151
|
Snow DM. Commentary on: "Facilitating transparency in spinal cord injury studies using data standards and ontologies". Neural Regen Res 2014; 9:8-9. [PMID: 25206737 PMCID: PMC4146318 DOI: 10.4103/1673-5374.125323] [Citation(s) in RCA: 3] [Impact Index Per Article: 0.3] [Reference Citation Analysis] [Track Full Text] [Download PDF] [Journal Information] [Subscribe] [Scholar Register] [Accepted: 01/04/2014] [Indexed: 11/04/2022] Open
Affiliation(s)
- Diane M Snow
- Spinal Cord and Brain Injury Research Center (SCoBIRC), Department of Anatomy and Neurobiology, The University of Kentucky, Lexington, KY, USA
| |
Collapse
|
152
|
Davis J, Mengersen K, Bennett S, Mazerolle L. Viewing systematic reviews and meta-analysis in social research through different lenses. SPRINGERPLUS 2014; 3:511. [PMID: 25279303 PMCID: PMC4167883 DOI: 10.1186/2193-1801-3-511] [Citation(s) in RCA: 76] [Impact Index Per Article: 7.6] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Subscribe] [Scholar Register] [Received: 03/18/2014] [Accepted: 09/04/2014] [Indexed: 11/10/2022]
Abstract
Systematic reviews and meta-analyses are used to combine results across studies to determine an overall effect. Meta-analysis is especially useful for combining evidence to inform social policy, but meta-analyses of applied social science research may encounter practical issues arising from the nature of the research domain. The current paper identifies potential resolutions to four issues that may be encountered in systematic reviews and meta-analyses in social research. The four issues are: scoping and targeting research questions appropriate for meta-analysis; selecting eligibility criteria where primary studies vary in research design and choice of outcome measures; dealing with inconsistent reporting in primary studies; and identifying sources of heterogeneity with multiple confounded moderators. The paper presents an overview of each issue with a review of potential resolutions, identified from similar issues encountered in meta-analysis in medical and biological sciences. The discussion aims to share and improve methodology in systematic reviews and meta-analysis by promoting cross-disciplinary communication, that is, to encourage 'viewing through different lenses'.
Collapse
Affiliation(s)
- Jacqueline Davis
- Institute for Social Science Research, University of Queensland, Brisbane, St Lucia 4072 Australia
| | - Kerrie Mengersen
- School of Mathematical Sciences, Queensland University of Technology, GPO Box 2434, Brisbane, 4001 Australia
| | - Sarah Bennett
- Institute for Social Science Research, University of Queensland, Brisbane, St Lucia 4072 Australia
| | - Lorraine Mazerolle
- Institute for Social Science Research, University of Queensland, Brisbane, St Lucia 4072 Australia
| |
Collapse
|
153
|
Lemmon VP, Abeyruwan S, Visser U, Bixby JL. Facilitating transparency in spinal cord injury studies using data standards and ontologies. Neural Regen Res 2014; 9:6-7. [PMID: 25206736 PMCID: PMC4146316 DOI: 10.4103/1673-5374.125322] [Citation(s) in RCA: 9] [Impact Index Per Article: 0.9] [Reference Citation Analysis] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Accepted: 11/25/2013] [Indexed: 12/11/2022] Open
Affiliation(s)
- Vance P Lemmon
- Department of Neurological Surgery, University of Miami, Miami, FL 33136, USA ; Miami Project to Cure Paralysis, University of Miami, Miami, FL 33136, USA ; Center for Computational Science, University of Miami, Miami, FL 33146, USA
| | - Saminda Abeyruwan
- Center for Computational Science, University of Miami, Miami, FL 33146, USA ; Department of Computer Science, University of Miami, Miami, FL 33146, USA
| | - Ubbo Visser
- Center for Computational Science, University of Miami, Miami, FL 33146, USA ; Department of Computer Science, University of Miami, Miami, FL 33146, USA
| | - John L Bixby
- Department of Neurological Surgery, University of Miami, Miami, FL 33136, USA ; Miami Project to Cure Paralysis, University of Miami, Miami, FL 33136, USA ; Center for Computational Science, University of Miami, Miami, FL 33146, USA ; Department of Molecular & Cellular Pharmacology, University of Miami, Miami, FL 33136, USA
| |
Collapse
|
154
|
Abstract
The consultation of internet databases and the related use of computer software to retrieve, visualise and model data have become key components of many areas of scientific research. This paper focuses on the relation of these developments to understanding the biology of organisms, and examines the conditions under which the evidential value of data posted online is assessed and interpreted by the researchers who access them, in ways that underpin and guide the use of those data to foster discovery. I consider the types of knowledge required to interpret data as evidence for claims about organisms, and in particular the relevance of knowledge acquired through physical interaction with actual organisms to assessing the evidential value of data found online. I conclude that familiarity with research in vivo is crucial to assessing the quality and significance of data visualised in silico; and that studying how biological data are disseminated, visualised, assessed and interpreted in the digital age provides a strong rationale for viewing scientific understanding as a social and distributed, rather than individual and localised, achievement.
Collapse
Affiliation(s)
- Sabina Leonelli
- ESRC Centre for Genomics in Society, Department of Sociology and Philosophy, University of Exeter, St Germans Road, EX4 4PJ Exeter, UK,
| |
Collapse
|
155
|
Plant AL, Locascio LE, May WE, Gallagher PD. Improved reproducibility by assuring confidence in measurements in biomedical research. Nat Methods 2014; 11:895-8. [DOI: 10.1038/nmeth.3076] [Citation(s) in RCA: 57] [Impact Index Per Article: 5.7] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/10/2022]
|
156
|
Flórez-Vargas O, Bramhall M, Noyes H, Cruickshank S, Stevens R, Brass A. The quality of methods reporting in parasitology experiments. PLoS One 2014; 9:e101131. [PMID: 25076044 PMCID: PMC4116335 DOI: 10.1371/journal.pone.0101131] [Citation(s) in RCA: 11] [Impact Index Per Article: 1.1] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 01/17/2014] [Accepted: 06/03/2014] [Indexed: 12/23/2022] Open
Abstract
There is a growing concern both inside and outside the scientific community over the lack of reproducibility of experiments. The depth and detail of reported methods are critical to the reproducibility of findings, but also for making it possible to compare and integrate data from different studies. In this study, we evaluated in detail the methods reporting in a comprehensive set of trypanosomiasis experiments that should enable valid reproduction, integration and comparison of research findings. We evaluated a subset of other parasitic (Leishmania, Toxoplasma, Plasmodium, Trichuris and Schistosoma) and non-parasitic (Mycobacterium) experimental infections in order to compare the quality of method reporting more generally. A systematic review using PubMed (2000-2012) of all publications describing gene expression in cells and animals infected with Trypanosoma spp was undertaken based on PRISMA guidelines; 23 papers were identified and included. We defined a checklist of essential parameters that should be reported and have scored the number of those parameters that are reported for each publication. Bibliometric parameters (impact factor, citations and h-index) were used to look for association between Journal and Author status and the quality of method reporting. Trichuriasis experiments achieved the highest scores and included the only paper to score 100% in all criteria. The mean of scores achieved by Trypanosoma articles through the checklist was 65.5% (range 32-90%). Bibliometric parameters were not correlated with the quality of method reporting (Spearman's rank correlation coefficient <-0.5; p>0.05). Our results indicate that the quality of methods reporting in experimental parasitology is a cause for concern and it has not improved over time, despite there being evidence that most of the assessed parameters do influence the results. We propose that our set of parameters be used as guidelines to improve the quality of the reporting of experimental infection models as a pre-requisite for integrating and comparing sets of data.
Collapse
Affiliation(s)
- Oscar Flórez-Vargas
- Bio-health Informatics Group, School of Computer Science, University of Manchester, Manchester, United Kingdom
| | - Michael Bramhall
- Bio-health Informatics Group, School of Computer Science, University of Manchester, Manchester, United Kingdom
| | - Harry Noyes
- School of Biological Science, University of Liverpool, Liverpool, United Kingdom
| | - Sheena Cruickshank
- Manchester Immunology Group, Faculty of Life Science, University of Manchester, Manchester, United Kingdom
| | - Robert Stevens
- Bio-health Informatics Group, School of Computer Science, University of Manchester, Manchester, United Kingdom
| | - Andy Brass
- Bio-health Informatics Group, School of Computer Science, University of Manchester, Manchester, United Kingdom
- Manchester Immunology Group, Faculty of Life Science, University of Manchester, Manchester, United Kingdom
| |
Collapse
|
157
|
Lemmon VP, Ferguson AR, Popovich PG, Xu XM, Snow DM, Igarashi M, Beattie CE, Bixby JL. Minimum information about a spinal cord injury experiment: a proposed reporting standard for spinal cord injury experiments. J Neurotrauma 2014; 31:1354-61. [PMID: 24870067 DOI: 10.1089/neu.2014.3400] [Citation(s) in RCA: 63] [Impact Index Per Article: 6.3] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 02/01/2023] Open
Abstract
The lack of reproducibility in many areas of experimental science has a number of causes, including a lack of transparency and precision in the description of experimental approaches. This has far-reaching consequences, including wasted resources and slowing of progress. Additionally, the large number of laboratories around the world publishing articles on a given topic make it difficult, if not impossible, for individual researchers to read all of the relevant literature. Consequently, centralized databases are needed to facilitate the generation of new hypotheses for testing. One strategy to improve transparency in experimental description, and to allow the development of frameworks for computer-readable knowledge repositories, is the adoption of uniform reporting standards, such as common data elements (data elements used in multiple clinical studies) and minimum information standards. This article describes a minimum information standard for spinal cord injury (SCI) experiments, its major elements, and the approaches used to develop it. Transparent reporting standards for experiments using animal models of human SCI aim to reduce inherent bias and increase experimental value.
Collapse
Affiliation(s)
- Vance P Lemmon
- 1 Miami Project to Cure Paralysis, University of Miami School of Medicine , Miami, Florida
| | | | | | | | | | | | | | | | | |
Collapse
|
158
|
Heise R, Arrivault S, Szecowka M, Tohge T, Nunes-Nesi A, Stitt M, Nikoloski Z, Fernie AR. Flux profiling of photosynthetic carbon metabolism in intact plants. Nat Protoc 2014; 9:1803-24. [DOI: 10.1038/nprot.2014.115] [Citation(s) in RCA: 53] [Impact Index Per Article: 5.3] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 02/06/2023]
|
159
|
Dugan VG, Emrich SJ, Giraldo-Calderón GI, Harb OS, Newman RM, Pickett BE, Schriml LM, Stockwell TB, Stoeckert CJ, Sullivan DE, Singh I, Ward DV, Yao A, Zheng J, Barrett T, Birren B, Brinkac L, Bruno VM, Caler E, Chapman S, Collins FH, Cuomo CA, Di Francesco V, Durkin S, Eppinger M, Feldgarden M, Fraser C, Fricke WF, Giovanni M, Henn MR, Hine E, Hotopp JD, Karsch-Mizrachi I, Kissinger JC, Lee EM, Mathur P, Mongodin EF, Murphy CI, Myers G, Neafsey DE, Nelson KE, Nierman WC, Puzak J, Rasko D, Roos DS, Sadzewicz L, Silva JC, Sobral B, Squires RB, Stevens RL, Tallon L, Tettelin H, Wentworth D, White O, Will R, Wortman J, Zhang Y, Scheuermann RH. Standardized metadata for human pathogen/vector genomic sequences. PLoS One 2014; 9:e99979. [PMID: 24936976 PMCID: PMC4061050 DOI: 10.1371/journal.pone.0099979] [Citation(s) in RCA: 30] [Impact Index Per Article: 3.0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 02/06/2014] [Accepted: 05/15/2014] [Indexed: 11/18/2022] Open
Abstract
High throughput sequencing has accelerated the determination of genome sequences for thousands of human infectious disease pathogens and dozens of their vectors. The scale and scope of these data are enabling genotype-phenotype association studies to identify genetic determinants of pathogen virulence and drug/insecticide resistance, and phylogenetic studies to track the origin and spread of disease outbreaks. To maximize the utility of genomic sequences for these purposes, it is essential that metadata about the pathogen/vector isolate characteristics be collected and made available in organized, clear, and consistent formats. Here we report the development of the GSCID/BRC Project and Sample Application Standard, developed by representatives of the Genome Sequencing Centers for Infectious Diseases (GSCIDs), the Bioinformatics Resource Centers (BRCs) for Infectious Diseases, and the U.S. National Institute of Allergy and Infectious Diseases (NIAID), part of the National Institutes of Health (NIH), informed by interactions with numerous collaborating scientists. It includes mapping to terms from other data standards initiatives, including the Genomic Standards Consortium's minimal information (MIxS) and NCBI's BioSample/BioProjects checklists and the Ontology for Biomedical Investigations (OBI). The standard includes data fields about characteristics of the organism or environmental source of the specimen, spatial-temporal information about the specimen isolation event, phenotypic characteristics of the pathogen/vector isolated, and project leadership and support. By modeling metadata fields into an ontology-based semantic framework and reusing existing ontologies and minimum information checklists, the application standard can be extended to support additional project-specific data fields and integrated with other data represented with comparable standards. The use of this metadata standard by all ongoing and future GSCID sequencing projects will provide a consistent representation of these data in the BRC resources and other repositories that leverage these data, allowing investigators to identify relevant genomic sequences and perform comparative genomics analyses that are both statistically meaningful and biologically relevant.
Collapse
Affiliation(s)
- Vivien G. Dugan
- J. Craig Venter Institute, Rockville, Maryland, and La Jolla, California, United States of America
- National Institute of Allergy and Infectious Diseases, Rockville, Maryland, United States of America
| | - Scott J. Emrich
- University of Notre Dame, Notre Dame, Indiana, United States of America
| | | | - Omar S. Harb
- University of Pennsylvania, Philadelphia, Pennsylvania, United States of America
| | - Ruchi M. Newman
- Broad Institute, Cambridge, Massachusetts, United States of America
| | - Brett E. Pickett
- J. Craig Venter Institute, Rockville, Maryland, and La Jolla, California, United States of America
| | - Lynn M. Schriml
- Institute for Genome Sciences, University of Maryland School of Medicine, Baltimore, Maryland, United States of America
| | - Timothy B. Stockwell
- J. Craig Venter Institute, Rockville, Maryland, and La Jolla, California, United States of America
| | | | - Dan E. Sullivan
- Cyberinfrastructure Division, Virginia Bioinformatics Institute, Blacksburg, Virginia, United States of America
| | - Indresh Singh
- J. Craig Venter Institute, Rockville, Maryland, and La Jolla, California, United States of America
| | - Doyle V. Ward
- Broad Institute, Cambridge, Massachusetts, United States of America
| | - Alison Yao
- National Institute of Allergy and Infectious Diseases, Rockville, Maryland, United States of America
| | - Jie Zheng
- University of Pennsylvania, Philadelphia, Pennsylvania, United States of America
| | - Tanya Barrett
- National Center for Biotechnology Information, National Library of Medicine, Bethesda, Maryland, United States of America
| | - Bruce Birren
- Broad Institute, Cambridge, Massachusetts, United States of America
| | - Lauren Brinkac
- J. Craig Venter Institute, Rockville, Maryland, and La Jolla, California, United States of America
| | - Vincent M. Bruno
- Institute for Genome Sciences, University of Maryland School of Medicine, Baltimore, Maryland, United States of America
| | - Elizabet Caler
- J. Craig Venter Institute, Rockville, Maryland, and La Jolla, California, United States of America
| | - Sinéad Chapman
- Broad Institute, Cambridge, Massachusetts, United States of America
| | - Frank H. Collins
- University of Notre Dame, Notre Dame, Indiana, United States of America
| | | | - Valentina Di Francesco
- National Institute of Allergy and Infectious Diseases, Rockville, Maryland, United States of America
| | - Scott Durkin
- J. Craig Venter Institute, Rockville, Maryland, and La Jolla, California, United States of America
| | - Mark Eppinger
- Institute for Genome Sciences, University of Maryland School of Medicine, Baltimore, Maryland, United States of America
| | | | - Claire Fraser
- Institute for Genome Sciences, University of Maryland School of Medicine, Baltimore, Maryland, United States of America
| | - W. Florian Fricke
- Institute for Genome Sciences, University of Maryland School of Medicine, Baltimore, Maryland, United States of America
| | - Maria Giovanni
- National Institute of Allergy and Infectious Diseases, Rockville, Maryland, United States of America
| | - Matthew R. Henn
- Broad Institute, Cambridge, Massachusetts, United States of America
| | - Erin Hine
- Institute for Genome Sciences, University of Maryland School of Medicine, Baltimore, Maryland, United States of America
| | - Julie Dunning Hotopp
- Institute for Genome Sciences, University of Maryland School of Medicine, Baltimore, Maryland, United States of America
| | - Ilene Karsch-Mizrachi
- National Center for Biotechnology Information, National Library of Medicine, Bethesda, Maryland, United States of America
| | | | - Eun Mi Lee
- National Institute of Allergy and Infectious Diseases, Rockville, Maryland, United States of America
| | - Punam Mathur
- National Institute of Allergy and Infectious Diseases, Rockville, Maryland, United States of America
| | - Emmanuel F. Mongodin
- Institute for Genome Sciences, University of Maryland School of Medicine, Baltimore, Maryland, United States of America
| | - Cheryl I. Murphy
- Broad Institute, Cambridge, Massachusetts, United States of America
| | - Garry Myers
- Institute for Genome Sciences, University of Maryland School of Medicine, Baltimore, Maryland, United States of America
| | | | - Karen E. Nelson
- J. Craig Venter Institute, Rockville, Maryland, and La Jolla, California, United States of America
| | - William C. Nierman
- J. Craig Venter Institute, Rockville, Maryland, and La Jolla, California, United States of America
| | - Julia Puzak
- Kelly Government Solutions, Rockville, Maryland, United States of America
| | - David Rasko
- Institute for Genome Sciences, University of Maryland School of Medicine, Baltimore, Maryland, United States of America
| | - David S. Roos
- University of Pennsylvania, Philadelphia, Pennsylvania, United States of America
| | - Lisa Sadzewicz
- Institute for Genome Sciences, University of Maryland School of Medicine, Baltimore, Maryland, United States of America
| | - Joana C. Silva
- Institute for Genome Sciences, University of Maryland School of Medicine, Baltimore, Maryland, United States of America
| | - Bruno Sobral
- Cyberinfrastructure Division, Virginia Bioinformatics Institute, Blacksburg, Virginia, United States of America
| | - R. Burke Squires
- National Institute of Allergy and Infectious Diseases, Rockville, Maryland, United States of America
| | - Rick L. Stevens
- Argonne National Laboratory, Lemont, Illinois, United States of America
| | - Luke Tallon
- Institute for Genome Sciences, University of Maryland School of Medicine, Baltimore, Maryland, United States of America
| | - Herve Tettelin
- Institute for Genome Sciences, University of Maryland School of Medicine, Baltimore, Maryland, United States of America
| | - David Wentworth
- J. Craig Venter Institute, Rockville, Maryland, and La Jolla, California, United States of America
| | - Owen White
- Institute for Genome Sciences, University of Maryland School of Medicine, Baltimore, Maryland, United States of America
| | - Rebecca Will
- Cyberinfrastructure Division, Virginia Bioinformatics Institute, Blacksburg, Virginia, United States of America
| | - Jennifer Wortman
- Broad Institute, Cambridge, Massachusetts, United States of America
| | - Yun Zhang
- J. Craig Venter Institute, Rockville, Maryland, and La Jolla, California, United States of America
| | - Richard H. Scheuermann
- J. Craig Venter Institute, Rockville, Maryland, and La Jolla, California, United States of America
- Department of Pathology, University of California San Diego, San Diego, California, United States of America
- * E-mail:
| |
Collapse
|
160
|
Merrill E, Corlosquet S, Ciccarese P, Clark T, Das S. Semantic Web repositories for genomics data using the eXframe platform. J Biomed Semantics 2014; 5:S3. [PMID: 25093072 PMCID: PMC4108874 DOI: 10.1186/2041-1480-5-s1-s3] [Citation(s) in RCA: 4] [Impact Index Per Article: 0.4] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 12/01/2022] Open
Abstract
BACKGROUND With the advent of inexpensive assay technologies, there has been an unprecedented growth in genomics data as well as the number of databases in which it is stored. In these databases, sample annotation using ontologies and controlled vocabularies is becoming more common. However, the annotation is rarely available as Linked Data, in a machine-readable format, or for standardized queries using SPARQL. This makes large-scale reuse, or integration with other knowledge bases very difficult. METHODS To address this challenge, we have developed the second generation of our eXframe platform, a reusable framework for creating online repositories of genomics experiments. This second generation model now publishes Semantic Web data. To accomplish this, we created an experiment model that covers provenance, citations, external links, assays, biomaterials used in the experiment, and the data collected during the process. The elements of our model are mapped to classes and properties from various established biomedical ontologies. Resource Description Framework (RDF) data is automatically produced using these mappings and indexed in an RDF store with a built-in Sparql Protocol and RDF Query Language (SPARQL) endpoint. CONCLUSIONS Using the open-source eXframe software, institutions and laboratories can create Semantic Web repositories of their experiments, integrate it with heterogeneous resources and make it interoperable with the vast Semantic Web of biomedical knowledge.
Collapse
Affiliation(s)
- Emily Merrill
- Massachusetts General Hospital, Partners Research Building, 65 Landsdowne St, Cambridge, MA, 02139, USA
| | - Stéphane Corlosquet
- Massachusetts General Hospital, Partners Research Building, 65 Landsdowne St, Cambridge, MA, 02139, USA
| | - Paolo Ciccarese
- Massachusetts General Hospital, Partners Research Building, 65 Landsdowne St, Cambridge, MA, 02139, USA
- Harvard Medical School, 25 Shattuck St, Boston, MA, 02115, USA
| | - Tim Clark
- Massachusetts General Hospital, Partners Research Building, 65 Landsdowne St, Cambridge, MA, 02139, USA
- Harvard Medical School, 25 Shattuck St, Boston, MA, 02115, USA
- School of Computer Science, University of Manchester, Oxford Road, Manchester, M13 9PL, UK
| | - Sudeshna Das
- Massachusetts General Hospital, Partners Research Building, 65 Landsdowne St, Cambridge, MA, 02139, USA
- Harvard Medical School, 25 Shattuck St, Boston, MA, 02115, USA
| |
Collapse
|
161
|
Vempati UD, Chung C, Mader C, Koleti A, Datar N, Vidović D, Wrobel D, Erickson S, Muhlich JL, Berriz G, Benes CH, Subramanian A, Pillai A, Shamu CE, Schürer SC. Metadata Standard and Data Exchange Specifications to Describe, Model, and Integrate Complex and Diverse High-Throughput Screening Data from the Library of Integrated Network-based Cellular Signatures (LINCS). JOURNAL OF BIOMOLECULAR SCREENING 2014; 19:803-16. [PMID: 24518066 PMCID: PMC7723305 DOI: 10.1177/1087057114522514] [Citation(s) in RCA: 60] [Impact Index Per Article: 6.0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Subscribe] [Scholar Register] [Received: 09/09/2013] [Accepted: 01/13/2014] [Indexed: 01/15/2023]
Abstract
The National Institutes of Health Library of Integrated Network-based Cellular Signatures (LINCS) program is generating extensive multidimensional data sets, including biochemical, genome-wide transcriptional, and phenotypic cellular response signatures to a variety of small-molecule and genetic perturbations with the goal of creating a sustainable, widely applicable, and readily accessible systems biology knowledge resource. Integration and analysis of diverse LINCS data sets depend on the availability of sufficient metadata to describe the assays and screening results and on their syntactic, structural, and semantic consistency. Here we report metadata specifications for the most important molecular and cellular components and recommend them for adoption beyond the LINCS project. We focus on the minimum required information to model LINCS assays and results based on a number of use cases, and we recommend controlled terminologies and ontologies to annotate assays with syntactic consistency and semantic integrity. We also report specifications for a simple annotation format (SAF) to describe assays and screening results based on our metadata specifications with explicit controlled vocabularies. SAF specifically serves to programmatically access and exchange LINCS data as a prerequisite for a distributed information management infrastructure. We applied the metadata specifications to annotate large numbers of LINCS cell lines, proteins, and small molecules. The resources generated and presented here are freely available.
Collapse
Affiliation(s)
- Uma D Vempati
- Center for Computational Science, University of Miami, Miami, FL, USA
| | - Caty Chung
- Center for Computational Science, University of Miami, Miami, FL, USA
| | - Chris Mader
- Center for Computational Science, University of Miami, Miami, FL, USA
| | - Amar Koleti
- Center for Computational Science, University of Miami, Miami, FL, USA
| | - Nakul Datar
- Center for Computational Science, University of Miami, Miami, FL, USA
| | - Dušica Vidović
- Center for Computational Science, University of Miami, Miami, FL, USA
| | - David Wrobel
- ICCB-Longwood Screening Facility, Harvard Medical School, Boston, MA, USA
| | - Sean Erickson
- ICCB-Longwood Screening Facility, Harvard Medical School, Boston, MA, USA
| | - Jeremy L Muhlich
- Department of Systems Biology, Harvard Medical School, Boston, Massachusetts, USA
| | - Gabriel Berriz
- Department of Systems Biology, Harvard Medical School, Boston, Massachusetts, USA
| | - Cyril H Benes
- Center for Molecular Therapeutics, Massachusetts General Hospital, Boston, Massachusetts, USA
| | | | - Ajay Pillai
- National Human Genome Research Institute, National Institutes of Health, Rockville, Maryland, USA
| | - Caroline E Shamu
- ICCB-Longwood Screening Facility, Harvard Medical School, Boston, MA, USA Department of Systems Biology, Harvard Medical School, Boston, Massachusetts, USA
| | - Stephan C Schürer
- Center for Computational Science, University of Miami, Miami, FL, USA Department of Molecular and Cellular Pharmacology, University of Miami, Miami, Florida, USA
| |
Collapse
|
162
|
Optimizing methodologies for PCR-based DNA methylation analysis. Biotechniques 2014; 55:181-97. [PMID: 24107250 DOI: 10.2144/000114087] [Citation(s) in RCA: 118] [Impact Index Per Article: 11.8] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 11/07/2012] [Accepted: 09/10/2013] [Indexed: 02/07/2023] Open
Abstract
Comprehensive analysis of DNA methylation patterns is critical for understanding the molecular basis of many human diseases. While hundreds of PCR-based DNA methylation studies are published every year, the selection and implementation of appropriate methods for these studies can be challenging for molecular genetics researchers not yet familiar with methylation analysis. Here we review the most commonly used PCR-based DNA methylation analysis techniques: bisulfite sequencing PCR (BSP), methylation specific PCR (MSP), MethyLight, and methylation-sensitive high resolution melting (MS-HRM). We provide critical analysis of the strengths and weaknesses of each approach as well as a series of guidelines to assist in selecting and implementing an appropriate method.
Collapse
|
163
|
Izzo M, Mortola F, Arnulfo G, Fato MM, Varesio L. A digital repository with an extensible data model for biobanking and genomic analysis management. BMC Genomics 2014; 15 Suppl 3:S3. [PMID: 25077808 PMCID: PMC4083403 DOI: 10.1186/1471-2164-15-s3-s3] [Citation(s) in RCA: 8] [Impact Index Per Article: 0.8] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/18/2022] Open
Abstract
Motivation Molecular biology laboratories require extensive metadata to improve data collection and analysis. The heterogeneity of the collected metadata grows as research is evolving in to international multi-disciplinary collaborations and increasing data sharing among institutions. Single standardization is not feasible and it becomes crucial to develop digital repositories with flexible and extensible data models, as in the case of modern integrated biobanks management. Results We developed a novel data model in JSON format to describe heterogeneous data in a generic biomedical science scenario. The model is built on two hierarchical entities: processes and events, roughly corresponding to research studies and analysis steps within a single study. A number of sequential events can be grouped in a process building up a hierarchical structure to track patient and sample history. Each event can produce new data. Data is described by a set of user-defined metadata, and may have one or more associated files. We integrated the model in a web based digital repository with a data grid storage to manage large data sets located in geographically distinct areas. We built a graphical interface that allows authorized users to define new data types dynamically, according to their requirements. Operators compose queries on metadata fields using a flexible search interface and run them on the database and on the grid. We applied the digital repository to the integrated management of samples, patients and medical history in the BIT-Gaslini biobank. The platform currently manages 1800 samples of over 900 patients. Microarray data from 150 analyses are stored on the grid storage and replicated on two physical resources for preservation. The system is equipped with data integration capabilities with other biobanks for worldwide information sharing. Conclusions Our data model enables users to continuously define flexible, ad hoc, and loosely structured metadata, for information sharing in specific research projects and purposes. This approach can improve sensitively interdisciplinary research collaboration and allows to track patients' clinical records, sample management information, and genomic data. The web interface allows the operators to easily manage, query, and annotate the files, without dealing with the technicalities of the data grid.
Collapse
|
164
|
Cvijovic M, Almquist J, Hagmar J, Hohmann S, Kaltenbach HM, Klipp E, Krantz M, Mendes P, Nelander S, Nielsen J, Pagnani A, Przulj N, Raue A, Stelling J, Stoma S, Tobin F, Wodke JAH, Zecchina R, Jirstrand M. Bridging the gaps in systems biology. Mol Genet Genomics 2014; 289:727-34. [DOI: 10.1007/s00438-014-0843-3] [Citation(s) in RCA: 34] [Impact Index Per Article: 3.4] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 01/31/2014] [Accepted: 03/21/2014] [Indexed: 12/17/2022]
|
165
|
Razick S, Močnik R, Thomas LF, Ryeng E, Drabløs F, Sætrom P. The eGenVar data management system--cataloguing and sharing sensitive data and metadata for the life sciences. DATABASE-THE JOURNAL OF BIOLOGICAL DATABASES AND CURATION 2014; 2014:bau027. [PMID: 24682735 PMCID: PMC4030636 DOI: 10.1093/database/bau027] [Citation(s) in RCA: 7] [Impact Index Per Article: 0.7] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Subscribe] [Scholar Register] [Indexed: 12/31/2022]
Abstract
Systematic data management and controlled data sharing aim at increasing reproducibility,
reducing redundancy in work, and providing a way to efficiently locate complementing or
contradicting information. One method of achieving this is collecting data in a central
repository or in a location that is part of a federated system and providing interfaces to
the data. However, certain data, such as data from biobanks or clinical studies, may, for
legal and privacy reasons, often not be stored in public repositories. Instead, we
describe a metadata cataloguing system and a software suite for reporting the presence of
data from the life sciences domain. The system stores three types of metadata: file
information, file provenance and data lineage, and content descriptions. Our software
suite includes both graphical and command line interfaces that allow users to report and
tag files with these different metadata types. Importantly, the files remain in their
original locations with their existing access-control mechanisms in place, while our
system provides descriptions of their contents and relationships. Our system and software
suite thereby provide a common framework for cataloguing and sharing both public and
private data. Database URL:http://bigr.medisin.ntnu.no/data/eGenVar/
Collapse
Affiliation(s)
- Sabry Razick
- Department of Cancer Research and Molecular Medicine, Norwegian University of Science and Technology, Prinsesse Kristinasgt. 1, NO-7491 Trondheim, Norway and Department of Computer and Information Science, Norwegian University of Science and Technology, Sem Sælands vei 9, NO-7491 Trondheim, Norway
| | | | | | | | | | | |
Collapse
|
166
|
Screening for host proteins with pro- and antiviral activity using high-throughput RNAi. Methods Mol Biol 2014; 1064:71-90. [PMID: 23996250 DOI: 10.1007/978-1-62703-601-6_5] [Citation(s) in RCA: 5] [Impact Index Per Article: 0.5] [Reference Citation Analysis] [Abstract] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 12/25/2022]
Abstract
RNA interference (RNAi) describes the mechanism of posttranscriptional gene silencing by small (typically 18-24 nucleotides) RNA molecules and includes small-interfering RNAs (siRNAs) and microRNAs (miRNAs). As siRNAs and miRNAs are simple to use experimentally, they are easily adaptable to high-throughput methodologies and provide an ideal tool for genome-wide gene depletion studies. Over recent years RNAi has been used extensively to investigate the complex interactions between pathogen and host, and the identification of novel cellular factors and pathways influencing viral disease pathogenesis exemplifies the power of this technique. Here, the use of RNAi to investigate the functional role of cellular proteins in herpesvirus (Herpes Simplex Virus Type I; HSV-1) replication and how to identify novel antiviral and proviral host proteins is described.
Collapse
|
167
|
York WS, Agravat S, Aoki-Kinoshita KF, McBride R, Campbell MP, Costello CE, Dell A, Feizi T, Haslam SM, Karlsson N, Khoo KH, Kolarich D, Liu Y, Novotny M, Packer NH, Paulson JC, Rapp E, Ranzinger R, Rudd PM, Smith DF, Struwe WB, Tiemeyer M, Wells L, Zaia J, Kettner C. MIRAGE: the minimum information required for a glycomics experiment. Glycobiology 2014; 24:402-6. [PMID: 24653214 PMCID: PMC3976285 DOI: 10.1093/glycob/cwu018] [Citation(s) in RCA: 90] [Impact Index Per Article: 9.0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 12/01/2022] Open
Abstract
The MIRAGE (minimum information required for a glycomics experiment) initiative was founded in Seattle, WA, in November 2011 in order to develop guidelines for reporting the qualitative and quantitative results obtained by diverse types of glycomics analyses, including the conditions and techniques that were applied to prepare the glycans for analysis and generate the primary data along with the tools and parameters that were used to process and annotate this data. These guidelines must address a broad range of issues, as glycomics data are inherently complex and are generated using diverse methods, including mass spectrometry (MS), chromatography, glycan array-binding assays, nuclear magnetic resonance (NMR) and other rapidly developing technologies. The acceptance of these guidelines by scientists conducting research on biological systems in which glycans have a significant role will facilitate the evaluation and reproduction of glycomics experiments and data that is reported in scientific journals and uploaded to glycomics databases. As a first step, MIRAGE guidelines for glycan analysis by MS have been recently published (Kolarich D, Rapp E, Struwe WB, Haslam SM, Zaia J., et al. 2013. The minimum information required for a glycomics experiment (MIRAGE) project – Improving the standards for reporting mass spectrometry-based glycoanalytic data. Mol. Cell Proteomics. 12:991–995), allowing them to be implemented and evaluated in the context of real-world glycobiology research. In this paper, we set out the historical context, organization structure and overarching objectives of the MIRAGE initiative.
Collapse
Affiliation(s)
- William S York
- Complex Carbohydrate Research Center, University of Georgia, 315 Riverbend Road, Athens, GA 30602, USA
| | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | |
Collapse
|
168
|
Tenenbaum JD, Sansone SA, Haendel M. A sea of standards for omics data: sink or swim? J Am Med Inform Assoc 2014; 21:200-3. [PMID: 24076747 PMCID: PMC3932466 DOI: 10.1136/amiajnl-2013-002066] [Citation(s) in RCA: 49] [Impact Index Per Article: 4.9] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Journal Information] [Subscribe] [Scholar Register] [Received: 05/31/2013] [Revised: 07/08/2013] [Accepted: 09/10/2013] [Indexed: 11/29/2022] Open
Abstract
In the era of Big Data, omic-scale technologies, and increasing calls for data sharing, it is generally agreed that the use of community-developed, open data standards is critical. Far less agreed upon is exactly which data standards should be used, the criteria by which one should choose a standard, or even what constitutes a data standard. It is impossible simply to choose a domain and have it naturally follow which data standards should be used in all cases. The 'right' standards to use is often dependent on the use case scenarios for a given project. Potential downstream applications for the data, however, may not always be apparent at the time the data are generated. Similarly, technology evolves, adding further complexity. Would-be standards adopters must strike a balance between planning for the future and minimizing the burden of compliance. Better tools and resources are required to help guide this balancing act.
Collapse
Affiliation(s)
- Jessica D Tenenbaum
- Duke Translational Medicine Institute, Duke University, Durham, North Carolina, USA
| | | | - Melissa Haendel
- Library and Department of Medical Informatics & Clinical Epidemiology, Oregon Health & Science University, Portland, Oregon, USA
| |
Collapse
|
169
|
Lourenço A, Coenye T, Goeres DM, Donelli G, Azevedo AS, Ceri H, Coelho FL, Flemming HC, Juhna T, Lopes SP, Oliveira R, Oliver A, Shirtliff ME, Sousa AM, Stoodley P, Pereira MO, Azevedo NF. Minimum information about a biofilm experiment (MIABiE): standards for reporting experiments and data on sessile microbial communities living at interfaces. Pathog Dis 2014; 70:250-6. [PMID: 24478124 DOI: 10.1111/2049-632x.12146] [Citation(s) in RCA: 37] [Impact Index Per Article: 3.7] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 11/27/2013] [Revised: 01/15/2014] [Accepted: 01/15/2014] [Indexed: 02/04/2023] Open
Abstract
The minimum information about a biofilm experiment (MIABiE) initiative has arisen from the need to find an adequate and scientifically sound way to control the quality of the documentation accompanying the public deposition of biofilm-related data, particularly those obtained using high-throughput devices and techniques. Thereby, the MIABiE consortium has initiated the identification and organization of a set of modules containing the minimum information that needs to be reported to guarantee the interpretability and independent verification of experimental results and their integration with knowledge coming from other fields. MIABiE does not intend to propose specific standards on how biofilms experiments should be performed, because it is acknowledged that specific research questions require specific conditions which may deviate from any standardization. Instead, MIABiE presents guidelines about the data to be recorded and published in order for the procedure and results to be easily and unequivocally interpreted and reproduced. Overall, MIABiE opens up the discussion about a number of particular areas of interest and attempts to achieve a broad consensus about which biofilm data and metadata should be reported in scientific journals in a systematic, rigorous and understandable manner.
Collapse
Affiliation(s)
- Anália Lourenço
- Departamento de Informática, Universidade de Vigo, ESEI - Escuela Superior de Ingeniería Informática, Ourense, Spain; IBB - Institute for Biotechnology and Bioengineering, Centre of Biological Engineering, University of Minho, Braga, Portugal
| | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | |
Collapse
|
170
|
Brusic V, Gottardo R, Kleinstein SH, Davis MM. Computational resources for high-dimensional immune analysis from the Human Immunology Project Consortium. Nat Biotechnol 2014; 32:146-8. [PMID: 24441472 PMCID: PMC4294529 DOI: 10.1038/nbt.2777] [Citation(s) in RCA: 58] [Impact Index Per Article: 5.8] [Reference Citation Analysis] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 01/09/2023]
Affiliation(s)
- Vladimir Brusic
- Cancer Vaccine Center, Dana-Farber Cancer Institute, Boston, Massachusetts, USA
| | - Raphael Gottardo
- Vaccine and Infectious Disease Division, Fred Hutchinson Cancer Research Center, Seattle, Washington, USA
| | - Steven H Kleinstein
- Interdepartmental Program in Computational Biology and Bioinformatics, Yale University, New Haven, Connecticut, USA
- Department of Pathology, Yale School of Medicine, New Haven, Connecticut, USA
| | - Mark M Davis
- The Howard Hughes Medical Institute, The Institute for Immunity, Transplantation and Infection, and The Department of Microbiology and Immunology, Stanford University, Stanford, California, USA
| | | |
Collapse
|
171
|
Palla P, Frau G, Vargiu L, Rodriguez-Tomé P. QTREDS: a Ruby on Rails-based platform for omics laboratories. BMC Bioinformatics 2014; 15 Suppl 1:S13. [PMID: 24564791 PMCID: PMC4015218 DOI: 10.1186/1471-2105-15-s1-s13] [Citation(s) in RCA: 7] [Impact Index Per Article: 0.7] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/25/2022] Open
Abstract
Background In recent years, the experimental aspects of the laboratory activities have been growing in complexity in terms of amount and diversity of data produced, equipment used, of computer-based workflows needed to process and analyze the raw data generated. To enhance the level of quality control over the laboratory activities and efficiently handle the large amounts of data produced, a Laboratory Management Information System (LIMS) is highly-recommended. A LIMS is a complex software platform that helps researchers to have a complete knowledge of the laboratory activities at each step encouraging them to adopt good laboratory practices. Results We have designed and implemented Quality and TRacEability Data System - QTREDS, a software platform born to address the specific needs of the CRS4 Sequencing and Genotyping Platform (CSGP). The system written in the Ruby programming language and developed using the Rails framework is based on four main functional blocks: a sample handler, a workflow generator, an inventory management system and a user management system. The wizard-based sample handler allows to manage one or multiple samples at a time, tracking the path of each sample and providing a full chain of custody. The workflow generator encapsulates a user-friendly JavaScript-based visual tool that allows users to design customized workflows even for those without a technical background. With the inventory management system, reagents, laboratory glassware and consumables can be easily added through their barcodes and minimum stock levels can be controlled to avoid shortages of essential laboratory supplies. QTREDS provides a system for privileges management and authorizations to create different user roles, each with a well-defined access profile. Conclusions Tracking and monitoring all the phases of the laboratory activities can help to identify and troubleshoot problems more quickly, reducing the risk of process failures and their related costs. QTREDS was designed to address the specific needs of the CSGP laboratory, where it has been successfully used for over a year, but thanks to its flexibility it can be easily adapted to other "omics" laboratories. The software is freely available for academic users from http://qtreds.crs4.it.
Collapse
|
172
|
Mayer G, Jones AR, Binz PA, Deutsch EW, Orchard S, Montecchi-Palazzi L, Vizcaíno JA, Hermjakob H, Oveillero D, Julian R, Stephan C, Meyer HE, Eisenacher M. Controlled vocabularies and ontologies in proteomics: overview, principles and practice. BIOCHIMICA ET BIOPHYSICA ACTA 2014; 1844:98-107. [PMID: 23429179 PMCID: PMC3898906 DOI: 10.1016/j.bbapap.2013.02.017] [Citation(s) in RCA: 20] [Impact Index Per Article: 2.0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Subscribe] [Scholar Register] [Received: 11/23/2012] [Revised: 02/05/2013] [Accepted: 02/09/2013] [Indexed: 11/30/2022]
Abstract
This paper focuses on the use of controlled vocabularies (CVs) and ontologies especially in the area of proteomics, primarily related to the work of the Proteomics Standards Initiative (PSI). It describes the relevant proteomics standard formats and the ontologies used within them. Software and tools for working with these ontology files are also discussed. The article also examines the "mapping files" used to ensure correct controlled vocabulary terms that are placed within PSI standards and the fulfillment of the MIAPE (Minimum Information about a Proteomics Experiment) requirements. This article is part of a Special Issue entitled: Computational Proteomics in the Post-Identification Era. Guest Editors: Martin Eisenacher and Christian Stephan.
Collapse
Affiliation(s)
- Gerhard Mayer
- Medizinisches Proteom Center (MPC), Ruhr-Universität Bochum, D-44801 Bochum, Germany
| | - Andrew R. Jones
- Institute of Integrative Biology, University of Liverpool, Liverpool L69 7ZB, UK
| | - Pierre-Alain Binz
- SIB Swiss Institute of Bioinformatics, Swiss-Prot group, Rue Michel-Servet 1, CH-1211 Geneva 4, Switzerland
| | - Eric W. Deutsch
- Institute for Systems Biology, 401 Terry Avenue North, Seattle, WA 98109, USA
| | - Sandra Orchard
- EMBL-EBI, Wellcome Trust Genome Campus, Hinxton, Cambridge, CB10 1SD, UK
| | | | | | - Henning Hermjakob
- EMBL-EBI, Wellcome Trust Genome Campus, Hinxton, Cambridge, CB10 1SD, UK
| | - David Oveillero
- EMBL-EBI, Wellcome Trust Genome Campus, Hinxton, Cambridge, CB10 1SD, UK
| | | | - Christian Stephan
- Medizinisches Proteom Center (MPC), Ruhr-Universität Bochum, D-44801 Bochum, Germany
- Kairos GmbH, Universitätsstraße 136, D-44799 Bochum, Germany
| | - Helmut E. Meyer
- Medizinisches Proteom Center (MPC), Ruhr-Universität Bochum, D-44801 Bochum, Germany
| | - Martin Eisenacher
- Medizinisches Proteom Center (MPC), Ruhr-Universität Bochum, D-44801 Bochum, Germany
| |
Collapse
|
173
|
Martínez-Bartolomé S, Binz PA, Albar JP. The Minimal Information about a Proteomics Experiment (MIAPE) from the Proteomics Standards Initiative. Methods Mol Biol 2014; 1072:765-80. [PMID: 24136562 DOI: 10.1007/978-1-62703-631-3_53] [Citation(s) in RCA: 25] [Impact Index Per Article: 2.5] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 12/13/2022]
Abstract
During the last 10 years, the Proteomics Standards Initiative from the Human Proteome Organization (HUPO-PSI) has worked on defining standards for proteomics data representation as well as guidelines that state the minimum information that should be included when reporting a proteomics experiment (MIAPE). Such minimum information must describe the complete experiment, including both experimental protocols and data processing methods, allowing a critical evaluation of the whole process and the potential recreation of the work. In this chapter we describe the standardization work performed by the HUPO-PSI, and then we concentrate on the MIAPE guidelines, highlighting its importance when publishing proteomics experiments particularly in specialized proteomics journals. Finally, we describe existing bioinformatics resources that generate MIAPE compliant reports or that check proteomics data files for MIAPE compliance.
Collapse
|
174
|
Kolker E, Özdemir V, Martens L, Hancock W, Anderson G, Anderson N, Aynacioglu S, Baranova A, Campagna SR, Chen R, Choiniere J, Dearth SP, Feng WC, Ferguson L, Fox G, Frishman D, Grossman R, Heath A, Higdon R, Hutz MH, Janko I, Jiang L, Joshi S, Kel A, Kemnitz JW, Kohane IS, Kolker N, Lancet D, Lee E, Li W, Lisitsa A, Llerena A, MacNealy-Koch C, Marshall JC, Masuzzo P, May A, Mias G, Monroe M, Montague E, Mooney S, Nesvizhskii A, Noronha S, Omenn G, Rajasimha H, Ramamoorthy P, Sheehan J, Smarr L, Smith CV, Smith T, Snyder M, Rapole S, Srivastava S, Stanberry L, Stewart E, Toppo S, Uetz P, Verheggen K, Voy BH, Warnich L, Wilhelm SW, Yandl G. Toward more transparent and reproducible omics studies through a common metadata checklist and data publications. OMICS : A JOURNAL OF INTEGRATIVE BIOLOGY 2014; 18:10-4. [PMID: 24456465 PMCID: PMC3903324 DOI: 10.1089/omi.2013.0149] [Citation(s) in RCA: 38] [Impact Index Per Article: 3.8] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 12/13/2022]
Abstract
Biological processes are fundamentally driven by complex interactions between biomolecules. Integrated high-throughput omics studies enable multifaceted views of cells, organisms, or their communities. With the advent of new post-genomics technologies, omics studies are becoming increasingly prevalent; yet the full impact of these studies can only be realized through data harmonization, sharing, meta-analysis, and integrated research. These essential steps require consistent generation, capture, and distribution of metadata. To ensure transparency, facilitate data harmonization, and maximize reproducibility and usability of life sciences studies, we propose a simple common omics metadata checklist. The proposed checklist is built on the rich ontologies and standards already in use by the life sciences community. The checklist will serve as a common denominator to guide experimental design, capture important parameters, and be used as a standard format for stand-alone data publications. The omics metadata checklist and data publications will create efficient linkages between omics data and knowledge-based life sciences innovation and, importantly, allow for appropriate attribution to data generators and infrastructure science builders in the post-genomics era. We ask that the life sciences community test the proposed omics metadata checklist and data publications and provide feedback for their use and improvement.
Collapse
Affiliation(s)
- Eugene Kolker
- Bioinformatics and High-Throughput Analysis Laboratory, Seattle Children's Research Institute, Seattle, Washington
- Predictive Analytics, Seattle Children's, Seattle, Washington
- Data-Enabled Life Sciences Alliance (DELSA Global), Seattle, Washington
| | - Vural Özdemir
- Data-Enabled Life Sciences Alliance (DELSA Global), Seattle, Washington
- Office of the President, Gaziantep University, International Affairs and Global Development Strategy
- Faculty of Communications, Universite Bulvarı, Kilis Yolu, Turkey
| | - Lennart Martens
- Data-Enabled Life Sciences Alliance (DELSA Global), Seattle, Washington
- Department of Medical Protein Research, Vlaams Instituut voor Biotechnologie, Ghent, Belgium
- Department of Biochemistry, Ghent University; Ghent, Belgium
| | - William Hancock
- Data-Enabled Life Sciences Alliance (DELSA Global), Seattle, Washington
- Department of Chemistry, Barnett Institute, Northeastern University, Boston, Massachusetts
| | - Gordon Anderson
- Data-Enabled Life Sciences Alliance (DELSA Global), Seattle, Washington
- Fundamental and Computational Sciences Directorate, Pacific Northwest National Laboratory, Richland, Washington
| | - Nathaniel Anderson
- Bioinformatics and High-Throughput Analysis Laboratory, Seattle Children's Research Institute, Seattle, Washington
- Data-Enabled Life Sciences Alliance (DELSA Global), Seattle, Washington
| | - Sukru Aynacioglu
- Data-Enabled Life Sciences Alliance (DELSA Global), Seattle, Washington
- Department of Pharmacology, Gaziantep University, Gaziantep, Turkey
| | - Ancha Baranova
- Data-Enabled Life Sciences Alliance (DELSA Global), Seattle, Washington
- School of Systems Biology, George Mason University, Manassas, Virginia
| | - Shawn R. Campagna
- Data-Enabled Life Sciences Alliance (DELSA Global), Seattle, Washington
- Department of Chemistry, University of Tennessee Knoxville, Knoxville, Tennessee
| | - Rui Chen
- Data-Enabled Life Sciences Alliance (DELSA Global), Seattle, Washington
- Department of Genetics, Stanford University, Stanford, California
| | - John Choiniere
- Bioinformatics and High-Throughput Analysis Laboratory, Seattle Children's Research Institute, Seattle, Washington
- Data-Enabled Life Sciences Alliance (DELSA Global), Seattle, Washington
| | - Stephen P. Dearth
- Data-Enabled Life Sciences Alliance (DELSA Global), Seattle, Washington
- Department of Chemistry, University of Tennessee Knoxville, Knoxville, Tennessee
| | - Wu-Chun Feng
- Data-Enabled Life Sciences Alliance (DELSA Global), Seattle, Washington
- Department of Computer Science, Virginia Tech, Blacksburg, Virginia
- Department of Electrical and Computer Engineering, Virginia Tech, Blacksburg, Virginia
- Department of SyNeRGy Laboratory, Virginia Tech, Blacksburg, Virginia
| | - Lynnette Ferguson
- Data-Enabled Life Sciences Alliance (DELSA Global), Seattle, Washington
- Department of Nutrition, Auckland Cancer Society Research Centre, University of Auckland, Auckland, New Zealand
| | - Geoffrey Fox
- Data-Enabled Life Sciences Alliance (DELSA Global), Seattle, Washington
- School of Informatics and Computing, Indiana University, Bloomington, Indiana
| | - Dmitrij Frishman
- Data-Enabled Life Sciences Alliance (DELSA Global), Seattle, Washington
- Technische Universitat Munchen, Wissenshaftzentrum Weihenstephan, Freising, Germany
| | - Robert Grossman
- Data-Enabled Life Sciences Alliance (DELSA Global), Seattle, Washington
- Institute for Genomics and Systems Biology, University of Chicago, Chicago, Illinois
- Department of Medicine, University of Chicago, Chicago, Illinois
| | - Allison Heath
- Data-Enabled Life Sciences Alliance (DELSA Global), Seattle, Washington
- Institute for Genomics and Systems Biology, University of Chicago, Chicago, Illinois
- Knapp Center for Biomedical Discovery, University of Chicago, Chicago, Illinois
| | - Roger Higdon
- Bioinformatics and High-Throughput Analysis Laboratory, Seattle Children's Research Institute, Seattle, Washington
- Predictive Analytics, Seattle Children's, Seattle, Washington
- Data-Enabled Life Sciences Alliance (DELSA Global), Seattle, Washington
| | - Mara H. Hutz
- Data-Enabled Life Sciences Alliance (DELSA Global), Seattle, Washington
- Departamento de Genetica, Instituto de Biociencias, Federal University of Rio Grande do Sul, Porto Alegre, Brazil
| | - Imre Janko
- Data-Enabled Life Sciences Alliance (DELSA Global), Seattle, Washington
- High-Throughput Analysis Core, Seattle Children's Research Institute, Seattle, Washington
| | - Lihua Jiang
- Data-Enabled Life Sciences Alliance (DELSA Global), Seattle, Washington
- Department of Genetics, Stanford University, Stanford, California
| | - Sanjay Joshi
- Data-Enabled Life Sciences Alliance (DELSA Global), Seattle, Washington
- Life Sciences, EMC, Hopkinton, Massachusetts
| | - Alexander Kel
- Data-Enabled Life Sciences Alliance (DELSA Global), Seattle, Washington
- GeneXplain GmbH, Wolfenbüttel, Germany
| | - Joseph W. Kemnitz
- Data-Enabled Life Sciences Alliance (DELSA Global), Seattle, Washington
- Department of Cell and Regenerative Biology, University of Wisconsin-Madison, Madison, Wisconsin
- Wisconsin National Primate Research Center, University of Wisconsin-Madison, Madison, Wisconsin
| | - Isaac S. Kohane
- Data-Enabled Life Sciences Alliance (DELSA Global), Seattle, Washington
- Pediatrics and Health Sciences Technology, Children's Hospital and Harvard Medical School, Boston, Massachusetts
- HMS Center for Biomedical Informatics, Countway Library of Medicine, Boston, Massachusetts
| | - Natali Kolker
- Predictive Analytics, Seattle Children's, Seattle, Washington
- Data-Enabled Life Sciences Alliance (DELSA Global), Seattle, Washington
- High-Throughput Analysis Core, Seattle Children's Research Institute, Seattle, Washington
| | - Doron Lancet
- Data-Enabled Life Sciences Alliance (DELSA Global), Seattle, Washington
- Department of Molecular Genetics, Crown Human Genome Center, Weizmann Institute of Science, Rehovot, Israel
| | - Elaine Lee
- Data-Enabled Life Sciences Alliance (DELSA Global), Seattle, Washington
- High-Throughput Analysis Core, Seattle Children's Research Institute, Seattle, Washington
| | - Weizhong Li
- Data-Enabled Life Sciences Alliance (DELSA Global), Seattle, Washington
- Center for Research in Biological Systems, University of California, San Diego, La Jolla, California
| | - Andrey Lisitsa
- Data-Enabled Life Sciences Alliance (DELSA Global), Seattle, Washington
- Russian Human Proteome Organization (RHUPO), Moscow, Russia
- Institute of Biomedical Chemistry, Moscow, Russia
| | - Adrian Llerena
- Data-Enabled Life Sciences Alliance (DELSA Global), Seattle, Washington
- Clinical Research Center, Extremadura University Hospital and Medical School, Badajoz, Spain
| | - Courtney MacNealy-Koch
- Bioinformatics and High-Throughput Analysis Laboratory, Seattle Children's Research Institute, Seattle, Washington
- Data-Enabled Life Sciences Alliance (DELSA Global), Seattle, Washington
| | - Jean-Claude Marshall
- Data-Enabled Life Sciences Alliance (DELSA Global), Seattle, Washington
- Center for Translational Research, Catholic Health Initiatives, Towson, Maryland
| | - Paola Masuzzo
- Data-Enabled Life Sciences Alliance (DELSA Global), Seattle, Washington
- Department of Medical Protein Research, Vlaams Instituut voor Biotechnologie, Ghent, Belgium
- Department of Biochemistry, Ghent University; Ghent, Belgium
| | - Amanda May
- Data-Enabled Life Sciences Alliance (DELSA Global), Seattle, Washington
- Department of Chemistry, University of Tennessee Knoxville, Knoxville, Tennessee
| | - George Mias
- Data-Enabled Life Sciences Alliance (DELSA Global), Seattle, Washington
- Department of Genetics, Stanford University, Stanford, California
| | - Matthew Monroe
- Data-Enabled Life Sciences Alliance (DELSA Global), Seattle, Washington
- Biological Sciences Division, Pacific Northwest National Laboratory, Richland, Washington
| | - Elizabeth Montague
- Bioinformatics and High-Throughput Analysis Laboratory, Seattle Children's Research Institute, Seattle, Washington
- Predictive Analytics, Seattle Children's, Seattle, Washington
- Data-Enabled Life Sciences Alliance (DELSA Global), Seattle, Washington
| | - Sean Mooney
- Data-Enabled Life Sciences Alliance (DELSA Global), Seattle, Washington
- The Buck Institute for Research on Aging, Novato, California
| | - Alexey Nesvizhskii
- Data-Enabled Life Sciences Alliance (DELSA Global), Seattle, Washington
- Department of Pathology, University of Michigan, Ann Arbor, Michigan
- Computational Medicine and Bioinformatics, University of Michigan, Ann Arbor, Michigan
| | - Santosh Noronha
- Data-Enabled Life Sciences Alliance (DELSA Global), Seattle, Washington
- Department of Chemical Engineering, Indian Institute of Technology Bombay, Powai, Mumbai, India
| | - Gilbert Omenn
- Data-Enabled Life Sciences Alliance (DELSA Global), Seattle, Washington
- Center for Computational Medicine and Bioinformatics, University of Michigan, Ann Arbor Michigan
- Department of Molecular Medicine & Genetics and Human Genetics, University of Michigan, Ann Arbor Michigan
- Department of Computational Medicine and Bioinformatics, University of Michigan, Ann Arbor Michigan
- School of Public Health, University of Michigan, Ann Arbor Michigan
| | - Harsha Rajasimha
- Data-Enabled Life Sciences Alliance (DELSA Global), Seattle, Washington
- Jeeva Informatics Solutions LLC, Derwood, Maryland
| | - Preveen Ramamoorthy
- Data-Enabled Life Sciences Alliance (DELSA Global), Seattle, Washington
- Molecular Diagnostics Department, National Jewish Health, Denver, Colorado
| | - Jerry Sheehan
- Data-Enabled Life Sciences Alliance (DELSA Global), Seattle, Washington
- California Institute for Telecommunications and Information Technology, University of California-San Diego, La Jolla, California
| | - Larry Smarr
- Data-Enabled Life Sciences Alliance (DELSA Global), Seattle, Washington
- California Institute for Telecommunications and Information Technology, University of California-San Diego, La Jolla, California
| | - Charles V. Smith
- Data-Enabled Life Sciences Alliance (DELSA Global), Seattle, Washington
- Center for Developmental Therapeutics, Seattle Children's Research Institute, Seattle, Washington
| | - Todd Smith
- Data-Enabled Life Sciences Alliance (DELSA Global), Seattle, Washington
- Digital World Biology, Seattle, Washington
| | - Michael Snyder
- Data-Enabled Life Sciences Alliance (DELSA Global), Seattle, Washington
- Department of Genetics, Stanford University, Stanford, California
- Stanford Center for Genomics and Personalized Medicine, Stanford University, Stanford, California
| | - Srikanth Rapole
- Data-Enabled Life Sciences Alliance (DELSA Global), Seattle, Washington
- Proteomics Laboratory, National Centre for Cell Science, University of Pune, Pune, India
| | - Sanjeeva Srivastava
- Data-Enabled Life Sciences Alliance (DELSA Global), Seattle, Washington
- Proteomics Laboratory, Indian Institute of Technology Bombay, Mumbai, India
| | - Larissa Stanberry
- Bioinformatics and High-Throughput Analysis Laboratory, Seattle Children's Research Institute, Seattle, Washington
- Predictive Analytics, Seattle Children's, Seattle, Washington
- Data-Enabled Life Sciences Alliance (DELSA Global), Seattle, Washington
| | - Elizabeth Stewart
- Bioinformatics and High-Throughput Analysis Laboratory, Seattle Children's Research Institute, Seattle, Washington
- Data-Enabled Life Sciences Alliance (DELSA Global), Seattle, Washington
| | - Stefano Toppo
- Data-Enabled Life Sciences Alliance (DELSA Global), Seattle, Washington
- Department of Molecular Medicine, University of Padova, Padova, Italy
| | - Peter Uetz
- Data-Enabled Life Sciences Alliance (DELSA Global), Seattle, Washington
- Center for the Study of Biological Complexity (CSBC), Virginia Commonwealth University, Richmond, Virginia
| | - Kenneth Verheggen
- Data-Enabled Life Sciences Alliance (DELSA Global), Seattle, Washington
- Department of Medical Protein Research, Vlaams Instituut voor Biotechnologie, Ghent, Belgium
- Department of Biochemistry, Ghent University; Ghent, Belgium
| | - Brynn H. Voy
- Data-Enabled Life Sciences Alliance (DELSA Global), Seattle, Washington
- Department of Animal Science, University of Tennessee Institute of Agriculture, Knoxville, Tennessee
| | - Louise Warnich
- Data-Enabled Life Sciences Alliance (DELSA Global), Seattle, Washington
- Department of Genetics, Faculty of AgriSciences, University of Stellenbosch, Stellenbosch, South Africa
| | - Steven W. Wilhelm
- Data-Enabled Life Sciences Alliance (DELSA Global), Seattle, Washington
- Department of Microbiology, University of Tennessee-Knoxville, Knoxville, Tennessee
| | - Gregory Yandl
- Bioinformatics and High-Throughput Analysis Laboratory, Seattle Children's Research Institute, Seattle, Washington
- Data-Enabled Life Sciences Alliance (DELSA Global), Seattle, Washington
| |
Collapse
|
175
|
González-Beltrán A, Maguire E, Sansone SA, Rocca-Serra P. linkedISA: semantic representation of ISA-Tab experimental metadata. BMC Bioinformatics 2014; 15. [PMID: 25472428 PMCID: PMC4255742 DOI: 10.1186/1471-2105-15-s14-s4,] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 05/10/2023] Open
Abstract
BACKGROUND Reporting and sharing experimental metadata- such as the experimental design, characteristics of the samples, and procedures applied, along with the analysis results, in a standardised manner ensures that datasets are comprehensible and, in principle, reproducible, comparable and reusable. Furthermore, sharing datasets in formats designed for consumption by humans and machines will also maximize their use. The Investigation/Study/Assay (ISA) open source metadata tracking framework facilitates standards-compliant collection, curation, visualization, storage and sharing of datasets, leveraging on other platforms to enable analysis and publication. The ISA software suite includes several components used in increasingly diverse set of life science and biomedical domains; it is underpinned by a general-purpose format, ISA-Tab, and conversions exist into formats required by public repositories. While ISA-Tab works well mainly as a human readable format, we have also implemented a linked data approach to semantically define the ISA-Tab syntax. RESULTS We present a semantic web representation of the ISA-Tab syntax that complements ISA-Tab's syntactic interoperability with semantic interoperability. We introduce the linkedISA conversion tool from ISA-Tab to the Resource Description Framework (RDF), supporting mappings from the ISA syntax to multiple community-defined, open ontologies and capitalising on user-provided ontology annotations in the experimental metadata. We describe insights of the implementation and how annotations can be expanded driven by the metadata. We applied the conversion tool as part of Bio-GraphIIn, a web-based application supporting integration of the semantically-rich experimental descriptions. Designed in a user-friendly manner, the Bio-GraphIIn interface hides most of the complexities to the users, exposing a familiar tabular view of the experimental description to allow seamless interaction with the RDF representation, and visualising descriptors to drive the query over the semantic representation of the experimental design. In addition, we defined queries over the linkedISA RDF representation and demonstrated its use over the linkedISA conversion of datasets from Nature' Scientific Data online publication. CONCLUSIONS Our linked data approach has allowed us to: 1) make the ISA-Tab semantics explicit and machine-processable, 2) exploit the existing ontology-based annotations in the ISA-Tab experimental descriptions, 3) augment the ISA-Tab syntax with new descriptive elements, 4) visualise and query elements related to the experimental design. Reasoning over ISA-Tab metadata and associated data will facilitate data integration and knowledge discovery.
Collapse
Affiliation(s)
| | - Eamonn Maguire
- Oxford e-Research Centre, University of Oxford, Oxford, OX1 3QG, UK
| | | | | |
Collapse
|
176
|
Altmäe S, Esteban FJ, Stavreus-Evers A, Simón C, Giudice L, Lessey BA, Horcajadas JA, Macklon NS, D'Hooghe T, Campoy C, Fauser BC, Salamonsen LA, Salumets A. Guidelines for the design, analysis and interpretation of 'omics' data: focus on human endometrium. Hum Reprod Update 2014; 20:12-28. [PMID: 24082038 PMCID: PMC3845681 DOI: 10.1093/humupd/dmt048] [Citation(s) in RCA: 101] [Impact Index Per Article: 10.1] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 04/19/2013] [Revised: 08/04/2013] [Accepted: 08/19/2013] [Indexed: 01/07/2023] Open
Abstract
BACKGROUND 'Omics' high-throughput analyses, including genomics, epigenomics, transcriptomics, proteomics and metabolomics, are widely applied in human endometrial studies. Analysis of endometrial transcriptome patterns in physiological and pathophysiological conditions has been to date the most commonly applied 'omics' technique in human endometrium. As the technologies improve, proteomics holds the next big promise for this field. The 'omics' technologies have undoubtedly advanced our knowledge of human endometrium in relation to fertility and different diseases. Nevertheless, the challenges arising from the vast amount of data generated and the broad variation of 'omics' profiling according to different environments and stimuli make it difficult to assess the validity, reproducibility and interpretation of such 'omics' data. With the expansion of 'omics' analyses in the study of the endometrium, there is a growing need to develop guidelines for the design of studies, and the analysis and interpretation of 'omics' data. METHODS Systematic review of the literature in PubMed, and references from relevant articles were investigated up to March 2013. RESULTS The current review aims to provide guidelines for future 'omics' studies on human endometrium, together with a summary of the status and trends, promise and shortcomings in the high-throughput technologies. In addition, the approaches presented here can be adapted to other areas of high-throughput 'omics' studies. CONCLUSION A highly rigorous approach to future studies, based on the guidelines provided here, is a prerequisite for obtaining data on biological systems which can be shared among researchers worldwide and will ultimately be of clinical benefit.
Collapse
Affiliation(s)
- Signe Altmäe
- Competence Centre on Reproductive Medicine and Biology, Tartu, Estonia
- School of Medicine, Department of Paediatrics, University of Granada, 18012 Granada, Spain
| | | | - Anneli Stavreus-Evers
- Department of Women's and Children's Health, Uppsala University, Akademiska Sjukhuset, 75185 Uppsala, Sweden
| | - Carlos Simón
- Fundación Instituto Valenciano de Infertilidad (FIVI) and Instituto Universitario IVI/INCLIVA, Valencia University, 46021 Valencia, Spain
| | - Linda Giudice
- Department of Obstetrics, Gynecology and Reproductive Sciences, University of California San Francisco, San Francisco, CA 94143-0132, USA
| | - Bruce A. Lessey
- Division of Reproductive Endocrinology, Department of Obstetrics and Gynecology, University Medical Group, Greenville Hospital System, Greenville, South Carolina, SC 29605, USA
| | - Jose A. Horcajadas
- Araid-Hospital Miguel Servet, 50004 Zaragoza, Spain
- Department of Genetics, Universidad Pablo de Olavide, 41013 Sevilla, Spain
| | - Nick S. Macklon
- Department of Obstetrics and Gynaecology, Division of Developmental Origins of Adult Disease, University of Southampton, Princess Anne Hospital, SO16 5YA Southampton, UK
- Department of Reproductive Medicine and Gynaecology, University Medical Center Utrecht, 3584 CX Utrecht, The Netherlands
| | - Thomas D'Hooghe
- Leuven University Fertility Center, Department of Obstetrics and Gynecology, University Hospital Leuven, Leuven, Belgium
- Department of Development and Regeneration, KU Leuven (Leuven University), 3000 Leuven, Belgium
| | - Cristina Campoy
- School of Medicine, Department of Paediatrics, University of Granada, 18012 Granada, Spain
| | - Bart C. Fauser
- Department of Reproductive Medicine and Gynaecology, University Medical Center Utrecht, 3584 CX Utrecht, The Netherlands
| | - Lois A. Salamonsen
- Prince Henry's Institute of Medical Research, Melbourne, Victoria 3168, Australia
| | - Andres Salumets
- Competence Centre on Reproductive Medicine and Biology, Tartu, Estonia
- Department of Obstetrics and Gynaecology, University of Tartu, 51014 Tartu, Estonia
| |
Collapse
|
177
|
Leonelli S. Integrating data to acquire new knowledge: Three modes of integration in plant science. STUDIES IN HISTORY AND PHILOSOPHY OF BIOLOGICAL AND BIOMEDICAL SCIENCES 2013; 44:503-514. [PMID: 23571025 DOI: 10.1016/j.shpsc.2013.03.020] [Citation(s) in RCA: 10] [Impact Index Per Article: 0.9] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 06/02/2023]
Abstract
This paper discusses what it means and what it takes to integrate data in order to acquire new knowledge about biological entities and processes. Maureen O'Malley and Orkun Soyer have pointed to the scientific work involved in data integration as important and distinct from the work required by other forms of integration, such as methodological and explanatory integration, which have been more successful in captivating the attention of philosophers of science. Here I explore what data integration involves in more detail and with a focus on the role of data-sharing tools, like online databases, in facilitating this process; and I point to the philosophical implications of focusing on data as a unit of analysis. I then analyse three cases of data integration in the field of plant science, each of which highlights a different mode of integration: (1) inter-level integration, which involves data documenting different features of the same species, aims to acquire an interdisciplinary understanding of organisms as complex wholes and is exemplified by research on Arabidopsis thaliana; (2) cross-species integration, which involves data acquired on different species, aims to understand plant biology in all its different manifestations and is exemplified by research on Miscanthus giganteus; and (3) translational integration, which involves data acquired from sources within as well as outside academia, aims at the provision of interventions to improve human health (e.g. by sustaining the environment in which humans thrive) and is exemplified by research on Phytophtora ramorum. Recognising the differences between these efforts sheds light on the dynamics and diverse outcomes of data dissemination and integrative research; and the relations between the social and institutional roles of science, the development of data-sharing infrastructures and the production of scientific knowledge.
Collapse
Affiliation(s)
- Sabina Leonelli
- Department of Sociology, Philosophy and Anthropology & Egenis, University of Exeter, Byrne House, St Germans Road, EX4 4PJ Exeter, UK.
| |
Collapse
|
178
|
Comparative study of candidate housekeeping genes for quantification of target gene messenger RNA expression by real-time PCR in patients with inflammatory bowel disease. Inflamm Bowel Dis 2013; 19:2840-7. [PMID: 24141710 DOI: 10.1097/01.mib.0000435440.22484.e8] [Citation(s) in RCA: 28] [Impact Index Per Article: 2.5] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Submit a Manuscript] [Subscribe] [Scholar Register] [Indexed: 01/08/2023]
Abstract
BACKGROUND Mucosal expression of immunological mediators is modified in inflammatory bowel disease (IBD). Quantification of target gene messenger RNA (mRNA) transcripts depends on the normalization to a housekeeping or reference gene. Stability of housekeeping gene expression is critical for the accurate measurement of transcripts of the target gene. No studies have addressed the optimization of reference gene performance for mRNA studies in healthy intestinal mucosa and during mucosal inflammation. METHODS RNA was extracted from endoscopically obtained intestinal biopsies from healthy control subjects and patients with active IBD or non-IBD inflammatory diseases. Comparative analysis of 10 candidate housekeeping genes for quantitative real-time PCR was carried out according to predefined criteria, including use of the Web-based RefFinder platform. RESULTS We demonstrate that intestinal inflammation may significantly affect the stability of mucosal expression of housekeeping genes. Commonly used controls, such as glyceraldehyde-3-phosphate dehydrogenase, β-actin, or β2-microglobulin displayed high variability within the control group and/or between the healthy and inflamed mucosae. In contrast, we have identified novel genes with optimal stability, which may be used as appropriate housekeeping controls. The ribosomal proteins encoding genes (RPLPO and RPS9) were the most stable because their expression was not affected by interindividual differences, the presence of inflammation, or intestinal location. Normalization ofthe mRNA expression of mucosal tumor necrosis factor-α was highly dependent on the specific reference gene and varied significantly when normalized to genes with high or low stability. CONCLUSIONS Validation for optimal performance of the housekeeping gene is required for target mRNA quantification in healthy intestine and IBD-associated lesions. Suboptimal reference gene expression may explain conflicting results from published studies on IBD gene expression.
Collapse
|
179
|
Kolker E, Özdemir V, Martens L, Hancock W, Anderson G, Anderson N, Aynacioglu S, Baranova A, Campagna SR, Chen R, Choiniere J, Dearth SP, Feng WC, Ferguson L, Fox G, Frishman D, Grossman R, Heath A, Higdon R, Hutz MH, Janko I, Jiang L, Joshi S, Kel A, Kemnitz JW, Kohane IS, Kolker N, Lancet D, Lee E, Li W, Lisitsa A, Llerena A, MacNealy-Koch C, Marshall JC, Masuzzo P, May A, Mias G, Monroe M, Montague E, Mooney S, Nesvizhskii A, Noronha S, Omenn G, Rajasimha H, Ramamoorthy P, Sheehan J, Smarr L, Smith CV, Smith T, Snyder M, Rapole S, Srivastava S, Stanberry L, Stewart E, Toppo S, Uetz P, Verheggen K, Voy BH, Warnich L, Wilhelm SW, Yandl G. Toward More Transparent and Reproducible Omics Studies Through a Common Metadata Checklist and Data Publications. BIG DATA 2013; 1:196-201. [PMID: 27447251 DOI: 10.1089/big.2013.0039] [Citation(s) in RCA: 5] [Impact Index Per Article: 0.5] [Reference Citation Analysis] [Abstract] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 06/06/2023]
Abstract
Biological processes are fundamentally driven by complex interactions between biomolecules. Integrated high-throughput omics studies enable multifaceted views of cells, organisms, or their communities. With the advent of new post-genomics technologies, omics studies are becoming increasingly prevalent; yet the full impact of these studies can only be realized through data harmonization, sharing, meta-analysis, and integrated research. These essential steps require consistent generation, capture, and distribution of metadata. To ensure transparency, facilitate data harmonization, and maximize reproducibility and usability of life sciences studies, we propose a simple common omics metadata checklist. The proposed checklist is built on the rich ontologies and standards already in use by the life sciences community. The checklist will serve as a common denominator to guide experimental design, capture important parameters, and be used as a standard format for stand-alone data publications. The omics metadata checklist and data publications will create efficient linkages between omics data and knowledge-based life sciences innovation and, importantly, allow for appropriate attribution to data generators and infrastructure science builders in the post-genomics era. We ask that the life sciences community test the proposed omics metadata checklist and data publications and provide feedback for their use and improvement.
Collapse
Affiliation(s)
- Eugene Kolker
- 1 Bioinformatics and High-Throughput Analysis Laboratory, Seattle Children's Research Institute , Seattle, Washington
- 2 Predictive Analytics , Seattle Children's, Seattle, Washington
- 3 Data-Enabled Life Sciences Alliance (DELSA Global) , Seattle, Washington
| | - Vural Özdemir
- 3 Data-Enabled Life Sciences Alliance (DELSA Global) , Seattle, Washington
- 4 Office of the President, Gaziantep University , International Affairs and Global Development Strategy
- 5 Faculty of Communications, Universite Bulvarı , Kilis Yolu, Turkey
| | - Lennart Martens
- 3 Data-Enabled Life Sciences Alliance (DELSA Global) , Seattle, Washington
- 6 Department of Medical Protein Research, Vlaams Instituut voor Biotechnologie , Ghent, Belgium
- 7 Department of Biochemistry, Ghent University, Ghent , Belgium
| | - William Hancock
- 3 Data-Enabled Life Sciences Alliance (DELSA Global) , Seattle, Washington
- 8 Department of Chemistry, Barnett Institute, Northeastern University , Boston, Massachusetts
| | - Gordon Anderson
- 3 Data-Enabled Life Sciences Alliance (DELSA Global) , Seattle, Washington
- 9 Fundamental & Computational Sciences Directorate, Pacific Northwest National Laboratory , Richland, Washington
| | - Nathaniel Anderson
- 1 Bioinformatics and High-Throughput Analysis Laboratory, Seattle Children's Research Institute , Seattle, Washington
- 3 Data-Enabled Life Sciences Alliance (DELSA Global) , Seattle, Washington
| | - Sukru Aynacioglu
- 3 Data-Enabled Life Sciences Alliance (DELSA Global) , Seattle, Washington
- 10 Department of Pharmacology, Gaziantep University , Gaziantep, Turkey
| | - Ancha Baranova
- 3 Data-Enabled Life Sciences Alliance (DELSA Global) , Seattle, Washington
- 11 School of Systems Biology, George Mason University , Manassas, Virginia
| | - Shawn R Campagna
- 3 Data-Enabled Life Sciences Alliance (DELSA Global) , Seattle, Washington
- 12 Department of Chemistry, University of Tennessee Knoxville , Knoxville, Tennessee
| | - Rui Chen
- 3 Data-Enabled Life Sciences Alliance (DELSA Global) , Seattle, Washington
- 13 Department of Genetics, Stanford University , Stanford, California
| | - John Choiniere
- 1 Bioinformatics and High-Throughput Analysis Laboratory, Seattle Children's Research Institute , Seattle, Washington
- 3 Data-Enabled Life Sciences Alliance (DELSA Global) , Seattle, Washington
| | - Stephen P Dearth
- 3 Data-Enabled Life Sciences Alliance (DELSA Global) , Seattle, Washington
- 12 Department of Chemistry, University of Tennessee Knoxville , Knoxville, Tennessee
| | - Wu-Chun Feng
- 3 Data-Enabled Life Sciences Alliance (DELSA Global) , Seattle, Washington
- 14 Department of Computer Science, Virginia Tech, Blacksburg Virginia
- 15 Department of Electrical and Computer Engineering, Virginia Tech, Blacksburg Virginia
- 16 SyNeRGy Laboratory, Virginia Tech, Blacksburg, Virginia
| | - Lynnette Ferguson
- 3 Data-Enabled Life Sciences Alliance (DELSA Global) , Seattle, Washington
- 17 Department of Nutrition, Auckland Cancer Society Research Centre, University of Auckland , Auckland, New Zealand
| | - Geoffrey Fox
- 3 Data-Enabled Life Sciences Alliance (DELSA Global) , Seattle, Washington
- 18 School of Informatics and Computing, Indiana University , Bloomington, Indiana
| | - Dmitrij Frishman
- 3 Data-Enabled Life Sciences Alliance (DELSA Global) , Seattle, Washington
- 19 Technische Universitat Munchen , Wissenshaftzentrum Weihenstephan, Freising, Germany
| | - Robert Grossman
- 3 Data-Enabled Life Sciences Alliance (DELSA Global) , Seattle, Washington
- 20 Institute for Genomics and Systems Biology, University of Chicago , Chicago Illinois
- 21 Department of Medicine, University of Chicago , Chicago, Illinois
| | - Allison Heath
- 3 Data-Enabled Life Sciences Alliance (DELSA Global) , Seattle, Washington
- 20 Institute for Genomics and Systems Biology, University of Chicago , Chicago Illinois
- 22 Knapp Center for Biomedical Discovery, University of Chicago , Chicago, Illinois
| | - Roger Higdon
- 1 Bioinformatics and High-Throughput Analysis Laboratory, Seattle Children's Research Institute , Seattle, Washington
- 2 Predictive Analytics , Seattle Children's, Seattle, Washington
- 3 Data-Enabled Life Sciences Alliance (DELSA Global) , Seattle, Washington
| | - Mara H Hutz
- 3 Data-Enabled Life Sciences Alliance (DELSA Global) , Seattle, Washington
- 23 Departamento de Genetica, Instituto de Biociencias, Federal University of Rio Grande do Sul , Porto Alegre, Brazil
| | - Imre Janko
- 3 Data-Enabled Life Sciences Alliance (DELSA Global) , Seattle, Washington
- 24 High-Throughput Analysis Core, Seattle Children's Research Institute , Seattle, Washington
| | - Lihua Jiang
- 3 Data-Enabled Life Sciences Alliance (DELSA Global) , Seattle, Washington
- 13 Department of Genetics, Stanford University , Stanford, California
| | - Sanjay Joshi
- 3 Data-Enabled Life Sciences Alliance (DELSA Global) , Seattle, Washington
- 25 Life Sciences , EMC, Hopkinton, Massachusetts
| | - Alexander Kel
- 3 Data-Enabled Life Sciences Alliance (DELSA Global) , Seattle, Washington
- 26 GeneXplain GmbH , Wolfenbüttel, Germany
| | - Joseph W Kemnitz
- 3 Data-Enabled Life Sciences Alliance (DELSA Global) , Seattle, Washington
- 27 Department of Cell and Regenerative Biology, University of Wisconsin-Madison , Madison, Wisconsin
- 28 Wisconsin National Primate Research Center, University of Wisconsin-Madison , Madison, Wisconsin
| | - Isaac S Kohane
- 3 Data-Enabled Life Sciences Alliance (DELSA Global) , Seattle, Washington
- 29 Pediatrics and Health Sciences Technology, Children's Hospital and Harvard Medical School , Boston, Massachusetts
- 30 HMS Center for Biomedical Informatics, Countway Library of Medicine , Boston, Massachusetts
| | - Natali Kolker
- 2 Predictive Analytics , Seattle Children's, Seattle, Washington
- 3 Data-Enabled Life Sciences Alliance (DELSA Global) , Seattle, Washington
- 24 High-Throughput Analysis Core, Seattle Children's Research Institute , Seattle, Washington
| | - Doron Lancet
- 3 Data-Enabled Life Sciences Alliance (DELSA Global) , Seattle, Washington
- 31 Department of Molecular Genetics, Crown Human Genome Center , Weizmann Institute of Science, Rehovot, Israel
| | - Elaine Lee
- 3 Data-Enabled Life Sciences Alliance (DELSA Global) , Seattle, Washington
- 24 High-Throughput Analysis Core, Seattle Children's Research Institute , Seattle, Washington
| | - Weizhong Li
- 3 Data-Enabled Life Sciences Alliance (DELSA Global) , Seattle, Washington
- 32 Center for Research in Biological Systems, University of California , San Diego, La Jolla, California
| | - Andrey Lisitsa
- 3 Data-Enabled Life Sciences Alliance (DELSA Global) , Seattle, Washington
- 33 Russian Human Proteome Organization (RHUPO) , Moscow, Russia
- 34 Institute of Biomedical Chemistry , Moscow, Russia
| | - Adrian Llerena
- 3 Data-Enabled Life Sciences Alliance (DELSA Global) , Seattle, Washington
- 35 Clinical Research Center, Extremadura University Hospital and Medical School , Badajoz, Spain
| | - Courtney MacNealy-Koch
- 1 Bioinformatics and High-Throughput Analysis Laboratory, Seattle Children's Research Institute , Seattle, Washington
- 3 Data-Enabled Life Sciences Alliance (DELSA Global) , Seattle, Washington
| | - Jean-Claude Marshall
- 3 Data-Enabled Life Sciences Alliance (DELSA Global) , Seattle, Washington
- 36 Center for Translational Research, Catholic Health Initiatives , Towson, Maryland
| | - Paola Masuzzo
- 3 Data-Enabled Life Sciences Alliance (DELSA Global) , Seattle, Washington
- 6 Department of Medical Protein Research, Vlaams Instituut voor Biotechnologie , Ghent, Belgium
- 7 Department of Biochemistry, Ghent University, Ghent , Belgium
| | - Amanda May
- 3 Data-Enabled Life Sciences Alliance (DELSA Global) , Seattle, Washington
- 12 Department of Chemistry, University of Tennessee Knoxville , Knoxville, Tennessee
| | - George Mias
- 3 Data-Enabled Life Sciences Alliance (DELSA Global) , Seattle, Washington
- 13 Department of Genetics, Stanford University , Stanford, California
| | - Matthew Monroe
- 3 Data-Enabled Life Sciences Alliance (DELSA Global) , Seattle, Washington
- 37 Biological Sciences Division, Pacific Northwest National Laboratory , Richland, Washington
| | - Elizabeth Montague
- 1 Bioinformatics and High-Throughput Analysis Laboratory, Seattle Children's Research Institute , Seattle, Washington
- 2 Predictive Analytics , Seattle Children's, Seattle, Washington
- 3 Data-Enabled Life Sciences Alliance (DELSA Global) , Seattle, Washington
| | - Sean Mooney
- 3 Data-Enabled Life Sciences Alliance (DELSA Global) , Seattle, Washington
- 38 The Buck Institute for Research on Aging , Novato, California
| | - Alexey Nesvizhskii
- 3 Data-Enabled Life Sciences Alliance (DELSA Global) , Seattle, Washington
- 39 Department of Pathology, University of Michigan , Ann Arbor, Michigan
- 40 Computational Medicine and Bioinformatics, University of Michigan , Ann Arbor, Michigan
| | - Santosh Noronha
- 3 Data-Enabled Life Sciences Alliance (DELSA Global) , Seattle, Washington
- 41 Department of Chemical Engineering, Indian Institute of Technology Bombay , Powai, Mumbai, India
| | - Gilbert Omenn
- 3 Data-Enabled Life Sciences Alliance (DELSA Global) , Seattle, Washington
- 42 Center for Computational Medicine and Bioinformatics, University of Michigan , Ann Arbor, Michigan
- 43 Departments of Molecular Medicine & Genetics and Human Genetics, University of Michigan , Ann Arbor Michigan
- 44 Department of Computational Medicine and Bioinformatics, University of Michigan , Ann Arbor, Michigan
- 45 School of Public Health, University of Michigan , Ann Arbor, Michigan
| | - Harsha Rajasimha
- 3 Data-Enabled Life Sciences Alliance (DELSA Global) , Seattle, Washington
- 46 J eeva Informatics Solutions LLC , Derwood, Maryland
| | - Preveen Ramamoorthy
- 3 Data-Enabled Life Sciences Alliance (DELSA Global) , Seattle, Washington
- 47 Molecular Diagnostics Department, National Jewish Health , Denver Colorado
| | - Jerry Sheehan
- 3 Data-Enabled Life Sciences Alliance (DELSA Global) , Seattle, Washington
- 48 California Institute for Telecommunications and Information Technology, University of California-San Diego , La Jolla, California
| | - Larry Smarr
- 3 Data-Enabled Life Sciences Alliance (DELSA Global) , Seattle, Washington
- 48 California Institute for Telecommunications and Information Technology, University of California-San Diego , La Jolla, California
| | - Charles V Smith
- 3 Data-Enabled Life Sciences Alliance (DELSA Global) , Seattle, Washington
- 49 Center for Developmental Therapeutics, Seattle Children's Research Institute , Seattle, Washington
| | - Todd Smith
- 3 Data-Enabled Life Sciences Alliance (DELSA Global) , Seattle, Washington
- 50 Digital World Biology , Seattle, Washington
| | - Michael Snyder
- 3 Data-Enabled Life Sciences Alliance (DELSA Global) , Seattle, Washington
- 13 Department of Genetics, Stanford University , Stanford, California
- 51 Stanford Center for Genomics and Personalized Medicine, Stanford University , Stanford, California
| | - Srikanth Rapole
- 3 Data-Enabled Life Sciences Alliance (DELSA Global) , Seattle, Washington
- 52 Proteomics Laboratory, National Centre for Cell Science, University of Pune , Pune, India
| | - Sanjeeva Srivastava
- 3 Data-Enabled Life Sciences Alliance (DELSA Global) , Seattle, Washington
- 53 Proteomics Laboratory, Indian Institute of Technology Bombay , Mumbai, India
| | - Larissa Stanberry
- 1 Bioinformatics and High-Throughput Analysis Laboratory, Seattle Children's Research Institute , Seattle, Washington
- 2 Predictive Analytics , Seattle Children's, Seattle, Washington
- 3 Data-Enabled Life Sciences Alliance (DELSA Global) , Seattle, Washington
| | - Elizabeth Stewart
- 1 Bioinformatics and High-Throughput Analysis Laboratory, Seattle Children's Research Institute , Seattle, Washington
- 3 Data-Enabled Life Sciences Alliance (DELSA Global) , Seattle, Washington
| | - Stefano Toppo
- 3 Data-Enabled Life Sciences Alliance (DELSA Global) , Seattle, Washington
- 54 Department of Molecular Medicine, University of Padova , Padova, Italy
| | - Peter Uetz
- 3 Data-Enabled Life Sciences Alliance (DELSA Global) , Seattle, Washington
- 55 Center for the Study of Biological Complexity (CSBC), Virginia Commonwealth University , Richmond, Virginia
| | - Kenneth Verheggen
- 3 Data-Enabled Life Sciences Alliance (DELSA Global) , Seattle, Washington
- 6 Department of Medical Protein Research, Vlaams Instituut voor Biotechnologie , Ghent, Belgium
- 7 Department of Biochemistry, Ghent University, Ghent , Belgium
| | - Brynn H Voy
- 3 Data-Enabled Life Sciences Alliance (DELSA Global) , Seattle, Washington
- 56 Department of Animal Science, University of Tennessee Institute of Agriculture , Knoxville, Tennessee
| | - Louise Warnich
- 3 Data-Enabled Life Sciences Alliance (DELSA Global) , Seattle, Washington
- 57 Department of Genetics, Faculty of AgriSciences, University of Stellenbosch , Stellenbosch, South Africa
| | - Steven W Wilhelm
- 3 Data-Enabled Life Sciences Alliance (DELSA Global) , Seattle, Washington
- 58 Department of Microbiology, University of Tennessee-Knoxville , Knoxville, Tennessee
| | - Gregory Yandl
- 1 Bioinformatics and High-Throughput Analysis Laboratory, Seattle Children's Research Institute , Seattle, Washington
- 3 Data-Enabled Life Sciences Alliance (DELSA Global) , Seattle, Washington
| |
Collapse
|
180
|
Tummler K, Lubitz T, Schelker M, Klipp E. New types of experimental data shape the use of enzyme kinetics for dynamic network modeling. FEBS J 2013; 281:549-71. [PMID: 24034816 DOI: 10.1111/febs.12525] [Citation(s) in RCA: 33] [Impact Index Per Article: 3.0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/21/2013] [Revised: 08/27/2013] [Accepted: 09/10/2013] [Indexed: 01/21/2023]
Abstract
Since the publication of Leonor Michaelis and Maude Menten's paper on the reaction kinetics of the enzyme invertase in 1913, molecular biology has evolved tremendously. New measurement techniques allow in vivo characterization of the whole genome, proteome or transcriptome of cells, whereas the classical enzyme essay only allows determination of the two Michaelis-Menten parameters V and K(m). Nevertheless, Michaelis-Menten kinetics are still commonly used, not only in the in vitro context of enzyme characterization but also as a rate law for enzymatic reactions in larger biochemical reaction networks. In this review, we give an overview of the historical development of kinetic rate laws originating from Michaelis-Menten kinetics over the past 100 years. Furthermore, we briefly summarize the experimental techniques used for the characterization of enzymes, and discuss web resources that systematically store kinetic parameters and related information. Finally, describe the novel opportunities that arise from using these data in dynamic mathematical modeling. In this framework, traditional in vitro approaches may be combined with modern genome-scale measurements to foster thorough understanding of the underlying complex mechanisms.
Collapse
Affiliation(s)
- Katja Tummler
- Theoretical Biophysics, Humboldt-Universität zu Berlin, Germany
| | | | | | | |
Collapse
|
181
|
Khan AM, Tan TW, Schönbach C, Ranganathan S. APBioNet-transforming bioinformatics in the Asia-Pacific region. PLoS Comput Biol 2013; 9:e1003317. [PMID: 24204244 PMCID: PMC3814852 DOI: 10.1371/journal.pcbi.1003317] [Citation(s) in RCA: 6] [Impact Index Per Article: 0.5] [Reference Citation Analysis] [MESH Headings] [Track Full Text] [Download PDF] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 12/13/2022] Open
Affiliation(s)
- Asif M. Khan
- Perdana University Graduate School of Medicine, Selangor Darul Ehsan, Malaysia
- Department of Pharmacology and Molecular Sciences, The Johns Hopkins University School of Medicine, Baltimore, Maryland, United States of America
| | - Tin Wee Tan
- Department of Biochemistry, Yong Loo Lin School of Medicine, National University of Singapore, Singapore
- * E-mail: (TWT); (SR)
| | - Christian Schönbach
- School of Science and Technology, Department of Biology and Chemistry, Nazarbayev University, Astana, Republic of Kazakhstan
- Department of Bioscience and Bioinformatics and Biomedical Informatics R&D Center (BMIRC), Kyushu Institute of Technology, Fukuoka, Japan
| | - Shoba Ranganathan
- Department of Biochemistry, Yong Loo Lin School of Medicine, National University of Singapore, Singapore
- Department of Chemistry and Biomolecular Sciences and ARC Centre of Excellence in Bioinformatics, Macquarie University, Sydney, Australia
- * E-mail: (TWT); (SR)
| |
Collapse
|
182
|
Wittig U, Rey M, Kania R, Bittkowski M, Shi L, Golebiewski M, Weidemann A, Müller W, Rojas I. Challenges for an enzymatic reaction kinetics database. FEBS J 2013; 281:572-82. [PMID: 24165050 DOI: 10.1111/febs.12562] [Citation(s) in RCA: 13] [Impact Index Per Article: 1.2] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 05/31/2013] [Revised: 09/27/2013] [Accepted: 10/02/2013] [Indexed: 11/27/2022]
Abstract
The scientific literature contains a tremendous amount of kinetic data describing the dynamic behaviour of biochemical reactions over time. These data are needed for computational modelling to create models of biochemical reaction networks and to obtain a better understanding of the processes in living cells. To extract the knowledge from the literature, biocurators are required to understand a paper and interpret the data. For modellers, as well as experimentalists, this process is very time consuming because the information is distributed across the publication and, in most cases, is insufficiently structured and often described without standard terminology. In recent years, biological databases for different data types have been developed. The advantages of these databases lie in their unified structure, searchability and the potential for augmented analysis by software, which supports the modelling process. We have developed the SABIO-RK database for biochemical reaction kinetics. In the present review, we describe the challenges for database developers and curators, beginning with an analysis of relevant publications up to the export of database information in a standardized format. The aim of the present review is to draw the experimentalist's attention to the problem (from a data integration point of view) of incompletely and imprecisely written publications. We describe how to lower the barrier to curators and improve this situation. At the same time, we are aware that curating experimental data takes time. There is a community concerned with making the task of publishing data with the proper structure and annotation to ontologies much easier. In this respect, we highlight some useful initiatives and tools.
Collapse
Affiliation(s)
- Ulrike Wittig
- Scientific Databases and Visualization Group, Heidelberg Institute for Theoretical Studies (HITS), Germany
| | | | | | | | | | | | | | | | | |
Collapse
|
183
|
Robert-Tissot C, Nguyen LT, Ohashi PS, Speiser DE. Mobilizing and evaluating anticancer T cells: pitfalls and solutions. Expert Rev Vaccines 2013; 12:1325-40. [PMID: 24127850 DOI: 10.1586/14760584.2013.843456] [Citation(s) in RCA: 4] [Impact Index Per Article: 0.4] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 01/01/2023]
Abstract
Immunotherapy is a promising means to fight cancer, prompting a steady increase in clinical trials and correlative laboratory studies in this field. As antitumor T cells play central roles in immunity against malignant diseases, most immunotherapeutic protocols aim to induce and/or strengthen their function. Various treatment strategies have elicited encouraging clinical responses; however, major challenges have been uncovered that should be addressed in order to fully exploit the potential of immunotherapy. Here, we outline pitfalls for the mobilization of antitumor T cells and offer solutions to improve their therapeutic efficacy. We provide a critical perspective on the main methodologies used to characterize T-cell responses to cancer therapies, with a focus on discrepancies between T-cell attributes measured in vitro and protective responses in vivo. This review altogether provides recommendations to optimize the design of future clinical trials and highlights important considerations for the proficient analysis of clinical specimens available for research.
Collapse
Affiliation(s)
- Céline Robert-Tissot
- Campbell Family Institute for Breast Cancer Research, Princess Margaret Cancer Centre, University Health Network, Toronto, ON, M5G 2C1, Canada
| | | | | | | |
Collapse
|
184
|
Knüpfer C, Beckstein C. Function of dynamic models in systems biology: linking structure to behaviour. J Biomed Semantics 2013; 4:24. [PMID: 24103739 PMCID: PMC3853929 DOI: 10.1186/2041-1480-4-24] [Citation(s) in RCA: 2] [Impact Index Per Article: 0.2] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 01/17/2013] [Accepted: 05/23/2013] [Indexed: 11/10/2022] Open
Abstract
BACKGROUND Dynamic models in Systems Biology are used in computational simulation experiments for addressing biological questions. The complexity of the modelled biological systems and the growing number and size of the models calls for computer support for modelling and simulation in Systems Biology. This computer support has to be based on formal representations of relevant knowledge fragments. RESULTS In this paper we describe different functional aspects of dynamic models. This description is conceptually embedded in our "meaning facets" framework which systematises the interpretation of dynamic models in structural, functional and behavioural facets. Here we focus on how function links the structure and the behaviour of a model. Models play a specific role (teleological function) in the scientific process of finding explanations for dynamic phenomena. In order to fulfil this role a model has to be used in simulation experiments (pragmatical function). A simulation experiment always refers to a specific situation and a state of the model and the modelled system (conditional function). We claim that the function of dynamic models refers to both the simulation experiment executed by software (intrinsic function) and the biological experiment which produces the phenomena under investigation (extrinsic function). We use the presented conceptual framework for the function of dynamic models to review formal accounts for functional aspects of models in Systems Biology, such as checklists, ontologies, and formal languages. Furthermore, we identify missing formal accounts for some of the functional aspects. In order to fill one of these gaps we propose an ontology for the teleological function of models. CONCLUSION We have thoroughly analysed the role and use of models in Systems Biology. The resulting conceptual framework for the function of models is an important first step towards a comprehensive formal representation of the functional knowledge involved in the modelling and simulation process. Any progress in this area will in turn improve computer-supported modelling and simulation in Systems Biology.
Collapse
|
185
|
Mikut R, Dickmeis T, Driever W, Geurts P, Hamprecht FA, Kausler BX, Ledesma-Carbayo MJ, Marée R, Mikula K, Pantazis P, Ronneberger O, Santos A, Stotzka R, Strähle U, Peyriéras N. Automated processing of zebrafish imaging data: a survey. Zebrafish 2013; 10:401-21. [PMID: 23758125 PMCID: PMC3760023 DOI: 10.1089/zeb.2013.0886] [Citation(s) in RCA: 56] [Impact Index Per Article: 5.1] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 01/23/2023] Open
Abstract
Due to the relative transparency of its embryos and larvae, the zebrafish is an ideal model organism for bioimaging approaches in vertebrates. Novel microscope technologies allow the imaging of developmental processes in unprecedented detail, and they enable the use of complex image-based read-outs for high-throughput/high-content screening. Such applications can easily generate Terabytes of image data, the handling and analysis of which becomes a major bottleneck in extracting the targeted information. Here, we describe the current state of the art in computational image analysis in the zebrafish system. We discuss the challenges encountered when handling high-content image data, especially with regard to data quality, annotation, and storage. We survey methods for preprocessing image data for further analysis, and describe selected examples of automated image analysis, including the tracking of cells during embryogenesis, heartbeat detection, identification of dead embryos, recognition of tissues and anatomical landmarks, and quantification of behavioral patterns of adult fish. We review recent examples for applications using such methods, such as the comprehensive analysis of cell lineages during early development, the generation of a three-dimensional brain atlas of zebrafish larvae, and high-throughput drug screens based on movement patterns. Finally, we identify future challenges for the zebrafish image analysis community, notably those concerning the compatibility of algorithms and data formats for the assembly of modular analysis pipelines.
Collapse
Affiliation(s)
- Ralf Mikut
- Karlsruhe Institute of Technology, Karlsruhe, Germany.
| | | | | | | | | | | | | | | | | | | | | | | | | | | | | |
Collapse
|
186
|
Wagner M, Vermeirssen ELM, Buchinger S, Behr M, Magdeburg A, Oehlmann J. Deriving bio-equivalents from in vitro bioassays: assessment of existing uncertainties and strategies to improve accuracy and reporting. ENVIRONMENTAL TOXICOLOGY AND CHEMISTRY 2013; 32:1906-17. [PMID: 23625661 DOI: 10.1002/etc.2256] [Citation(s) in RCA: 23] [Impact Index Per Article: 2.1] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Subscribe] [Scholar Register] [Received: 12/17/2012] [Revised: 03/18/2013] [Accepted: 04/15/2013] [Indexed: 05/12/2023]
Abstract
Bio-equivalents (e.g., 17β-estradiol or dioxin equivalents) are commonly employed to quantify the in vitro effects of complex human or environmental samples. However, there is no generally accepted data analysis strategy for estimating and reporting bio-equivalents. Therefore, the aims of the present study are to 1) identify common mathematical models for the derivation of bio-equivalents from the literature, 2) assess the ability of those models to correctly predict bio-equivalents, and 3) propose measures to reduce uncertainty in their calculation and reporting. We compiled a database of 234 publications that report bio-equivalents. From the database, we extracted 3 data analysis strategies commonly used to estimate bio-equivalents. These models are based on linear or nonlinear interpolation, and the comparison of effect concentrations (ECX ). To assess their accuracy, we employed simulated data sets in different scenarios. The results indicate that all models lead to a considerable misestimation of bio-equivalents if certain mathematical assumptions (e.g., goodness of fit, parallelism of dose-response curves) are violated. However, nonlinear interpolation is most suitable to predict bio-equivalents from single-point estimates. Regardless of the model, subsequent linear extrapolation of bio-equivalents generates additional inaccuracy if the prerequisite of parallel dose-response curves is not met. When all these factors are taken into consideration, it becomes clear that data analysis introduces considerable uncertainty in the derived bio-equivalents. To improve accuracy and transparency of bio-equivalents, we propose a novel data analysis strategy and a checklist for reporting Minimum Information about Bio-equivalent ESTimates (MIBEST).
Collapse
Affiliation(s)
- Martin Wagner
- Department of Aquatic Ecotoxicology, Faculty of Biological Sciences, Goethe University, Frankfurt am Main, Germany.
| | | | | | | | | | | |
Collapse
|
187
|
Milsted AJ, Hale JR, Frey JG, Neylon C. LabTrove: a lightweight, web based, laboratory "blog" as a route towards a marked up record of work in a bioscience research laboratory. PLoS One 2013; 8:e67460. [PMID: 23935832 PMCID: PMC3720848 DOI: 10.1371/journal.pone.0067460] [Citation(s) in RCA: 20] [Impact Index Per Article: 1.8] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 12/31/2012] [Accepted: 05/17/2013] [Indexed: 11/18/2022] Open
Abstract
BACKGROUND The electronic laboratory notebook (ELN) has the potential to replace the paper notebook with a marked-up digital record that can be searched and shared. However, it is a challenge to achieve these benefits without losing the usability and flexibility of traditional paper notebooks. We investigate a blog-based platform that addresses the issues associated with the development of a flexible system for recording scientific research. METHODOLOGY/PRINCIPAL FINDINGS We chose a blog-based approach with the journal characteristics of traditional notebooks in mind, recognizing the potential for linking together procedures, materials, samples, observations, data, and analysis reports. We implemented the LabTrove blog system as a server process written in PHP, using a MySQL database to persist posts and other research objects. We incorporated a metadata framework that is both extensible and flexible while promoting consistency and structure where appropriate. Our experience thus far is that LabTrove is capable of providing a successful electronic laboratory recording system. CONCLUSIONS/SIGNIFICANCE LabTrove implements a one-item one-post system, which enables us to uniquely identify each element of the research record, such as data, samples, and protocols. This unique association between a post and a research element affords advantages for monitoring the use of materials and samples and for inspecting research processes. The combination of the one-item one-post system, consistent metadata, and full-text search provides us with a much more effective record than a paper notebook. The LabTrove approach provides a route towards reconciling the tensions and challenges that lie ahead in working towards the long-term goals for ELNs. LabTrove, an electronic laboratory notebook (ELN) system from the Smart Research Framework, based on a blog-type framework with full access control, facilitates the scientific experimental recording requirements for reproducibility, reuse, repurposing, and redeployment.
Collapse
Affiliation(s)
- Andrew J. Milsted
- Department of Chemistry, University of Southampton, Southampton, United Kingdom
| | - Jennifer R. Hale
- Department of Chemistry, University of Southampton, Southampton, United Kingdom
| | - Jeremy G. Frey
- Department of Chemistry, University of Southampton, Southampton, United Kingdom
- * E-mail:
| | - Cameron Neylon
- ISIS Neutron Facility, Science and Technology Facilities Council Rutherford Appleton Laboratory, Harwell Science and Innovation Campus, Didcot, United Kingdom
| |
Collapse
|
188
|
Janetzki S, Hoos A, Melief CJM, Odunsi K, Romero P, Britten CM. Structured reporting of T cell assay results. CANCER IMMUNITY 2013; 13:13. [PMID: 23882158 PMCID: PMC3718734] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Key Words] [MESH Headings] [Subscribe] [Scholar Register] [Indexed: 06/02/2023]
|
189
|
Turbelin C, Boëlle PY. Open data in public health surveillance systems: a case study using the French Sentinelles network. Int J Med Inform 2013; 82:1012-21. [PMID: 23850384 DOI: 10.1016/j.ijmedinf.2013.06.009] [Citation(s) in RCA: 4] [Impact Index Per Article: 0.4] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/24/2012] [Revised: 12/04/2012] [Accepted: 06/14/2013] [Indexed: 12/22/2022]
Abstract
INTRODUCTION Public Health Surveillance (PHS) produces an increasing number of health indicators. Exposing these data is at the core of interoperability; however no standard has yet been adopted for such information on the internet. METHOD Here, we compared two approaches to expose data from the French Sentinelles network, an information system focusing on communicable diseases surveillance in the general population. We implemented SDMX-HD (Statistical Data and Metadata Exchange-Health Domain), a standard supported by government agencies to exchange statistical data and OpenData (OData), a general purpose protocol proposed by Microsoft Corp. The same data were described using SDMX-HD (available at http://sdmx.sentiweb.fr) and using OData (http://odata.sentiweb.fr). DISCUSSION These two use cases proved the feasibility of opening public health data on the internet, and highlighted difficulties: SDMX, a full-featured solution, encouraged harmonization and reusability, sustainability, but required complex developments and tools; OData was much simpler to implement but required a "from scratch" description and did not encourage reusability. From an end-user perspective, integration in every-day tools is not achieved yet. These two approaches are a first step to interoperability in PHS.
Collapse
Affiliation(s)
- Clément Turbelin
- UPMC Univ Paris 06, France; UMR S 707, Inserm, Paris, F-75012, France.
| | - Pierre-Yves Boëlle
- UPMC Univ Paris 06, France; UMR S 707, Inserm, Paris, F-75012, France; AP-HP, Hôpital Saint Antoine, Paris, France
| |
Collapse
|
190
|
Ison J, Kalas M, Jonassen I, Bolser D, Uludag M, McWilliam H, Malone J, Lopez R, Pettifer S, Rice P. EDAM: an ontology of bioinformatics operations, types of data and identifiers, topics and formats. Bioinformatics 2013; 29:1325-32. [PMID: 23479348 PMCID: PMC3654706 DOI: 10.1093/bioinformatics/btt113] [Citation(s) in RCA: 126] [Impact Index Per Article: 11.5] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/16/2012] [Revised: 02/28/2013] [Accepted: 03/01/2013] [Indexed: 11/14/2022] Open
Abstract
MOTIVATION Advancing the search, publication and integration of bioinformatics tools and resources demands consistent machine-understandable descriptions. A comprehensive ontology allowing such descriptions is therefore required. RESULTS EDAM is an ontology of bioinformatics operations (tool or workflow functions), types of data and identifiers, application domains and data formats. EDAM supports semantic annotation of diverse entities such as Web services, databases, programmatic libraries, standalone tools, interactive applications, data schemas, datasets and publications within bioinformatics. EDAM applies to organizing and finding suitable tools and data and to automating their integration into complex applications or workflows. It includes over 2200 defined concepts and has successfully been used for annotations and implementations. AVAILABILITY The latest stable version of EDAM is available in OWL format from http://edamontology.org/EDAM.owl and in OBO format from http://edamontology.org/EDAM.obo. It can be viewed online at the NCBO BioPortal and the EBI Ontology Lookup Service. For documentation and license please refer to http://edamontology.org. This article describes version 1.2 available at http://edamontology.org/EDAM_1.2.owl. CONTACT jison@ebi.ac.uk.
Collapse
Affiliation(s)
- Jon Ison
- EMBL European Bioinformatics Institute, Hinxton, Cambridge CB10 1SD, UK.
| | | | | | | | | | | | | | | | | | | |
Collapse
|
191
|
Harrow I, Filsell W, Woollard P, Dix I, Braxenthaler M, Gedye R, Hoole D, Kidd R, Wilson J, Rebholz-Schuhmann D. Towards Virtual Knowledge Broker services for semantic integration of life science literature and data sources. Drug Discov Today 2013; 18:428-34. [DOI: 10.1016/j.drudis.2012.11.012] [Citation(s) in RCA: 8] [Impact Index Per Article: 0.7] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/18/2012] [Revised: 11/09/2012] [Accepted: 11/22/2012] [Indexed: 10/27/2022]
|
192
|
Eckels J, Nathe C, Nelson EK, Shoemaker SG, Nostrand EV, Yates NL, Ashley VC, Harris LJ, Bollenbeck M, Fong Y, Tomaras GD, Piehler B. Quality control, analysis and secure sharing of Luminex® immunoassay data using the open source LabKey Server platform. BMC Bioinformatics 2013; 14:145. [PMID: 23631706 PMCID: PMC3671158 DOI: 10.1186/1471-2105-14-145] [Citation(s) in RCA: 9] [Impact Index Per Article: 0.8] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/31/2012] [Accepted: 03/27/2013] [Indexed: 12/03/2022] Open
Abstract
BACKGROUND Immunoassays that employ multiplexed bead arrays produce high information content per sample. Such assays are now frequently used to evaluate humoral responses in clinical trials. Integrated software is needed for the analysis, quality control, and secure sharing of the high volume of data produced by such multiplexed assays. Software that facilitates data exchange and provides flexibility to perform customized analyses (including multiple curve fits and visualizations of assay performance over time) could increase scientists' capacity to use these immunoassays to evaluate human clinical trials. RESULTS The HIV Vaccine Trials Network and the Statistical Center for HIV/AIDS Research and Prevention collaborated with LabKey Software to enhance the open source LabKey Server platform to facilitate workflows for multiplexed bead assays. This system now supports the management, analysis, quality control, and secure sharing of data from multiplexed immunoassays that leverage Luminex xMAP® technology. These assays may be custom or kit-based. Newly added features enable labs to: (i) import run data from spreadsheets output by Bio-Plex Manager™ software; (ii) customize data processing, curve fits, and algorithms through scripts written in common languages, such as R; (iii) select script-defined calculation options through a graphical user interface; (iv) collect custom metadata for each titration, analyte, run and batch of runs; (v) calculate dose-response curves for titrations; (vi) interpolate unknown concentrations from curves for titrated standards; (vii) flag run data for exclusion from analysis; (viii) track quality control metrics across runs using Levey-Jennings plots; and (ix) automatically flag outliers based on expected values. Existing system features allow researchers to analyze, integrate, visualize, export and securely share their data, as well as to construct custom user interfaces and workflows. CONCLUSIONS Unlike other tools tailored for Luminex immunoassays, LabKey Server allows labs to customize their Luminex analyses using scripting while still presenting users with a single, graphical interface for processing and analyzing data. The LabKey Server system also stands out among Luminex tools for enabling smooth, secure transfer of data, quality control information, and analyses between collaborators. LabKey Server and its Luminex features are freely available as open source software at http://www.labkey.com under the Apache 2.0 license.
Collapse
Affiliation(s)
| | | | | | - Sara G Shoemaker
- Statistical Center for HIV/AIDS Research & Prevention (SCHARP), Fred Hutchinson Cancer Research Center, Seattle, WA, USA
| | | | - Nicole L Yates
- Duke Human Vaccine Institute, Duke University Medical Center, Durham, NC, USA
- Department of Medicine, Duke University Medical Center, Durham, NC, USA
| | - Vicki C Ashley
- Duke Human Vaccine Institute, Duke University Medical Center, Durham, NC, USA
- Department of Surgery, Duke University Medical Center, Durham, NC, USA
| | - Linda J Harris
- Statistical Center for HIV/AIDS Research & Prevention (SCHARP), Fred Hutchinson Cancer Research Center, Seattle, WA, USA
| | - Mark Bollenbeck
- Statistical Center for HIV/AIDS Research & Prevention (SCHARP), Fred Hutchinson Cancer Research Center, Seattle, WA, USA
| | - Youyi Fong
- Vaccine and Infectious Disease Division, Fred Hutchinson Cancer Research Center, Seattle, Washington, USA
| | - Georgia D Tomaras
- Duke Human Vaccine Institute, Duke University Medical Center, Durham, NC, USA
- Department of Medicine, Duke University Medical Center, Durham, NC, USA
- Department of Molecular Genetics and Microbiology, Duke University Medical Center, Durham, NC, USA
- Department of Immunology, Duke University Medical Center, Durham, NC, USA
| | | |
Collapse
|
193
|
Juty N, Le Novère N, Hermjakob H, Laibe C. Towards the collaborative curation of the registry underlying Identifiers.org. DATABASE-THE JOURNAL OF BIOLOGICAL DATABASES AND CURATION 2013; 2013:bat017. [PMID: 23584831 PMCID: PMC3625955 DOI: 10.1093/database/bat017] [Citation(s) in RCA: 10] [Impact Index Per Article: 0.9] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Subscribe] [Scholar Register] [Indexed: 12/28/2022]
Abstract
The MIRIAM Registry (http://www.ebi.ac.uk/miriam/) records information about collections of data in the life sciences, as well as where it can be obtained. This information is used, in combination with the resolving infrastructure of Identifiers.org (http://identifiers.org/), to generate globally unique identifiers, in the form of Uniform Resource Identifier. These identifiers are now widely used to provide perennial cross-references and annotations. The growing demand for these identifiers results in a significant increase in curational efforts to maintain the underlying registry. This requires the design and implementation of an economically viable and sustainable solution able to cope with such expansion. We briefly describe the Registry, the current curation duties entailed, and our plans to extend and distribute this workload through collaborative and community efforts.
Collapse
Affiliation(s)
- Nick Juty
- Proteomics Services, European Bioinformatics Institute, Wellcome Trust Genome Campus, Hinxton, Cambridge CB10 1SD, UK
| | | | | | | |
Collapse
|
194
|
Kolarich D, Rapp E, Struwe WB, Haslam SM, Zaia J, McBride R, Agravat S, Campbell MP, Kato M, Ranzinger R, Kettner C, York WS. The minimum information required for a glycomics experiment (MIRAGE) project: improving the standards for reporting mass-spectrometry-based glycoanalytic data. Mol Cell Proteomics 2013; 12:991-5. [PMID: 23378518 DOI: 10.1074/mcp.o112.026492] [Citation(s) in RCA: 89] [Impact Index Per Article: 8.1] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/06/2022] Open
Abstract
The MIRAGE guidelines are being developed in response to a critical need in the glycobiology community to clarify glycoanalytic results so that they are more readily evaluated (in terms of their scope and depth) and to facilitate the reproduction of important results in the laboratory. The molecular and biological complexity of the glycosylation process makes thorough reporting of the results of a glycomics experiment a highly challenging endeavor. The resulting data specify the identity and quantity of complex structures, the precise molecular features of which are sometimes inferred using prior knowledge, such as familiarity with a particular biosynthetic mechanism. Specifying the exact methods and assumptions that were used to assign and quantify reported structures allows the interested scientist to appreciate the scope and depth of the analysis. Mass spectrometry (MS) is the most widely used tool for glycomics experiments. The interpretation and reproducibility of MS-based glycomics data depend on comprehensive meta-data describing the instrumentation, instrument setup, and data acquisition protocols. The MIRAGE guidelines for MS-based glycomics have been designed to facilitate the collection and sharing of this critical information in order to assist the glycoanalyst in generating data sets with maximum information content and biological relevance.
Collapse
Affiliation(s)
- Daniel Kolarich
- Department of Biomolecular Systems, Max Planck Institute of Colloids and Interfaces, 14424 Potsdam, Germany
| | | | | | | | | | | | | | | | | | | | | | | |
Collapse
|
195
|
Kohonen P, Benfenati E, Bower D, Ceder R, Crump M, Cross K, Grafström RC, Healy L, Helma C, Jeliazkova N, Jeliazkov V, Maggioni S, Miller S, Myatt G, Rautenberg M, Stacey G, Willighagen E, Wiseman J, Hardy B. The ToxBank Data Warehouse: Supporting the Replacement of In Vivo Repeated Dose Systemic Toxicity Testing. Mol Inform 2013; 32:47-63. [PMID: 27481023 DOI: 10.1002/minf.201200114] [Citation(s) in RCA: 30] [Impact Index Per Article: 2.7] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/04/2012] [Accepted: 11/27/2012] [Indexed: 12/12/2022]
Abstract
The aim of the SEURAT-1 (Safety Evaluation Ultimately Replacing Animal Testing-1) research cluster, comprised of seven EU FP7 Health projects co-financed by Cosmetics Europe, is to generate a proof-of-concept to show how the latest technologies, systems toxicology and toxicogenomics can be combined to deliver a test replacement for repeated dose systemic toxicity testing on animals. The SEURAT-1 strategy is to adopt a mode-of-action framework to describe repeated dose toxicity, combining in vitro and in silico methods to derive predictions of in vivo toxicity responses. ToxBank is the cross-cluster infrastructure project whose activities include the development of a data warehouse to provide a web-accessible shared repository of research data and protocols, a physical compounds repository, reference or "gold compounds" for use across the cluster (available via wiki.toxbank.net), and a reference resource for biomaterials. Core technologies used in the data warehouse include the ISA-Tab universal data exchange format, REpresentational State Transfer (REST) web services, the W3C Resource Description Framework (RDF) and the OpenTox standards. We describe the design of the data warehouse based on cluster requirements, the implementation based on open standards, and finally the underlying concepts and initial results of a data analysis utilizing public data related to the gold compounds.
Collapse
Affiliation(s)
| | | | | | | | | | | | | | - Lyn Healy
- National Institute for Biological Standards and Control, Potters Bar, UK
| | | | | | | | - Silvia Maggioni
- Istituto di Ricerche Farmacologiche Mario Negri, Milan, Italy
| | | | | | | | - Glyn Stacey
- National Institute for Biological Standards and Control, Potters Bar, UK
| | | | | | | |
Collapse
|
196
|
Nelson EK, Piehler B, Rauch A, Ramsay S, Holman D, Asare S, Asare A, Igra M. Ancillary study management systems: a review of needs. BMC Med Inform Decis Mak 2013; 13:5. [PMID: 23294514 PMCID: PMC3564696 DOI: 10.1186/1472-6947-13-5] [Citation(s) in RCA: 3] [Impact Index Per Article: 0.3] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/15/2012] [Accepted: 12/28/2012] [Indexed: 11/10/2022] Open
Abstract
BACKGROUND The valuable clinical data, specimens, and assay results collected during a primary clinical trial or observational study can enable researchers to answer additional, pressing questions with relatively small investments in new measurements. However, management of such follow-on, "ancillary" studies is complex. It requires coordinating across institutions, sites, repositories, and approval boards, as well as distributing, integrating, and analyzing diverse data types. General-purpose software systems that simplify the management of ancillary studies have not yet been explored in the research literature. METHODS We have identified requirements for ancillary study management primarily as part of our ongoing work with a number of large research consortia. These organizations include the Center for HIV/AIDS Vaccine Immunology (CHAVI), the Immune Tolerance Network (ITN), the HIV Vaccine Trials Network (HVTN), the U.S. Military HIV Research Program (MHRP), and the Network for Pancreatic Organ Donors with Diabetes (nPOD). We also consulted with researchers at a range of other disease research organizations regarding their workflows and data management strategies. Lastly, to enhance breadth, we reviewed process documents for ancillary study management from other organizations. RESULTS By exploring characteristics of ancillary studies, we identify differentiating requirements and scenarios for ancillary study management systems (ASMSs). Distinguishing characteristics of ancillary studies may include the collection of additional measurements (particularly new analyses of existing specimens); the initiation of studies by investigators unaffiliated with the original study; cross-protocol data pooling and analysis; pre-existing participant consent; and pre-existing data context and provenance. For an ASMS to address these characteristics, it would need to address both operational requirements (e.g., allocating existing specimens) and data management requirements (e.g., securely distributing and integrating primary and ancillary data). CONCLUSIONS The scenarios and requirements we describe can help guide the development of systems that make conducting ancillary studies easier, less expensive, and less error-prone. Given the relatively consistent characteristics and challenges of ancillary study management, general-purpose ASMSs are likely to be useful to a wide range of organizations. Using the requirements identified in this paper, we are currently developing an open-source, general-purpose ASMS based on LabKey Server (http://www.labkey.org) in collaboration with CHAVI, the ITN and nPOD.
Collapse
Affiliation(s)
| | | | | | - Sarah Ramsay
- Statistical Center for HIV/AIDS Research & Prevention (SCHARP), Fred Hutchinson Cancer Research Center, Seattle, WA, USA
| | - Drienna Holman
- Statistical Center for HIV/AIDS Research & Prevention (SCHARP), Fred Hutchinson Cancer Research Center, Seattle, WA, USA
| | - Smita Asare
- University of California, San Francisco, CA, USA
- Immune Tolerance Network, Bethesda, MD, USA
| | - Adam Asare
- University of California, San Francisco, CA, USA
- Immune Tolerance Network, Bethesda, MD, USA
| | | |
Collapse
|
197
|
Abstract
The aim of this chapter is to provide sufficient information to enable a reader, new to the subject of Systems Biology, to create and use effectively controlled annotations, using resolvable Identifiers.org Uniform Resource Identifiers (URIs). The text details the underlying requirements that have led to the development of such an identification scheme and infrastructure, the principles that underpin its syntax and the benefits derived through its use. It also places into context the relationship with other standardization efforts, how it differs from other pre-existing identification schemes, recent improvements to the system, as well as those that are planned in the future. Throughout, the reader is provided with explicit examples of use and directed to supplementary information where necessary.
Collapse
Affiliation(s)
- Nick Juty
- The EMBL-European Bioinformatics Institute, Wellcome Trust Genome Campus Hinxton, Cambridge, UK
| | | | | |
Collapse
|
198
|
Improving biological relevancy of transcriptional biomarkers experiments by applying the MIQE guidelines to pre-clinical and clinical trials. Methods 2013; 59:147-53. [DOI: 10.1016/j.ymeth.2012.07.024] [Citation(s) in RCA: 8] [Impact Index Per Article: 0.7] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/12/2012] [Revised: 07/18/2012] [Accepted: 07/20/2012] [Indexed: 11/22/2022] Open
|
199
|
Abstract
Nucleic acids are the ultimate biomarker and real-time PCR (qPCR) is firmly established as the method of choice for nucleic acid detection. Together, they allow the accurate, sensitive and specific identification of pathogens, and the use of qPCR has become routine in diagnostic laboratories. The reliability of qPCR-based assays relies on a combination of optimal sample selection, assay design and validation as well as appropriate data analysis and the "Minimal Information for the Publication of real-time PCR" (MIQE) guidelines aim to improve both the reliability of assay design as well as the transparency of reporting, essential conditions if qPCR is to remain the benchmark technology for molecular diagnosis.
Collapse
Affiliation(s)
- Gemma Johnson
- Blizard Institute of Cell and Molecular Science, Barts and the London School of Medicine and Dentistry, Queen Mary University of London, London, UK
| | | | | |
Collapse
|
200
|
Henderson VC, Kimmelman J, Fergusson D, Grimshaw JM, Hackam DG. Threats to validity in the design and conduct of preclinical efficacy studies: a systematic review of guidelines for in vivo animal experiments. PLoS Med 2013; 10:e1001489. [PMID: 23935460 PMCID: PMC3720257 DOI: 10.1371/journal.pmed.1001489] [Citation(s) in RCA: 193] [Impact Index Per Article: 17.5] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Submit a Manuscript] [Subscribe] [Scholar Register] [Received: 01/11/2013] [Accepted: 06/13/2013] [Indexed: 11/25/2022] Open
Abstract
BACKGROUND The vast majority of medical interventions introduced into clinical development prove unsafe or ineffective. One prominent explanation for the dismal success rate is flawed preclinical research. We conducted a systematic review of preclinical research guidelines and organized recommendations according to the type of validity threat (internal, construct, or external) or programmatic research activity they primarily address. METHODS AND FINDINGS We searched MEDLINE, Google Scholar, Google, and the EQUATOR Network website for all preclinical guideline documents published up to April 9, 2013 that addressed the design and conduct of in vivo animal experiments aimed at supporting clinical translation. To be eligible, documents had to provide guidance on the design or execution of preclinical animal experiments and represent the aggregated consensus of four or more investigators. Data from included guidelines were independently extracted by two individuals for discrete recommendations on the design and implementation of preclinical efficacy studies. These recommendations were then organized according to the type of validity threat they addressed. A total of 2,029 citations were identified through our search strategy. From these, we identified 26 guidelines that met our eligibility criteria--most of which were directed at neurological or cerebrovascular drug development. Together, these guidelines offered 55 different recommendations. Some of the most common recommendations included performance of a power calculation to determine sample size, randomized treatment allocation, and characterization of disease phenotype in the animal model prior to experimentation. CONCLUSIONS By identifying the most recurrent recommendations among preclinical guidelines, we provide a starting point for developing preclinical guidelines in other disease domains. We also provide a basis for the study and evaluation of preclinical research practice. Please see later in the article for the Editors' Summary.
Collapse
Affiliation(s)
- Valerie C. Henderson
- Studies of Translation, Ethics and Medicine (STREAM) Group, Biomedical Ethics Unit, Department of Social Studies of Medicine, McGill University, Montréal, Québec, Canada
| | - Jonathan Kimmelman
- Studies of Translation, Ethics and Medicine (STREAM) Group, Biomedical Ethics Unit, Department of Social Studies of Medicine, McGill University, Montréal, Québec, Canada
- * E-mail:
| | - Dean Fergusson
- Ottawa Hospital Research Institute, The Ottawa Hospital, Ottawa, Ontario, Canada
- Department of Medicine, University of Ottawa, Ottawa, Ontario, Canada
| | - Jeremy M. Grimshaw
- Ottawa Hospital Research Institute, The Ottawa Hospital, Ottawa, Ontario, Canada
- Department of Medicine, University of Ottawa, Ottawa, Ontario, Canada
| | - Dan G. Hackam
- Division of Clinical Pharmacology, Department of Medicine, University of Western Ontario, London, Ontario, Canada
| |
Collapse
|