1
|
McDavid A, Corbett AM, Dutra JL, Straw AG, Topham DJ, Pryhuber GS, Caserta MT, Gill SR, Scheible KM, Holden-Wiltse J. Eight practices for data management to enable team data science. J Clin Transl Sci 2020; 5:e14. [PMID: 33948240 PMCID: PMC8057476 DOI: 10.1017/cts.2020.501] [Citation(s) in RCA: 1] [Impact Index Per Article: 0.3] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 05/05/2020] [Revised: 06/16/2020] [Accepted: 06/17/2020] [Indexed: 12/16/2022] Open
Abstract
INTRODUCTION In clinical and translational research, data science is often and fortuitously integrated with data collection. This contrasts to the typical position of data scientists in other settings, where they are isolated from data collectors. Because of this, effective use of data science techniques to resolve translational questions requires innovation in the organization and management of these data. METHODS We propose an operational framework that respects this important difference in how research teams are organized. To maximize the accuracy and speed of the clinical and translational data science enterprise under this framework, we define a set of eight best practices for data management. RESULTS In our own work at the University of Rochester, we have strived to utilize these practices in a customized version of the open source LabKey platform for integrated data management and collaboration. We have applied this platform to cohorts that longitudinally track multidomain data from over 3000 subjects. CONCLUSIONS We argue that this has made analytical datasets more readily available and lowered the bar to interdisciplinary collaboration, enabling a team-based data science that is unique to the clinical and translational setting.
Collapse
Affiliation(s)
- Andrew McDavid
- Department of Biostatistics and Computational Biology, University of Rochester, Rochester, NY, USA
| | - Anthony M. Corbett
- Clinical and Translational Science Institute, University of Rochester, Rochester, NY, USA
| | - Jennifer L. Dutra
- Clinical and Translational Science Institute, University of Rochester, Rochester, NY, USA
| | - Andrew G. Straw
- Department of Biostatistics and Computational Biology, University of Rochester, Rochester, NY, USA
| | - David J. Topham
- Department of Microbiology and Immunology, University of Rochester, Rochester, NY, USA
| | | | - Mary T. Caserta
- Department of Pediatrics, University of Rochester, Rochester, NY, USA
| | - Steven R. Gill
- Department of Microbiology and Immunology, University of Rochester, Rochester, NY, USA
| | | | - Jeanne Holden-Wiltse
- Department of Biostatistics and Computational Biology, University of Rochester, Rochester, NY, USA
- Clinical and Translational Science Institute, University of Rochester, Rochester, NY, USA
| |
Collapse
|
2
|
Eckels J, Nathe C, Nelson EK, Shoemaker SG, Nostrand EV, Yates NL, Ashley VC, Harris LJ, Bollenbeck M, Fong Y, Tomaras GD, Piehler B. Quality control, analysis and secure sharing of Luminex® immunoassay data using the open source LabKey Server platform. BMC Bioinformatics 2013; 14:145. [PMID: 23631706 PMCID: PMC3671158 DOI: 10.1186/1471-2105-14-145] [Citation(s) in RCA: 9] [Impact Index Per Article: 0.8] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/31/2012] [Accepted: 03/27/2013] [Indexed: 12/03/2022] Open
Abstract
BACKGROUND Immunoassays that employ multiplexed bead arrays produce high information content per sample. Such assays are now frequently used to evaluate humoral responses in clinical trials. Integrated software is needed for the analysis, quality control, and secure sharing of the high volume of data produced by such multiplexed assays. Software that facilitates data exchange and provides flexibility to perform customized analyses (including multiple curve fits and visualizations of assay performance over time) could increase scientists' capacity to use these immunoassays to evaluate human clinical trials. RESULTS The HIV Vaccine Trials Network and the Statistical Center for HIV/AIDS Research and Prevention collaborated with LabKey Software to enhance the open source LabKey Server platform to facilitate workflows for multiplexed bead assays. This system now supports the management, analysis, quality control, and secure sharing of data from multiplexed immunoassays that leverage Luminex xMAP® technology. These assays may be custom or kit-based. Newly added features enable labs to: (i) import run data from spreadsheets output by Bio-Plex Manager™ software; (ii) customize data processing, curve fits, and algorithms through scripts written in common languages, such as R; (iii) select script-defined calculation options through a graphical user interface; (iv) collect custom metadata for each titration, analyte, run and batch of runs; (v) calculate dose-response curves for titrations; (vi) interpolate unknown concentrations from curves for titrated standards; (vii) flag run data for exclusion from analysis; (viii) track quality control metrics across runs using Levey-Jennings plots; and (ix) automatically flag outliers based on expected values. Existing system features allow researchers to analyze, integrate, visualize, export and securely share their data, as well as to construct custom user interfaces and workflows. CONCLUSIONS Unlike other tools tailored for Luminex immunoassays, LabKey Server allows labs to customize their Luminex analyses using scripting while still presenting users with a single, graphical interface for processing and analyzing data. The LabKey Server system also stands out among Luminex tools for enabling smooth, secure transfer of data, quality control information, and analyses between collaborators. LabKey Server and its Luminex features are freely available as open source software at http://www.labkey.com under the Apache 2.0 license.
Collapse
Affiliation(s)
| | | | | | - Sara G Shoemaker
- Statistical Center for HIV/AIDS Research & Prevention (SCHARP), Fred Hutchinson Cancer Research Center, Seattle, WA, USA
| | | | - Nicole L Yates
- Duke Human Vaccine Institute, Duke University Medical Center, Durham, NC, USA
- Department of Medicine, Duke University Medical Center, Durham, NC, USA
| | - Vicki C Ashley
- Duke Human Vaccine Institute, Duke University Medical Center, Durham, NC, USA
- Department of Surgery, Duke University Medical Center, Durham, NC, USA
| | - Linda J Harris
- Statistical Center for HIV/AIDS Research & Prevention (SCHARP), Fred Hutchinson Cancer Research Center, Seattle, WA, USA
| | - Mark Bollenbeck
- Statistical Center for HIV/AIDS Research & Prevention (SCHARP), Fred Hutchinson Cancer Research Center, Seattle, WA, USA
| | - Youyi Fong
- Vaccine and Infectious Disease Division, Fred Hutchinson Cancer Research Center, Seattle, Washington, USA
| | - Georgia D Tomaras
- Duke Human Vaccine Institute, Duke University Medical Center, Durham, NC, USA
- Department of Medicine, Duke University Medical Center, Durham, NC, USA
- Department of Molecular Genetics and Microbiology, Duke University Medical Center, Durham, NC, USA
- Department of Immunology, Duke University Medical Center, Durham, NC, USA
| | | |
Collapse
|