1
|
Pine PS, Lund SP, Parsons JR, Vang LK, Mahabal AA, Cinquini L, Kelly SC, Kincaid H, Crichton DJ, Spira A, Liu G, Gower AC, Pass HI, Goparaju C, Dubinett SM, Krysan K, Stass SA, Kukuruga D, Van Keuren-Jensen K, Courtright-Lim A, Thompson KL, Rosenzweig BA, Sorbara L, Srivastava S, Salit ML. Summarizing performance for genome scale measurement of miRNA: reference samples and metrics. BMC Genomics 2018; 19:180. [PMID: 29510677 PMCID: PMC5838960 DOI: 10.1186/s12864-018-4496-1] [Citation(s) in RCA: 3] [Impact Index Per Article: 0.5] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 09/29/2017] [Accepted: 01/25/2018] [Indexed: 11/10/2022] Open
Abstract
BACKGROUND The potential utility of microRNA as biomarkers for early detection of cancer and other diseases is being investigated with genome-scale profiling of differentially expressed microRNA. Processes for measurement assurance are critical components of genome-scale measurements. Here, we evaluated the utility of a set of total RNA samples, designed with between-sample differences in the relative abundance of miRNAs, as process controls. RESULTS Three pure total human RNA samples (brain, liver, and placenta) and two different mixtures of these components were evaluated as measurement assurance control samples on multiple measurement systems at multiple sites and over multiple rounds. In silico modeling of mixtures provided benchmark values for comparison with physical mixtures. Biomarker development laboratories using next-generation sequencing (NGS) or genome-scale hybridization assays participated in the study and returned data from the samples using their routine workflows. Multiplexed and single assay reverse-transcription PCR (RT-PCR) was used to confirm in silico predicted sample differences. Data visualizations and summary metrics for genome-scale miRNA profiling assessment were developed using this dataset, and a range of performance was observed. These metrics have been incorporated into an online data analysis pipeline and provide a convenient dashboard view of results from experiments following the described design. The website also serves as a repository for the accumulation of performance values providing new participants in the project an opportunity to learn what may be achievable with similar measurement processes. CONCLUSIONS The set of reference samples used in this study provides benchmark values suitable for assessing genome-scale miRNA profiling processes. Incorporation of these metrics into an online resource allows laboratories to periodically evaluate their performance and assess any changes introduced into their measurement process.
Collapse
Affiliation(s)
- P Scott Pine
- Joint Initiative for Metrology in Biology, National Institute of Standards and Technology, 443 Via Ortega, Stanford, CA, 94305, USA.
| | - Steven P Lund
- Statistical Engineering Division, National Institute of Standards and Technology, Gaithersburg, MD, USA
| | - Jerod R Parsons
- Joint Initiative for Metrology in Biology, National Institute of Standards and Technology, 443 Via Ortega, Stanford, CA, 94305, USA
| | - Lindsay K Vang
- Joint Initiative for Metrology in Biology, National Institute of Standards and Technology, 443 Via Ortega, Stanford, CA, 94305, USA
| | - Ashish A Mahabal
- Center for Data Driven Discovery, California Institute of Technology, Pasadena, CA, USA
| | - Luca Cinquini
- Jet Propulsion Laboratory, California Institute of Technology, Pasadena, CA, USA
| | - Sean C Kelly
- Jet Propulsion Laboratory, California Institute of Technology, Pasadena, CA, USA
| | - Heather Kincaid
- Jet Propulsion Laboratory, California Institute of Technology, Pasadena, CA, USA
| | - Daniel J Crichton
- Jet Propulsion Laboratory, California Institute of Technology, Pasadena, CA, USA
| | - Avrum Spira
- Section of Computational Biomedicine, Department of Medicine, Boston University School of Medicine, Boston, MA, USA
| | - Gang Liu
- Section of Computational Biomedicine, Department of Medicine, Boston University School of Medicine, Boston, MA, USA
| | - Adam C Gower
- Section of Computational Biomedicine, Department of Medicine, Boston University School of Medicine, Boston, MA, USA
| | - Harvey I Pass
- Department of Cardiothoracic Surgery, NYU Langone Medical Center, New York, NY, USA
| | - Chandra Goparaju
- Department of Cardiothoracic Surgery, NYU Langone Medical Center, New York, NY, USA
| | - Steven M Dubinett
- Department of Medicine, David Geffen School of Medicine at UCLA, Los Angeles, CA, USA.,Veterans Affairs Greater Los Angeles Healthcare System, Los Angeles, CA, USA
| | - Kostyantyn Krysan
- Department of Medicine, David Geffen School of Medicine at UCLA, Los Angeles, CA, USA
| | - Sanford A Stass
- Department of Pathology, University of Maryland School of Medicine, Baltimore, MD, USA
| | - Debra Kukuruga
- Department of Pathology, University of Maryland School of Medicine, Baltimore, MD, USA
| | | | | | - Karol L Thompson
- Center for Drug Evaluation and Research, Food and Drug Administration, Silver Spring, MD, USA
| | - Barry A Rosenzweig
- Center for Drug Evaluation and Research, Food and Drug Administration, Silver Spring, MD, USA
| | - Lynn Sorbara
- Division of Cancer Prevention, National Cancer Institute, Rockville, MD, USA
| | - Sudhir Srivastava
- Division of Cancer Prevention, National Cancer Institute, Rockville, MD, USA
| | - Marc L Salit
- Joint Initiative for Metrology in Biology, National Institute of Standards and Technology, 443 Via Ortega, Stanford, CA, 94305, USA
| |
Collapse
|
2
|
Pine PS, Rosenzweig BA, Thompson KL. An adaptable method using human mixed tissue ratiometric controls for benchmarking performance on gene expression microarrays in clinical laboratories. BMC Biotechnol 2011; 11:38. [PMID: 21486464 PMCID: PMC3103427 DOI: 10.1186/1472-6750-11-38] [Citation(s) in RCA: 11] [Impact Index Per Article: 0.8] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 01/25/2011] [Accepted: 04/12/2011] [Indexed: 11/10/2022] Open
Abstract
BACKGROUND Molecular biomarkers that are based on mRNA transcripts are being developed for the diagnosis and treatment of a number of diseases. DNA microarrays are one of the primary technologies being used to develop classifiers from gene expression data for clinically relevant outcomes. Microarray assays are highly multiplexed measures of comparative gene expression but have a limited dynamic range of measurement and show compression in fold change detection. To increase the clinical utility of microarrays, assay controls are needed that benchmark performance using metrics that are relevant to the analysis of genomic data generated with biological samples. RESULTS Ratiometric controls were prepared from commercial sources of high quality RNA from human tissues with distinctly different expression profiles and mixed in defined ratios. The samples were processed using six different target labeling protocols and replicate datasets were generated on high density gene expression microarrays. The area under the curve from receiver operating characteristic plots was calculated to measure diagnostic performance. The reliable region of the dynamic range was derived from log(2) ratio deviation plots made for each dataset. Small but statistically significant differences in diagnostic performance were observed between standardized assays available from the array manufacturer and alternative methods for target generation. Assay performance using the reliable range of comparative measurement as a metric was improved by adjusting sample hybridization conditions for one commercial kit. CONCLUSIONS Process improvement in microarray assay performance was demonstrated using samples prepared from commercially available materials and two metrics - diagnostic performance and the reliable range of measurement. These methods have advantages over approaches that use a limited set of external controls or correlations to reference sets, because they provide benchmark values that can be used by clinical laboratories to help optimize protocol conditions and laboratory proficiency with microarray assays.
Collapse
Affiliation(s)
- P Scott Pine
- Center for Drug Evaluation and Research, US Food and Drug Administration, Silver Spring, MD 20993 USA
| | | | | |
Collapse
|
3
|
Correcting for intra-experiment variation in Illumina BeadChip data is necessary to generate robust gene-expression profiles. BMC Genomics 2010; 11:134. [PMID: 20181233 PMCID: PMC2843619 DOI: 10.1186/1471-2164-11-134] [Citation(s) in RCA: 29] [Impact Index Per Article: 2.1] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 11/24/2009] [Accepted: 02/24/2010] [Indexed: 11/25/2022] Open
Abstract
Background Microarray technology is a popular means of producing whole genome transcriptional profiles, however high cost and scarcity of mRNA has led many studies to be conducted based on the analysis of single samples. We exploit the design of the Illumina platform, specifically multiple arrays on each chip, to evaluate intra-experiment technical variation using repeated hybridisations of universal human reference RNA (UHRR) and duplicate hybridisations of primary breast tumour samples from a clinical study. Results A clear batch-specific bias was detected in the measured expressions of both the UHRR and clinical samples. This bias was found to persist following standard microarray normalisation techniques. However, when mean-centering or empirical Bayes batch-correction methods (ComBat) were applied to the data, inter-batch variation in the UHRR and clinical samples were greatly reduced. Correlation between replicate UHRR samples improved by two orders of magnitude following batch-correction using ComBat (ranging from 0.9833-0.9991 to 0.9997-0.9999) and increased the consistency of the gene-lists from the duplicate clinical samples, from 11.6% in quantile normalised data to 66.4% in batch-corrected data. The use of UHRR as an inter-batch calibrator provided a small additional benefit when used in conjunction with ComBat, further increasing the agreement between the two gene-lists, up to 74.1%. Conclusion In the interests of practicalities and cost, these results suggest that single samples can generate reliable data, but only after careful compensation for technical bias in the experiment. We recommend that investigators appreciate the propensity for such variation in the design stages of a microarray experiment and that the use of suitable correction methods become routine during the statistical analysis of the data.
Collapse
|