1
|
Mizrahi-Man O, Woehrmann MH, Webster TA, Gollub J, Bivol A, Keeble SM, Aull KH, Mittal A, Roter AH, Wong BA, Schmidt JP. Novel genotyping algorithms for rare variants significantly improve the accuracy of Applied Biosystems™ Axiom™ array genotyping calls: Retrospective evaluation of UK Biobank array data. PLoS One 2022; 17:e0277680. [PMID: 36395175 PMCID: PMC9671364 DOI: 10.1371/journal.pone.0277680] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [What about the content of this article? (0)] [Affiliation(s)] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 04/13/2022] [Accepted: 11/01/2022] [Indexed: 11/18/2022] Open
Abstract
The UK Biobank genotyped about 500k participants using Applied Biosystems Axiom microarrays. Participants were subsequently sequenced by the UK Biobank Exome Sequencing Consortium. Axiom genotyping was highly accurate in comparison to sequencing results, for almost 100,000 variants both directly genotyped on the UK Biobank Axiom array and via whole exome sequencing. However, in a study using the exome sequencing results of the first 50k individuals as reference (truth), it was observed that the positive predictive value (PPV) decreased along with the number of heterozygous array calls per variant. We developed a novel addition to the genotyping algorithm, Rare Heterozygous Adjusted (RHA), to significantly improve PPV in variants with minor allele frequency below 0.01%. The improvement in PPV was roughly equal when comparing to the exome sequencing of 50k individuals, or to the more recent ~200k individuals. Sensitivity was higher in the 200k data. The improved calling algorithm, along with enhanced quality control of array probesets, significantly improved the positive predictive value and the sensitivity of array data, making it suitable for the detection of ultra-rare variants.
Collapse
Affiliation(s)
- Orna Mizrahi-Man
- Thermo Fisher Scientific, Santa Clara, CA, United States of America
| | | | | | - Jeremy Gollub
- Thermo Fisher Scientific, Santa Clara, CA, United States of America
| | - Adrian Bivol
- Thermo Fisher Scientific, Santa Clara, CA, United States of America
| | - Sara M. Keeble
- Thermo Fisher Scientific, Santa Clara, CA, United States of America
| | | | - Anuradha Mittal
- Thermo Fisher Scientific, Santa Clara, CA, United States of America
| | - Alan H. Roter
- Thermo Fisher Scientific, Santa Clara, CA, United States of America
| | - Brant A. Wong
- Thermo Fisher Scientific, Santa Clara, CA, United States of America
| | - Jeanette P. Schmidt
- Thermo Fisher Scientific, Santa Clara, CA, United States of America
- * E-mail:
| |
Collapse
|
2
|
Schmit SL, Gollub J, Shapero MH, Huang SC, Rennert HS, Finn A, Rennert G, Gruber SB. MicroRNA polymorphisms and risk of colorectal cancer. Cancer Epidemiol Biomarkers Prev 2014; 24:65-72. [PMID: 25342389 DOI: 10.1158/1055-9965.epi-14-0219] [Citation(s) in RCA: 11] [Impact Index Per Article: 1.1] [Reference Citation Analysis] [What about the content of this article? (0)] [Affiliation(s)] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 01/10/2023] Open
Abstract
BACKGROUND miRNAs act as post-transcriptional regulators of gene expression. Genetic variation in miRNA-encoding sequences or their corresponding binding sites may affect the fidelity of the miRNA-mRNA interaction and subsequently alter the risk of cancer development. METHODS This study expanded the search for miRNA-related polymorphisms contributing to the etiology of colorectal cancer across the genome using a novel platform, the Axiom miRNA Target Site Genotyping Array (237,858 markers). After quality control, the study included 596 cases and 429 controls from the Molecular Epidemiology of Colorectal Cancer study, a population-based case-control study of colorectal cancer in northern Israel. The association between each marker and colorectal cancer status was examined assuming a log-additive genetic model using logistic regression adjusted for sex, age, and two principal components. RESULTS Twenty-three markers had P values less than 5.0E-04, and the most statistically significant association involved rs2985 (chr6:34845648; intronic of UHRF1BP1; OR = 0.66; P = 3.7E-05). Furthermore, this study replicated a previously published risk locus, rs1051690, in the 3'-untranslated region of the insulin receptor gene INSR (OR = 1.38; P = 0.03), with strong evidence of differences in INSR gene expression by genotype. CONCLUSIONS This study is the first to examine associations between genetic variation in miRNA target sites and colorectal cancer using a genome-wide approach. Functional studies to identify allele-specific effects on miRNA binding are needed to confirm the regulatory capacity of genetic variation to influence risk of colorectal cancer. IMPACT This study demonstrates the potential for an miRNA-targeted genome-wide association study to identify candidate susceptibility loci and prioritize them for functional characterization.
Collapse
Affiliation(s)
- Stephanie L Schmit
- Department of Preventive Medicine, Keck School of Medicine, University of Southern California, Los Angeles, California. USC Norris Comprehensive Cancer Center, Keck School of Medicine, University of Southern California, Los Angeles, California
| | | | | | - Shu-Chen Huang
- USC Norris Comprehensive Cancer Center, Keck School of Medicine, University of Southern California, Los Angeles, California
| | - Hedy S Rennert
- Department of Community Medicine and Epidemiology, Carmel Medical Center, Haifa, Israel
| | - Andrea Finn
- Affymetrix, Incorporated, Santa Clara, California
| | - Gad Rennert
- Department of Community Medicine and Epidemiology, Carmel Medical Center, Haifa, Israel. Clalit Health Services National Cancer Control Center, Haifa, Israel. Bruce Rappaport Faculty of Medicine, Technion-Israel Institute of Technology, Haifa, Israel
| | - Stephen B Gruber
- Department of Preventive Medicine, Keck School of Medicine, University of Southern California, Los Angeles, California. USC Norris Comprehensive Cancer Center, Keck School of Medicine, University of Southern California, Los Angeles, California.
| |
Collapse
|
3
|
Stenzel SL, Gollub J, Shapero M, Finn A, Rennert G, Gruber SB. Abstract 2560: MicroRNA target site polymorphisms and colorectal cancer risk in the Ashkenazi Jewish population. Cancer Res 2013. [DOI: 10.1158/1538-7445.am2013-2560] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [What about the content of this article? (0)] [Affiliation(s)] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/16/2022]
Abstract
Abstract
MicroRNAs (miRNAs) act as key post-transcriptional regulators of gene expression via binding to the 3’ untranslated regions (UTRs) of mRNAs. Single nucleotide polymorphisms (SNPs) found in the miRNA sequence and/or corresponding binding sites can affect the fidelity of this interaction, and evidence from candidate miRNA studies suggests that such SNPs may increase or decrease risk of tumor development. This study was designed to expand the search for miRNA-related polymorphisms important in cancer etiology across the genome and to investigate the association between thousands of genetic variants in miRNA target sites and colorectal cancer (CRC) risk using a novel genotyping platform, the Axiom® miRNA Target Site Genotyping Array (237,858 markers). The final study sample after quality control filtering included 605 cases and 615 controls from the Molecular Epidemiology of Colorectal Cancer (MECC) study, a population-based case-control study of incident CRC cases in a geographically-defined area of northern Israel. The marginal association between each marker with minor allele frequency >=1% and CRC risk was examined assuming a log-additive genetic model using logistic regression adjusted for sex, age, and the first two principal components (PC) to capture fine-scale population structure. There were 11 markers with p-values less than 3E-4, and the most statistically significant association involved rs12130051 (chr1:19545053; 3’-UTR of KIAA0090) with an OR of 2.24 (1.52,3.31) and p=4.48E-5. In addition, rs1051690 (chr5:142605172; 3’-UTR of ARHGAP26), was statistically significantly associated with decreased CRC risk (OR (95% CI): 0.71 (0.59, 0.85); p = 2.03E-4). Further, this study provided evidence for replication of a previously published locus, rs1051690 in the 3’UTR region of the insulin receptor gene, INSR (OR (95% CI): 1.27 (0.96,1.67); p = 0.09). A fixed-effects meta-analysis of sample-size weighted p-values from a recent publication in Carcinogenesis describing the INSR variant (Landi et al., 2012) and from the current MECC study yielded a combined p-value of 0.01. To our knowledge, this is the first study to examine the association between genetic variation in miRNA target sites and cancer risk using a genome-wide approach. Work is ongoing to replicate findings from this study and to characterize the functional impact of prioritized polymorphisms.
Citation Format: Stephanie L. Stenzel, Jeremy Gollub, Michael Shapero, Andrea Finn, Gad Rennert, Stephen B. Gruber. MicroRNA target site polymorphisms and colorectal cancer risk in the Ashkenazi Jewish population. [abstract]. In: Proceedings of the 104th Annual Meeting of the American Association for Cancer Research; 2013 Apr 6-10; Washington, DC. Philadelphia (PA): AACR; Cancer Res 2013;73(8 Suppl):Abstract nr 2560. doi:10.1158/1538-7445.AM2013-2560
Collapse
Affiliation(s)
| | | | | | | | - Gad Rennert
- 3Clalit Health Services National Cancer Control Center; Department of Community Medicine and Epidemiology, Carmel Medical Center; Bruce Rappaport Faculty of Medicine, Technion-Israel Institute of Technology, Haifa, Israel
| | - Stephen B. Gruber
- 4Norris Comprehensive Cancer Center, University of Southern California, Los Angeles, CA
| |
Collapse
|
4
|
Duttagupta R, DiRienzo S, Jiang R, Bowers J, Gollub J, Kao J, Kearney K, Rudolph D, Dawany NB, Showe MK, Stamato T, Getts RC, Jones KW. Genome-wide maps of circulating miRNA biomarkers for ulcerative colitis. PLoS One 2012; 7:e31241. [PMID: 22359580 PMCID: PMC3281076 DOI: 10.1371/journal.pone.0031241] [Citation(s) in RCA: 86] [Impact Index Per Article: 7.2] [Reference Citation Analysis] [What about the content of this article? (0)] [Affiliation(s)] [Abstract] [MESH Headings] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 11/09/2011] [Accepted: 01/05/2012] [Indexed: 12/17/2022] Open
Abstract
Inflammatory Bowel Disease--comprised of Crohn's Disease and Ulcerative Colitis (UC)--is a complex, multi-factorial inflammatory disorder of the gastrointestinal tract. In this study we have explored the utility of naturally occurring circulating miRNAs as potential blood-based biomarkers for non-invasive prediction of UC incidences. Whole genome maps of circulating miRNAs in micro-vesicles, Peripheral Blood Mononuclear Cells and platelets have been constructed from a cohort of 20 UC patients and 20 normal individuals. Through Significance Analysis of Microarrays, a signature of 31 differentially expressed platelet-derived miRNAs has been identified and biomarker performance estimated through a non-probabilistic binary linear classification using Support Vector Machines. Through this approach, classifier measurements reveal a predictive score of 92.8% accuracy, 96.2% specificity and 89.5% sensitivity in distinguishing UC patients from normal individuals. Additionally, the platelet-derived biomarker signature can be validated at 88% accuracy through qPCR assays, and a majority of the miRNAs in this panel can be demonstrated to sub-stratify into 4 highly correlated intensity based clusters. Analysis of predicted targets of these biomarkers reveal an enrichment of pathways associated with cytoskeleton assembly, transport, membrane permeability and regulation of transcription factors engaged in a variety of regulatory cascades that are consistent with a cell-mediated immune response model of intestinal inflammation. Interestingly, comparison of the miRNA biomarker panel and genetic loci implicated in IBD through genome-wide association studies identifies a physical linkage between hsa-miR-941 and a UC susceptibility loci located on Chr 20. Taken together, analysis of these expression maps outlines a promising catalog of novel platelet-derived miRNA biomarkers of clinical utility and provides insight into the potential biological function of these candidates in disease pathogenesis.
Collapse
Affiliation(s)
- Radha Duttagupta
- Applied Reasearch Group, Affymetrix Inc, Santa Clara, California, United States of America
| | - Sharon DiRienzo
- Lankenau Institute for Medical Research, Wynnewood, Pennsylvania, United States of America
| | - Rong Jiang
- Applied Reasearch Group, Affymetrix Inc, Santa Clara, California, United States of America
| | - Jessica Bowers
- Research and Development, Genisphere LLC, Hatfield, Pennsylvania, United States of America
| | - Jeremy Gollub
- Applied Reasearch Group, Affymetrix Inc, Santa Clara, California, United States of America
| | - Jessica Kao
- Applied Reasearch Group, Affymetrix Inc, Santa Clara, California, United States of America
| | - Keith Kearney
- Division of Gastroenterology, Lankenau Medical Center, Wynnewood, Pennsylvania, United States of America
| | - David Rudolph
- Division of Gastroenterology, Lankenau Medical Center, Wynnewood, Pennsylvania, United States of America
| | - Noor B. Dawany
- The Wistar Institute, Philadelphia, Pennsylvania, United States of America
| | - Michael K. Showe
- The Wistar Institute, Philadelphia, Pennsylvania, United States of America
| | - Tom Stamato
- Lankenau Institute for Medical Research, Wynnewood, Pennsylvania, United States of America
| | - Robert C. Getts
- Research and Development, Genisphere LLC, Hatfield, Pennsylvania, United States of America
| | - Keith W. Jones
- Applied Reasearch Group, Affymetrix Inc, Santa Clara, California, United States of America
| |
Collapse
|
5
|
Abstract
Effective diagnosis and surveillance of complex multi-factorial disorders such as cancer can be improved by screening of easily accessible biomarkers. Highly stable cell free Circulating Nucleic Acids (CNA) present as both RNA and DNA species have been discovered in the blood and plasma of humans. Correlations between tumor-associated genomic/epigenetic/transcriptional changes and alterations in CNA levels are strong predictors of the utility of this biomarker class as promising clinical indicators. Towards this goal microRNAs (miRNAs) representing a class of naturally occurring small non-coding RNAs of 19-25 nt in length have emerged as an important set of markers that can associate their specific expression profiles with cancer development. In this study we investigate some of the pre-analytic considerations for isolating plasma fractions for the study of miRNA biomarkers. We find that measurement of circulating miRNA levels are frequently confounded by varying levels of cellular miRNAs of different hematopoietic origins. In order to assess the relative proportions of this cell-derived class, we have fractionated whole blood into plasma and its ensuing sub-fractions. Cellular miRNA signatures in cohorts of normal individuals are catalogued and the abundance and gender specific expression of bona fide circulating markers explored after calibrating the signal for this interfering class. A map of differentially expressed profiles is presented and the intrinsic variability of circulating miRNA species investigated in subsets of healthy males and females.
Collapse
Affiliation(s)
- Radha Duttagupta
- Applied Research, Affymetrix Inc, Santa Clara, California, United States of America
| | - Rong Jiang
- Applied Research, Affymetrix Inc, Santa Clara, California, United States of America
| | - Jeremy Gollub
- Applied Research, Affymetrix Inc, Santa Clara, California, United States of America
| | - Robert C. Getts
- Research and Development, Genisphere LLC, Hatfield, Pennsylvania, United States of America
| | - Keith W. Jones
- Applied Research, Affymetrix Inc, Santa Clara, California, United States of America
| |
Collapse
|
6
|
Hoffmann TJ, Kvale MN, Hesselson SE, Zhan Y, Aquino C, Cao Y, Cawley S, Chung E, Connell S, Eshragh J, Ewing M, Gollub J, Henderson M, Hubbell E, Iribarren C, Kaufman J, Lao RZ, Lu Y, Ludwig D, Mathauda GK, McGuire W, Mei G, Miles S, Purdy MM, Quesenberry C, Ranatunga D, Rowell S, Sadler M, Shapero MH, Shen L, Shenoy TR, Smethurst D, Van den Eeden SK, Walter L, Wan E, Wearley R, Webster T, Wen CC, Weng L, Whitmer RA, Williams A, Wong SC, Zau C, Finn A, Schaefer C, Kwok PY, Risch N. Next generation genome-wide association tool: design and coverage of a high-throughput European-optimized SNP array. Genomics 2011; 98:79-89. [PMID: 21565264 DOI: 10.1016/j.ygeno.2011.04.005] [Citation(s) in RCA: 128] [Impact Index Per Article: 9.8] [Reference Citation Analysis] [What about the content of this article? (0)] [Affiliation(s)] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 04/11/2011] [Accepted: 04/15/2011] [Indexed: 10/18/2022]
Abstract
The success of genome-wide association studies has paralleled the development of efficient genotyping technologies. We describe the development of a next-generation microarray based on the new highly-efficient Affymetrix Axiom genotyping technology that we are using to genotype individuals of European ancestry from the Kaiser Permanente Research Program on Genes, Environment and Health (RPGEH). The array contains 674,517 SNPs, and provides excellent genome-wide as well as gene-based and candidate-SNP coverage. Coverage was calculated using an approach based on imputation and cross validation. Preliminary results for the first 80,301 saliva-derived DNA samples from the RPGEH demonstrate very high quality genotypes, with sample success rates above 94% and over 98% of successful samples having SNP call rates exceeding 98%. At steady state, we have produced 462 million genotypes per week for each Axiom system. The new array provides a valuable addition to the repertoire of tools for large scale genome-wide association studies.
Collapse
Affiliation(s)
- Thomas J Hoffmann
- Institute for Human Genetics, University of California, San Francisco 94143-0794, CA, USA.
| | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | |
Collapse
|
7
|
Nioi P, Pardo IDR, Sherratt PJ, Fielden MR, Gollub J, Nie A, Snyder RD. Erratum to "Prediction of non-genotoxic carcinogenesis in rats using changes in gene expression following acute dosing". Chem Biol Interact 2009; 176:252-60. [PMID: 19086139 DOI: 10.1016/j.cbi.2008.09.005] [Citation(s) in RCA: 4] [Impact Index Per Article: 0.3] [Reference Citation Analysis] [What about the content of this article? (0)] [Affiliation(s)] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 02/06/2023]
Abstract
Carcinogenicity of chemicals can currently only be evaluated in 2-year rodent bioassays. Therefore, the development of early biomarkers for carcinogenesis would result in substantial savings in time and expense. The current study investigates whether early changes in gene expression may be developed as markers for cancer. Animals were treated for 1 or 5 days with either non-genotoxic carcinogens or non-carcinogens and gene expression was analyzed by quantitative PCR (qPCR).We tested two gene signatures previously reported to detect non-genotoxic carcinogens. Using one gene signature it was confirmed that 3/3 nongenotoxic carcinogens and 2/2 non-carcinogens are correctly identified with data from 1 or 5 days of dosing. In contrast an alternative signature correctly identified 0/3 and 2/3 nongenotoxic carcinogens at 1 and 5 days of treatment, respectively and 2/2 non-carcinogens at both time-points. Additionally, we evaluated a novel panel of putative biomarker genes, from the literature, many of which have roles in cell growth and division, including myc, cdc2 and mcm6. These genes were significantly induced by non-genotoxic carcinogens and not by non-carcinogens. Using the average fold-induction across this panel, 2/3 non-genotoxic carcinogens were detected on both day 1 and day 5. These data support the idea that acute changes in gene expression may provide biomarkers for non-genotoxic carcinogenesis but also highlight interesting differences in the sensitivities of distinct gene signatures.
Collapse
Affiliation(s)
- Paul Nioi
- The Schering-Plough Research Institute, Building S12-2603, 181 Passaic Avenue, Summit, NJ 07901, United States.
| | | | | | | | | | | | | |
Collapse
|
8
|
Fielden MR, Nie A, McMillian M, Elangbam CS, Trela BA, Yang Y, Dunn RT, Dragan Y, Fransson-Stehen R, Bogdanffy M, Adams SP, Foster WR, Chen SJ, Rossi P, Kasper P, Jacobson-Kram D, Tatsuoka KS, Wier PJ, Gollub J, Halbert DN, Roter A, Young JK, Sina JF, Marlowe J, Martus HJ, Aubrecht J, Olaharski AJ, Roome N, Nioi P, Pardo I, Snyder R, Perry R, Lord P, Mattes W, Car BD. Interlaboratory evaluation of genomic signatures for predicting carcinogenicity in the rat. Toxicol Sci 2008; 103:28-34. [PMID: 18281259 DOI: 10.1093/toxsci/kfn022] [Citation(s) in RCA: 78] [Impact Index Per Article: 4.9] [Reference Citation Analysis] [What about the content of this article? (0)] [Affiliation(s)] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 01/13/2023] Open
Abstract
The Critical Path Institute recently established the Predictive Safety Testing Consortium, a collaboration between several companies and the U.S. Food and Drug Administration, aimed at evaluating and qualifying biomarkers for a variety of toxicological endpoints. The Carcinogenicity Working Group of the Predictive Safety Testing Consortium has concentrated on sharing data to test the predictivity of two published hepatic gene expression signatures, including the signature by Fielden et al. (2007, Toxicol. Sci. 99, 90-100) for predicting nongenotoxic hepatocarcinogens, and the signature by Nie et al. (2006, Mol. Carcinog. 45, 914-933) for predicting nongenotoxic carcinogens. Although not a rigorous prospective validation exercise, the consortium approach created an opportunity to perform a meta-analysis to evaluate microarray data from short-term rat studies on over 150 compounds. Despite significant differences in study designs and microarray platforms between laboratories, the signatures proved to be relatively robust and more accurate than expected by chance. The accuracy of the Fielden et al. signature was between 63 and 69%, whereas the accuracy of the Nie et al. signature was between 55 and 64%. As expected, the predictivity was reduced relative to internal validation estimates reported under identical test conditions. Although the signatures were not deemed suitable for use in regulatory decision making, they were deemed worthwhile in the early assessment of drugs to aid decision making in drug development. These results have prompted additional efforts to rederive and evaluate a QPCR-based signature using these samples. When combined with a standardized test procedure and prospective interlaboratory validation, the accuracy and potential utility in preclinical applications can be ascertained.
Collapse
|
9
|
Kuo MD, Gollub J, Sirlin CB, Ooi C, Chen X. Radiogenomic analysis to identify imaging phenotypes associated with drug response gene expression programs in hepatocellular carcinoma. J Vasc Interv Radiol 2007; 18:821-31. [PMID: 17609439 DOI: 10.1016/j.jvir.2007.04.031] [Citation(s) in RCA: 126] [Impact Index Per Article: 7.4] [Reference Citation Analysis] [What about the content of this article? (0)] [Affiliation(s)] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 12/28/2022] Open
Abstract
PURPOSE To determine whether conventional contrast-enhanced computed tomography (CT) could be used to identify imaging phenotypes associated with a doxorubicin drug response gene expression program in hepatocellular carcinoma (HCC) by using an integrated imaging-genomic approach. MATERIALS AND METHODS Thirty HCCs were analyzed and scored individually across six predefined imaging phenotypes. Unsupervised and supervised bioinformatics analyses were performed to correlate the imaging scores with the corresponding tumor microarray data (each microarray contained gene expression measurements across approximately 18,000 genes) to identify relationships between the imaging traits and underlying tumor gene expression. Enrichment for a predefined doxorubicin-response gene expression program was then performed against the imaging phenotype-associated genes and enrichment determined. RESULTS An imaging phenotype related to tumor margins on arterial phase images demonstrated significant correlation with the doxorubicin-response transcriptional program (P < .05, q < 0.1). It was also significantly associated with HCC venous invasion and tumor stage (P < .05, q < 0.1). Tumors with higher tumor margin scores were more strongly associated with the doxorubicin resistance transcriptional program and had a greater prevalence of venous invasion and worse stage. Tumors with lower tumor margin scores, however, demonstrated a converse relationship. CONCLUSIONS It is possible to identify HCC imaging phenotypes at CT that correlate with a doxorubicin drug response gene expression program. Given the role of doxorubicin in regional therapies for HCC management, it is possible that such an approach could be used to guide HCC therapy on a tumor-by-tumor basis on the basis of underlying tumor gene expression patterns.
Collapse
Affiliation(s)
- Michael D Kuo
- Department of Radiology, University of California, San Diego Medical Center, San Diego, CA 92103, USA.
| | | | | | | | | |
Collapse
|
10
|
Fielden MR, Brennan R, Gollub J. A gene expression biomarker provides early prediction and mechanistic assessment of hepatic tumor induction by nongenotoxic chemicals. Toxicol Sci 2007; 99:90-100. [PMID: 17557906 DOI: 10.1093/toxsci/kfm156] [Citation(s) in RCA: 141] [Impact Index Per Article: 8.3] [Reference Citation Analysis] [What about the content of this article? (0)] [Affiliation(s)] [Abstract] [MESH Headings] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/13/2022] Open
Abstract
There are currently no accurate and well-validated short-term tests to identify nongenotoxic hepatic tumorigens, thus necessitating an expensive 2-year rodent bioassay before a risk assessment can begin. Using hepatic gene expression data from rats treated for 5 days with one of 100 structurally and mechanistically diverse nongenotoxic hepatocarcinogens and nonhepatocarcinogens, a novel multigenebiomarker (i.e., signature) was derived to predict the likelihood of nongenotoxic chemicals to induce liver tumors in longer term studies. Independent validation of the signature on 47 test chemicals indicates an assay sensitivity and specificity of 86% and 81%, respectively. Alternate short-term in vivo pathological and genomic biomarkers were evaluated in parallel for comparison, including liver weight, hepatocellular hypertrophy, hepatic necrosis, serum alanine aminotransferase activity, induction of cytochrome P450 genes, and repression of Tsc-22 or alpha2-macroglobulin messenger RNA. In contrast to these biomarkers, the gene expression-based signature was more accurate. Unlike existing tests, an understanding of potential modes of action for hepatic tumorigenicity can be derived by comparison of the signature profile of test chemicals to hepatic tumorigens of known mechanism, including regenerative proliferation, proliferation associated with xenobiotic receptor activation, peroxisome proliferation, and steroid hormone-mediated mechanisms. This signature is not only more accurate than current methods, but also facilitates the identification of mode of action to aid in the early assessment of human cancer risk.
Collapse
Affiliation(s)
- Mark R Fielden
- Iconix Biosciences, Inc., Mountain View, California 94043, USA.
| | | | | |
Collapse
|
11
|
Segal E, Sirlin CB, Ooi C, Adler AS, Gollub J, Chen X, Chan BK, Matcuk GR, Barry CT, Chang HY, Kuo MD. Decoding global gene expression programs in liver cancer by noninvasive imaging. Nat Biotechnol 2007; 25:675-80. [PMID: 17515910 DOI: 10.1038/nbt1306] [Citation(s) in RCA: 399] [Impact Index Per Article: 23.5] [Reference Citation Analysis] [What about the content of this article? (0)] [Affiliation(s)] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 01/08/2007] [Accepted: 04/24/2007] [Indexed: 12/15/2022]
Abstract
Paralleling the diversity of genetic and protein activities, pathologic human tissues also exhibit diverse radiographic features. Here we show that dynamic imaging traits in non-invasive computed tomography (CT) systematically correlate with the global gene expression programs of primary human liver cancer. Combinations of twenty-eight imaging traits can reconstruct 78% of the global gene expression profiles, revealing cell proliferation, liver synthetic function, and patient prognosis. Thus, genomic activity of human liver cancers can be decoded by noninvasive imaging, thereby enabling noninvasive, serial and frequent molecular profiling for personalized medicine.
Collapse
Affiliation(s)
- Eran Segal
- Dept. of Computer Science and Applied Mathematics, Weizmann Institute of Science, Rehovot 76100, Israel
| | | | | | | | | | | | | | | | | | | | | |
Collapse
|
12
|
Demeter J, Beauheim C, Gollub J, Hernandez-Boussard T, Jin H, Maier D, Matese JC, Nitzberg M, Wymore F, Zachariah ZK, Brown PO, Sherlock G, Ball CA. The Stanford Microarray Database: implementation of new analysis tools and open source release of software. Nucleic Acids Res 2006; 35:D766-70. [PMID: 17182626 PMCID: PMC1781111 DOI: 10.1093/nar/gkl1019] [Citation(s) in RCA: 130] [Impact Index Per Article: 7.2] [Reference Citation Analysis] [What about the content of this article? (0)] [Affiliation(s)] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/15/2022] Open
Abstract
The Stanford Microarray Database (SMD; ) is a research tool and archive that allows hundreds of researchers worldwide to store, annotate, analyze and share data generated by microarray technology. SMD supports most major microarray platforms, and is MIAME-supportive and can export or import MAGE-ML. The primary mission of SMD is to be a research tool that supports researchers from the point of data generation to data publication and dissemination, but it also provides unrestricted access to analysis tools and public data from 300 publications. In addition to supporting ongoing research, SMD makes its source code fully and freely available to others under an Open Source license, enabling other groups to create a local installation of SMD. In this article, we describe several data analysis tools implemented in SMD and we discuss features of our software release.
Collapse
Affiliation(s)
- Janos Demeter
- Department of Biochemistry, Stanford University School of MedicineStanford, CA, USA
| | - Catherine Beauheim
- Department of Genetics, Stanford University School of MedicineStanford, CA, USA
| | | | | | - Heng Jin
- Department of Biochemistry, Stanford University School of MedicineStanford, CA, USA
| | - Donald Maier
- Department of Biochemistry, Stanford University School of MedicineStanford, CA, USA
| | - John C. Matese
- Lewis-Sigler Institute for Integrative Genomics, Princeton UniversityPrinceton, NJ, USA
| | - Michael Nitzberg
- Department of Biochemistry, Stanford University School of MedicineStanford, CA, USA
| | - Farrell Wymore
- Department of Biochemistry, Stanford University School of MedicineStanford, CA, USA
| | | | - Patrick O. Brown
- Department of Biochemistry, Stanford University School of MedicineStanford, CA, USA
- Howard Hughes Medical InstituteStanford, CA, USA
| | - Gavin Sherlock
- Department of Genetics, Stanford University School of MedicineStanford, CA, USA
| | - Catherine A. Ball
- Department of Biochemistry, Stanford University School of MedicineStanford, CA, USA
- To whom correspondence should be addressed. Tel: +1 650 724 3028; Fax: +1 650 724 3701;
| |
Collapse
|
13
|
Abstract
The Stanford Microarray Database (SMD) is a DNA microarray research database that provides a large amount of data for public use. This chapter describes the use of the primary tools for searching, browsing, retrieving, and analyzing data available for SMD. With this introduction, researchers and students will be able to examine and analyze a large body of gene expression and other experiments. Additional tools for depositing, annotating, sharing, and analyzing data, available only to registered users, are also described. SMD is available for installation as a local database.
Collapse
Affiliation(s)
- Jeremy Gollub
- Department of Biochemistry, Stanford University Medical School, Stanford, CA, USA
| | | | | |
Collapse
|
14
|
Abstract
Even a simple, small-scale, microarray experiment generates thousands to millions of data points. Clearly, spreadsheets or plotting programs do not suffice for analysis of such large volumes of data, and comprehensive analysis requires systematic methods for selection and organization of data. This chapter focuses on the concepts and algorithms of hierarchical clustering and the most commonly employed methods of partitioning or organizing microarray data, and freely available software that implements these algorithms.
Collapse
Affiliation(s)
- Jeremy Gollub
- Department of Biochemistry, Stanford University Medical School, Stanford, CA, USA
| | | |
Collapse
|
15
|
Ball CA, Awad IAB, Demeter J, Gollub J, Hebert JM, Hernandez-Boussard T, Jin H, Matese JC, Nitzberg M, Wymore F, Zachariah ZK, Brown PO, Sherlock G. The Stanford Microarray Database accommodates additional microarray platforms and data formats. Nucleic Acids Res 2005; 33:D580-2. [PMID: 15608265 PMCID: PMC539960 DOI: 10.1093/nar/gki006] [Citation(s) in RCA: 150] [Impact Index Per Article: 7.9] [Reference Citation Analysis] [What about the content of this article? (0)] [Affiliation(s)] [Abstract] [MESH Headings] [Grants] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 12/02/2022] Open
Abstract
The Stanford Microarray Database (SMD) (http://smd.stanford.edu) is a research tool for hundreds of Stanford researchers and their collaborators. In addition, SMD functions as a resource for the entire biological research community by providing unrestricted access to microarray data published by SMD users and by disseminating its source code. In addition to storing GenePix (Axon Instruments) and ScanAlyze output from spotted microarrays, SMD has recently added the ability to store, retrieve, display and analyze the complete raw data produced by several additional microarray platforms and image analysis software packages, so that we can also now accept data from Affymetrix GeneChips (MAS5/GCOS or dChip), Agilent Catalog or Custom arrays (using Agilent's Feature Extraction software) or data created by SpotReader (Niles Scientific). We have implemented software that allows us to accept MAGE-ML documents from array manufacturers and to submit MIAME-compliant data in MAGE-ML format directly to ArrayExpress and GEO, greatly increasing the ease with which data from SMD can be published adhering to accepted standards and also increasing the accessibility of published microarray data to the general public. We have introduced a new tool to facilitate data sharing among our users, so that datasets can be shared during, before or after the completion of data analysis. The latest version of the source code for the complete database package was released in November 2004 (http://smd.stanford.edu/download/), allowing researchers around the world to deploy their own installations of SMD.
Collapse
Affiliation(s)
- Catherine A Ball
- Department of Biochemistry, Stanford University School of Medicine, Stanford, CA, USA.
| | | | | | | | | | | | | | | | | | | | | | | | | |
Collapse
|
16
|
Boyle EI, Weng S, Gollub J, Jin H, Botstein D, Cherry JM, Sherlock G. GO::TermFinder--open source software for accessing Gene Ontology information and finding significantly enriched Gene Ontology terms associated with a list of genes. Bioinformatics 2004; 20:3710-5. [PMID: 15297299 PMCID: PMC3037731 DOI: 10.1093/bioinformatics/bth456] [Citation(s) in RCA: 1391] [Impact Index Per Article: 69.6] [Reference Citation Analysis] [What about the content of this article? (0)] [Affiliation(s)] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/12/2022] Open
Abstract
SUMMARY GO::TermFinder comprises a set of object-oriented Perl modules for accessing Gene Ontology (GO) information and evaluating and visualizing the collective annotation of a list of genes to GO terms. It can be used to draw conclusions from microarray and other biological data, calculating the statistical significance of each annotation. GO::TermFinder can be used on any system on which Perl can be run, either as a command line application, in single or batch mode, or as a web-based CGI script. AVAILABILITY The full source code and documentation for GO::TermFinder are freely available from http://search.cpan.org/dist/GO-TermFinder/.
Collapse
Affiliation(s)
- Elizabeth I Boyle
- Department of Genetics, Stanford University School of Medicine, Stanford, CA 94305, USA
| | | | | | | | | | | | | |
Collapse
|
17
|
Canellos GP, Gollub J, Neuberg D, Mauch P, Shulman LN. Primary systemic treatment of advanced Hodgkin's disease with EVA (etoposide, vinblastine, doxorubicin): 10-year follow-up. Ann Oncol 2003; 14:268-72. [PMID: 12562654 DOI: 10.1093/annonc/mdg076] [Citation(s) in RCA: 20] [Impact Index Per Article: 1.0] [Reference Citation Analysis] [What about the content of this article? (0)] [Affiliation(s)] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/13/2022] Open
Abstract
BACKGROUND The most commonly used regimen for the treatment of advanced Hodgkin's disease (HD) is ABVD (doxorubicin, bleomycin, vinblastine and dacarbazine). Two of these components, bleomycin and dacarbazine, have defined toxicities such as pulmonary fibrosis and nausea/vomiting, and also uncertain single-drug activity. The EVA regimen (etoposide, vinblastine, doxorubicin) is an attempt to substitute a known active agent, etoposide, for bleomycin and dacarbazine. PATIENTS AND METHODS A series of 51 patients with advanced HD without prior systemic therapy were treated. The series included 12 stage II patients with bulky (>10 cm) mediastinal tumors, 10 of whom received complementary radiation therapy. The remaining patients received EVA only. Response, duration of response, survival, toxicity and the efficacy of salvage therapy were evaluated in all patients. The median follow-up time was 111 months and permitted an assessment of the long-term effects of treatment and natural history of a cohort of treated patients. RESULTS EVA achieved a complete response (or clinical complete response) in 48/51 patients (94%). Of these 48 responders, 16 relapsed in a median of 11 months (range 3-48 months). In follow-up, 32/51 patients had no evidence of relapsed HD, although three died from other causes (two from vascular events and one from large cell lymphoma), resulting in progression-free survival for the entire group of 57% at 111 months. Eight of the 16 were alive and free from disease at follow-up at 111 months. In the entire series, only seven patients (14%) died of HD. 37 patients (73%) continued free from disease. There was no pulmonary toxicity. CONCLUSIONS The EVA regimen appears to have an overall survival (OS) outcome comparable to ABVD, but without the lung toxicity. The high salvage rate of second-line therapy, in most instances at conventional dosage, suggests an absence of cross-resistance to alkylating agents in patients treated with EVA.
Collapse
Affiliation(s)
- G P Canellos
- Dana-Farber Cancer Institute, Boston, MA 02115, USA.
| | | | | | | | | |
Collapse
|
18
|
Gollub J, Ball CA, Binkley G, Demeter J, Finkelstein DB, Hebert JM, Hernandez-Boussard T, Jin H, Kaloper M, Matese JC, Schroeder M, Brown PO, Botstein D, Sherlock G. The Stanford Microarray Database: data access and quality assessment tools. Nucleic Acids Res 2003; 31:94-6. [PMID: 12519956 PMCID: PMC165525 DOI: 10.1093/nar/gkg078] [Citation(s) in RCA: 260] [Impact Index Per Article: 12.4] [Reference Citation Analysis] [What about the content of this article? (0)] [Affiliation(s)] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 08/30/2002] [Revised: 10/11/2002] [Accepted: 10/11/2002] [Indexed: 11/12/2022] Open
Abstract
The Stanford Microarray Database (SMD; http://genome-www.stanford.edu/microarray/) serves as a microarray research database for Stanford investigators and their collaborators. In addition, SMD functions as a resource for the entire scientific community, by making freely available all of its source code and providing full public access to data published by SMD users, along with many tools to explore and analyze those data. SMD currently provides public access to data from 3500 microarrays, including data from 85 publications, and this total is increasing rapidly. In this article, we describe some of SMD's newer tools for accessing public data, assessing data quality and for data analysis.
Collapse
Affiliation(s)
- Jeremy Gollub
- Department of Genetics, Center for Clinical Sciences Research, 269 Campus Drive, Room 2255b, Stanford University, Stanford, CA 94305-5163, USA
| | | | | | | | | | | | | | | | | | | | | | | | | | | |
Collapse
|
19
|
Finkelstein D, Ewing R, Gollub J, Sterky F, Cherry JM, Somerville S. Microarray data quality analysis: lessons from the AFGC project. Arabidopsis Functional Genomics Consortium. Plant Mol Biol 2002; 48:119-131. [PMID: 11860205 DOI: 10.1023/a:1013765922672] [Citation(s) in RCA: 44] [Impact Index Per Article: 2.0] [Reference Citation Analysis] [What about the content of this article? (0)] [Affiliation(s)] [Abstract] [MESH Headings] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 05/23/2023]
Abstract
Genome-wide expression profiling with DNA microarrays has and will provide a great deal of data to the plant scientific community. However, reliability concerns have required the development data quality tests for common systematic biases. Fortunately, most large-scale systematic biases are detectable and some are correctable by normalization. Technical replication experiments and statistical surveys indicate that these biases vary widely in severity and appearance. As a result, no single normalization or correction method currently available is able to address all the issues. However, careful sequence selection, array design, experimental design and experimental annotation can substantially improve the quality and biological of microarray data. In this review, we discuss these issues with reference to examples from the Arabidopsis Functional Genomics Consortium (AFGC) microarray project.
Collapse
Affiliation(s)
- David Finkelstein
- Carnegie Institution of Washington, Department of Plant Biology, Stanford, CA 94305, USA.
| | | | | | | | | | | |
Collapse
|
20
|
Abstract
We have observed the effects of MgADP and thiophosphorylation on the conformational state of the light chain domain of myosin in skinned smooth muscle. Electron paramagnetic resonance (EPR) spectroscopy was used to monitor the orientation of spin probes attached to the myosin regulatory light chain (RLC). Two spectral states were seen, termed here "intermediate" and "final", that are distinguished by a approximately 24 degrees axial rotation of spin probes attached to the RLC. The two observed conformations are similar to those found previously for smooth muscle myosin S1; the final state corresponds to the major conformation of S1 in the absence of ADP, while the intermediate state corresponds to the conformation of S1 with ADP bound. Light chain domain orientation was observed as a function of the MgADP concentration and the extent of RLC thiophosphorylation. In rigor (no MgADP), LC domains were distributed equally between the intermediate state and the final state; upon addition of saturating (3.5 mM) MgADP, about one-third of the LC domains in the final state rotated approximately 20 degrees axially to the intermediate state. The progression of the change in populations was fit to a simple binding equation, yielding an apparent dissociation constant of approximately 110 microM for skinned smooth muscle fibers and approximately 730 microM for thiophosphorylated, skinned smooth muscle fibers. These observations suggest a model that explains the behavior of "latch bridges" in smooth muscle.
Collapse
Affiliation(s)
- J Gollub
- Graduate Group in Biophysics, Department of Biochemistry and Biophysics, Cardiovascular Research Institute, University of California, San Francisco 94143-0448, USA
| | | | | |
Collapse
|
21
|
Abstract
Current theories of muscle cross-bridge function suggest that force is generated by a change in the orientation of the myosin neck region. We attached a paramagnetic probe to a subunit in the neck region and measured the orientation of the probe using electron paramagnetic resonance spectroscopy. The angle of the probes on smooth myosin S1 were changed by 20 degrees +/- 4 degrees on addition of ADP (50% effect at 5 +/- 2 microM), but ADP produced little effect on skeletal S1. The orientation of smooth myosin, +ADP, resembled that of skeletal myosin, +/- ADP, suggesting that the release of ADP generates an extra rotation of the neck region in smooth muscle at the end of its power stroke.
Collapse
Affiliation(s)
- J Gollub
- Graduate Group in Biophysics, University of California, San Francisco 94143-0448, USA
| | | | | |
Collapse
|
22
|
Zhao L, Gollub J, Cooke R. Orientation of paramagnetic probes attached to gizzard regulatory light chain bound to myosin heads in rabbit skeletal muscle. Biochemistry 1996; 35:10158-65. [PMID: 8756480 DOI: 10.1021/bi960505v] [Citation(s) in RCA: 10] [Impact Index Per Article: 0.4] [Reference Citation Analysis] [What about the content of this article? (0)] [Affiliation(s)] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 02/02/2023]
Abstract
The orientation of the myosin neck was monitored using electron paramagnetic resonance (EPR) spectroscopy. Gizzard regulatory light chain was labeled with a nitroxide spin probe and exchanged for the native subunit, located in the myosin neck, in rabbit psoas muscle fibers. The EPR spectra of rigor fibers indicated a substantial degree of probe immobilization and showed a strong dependence on the orientation of the fiber axis relative to the magnetic field, indicating that the neck was ordered in this state. Spectra of relaxed fibers at 24 degrees C showed that the neck was disordered, but the spectra of relaxed fibers at 4 degrees C indicated that the neck was partially ordered. Active fibers at the two temperatures produced spectra identical to relaxed fibers, indicating that no novel angles could be seen in the neck during the powerstroke. Proteolytic fragments of myosin, S1 and HMM, were exchanged with labeled light chains and bound to thin filaments in unlabeled fibers. The distribution of probe orientations for HMM was identical to that of labeled rigor fibers, while S1 showed a slightly different distribution, suggesting that the neck is distorted (by a few degrees) by the interactions of the two heads of myosin when bound to actin.
Collapse
Affiliation(s)
- L Zhao
- Department of Biochemistry & Biophysics, University of California, San Francisco 94143-0448, USA
| | | | | |
Collapse
|