176
|
Speers C, Tsimelzon A, Sexton K, Herrick AM, Gutierrez C, Culhane A, Quackenbush J, Hilsenbeck S, Chang J, Brown P. Identification of novel kinase targets for the treatment of estrogen receptor-negative breast cancer. Clin Cancer Res 2009; 15:6327-40. [PMID: 19808870 DOI: 10.1158/1078-0432.ccr-09-1107] [Citation(s) in RCA: 78] [Impact Index Per Article: 5.2] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 12/18/2022]
Abstract
PURPOSE Previous gene expression profiling studies of breast cancer have focused on the entire genome to identify genes differentially expressed between estrogen receptor (ER) alpha-positive and ER-alpha-negative cancers. EXPERIMENTAL DESIGN Here, we used gene expression microarray profiling to identify a distinct kinase gene expression profile that identifies ER-negative breast tumors and subsets ER-negative breast tumors into four distinct subtypes. RESULTS Based on the types of kinases expressed in these clusters, we identify a cell cycle regulatory subset, a S6 kinase pathway cluster, an immunomodulatory kinase-expressing cluster, and a mitogen-activated protein kinase pathway cluster. Furthermore, we show that this specific kinase profile is validated using independent sets of human tumors and is also seen in a panel of breast cancer cell lines. Kinase expression knockdown studies show that many of these kinases are essential for the growth of ER-negative, but not ER-positive, breast cancer cell lines. Finally, survival analysis of patients with breast cancer shows that the S6 kinase pathway signature subtype of ER-negative cancers confers an extremely poor prognosis, whereas patients whose tumors express high levels of immunomodulatory kinases have a significantly better prognosis. CONCLUSIONS This study identifies a list of kinases that are prognostic and may serve as druggable targets for the treatment of ER-negative breast cancer.
Collapse
|
177
|
Culhane AC, Quackenbush J. Confounding effects in "A six-gene signature predicting breast cancer lung metastasis". Cancer Res 2009; 69:7480-5. [PMID: 19723662 DOI: 10.1158/0008-5472.can-08-3350] [Citation(s) in RCA: 32] [Impact Index Per Article: 2.1] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/16/2022]
Abstract
The majority of breast cancer deaths result from metastases rather than from direct effects of the primary tumor itself. Recently, Landemaine and colleagues described a six-gene signature purported to predict lung metastasis risk. They analyzed gene expression in 23 metastases from breast cancer patients (5 lung, 18 non-lung) identifying a 21-gene signature. Expression of 16 of these was analyzed in primary breast tumors from 72 patients with known outcome, and six were selected that were predictive of lung metastases: DSC2, TFCP2L1, UGT8, ITGB8, ANP32E, and FERMT1. Despite the value of such a signature, our analysis indicates that this analysis ignored potentially important confounding factors and that their signature is instead a surrogate for molecular subtype.
Collapse
|
178
|
Colak D, Kaya N, Al-Zahrani J, Al Bakheet A, Muiya P, Andres E, Quackenbush J, Dzimiri N. Left ventricular global transcriptional profiling in human end-stage dilated cardiomyopathy. Genomics 2009; 94:20-31. [PMID: 19332114 PMCID: PMC4152850 DOI: 10.1016/j.ygeno.2009.03.003] [Citation(s) in RCA: 40] [Impact Index Per Article: 2.7] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 09/09/2008] [Revised: 02/17/2009] [Accepted: 03/17/2009] [Indexed: 02/07/2023]
Abstract
We employed ABI high-density oligonucleotide microarrays containing 31,700 sixty-mer probes (representing 27,868 annotated human genes) to determine differential gene expression in idiopathic dilated cardiomyopathy (DCM). We identified 626 up-regulated and 636 down-regulated genes in DCM compared to controls. Most significant changes occurred in the tricarboxylic acid cycle, angiogenesis, and apoptotic signaling pathways, among which 32 apoptosis- and 13 MAPK activity-related genes were altered. Inorganic cation transporter, catalytic activities, energy metabolism and electron transport-related processes were among the most critically influenced pathways. Among the up-regulated genes were HTRA1 (6.9-fold), PDCD8(AIFM1) (5.2) and PRDX2 (4.4) and the down-regulated genes were NR4A2 (4.8), MX1 (4.3), LGALS9 (4), IFNA13 (4), UNC5D (3.6) and HDAC2 (3) (p<0.05), all of which have no clearly defined cardiac-related function yet. Gene ontology and enrichment analysis also revealed significant alterations in mitochondrial oxidative phosphorylation, metabolism and Alzheimer's disease pathways. Concordance was also confirmed for a significant number of genes and pathways in an independent validation microarray dataset. Furthermore, verification by real-time RT-PCR showed a high degree of consistency with the microarray results. Our data demonstrate an association of DCM with alterations in various cellular events and multiple yet undeciphered genes that may contribute to heart muscle disease pathways.
Collapse
|
179
|
Hellner K, Mar J, Fang F, Quackenbush J, Münger K. HPV16 E7 oncogene expression in normal human epithelial cells causes molecular changes indicative of an epithelial to mesenchymal transition. Virology 2009; 391:57-63. [PMID: 19552933 DOI: 10.1016/j.virol.2009.05.036] [Citation(s) in RCA: 68] [Impact Index Per Article: 4.5] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 02/23/2009] [Revised: 04/01/2009] [Accepted: 05/26/2009] [Indexed: 11/25/2022]
Abstract
Cancer-associated epithelial to mesenchymal transition (EMT) is crucial for invasion and metastasis. Molecular hallmarks of EMT include down-regulation of the epithelial adhesion protein E-cadherin and de-novo expression of N-cadherin and the mesenchymal intermediate filament proteins vimentin and fibronectin. Expression of HPV16 E7 in normal human epithelial cells caused increased levels of vimentin and fibronectin, whereas the epithelial adhesion protein E-cadherin was expressed at decreased levels. Similar expression patterns of vimentin, fibronectin and E-cadherin were also detected in cells expressing HPV16 E6 and E7 or the entire HPV16 early transcriptional unit. HPV16 E6 and E7 were each able to induce N-cadherin expression. Interestingly, these changes in expression levels of EMT-associated proteins are not similarly reflected at the level of mRNA expression, suggesting that HPV16 oncoproteins also modulate EMT through non-transcriptional mechanisms. Hence, HPV16 oncoproteins may contribute to malignant progression through EMT induction.
Collapse
|
180
|
Mar JC, Kimura Y, Schroder K, Irvine KM, Hayashizaki Y, Suzuki H, Hume D, Quackenbush J. Data-driven normalization strategies for high-throughput quantitative RT-PCR. BMC Bioinformatics 2009; 10:110. [PMID: 19374774 PMCID: PMC2680405 DOI: 10.1186/1471-2105-10-110] [Citation(s) in RCA: 85] [Impact Index Per Article: 5.7] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 11/24/2008] [Accepted: 04/19/2009] [Indexed: 11/10/2022] Open
Abstract
BACKGROUND High-throughput real-time quantitative reverse transcriptase polymerase chain reaction (qPCR) is a widely used technique in experiments where expression patterns of genes are to be profiled. Current stage technology allows the acquisition of profiles for a moderate number of genes (50 to a few thousand), and this number continues to grow. The use of appropriate normalization algorithms for qPCR-based data is therefore a highly important aspect of the data preprocessing pipeline. RESULTS We present and evaluate two data-driven normalization methods that directly correct for technical variation and represent robust alternatives to standard housekeeping gene-based approaches. We evaluated the performance of these methods against a single gene housekeeping gene method and our results suggest that quantile normalization performs best. These methods are implemented in freely-available software as an R package qpcrNorm distributed through the Bioconductor project. CONCLUSION The utility of the approaches that we describe can be demonstrated most clearly in situations where standard housekeeping genes are regulated by some experimental condition. For large qPCR-based data sets, our approaches represent robust, data-driven strategies for normalization.
Collapse
|
181
|
Fedorov VB, Goropashnaya AV, Tøien Ø, Stewart NC, Gracey AY, Chang C, Qin S, Pertea G, Quackenbush J, Showe LC, Showe MK, Boyer BB, Barnes BM. Elevated expression of protein biosynthesis genes in liver and muscle of hibernating black bears (Ursus americanus). Physiol Genomics 2009; 37:108-18. [PMID: 19240299 DOI: 10.1152/physiolgenomics.90398.2008] [Citation(s) in RCA: 84] [Impact Index Per Article: 5.6] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/22/2022] Open
Abstract
We conducted a large-scale gene expression screen using the 3,200 cDNA probe microarray developed specifically for Ursus americanus to detect expression differences in liver and skeletal muscle that occur during winter hibernation compared with animals sampled during summer. The expression of 12 genes, including RNA binding protein motif 3 (Rbm3), that are mostly involved in protein biosynthesis, was induced during hibernation in both liver and muscle. The Gene Ontology and Gene Set Enrichment analysis consistently showed a highly significant enrichment of the protein biosynthesis category by overexpressed genes in both liver and skeletal muscle during hibernation. Coordinated induction in transcriptional level of genes involved in protein biosynthesis is a distinctive feature of the transcriptome in hibernating black bears. This finding implies induction of translation and suggests an adaptive mechanism that contributes to a unique ability to reduce muscle atrophy over prolonged periods of immobility during hibernation. Comparing expression profiles in bears to small mammalian hibernators shows a general trend during hibernation of transcriptional changes that include induction of genes involved in lipid metabolism and carbohydrate synthesis as well as depression of genes involved in the urea cycle and detoxification function in liver.
Collapse
|
182
|
Rocke DM, Ideker T, Troyanskaya O, Quackenbush J, Dopazo J. Papers on normalization, variable selection, classification or clustering of microarray data. Bioinformatics 2009. [DOI: 10.1093/bioinformatics/btp038] [Citation(s) in RCA: 40] [Impact Index Per Article: 2.7] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/13/2022] Open
|
183
|
|
184
|
Dutta B, Kanani H, Quackenbush J, Klapa MI. Time-series integrated "omic" analyses to elucidate short-term stress-induced responses in plant liquid cultures. Biotechnol Bioeng 2008; 102:264-279. [PMID: 18958862 DOI: 10.1002/bit.22036] [Citation(s) in RCA: 23] [Impact Index Per Article: 1.4] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/07/2022]
Abstract
The research that aims at furthering our understanding of plant primary metabolism has intensified during the last decade. The presented study validated a systems biology methodological framework for the analysis of stress-induced molecular interaction networks in the context of plant primary metabolism, as these are expressed during the first hours of the stress treatment. The framework involves the application of time-series integrated full-genome transcriptomic and polar metabolomic analyses on plant liquid cultures. The latter were selected as the model system for this type of analysis, because they provide a well-controlled growth environment, ensuring that the observed plant response is due only to the applied perturbation. An enhanced gas chromatography-mass spectrometry (GC-MS) metabolomic data correction strategy and a new algorithm for the significance analysis of time-series "omic" data are used to extract information about the plant's transcriptional and metabolic response to the applied stress from the acquired datasets; in this article, it is the first time that these are applied for the analysis of a large biological dataset from a complex eukaryotic system. The case-study involved Arabidopsis thaliana liquid cultures subjected for 30 h to elevated (1%) CO2 stress. The advantages and validity of the methodological framework are discussed in the context of the known A. thaliana or plant, in general, physiology under the particular stress. Of note, the ability of the methodology to capture dynamic aspects of the observed molecular response allowed for 9 and 24 h of treatment to be indicated as corresponding to shifts in both the transcriptional and metabolic activity; analysis of the pathways through which these activity changes are manifested provides insight to regulatory processes.
Collapse
|
185
|
Quackenbush J. Extracting meaning from functional genomics experiments. Toxicol Appl Pharmacol 2008; 207:195-9. [PMID: 16002114 DOI: 10.1016/j.taap.2005.04.029] [Citation(s) in RCA: 15] [Impact Index Per Article: 0.9] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/15/2004] [Revised: 07/15/2004] [Accepted: 04/01/2005] [Indexed: 11/23/2022]
Abstract
The completion of draft genome sequences for human, mouse, rat, and an increasing number of other species, has provided us with preliminary gene catalogues for many organisms of medical and scientific interests. Interpreting these gene lists in the context of the organism's underlying biology, however, remains difficult. The development of DNA microarrays provided one potential source of data to help interpret gene function; by profiling global patterns of gene expression across diverse conditions, it was hoped that we might be able to develop insight into biological function. But the power of these functional genomics assays, as well as assays in proteomics and metabolomics, is that they primarily give us lists of differentially expressed genes that can be correlated with particular phenotypic states, but which remain difficult to link mechanistically to the biology driving the phenotype.
Collapse
|
186
|
Kitami T, Rubio R, O'Brien W, Quackenbush J, Nadeau JH. Gene-environment interactions reveal a homeostatic role for cholesterol metabolism during dietary folate perturbation in mice. Physiol Genomics 2008; 35:182-90. [PMID: 18697859 DOI: 10.1152/physiolgenomics.00294.2007] [Citation(s) in RCA: 9] [Impact Index Per Article: 0.6] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 12/31/2022] Open
Abstract
Dietary folate supplementation can dramatically reduce the severity and incidence of several common birth defects and adult diseases that are associated with anomalies in homocysteine and folate metabolism. The common polymorphisms that adversely affect these metabolic pathways do not fully account for the particular birth defects and adult diseases that occur in at-risk individuals. To test involvement of folate, homocysteine, and other pathways in disease pathogenesis and treatment response, we analyzed global and pathway-specific changes in gene expression and levels of selected metabolites after depletion and repletion of dietary folate in two genetically distinct inbred strains of mice. Compared with the C57BL/6J strain, A/J showed greater homeostatic response to folate perturbation by retaining a higher serum folate level and minimizing global gene expression changes. Remarkably, folate perturbation led to systematic strain-specific differences only in the expression profile of the cholesterol biosynthesis pathway and to changes in levels of serum and liver total cholesterol. By genetically increasing serum and liver total cholesterol levels in APOE-deficient mice, we modestly but significantly improved folate retention during folate depletion, suggesting that homeostasis among the homocysteine, folate and cholesterol metabolic pathways contributes to the beneficial effects of dietary folate supplementation.
Collapse
|
187
|
Taylor CF, Field D, Sansone SA, Aerts J, Apweiler R, Ashburner M, Ball CA, Binz PA, Bogue M, Booth T, Brazma A, Brinkman RR, Michael Clark A, Deutsch EW, Fiehn O, Fostel J, Ghazal P, Gibson F, Gray T, Grimes G, Hancock JM, Hardy NW, Hermjakob H, Julian RK, Kane M, Kettner C, Kinsinger C, Kolker E, Kuiper M, Le Novère N, Leebens-Mack J, Lewis SE, Lord P, Mallon AM, Marthandan N, Masuya H, McNally R, Mehrle A, Morrison N, Orchard S, Quackenbush J, Reecy JM, Robertson DG, Rocca-Serra P, Rodriguez H, Rosenfelder H, Santoyo-Lopez J, Scheuermann RH, Schober D, Smith B, Snape J, Stoeckert CJ, Tipton K, Sterk P, Untergasser A, Vandesompele J, Wiemann S. Promoting coherent minimum reporting guidelines for biological and biomedical investigations: the MIBBI project. Nat Biotechnol 2008; 26:889-96. [PMID: 18688244 PMCID: PMC2771753 DOI: 10.1038/nbt.1411] [Citation(s) in RCA: 358] [Impact Index Per Article: 22.4] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/08/2022]
Abstract
The Minimum Information for Biological and Biomedical Investigations (MIBBI) project provides a resource for those exploring the range of extant minimum information checklists and fosters coordinated development of such checklists.
Collapse
|
188
|
Aryee MJA, Quackenbush J. An optimized predictive strategy for interactome mapping. J Proteome Res 2008; 7:4089-94. [PMID: 18642945 DOI: 10.1021/pr700858e] [Citation(s) in RCA: 2] [Impact Index Per Article: 0.1] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/30/2022]
Abstract
We present an optimized experimental strategy that can accelerate progress toward identifying the majority of pairwise protein interactions. Our method involves applying a predictive algorithm, based on the existing data, to identify protein pairs likely to interact and prioritizing these for screening. The approach is iterative as additional data allows one to refine predictions directing the next stage of experimentation.
Collapse
|
189
|
Djebbari A, Quackenbush J. Seeded Bayesian Networks: constructing genetic networks from microarray data. BMC SYSTEMS BIOLOGY 2008; 2:57. [PMID: 18601736 PMCID: PMC2474592 DOI: 10.1186/1752-0509-2-57] [Citation(s) in RCA: 79] [Impact Index Per Article: 4.9] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Subscribe] [Scholar Register] [Received: 12/03/2007] [Accepted: 07/04/2008] [Indexed: 11/10/2022]
Abstract
BACKGROUND DNA microarrays and other genomics-inspired technologies provide large datasets that often include hidden patterns of correlation between genes reflecting the complex processes that underlie cellular metabolism and physiology. The challenge in analyzing large-scale expression data has been to extract biologically meaningful inferences regarding these processes - often represented as networks - in an environment where the datasets are often imperfect and biological noise can obscure the actual signal. Although many techniques have been developed in an attempt to address these issues, to date their ability to extract meaningful and predictive network relationships has been limited. Here we describe a method that draws on prior information about gene-gene interactions to infer biologically relevant pathways from microarray data. Our approach consists of using preliminary networks derived from the literature and/or protein-protein interaction data as seeds for a Bayesian network analysis of microarray results. RESULTS Through a bootstrap analysis of gene expression data derived from a number of leukemia studies, we demonstrate that seeded Bayesian Networks have the ability to identify high-confidence gene-gene interactions which can then be validated by comparison to other sources of pathway data. CONCLUSION The use of network seeds greatly improves the ability of Bayesian Network analysis to learn gene interaction networks from gene expression data. We demonstrate that the use of seeds derived from the biomedical literature or high-throughput protein-protein interaction data, or the combination, provides improvement over a standard Bayesian Network analysis, allowing networks involving dynamic processes to be deduced from the static snapshots of biological systems that represent the most common source of microarray data. Software implementing these methods has been included in the widely used TM4 microarray analysis package.
Collapse
|
190
|
Boedigheimer MJ, Wolfinger RD, Bass MB, Bushel PR, Chou JW, Cooper M, Corton JC, Fostel J, Hester S, Lee JS, Liu F, Liu J, Qian HR, Quackenbush J, Pettit S, Thompson KL. Sources of variation in baseline gene expression levels from toxicogenomics study control animals across multiple laboratories. BMC Genomics 2008; 9:285. [PMID: 18549499 PMCID: PMC2453529 DOI: 10.1186/1471-2164-9-285] [Citation(s) in RCA: 66] [Impact Index Per Article: 4.1] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 12/05/2007] [Accepted: 06/12/2008] [Indexed: 01/22/2023] Open
Abstract
Background The use of gene expression profiling in both clinical and laboratory settings would be enhanced by better characterization of variance due to individual, environmental, and technical factors. Meta-analysis of microarray data from untreated or vehicle-treated animals within the control arm of toxicogenomics studies could yield useful information on baseline fluctuations in gene expression, although control animal data has not been available on a scale and in a form best served for data-mining. Results A dataset of control animal microarray expression data was assembled by a working group of the Health and Environmental Sciences Institute's Technical Committee on the Application of Genomics in Mechanism Based Risk Assessment in order to provide a public resource for assessments of variability in baseline gene expression. Data from over 500 Affymetrix microarrays from control rat liver and kidney were collected from 16 different institutions. Thirty-five biological and technical factors were obtained for each animal, describing a wide range of study characteristics, and a subset were evaluated in detail for their contribution to total variability using multivariate statistical and graphical techniques. Conclusion The study factors that emerged as key sources of variability included gender, organ section, strain, and fasting state. These and other study factors were identified as key descriptors that should be included in the minimal information about a toxicogenomics study needed for interpretation of results by an independent source. Genes that are the most and least variable, gender-selective, or altered by fasting were also identified and functionally categorized. Better characterization of gene expression variability in control animals will aid in the design of toxicogenomics studies and in the interpretation of their results.
Collapse
|
191
|
Lee Y, Quackenbush J. Using the TIGR gene index databases for biological discovery. ACTA ACUST UNITED AC 2008; Chapter 1:Unit 1.6. [PMID: 18428690 DOI: 10.1002/0471250953.bi0106s03] [Citation(s) in RCA: 3] [Impact Index Per Article: 0.2] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/11/2022]
Abstract
The TIGR Gene Index web pages provide access to analyses of ESTs and gene sequences for nearly 60 species, as well as a number of resources derived from these. Each species-specific database is presented using a common format with a homepage. A variety of methods exist that allow users to search each species-specific database. Methods implemented currently include nucleotide or protein sequence queries using WU-BLAST, text-based searches using various sequence identifiers, searches by gene, tissue and library name, and searches using functional classes through Gene Ontology assignments. This protocol provides guidance for using the Gene Index Databases to extract information.
Collapse
|
192
|
Cheng ASL, Culhane AC, Chan MWY, Venkataramu CR, Ehrich M, Nasir A, Rodriguez BAT, Liu J, Yan PS, Quackenbush J, Nephew KP, Yeatman TJ, Huang THM. Epithelial progeny of estrogen-exposed breast progenitor cells display a cancer-like methylome. Cancer Res 2008; 68:1786-96. [PMID: 18339859 DOI: 10.1158/0008-5472.can-07-5547] [Citation(s) in RCA: 115] [Impact Index Per Article: 7.2] [Reference Citation Analysis] [Abstract] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 12/21/2022]
Abstract
Estrogen imprinting is used to describe a phenomenon in which early developmental exposure to endocrine disruptors increases breast cancer risk later in adult life. We propose that long-lived, self-regenerating stem and progenitor cells are more susceptible to the exposure injury than terminally differentiated epithelial cells in the breast duct. Mammospheres, containing enriched breast progenitors, were used as an exposure system to simulate this imprinting phenomenon in vitro. Using MeDIP-chip, a methylation microarray screening method, we found that 0.5% (120 loci) of human CpG islands were hypermethylated in epithelial cells derived from estrogen-exposed progenitors compared with the non-estrogen-exposed control cells. This epigenetic event may lead to progressive silencing of tumor suppressor genes, including RUNX3, in these epithelial cells, which also occurred in primary breast tumors. Furthermore, normal tissue in close proximity to the tumor site also displayed RUNX3 hypermethylation, suggesting that this aberrant event occurs in early breast carcinogenesis. The high prevalence of estrogen-induced epigenetic changes in primary tumors and the surrounding histologically normal tissues provides the first empirical link between estrogen injury of breast stem/progenitor cells and carcinogenesis. This finding also offers a mechanistic explanation as to why a tumor suppressor gene, such as RUNX3, can be heritably silenced by epigenetic mechanisms in breast cancer.
Collapse
|
193
|
Abstract
The promise of the genome project was that a complete sequence would provide us with information that would transform biology and medicine. But the 'parts list' that has emerged from the genome project is far from the 'wiring diagram' and 'circuit logic' we need to understand the link between genotype, environment and phenotype. While genomic technologies such as DNA microarrays, proteomics and metabolomics have given us new tools and new sources of data to address these problems, a number of crucial elements remain to be addressed before we can begin to close the loop and develop a predictive quantitative biology that is the stated goal of so much of current biological research, including systems biology. Our approach to this problem has largely been one of integration, bringing together a vast wealth of information to better interpret the experimental data we are generating in genomic assays and creating publicly available databases and software tools to facilitate the work of others. Recently, we have used a similar approach to trying to understand the biological networks that underlie the phenotypic responses we observe and starting us on the road to developing a predictive biology.
Collapse
|
194
|
Deutsch EW, Ball CA, Bova GS, Brazma A, Bumgarner RE, Campbell D, Causton HC, Christiansen J, Davidson D, Eichner LJ, Goo YA, Grimmond S, Henrich T, Johnson MH, Korb M, Mills JC, Oudes A, Parkinson HE, Pascal LE, Quackenbush J, Ramialison M, Ringwald M, Sansone SA, Sherlock G, Stoeckert CJ, Swedlow J, Taylor RC, Walashek L, Zhou Y, Liu AY, True LD. Development of the Minimum Information Specification for In Situ Hybridization and Immunohistochemistry Experiments (MISFISHIE). OMICS-A JOURNAL OF INTEGRATIVE BIOLOGY 2007; 10:205-8. [PMID: 16901227 DOI: 10.1089/omi.2006.10.205] [Citation(s) in RCA: 15] [Impact Index Per Article: 0.9] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 11/12/2022]
Abstract
We describe the creation process of the Minimum Information Specification for In Situ Hybridization and Immunohistochemistry Experiments (MISFISHIE). Modeled after the existing minimum information specification for microarray data, we created a new specification for gene expression localization experiments, initially to facilitate data sharing within a consortium. After successful use within the consortium, the specification was circulated to members of the wider biomedical research community for comment and refinement. After a period of acquiring many new suggested requirements, it was necessary to enter a final phase of excluding those requirements that were deemed inappropriate as a minimum requirement for all experiments. The full specification will soon be published as a version 1.0 proposal to the community, upon which a more full discussion must take place so that the final specification may be achieved with the involvement of the whole community.
Collapse
|
195
|
Hollingsworth JW, Whitehead G, Berman KG, Tekippe EM, Gilmour MI, Larkin JE, Quackenbush J, Schwartz DA. Genetic basis of murine antibacterial defense to streptococcal lung infection. Immunogenetics 2007; 59:713-24. [PMID: 17701033 DOI: 10.1007/s00251-007-0242-6] [Citation(s) in RCA: 7] [Impact Index Per Article: 0.4] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 04/21/2007] [Accepted: 07/03/2007] [Indexed: 12/14/2022]
Abstract
To evaluate the effect of genetic background on antibacterial defense to streptococcal infection, eight genetically diverse strains of mice (A/J, DBA/2J, CAST/Ei, FVB/NJ, BALB/cJ, C57BL/6J, 129/SvImJ, and C3H/HeJ) and tlr2-deficient mice (C57BL/6(tlr2-/-)) were infected with three doses of Streptococcus zooepidemicus (500, 5,000, or 50,000 colony-forming units) by alveolar challenge. There was a range of susceptibility between the strains at each dose and time point (6, 24, and 96 h). At the lowest dose, the 129/SvImJ and C3H/HeJ strains had significantly higher bacterial counts at all time points after infection, when compared to A/J, DBA/2J, CAST/Ei, FVB/NJ, which were resistant to infection at the low dose of innoculum. At the medium dose, 129/SvImJ and C3H/HeJ had higher bacterial counts, while A/J, DBA/2J, and BALB/cJ showed reduced streptococcal growth. After the highest dose of Streptococcus, there were minimal differences between strains, suggesting the protective impact of modifier genes can be overcome. TLR2-deficient animals contained increased bacterial load with reduced cytokines after 96 h when compared to C57BL/6J controls suggesting a role of innate immunity in late antibacterial defense. Overall, we identify vulnerable (129/SvlmJ and C3H/HeJ) and resistant (A/J, FVB, and DBA) mouse strains to streptococcal lung infection, which demonstrate divergent genetic expression profiles. These results demonstrate that innate differences in pulmonary host defense to S. zooepidemicus are dependent on host genetic factors.
Collapse
|
196
|
Mar JC, Rubio R, Quackenbush J. Inferring steady state single-cell gene expression distributions from analysis of mesoscopic samples. Genome Biol 2007; 7:R119. [PMID: 17169148 PMCID: PMC1794432 DOI: 10.1186/gb-2006-7-12-r119] [Citation(s) in RCA: 13] [Impact Index Per Article: 0.8] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 08/04/2006] [Revised: 11/08/2006] [Accepted: 12/14/2006] [Indexed: 11/10/2022] Open
Abstract
BACKGROUND A great deal of interest has been generated by systems biology approaches that attempt to develop quantitative, predictive models of cellular processes. However, the starting point for all cellular gene expression, the transcription of RNA, has not been described and measured in a population of living cells. RESULTS Here we present a simple model for transcript levels based on Poisson statistics and provide supporting experimental evidence for genes known to be expressed at high, moderate, and low levels. CONCLUSION Although the model describes a microscopic process occurring at the level of an individual cell, the supporting data we provide uses a small number of cells where the echoes of the underlying stochastic processes can be seen. Not only do these data confirm our model, but this general strategy opens up a potential new approach, Mesoscopic Biology, that can be used to assess the natural variability of processes occurring at the cellular level in biological systems.
Collapse
|
197
|
Raj JU, Aliferis C, Caprioli RM, Cowley AW, Davies PF, Duncan MW, Erle DJ, Erzurum SC, Finn PW, Ischiropoulos H, Kaminski N, Kleeberger SR, Leikauf GD, Loyd JE, Martin TR, Matalon S, Moore JH, Quackenbush J, Sabo-Attwood T, Shapiro SD, Schnitzer JE, Schwartz DA, Schwiebert LM, Sheppard D, Ware LB, Weiss ST, Whitsett JA, Wurfel MM, Matthay MA. Genomics and proteomics of lung disease: conference summary. Am J Physiol Lung Cell Mol Physiol 2007; 293:L45-51. [PMID: 17468134 PMCID: PMC4212816 DOI: 10.1152/ajplung.00139.2007] [Citation(s) in RCA: 10] [Impact Index Per Article: 0.6] [Reference Citation Analysis] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/22/2022] Open
|
198
|
Danley PD, Mullen SP, Liu F, Nene V, Quackenbush J, Shaw KL. A cricket Gene Index: a genomic resource for studying neurobiology, speciation, and molecular evolution. BMC Genomics 2007; 8:109. [PMID: 17459168 PMCID: PMC1878485 DOI: 10.1186/1471-2164-8-109] [Citation(s) in RCA: 30] [Impact Index Per Article: 1.8] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/13/2006] [Accepted: 04/25/2007] [Indexed: 11/10/2022] Open
Abstract
BACKGROUND As the developmental costs of genomic tools decline, genomic approaches to non-model systems are becoming more feasible. Many of these systems may lack advanced genetic tools but are extremely valuable models in other biological fields. Here we report the development of expressed sequence tags (EST's) in an orthopteroid insect, a model for the study of neurobiology, speciation, and evolution. RESULTS We report the sequencing of 14,502 EST's from clones derived from a nerve cord cDNA library, and the subsequent construction of a Gene Index from these sequences, from the Hawaiian trigonidiine cricket Laupala kohalensis. The Gene Index contains 8607 unique sequences comprised of 2575 tentative consensus (TC) sequences and 6032 singletons. For each of the unique sequences, an attempt was made to assign a provisional annotation and to categorize its function using a Gene Ontology-based classification through a sequence-based comparison to known proteins. In addition, a set of unique 70 base pair oligomers that can be used for DNA microarrays was developed. All Gene Index information is posted at the DFCI Gene Indices web page CONCLUSION Orthopterans are models used to understand the neurophysiological basis of complex motor patterns such as flight and stridulation. The sequences presented in the cricket Gene Index will provide neurophysiologists with many genetic tools that have been largely absent in this field. The cricket Gene Index is one of only two gene indices to be developed in an evolutionary model system. Species within the genus Laupala have speciated recently, rapidly, and extensively. Therefore, the genes identified in the cricket Gene Index can be used to study the genomics of speciation. Furthermore, this gene index represents a significant EST resources for basal insects. As such, this resource is a valuable comparative tool for the understanding of invertebrate molecular evolution. The sequences presented here will provide much needed genomic resources for three distinct but overlapping fields of inquiry: neurobiology, speciation, and molecular evolution.
Collapse
|
199
|
Dimitrakov JD, Kim J, Lee J, Quackenbush J, Zhou W, Liotta L, Petricoin E, Zurakowski D, Freeman MR, Nickel JC. 93: A Panel of Potential Diagnostic Biomarkers for Chronic Prostatitis/Chronic Pelvic Pain Syndrome. J Urol 2007. [DOI: 10.1016/s0022-5347(18)30358-6] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/26/2022]
|
200
|
Song JS, Maghsoudi K, Li W, Fox E, Quackenbush J, Shirley Liu X. Microarray blob-defect removal improves array analysis. Bioinformatics 2007; 23:966-71. [PMID: 17332024 DOI: 10.1093/bioinformatics/btm043] [Citation(s) in RCA: 15] [Impact Index Per Article: 0.9] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/15/2022] Open
Abstract
MOTIVATION New generation Affymetrix oligonucleotide microarrays often have blob-like image defects that will require investigators to either repeat their hybridization assays or analyze their data with the defects left in place. We investigated the effect of analyzing a spike-in experiment on Affymetrix ENCODE tiling arrays in the presence of simulated blobs covering between 1 and 9% of the array area. Using two different ChIP-chip tiling array analysis programs (Affymetrix tiling array software, TAS, and model-based analysis of tiling arrays, MAT), we found that even the smallest blob defects significantly decreased the sensitivity and increased the false discovery rate (FDR) of the spike-in target prediction. RESULTS We introduced a new software tool, the microarray blob remover (MBR), which allows rapid visualization, detection and removal of various blob defects from the .CEL files of different types of Affymetrix microarrays. It is shown that using MBR significantly improves the sensitivity and FDR of a tiling array analysis compared to leaving the affected probes in the analysis. AVAILABILITY The MBR software and the sample array .CEL files used in this article are available at: http://liulab.dfci.harvard.edu/Software/MBR/MBR.htm
Collapse
|