26
|
Ordway JM, Budiman MA, Korshunova Y, Maloney RK, Bedell JA, Citek RW, Bacher B, Peterson S, Rohlfing T, Hall J, Brown R, Lakey N, Doerge RW, Martienssen RA, Leon J, McPherson JD, Jeddeloh JA. Identification of novel high-frequency DNA methylation changes in breast cancer. PLoS One 2007; 2:e1314. [PMID: 18091988 PMCID: PMC2117343 DOI: 10.1371/journal.pone.0001314] [Citation(s) in RCA: 84] [Impact Index Per Article: 4.9] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/03/2007] [Accepted: 11/06/2007] [Indexed: 12/31/2022] Open
Abstract
Recent data have revealed that epigenetic alterations, including DNA methylation and chromatin structure changes, are among the earliest molecular abnormalities to occur during tumorigenesis. The inherent thermodynamic stability of cytosine methylation and the apparent high specificity of the alterations for disease may accelerate the development of powerful molecular diagnostics for cancer. We report a genome-wide analysis of DNA methylation alterations in breast cancer. The approach efficiently identified a large collection of novel differentially DNA methylated loci (∼200), a subset of which was independently validated across a panel of over 230 clinical samples. The differential cytosine methylation events were independent of patient age, tumor stage, estrogen receptor status or family history of breast cancer. The power of the global approach for discovery is underscored by the identification of a single differentially methylated locus, associated with the GHSR gene, capable of distinguishing infiltrating ductal breast carcinoma from normal and benign breast tissues with a sensitivity and specificity of 90% and 96%, respectively. Notably, the frequency of these molecular abnormalities in breast tumors substantially exceeds the frequency of any other single genetic or epigenetic change reported to date. The discovery of over 50 novel DNA methylation-based biomarkers of breast cancer may provide new routes for development of DNA methylation-based diagnostics and prognostics, as well as reveal epigenetically regulated mechanism involved in breast tumorigenesis.
Collapse
|
27
|
Zhao J, Wang J, An L, Doerge RW, Chen ZJ, Grau CR, Meng J, Osborn TC. Analysis of gene expression profiles in response to Sclerotinia sclerotiorum in Brassica napus. PLANTA 2007; 227:13-24. [PMID: 17665211 DOI: 10.1007/s00425-007-0586-z] [Citation(s) in RCA: 37] [Impact Index Per Article: 2.2] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Subscribe] [Scholar Register] [Received: 05/04/2007] [Accepted: 07/05/2007] [Indexed: 05/16/2023]
Abstract
Sclerotinia sclerotiorum is a necrotrophic plant pathogen which causes serious disease in agronomically important crop species. The molecular basis of plant defense to this pathogen is poorly understood. We investigated gene expression changes associated with S. sclerotiorum infection in a partially resistant and a susceptible genotype of oilseed Brassica napus using a whole genome microarray from Arabidopsis. A total of 686 and 1,547 genes were found to be differentially expressed after infection in the resistant and susceptible genotypes, respectively. The number of differentially expressed genes increased over infection time with the majority being up-regulated in both genotypes. The putative functions of the differentially expressed genes included pathogenesis-related (PR) proteins, proteins involved in the oxidative burst, protein kinase, molecule transporters, cell maintenance and development, abiotic stress, as well as proteins with unknown functions. The gene regulation patterns indicated that a large part of the defense response exhibited as a temporal and quantitative difference between the two genotypes. Genes associated with jasmonic acid (JA) and ethylene signal transduction pathways were induced, but no salicylic acid (SA) responsive genes were identified. Candidate defense genes were identified by integration of the early response genes in the partially resistant line with previously mapped quantitative trait loci (QTL). Expression levels of these genes were verified by Northern blot analyses. These results indicate that genes encoding various proteins involved in diverse roles, particularly WRKY transcription factors and plant cell wall related proteins may play an important role in the defense response to S. sclerotiorum disease.
Collapse
|
28
|
van Leeuwen H, Kliebenstein DJ, West MAL, Kim K, van Poecke R, Katagiri F, Michelmore RW, Doerge RW, St Clair DA. Natural variation among Arabidopsis thaliana accessions for transcriptome response to exogenous salicylic acid. THE PLANT CELL 2007; 19:2099-110. [PMID: 17630278 PMCID: PMC1955704 DOI: 10.1105/tpc.107.050641] [Citation(s) in RCA: 80] [Impact Index Per Article: 4.7] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 05/16/2023]
Abstract
Little is known about how gene expression variation within a given species controls phenotypic variation under different treatments or environments. Here, we surveyed the transcriptome response of seven diverse Arabidopsis thaliana accessions in response to two treatments: the presence and absence of exogenously applied salicylic acid (SA), an important signaling molecule in plant defense. A factorial experiment was conducted with three biological replicates per accession with and without applications of SA and sampled at three time points posttreatment. Transcript level data from Affymetrix ATH1 microarrays were analyzed on both per-gene and gene-network levels to detect expression level polymorphisms associated with SA response. Significant variation in transcript levels for response to SA was detected among the accessions, with relatively few genes responding similarly across all accessions and time points. Twenty-five of 54 defined gene networks identified from other microarray studies (pathogen-challenged Columbia [Col-0]) showed a significant response to SA in one or more accessions. A comparison of gene-network relationships in our data to the pathogen-challenged Col-0 data demonstrated a higher-order conservation of linkages between defense response gene networks. Cvi-1 and Mt-0 appeared to have globally different SA responsiveness in comparison to the other five accessions. Expression level polymorphisms for SA response were abundant at both individual gene and gene-network levels in the seven accessions, suggesting that natural variation for SA response is prevalent in Arabidopsis.
Collapse
|
29
|
West MAL, Kim K, Kliebenstein DJ, van Leeuwen H, Michelmore RW, Doerge RW, St Clair DA. Global eQTL mapping reveals the complex genetic architecture of transcript-level variation in Arabidopsis. Genetics 2007; 175:1441-50. [PMID: 17179097 PMCID: PMC1840073 DOI: 10.1534/genetics.106.064972] [Citation(s) in RCA: 256] [Impact Index Per Article: 15.1] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 08/17/2006] [Accepted: 12/01/2006] [Indexed: 01/09/2023] Open
Abstract
The genetic architecture of transcript-level variation is largely unknown. The genetic determinants of transcript-level variation were characterized in a recombinant inbred line (RIL) population (n = 211) of Arabidopsis thaliana using whole-genome microarray analysis and expression quantitative trait loci (eQTL) mapping of transcript levels as expression traits (e-traits). Genetic control of transcription was highly complex: one-third of the quantitatively controlled transcripts/e-traits were regulated by cis-eQTL, and many trans-eQTL mapped to hotspots that regulated hundreds to thousands of e-traits. Several thousand eQTL of large phenotypic effect were detected, but almost all (93%) of the 36,871 eQTL were associated with small phenotypic effects (R(2) < 0.3). Many transcripts/e-traits were controlled by multiple eQTL with opposite allelic effects and exhibited higher heritability in the RILs than their parents, suggesting nonadditive genetic variation. To our knowledge, this is the first large-scale global eQTL study in a relatively large plant mapping population. It reveals that the genetic control of transcript level is highly variable and multifaceted and that this complexity may be a general characteristic of eukaryotes.
Collapse
|
30
|
Jiang H, Doerge RW. A two-step multiple comparison procedure for a large number of tests and multiple treatments. Stat Appl Genet Mol Biol 2006; 5:Article28. [PMID: 17402912 DOI: 10.2202/1544-6115.1223] [Citation(s) in RCA: 7] [Impact Index Per Article: 0.4] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/18/2022]
Abstract
For situations where the number of tested hypotheses is increasingly large, the power to detect statistically significant multiple treatment effects decreases. As is the case with microarray technology, often researchers are interested in identifying differentially expressed genes for more than two types of cells or treatments. A two-step procedure is proposed for the purpose of increasing power to detect significant effects (i.e., to identify differentially expressed genes). Specifically, in the first step, the null hypothesis of equality across the mean expression levels for all treatments is tested for each gene. In the second step, only pairwise comparisons corresponding to the genes for which the treatment means are statistically different in the first step are tested. We propose an approach to estimate the overall FDR for both fixed rejection regions and fixed FDR significance levels. Also proposed is a procedure to find the FDR significance levels used in the first step and the second step such that the overall FDR can be controlled below a pre-specified FDR significance level. When compared via simulation the two-step approach has increased power over a one-step procedure, and controls the FDR at a desire significance level.
Collapse
|
31
|
Ordway JM, Bedell JA, Citek RW, Nunberg A, Garrido A, Kendall R, Stevens JR, Cao D, Doerge RW, Korshunova Y, Holemon H, McPherson JD, Lakey N, Leon J, Martienssen RA, Jeddeloh JA. Comprehensive DNA methylation profiling in a human cancer genome identifies novel epigenetic targets. Carcinogenesis 2006; 27:2409-23. [PMID: 16952911 DOI: 10.1093/carcin/bgl161] [Citation(s) in RCA: 98] [Impact Index Per Article: 5.4] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 02/06/2023] Open
Abstract
Using a unique microarray platform for cytosine methylation profiling, the DNA methylation landscape of the human genome was monitored at more than 21,000 sites, including 79% of the annotated transcriptional start sites (TSS). Analysis of an oligodendroglioma derived cell line LN-18 revealed more than 4000 methylated TSS. The gene-centric analysis indicated a complex pattern of DNA methylation exists along each autosome, with a trend of increasing density approaching the telomeres. Remarkably, 2% of CpG islands (CGI) were densely methylated, and 17% had significant levels of 5 mC, whether or not they corresponded to a TSS. Substantial independent verification, obtained from 95 loci, suggested that this approach is capable of large scale detection of cytosine methylation with an accuracy approaching 90%. In addition, we detected large genomic domains that are also susceptible to DNA methylation reinforced inactivation, such as the HOX cluster on chromosome 7 (CH7). Extrapolation from the data suggests that more than 2000 genomic loci may be susceptible to methylation and associated inactivation, and most have yet to be identified. Finally, we report six new targets of epigenetic inactivation (IRX3, WNT10A, WNT6, RARalpha, BMP7 and ZGPAT). These targets displayed cell line and tumor specific differential methylation when compared with normal brain samples, suggesting they may have utility as biomarkers. Uniquely, hypermethylation of the CGI within an IRX3 exon was correlated with over-expression of IRX3 in tumor tissues and cell lines relative to normal brain samples.
Collapse
|
32
|
Kliebenstein DJ, West MAL, van Leeuwen H, Loudet O, Doerge RW, St Clair DA. Identification of QTLs controlling gene expression networks defined a priori. BMC Bioinformatics 2006; 7:308. [PMID: 16780591 PMCID: PMC1540440 DOI: 10.1186/1471-2105-7-308] [Citation(s) in RCA: 88] [Impact Index Per Article: 4.9] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 01/14/2006] [Accepted: 06/16/2006] [Indexed: 11/17/2022] Open
Abstract
Background Gene expression microarrays allow the quantification of transcript accumulation for many or all genes in a genome. This technology has been utilized for a range of investigations, from assessments of gene regulation in response to genetic or environmental fluctuation to global expression QTL (eQTL) analyses of natural variation. Current analysis techniques facilitate the statistical querying of individual genes to evaluate the significance of a change in response, also known as differential expression. Since genes are also known to respond as groups due to their membership in networks, effective approaches are needed to investigate transcriptome variation as related to gene network responses. Results We describe a statistical approach that is capable of assessing higher-order a priori defined gene network response, as measured by microarrays. This analysis detected significant network variation between two Arabidopsis thaliana accessions, Bay-0 and Shahdara. By extending this approach, we were able to identify eQTLs controlling network responses for 18 out of 20 a priori-defined gene networks in a recombinant inbred line population derived from accessions Bay-0 and Shahdara. Conclusion This approach has the potential to be expanded to facilitate direct tests of the relationship between phenotypic trait and transcript genetic architecture. The use of a priori definitions for network eQTL identification has enormous potential for providing direction toward future eQTL analyses.
Collapse
|
33
|
West MAL, van Leeuwen H, Kozik A, Kliebenstein DJ, Doerge RW, St Clair DA, Michelmore RW. High-density haplotyping with microarray-based expression and single feature polymorphism markers in Arabidopsis. Genome Res 2006; 16:787-95. [PMID: 16702412 PMCID: PMC1473188 DOI: 10.1101/gr.5011206] [Citation(s) in RCA: 153] [Impact Index Per Article: 8.5] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/24/2022]
Abstract
Expression microarrays hybridized with RNA can simultaneously provide both phenotypic (gene expression) and genotypic (marker) data. We developed two types of genetic markers from Affymetrix GeneChip expression data to generate detailed haplotypes for 148 recombinant inbred lines (RILs) derived from Arabidopsis thaliana accessions Bayreuth and Shahdara. Gene expression markers (GEMs) are based on differences in transcript levels that exhibit bimodal distributions in segregating progeny, while single feature polymorphism (SFP) markers rely on differences in hybridization to individual oligonucleotide probes. Unlike SFPs, GEMs can be derived from any type of DNA-based expression microarray. Our method identifies SFPs independent of a gene's expression level. Alleles for each GEM and SFP marker were ascertained with GeneChip data from parental accessions as well as RILs; a novel algorithm for allele determination using RIL distributions capitalized on the high level of genetic replication per locus. GEMs and SFP markers provided robust markers in 187 and 968 genes, respectively, which allowed estimation of gene order consistent with that predicted from the Col-0 genomic sequence. Using microarrays on a population to simultaneously measure gene expression variation and obtain genotypic data for a linkage map will facilitate expression QTL analyses without the need for separate genotyping. We have demonstrated that gene expression measurements from microarrays can be leveraged to identify polymorphisms across the genome and can be efficiently developed into genetic markers that are verifiable in a large segregating RIL population. Both marker types also offer opportunities for massively parallel mapping in unsequenced and less studied species.
Collapse
|
34
|
Wang J, Tian L, Lee HS, Wei NE, Jiang H, Watson B, Madlung A, Osborn TC, Doerge RW, Comai L, Chen ZJ. Genomewide nonadditive gene regulation in Arabidopsis allotetraploids. Genetics 2006; 172:507-17. [PMID: 16172500 PMCID: PMC1456178 DOI: 10.1534/genetics.105.047894] [Citation(s) in RCA: 390] [Impact Index Per Article: 21.7] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/07/2005] [Accepted: 09/19/2005] [Indexed: 12/21/2022] Open
Abstract
Polyploidy has occurred throughout the evolutionary history of all eukaryotes and is extremely common in plants. Reunification of the evolutionarily divergent genomes in allopolyploids creates regulatory incompatibilities that must be reconciled. Here we report genomewide gene expression analysis of Arabidopsis synthetic allotetraploids, using spotted 70-mer oligo-gene microarrays. We detected >15% transcriptome divergence between the progenitors, and 2105 and 1818 genes were highly expressed in Arabidopsis thaliana and A. arenosa, respectively. Approximately 5.2% (1362) and 5.6% (1469) genes displayed expression divergence from the midparent value (MPV) in two independently derived synthetic allotetraploids, suggesting nonadditive gene regulation following interspecific hybridization. Remarkably, the majority of nonadditively expressed genes in the allotetraploids also display expression changes between the parents, indicating that transcriptome divergence is reconciled during allopolyploid formation. Moreover, >65% of the nonadditively expressed genes in the allotetraploids are repressed, and >94% of the repressed genes in the allotetraploids match the genes that are expressed at higher levels in A. thaliana than in A. arenosa, consistent with the silencing of A. thaliana rRNA genes subjected to nucleolar dominance and with overall suppression of the A. thaliana phenotype in the synthetic allotetraploids and natural A. suecica. The nonadditive gene regulation is involved in various biological pathways, and the changes in gene expression are developmentally regulated. In contrast to the small effects of genome doubling on gene regulation in autotetraploids, the combination of two divergent genomes in allotetraploids by interspecific hybridization induces genomewide nonadditive gene regulation, providing a molecular basis for de novo variation and allopolyploid evolution.
Collapse
|
35
|
Teuscher C, Doerge RW, Fillmore PD, Blankenhorn EP. eae36, a locus on mouse chromosome 4, controls susceptibility to experimental allergic encephalomyelitis in older mice and mice immunized in the winter. Genetics 2005; 172:1147-53. [PMID: 16299394 PMCID: PMC1456213 DOI: 10.1534/genetics.105.049049] [Citation(s) in RCA: 17] [Impact Index Per Article: 0.9] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/18/2022] Open
Abstract
Genetic factors are believed to contribute to multiple sclerosis (MS) susceptibility; however, strong evidence implicating intrinsic and environmental factors in the etiopathogenesis of MS also exists. Susceptibility to experimental allergic encephalomyelitis (EAE), the principal animal model of MS, is also influenced by nongenetic factors, including age and season at immunization. This suggests that age- and season-by-gene interactions exist and that different susceptibility loci may influence disease as a function of the two parameters. In this study, linkage analysis based on genome exclusion mapping was carried out using age and season at immunization restricted cohorts of (B10.S x SJL/J) F2 intercross mice in an effort to identify such linkages. Significant linkage of EAE to eae4 and eae5 was detected with 6- to 12-week-old and summer cohorts. In contrast, significant linkage of EAE to eae4 and eae5 was not detected with the >12-week-old and winter/spring populations. Rather, significant linkage to D4Mit203 at 128.50 Mb on chromosome 4 was detected with animals that were >12 weeks old at the time of immunization or were immunized in the winter. This previously unidentified locus has been designated eae36. These results support the existence of age- and season-by-gene-specific interactions in the genetic control of susceptibility to autoimmune inflammatory disease of the central nervous system and suggest that late-onset MS may be immunogenetically distinct.
Collapse
MESH Headings
- Aging/genetics
- Animals
- Crosses, Genetic
- Encephalomyelitis, Autoimmune, Experimental/chemically induced
- Encephalomyelitis, Autoimmune, Experimental/genetics
- Encephalomyelitis, Autoimmune, Experimental/immunology
- Genetic Markers
- Genetic Predisposition to Disease/genetics
- Immunization
- Mice
- Mice, Inbred C57BL
- Mice, Inbred Strains
- Microsatellite Repeats
- Polymorphism, Single Nucleotide
- Seasons
Collapse
|
36
|
Kliebenstein DJ, West MAL, van Leeuwen H, Kim K, Doerge RW, Michelmore RW, St Clair DA. Genomic survey of gene expression diversity in Arabidopsis thaliana. Genetics 2005; 172:1179-89. [PMID: 16204207 PMCID: PMC1456216 DOI: 10.1534/genetics.105.049353] [Citation(s) in RCA: 101] [Impact Index Per Article: 5.3] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/18/2022] Open
Abstract
Differential gene expression controls variation in numerous plant traits, such as flowering time and plant/pest interactions, but little is known about the genomic distribution of the determinants of transcript levels and their associated variation. Affymetrix ATH1 GeneChip microarrays representing 22,810 genes were used to survey the transcriptome of seven Arabidopsis thaliana accessions in the presence and absence of exogenously applied salicylic acid (SA). These accessions encompassed approximately 80% of the moderate- to high-frequency nucleotide polymorphisms in Arabidopsis. A factorial design, consisting of three biological replicates per accession for the two treatments at three time points (4, 28, and 52 hr post-treatment), and a total of 126 microarrays were used. Between any pair of Arabidopsis accessions, we detected on average 2234 genes (ranging from 1428 to 3334) that were significantly differentially expressed under the conditions of this experiment, using a split-plot analysis of variance. Upward of 6433 genes were differentially expressed between at least one pair of accessions. These results suggest that analysis of additional genetic, developmental, and environmental conditions may show that a significant fraction of the Arabidopsis genome is differentially expressed. Examination of sequence diversity demonstrated a significant positive association with diversity in gene expression.
Collapse
|
37
|
Bogdan M, Doerge RW. Biased estimators of quantitative trait locus heritability and location in interval mapping. Heredity (Edinb) 2005; 95:476-84. [PMID: 16189542 DOI: 10.1038/sj.hdy.6800747] [Citation(s) in RCA: 21] [Impact Index Per Article: 1.1] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/09/2022] Open
Abstract
In many empirical studies, it has been observed that genome scans yield biased estimates of heritability, as well as genetic effects. It is widely accepted that quantitative trait locus (QTL) mapping is a model selection procedure, and that the overestimation of genetic effects is the result of using the same data for model selection as estimation of parameters. There are two key steps in QTL modeling, each of which biases the estimation of genetic effects. First, test procedures are employed to select the regions of the genome for which there is significant evidence for the presence of QTL. Second, and most important for this demonstration, estimates of the genetic effects are reported only at the locations for which the evidence is maximal. We demonstrate that even when we know there is just one QTL present (ignoring the testing bias), and we use interval mapping to estimate its location and effect, the estimator of the effect will be biased. As evidence, we present results of simulations investigating the relative importance of the two sources of bias and the dependence of bias of heritability estimators on the true QTL heritability, sample size, and the length of the investigated part of the genome. Moreover, we present results of simulations demonstrating the skewness of the distribution of estimators of QTL locations and the resulting bias in estimation of location. We use computer simulations to investigate the dependence of this bias on the true QTL location, heritability, and the sample size.
Collapse
|
38
|
Martienssen RA, Doerge RW, Colot V. Epigenomic mapping in Arabidopsis using tiling microarrays. Chromosome Res 2005; 13:299-308. [PMID: 15868423 DOI: 10.1007/s10577-005-1507-2] [Citation(s) in RCA: 27] [Impact Index Per Article: 1.4] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 10/25/2022]
Abstract
In addition to genetic information, chromosomes transmit epigenetic information from cell to cell during division, and sometimes from generation to generation. While genetic information is encoded directly in the DNA sequence, epigenetic information is not, although it is usually associated with specific chromosomal regions. Epigenetic modifications in plants include cytosine methylation as well as modification of histones and other chromosomal proteins. Small interfering RNA play major roles in targeting these modifications to specific regions. Genomic tiling microarrays are powerful tools for analysing epigenetic information, and we review their application in building epigenomic maps in the model plant, Arabidopsis.
Collapse
|
39
|
Munneke B, Schlauch KA, Simonsen KL, Beavis WD, Doerge RW. Adding confidence to gene expression clustering. Genetics 2005; 170:2003-11. [PMID: 15944369 PMCID: PMC1449753 DOI: 10.1534/genetics.104.031500] [Citation(s) in RCA: 12] [Impact Index Per Article: 0.6] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/18/2022] Open
Abstract
It has been well established that gene expression data contain large amounts of random variation that affects both the analysis and the results of microarray experiments. Typically, microarray data are either tested for differential expression between conditions or grouped on the basis of profiles that are assessed temporally or across genetic or environmental conditions. While testing differential expression relies on levels of certainty to evaluate the relative worth of various analyses, cluster analysis is exploratory in nature and has not had the benefit of any judgment of statistical inference. By using a novel dissimilarity function to ascertain gene expression clusters and conditional randomization of the data space to illuminate distinctions between statistically significant clusters of gene expression patterns, we aim to provide a level of confidence to inferred clusters of gene expression data. We apply both permutation and convex hull approaches for randomization of the data space and show that both methods can provide an effective assessment of gene expression profiles whose coregulation is statistically different from that expected by random chance alone.
Collapse
|
40
|
Bradley KC, Boulware MB, Jiang H, Doerge RW, Meisel RL, Mermelstein PG. Changes in gene expression within the nucleus accumbens and striatum following sexual experience. GENES BRAIN AND BEHAVIOR 2005; 4:31-44. [PMID: 15660666 DOI: 10.1111/j.1601-183x.2004.00093.x] [Citation(s) in RCA: 31] [Impact Index Per Article: 1.6] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 01/02/2023]
Abstract
Sexual experience, like repeated drug use, produces long-term changes including sensitization in the nucleus accumbens and dorsal striatum. To better understand the molecular mechanisms underlying the neuroadaptations following sexual experience, we employed a DNA microarray approach to identify genes differentially expressed between sexually experienced and sexually naive female hamsters within the nucleus accumbens and dorsal striatum. For 6 weeks, a stimulus male was placed in the home cage of one-half of the hormonally primed, ovariectomized female hamsters. On the seventh week, the two experimental groups were subdivided, with one half paired with a stimulus male. In comparison with sexually naive animals, sexually experienced hamsters receiving a stimulus male on week 7 exhibited an increase in a large number of genes. Conversely, sexually experienced female hamsters not receiving a stimulus male on week 7 exhibited a reduction in the expression of many genes. For directional changes and the categories of genes regulated by the experimental conditions, data were consistent across the nucleus accumbens and dorsal striatum. However, the specific genes exhibiting changes in expression were disparate. These experiments, among the first to profile genes regulated by female sexual behavior, will provide insight into the mechanisms by which both motivated behaviors and drugs of abuse induce long-term changes in the mesolimbic and nigrostriatal dopamine pathways.
Collapse
|
41
|
Bogdan M, Ghosh JK, Doerge RW. Modifying the Schwarz Bayesian information criterion to locate multiple interacting quantitative trait loci. Genetics 2005; 167:989-99. [PMID: 15238547 PMCID: PMC1470914 DOI: 10.1534/genetics.103.021683] [Citation(s) in RCA: 112] [Impact Index Per Article: 5.9] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 02/01/2023] Open
Abstract
The problem of locating multiple interacting quantitative trait loci (QTL) can be addressed as a multiple regression problem, with marker genotypes being the regressor variables. An important and difficult part in fitting such a regression model is the estimation of the QTL number and respective interactions. Among the many model selection criteria that can be used to estimate the number of regressor variables, none are used to estimate the number of interactions. Our simulations demonstrate that epistatic terms appearing in a model without the related main effects cause the standard model selection criteria to have a strong tendency to overestimate the number of interactions, and so the QTL number. With this as our motivation we investigate the behavior of the Schwarz Bayesian information criterion (BIC) by explaining the phenomenon of the overestimation and proposing a novel modification of BIC that allows the detection of main effects and pairwise interactions in a backcross population. Results of an extensive simulation study demonstrate that our modified version of BIC performs very well in practice. Our methodology can be extended to general populations and higher-order interactions.
Collapse
|
42
|
Stevens JR, Doerge RW. Combining Affymetrix microarray results. BMC Bioinformatics 2005; 6:57. [PMID: 15774008 PMCID: PMC1274254 DOI: 10.1186/1471-2105-6-57] [Citation(s) in RCA: 50] [Impact Index Per Article: 2.6] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/28/2004] [Accepted: 03/17/2005] [Indexed: 11/10/2022] Open
Abstract
BACKGROUND As the use of microarray technology becomes more prevalent it is not unusual to find several laboratories employing the same microarray technology to identify genes related to the same condition in the same species. Although the experimental specifics are similar, typically a different list of statistically significant genes result from each data analysis. RESULTS We propose a statistically-based meta-analytic approach to microarray analysis for the purpose of systematically combining results from the different laboratories. This approach provides a more precise view of genes that are significantly related to the condition of interest while simultaneously allowing for differences between laboratories. Of particular interest is the widely used Affymetrix oligonucleotide array, the results of which are naturally suited to a meta-analysis. A simulation model based on the Affymetrix platform is developed to examine the adaptive nature of the meta-analytic approach and to illustrate the usefulness of such an approach in combining microarray results across laboratories. The approach is then applied to real data involving a mouse model for multiple sclerosis. CONCLUSION The quantitative estimates from the meta-analysis model tend to be closer to the "true" degree of differential expression than any single lab. Meta-analytic methods can systematically combine Affymetrix results from different laboratories to gain a clearer understanding of genes' relationships to specific conditions of interest.
Collapse
|
43
|
Abstract
While extensive progress has been made in quantitative trait locus (QTL) mapping for diploid species, similar progress in QTL mapping for polyploids has been limited due to the complex genetic architecture of polyploids. To date, QTL mapping in polyploids has focused mainly on tetraploids with dominant and/or codominant markers. Here, we extend this view to include any even ploidy level under a dominant marker system. Our approach first selects the most likely chromosomal marker configurations using a Bayesian selection criterion and then fits an interval-mapping model to each candidate. Profiles of the likelihood-ratio test statistic and the maximum-likelihood estimates (MLEs) of parameters including QTL effects are obtained via the EM algorithm. Putative QTL are then detected using a resampling-based significance threshold, and the corresponding parental configuration is identified to be the underlying parental configuration from which the data are observed. Although presented via pseudo-doubled backcross experiments, this approach can be readily extended to other breeding systems. Our method is applied to single-dose restriction fragment autotetraploid alfalfa data, and the performance is investigated through simulation studies.
Collapse
|
44
|
Madlung A, Tyagi AP, Watson B, Jiang H, Kagochi T, Doerge RW, Martienssen R, Comai L. Genomic changes in synthetic Arabidopsis polyploids. THE PLANT JOURNAL : FOR CELL AND MOLECULAR BIOLOGY 2005. [PMID: 15634199 DOI: 10.1111/j.1365-313x.2004.02297] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Subscribe] [Scholar Register] [Indexed: 05/05/2023]
Abstract
Polyploids are common and arise frequently by genome duplication (autopolyploids) or interspecific hybridization (allopolyploids). Neoallopolyploids display sterility, lethality, phenotypic instability, gene silencing and epigenetic changes. Little is known about the molecular basis of these phenomena, and how much genomic remodeling happens upon allopolyploidization. Extensive genomic remodeling has been documented in wheat, but little remodeling occurs in cotton. Newly synthesized Arabidopsis allopolyploids, which display phenotypic instability and low fertility, displayed several, possibly related mechanisms that can remodel genomes. We detected transcriptional activity of several transposons although their transposition was limited. One represents a new family of conditionally active En-Spm-like transposons of Arabidopsis thaliana, which underwent remodeling of CG methylation upon allopolyploidization. A random amplified fragment length polymorphism survey suggested remodeling at few, specific loci. Meiotic analyses revealed the appearance of chromosomal fragments in a substantial fraction of anther meiocytes. In several individuals produced by hybrids between the synthetic and a natural allopolyploid pollen viability inversely correlated with meiotic instability. Activity of selected DNA transposons and the possibly related chromosomal breaks could cause changes by inducing translocations and rearrangements.
Collapse
|
45
|
Madlung A, Tyagi AP, Watson B, Jiang H, Kagochi T, Doerge RW, Martienssen R, Comai L. Genomic changes in synthetic Arabidopsis polyploids. THE PLANT JOURNAL : FOR CELL AND MOLECULAR BIOLOGY 2005; 41:221-30. [PMID: 15634199 DOI: 10.1111/j.1365-313x.2004.02297.x] [Citation(s) in RCA: 220] [Impact Index Per Article: 11.6] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 05/20/2023]
Abstract
Polyploids are common and arise frequently by genome duplication (autopolyploids) or interspecific hybridization (allopolyploids). Neoallopolyploids display sterility, lethality, phenotypic instability, gene silencing and epigenetic changes. Little is known about the molecular basis of these phenomena, and how much genomic remodeling happens upon allopolyploidization. Extensive genomic remodeling has been documented in wheat, but little remodeling occurs in cotton. Newly synthesized Arabidopsis allopolyploids, which display phenotypic instability and low fertility, displayed several, possibly related mechanisms that can remodel genomes. We detected transcriptional activity of several transposons although their transposition was limited. One represents a new family of conditionally active En-Spm-like transposons of Arabidopsis thaliana, which underwent remodeling of CG methylation upon allopolyploidization. A random amplified fragment length polymorphism survey suggested remodeling at few, specific loci. Meiotic analyses revealed the appearance of chromosomal fragments in a substantial fraction of anther meiocytes. In several individuals produced by hybrids between the synthetic and a natural allopolyploid pollen viability inversely correlated with meiotic instability. Activity of selected DNA transposons and the possibly related chromosomal breaks could cause changes by inducing translocations and rearrangements.
Collapse
|
46
|
Wang J, Lee JJ, Tian L, Lee HS, Chen M, Rao S, Wei EN, Doerge RW, Comai L, Chen ZJ. Methods for genome-wide analysis of gene expression changes in polyploids. Methods Enzymol 2005; 395:570-96. [PMID: 15865985 PMCID: PMC1986650 DOI: 10.1016/s0076-6879(05)95030-1] [Citation(s) in RCA: 13] [Impact Index Per Article: 0.7] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 12/13/2022]
Abstract
Polyploidy is an evolutionary innovation, providing extra sets of genetic material for phenotypic variation and adaptation. It is predicted that changes of gene expression by genetic and epigenetic mechanisms are responsible for novel variation in nascent and established polyploids (Liu and Wendel, 2002; Osborn et al., 2003; Pikaard, 2001). Studying gene expression changes in allopolyploids is more complicated than in autopolyploids, because allopolyploids contain more than two sets of genomes originating from divergent, but related, species. Here we describe two methods that are applicable to the genome-wide analysis of gene expression differences resulting from genome duplication in autopolyploids or interactions between homoeologous genomes in allopolyploids. First, we describe an amplified fragment length polymorphism (AFLP)--complementary DNA (cDNA) display method that allows the discrimination of homoeologous loci based on restriction polymorphisms between the progenitors. Second, we describe microarray analyses that can be used to compare gene expression differences between the allopolyploids and respective progenitors using appropriate experimental design and statistical analysis. We demonstrate the utility of these two complementary methods and discuss the pros and cons of using the methods to analyze gene expression changes in autopolyploids and allopolyploids. Furthermore, we describe these methods in general terms to be of wider applicability for comparative gene expression in a variety of evolutionary, genetic, biological, and physiological contexts.
Collapse
|
47
|
Churchill GA, Airey DC, Allayee H, Angel JM, Attie AD, Beatty J, Beavis WD, Belknap JK, Bennett B, Berrettini W, Bleich A, Bogue M, Broman KW, Buck KJ, Buckler E, Burmeister M, Chesler EJ, Cheverud JM, Clapcote S, Cook MN, Cox RD, Crabbe JC, Crusio WE, Darvasi A, Deschepper CF, Doerge RW, Farber CR, Forejt J, Gaile D, Garlow SJ, Geiger H, Gershenfeld H, Gordon T, Gu J, Gu W, de Haan G, Hayes NL, Heller C, Himmelbauer H, Hitzemann R, Hunter K, Hsu HC, Iraqi FA, Ivandic B, Jacob HJ, Jansen RC, Jepsen KJ, Johnson DK, Johnson TE, Kempermann G, Kendziorski C, Kotb M, Kooy RF, Llamas B, Lammert F, Lassalle JM, Lowenstein PR, Lu L, Lusis A, Manly KF, Marcucio R, Matthews D, Medrano JF, Miller DR, Mittleman G, Mock BA, Mogil JS, Montagutelli X, Morahan G, Morris DG, Mott R, Nadeau JH, Nagase H, Nowakowski RS, O'Hara BF, Osadchuk AV, Page GP, Paigen B, Paigen K, Palmer AA, Pan HJ, Peltonen-Palotie L, Peirce J, Pomp D, Pravenec M, Prows DR, Qi Z, Reeves RH, Roder J, Rosen GD, Schadt EE, Schalkwyk LC, Seltzer Z, Shimomura K, Shou S, Sillanpää MJ, Siracusa LD, Snoeck HW, Spearow JL, Svenson K, Tarantino LM, Threadgill D, Toth LA, Valdar W, de Villena FPM, Warden C, Whatley S, Williams RW, Wiltshire T, Yi N, Zhang D, Zhang M, Zou F. The Collaborative Cross, a community resource for the genetic analysis of complex traits. Nat Genet 2004; 36:1133-7. [PMID: 15514660 DOI: 10.1038/ng1104-1133] [Citation(s) in RCA: 754] [Impact Index Per Article: 37.7] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/08/2022]
Abstract
The goal of the Complex Trait Consortium is to promote the development of resources that can be used to understand, treat and ultimately prevent pervasive human diseases. Existing and proposed mouse resources that are optimized to study the actions of isolated genetic loci on a fixed background are less effective for studying intact polygenic networks and interactions among genes, environments, pathogens and other factors. The Collaborative Cross will provide a common reference panel specifically designed for the integrative analysis of complex systems and will change the way we approach human health and disease.
Collapse
|
48
|
Tian L, Fong MP, Wang JJ, Wei NE, Jiang H, Doerge RW, Chen ZJ. Reversible histone acetylation and deacetylation mediate genome-wide, promoter-dependent and locus-specific changes in gene expression during plant development. Genetics 2004; 169:337-45. [PMID: 15371352 PMCID: PMC1448893 DOI: 10.1534/genetics.104.033142] [Citation(s) in RCA: 136] [Impact Index Per Article: 6.8] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/18/2022] Open
Abstract
Histone acetylation and deacetylation activate or repress transcription, yet the physiological relevance of reversible changes in chromatin structure and gene expression is poorly understood. We have shown that disrupting the expression of AtHD1 that encodes a putative Arabidopsis thaliana histone deacetylase induces a variety of developmental abnormalities. However, causal effects of the AtHD1 disruption on chromatin structure and gene expression are unknown. Using Arabidopsis spotted oligo-gene microarray analysis, here we report that >7% of the transcriptome was up- or downregulated in A. thaliana plants containing a T-DNA insertion in AtHD1 (athd1-t1), indicating that AtHD1 provides positive and negative control of transcriptional regulation. Remarkably, genes involved in ionic homeostasis and protein synthesis were ectopically expressed, whereas genes in ionic homeostasis, protein transport, and plant hormonal regulation were repressed in athd1-t1 leaves or flowers, suggesting a role of AtHD1 in developmental and environmental regulation of gene expression. Moreover, defective AtHD1 induced site-specific and reversible acetylation changes in H3-Lys9, H4-Lys12, and H4 tetra-lysines (residues 5, 8, 12, and 16) in homozygous recessive and heterozygous plants. Transcriptional activation was locus specific and often associated with specific acetylation sites in the vicinity of promoters, whereas gene repression did not correlate with changes in histone acetylation or correlated directly with H3-Lys9 methylation but not with DNA methylation. The data suggest that histone acetylation and deacetylation are promoter dependent, locus specific, and genetically reversible, which provides a general mechanism for reversible gene regulation responsive to developmental and environmental changes.
Collapse
|
49
|
Jeffrey Chen Z, Wang J, Tian L, Lee HS, Wang JJ, Chen M, Lee JJ, Josefsson C, Madlung A, Watson B, Lippman Z, Vaughn M, Chris Pires J, Colot V, Doerge RW, Martienssen RA, Comai L, Osborn TC. The development of an Arabidopsis model system for genome-wide analysis of polyploidy effects. Biol J Linn Soc Lond 2004; 82:689-700. [PMID: 18079994 DOI: 10.1111/j.1095-8312.2004.00351.x] [Citation(s) in RCA: 56] [Impact Index Per Article: 2.8] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/28/2022]
Abstract
Arabidopsis is a model system not only for studying numerous aspects of plant biology, but also for understanding mechanisms of the rapid evolutionary process associated with genome duplication and polyploidization. Although in animals interspecific hybrids are often sterile and aneuploids are related to disease syndromes, both Arabidopsis autopolyploids and allopolyploids occur in nature and can be readily formed in the laboratory, providing an attractive system for comparing changes in gene expression and genome structure among relatively 'young' and 'established' or 'ancient' polyploids. Powerful reverse and forward genetics in Arabidopsis offer an exceptional means by which regulatory mechanisms of gene and genome duplication may be revealed. Moreover, the Arabidopsis genome is completely sequenced; both coding and non-coding sequences are available. We have developed spotted oligo-gene and chromosome microarrays using the complete Arabidopsis genome sequence. The oligo-gene microarray consists of ~26 000 70-mer oligonucleotides that are designed from all annotated genes in Arabidopsis, and the chromosome microarray contains 1 kb genomic tiling fragments amplified from a chromosomal region or the complete sequence of chromosome 4. We have demonstrated the utility of microarrays for genome-wide analysis of changes in gene expression, genome organization and chromatin structure in Arabidopsis polyploids and related species.
Collapse
|
50
|
Lippman Z, Gendrel AV, Black M, Vaughn MW, Dedhia N, McCombie WR, Lavine K, Mittal V, May B, Kasschau KD, Carrington JC, Doerge RW, Colot V, Martienssen R. Role of transposable elements in heterochromatin and epigenetic control. Nature 2004; 430:471-6. [PMID: 15269773 DOI: 10.1038/nature02651] [Citation(s) in RCA: 822] [Impact Index Per Article: 41.1] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 01/02/2004] [Accepted: 05/07/2004] [Indexed: 11/09/2022]
Abstract
Heterochromatin has been defined as deeply staining chromosomal material that remains condensed in interphase, whereas euchromatin undergoes de-condensation. Heterochromatin is found near centromeres and telomeres, but interstitial sites of heterochromatin (knobs) are common in plant genomes and were first described in maize. These regions are repetitive and late-replicating. In Drosophila, heterochromatin influences gene expression, a heterochromatin phenomenon called position effect variegation. Similarities between position effect variegation in Drosophila and gene silencing in maize mediated by "controlling elements" (that is, transposable elements) led in part to the proposal that heterochromatin is composed of transposable elements, and that such elements scattered throughout the genome might regulate development. Using microarray analysis, we show that heterochromatin in Arabidopsis is determined by transposable elements and related tandem repeats, under the control of the chromatin remodelling ATPase DDM1 (Decrease in DNA Methylation 1). Small interfering RNAs (siRNAs) correspond to these sequences, suggesting a role in guiding DDM1. We also show that transposable elements can regulate genes epigenetically, but only when inserted within or very close to them. This probably accounts for the regulation by DDM1 and the DNA methyltransferase MET1 of the euchromatic, imprinted gene FWA, as its promoter is provided by transposable-element-derived tandem repeats that are associated with siRNAs.
Collapse
|