51
|
Biddie SC, John S, Hager GL. Genome-wide mechanisms of nuclear receptor action. Trends Endocrinol Metab 2010; 21:3-9. [PMID: 19800253 PMCID: PMC2818176 DOI: 10.1016/j.tem.2009.08.006] [Citation(s) in RCA: 65] [Impact Index Per Article: 4.6] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Submit a Manuscript] [Subscribe] [Scholar Register] [Received: 07/20/2009] [Revised: 08/19/2009] [Accepted: 08/21/2009] [Indexed: 01/11/2023]
Abstract
Nuclear receptors are involved in a myriad of physiological processes, responding to ligands and binding to DNA at sequence-specific cis-regulatory elements. This binding occurs in the context of chromatin, a critical factor in regulating eukaryotic transcription. Recent high-throughput assays have examined nuclear receptor action genome-wide, advancing our understanding of receptor binding to regulatory elements. Here, we discuss current knowledge of genome-wide response element occupancy by receptors and the function of transcription factor networks in regulating nuclear receptor action. We highlight emerging roles for the epigenome, chromatin remodeling, histone modification, histone variants and long-range chromosomal interactions in nuclear receptor binding and receptor-dependent gene regulation. These mechanisms contribute importantly to the action of nuclear receptors in health and disease.
Collapse
Affiliation(s)
- Simon C. Biddie
- Laboratory of Receptor Biology and Gene Expression, Building 41, Rm B602, 41 Library Dr., National Cancer Institute, NIH, Bethesda, MD 20892-5055
- Henry Wellcome Laboratories for Integrative Neuroscience and Endocrinology, The Dorothy Hodgkin Building, University of Bristol, Whitson Street, Bristol, BS1 3NY United Kingdom
| | - Sam John
- Laboratory of Receptor Biology and Gene Expression, Building 41, Rm B602, 41 Library Dr., National Cancer Institute, NIH, Bethesda, MD 20892-5055
| | - Gordon L. Hager
- Laboratory of Receptor Biology and Gene Expression, Building 41, Rm B602, 41 Library Dr., National Cancer Institute, NIH, Bethesda, MD 20892-5055
| |
Collapse
|
52
|
Ohnesorg T, Eggers S, Leonhard WN, Sinclair AH, White SJ. Rapid high-throughput analysis of DNaseI hypersensitive sites using a modified Multiplex Ligation-dependent Probe Amplification approach. BMC Genomics 2009; 10:412. [PMID: 19728890 PMCID: PMC2748097 DOI: 10.1186/1471-2164-10-412] [Citation(s) in RCA: 7] [Impact Index Per Article: 0.5] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 01/15/2009] [Accepted: 09/04/2009] [Indexed: 11/10/2022] Open
Abstract
BACKGROUND Mapping DNaseI hypersensitive sites is commonly used to identify regulatory regions in the genome. However, currently available methods are either time consuming and laborious, expensive or require large numbers of cells. We aimed to develop a quick and straightforward method for the analysis of DNaseI hypersensitive sites that overcomes these problems. RESULTS We have developed a modified Multiplex Ligation-dependent Probe Amplification (MLPA) approach for the identification and analysis of genomic regulatory regions. The utility of this approach was demonstrated by simultaneously analysing 20 loci from the ENCODE project for DNaseI hypersensitivity in a range of different cell lines. We were able to obtain reproducible results with as little as 5 x 10(4) cells per DNaseI treatment. Our results broadly matched those previously reported by the ENCODE project, and both technical and biological replicates showed high correlations, indicating the sensitivity and reproducibility of this method. CONCLUSION This new method will considerably facilitate the identification and analysis of DNaseI hypersensitive sites. Due to the multiplexing potential of MLPA (up to 50 loci can be examined) it is possible to analyse dozens of DNaseI hypersensitive sites in a single reaction. Furthermore, the high sensitivity of MLPA means that fewer than 10(5) cells per DNaseI treatment can be used, allowing the discovery and analysis of tissue specific regulatory regions without the need for pooling. This method is quick and easy and results can be obtained within 48 hours after harvesting of cells or tissues. As no special equipment is required, this method can be applied by any laboratory interested in the analysis of DNaseI hypersensitive regions.
Collapse
Affiliation(s)
- Thomas Ohnesorg
- Murdoch Children's Research Institute and Department of Paediatrics, University of Melbourne, Royal Children's Hospital, Parkville, VIC, Australia
| | | | | | | | | |
Collapse
|
53
|
The HBS1L-MYB intergenic interval associated with elevated HbF levels shows characteristics of a distal regulatory region in erythroid cells. Blood 2009; 114:1254-62. [DOI: 10.1182/blood-2009-03-210146] [Citation(s) in RCA: 71] [Impact Index Per Article: 4.7] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 01/28/2023] Open
Abstract
Abstract
HBS1L-MYB intergenic polymorphism (HMIP) on chromosome 6q23 is associated with elevated fetal hemoglobin levels and has pleiotropic effects on several hematologic parameters. To investigate potential regulatory activity in the region, we have measured sensitivity of the sequences to DNase I cleavage that identified 3 tissue-specific DNase I hypersensitive sites in the core intergenic interval. Chromatin immunoprecipitation with microarray (ChIP-chip) analysis showed strong histone acetylation in a defined interval of 65 kb corresponding to the core HBS1L-MYB intergenic region in primary human erythroid cells but not in non–MYB-expressing HeLa cells. ChIP-chip analysis also identified several potential cis-regulatory elements as strong GATA-1 signals that coincided with the DNase I hypersensitive sites present in MYB-expressing erythroid cells. We suggest that HMIP contains regulatory sequences that could be important in hematopoiesis by controlling MYB expression. This study provides the functional link between genetic association of HMIP with control of fetal hemoglobin and other hematologic parameters. We also present a large-scale analysis of histone acetylation as well as RNA polymerase II and GATA-1 interactions on chromosome 6q, and α and β globin gene loci. The data suggest that GATA-1 regulates numerous genes of various functions on chromosome 6q.
Collapse
|
54
|
Placek K, Coffre M, Maiella S, Bianchi E, Rogge L. Genetic and epigenetic networks controlling T helper 1 cell differentiation. Immunology 2009; 127:155-62. [PMID: 19476511 DOI: 10.1111/j.1365-2567.2009.03059.x] [Citation(s) in RCA: 26] [Impact Index Per Article: 1.7] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 01/10/2023] Open
Abstract
Significant progress has been made during the past years in our understanding of the mechanisms that control the differentiation of naïve CD4(+) T cells into effector T-cell subsets with distinct functional properties. Previous work allowed the identification of key molecules involved in regulating this highly complex process, such as cytokines and their receptors, signal transducers and transcription factors. More recently, the emphasis of research in this field has been to elucidate how the multiplicity of signals is integrated to shape a T helper subset-specific gene-expression program controlling differentiation and effector functions. In this review we will highlight advances that have been made in unravelling the genetic and epigenetic networks controlling differentiation of naïve CD4(+) T cells into interferon-gamma(IFN-gamma)-secreting T helper type 1 (Th1) cells.
Collapse
|
55
|
Ng CEL, Ito Y, Osato M. Retroviral integration sites (RIS) mark cis-regulatory elements. Crit Rev Oncol Hematol 2009; 71:1-11. [DOI: 10.1016/j.critrevonc.2008.10.008] [Citation(s) in RCA: 1] [Impact Index Per Article: 0.1] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 02/06/2008] [Revised: 09/29/2008] [Accepted: 10/15/2008] [Indexed: 12/15/2022] Open
|
56
|
Mann T, Humbert R, Dorschner M, Stamatoyannopoulos J, Noble WS. A thermodynamic approach to PCR primer design. Nucleic Acids Res 2009; 37:e95. [PMID: 19528077 PMCID: PMC2715258 DOI: 10.1093/nar/gkp443] [Citation(s) in RCA: 20] [Impact Index Per Article: 1.3] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/13/2022] Open
Abstract
We developed a primer design method, Pythia, in which state of the art DNA binding affinity computations are directly integrated into the primer design process. We use chemical reaction equilibrium analysis to integrate multiple binding energy calculations into a conservative measure of polymerase chain reaction (PCR) efficiency, and a precomputed index on genomic sequences to evaluate primer specificity. We show that Pythia can design primers with success rates comparable with those of current methods, but yields much higher coverage in difficult genomic regions. For example, in RepeatMasked sequences in the human genome, Pythia achieved a median coverage of 89% as compared with a median coverage of 51% for Primer3. For parameter settings yielding sensitivities of 81%, our method has a recall of 97%, compared with the Primer3 recall of 48%. Because our primer design approach is based on the chemistry of DNA interactions, it has fewer and more physically meaningful parameters than current methods, and is therefore easier to adjust to specific experimental requirements. Our software is freely available at http://pythia.sourceforge.net.
Collapse
Affiliation(s)
- Tobias Mann
- Department of Genome Sciences, University of Washington, Seattle, WA, USA
| | | | | | | | | |
Collapse
|
57
|
Astrand C, Belikov S, Wrange O. Histone acetylation characterizes chromatin presetting by NF1 and Oct1 and enhances glucocorticoid receptor binding to the MMTV promoter. Exp Cell Res 2009; 315:2604-15. [PMID: 19463811 DOI: 10.1016/j.yexcr.2009.05.012] [Citation(s) in RCA: 24] [Impact Index Per Article: 1.6] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 02/10/2009] [Revised: 04/29/2009] [Accepted: 05/15/2009] [Indexed: 10/20/2022]
Abstract
Transcription from the mouse mammary tumor virus (MMTV) promoter is induced by the glucocorticoid receptor (GR). This switch was reconstituted in Xenopus oocytes. Previously, we showed that Nuclear Factor 1 (NF1) and Octamer Transcription Factor 1 (Oct1) bind constitutively to the MMTV promoter and thereby induce translational nucleosome positioning representing an intermediary, i.e. preset, state of nucleosome organization. Here we further characterize this NF1 and Oct1 induced preset chromatin in relation to the inactive and the hormone-activated state. The preset chromatin exhibits increased histone acetylation but does not cause dissociation of histone H1 as oppose to the hormone-activated state. Furthermore, upon hormone induction the preset MMTV chromatin displays an enhanced and prolonged GR binding capacity and transcription during an intrinsic and time-dependent silencing of the injected template. The silencing process correlates with a reduced histone acetylation. However, a histone deacetylase inhibitor, trichostatin A (TSA), does not counteract silencing in spite of its distinct stimulation of GR-DNA binding. The latter indicates the importance of histone acetylation to maintain DNA access for inducible factor binding. We discuss how constitutively bound factors such as NF1 and Oct1 may participate in the maintenance of tissue specificity of hormone responsive genes.
Collapse
Affiliation(s)
- Carolina Astrand
- Department of Cell and Molecular Biology, Karolinska Institutet, Stockholm, Sweden.
| | | | | |
Collapse
|
58
|
Abstract
The glucocorticoid receptor regulates the expression of a large number of genes in mammalian cells. The interaction of this receptor with regulatory elements has been discovered to be highly dynamic, with occupancy states measured in seconds, rather than minutes or hours. This finding has led to a paradigm shift in our understanding of receptor function throughout the genome. The mechanisms involved in these rapid exchange events, as well as the implications for receptor function, are discussed.
Collapse
Affiliation(s)
- Simon C Biddie
- Laboratory of Receptor Biology and Gene Expression, National Cancer Institute, NIH, Bethesda, MD 20892-5055, USA
| | | |
Collapse
|
59
|
Giresi PG, Lieb JD. Isolation of active regulatory elements from eukaryotic chromatin using FAIRE (Formaldehyde Assisted Isolation of Regulatory Elements). Methods 2009; 48:233-9. [PMID: 19303047 DOI: 10.1016/j.ymeth.2009.03.003] [Citation(s) in RCA: 157] [Impact Index Per Article: 10.5] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 12/05/2008] [Revised: 03/05/2009] [Accepted: 03/07/2009] [Indexed: 10/21/2022] Open
Abstract
The binding of sequence-specific regulatory factors and the recruitment of chromatin remodeling activities cause nucleosomes to be evicted from chromatin in eukaryotic cells. Traditionally, these active sites have been identified experimentally through their sensitivity to nucleases. Here we describe the details of a simple procedure for the genome-wide isolation of nucleosome-depleted DNA from human chromatin, termed FAIRE (Formaldehyde Assisted Isolation of Regulatory Elements). We also provide protocols for different methods of detecting FAIRE-enriched DNA, including use of PCR, DNA microarrays, and next-generation sequencing. FAIRE works on all eukaryotic chromatin tested to date. To perform FAIRE, chromatin is crosslinked with formaldehyde, sheared by sonication, and phenol-chloroform extracted. Most genomic DNA is crosslinked to nucleosomes and is sequestered to the interphase, whereas DNA recovered in the aqueous phase corresponds to nucleosome-depleted regions of the genome. The isolated regions are largely coincident with the location of DNaseI hypersensitive sites, transcriptional start sites, enhancers, insulators, and active promoters. Given its speed and simplicity, FAIRE has utility in establishing chromatin profiles of diverse cell types in health and disease, isolating DNA regulatory elements en masse for further characterization, and as a screening assay for the effects of small molecules on chromatin organization.
Collapse
Affiliation(s)
- Paul G Giresi
- Department of Biology and Carolina Center for the Genome Sciences, University of North Carolina at Chapel Hill, CB #3280, 408 Fordham Hall, Chapel Hill, NC 27599-3280, USA
| | | |
Collapse
|
60
|
George AA, Louis Schiltz R, Hager GL. Dynamic access of the glucocorticoid receptor to response elements in chromatin. Int J Biochem Cell Biol 2009; 41:214-24. [PMID: 18930837 PMCID: PMC2632576 DOI: 10.1016/j.biocel.2008.09.019] [Citation(s) in RCA: 35] [Impact Index Per Article: 2.3] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 08/15/2008] [Revised: 09/22/2008] [Accepted: 09/23/2008] [Indexed: 12/22/2022]
Abstract
Transcriptional activation as a rate-limiting step of gene expression is often triggered by an environmental stimulus that is transmitted through a signaling cascade to specific transcription factors. Transcription factors must then find appropriate target genes in the context of chromatin. Subsequent modulation of local chromatin domains is now recognized as a major mechanism of gene regulation. The interactions of transcription factors with chromatin structures have recently been observed to be highly dynamic, with residence times measured in seconds. Thus, the concept of static, multi-protein complexes forming at regulatory elements in the genome has been replaced by a new paradigm that envisages rapid and continuous exchange events with the template. These highly dynamic interactions are a property of both DNA-protein and protein-protein interactions and are inherent to every stage of the transcriptional response. In this review we discuss the dynamics of a nuclear receptor, and its transcriptional response in the chromatin context.
Collapse
Affiliation(s)
- Anuja A. George
- Laboratory of Receptor Biology and Gene Expression, Building 41, B602, 41 Library Dr., National Cancer Institute, NIH, Bethesda, MD 20892-5055
| | - R. Louis Schiltz
- Laboratory of Receptor Biology and Gene Expression, Building 41, B602, 41 Library Dr., National Cancer Institute, NIH, Bethesda, MD 20892-5055
| | - Gordon L. Hager
- Laboratory of Receptor Biology and Gene Expression, Building 41, B602, 41 Library Dr., National Cancer Institute, NIH, Bethesda, MD 20892-5055
| |
Collapse
|
61
|
Attanasio C, Reymond A, Humbert R, Lyle R, Kuehn MS, Neph S, Sabo PJ, Goldy J, Weaver M, Haydock A, Lee K, Dorschner M, Dermitzakis ET, Antonarakis SE, Stamatoyannopoulos JA. Assaying the regulatory potential of mammalian conserved non-coding sequences in human cells. Genome Biol 2008; 9:R168. [PMID: 19055709 PMCID: PMC2646272 DOI: 10.1186/gb-2008-9-12-r168] [Citation(s) in RCA: 18] [Impact Index Per Article: 1.1] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/09/2008] [Revised: 09/24/2008] [Accepted: 12/02/2008] [Indexed: 01/26/2023] Open
Abstract
The fraction of experimentally active conserved non-coding sequences within any given cell type is low, so classical assays are unlikely to expose their potential. Background Conserved non-coding sequences in the human genome are approximately tenfold more abundant than known genes, and have been hypothesized to mark the locations of cis-regulatory elements. However, the global contribution of conserved non-coding sequences to the transcriptional regulation of human genes is currently unknown. Deeply conserved elements shared between humans and teleost fish predominantly flank genes active during morphogenesis and are enriched for positive transcriptional regulatory elements. However, such deeply conserved elements account for <1% of the conserved non-coding sequences in the human genome, which are predominantly mammalian. Results We explored the regulatory potential of a large sample of these 'common' conserved non-coding sequences using a variety of classic assays, including chromatin remodeling, and enhancer/repressor and promoter activity. When tested across diverse human model cell types, we find that the fraction of experimentally active conserved non-coding sequences within any given cell type is low (approximately 5%), and that this proportion increases only modestly when considered collectively across cell types. Conclusions The results suggest that classic assays of cis-regulatory potential are unlikely to expose the functional potential of the substantial majority of mammalian conserved non-coding sequences in the human genome.
Collapse
Affiliation(s)
- Catia Attanasio
- Department of Genetic Medicine and Development, University of Geneva Medical School, 1 rue Michel Servet, 1211, Geneva 4, Switzerland.
| | | | | | | | | | | | | | | | | | | | | | | | | | | | | |
Collapse
|
62
|
Failure of terminal erythroid differentiation in EKLF-deficient mice is associated with cell cycle perturbation and reduced expression of E2F2. Mol Cell Biol 2008; 28:7394-401. [PMID: 18852285 DOI: 10.1128/mcb.01087-08] [Citation(s) in RCA: 82] [Impact Index Per Article: 5.1] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 12/17/2022] Open
Abstract
Erythroid Krüppel-like factor (EKLF) is a Krüppel-like transcription factor identified as a transcriptional activator and chromatin modifier in erythroid cells. EKLF-deficient (Eklf(-/-)) mice die at day 14.5 of gestation from severe anemia. In this study, we demonstrate that early progenitor cells fail to undergo terminal erythroid differentiation in Eklf(-/-) embryos. To discover potential EKLF target genes responsible for the failure of erythropoiesis, transcriptional profiling was performed with RNA from wild-type and Eklf(-/-) early erythroid progenitor cells. These analyses identified significant perturbation of a network of genes involved in cell cycle regulation, with the critical regulator of the cell cycle, E2f2, at a hub. E2f2 mRNA and protein levels were markedly decreased in Eklf(-/-) early erythroid progenitor cells, which showed a delay in the G(1)-to-S-phase transition. Chromatin immunoprecipitation analysis demonstrated EKLF occupancy at the proximal E2f2 promoter in vivo. Consistent with the role of EKLF as a chromatin modifier, EKLF binding sites in the E2f2 promoter were located in a region of EKLF-dependent DNase I sensitivity in early erythroid progenitor cells. We propose a model in which EKLF-dependent activation and modification of the E2f2 locus is required for cell cycle progression preceding terminal erythroid differentiation.
Collapse
|
63
|
Esumi N, Kachi S, Hackler L, Masuda T, Yang Z, Campochiaro PA, Zack DJ. BEST1 expression in the retinal pigment epithelium is modulated by OTX family members. Hum Mol Genet 2008; 18:128-41. [PMID: 18849347 DOI: 10.1093/hmg/ddn323] [Citation(s) in RCA: 46] [Impact Index Per Article: 2.9] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/14/2022] Open
Abstract
A number of genes preferentially expressed in the retinal pigment epithelium (RPE) are associated with retinal degenerative disease. One of these, BEST1, encodes bestrophin-1, a protein that when mutated causes Best macular dystrophy. As a model for RPE gene regulation, we have been studying the mechanisms that control BEST1 expression, and recently demonstrated that members of the MITF-TFE family modulate BEST1 transcription. The human BEST1 upstream region from -154 to +38 bp is sufficient to direct expression in the RPE, and positive-regulatory elements exist between -154 and -104 bp. Here, we show that the -154 to -104 bp region is necessary for RPE expression in transgenic mice and contains a predicted OTX-binding site (Site 1). Since another non-canonical OTX site (Site 2) is located nearby, we tested the function of these sites using BEST1 promoter/luciferase constructs by in vivo electroporation and found that mutation of both sites reduces promoter activity. Three OTX family proteins - OTX1, OTX2 and CRX - bound to both Sites 1 and 2 in vitro, and all of them increased BEST1 promoter activity. Surprisingly, we found that human and bovine RPE expressed not only OTX2 but also CRX, the CRX genomic region in bovine RPE was hypersensitive to DNase I, consistent with active transcription, and that both OTX2 and CRX bound to the BEST1 proximal promoter in vivo. These results demonstrate for the first time CRX expression in the RPE, and suggest that OTX2 and CRX may act as positive modulators of the BEST1 promoter in the RPE.
Collapse
Affiliation(s)
- Noriko Esumi
- The Guerrieri Center for Genetic Engineering and Molecular Ophthalmology at The Wilmer Eye Institute, Johns Hopkins University School of Medicine, Baltimore, MD 21287-9289, USA
| | | | | | | | | | | | | |
Collapse
|
64
|
Taylor JM, Wicks K, Vandiedonck C, Knight JC. Chromatin profiling across the human tumour necrosis factor gene locus reveals a complex, cell type-specific landscape with novel regulatory elements. Nucleic Acids Res 2008; 36:4845-62. [PMID: 18653526 PMCID: PMC2528168 DOI: 10.1093/nar/gkn444] [Citation(s) in RCA: 20] [Impact Index Per Article: 1.3] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/22/2022] Open
Abstract
The TNF locus on chromosome 6p21 encodes a family of proteins with key roles in the immune response whose dysregulation leads to severe disease. Transcriptional regulation is important, with cell type and stimulus-specific enhancer complexes involving the proximal TNF promoter. We show how quantitative chromatin profiling across a 34 kb region spanning the TNF locus has allowed us to identify a number of novel DNase hypersensitive sites and characterize more distant regulatory elements. We demonstrate DNase hypersensitive sites corresponding to the lymphotoxin alpha (LTA) and tumour necrosis factor (TNF) promoter regions, a CpG island in exon 4 of lymphotoxin beta (LTB), the 3′ end of nuclear factor of kappa light polypeptide gene enhancer in B-cells inhibitor-like 1 (NFKBIL1) and 3.4 kb upstream of LTA. These sites co-localize to highly conserved DNA sequences and show evidence of cell type specificity when lymphoblastoid, Jurkat, U937, HeLa and HEK293T cell lines are analysed using Southern blotting. For Jurkat T cells, we define histone modifications across the locus. Peaks of acetylated histone H3 and H4, together with tri-methyl K4 of histone H3, correspond to hypersensitive sites, notably in exon 4 of LTB. We provide evidence of a functional role for an intergenic DNase I hypersensitive site distal to LTA in Jurkat cells based on reporter gene analysis, with evidence of recruitment of upstream stimulatory factors (USF) transcription factors.
Collapse
Affiliation(s)
- Jennifer M Taylor
- Wellcome Trust Centre for Human Genetics, University of Oxford, Oxford OX3 7BN, UK
| | | | | | | |
Collapse
|
65
|
Lian H, Thompson WA, Thurman R, Stamatoyannopoulos JA, Noble WS, Lawrence CE. Automated mapping of large-scale chromatin structure in ENCODE. ACTA ACUST UNITED AC 2008; 24:1911-6. [PMID: 18591192 PMCID: PMC2519158 DOI: 10.1093/bioinformatics/btn335] [Citation(s) in RCA: 22] [Impact Index Per Article: 1.4] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/30/2022]
Abstract
Motivation: A recently developed DNaseI assay has given us our first genome-wide view of chromatin structure. In addition to cataloging DNaseI hypersensitive sites, these data allows us to more completely characterize overall features of chromatin accessibility. We employed a Bayesian hierarchical change-point model (CPM), a generalization of a hidden Markov Model (HMM), to characterize tiled microarray DNaseI sensitivity data available from the ENCODE project. Results: Our analysis shows that the accessibility of chromatin to cleavage by DNaseI is well described by a four state model of local segments with each state described by a continuous mixture of Gaussian variables. The CPM produces a better fit to the observed data than the HMM. The large posterior probability for the four-state CPM suggests that the data falls naturally into four classes of regions, which we call major and minor DNaseI hypersensitive sites (DHSs), regions of intermediate sensitivity, and insensitive regions. These classes agree well with a model of chromatin in which local disruptions (DHSs) are concentrated within larger domains of intermediate sensitivity, the accessibility islands. The CPM assigns 92% of the bases within the ENCODE regions to the insensitive regions. The 5.8% of the bases that are in regions of intermediate sensitivity are clearly enriched in functional elements, including genes and activating histone modifications, while the remaining 2.2% of the bases in hypersensitive regions are very strongly enriched in these elements. Availability: The CPM software is available upon request from the authors. Contact:jstam@stamlab.org; noble@gs.washington.edu; Charles_Lawrence@brown.edu Supplementary information:Supplementary data are available at Bioinformatics online. Source code is available at http://noble.gs.washington.edu/proj/segment.
Collapse
Affiliation(s)
- Heng Lian
- Division of Mathematical Sciences, SPMS, Nanyang Technological University, Singapore
| | | | | | | | | | | |
Collapse
|
66
|
Chapuis J, Moisan F, Mellick G, Elbaz A, Silburn P, Pasquier F, Hannequin D, Lendon C, Campion D, Amouyel P, Lambert JC. Association study of the NEDD9 gene with the risk of developing Alzheimer's and Parkinson's disease. Hum Mol Genet 2008; 17:2863-7. [PMID: 18579580 DOI: 10.1093/hmg/ddn183] [Citation(s) in RCA: 15] [Impact Index Per Article: 0.9] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/12/2022] Open
Abstract
Alzheimer's disease (AD) and Parkinson's disease (PD), the two most common neurodegenerative disorders in the elderly, have been hypothesized to share genetic determinants. Recently, Li et al. proposed that a variant in the NEDD9 gene may be one of these common genetic factors. We attempted to confirm this initial observation by conducting an equivalent analysis in terms of pathologies and sample size. We genotyped the NEDD9 rs760678 SNP in three independent AD case-control studies (n = 3176) and two independent PD case-control studies (n = 1855). However, we failed to detect an association of this SNP with the risk of developing AD or PD, in any of these populations. In conclusion, these data indicate that the rs760678 SNP of the NEDD9 gene is at best a weak genetic determinant of AD or PD.
Collapse
Affiliation(s)
- Julien Chapuis
- Inserm, U744, Institut Pasteur de Lille, Université de Lille 2, Lille, France
| | | | | | | | | | | | | | | | | | | | | |
Collapse
|
67
|
Rowell E, Merkenschlager M, Wilson CB. Long-range regulation of cytokine gene expression. Curr Opin Immunol 2008; 20:272-80. [PMID: 18485687 DOI: 10.1016/j.coi.2008.03.012] [Citation(s) in RCA: 21] [Impact Index Per Article: 1.3] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 01/25/2008] [Revised: 03/01/2008] [Accepted: 03/25/2008] [Indexed: 11/30/2022]
Abstract
In metazoans, transcription is regulated by promoters and additional elements, which may be located far from their target gene(s). Moreover, genes (including those encoding cytokines and cytokine receptors) are commonly clustered in the genome, providing the opportunity for the shared, competitive, or sequential use of regulatory elements. New techniques, discussed here, are generating an avalanche of high-resolution genome-wide data through which candidate regulatory elements have been identified in specific cell types (including T cells), their functions inferred, and their physical interactions in three-dimensional space demonstrated. As a result, a nearly comprehensive list of regulatory elements in the Th2 cytokine locus, a growing list of elements in the interferon-gamma gene locus, and maps of their three-dimensional interactions are now available, though much remains to be learned about the molecular mechanisms at play, the dynamics of these interactions, and their functional importance.
Collapse
Affiliation(s)
- Emily Rowell
- Department of Immunology, University of Washington, Seattle, WA 98195, USA
| | | | | |
Collapse
|
68
|
John S, Sabo PJ, Johnson TA, Sung MH, Biddie SC, Lightman SL, Voss TC, Davis SR, Meltzer PS, Stamatoyannopoulos JA, Hager GL. Interaction of the glucocorticoid receptor with the chromatin landscape. Mol Cell 2008; 29:611-24. [PMID: 18342607 DOI: 10.1016/j.molcel.2008.02.010] [Citation(s) in RCA: 248] [Impact Index Per Article: 15.5] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/29/2007] [Revised: 10/22/2007] [Accepted: 02/27/2008] [Indexed: 11/18/2022]
Abstract
The generality and spectrum of chromatin-remodeling requirements for nuclear receptor function are unknown. We have characterized glucocorticoid receptor (GR) binding events and chromatin structural transitions across GR-induced or -repressed genes. This analysis reveals that GR binding invariably occurs at nuclease-accessible sites (DHS). A remarkable diversity of mechanisms, however, render these sites available for GR binding. Accessibility of the GR binding sites is either constitutive or hormone inducible. Within each category, some DHS sites require the Brg1-containing Swi/Snf complex, but others are Brg1 independent, implicating a different remodeling complex. The H2A.Z histone variant is highly enriched at both inducible and constitutive DHS sites and is subject to exchange during hormone activation. The DHS profile is highly cell specific, implicating cell-selective organization of the chromatin landscape as a critical determinant of tissue-selective receptor function. Furthermore, the widespread requirement for chromatin remodeling supports the recent hypothesis that the rapid exchange of receptor proteins occurs during nucleosome reorganization.
Collapse
Affiliation(s)
- Sam John
- Laboratory of Receptor Biology and Gene Expression, Center for Cancer Research, National Cancer Institute, NIH, Bethesda, MD 20892-5055, USA
| | | | | | | | | | | | | | | | | | | | | |
Collapse
|
69
|
Petrykowska HM, Vockley CM, Elnitski L. Detection and characterization of silencers and enhancer-blockers in the greater CFTR locus. Genome Res 2008; 18:1238-46. [PMID: 18436892 DOI: 10.1101/gr.073817.107] [Citation(s) in RCA: 41] [Impact Index Per Article: 2.6] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/24/2022]
Abstract
Silencers and enhancer-blockers (EBs) are cis-acting, negative regulatory elements (NREs) that control interactions between promoters and enhancers. Although relatively uncharacterized in terms of biological mechanisms, these elements are likely to be abundant in the genome. We developed an experimental strategy to identify silencers and EBs using transient transfection assays. A known insulator and EB from the chicken beta-globin locus, cHS4, served as a control element for these assays. We examined 47 sequences from a 1.8-Mb region of human chromosome 7 for silencer and EB activities. The majority of functional elements displayed directional and promoter-specific activities. A limited number of sequences acted in a dual manner, as both silencers and EBs. We examined genomic data, epigenetic modifications, and sequence motifs within these regions. Strong silencer elements contained a novel CT-rich motif, often in multiple copies. Deletion of the motif from three regions caused a measurable loss of silencing ability in these sequences. Moreover, five duplicate occurrences of this motif were identified in the cHS4 insulator. These motifs provided an explanation for an uncharacterized silencing activity we measured in the insulator element. Overall, we identified 15 novel NREs, which contribute new insights into the prevalence and composition of sequences that negatively regulate gene expression.
Collapse
Affiliation(s)
- Hanna M Petrykowska
- Genome Technology Branch, National Human Genome Research Institute, NIH, Rockville, MD 20852, USA
| | | | | |
Collapse
|
70
|
Abstract
The human genome is predominantly composed of nonprotein-coding sequences whose function remains largely undefined. A significant portion of the noncoding DNA is believed to serve as transcriptional regulatory elements that control gene expression in specific cell types at appropriate developmental stages. Identifying these regulatory sequences and determining the mechanisms by which they act present a great challenge in the postgenomic era. Previous investigations using genetic, molecular, and biochemical approaches have uncovered a large number of proteins involved in regulating transcription. Knowledge of the genomic locations of DNA binding for these proteins in the nucleus should define the identity and nature of the transcriptional regulatory sequences and reveal the gene regulatory networks in cells. Chromatin immunoprecipitation (ChIP) is a common method for detecting interactions between a protein and a DNA sequence in vivo. In recent years, this method has been combined with DNA microarrays and other high-throughput technologies to enable genome-wide identification of DNA-binding sites for various nuclear proteins. Here, we review recent advances in ChIP-based methods for genome-wide detection of protein-DNA interactions, and discuss their significance in enhancing our knowledge of the gene regulatory networks and epigenetic mechanisms in cells.
Collapse
Affiliation(s)
- Tae Hoon Kim
- Ludwig Institute for Cancer Research, University of California, San Diego School of Medicine, La Jolla, California 92093-0653, USA.
| | | |
Collapse
|
71
|
Levasseur DN, Wang J, Dorschner MO, Stamatoyannopoulos JA, Orkin SH. Oct4 dependence of chromatin structure within the extended Nanog locus in ES cells. Genes Dev 2008; 22:575-80. [PMID: 18283123 DOI: 10.1101/gad.1606308] [Citation(s) in RCA: 98] [Impact Index Per Article: 6.1] [Reference Citation Analysis] [Abstract] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 01/01/2023]
Abstract
Embryonic stem (ES) cells offer insight into early developmental fate decisions, and their controlled differentiation may yield vast regenerative potential. The molecular determinants supporting ES cell self-renewal are incompletely understood. The homeodomain proteins Nanog and Oct4 are essential for mouse ES cell self-renewal. Using a high-throughput approach, we discovered DNaseI hypersensitive sites and potential regulatory elements along a 160-kb region of the genome that includes GDF3, Dppa3, and Nanog. We analyzed gene expression, chromatin occupancy, and higher-order chromatin structure throughout this gene locus and found that expression of the reprogramming factor Oct4 is required to maintain its integrity.
Collapse
Affiliation(s)
- Dana N Levasseur
- Division of Hematology-Oncology, Children's Hospital and Dana Farber Cancer Institute, Harvard Medical School, Harvard Stem Cell Institute, Boston, MA 02115, USA
| | | | | | | | | |
Collapse
|
72
|
Higgs DR, Vernimmen D, Hughes J, Gibbons R. Using genomics to study how chromatin influences gene expression. Annu Rev Genomics Hum Genet 2007; 8:299-325. [PMID: 17506662 DOI: 10.1146/annurev.genom.8.080706.092323] [Citation(s) in RCA: 31] [Impact Index Per Article: 1.8] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/09/2022]
Abstract
A postgenome challenge is to understand how the code in DNA is converted into the biological processes underlying various cell fates. By establishing the appropriate technical tools, we are moving from an era in which such questions have been asked by studying individual genes to one in which large domains, whole chromosomes, and the entire human genome can be investigated. These developments will allow us to study in parallel the transcriptional program and components of the epigenetic program (nuclear position, timing of replication, chromatin structure and modification, DNA methylation) to determine the hierarchy and order of events required to switch genes on and off during differentiation and development.
Collapse
Affiliation(s)
- Douglas R Higgs
- MRC Molecular Haematology Unit, Weatherall Institute of Molecular Medicine, John Radcliffe Hospital, Headington, Oxford OX3 9DS, United Kingdom.
| | | | | | | |
Collapse
|
73
|
Yu X, Lin J, Zack DJ, Qian J. Identification of tissue-specific cis-regulatory modules based on interactions between transcription factors. BMC Bioinformatics 2007; 8:437. [PMID: 17996093 PMCID: PMC2194798 DOI: 10.1186/1471-2105-8-437] [Citation(s) in RCA: 20] [Impact Index Per Article: 1.2] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 05/18/2007] [Accepted: 11/09/2007] [Indexed: 01/22/2023] Open
Abstract
BACKGROUND Evolutionary conservation has been used successfully to help identify cis-acting DNA regions that are important in regulating tissue-specific gene expression. Motivated by increasing evidence that some DNA regulatory regions are not evolutionary conserved, we have developed an approach for cis-regulatory region identification that does not rely upon evolutionary sequence conservation. RESULTS The conservation-independent approach is based on an empirical potential energy between interacting transcription factors (TFs). In this analysis, the potential energy is defined as a function of the number of TF interactions in a genomic region and the strength of the interactions. By identifying sets of interacting TFs, the analysis locates regions enriched with the binding sites of these interacting TFs. We applied this approach to 30 human tissues and identified 6232 putative cis-regulatory modules (CRMs) regulating 2130 tissue-specific genes. Interestingly, some genes appear to be regulated by different CRMs in different tissues. Known regulatory regions are highly enriched in our predicted CRMs. In addition, DNase I hypersensitive sites, which tend to be associated with active regulatory regions, significantly overlap with the predicted CRMs, but not with more conserved regions. We also find that conserved and non-conserved CRMs regulate distinct gene groups. Conserved CRMs control more essential genes and genes involved in fundamental cellular activities such as transcription. In contrast, non-conserved CRMs, in general, regulate more non-essential genes, such as genes related to neural activity. CONCLUSION These results demonstrate that identifying relevant sets of binding motifs can help in the mapping of DNA regulatory regions, and suggest that non-conserved CRMs play an important role in gene regulation.
Collapse
Affiliation(s)
- Xueping Yu
- Wilmer Institute, Johns Hopkins University School of Medicine, Baltimore, MD 21287, USA.
| | | | | | | |
Collapse
|
74
|
Greenbaum JA, Parker SC, Tullius TD. Detection of DNA structural motifs in functional genomic elements. Genome Res 2007; 17:940-6. [PMID: 17568009 PMCID: PMC1891352 DOI: 10.1101/gr.5602807] [Citation(s) in RCA: 17] [Impact Index Per Article: 1.0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 01/03/2023]
Abstract
The completion of the human genome project has fueled the search for regulatory elements by a variety of different approaches. Many successful analyses have focused on examining primary DNA sequence and/or chromatin structure. However, it has been difficult to detect common sequence motifs within the feature of chromatin structure most closely associated with regulatory elements, DNase I hypersensitive sites (DHSs). Considering just the nucleotide sequence and/or the chromatin structure of regulatory elements may neglect a critical feature of what is recognized by the regulatory machinery--DNA structure. We introduce a new computational method to detect common DNA structural motifs in a large collection of DHSs that are found in the ENCODE regions of the human genome. We show that DHSs have common DNA structural motifs that show no apparent sequence consensus. One such structural motif is much more highly enriched in experimentally identified DHSs that are in CpG islands and near transcription start sites (TSSs), compared to DHSs not in CpG islands and farther from TSSs, suggesting that DNA structural motifs may participate in the formation of functional regulatory elements. We propose that studies of the conservation of DNA structure, independent of sequence conservation, will provide new information about the link between the nucleotide sequence of a DNA molecule and its experimentally demonstrated function.
Collapse
Affiliation(s)
- Jason A. Greenbaum
- Program in Bioinformatics, Boston University, Boston, Massachusetts 02215, USA
| | - Stephen C.J. Parker
- Program in Bioinformatics, Boston University, Boston, Massachusetts 02215, USA
| | - Thomas D. Tullius
- Program in Bioinformatics, Boston University, Boston, Massachusetts 02215, USA
- Department of Chemistry, Boston University, Boston, Massachusetts 02215, USA
- Corresponding author.E-mail ; fax (617) 353-6466
| |
Collapse
|
75
|
Martins RP, Platts AE, Krawetz SA. Tracking chromatin states using controlled DNase I treatment and real-time PCR. Cell Mol Biol Lett 2007; 12:545-55. [PMID: 17588221 PMCID: PMC6275682 DOI: 10.2478/s11658-007-0024-z] [Citation(s) in RCA: 5] [Impact Index Per Article: 0.3] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Journal Information] [Subscribe] [Scholar Register] [Received: 02/21/2007] [Accepted: 05/22/2007] [Indexed: 11/22/2022] Open
Abstract
A novel approach to DNase I-sensitivity analysis was applied to examining genes of the spermatogenic pathway, reflective of the substantial morphological and genomic changes that occur during this program of differentiation. A new real-time PCR-based strategy that considers the nuances of response to nuclease treatment was used to assess the nuclease susceptibility through differentiation. Data analysis was automated with the K-Lab PCR algorithm, facilitating the rapid analysis of multiple samples while eliminating the subjectivity usually associated with Ct analyses. The utility of this assay and analytical paradigm as applied to nuclease-sensitivity mapping is presented.
Collapse
Affiliation(s)
- Rui Pires Martins
- Center for Molecular Medicine and Genetics, Wayne State University School of Medicine, Detroit, Michigan USA
| | - Adrian E. Platts
- Center for Molecular Medicine and Genetics, Wayne State University School of Medicine, Detroit, Michigan USA
- Department of Obstetrics and Gynecology, Wayne State University School of Medicine, Detroit, Michigan USA
| | - Stephen A. Krawetz
- Center for Molecular Medicine and Genetics, Wayne State University School of Medicine, Detroit, Michigan USA
- Institute for Scientific Computing, Wayne State University, Detroit, Michigan USA
| |
Collapse
|
76
|
Birney E, Stamatoyannopoulos JA, Dutta A, Guigó R, Gingeras TR, Margulies EH, Weng Z, Snyder M, Dermitzakis ET, Thurman RE, Kuehn MS, Taylor CM, Neph S, Koch CM, Asthana S, Malhotra A, Adzhubei I, Greenbaum JA, Andrews RM, Flicek P, Boyle PJ, Cao H, Carter NP, Clelland GK, Davis S, Day N, Dhami P, Dillon SC, Dorschner MO, Fiegler H, Giresi PG, Goldy J, Hawrylycz M, Haydock A, Humbert R, James KD, Johnson BE, Johnson EM, Frum TT, Rosenzweig ER, Karnani N, Lee K, Lefebvre GC, Navas PA, Neri F, Parker SCJ, Sabo PJ, Sandstrom R, Shafer A, Vetrie D, Weaver M, Wilcox S, Yu M, Collins FS, Dekker J, Lieb JD, Tullius TD, Crawford GE, Sunyaev S, Noble WS, Dunham I, Denoeud F, Reymond A, Kapranov P, Rozowsky J, Zheng D, Castelo R, Frankish A, Harrow J, Ghosh S, Sandelin A, Hofacker IL, Baertsch R, Keefe D, Dike S, Cheng J, Hirsch HA, Sekinger EA, Lagarde J, Abril JF, Shahab A, Flamm C, Fried C, Hackermüller J, Hertel J, Lindemeyer M, Missal K, Tanzer A, Washietl S, Korbel J, Emanuelsson O, Pedersen JS, Holroyd N, Taylor R, Swarbreck D, Matthews N, Dickson MC, Thomas DJ, Weirauch MT, Gilbert J, Drenkow J, Bell I, Zhao X, Srinivasan KG, Sung WK, Ooi HS, Chiu KP, Foissac S, Alioto T, Brent M, Pachter L, Tress ML, Valencia A, Choo SW, Choo CY, Ucla C, Manzano C, Wyss C, Cheung E, Clark TG, Brown JB, Ganesh M, Patel S, Tammana H, Chrast J, Henrichsen CN, Kai C, Kawai J, Nagalakshmi U, Wu J, Lian Z, Lian J, Newburger P, Zhang X, Bickel P, Mattick JS, Carninci P, Hayashizaki Y, Weissman S, Hubbard T, Myers RM, Rogers J, Stadler PF, Lowe TM, Wei CL, Ruan Y, Struhl K, Gerstein M, Antonarakis SE, Fu Y, Green ED, Karaöz U, Siepel A, Taylor J, Liefer LA, Wetterstrand KA, Good PJ, Feingold EA, Guyer MS, Cooper GM, Asimenos G, Dewey CN, Hou M, Nikolaev S, Montoya-Burgos JI, Löytynoja A, Whelan S, Pardi F, Massingham T, Huang H, Zhang NR, Holmes I, Mullikin JC, Ureta-Vidal A, Paten B, Seringhaus M, Church D, Rosenbloom K, Kent WJ, Stone EA, Batzoglou S, Goldman N, Hardison RC, Haussler D, Miller W, Sidow A, Trinklein ND, Zhang ZD, Barrera L, Stuart R, King DC, Ameur A, Enroth S, Bieda MC, Kim J, Bhinge AA, Jiang N, Liu J, Yao F, Vega VB, Lee CWH, Ng P, Shahab A, Yang A, Moqtaderi Z, Zhu Z, Xu X, Squazzo S, Oberley MJ, Inman D, Singer MA, Richmond TA, Munn KJ, Rada-Iglesias A, Wallerman O, Komorowski J, Fowler JC, Couttet P, Bruce AW, Dovey OM, Ellis PD, Langford CF, Nix DA, Euskirchen G, Hartman S, Urban AE, Kraus P, Van Calcar S, Heintzman N, Kim TH, Wang K, Qu C, Hon G, Luna R, Glass CK, Rosenfeld MG, Aldred SF, Cooper SJ, Halees A, Lin JM, Shulha HP, Zhang X, Xu M, Haidar JNS, Yu Y, Ruan Y, Iyer VR, Green RD, Wadelius C, Farnham PJ, Ren B, Harte RA, Hinrichs AS, Trumbower H, Clawson H, Hillman-Jackson J, Zweig AS, Smith K, Thakkapallayil A, Barber G, Kuhn RM, Karolchik D, Armengol L, Bird CP, de Bakker PIW, Kern AD, Lopez-Bigas N, Martin JD, Stranger BE, Woodroffe A, Davydov E, Dimas A, Eyras E, Hallgrímsdóttir IB, Huppert J, Zody MC, Abecasis GR, Estivill X, Bouffard GG, Guan X, Hansen NF, Idol JR, Maduro VVB, Maskeri B, McDowell JC, Park M, Thomas PJ, Young AC, Blakesley RW, Muzny DM, Sodergren E, Wheeler DA, Worley KC, Jiang H, Weinstock GM, Gibbs RA, Graves T, Fulton R, Mardis ER, Wilson RK, Clamp M, Cuff J, Gnerre S, Jaffe DB, Chang JL, Lindblad-Toh K, Lander ES, Koriabine M, Nefedov M, Osoegawa K, Yoshinaga Y, Zhu B, de Jong PJ. Identification and analysis of functional elements in 1% of the human genome by the ENCODE pilot project. Nature 2007; 447:799-816. [PMID: 17571346 PMCID: PMC2212820 DOI: 10.1038/nature05874] [Citation(s) in RCA: 3814] [Impact Index Per Article: 224.4] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 01/17/2023]
Abstract
We report the generation and analysis of functional data from multiple, diverse experiments performed on a targeted 1% of the human genome as part of the pilot phase of the ENCODE Project. These data have been further integrated and augmented by a number of evolutionary and computational analyses. Together, our results advance the collective knowledge about human genome function in several major areas. First, our studies provide convincing evidence that the genome is pervasively transcribed, such that the majority of its bases can be found in primary transcripts, including non-protein-coding transcripts, and those that extensively overlap one another. Second, systematic examination of transcriptional regulation has yielded new understanding about transcription start sites, including their relationship to specific regulatory sequences and features of chromatin accessibility and histone modification. Third, a more sophisticated view of chromatin structure has emerged, including its inter-relationship with DNA replication and transcriptional regulation. Finally, integration of these new sources of information, in particular with respect to mammalian evolution based on inter- and intra-species sequence comparisons, has yielded new mechanistic and evolutionary insights concerning the functional landscape of the human genome. Together, these studies are defining a path for pursuit of a more comprehensive characterization of human genome function.
Collapse
|
77
|
Schoenborn JR, Dorschner MO, Sekimata M, Santer DM, Shnyreva M, Fitzpatrick DR, Stamatoyannopoulos JA, Stamatoyonnapoulos JA, Wilson CB. Comprehensive epigenetic profiling identifies multiple distal regulatory elements directing transcription of the gene encoding interferon-gamma. Nat Immunol 2007; 8:732-42. [PMID: 17546033 PMCID: PMC2144744 DOI: 10.1038/ni1474] [Citation(s) in RCA: 234] [Impact Index Per Article: 13.8] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Journal Information] [Subscribe] [Scholar Register] [Received: 12/13/2006] [Accepted: 04/25/2007] [Indexed: 12/22/2022]
Abstract
Unlike the well defined T helper type 2 cytokine locus, little is known about the regulatory elements that govern the expression of Ifng, which encodes the 'signature' T helper type 1 cytokine interferon-gamma. Here our evolutionary analysis showed that the mouse Ifng locus diverged from the ancestral locus as a result of structural rearrangements producing deletion of the neighboring gene encoding interleukin 26 and disrupting synteny 57 kilobases upstream of Ifng. Proximal to that disruption, we identified by high-resolution mapping many regions with CD4+ T cell subset-specific epigenetic modifications. A subset of those regions represented enhancers, whereas others blocked the activity of upstream enhancers or insulated Ifng from neighboring chromatin. Our findings suggest that proper expression of Ifng is maintained through the collective action of multiple distal regulatory elements present in a region of about 100 kilobases flanking Ifng.
Collapse
Affiliation(s)
- Jamie R Schoenborn
- Department of Immunology, University of Washington, Seattle, Washington 98195, USA
| | | | | | | | | | | | | | | | | |
Collapse
|
78
|
Follows GA, Janes ME, Vallier L, Green AR, Gottgens B. Real-time PCR mapping of DNaseI-hypersensitive sites using a novel ligation-mediated amplification technique. Nucleic Acids Res 2007; 35:e56. [PMID: 17389645 PMCID: PMC1885650 DOI: 10.1093/nar/gkm108] [Citation(s) in RCA: 17] [Impact Index Per Article: 1.0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/13/2022] Open
Abstract
Mapping sites within the genome that are hypersensitive to digestion with DNaseI is an important method for identifying DNA elements that regulate transcription. The standard approach to locating these DNaseI-hypersensitive sites (DHSs) has been to use Southern blotting techniques, although we, and others, have recently published alternative methods using a range of technologies including high-throughput sequencing and genomic array tiling paths. In this article, we describe a novel protocol to use real-time PCR to map DHS. Advantages of the technique reported here include the small cell numbers required for each analysis, rapid, relatively low-cost experiments with minimal need for specialist equipment. Presented examples include comparative DHS mapping of known TAL1/SCL regulatory elements between human embryonic stem cells and K562 cells.
Collapse
Affiliation(s)
- George A Follows
- Department of Haematology, Cambridge Institute for Medical Research, Addenbrooke's Hospital, University of Cambridge, Cambridge CB2 2XY, UK.
| | | | | | | | | |
Collapse
|
79
|
Kodama Y, Nagaya S, Shinmyo A, Kato K. Mapping and characterization of DNase I hypersensitive sites in Arabidopsis chromatin. PLANT & CELL PHYSIOLOGY 2007; 48:459-70. [PMID: 17283013 DOI: 10.1093/pcp/pcm017] [Citation(s) in RCA: 22] [Impact Index Per Article: 1.3] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 05/13/2023]
Abstract
Recent genome-wide analyses of yeast and human chromatin revealed the widespread prevalence of DNase I hypersensitive sites (DNase I HSs) at gene regulatory regions with possible roles in eukaryotic gene regulation. The presence of DNase I HSs in plants has been described for only a few genes, and we analyzed the chromatin structure of an 80 kb genomic region containing 30 variably expressed genes by DNase I sensitivity assay at 500 bp resolution in Arabidopsis. Distinct DNase I HSs were found at the 5' and/or 3' ends of most genes irrespective of their expression levels. Further analysis of well-characterized genes showed that the DNase I HSs occurred near cis-regulatory elements in the promoters of these genes. Upon transcriptional activation of a heat-inducible gene, the DNase I HS was extended into the vicinity of a cis-element and adjacent TATA element in the promoter. Concomitant with this change in DNase I HS, histones were acetylated, removed from the promoter, and a transcription activator bound to this cis-element. These results suggest that the DNase I HSs participate in the transcriptional regulation of Arabidopsis genes by enhancing the access of chromatin remodeling factors and/or transcription factors to their target sites as seen in yeast and human chromatin.
Collapse
Affiliation(s)
- Yuichi Kodama
- Graduate School of Biological Sciences, Nara Institute of Science and Technology, Ikoma, Nara, 630-0192, Japan
| | | | | | | |
Collapse
|
80
|
Tang XB, Feng DX, Di LJ, Huang Y, Fu XH, Liu G, Tang Y, Liu DP, Liang CC. HS-48 alone has no enhancement role on the expression of human α-globin gene cluster. Blood Cells Mol Dis 2007; 38:32-6. [PMID: 17110138 DOI: 10.1016/j.bcmd.2006.09.007] [Citation(s) in RCA: 3] [Impact Index Per Article: 0.2] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 08/08/2006] [Revised: 08/29/2006] [Accepted: 09/21/2006] [Indexed: 10/23/2022]
Abstract
To investigate the in vivo function of the newly defined DNase I hypersensitive site HS-48 on the whole human alpha-globin gene cluster, the region containing all the other known 5 hypersensitive sites HS-4 to HS-40 was deleted from a 117 kb bacterial artificial chromosome clone bearing the whole human alpha-globin gene cluster. Transgenic mice were generated from this construct. The RNase protection assays showed that with HS-48 left and all the other 5 hypersensitive sites deleted, the expression of human alpha-like globin genes was completely silenced in embryonic, fetal and adult stages in all tissues. This finding indicates that HS-48 alone has no enhancer activity on the expression of human alpha-like globin genes, and that the region of HS-4 to HS-40 already contains all the upstream cis-elements needed for regulating human alpha-like globin genes.
Collapse
Affiliation(s)
- Xiao-bin Tang
- National Laboratory of Medical Molecular Biology, Institute of Basic Medical Sciences, Peking Union Medical College and Chinese Academy of Medical Sciences, Beijing 100005, PR China
| | | | | | | | | | | | | | | | | |
Collapse
|
81
|
Abstract
High-throughput experiments in biology often produce sets of genes of potential interests. Some of those gene sets might be of considerable size. Therefore, computer-assisted analysis is necessary for the biological interpretation of the gene sets, and for creating working hypotheses, which can be tested experimentally. One obvious way to analyze gene set data is to associate the genes with a particular biological feature, for example, a given pathway. Statistical analysis could be used to evaluate if a gene set is truly associated with a feature. Over the past few years many tools that perform such analysis have been created. In this chapter, using WebGestalt as an example, it will be explained in detail how to associate gene sets with functional annotations, pathways, publication records, and protein domains.
Collapse
Affiliation(s)
- Stefan A Kirov
- Oak Ridge National Laboratory, University of Tennessee, USA
| | | | | |
Collapse
|
82
|
Roh TY, Wei G, Farrell CM, Zhao K. Genome-wide prediction of conserved and nonconserved enhancers by histone acetylation patterns. Genome Res 2006; 17:74-81. [PMID: 17135569 PMCID: PMC1716270 DOI: 10.1101/gr.5767907] [Citation(s) in RCA: 109] [Impact Index Per Article: 6.1] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/25/2022]
Abstract
Comparative genomic studies have been useful in identifying transcriptional regulatory elements in higher eukaryotic genomes, but many important regulatory elements cannot be detected by such analyses due to evolutionary variations and alignment tool limitations. Therefore, in this study we exploit the highly conserved nature of epigenetic modifications to identify potential transcriptional enhancers. By using a high-resolution genome-wide mapping technique, which combines the chromatin immunoprecipitation and serial analysis of gene expression assays, we have recently determined the distribution of lysine 9/14-diacetylated histone H3 in human T cells. We showed the existence of 46,813 regions with clusters of histone acetylation, termed histone acetylation islands, some of which correspond to known transcriptional regulatory elements. In the present study, we find that 4679 sequences conserved between human and pufferfish coincide with histone acetylation islands, and random sampling shows that 33% (13/39) of these can function as transcriptional enhancers in human Jurkat T cells. In addition, by comparing the human histone acetylation island sequences with mouse genome sequences, we find that despite the conservation of many of these regions between these species, 21,855 of these sequences are not conserved. Furthermore, we demonstrate that about 50% (26/51) of these nonconserved sequences have enhancer activity in Jurkat cells, and that many of the orthologous mouse sequences also have enhancer activity in addition to conserved epigenetic modification patterns in mouse T-cell chromatin. Therefore, by combining epigenetic modification and sequence data, we have established a novel genome-wide method for identifying regulatory elements not discernable by comparative genomics alone.
Collapse
Affiliation(s)
- Tae-young Roh
- Laboratory of Molecular Immunology, National Heart, Lung, and Blood Institute, National Institutes of Health, Bethesda, Maryland 20892, USA
| | - Gang Wei
- Laboratory of Molecular Immunology, National Heart, Lung, and Blood Institute, National Institutes of Health, Bethesda, Maryland 20892, USA
| | - Catherine M. Farrell
- Laboratory of Molecular Immunology, National Heart, Lung, and Blood Institute, National Institutes of Health, Bethesda, Maryland 20892, USA
| | - Keji Zhao
- Laboratory of Molecular Immunology, National Heart, Lung, and Blood Institute, National Institutes of Health, Bethesda, Maryland 20892, USA
- Corresponding authorE-mail ; fax (301) 480-0961
| |
Collapse
|
83
|
Eeckhoute J, Carroll JS, Geistlinger TR, Torres-Arzayus MI, Brown M. A cell-type-specific transcriptional network required for estrogen regulation of cyclin D1 and cell cycle progression in breast cancer. Genes Dev 2006; 20:2513-26. [PMID: 16980581 PMCID: PMC1578675 DOI: 10.1101/gad.1446006] [Citation(s) in RCA: 227] [Impact Index Per Article: 12.6] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 02/07/2023]
Abstract
Estrogen stimulates the proliferation of the most common type of human breast cancer that expresses estrogen receptor alpha (ERalpha) through the activation of the cyclin D1 (CCND1) oncogene. However, our knowledge of ERalpha transcriptional mechanisms remains limited. Hence, it is still elusive why ERalpha ectopically expressed in ER-negative breast cancer cells (BCC) is functional on ectopic reporter constructs but lacks activity on many endogenous target genes, including CCND1. Here, we show that estradiol (E2) stimulation of CCND1 expression in BCC depends on a novel cell-type-specific enhancer downstream from the CCND1 coding region, which is the primary ERalpha recruitment site in estrogen-responsive cells. The pioneer factor FoxA1 is specifically required for the active chromatin state of this enhancer and as such is crucial for both CCND1 expression and subsequent cell cycle progression. Interestingly, even in BCC, CCND1 levels and proliferation are tightly controlled by E2 through the establishment of a negative feedforward loop involving the induction of NFIC, a putative tumor suppressor capable of directly repressing CCND1 transcription. Taken together, our results reveal an estrogen-regulated combinatorial network including cell-specific cis- and trans-regulators of CCND1 expression where ERalpha collaborates with other transcription factors associated with the ER-positive breast cancer phenotype, including FoxA1 and NFIC.
Collapse
Affiliation(s)
- Jérôme Eeckhoute
- Division of Molecular and Cellular Oncology, Department of Medical Oncology, Dana-Farber Cancer Institute and Department of Medicine, Brigham and Women's Hospital and Harvard Medical School, Boston, Massachusetts 02115, USA
| | | | | | | | | |
Collapse
|
84
|
Walhout AJM. Unraveling transcription regulatory networks by protein-DNA and protein-protein interaction mapping. Genome Res 2006; 16:1445-54. [PMID: 17053092 DOI: 10.1101/gr.5321506] [Citation(s) in RCA: 113] [Impact Index Per Article: 6.3] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/25/2022]
Abstract
Metazoan genomes contain thousands of protein-coding and noncoding RNA genes, most of which are differentially expressed, i.e., at different locations or at different times during development, function, or pathology of the organism. Differential gene expression is achieved in part by the action of regulatory transcription factors (TFs) that bind to cis-regulatory elements that are often located in or near their target genes. Each TF likely regulates many targets in the context of intricate transcription regulatory networks. Up to 10% of a genome may encode TFs, but only a handful of these have been studied in detail. Here, I will discuss the different steps involved in the mapping and analysis of transcription regulatory networks, including the identification of network nodes (TFs and their target sequences) and edges (TF-TF dimers and TF-DNA target interactions), integration with other data types, and network properties and emerging principles that provide insights into differential gene expression.
Collapse
Affiliation(s)
- Albertha J M Walhout
- Program in Gene Function and Expression, University of Massachusetts Medical School, Worcester, Massachusetts 01605, USA.
| |
Collapse
|
85
|
Elnitski L, Jin VX, Farnham PJ, Jones SJM. Locating mammalian transcription factor binding sites: a survey of computational and experimental techniques. Genome Res 2006; 16:1455-64. [PMID: 17053094 DOI: 10.1101/gr.4140006] [Citation(s) in RCA: 168] [Impact Index Per Article: 9.3] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/24/2022]
Abstract
Fields such as genomics and systems biology are built on the synergism between computational and experimental techniques. This type of synergism is especially important in accomplishing goals like identifying all functional transcription factor binding sites in vertebrate genomes. Precise detection of these elements is a prerequisite to deciphering the complex regulatory networks that direct tissue specific and lineage specific patterns of gene expression. This review summarizes approaches for in silico, in vitro, and in vivo identification of transcription factor binding sites. A variety of techniques useful for localized- and high-throughput analyses are discussed here, with emphasis on aspects of data generation and verification.
Collapse
Affiliation(s)
- Laura Elnitski
- Genomic Functional Analysis Section, National Human Genome Research Institute, National Institutes of Health, Rockville, Maryland 20878, USA.
| | | | | | | |
Collapse
|
86
|
Follows GA, Dhami P, Göttgens B, Bruce AW, Campbell PJ, Dillon SC, Smith AM, Koch C, Donaldson IJ, Scott MA, Dunham I, Janes ME, Vetrie D, Green AR. Identifying gene regulatory elements by genomic microarray mapping of DNaseI hypersensitive sites. Genome Res 2006; 16:1310-9. [PMID: 16963707 PMCID: PMC1581440 DOI: 10.1101/gr.5373606] [Citation(s) in RCA: 34] [Impact Index Per Article: 1.9] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 12/11/2022]
Abstract
The identification of cis-regulatory elements is central to understanding gene transcription. Hypersensitivity of cis-regulatory elements to digestion with DNaseI remains the gold-standard approach to locating such elements. Traditional methods used to identify DNaseI hypersensitive sites are cumbersome and can only be applied to short stretches of DNA at defined locations. Here we report the development of a novel genomic array-based approach to DNaseI hypersensitive site mapping (ADHM) that permits precise, large-scale identification of such sites from as few as 5 million cells. Using ADHM we identified all previously recognized hematopoietic regulatory elements across 200 kb of the mouse T-cell acute lymphocytic leukemia-1 (Tal1) locus, and, in addition, identified two novel elements within the locus, which show transcriptional regulatory activity. We further validated the ADHM protocol by mapping the DNaseI hypersensitive sites across 250 kb of the human TAL1 locus in CD34+ primary stem/progenitor cells and K562 cells and by mapping the previously known DNaseI hypersensitive sites across 240 kb of the human alpha-globin locus in K562 cells. ADHM provides a powerful approach to identifying DNaseI hypersensitive sites across large genomic regions.
Collapse
Affiliation(s)
- George A Follows
- Department of Haematology, Cambridge Institute for Medical Research, University of Cambridge, Cambridge, CB2 2XY, United Kingdom.
| | | | | | | | | | | | | | | | | | | | | | | | | | | |
Collapse
|
87
|
Sabo PJ, Kuehn MS, Thurman R, Johnson BE, Johnson EM, Cao H, Yu M, Rosenzweig E, Goldy J, Haydock A, Weaver M, Shafer A, Lee K, Neri F, Humbert R, Singer MA, Richmond TA, Dorschner MO, McArthur M, Hawrylycz M, Green RD, Navas PA, Noble WS, Stamatoyannopoulos JA. Genome-scale mapping of DNase I sensitivity in vivo using tiling DNA microarrays. Nat Methods 2006; 3:511-8. [PMID: 16791208 DOI: 10.1038/nmeth890] [Citation(s) in RCA: 273] [Impact Index Per Article: 15.2] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 04/12/2006] [Accepted: 05/22/2006] [Indexed: 11/09/2022]
Abstract
Localized accessibility of critical DNA sequences to the regulatory machinery is a key requirement for regulation of human genes. Here we describe a high-resolution, genome-scale approach for quantifying chromatin accessibility by measuring DNase I sensitivity as a continuous function of genome position using tiling DNA microarrays (DNase-array). We demonstrate this approach across 1% ( approximately 30 Mb) of the human genome, wherein we localized 2,690 classical DNase I hypersensitive sites with high sensitivity and specificity, and also mapped larger-scale patterns of chromatin architecture. DNase I hypersensitive sites exhibit marked aggregation around transcriptional start sites (TSSs), though the majority mark nonpromoter functional elements. We also developed a computational approach for visualizing higher-order features of chromatin structure. This revealed that human chromatin organization is dominated by large (100-500 kb) 'superclusters' of DNase I hypersensitive sites, which encompass both gene-rich and gene-poor regions. DNase-array is a powerful and straightforward approach for systematic exposition of the cis-regulatory architecture of complex genomes.
Collapse
Affiliation(s)
- Peter J Sabo
- Department of Genome Sciences, University of Washington, 1705 NE Pacific St., Box 357730, Seattle, Washington 98195, USA
| | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | |
Collapse
|
88
|
Crawford GE, Davis S, Scacheri PC, Renaud G, Halawi MJ, Erdos MR, Green R, Meltzer PS, Wolfsberg TG, Collins FS. DNase-chip: a high-resolution method to identify DNase I hypersensitive sites using tiled microarrays. Nat Methods 2006; 3:503-9. [PMID: 16791207 PMCID: PMC2698431 DOI: 10.1038/nmeth888] [Citation(s) in RCA: 181] [Impact Index Per Article: 10.1] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/08/2006] [Accepted: 05/11/2006] [Indexed: 12/15/2022]
Abstract
Mapping DNase I hypersensitive sites is an accurate method of identifying the location of gene regulatory elements, including promoters, enhancers, silencers and locus control regions. Although Southern blots are the traditional method of identifying DNase I hypersensitive sites, the conventional manual method is not readily scalable to studying large chromosomal regions, much less the entire genome. Here we describe DNase-chip, an approach that can rapidly identify DNase I hypersensitive sites for any region of interest, or potentially for the entire genome, by using tiled microarrays. We used DNase-chip to identify DNase I hypersensitive sites accurately from a representative 1% of the human genome in both primary and immortalized cell types. We found that although most DNase I hypersensitive sites were present in both cell types studied, some of them were cell-type specific. This method can be applied globally or in a targeted fashion to any tissue from any species with a sequenced genome.
Collapse
Affiliation(s)
- Gregory E Crawford
- National Human Genome Research Institute, National Institutes of Health, Building 31, Room 4B09, Bethesda, Maryland 20892, USA
| | | | | | | | | | | | | | | | | | | |
Collapse
|
89
|
Abstract
One of the most exciting frontiers in both epigenetics and genome sciences is the new field of epigenomics. This new discipline promises novel insights into the genome because of its potential to detect quantitative alterations, multiplex modifications and regulatory sequences outside of genes. A number of new epigenomic strategies are emerging to exploit microarray formats with varying substrate choice, pre-processing and data analysis. These approaches are designed to detect large numbers of variations in DNA methylation and chromatin modification. Many groups are joining forces toward developing an organized Human Epigenome Project to exploit these new technologies to better understand the basis of normal development and human disease.
Collapse
Affiliation(s)
- Pauline A Callinan
- Division of Molecular Medicine, Department of Medicine and Center for the Epigenetics of Common Human Disease, Johns Hopkins University School of Medicine, Baltimore, MD 21205, USA
| | | |
Collapse
|
90
|
Lieb JD, Beck S, Bulyk ML, Farnham P, Hattori N, Henikoff S, Liu XS, Okumura K, Shiota K, Ushijima T, Greally JM. Applying whole-genome studies of epigenetic regulation to study human disease. Cytogenet Genome Res 2006; 114:1-15. [PMID: 16717444 PMCID: PMC2734277 DOI: 10.1159/000091922] [Citation(s) in RCA: 49] [Impact Index Per Article: 2.7] [Reference Citation Analysis] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 09/26/2005] [Accepted: 10/06/2005] [Indexed: 12/15/2022] Open
Affiliation(s)
- J D Lieb
- Department of Biology, Carolina Center for Genome Sciences, The University of North Carolina, Chapel Hill, NC, USA
| | | | | | | | | | | | | | | | | | | | | |
Collapse
|
91
|
Graffmann N, Santourlidis S, Christ J, Wernet P, Uhrberg M. Direct and quantitative analysis of chromatin accessibility by MIRECAL--a Micrococcus nuclease/real-time PCR chromatin accessibility assay with locus specificity. Anal Biochem 2006; 354:308-10. [PMID: 16750813 DOI: 10.1016/j.ab.2006.03.039] [Citation(s) in RCA: 7] [Impact Index Per Article: 0.4] [Reference Citation Analysis] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 01/10/2006] [Revised: 03/16/2006] [Accepted: 03/22/2006] [Indexed: 10/24/2022]
Affiliation(s)
- Nina Graffmann
- Institute for Transplantation Diagnostics and Cell Therapeutics, Heinrich Heine University, D-40225 Düsseldorf, Germany
| | | | | | | | | |
Collapse
|
92
|
Blanchette M, Bataille AR, Chen X, Poitras C, Laganière J, Lefèbvre C, Deblois G, Giguère V, Ferretti V, Bergeron D, Coulombe B, Robert F. Genome-wide computational prediction of transcriptional regulatory modules reveals new insights into human gene expression. Genome Res 2006; 16:656-68. [PMID: 16606704 PMCID: PMC1457048 DOI: 10.1101/gr.4866006] [Citation(s) in RCA: 216] [Impact Index Per Article: 12.0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 12/27/2022]
Abstract
The identification of regulatory regions is one of the most important and challenging problems toward the functional annotation of the human genome. In higher eukaryotes, transcription-factor (TF) binding sites are often organized in clusters called cis-regulatory modules (CRM). While the prediction of individual TF-binding sites is a notoriously difficult problem, CRM prediction has proven to be somewhat more reliable. Starting from a set of predicted binding sites for more than 200 TF families documented in Transfac, we describe an algorithm relying on the principle that CRMs generally contain several phylogenetically conserved binding sites for a few different TFs. The method allows the prediction of more than 118,000 CRMs within the human genome. A subset of these is shown to be bound in vivo by TFs using ChIP-chip. Their analysis reveals, among other things, that CRM density varies widely across the genome, with CRM-rich regions often being located near genes encoding transcription factors involved in development. Predicted CRMs show a surprising enrichment near the 3' end of genes and in regions far from genes. We document the tendency for certain TFs to bind modules located in specific regions with respect to their target genes and identify TFs likely to be involved in tissue-specific regulation. The set of predicted CRMs, which is made available as a public database called PReMod (http://genomequebec.mcgill.ca/PReMod), will help analyze regulatory mechanisms in specific biological systems.
Collapse
Affiliation(s)
- Mathieu Blanchette
- McGill Centre for Bioinformatics, Montreal, Quebec, Canada, H3A 2B4
- Corresponding authors.E-mail ; fax (514) 398-3387.E-mail ; fax (514) 987-5743
| | - Alain R. Bataille
- Institut de Recherches Cliniques de Montréal, Montreal, Quebec, Canada H2W 1R7
| | - Xiaoyu Chen
- McGill Centre for Bioinformatics, Montreal, Quebec, Canada, H3A 2B4
| | - Christian Poitras
- Institut de Recherches Cliniques de Montréal, Montreal, Quebec, Canada H2W 1R7
| | - Josée Laganière
- Molecular Oncology Group Department of Medicine, Oncology and Biochemistry, McGill University, Montreal, Quebec, Canada H3A 1A1
| | - Céline Lefèbvre
- Molecular Oncology Group Department of Medicine, Oncology and Biochemistry, McGill University, Montreal, Quebec, Canada H3A 1A1
| | - Geneviève Deblois
- Molecular Oncology Group Department of Medicine, Oncology and Biochemistry, McGill University, Montreal, Quebec, Canada H3A 1A1
| | - Vincent Giguère
- Molecular Oncology Group Department of Medicine, Oncology and Biochemistry, McGill University, Montreal, Quebec, Canada H3A 1A1
| | - Vincent Ferretti
- McGill University and Genome Quebec Innovation Center, Montreal, Quebec, Canada H3A 1A4
| | - Dominique Bergeron
- Institut de Recherches Cliniques de Montréal, Montreal, Quebec, Canada H2W 1R7
| | - Benoit Coulombe
- Institut de Recherches Cliniques de Montréal, Montreal, Quebec, Canada H2W 1R7
| | - François Robert
- Institut de Recherches Cliniques de Montréal, Montreal, Quebec, Canada H2W 1R7
- Corresponding authors.E-mail ; fax (514) 398-3387.E-mail ; fax (514) 987-5743
| |
Collapse
|
93
|
Mann TP, Humbert R, Stamatoyannopolous JA, Noble WS. AUTOMATED VALIDATION OF POLYMERASE CHAIN REACTION AMPLICON MELTING CURVES. J Bioinform Comput Biol 2006; 4:299-315. [PMID: 16819785 DOI: 10.1142/s0219720006001989] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 09/20/2005] [Revised: 12/01/2005] [Accepted: 01/31/2006] [Indexed: 11/18/2022]
Abstract
The polymerase chain reaction (PCR) is a fundamental tool of molecular biology. Quantitative PCR is the gold-standard methodology for determination of DNA copy numbers, quantitating transcription, and numerous other applications. A major barrier to large-scale application of PCR for quantitative genomic analyses is the current requirement for manual validation of individual PCRs to ensure generation of a single product. This typically requires visual inspection either of gel electrophoreses or temperature dissociation ("melting") curves of individual PCRs — a time-consuming and costly process. Here we describe a robust computational solution to this fundamental problem. Using a training set of 10 080 reactions comprising multiple quantitative PCRs from each of 1728 unique human genomic amplicons, we developed a support vector machine classifier capable of discriminating single-product PCRs with better than 99% accuracy. This approach has broad utility, and eliminates a major bottleneck to widespread application of PCR for high-throughput genomic applications.
Collapse
Affiliation(s)
- Tobias P Mann
- Department of Genome Sciences, University of Washington, Seattle, WA, USA.
| | | | | | | |
Collapse
|
94
|
Pipkin ME, Lichtenheld MG. A reliable method to display authentic DNase I hypersensitive sites at long-ranges in single-copy genes from large genomes. Nucleic Acids Res 2006; 34:e34. [PMID: 16510851 PMCID: PMC1388096 DOI: 10.1093/nar/gkl006] [Citation(s) in RCA: 20] [Impact Index Per Article: 1.1] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/14/2022] Open
Abstract
The study of eukaryotic gene transcription depends on methods to discover distal cis-acting control sequences. Comparative bioinformatics is one powerful strategy to reveal these domains, but still requires conventional wet-bench techniques to elucidate their specificity and function. The DNase I hypersensitivity assay (DHA) is also a method to identify regulatory domains, but can also suggest their function. Technically however, the classical DHA is constrained to mapping gene loci in small increments of approximately 20 kb. This limitation hinders efficient and comprehensive analysis of distal gene regions. Here, we report an improved method termed mega-DHA that extends the range of existing DHAs to facilitate assaying intervals that approach 100 kb. We demonstrate its feasibility for efficient analysis of single-copy genes within a large and complex genome by assaying 230 kb of the human ADAMTS14-perforin-paladin gene cluster in four experiments. The results identify distinct networks of regulatory domains specific to expression of perforin and its two neighboring genes.
Collapse
Affiliation(s)
- Matthew E. Pipkin
- Department of Microbiology and Immunology, University of Miami, Miller School of MedicineMiami, FL, USA
| | - Mathias G. Lichtenheld
- Department of Microbiology and Immunology, University of Miami, Miller School of MedicineMiami, FL, USA
- The Sylvester Comprehensive Cancer Center, University of Miami, Miller School of MedicineMiami, FL, USA
- The Center for HIV Research, University of Miami, Miller School of MedicineMiami, FL, USA
- To whom correspondence should be addressed at Mathias G. Lichtenheld, Department of Microbiology and Immunology, University of Miami Miller School of Medicine, 1580 N.W. 10th Avenue, Batchelor Children Research Institute Room 738, Miami, FL 33136, USA. Tel: +1 305 243 3301; Fax: +1 305 243 7211;
| |
Collapse
|
95
|
Mann TP, Humbert R, Stamatoyannopolous JA, Noble WS. Automated validation of polymerase chain reactions using amplicon melting curves. PROCEEDINGS. IEEE COMPUTATIONAL SYSTEMS BIOINFORMATICS CONFERENCE 2005:377-85. [PMID: 16447995 DOI: 10.1109/csb.2005.17] [Citation(s) in RCA: 1] [Impact Index Per Article: 0.1] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 05/06/2023]
Abstract
PCR, the polymerase chain reaction, is a fundamental tool of molecular biology. Quantitative PCR is the gold-standard methodology for determination of DNA copy numbers, quantitating transcription, and numerous other applications. A major barrier to large-scale application of PCR for quantitative genomic analyses is the current requirement for manual validation of individual PCR reactions to ensure generation of a single product. This typically requires visual inspection either of gel electrophoreses or temperature dissociation ("melting") curves of individual PCR reactions - a time-consuming and costly process. Here we describe a robust computational solution to this fundamental problem. Using a training set of 10,080 reactions comprising multiple quantitative PCR reactions from each of 1,728 unique human genomic amplicons, we developed a support vector machine classifier capable of discriminating single-product PCR reactions with better than 99% accuracy. This approach has broad utility, and eliminates a major bottleneck to widespread application of PCR for high-throughput genomic applications.
Collapse
Affiliation(s)
- Tobias P Mann
- Department of Genome Science, University of Washington, Seattle, WA, USA
| | | | | | | |
Collapse
|