201
|
MacArthur S, Li XY, Li J, Brown JB, Chu HC, Zeng L, Grondona BP, Hechmer A, Simirenko L, Keränen SVE, Knowles DW, Stapleton M, Bickel P, Biggin MD, Eisen MB. Developmental roles of 21 Drosophila transcription factors are determined by quantitative differences in binding to an overlapping set of thousands of genomic regions. Genome Biol 2009; 10:R80. [PMID: 19627575 PMCID: PMC2728534 DOI: 10.1186/gb-2009-10-7-r80] [Citation(s) in RCA: 267] [Impact Index Per Article: 17.8] [Reference Citation Analysis] [Abstract] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 01/26/2009] [Revised: 05/15/2009] [Accepted: 07/23/2009] [Indexed: 01/05/2023] Open
Abstract
Distinct developmental fates in Drosophila melanogaster are specified by quantitative differences in transcription factor occupancy on a common set of bound regions. Background We previously established that six sequence-specific transcription factors that initiate anterior/posterior patterning in Drosophila bind to overlapping sets of thousands of genomic regions in blastoderm embryos. While regions bound at high levels include known and probable functional targets, more poorly bound regions are preferentially associated with housekeeping genes and/or genes not transcribed in the blastoderm, and are frequently found in protein coding sequences or in less conserved non-coding DNA, suggesting that many are likely non-functional. Results Here we show that an additional 15 transcription factors that regulate other aspects of embryo patterning show a similar quantitative continuum of function and binding to thousands of genomic regions in vivo. Collectively, the 21 regulators show a surprisingly high overlap in the regions they bind given that they belong to 11 DNA binding domain families, specify distinct developmental fates, and can act via different cis-regulatory modules. We demonstrate, however, that quantitative differences in relative levels of binding to shared targets correlate with the known biological and transcriptional regulatory specificities of these factors. Conclusions It is likely that the overlap in binding of biochemically and functionally unrelated transcription factors arises from the high concentrations of these proteins in nuclei, which, coupled with their broad DNA binding specificities, directs them to regions of open chromatin. We suggest that most animal transcription factors will be found to show a similar broad overlapping pattern of binding in vivo, with specificity achieved by modulating the amount, rather than the identity, of bound factor.
Collapse
Affiliation(s)
- Stewart MacArthur
- Genomics Division, Lawrence Berkeley National Laboratory, Cyclotron Road MS 84-181, Berkeley, CA 94720, USA
| | | | | | | | | | | | | | | | | | | | | | | | | | | | | |
Collapse
|
202
|
Abstract
Eukaryotic genomes are packaged into a nucleoprotein complex known as chromatin, which affects most processes that occur on DNA. Along with genetic and biochemical studies of resident chromatin proteins and their modifying enzymes, mapping of chromatin structure in vivo is one of the main pillars in our understanding of how chromatin relates to cellular processes. In this review, we discuss the use of genomic technologies to characterize chromatin structure in vivo, with a focus on data from budding yeast and humans. The picture emerging from these studies is the detailed chromatin structure of a typical gene, where the typical behavior gives insight into the mechanisms and deep rules that establish chromatin structure. Important deviation from the archetype is also observed, usually as a consequence of unique regulatory mechanisms at special genomic loci. Chromatin structure shows substantial conservation from yeast to humans, but mammalian chromatin has additional layers of complexity that likely relate to the requirements of multicellularity such as the need to establish faithful gene regulatory mechanisms for cell differentiation.
Collapse
Affiliation(s)
- Oliver J Rando
- Department of Biochemistry and Molecular Pharmacology, University of Massachusetts Medical School, Worcester, MA 01605, USA.
| | | |
Collapse
|
203
|
Deplancke B. Experimental advances in the characterization of metazoan gene regulatory networks. BRIEFINGS IN FUNCTIONAL GENOMICS AND PROTEOMICS 2009; 8:12-27. [PMID: 19324929 DOI: 10.1093/bfgp/elp001] [Citation(s) in RCA: 11] [Impact Index Per Article: 0.7] [Reference Citation Analysis] [Abstract] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 12/26/2022]
Abstract
Gene regulatory networks (GRNs) play a vital role in metazoan development and function, and deregulation of these networks is often implicated in disease. GRNs depict the dynamic interactions between genomic and regulatory state components. The genomic components comprise genes and their associated cis-regulatory elements. The regulatory state components consist primarily of transcriptional complexes that bind the latter elements. With the availability of complete genome sequences, several approaches have recently been developed which promise to significantly enhance our ability to identify either the genomic or regulatory state components, or the interactions between these two. In this review, I provide an in-depth overview of these approaches and detail how each contributes to a more comprehensive understanding of GRN composition and function.
Collapse
Affiliation(s)
- Bart Deplancke
- Ecole Polytechnique Fédérale de Lausanne, School of Life Sciences, Institute of Bioengineering, Lausanne, Switzerland.
| |
Collapse
|
204
|
Arpanahi A, Brinkworth M, Iles D, Krawetz SA, Paradowska A, Platts AE, Saida M, Steger K, Tedder P, Miller D. Endonuclease-sensitive regions of human spermatozoal chromatin are highly enriched in promoter and CTCF binding sequences. Genome Res 2009; 19:1338-49. [PMID: 19584098 DOI: 10.1101/gr.094953.109] [Citation(s) in RCA: 202] [Impact Index Per Article: 13.5] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 01/27/2023]
Abstract
During the haploid phase of mammalian spermatogenesis, nucleosomal chromatin is ultimately repackaged by small, highly basic protamines to generate an extremely compact, toroidal chromatin architecture that is critical to normal spermatozoal function. In common with several species, however, the human spermatozoon retains a small proportion of its chromatin packaged in nucleosomes. As nucleosomal chromatin in spermatozoa is structurally more open than protamine-packaged chromatin, we considered it likely to be more accessible to exogenously applied endonucleases. Accordingly, we have used this premise to identify a population of endonuclease-sensitive DNA sequences in human and murine spermatozoa. Our results show unequivocally that, in contrast to the endonuclease-resistant sperm chromatin packaged by protamines, regions of increased endonuclease sensitivity are closely associated with gene regulatory regions, including many promoter sequences and sequences recognized by CCCTC-binding factor (CTCF). Similar differential packaging of promoters is observed in the spermatozoal chromatin of both mouse and man. These observations imply the existence of epigenetic marks that distinguish gene regulatory regions in male germ cells and prevent their repackaging by protamines during spermiogenesis. The ontology of genes under the control of endonuclease-sensitive regulatory regions implies a role for this phenomenon in subsequent embryonic development.
Collapse
Affiliation(s)
- Ali Arpanahi
- Reproduction and Early Development Unit, Leeds Institute of Genetics and Health Therapeutics, University of Leeds, Leeds, United Kingdom
| | | | | | | | | | | | | | | | | | | |
Collapse
|
205
|
Ng CEL, Ito Y, Osato M. Retroviral integration sites (RIS) mark cis-regulatory elements. Crit Rev Oncol Hematol 2009; 71:1-11. [DOI: 10.1016/j.critrevonc.2008.10.008] [Citation(s) in RCA: 1] [Impact Index Per Article: 0.1] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 02/06/2008] [Revised: 09/29/2008] [Accepted: 10/15/2008] [Indexed: 12/15/2022] Open
|
206
|
Improved ChIP-chip analysis by a mixture model approach. BMC Bioinformatics 2009; 10:173. [PMID: 19500407 PMCID: PMC2700807 DOI: 10.1186/1471-2105-10-173] [Citation(s) in RCA: 5] [Impact Index Per Article: 0.3] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 12/08/2008] [Accepted: 06/07/2009] [Indexed: 11/14/2022] Open
Abstract
Background Microarray analysis of immunoprecipitated chromatin (ChIP-chip) has evolved from a novel technique to a standard approach for the systematic study of protein-DNA interactions. In ChIP-chip, sites of protein-DNA interactions are identified by signals from the hybridization of selected DNA to tiled oligomers and are graphically represented as peaks. Most existing methods were designed for the identification of relatively sparse peaks, in the presence of replicates. Results We propose a data normalization method and a statistical method for peak identification from ChIP-chip data based on a mixture model approach. In contrast to many existing methods, including methods that also employ mixture model approaches, our method is more flexible by imposing less restrictive assumptions and allowing a relatively large proportion of peak regions. In addition, our method does not require experimental replicates and is computationally efficient. We compared the performance of our method with several representative existing methods on three datasets, including a spike-in dataset. These comparisons demonstrate that our approach is more robust and has comparable or higher power than the other methods, especially in the context of abundant peak regions. Conclusion Our data normalization and peak detection methods have improved performance to detect peak regions in ChIP-chip data.
Collapse
|
207
|
Liu J, Zhang Z, Bando M, Itoh T, Deardorff MA, Clark D, Kaur M, Tandy S, Kondoh T, Rappaport E, Spinner NB, Vega H, Jackson LG, Shirahige K, Krantz ID. Transcriptional dysregulation in NIPBL and cohesin mutant human cells. PLoS Biol 2009; 7:e1000119. [PMID: 19468298 PMCID: PMC2680332 DOI: 10.1371/journal.pbio.1000119] [Citation(s) in RCA: 178] [Impact Index Per Article: 11.9] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 11/07/2008] [Accepted: 04/17/2009] [Indexed: 01/13/2023] Open
Abstract
Genome-wide studies using cells from patients with Cornelia de Lange Syndrome reveal a role for cohesin in regulating gene expression in human cells. Cohesin regulates sister chromatid cohesion during the mitotic cell cycle with Nipped-B-Like (NIPBL) facilitating its loading and unloading. In addition to this canonical role, cohesin has also been demonstrated to play a critical role in regulation of gene expression in nondividing cells. Heterozygous mutations in the cohesin regulator NIPBL or cohesin structural components SMC1A and SMC3 result in the multisystem developmental disorder Cornelia de Lange Syndrome (CdLS). Genome-wide assessment of transcription in 16 mutant cell lines from severely affected CdLS probands has identified a unique profile of dysregulated gene expression that was validated in an additional 101 samples and correlates with phenotypic severity. This profile could serve as a diagnostic and classification tool. Cohesin binding analysis demonstrates a preference for intergenic regions suggesting a cis-regulatory function mimicking that of a boundary/insulator interacting protein. However, the binding sites are enriched within the promoter regions of the dysregulated genes and are significantly decreased in CdLS proband, indicating an alternative role of cohesin as a transcription factor. Appropriate segregation of chromosomes to daughter cells depends upon proper cohesion of sister chromatids during mitosis. The multiprotein cohesin complex and its regulators are key factors in this process. Intriguingly, recent work has shown that the cohesin complex also has other cellular roles, including a role in regulating gene expression. Additionally, mutations in cohesin structural and regulatory components have been linked to human multisystem developmental disorders such as Cornelia de Lange Syndrome (CdLS), but the role cohesin is playing in the pathogenesis of this disorder is unknown. To define the role that cohesin plays in regulating gene expression in human cells, we analyzed gene expression and genome-wide cohesin binding patterns in cells from normal subjects and from CdLS probands with mutations in the cohesin regulator NIPBL or in the cohesin structural component SMC1A. We found a strikingly conserved pattern of gene dysregulation in these different cell lines that correlates with disease severity and a significant correlation between gene dysregulation and cohesin binding around misexpressed genes. The observed pattern of binding and misexpression is consistent with cohesin having a putative role as a boundary/insulator interacting protein or transcription factor, the activity of which is disrupted in CdLS probands.
Collapse
Affiliation(s)
- Jinglan Liu
- Division of Human Genetics, Abramson Research Institute, The Children's Hospital of Philadelphia, Philadelphia, Pennsylvania, United States of America
| | - Zhe Zhang
- Center for Biomedical Informatics, The Children's Hospital of Philadelphia, Philadelphia, Pennsylvania, United States of America
| | - Masashige Bando
- Laboratory of Chromosome Structure and Function, Department of Biological Science, Graduate School of Bioscience and Biotechnology, Tokyo Institute of Technology, Yokohama City, Kanagawa, Japan
| | - Takehiko Itoh
- Laboratory of Chromosome Structure and Function, Department of Biological Science, Graduate School of Bioscience and Biotechnology, Tokyo Institute of Technology, Yokohama City, Kanagawa, Japan
| | - Matthew A. Deardorff
- Division of Human Genetics, Abramson Research Institute, The Children's Hospital of Philadelphia, Philadelphia, Pennsylvania, United States of America
- The University of Pennsylvania School of Medicine, Philadelphia, Pennsylvania, United States of America
| | - Dinah Clark
- Division of Human Genetics, Abramson Research Institute, The Children's Hospital of Philadelphia, Philadelphia, Pennsylvania, United States of America
| | - Maninder Kaur
- Division of Human Genetics, Abramson Research Institute, The Children's Hospital of Philadelphia, Philadelphia, Pennsylvania, United States of America
| | - Stephany Tandy
- Division of Human Genetics, Abramson Research Institute, The Children's Hospital of Philadelphia, Philadelphia, Pennsylvania, United States of America
| | - Tatsuro Kondoh
- Division of Developmental Disability, Misakaenosono Mutsumi Developmental, Medical, and Welfare Center, Isahaya, Japan
| | - Eric Rappaport
- NAPCORE, The Children's Hospital of Philadelphia, Philadelphia, Pennsylvania, United States of America
| | - Nancy B. Spinner
- Division of Human Genetics, Abramson Research Institute, The Children's Hospital of Philadelphia, Philadelphia, Pennsylvania, United States of America
- The University of Pennsylvania School of Medicine, Philadelphia, Pennsylvania, United States of America
| | - Hugo Vega
- Instituto de Genética, Universidad Nacional de Colombia, Bogotá, Colombia
| | - Laird G. Jackson
- Department of Obstetrics and Gynecology, Drexel University School of Medicine, Philadelphia, Pennsylvania, United States of America
| | - Katsuhiko Shirahige
- Laboratory of Chromosome Structure and Function, Department of Biological Science, Graduate School of Bioscience and Biotechnology, Tokyo Institute of Technology, Yokohama City, Kanagawa, Japan
| | - Ian D. Krantz
- Division of Human Genetics, Abramson Research Institute, The Children's Hospital of Philadelphia, Philadelphia, Pennsylvania, United States of America
- The University of Pennsylvania School of Medicine, Philadelphia, Pennsylvania, United States of America
- * E-mail:
| |
Collapse
|
208
|
Abstract
The glucocorticoid receptor regulates the expression of a large number of genes in mammalian cells. The interaction of this receptor with regulatory elements has been discovered to be highly dynamic, with occupancy states measured in seconds, rather than minutes or hours. This finding has led to a paradigm shift in our understanding of receptor function throughout the genome. The mechanisms involved in these rapid exchange events, as well as the implications for receptor function, are discussed.
Collapse
Affiliation(s)
- Simon C Biddie
- Laboratory of Receptor Biology and Gene Expression, National Cancer Institute, NIH, Bethesda, MD 20892-5055, USA
| | | |
Collapse
|
209
|
|
210
|
Bina M, Wyss P, Lazarus SA, Shah SR, Ren W, Szpankowski W, Crawford GE, Park SP, Song XC. Discovering sequences with potential regulatory characteristics. Genomics 2009; 93:314-22. [DOI: 10.1016/j.ygeno.2008.11.008] [Citation(s) in RCA: 9] [Impact Index Per Article: 0.6] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/15/2007] [Revised: 05/28/2008] [Accepted: 11/17/2008] [Indexed: 11/25/2022]
|
211
|
Global mapping of protein-DNA interactions in vivo by digital genomic footprinting. Nat Methods 2009; 6:283-9. [PMID: 19305407 PMCID: PMC2668528 DOI: 10.1038/nmeth.1313] [Citation(s) in RCA: 441] [Impact Index Per Article: 29.4] [Reference Citation Analysis] [Abstract] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 11/12/2008] [Accepted: 02/19/2009] [Indexed: 11/26/2022]
Abstract
The orchestrated binding of transcriptional activators and repressors to specific DNA sequences in the context of chromatin defines the regulatory program of eukaryotic genomes. We developed a digital approach to assay regulatory protein occupancy on genomic DNA in vivo by dense mapping of individual DNase I cleavages from intact nuclei using massively parallel DNA sequencing. Analysis of > 23 million cleavages across the Saccharomyces cerevisiae genome revealed thousands of protected regulatory protein footprints, enabling de novo derivation of factor binding motifs as well as the identification of hundreds of novel binding sites for major regulators. We observed striking correspondence between nucleotide-level DNase I cleavage patterns and protein-DNA interactions determined by crystallography. The data also yielded a detailed view of larger chromatin features including positioned nucleosomes flanking factor binding regions. Digital genomic footprinting provides a powerful approach to delineate the cis-regulatory framework of any organism with an available genome sequence.
Collapse
|
212
|
Giresi PG, Lieb JD. Isolation of active regulatory elements from eukaryotic chromatin using FAIRE (Formaldehyde Assisted Isolation of Regulatory Elements). Methods 2009; 48:233-9. [PMID: 19303047 DOI: 10.1016/j.ymeth.2009.03.003] [Citation(s) in RCA: 157] [Impact Index Per Article: 10.5] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 12/05/2008] [Revised: 03/05/2009] [Accepted: 03/07/2009] [Indexed: 10/21/2022] Open
Abstract
The binding of sequence-specific regulatory factors and the recruitment of chromatin remodeling activities cause nucleosomes to be evicted from chromatin in eukaryotic cells. Traditionally, these active sites have been identified experimentally through their sensitivity to nucleases. Here we describe the details of a simple procedure for the genome-wide isolation of nucleosome-depleted DNA from human chromatin, termed FAIRE (Formaldehyde Assisted Isolation of Regulatory Elements). We also provide protocols for different methods of detecting FAIRE-enriched DNA, including use of PCR, DNA microarrays, and next-generation sequencing. FAIRE works on all eukaryotic chromatin tested to date. To perform FAIRE, chromatin is crosslinked with formaldehyde, sheared by sonication, and phenol-chloroform extracted. Most genomic DNA is crosslinked to nucleosomes and is sequestered to the interphase, whereas DNA recovered in the aqueous phase corresponds to nucleosome-depleted regions of the genome. The isolated regions are largely coincident with the location of DNaseI hypersensitive sites, transcriptional start sites, enhancers, insulators, and active promoters. Given its speed and simplicity, FAIRE has utility in establishing chromatin profiles of diverse cell types in health and disease, isolating DNA regulatory elements en masse for further characterization, and as a screening assay for the effects of small molecules on chromatin organization.
Collapse
Affiliation(s)
- Paul G Giresi
- Department of Biology and Carolina Center for the Genome Sciences, University of North Carolina at Chapel Hill, CB #3280, 408 Fordham Hall, Chapel Hill, NC 27599-3280, USA
| | | |
Collapse
|
213
|
Shi B, Guo X, Wu T, Sheng S, Wang J, Skogerbø G, Zhu X, Chen R. Genome-scale identification of Caenorhabditis elegans regulatory elements by tiling-array mapping of DNase I hypersensitive sites. BMC Genomics 2009; 10:92. [PMID: 19243610 PMCID: PMC2651899 DOI: 10.1186/1471-2164-10-92] [Citation(s) in RCA: 13] [Impact Index Per Article: 0.9] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/15/2008] [Accepted: 02/25/2009] [Indexed: 11/24/2022] Open
Abstract
Background A major goal of post-genomics research is the integrated analysis of genes, regulatory elements and the chromatin architecture on a genome-wide scale. Mapping DNase I hypersensitive sites within the nuclear chromatin is a powerful and well-established method of identifying regulatory element candidates. Results Here, we report the first genome-wide analysis of DNase I hypersensitive sites (DHSs) in Caenorhabditis elegans. The data was obtained by hybridizing DNase I-treated and end-captured material from young adult worms to a high-resolution tiling microarray. The data show that C. elegans DHSs were significantly enriched within intergenic regions located 2 kb upstream and downstream of coding genes, and also that a considerable fraction of all DHSs mapped to intergenic positions distant to annotated coding genes. Annotated transcribed loci were generally depleted in DHSs relative to intergenic regions, but DHSs were nonetheless enriched in coding exons and UTRs, whereas introns were significantly depleted in DHSs. Many DHSs appeared to be associated with annotated non-coding RNAs and recently detected transcripts of unknown function. It has been reported that nematode highly conserved non-coding elements were associated with cis-regulatory elements, and we also found that DHSs, particularly distal intergenic DHSs, were significantly enriched in regions that were conserved between the C. elegans and C. briggsae genomes. Conclusion We describe the first genome-wide analysis of C. elegans DHSs, and show that the distribution of DHSs is strongly associated with functional elements in the genome.
Collapse
Affiliation(s)
- Baochen Shi
- Bioinformatics Laboratory and National Laboratory of Biomacromolecules, Institute of Biophysics, Chinese Academy of Sciences, Beijing 100101, PR China.
| | | | | | | | | | | | | | | |
Collapse
|
214
|
Mani P, Yadav VK, Das SK, Chowdhury S. Genome-wide analyses of recombination prone regions predict role of DNA structural motif in recombination. PLoS One 2009; 4:e4399. [PMID: 19198658 PMCID: PMC2635932 DOI: 10.1371/journal.pone.0004399] [Citation(s) in RCA: 64] [Impact Index Per Article: 4.3] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/14/2008] [Accepted: 12/17/2008] [Indexed: 11/18/2022] Open
Abstract
HapMap findings reveal surprisingly asymmetric distribution of recombinogenic regions. Short recombinogenic regions (hotspots) are interspersed between large relatively non-recombinogenic regions. This raises the interesting possibility of DNA sequence and/or other cis- elements as determinants of recombination. We hypothesized the involvement of non-canonical sequences that can result in local non-B DNA structures and tested this using the G-quadruplex DNA as a model. G-quadruplex or G4 DNA is a unique form of four-stranded non-B DNA structure that engages certain G-rich sequences, presence of such motifs has been noted within telomeres. In support of this hypothesis, genome-wide computational analyses presented here reveal enrichment of potential G4 (PG4) DNA forming sequences within 25618 human hotspots relative to 9290 coldspots (p<0.0001). Furthermore, co-occurrence of PG4 DNA within several short sequence elements that are associated with recombinogenic regions was found to be significantly more than randomly expected. Interestingly, analyses of more than 50 DNA binding factors revealed that co-occurrence of PG4 DNA with target DNA binding sites of transcription factors c-Rel, NF-kappa B (p50 and p65) and Evi-1 was significantly enriched in recombination-prone regions. These observations support involvement of G4 DNA in recombination, predicting a functional model that is consistent with duplex-strand separation induced by formation of G4 motifs in supercoiled DNA and/or when assisted by other cellular factors.
Collapse
Affiliation(s)
- Prithvi Mani
- G. N. Ramachandran Knowledge Centre for Genome Informatics, Institute of Genomics and Integrative Biology, CSIR, Delhi, India
| | - Vinod Kumar Yadav
- G. N. Ramachandran Knowledge Centre for Genome Informatics, Institute of Genomics and Integrative Biology, CSIR, Delhi, India
| | - Swapan Kumar Das
- Functional Genomics Unit, Institute of Genomics and Integrative Biology, CSIR, Delhi, India
| | - Shantanu Chowdhury
- G. N. Ramachandran Knowledge Centre for Genome Informatics, Institute of Genomics and Integrative Biology, CSIR, Delhi, India
- Proteomics and Structural Biology Unit, Institute of Genomics and Integrative Biology, CSIR, Delhi, India
- * E-mail:
| |
Collapse
|
215
|
Gargiulo G, Minucci S. Epigenomic profiling of cancer cells. Int J Biochem Cell Biol 2009; 41:127-35. [DOI: 10.1016/j.biocel.2008.07.021] [Citation(s) in RCA: 34] [Impact Index Per Article: 2.3] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 05/27/2008] [Revised: 07/15/2008] [Accepted: 07/16/2008] [Indexed: 11/26/2022]
|
216
|
Prokesch A, Hackl H, Hakim-Weber R, Bornstein SR, Trajanoski Z. Novel insights into adipogenesis from omics data. Curr Med Chem 2009; 16:2952-64. [PMID: 19689276 PMCID: PMC2765082 DOI: 10.2174/092986709788803132] [Citation(s) in RCA: 34] [Impact Index Per Article: 2.3] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 02/21/2009] [Revised: 05/15/2009] [Accepted: 05/16/2009] [Indexed: 01/05/2023]
Abstract
Obesity, the excess accumulation of adipose tissue, is one of the most pressing health problems in both the Western world and in developing countries. Adipose tissue growth results from two processes: the increase in number of adipocytes (hyperplasia) that develop from precursor cells, and the growth of individual fat cells (hypertrophy) due to incorporation of triglycerides. Adipogenesis, the process of fat cell development, has been extensively studied using various cell and animal models. While these studies pointed out a number of key factors involved in adipogenesis, the list of molecular components is far from complete. The advance of high-throughput technologies has sparked many experimental studies aimed at the identification of novel molecular components regulating adipogenesis. This paper examines the results of recent studies on adipogenesis using high-throughput technologies. Specifically, it provides an overview of studies employing microarrays for gene expression profiling and studies using gel based and non-gel based proteomics as well as a chromatin immunoprecipitation followed by microarray analysis (ChIP-chip) or sequencing (ChIP-seq). Due to the maturity of the technology, the bulk of the available data was generated using microarrays. Therefore these data sets were not only reviewed but also underwent meta analysis. The review also shows that large-scale omics technologies in conjunction with sophisticated bioinformatics analyses can provide not only a list of novel players, but also a global view on biological processes and molecular networks. Finally, developing technologies and computational challenges associated with the data analyses are highlighted, and an outlook on the questions not previously addressed is provided.
Collapse
Affiliation(s)
- Andreas Prokesch
- Institute for Genomics and Bioinformatics, Graz University of Technology, Graz, Austria
| | - Hubert Hackl
- Institute for Genomics and Bioinformatics, Graz University of Technology, Graz, Austria
| | - Robab Hakim-Weber
- Department of Internal Medicine, Technical University Dresden, Dresden, Germany
| | - Stefan R Bornstein
- Department of Internal Medicine, Technical University Dresden, Dresden, Germany
| | - Zlatko Trajanoski
- Institute for Genomics and Bioinformatics, Graz University of Technology, Graz, Austria
| |
Collapse
|
217
|
Abstract
Regions of DNA that bind to the nuclear matrix, or nucleoskeleton, are known as Matrix Attachment Regions (MARs). MARs are thought to play an important role in higher-order structure and chromatin organization within the nucleus. MARs are also thought to act as boundaries of chromosomal domains that act to separate regions of gene-rich, decondensed euchromatin from highly repetitive, condensed heterochromatin. Herein I will present evidence that MARs do indeed act as domain boundaries and can prevent the spread of silencing into active genes. Many fundamental questions remain unanswered about how MARs function in the nucleus. New findings in epigenetics indicate that MARs may also play an important role in the organization of genes and the eventual transport of their mRNAs through the nuclear pore.
Collapse
|
218
|
Henikoff S, Henikoff JG, Sakai A, Loeb GB, Ahmad K. Genome-wide profiling of salt fractions maps physical properties of chromatin. Genome Res 2008; 19:460-9. [PMID: 19088306 DOI: 10.1101/gr.087619.108] [Citation(s) in RCA: 189] [Impact Index Per Article: 11.8] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 01/09/2023]
Abstract
We applied genome-wide profiling to successive salt-extracted fractions of micrococcal nuclease-treated Drosophila chromatin. Chromatin fractions extracted with 80 mM or 150 mM NaCl after digestion contain predominantly mononucleosomes and represent classical "active" chromatin. Profiles of these low-salt soluble fractions display phased nucleosomes over transcriptionally active genes that are locally depleted of histone H3.3 and correspond closely to profiles of histone H2Av (H2A.Z) and RNA polymerase II. This correspondence suggests that transcription can result in loss of H3.3+H2Av nucleosomes and generate low-salt soluble nucleosomes. Nearly quantitative recovery of chromatin is obtained with 600 mM NaCl; however, the remaining insoluble chromatin is enriched in actively transcribed regions. Salt-insoluble chromatin likely represents oligonucleosomes that are attached to large protein complexes. Both low-salt extracted and insoluble chromatin are rich in sequences that correspond to epigenetic regulatory elements genome-wide. The presence of active chromatin at both extremes of salt solubility suggests that these salt fractions capture bound and unbound intermediates in active processes, thus providing a simple, powerful strategy for mapping epigenome dynamics.
Collapse
Affiliation(s)
- Steven Henikoff
- Division of Basic Sciences, Fred Hutchinson Cancer Research Center, Seattle, Washington 98109-1024, USA.
| | | | | | | | | |
Collapse
|
219
|
Abstract
Chromatin immunoprecipitation followed by high-throughput (HTP) sequencing (ChIP-seq) is a powerful tool to establish protein-DNA interactions genome-wide. The primary limitation of its broad application at present is the often-limited access to sequencers. Here we report a protocol, Mab-seq, that generates genome-scale quality evaluations for nucleic acid libraries intended for deep-sequencing. We show how commercially available genomic microarrays can be used to maximize the efficiency of library creation and quickly generate reliable preliminary data on a chromosomal scale in advance of deep sequencing. We also exploit this technique to compare enriched regions identified using microarrays with those identified by sequencing, demonstrating that they agree on a core set of clearly identified enriched regions, while characterizing the additional enriched regions identifiable using HTP sequencing.
Collapse
|
220
|
de Wit E, van Steensel B. Chromatin domains in higher eukaryotes: insights from genome-wide mapping studies. Chromosoma 2008; 118:25-36. [PMID: 18853173 DOI: 10.1007/s00412-008-0186-0] [Citation(s) in RCA: 37] [Impact Index Per Article: 2.3] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 08/21/2008] [Revised: 09/22/2008] [Accepted: 09/23/2008] [Indexed: 01/22/2023]
Abstract
In genomes of higher eukaryotes, adjacent genes often show coordinated regulation of their expression. Compartmentalization of multiple neighboring genes into a shared chromatin environment can facilitate this coordinated expression. New mapping techniques have begun to reveal that such multigene chromatin domains are a common feature of fly and mammalian genomes. Many different types of chromatin domains have been identified based on the genomic binding patterns of various proteins and histone modifications. In addition, maps of genome-nuclear lamina associations and of looping interactions between loci provide the first systematic views of the three-dimensional folding of interphase chromosomes. These genome-wide datasets uncover new architectural principles of eukaryotic genomes and indicate that multigene chromatin domains are prevalent and important regulatory units.
Collapse
Affiliation(s)
- Elzo de Wit
- Division of Gene Regulation, Netherlands Cancer Institute, Amsterdam, The Netherlands
| | | |
Collapse
|
221
|
Schliep A, Krause R. Efficient algorithms for the computational design of optimal tiling arrays. IEEE/ACM TRANSACTIONS ON COMPUTATIONAL BIOLOGY AND BIOINFORMATICS 2008; 5:557-567. [PMID: 18989043 DOI: 10.1109/tcbb.2008.50] [Citation(s) in RCA: 4] [Impact Index Per Article: 0.3] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 05/27/2023]
Abstract
The representation of a genome by oligonucleotide probes is a prerequisite for the analysis of many of its basic properties, such as transcription factor binding sites, chromosomal breakpoints, gene expression of known genes and detection of novel genes, in particular those coding for small RNAs. An ideal representation would consist of a high density set of oligonucleotides with similar melting temperatures that do not cross-hybridize with other regions of the genome and are equidistantly spaced. The implementation of such design is typically called a tiling array or genome array. We formulate the minimal cost tiling path problem for the selection of oligonucleotides from a set of candidates. Computing the selection of probes requires multi-criterion optimization, which we cast into a shortest path problem. Standard algorithms running in linear time allow us to compute globally optimal tiling paths from millions of candidate oligonucleotides on a standard desktop computer for most problem variants. The solutions to this multi-criterion optimization are spatially adaptive to the problem instance. Our formulation incorporates experimental constraints with respect to specific regions of interest and trade offs between hybridization parameters, probe quality and tiling density easily. A web application is available at http://tileomatic.org.
Collapse
Affiliation(s)
- Alexander Schliep
- Department Computational Molecular Biology, Max Planck Institute for Molecular Genetics, Ihnestrasse 69-73, 14195 Berlin, Germany
| | | |
Collapse
|
222
|
Kharchenko PV, Woo CJ, Tolstorukov MY, Kingston RE, Park PJ. Nucleosome positioning in human HOX gene clusters. Genome Res 2008; 18:1554-61. [PMID: 18723689 DOI: 10.1101/gr.075952.107] [Citation(s) in RCA: 33] [Impact Index Per Article: 2.1] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/25/2022]
Abstract
The distribution of nucleosomes along the genome is a significant aspect of chromatin structure and is thought to influence gene regulation through modulation of DNA accessibility. However, properties of nucleosome organization remain poorly understood, particularly in mammalian genomes. Toward this goal we used tiled microarrays to identify stable nucleosome positions along the HOX gene clusters in human cell lines. We show that nucleosome positions exhibit sequence properties and long-range organization that are different from those characterized in other organisms. Despite overall variability of internucleosome distances, specific loci contain regular nucleosomal arrays with 195-bp periodicity. Moreover, such arrays tend to occur preferentially toward the 3' ends of genes. Through comparison of different cell lines, we find that active transcription is correlated with increased positioning of nucleosomes, suggesting an unexpected role for transcription in the establishment of well-positioned nucleosomes.
Collapse
Affiliation(s)
- Peter V Kharchenko
- Harvard Partners Center for Genetics and Genomics, Boston, Massachusetts 02115, USA
| | | | | | | | | |
Collapse
|
223
|
Inda MA, van Batenburg MF, Roos M, Belloum ASZ, Vasunin D, Wibisono A, van Kampen AHC, Breit TM. SigWin-detector: a Grid-enabled workflow for discovering enriched windows of genomic features related to DNA sequences. BMC Res Notes 2008; 1:63. [PMID: 18710516 PMCID: PMC2533338 DOI: 10.1186/1756-0500-1-63] [Citation(s) in RCA: 3] [Impact Index Per Article: 0.2] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 04/25/2008] [Accepted: 08/08/2008] [Indexed: 11/16/2022] Open
Abstract
Background Chromosome location is often used as a scaffold to organize genomic information in both the living cell and molecular biological research. Thus, ever-increasing amounts of data about genomic features are stored in public databases and can be readily visualized by genome browsers. To perform in silico experimentation conveniently with this genomics data, biologists need tools to process and compare datasets routinely and explore the obtained results interactively. The complexity of such experimentation requires these tools to be based on an e-Science approach, hence generic, modular, and reusable. A virtual laboratory environment with workflows, workflow management systems, and Grid computation are therefore essential. Findings Here we apply an e-Science approach to develop SigWin-detector, a workflow-based tool that can detect significantly enriched windows of (genomic) features in a (DNA) sequence in a fast and reproducible way. For proof-of-principle, we utilize a biological use case to detect regions of increased and decreased gene expression (RIDGEs and anti-RIDGEs) in human transcriptome maps. We improved the original method for RIDGE detection by replacing the costly step of estimation by random sampling with a faster analytical formula for computing the distribution of the null hypothesis being tested and by developing a new algorithm for computing moving medians. SigWin-detector was developed using the WS-VLAM workflow management system and consists of several reusable modules that are linked together in a basic workflow. The configuration of this basic workflow can be adapted to satisfy the requirements of the specific in silico experiment. Conclusion As we show with the results from analyses in the biological use case on RIDGEs, SigWin-detector is an efficient and reusable Grid-based tool for discovering windows enriched for features of a particular type in any sequence of values. Thus, SigWin-detector provides the proof-of-principle for the modular e-Science based concept of integrative bioinformatics experimentation.
Collapse
Affiliation(s)
- Márcia A Inda
- Integrative Bioinformatics Unit, Swammerdam Institute for Life Sciences, Faculty of Science, University of Amsterdam, PO Box 94062, 1090 GB Amsterdam, The Netherlands.
| | | | | | | | | | | | | | | |
Collapse
|
224
|
Extrapolating ENCODE data to the whole human genome. Gene 2008; 419:66-9. [DOI: 10.1016/j.gene.2008.02.013] [Citation(s) in RCA: 2] [Impact Index Per Article: 0.1] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 12/20/2007] [Revised: 02/04/2008] [Accepted: 02/05/2008] [Indexed: 11/18/2022]
|
225
|
Nikolaev LG, Akopov SB, Chernov IP, Sverdlov ED. Maps of cis-Regulatory Nodes in Megabase Long Genome Segments are an Inevitable Intermediate Step Toward Whole Genome Functional Mapping. Curr Genomics 2008; 8:137-49. [PMID: 18660850 DOI: 10.2174/138920207780368178] [Citation(s) in RCA: 2] [Impact Index Per Article: 0.1] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 01/17/2007] [Revised: 02/22/2007] [Accepted: 02/27/2007] [Indexed: 11/22/2022] Open
Abstract
The availability of complete human and other metazoan genome sequences has greatly facilitated positioning and analysis of various genomic functional elements, with initial emphasis on coding sequences. However, complete functional maps of sequenced eukaryotic genomes should include also positions of all non-coding regulatory elements. Unfortunately, experimental data on genomic positions of a multitude of regulatory sequences, such as enhancers, silencers, insulators, transcription terminators, and replication origins are very limited, especially at the whole genome level. Since most genomic regulatory elements (e.g. enhancers) are generally gene-, tissue-, or cell-specific, the prediction of these elements by computational methods is difficult and often ambiguous. Therefore, the development of high-throughput experimental approaches for identifying and mapping genomic functional elements is highly desirable. At the same time, the creation of whole-genome map of hundreds of thousands of regulatory elements in several hundreds of tissue/cell types is presently far beyond our capabilities. A possible alternative for the whole genome approach is to concentrate efforts on individual genomic segments and then to integrate the data obtained into a whole genome functional map. Moreover, the maps of polygenic fragments with functional cis-regulatory elements would provide valuable data on complex regulatory systems, including their variability and evolution. Here, we reviewed experimental approaches to the realization of these ideas, including our own developments of experimental techniques for selection of cis-acting functionally active DNA fragments from large (megabase-sized) segments of mammalian genomes.
Collapse
Affiliation(s)
- Lev G Nikolaev
- Shemyakin-Ovchinnikov Institute of Bioorganic Chemistry, Russian Academy of Sciences, 16/10 Miklukho-Maklaya,117997, Moscow, Russia
| | | | | | | |
Collapse
|
226
|
Singer GAC, Wu J, Yan P, Plass C, Huang THM, Davuluri RV. Genome-wide analysis of alternative promoters of human genes using a custom promoter tiling array. BMC Genomics 2008; 9:349. [PMID: 18655706 PMCID: PMC2527337 DOI: 10.1186/1471-2164-9-349] [Citation(s) in RCA: 40] [Impact Index Per Article: 2.5] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 12/10/2007] [Accepted: 07/25/2008] [Indexed: 11/23/2022] Open
Abstract
Background Independent lines of evidence suggested that a large fraction of human genes possess multiple promoters driving gene expression from distinct transcription start sites. Understanding which promoter is employed in which cellular context is required to unravel gene regulatory networks within the cell. Results We have developed a custom microarray platform that tiles roughly 35,000 alternative putative promoters from nearly 7,000 genes in the human genome. To demonstrate the utility of this array platform, we have analyzed the patterns of promoter usage in 17β-estradiol (E2)-treated and untreated MCF7 cells and show widespread usage of alternative promoters. Most intriguingly, we show that the downstream promoter in E2-sensitive multiple promoter genes tends to be very close to the 3'-terminus of the gene, suggesting exotic mechanisms of expression regulation in these genes. Conclusion The usage of alternative promoters greatly multiplies the transcriptional complexity available within the human genome. The fact that many of these promoters are incapable of driving the synthesis of a meaningful protein-encoding transcript further complicates the story.
Collapse
Affiliation(s)
- Gregory A C Singer
- Human Cancer Genetics Program, Comprehensive Cancer Center, Department of Molecular Virology, Immunology, and Medical Genetics, The Ohio State University, Columbus, OH, USA.
| | | | | | | | | | | |
Collapse
|
227
|
Taylor JM, Wicks K, Vandiedonck C, Knight JC. Chromatin profiling across the human tumour necrosis factor gene locus reveals a complex, cell type-specific landscape with novel regulatory elements. Nucleic Acids Res 2008; 36:4845-62. [PMID: 18653526 PMCID: PMC2528168 DOI: 10.1093/nar/gkn444] [Citation(s) in RCA: 20] [Impact Index Per Article: 1.3] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/22/2022] Open
Abstract
The TNF locus on chromosome 6p21 encodes a family of proteins with key roles in the immune response whose dysregulation leads to severe disease. Transcriptional regulation is important, with cell type and stimulus-specific enhancer complexes involving the proximal TNF promoter. We show how quantitative chromatin profiling across a 34 kb region spanning the TNF locus has allowed us to identify a number of novel DNase hypersensitive sites and characterize more distant regulatory elements. We demonstrate DNase hypersensitive sites corresponding to the lymphotoxin alpha (LTA) and tumour necrosis factor (TNF) promoter regions, a CpG island in exon 4 of lymphotoxin beta (LTB), the 3′ end of nuclear factor of kappa light polypeptide gene enhancer in B-cells inhibitor-like 1 (NFKBIL1) and 3.4 kb upstream of LTA. These sites co-localize to highly conserved DNA sequences and show evidence of cell type specificity when lymphoblastoid, Jurkat, U937, HeLa and HEK293T cell lines are analysed using Southern blotting. For Jurkat T cells, we define histone modifications across the locus. Peaks of acetylated histone H3 and H4, together with tri-methyl K4 of histone H3, correspond to hypersensitive sites, notably in exon 4 of LTB. We provide evidence of a functional role for an intergenic DNase I hypersensitive site distal to LTA in Jurkat cells based on reporter gene analysis, with evidence of recruitment of upstream stimulatory factors (USF) transcription factors.
Collapse
Affiliation(s)
- Jennifer M Taylor
- Wellcome Trust Centre for Human Genetics, University of Oxford, Oxford OX3 7BN, UK
| | | | | | | |
Collapse
|
228
|
Lian H, Thompson WA, Thurman R, Stamatoyannopoulos JA, Noble WS, Lawrence CE. Automated mapping of large-scale chromatin structure in ENCODE. ACTA ACUST UNITED AC 2008; 24:1911-6. [PMID: 18591192 PMCID: PMC2519158 DOI: 10.1093/bioinformatics/btn335] [Citation(s) in RCA: 22] [Impact Index Per Article: 1.4] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/30/2022]
Abstract
Motivation: A recently developed DNaseI assay has given us our first genome-wide view of chromatin structure. In addition to cataloging DNaseI hypersensitive sites, these data allows us to more completely characterize overall features of chromatin accessibility. We employed a Bayesian hierarchical change-point model (CPM), a generalization of a hidden Markov Model (HMM), to characterize tiled microarray DNaseI sensitivity data available from the ENCODE project. Results: Our analysis shows that the accessibility of chromatin to cleavage by DNaseI is well described by a four state model of local segments with each state described by a continuous mixture of Gaussian variables. The CPM produces a better fit to the observed data than the HMM. The large posterior probability for the four-state CPM suggests that the data falls naturally into four classes of regions, which we call major and minor DNaseI hypersensitive sites (DHSs), regions of intermediate sensitivity, and insensitive regions. These classes agree well with a model of chromatin in which local disruptions (DHSs) are concentrated within larger domains of intermediate sensitivity, the accessibility islands. The CPM assigns 92% of the bases within the ENCODE regions to the insensitive regions. The 5.8% of the bases that are in regions of intermediate sensitivity are clearly enriched in functional elements, including genes and activating histone modifications, while the remaining 2.2% of the bases in hypersensitive regions are very strongly enriched in these elements. Availability: The CPM software is available upon request from the authors. Contact:jstam@stamlab.org; noble@gs.washington.edu; Charles_Lawrence@brown.edu Supplementary information:Supplementary data are available at Bioinformatics online. Source code is available at http://noble.gs.washington.edu/proj/segment.
Collapse
Affiliation(s)
- Heng Lian
- Division of Mathematical Sciences, SPMS, Nanyang Technological University, Singapore
| | | | | | | | | | | |
Collapse
|
229
|
Chapuis J, Moisan F, Mellick G, Elbaz A, Silburn P, Pasquier F, Hannequin D, Lendon C, Campion D, Amouyel P, Lambert JC. Association study of the NEDD9 gene with the risk of developing Alzheimer's and Parkinson's disease. Hum Mol Genet 2008; 17:2863-7. [PMID: 18579580 DOI: 10.1093/hmg/ddn183] [Citation(s) in RCA: 17] [Impact Index Per Article: 1.1] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/12/2022] Open
Abstract
Alzheimer's disease (AD) and Parkinson's disease (PD), the two most common neurodegenerative disorders in the elderly, have been hypothesized to share genetic determinants. Recently, Li et al. proposed that a variant in the NEDD9 gene may be one of these common genetic factors. We attempted to confirm this initial observation by conducting an equivalent analysis in terms of pathologies and sample size. We genotyped the NEDD9 rs760678 SNP in three independent AD case-control studies (n = 3176) and two independent PD case-control studies (n = 1855). However, we failed to detect an association of this SNP with the risk of developing AD or PD, in any of these populations. In conclusion, these data indicate that the rs760678 SNP of the NEDD9 gene is at best a weak genetic determinant of AD or PD.
Collapse
Affiliation(s)
- Julien Chapuis
- Inserm, U744, Institut Pasteur de Lille, Université de Lille 2, Lille, France
| | | | | | | | | | | | | | | | | | | | | |
Collapse
|
230
|
Rowell E, Merkenschlager M, Wilson CB. Long-range regulation of cytokine gene expression. Curr Opin Immunol 2008; 20:272-80. [PMID: 18485687 DOI: 10.1016/j.coi.2008.03.012] [Citation(s) in RCA: 21] [Impact Index Per Article: 1.3] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 01/25/2008] [Revised: 03/01/2008] [Accepted: 03/25/2008] [Indexed: 11/30/2022]
Abstract
In metazoans, transcription is regulated by promoters and additional elements, which may be located far from their target gene(s). Moreover, genes (including those encoding cytokines and cytokine receptors) are commonly clustered in the genome, providing the opportunity for the shared, competitive, or sequential use of regulatory elements. New techniques, discussed here, are generating an avalanche of high-resolution genome-wide data through which candidate regulatory elements have been identified in specific cell types (including T cells), their functions inferred, and their physical interactions in three-dimensional space demonstrated. As a result, a nearly comprehensive list of regulatory elements in the Th2 cytokine locus, a growing list of elements in the interferon-gamma gene locus, and maps of their three-dimensional interactions are now available, though much remains to be learned about the molecular mechanisms at play, the dynamics of these interactions, and their functional importance.
Collapse
Affiliation(s)
- Emily Rowell
- Department of Immunology, University of Washington, Seattle, WA 98195, USA
| | | | | |
Collapse
|
231
|
John S, Sabo PJ, Johnson TA, Sung MH, Biddie SC, Lightman SL, Voss TC, Davis SR, Meltzer PS, Stamatoyannopoulos JA, Hager GL. Interaction of the glucocorticoid receptor with the chromatin landscape. Mol Cell 2008; 29:611-24. [PMID: 18342607 DOI: 10.1016/j.molcel.2008.02.010] [Citation(s) in RCA: 248] [Impact Index Per Article: 15.5] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/29/2007] [Revised: 10/22/2007] [Accepted: 02/27/2008] [Indexed: 11/18/2022]
Abstract
The generality and spectrum of chromatin-remodeling requirements for nuclear receptor function are unknown. We have characterized glucocorticoid receptor (GR) binding events and chromatin structural transitions across GR-induced or -repressed genes. This analysis reveals that GR binding invariably occurs at nuclease-accessible sites (DHS). A remarkable diversity of mechanisms, however, render these sites available for GR binding. Accessibility of the GR binding sites is either constitutive or hormone inducible. Within each category, some DHS sites require the Brg1-containing Swi/Snf complex, but others are Brg1 independent, implicating a different remodeling complex. The H2A.Z histone variant is highly enriched at both inducible and constitutive DHS sites and is subject to exchange during hormone activation. The DHS profile is highly cell specific, implicating cell-selective organization of the chromatin landscape as a critical determinant of tissue-selective receptor function. Furthermore, the widespread requirement for chromatin remodeling supports the recent hypothesis that the rapid exchange of receptor proteins occurs during nucleosome reorganization.
Collapse
Affiliation(s)
- Sam John
- Laboratory of Receptor Biology and Gene Expression, Center for Cancer Research, National Cancer Institute, NIH, Bethesda, MD 20892-5055, USA
| | | | | | | | | | | | | | | | | | | | | |
Collapse
|
232
|
Boyle AP, Davis S, Shulha HP, Meltzer P, Margulies EH, Weng Z, Furey TS, Crawford GE. High-resolution mapping and characterization of open chromatin across the genome. Cell 2008; 132:311-22. [PMID: 18243105 DOI: 10.1016/j.cell.2007.12.014] [Citation(s) in RCA: 1018] [Impact Index Per Article: 63.6] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 08/13/2007] [Revised: 10/22/2007] [Accepted: 12/04/2007] [Indexed: 12/28/2022]
Abstract
Mapping DNase I hypersensitive (HS) sites is an accurate method of identifying the location of genetic regulatory elements, including promoters, enhancers, silencers, insulators, and locus control regions. We employed high-throughput sequencing and whole-genome tiled array strategies to identify DNase I HS sites within human primary CD4+ T cells. Combining these two technologies, we have created a comprehensive and accurate genome-wide open chromatin map. Surprisingly, only 16%-21% of the identified 94,925 DNase I HS sites are found in promoters or first exons of known genes, but nearly half of the most open sites are in these regions. In conjunction with expression, motif, and chromatin immunoprecipitation data, we find evidence of cell-type-specific characteristics, including the ability to identify transcription start sites and locations of different chromatin marks utilized in these cells. In addition, and unexpectedly, our analyses have uncovered detailed features of nucleosome structure.
Collapse
Affiliation(s)
- Alan P Boyle
- Institute for Genome Sciences & Policy, Duke University, Durham, NC 27708, USA
| | | | | | | | | | | | | | | |
Collapse
|
233
|
Song JS, Johnson WE, Zhu X, Zhang X, Li W, Manrai AK, Liu JS, Chen R, Liu XS. Model-based analysis of two-color arrays (MA2C). Genome Biol 2008; 8:R178. [PMID: 17727723 PMCID: PMC2375008 DOI: 10.1186/gb-2007-8-8-r178] [Citation(s) in RCA: 93] [Impact Index Per Article: 5.8] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 04/20/2007] [Revised: 07/02/2007] [Accepted: 08/29/2007] [Indexed: 01/14/2023] Open
Abstract
A novel normalization method based on the GC content of probes is developed for two-color tiling arrays. The proposed method, together with robust estimates of the model parameters, is shown to perform superbly on published data sets. A robust algorithm for detecting peak regions is also formulated and shown to perform well compared to other approaches. The tools have been implemented as a stand-alone Java program called MA2C, which can display various plots of statistical analysis for quality control.
Collapse
Affiliation(s)
- Jun S Song
- Department of Biostatistics and Computational Biology, Dana-Farber Cancer Institute, 44 Binney Street, Boston, MA 02115, USA
- Department of Biostatistics, Harvard School of Public Health, Boston, MA 02115, USA
| | - W Evan Johnson
- Department of Biostatistics and Computational Biology, Dana-Farber Cancer Institute, 44 Binney Street, Boston, MA 02115, USA
- Department of Biostatistics, Harvard School of Public Health, Boston, MA 02115, USA
| | - Xiaopeng Zhu
- Bioinformatics Laboratory, Institute of Biophysics, Chinese Academy of Sciences, Beijing 100101, China
| | - Xinmin Zhang
- NimbleGen Systems, Inc., Science Court, Madison, Wisconsin 53711, USA
| | - Wei Li
- Department of Biostatistics and Computational Biology, Dana-Farber Cancer Institute, 44 Binney Street, Boston, MA 02115, USA
- Department of Biostatistics, Harvard School of Public Health, Boston, MA 02115, USA
| | - Arjun K Manrai
- Department of Physics, Harvard University, Cambridge, MA 02138, USA
| | - Jun S Liu
- Department of Biostatistics, Harvard School of Public Health, Boston, MA 02115, USA
- Department of Statistics, Harvard University, 1 Oxford Street, Cambridge, MA 02138, USA
| | - Runsheng Chen
- Bioinformatics Laboratory, Institute of Biophysics, Chinese Academy of Sciences, Beijing 100101, China
| | - X Shirley Liu
- Department of Biostatistics and Computational Biology, Dana-Farber Cancer Institute, 44 Binney Street, Boston, MA 02115, USA
- Department of Biostatistics, Harvard School of Public Health, Boston, MA 02115, USA
| |
Collapse
|
234
|
Di Filippo M, Bernardi G. Mapping DNase-I hypersensitive sites on human isochores. Gene 2008; 419:62-5. [PMID: 18436395 DOI: 10.1016/j.gene.2008.02.012] [Citation(s) in RCA: 20] [Impact Index Per Article: 1.3] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 12/20/2007] [Revised: 02/04/2008] [Accepted: 02/05/2008] [Indexed: 10/22/2022]
Abstract
Mapping DNase-I hypersensitive sites (HS) was used in the past to identify regulatory elements of specific genes. More recently, thousands of HS were identified in the human genome by using high-throughput methods. These approaches showed a general enrichment of HS near or within known genes, within CpG islands, within human-mouse conserved regions and in GC-rich regions of the genome. Here we show that HS: (i) are characterized by a much higher GC level (approximately 56%) than the average GC level of the human genome (approximately 41%); (ii) are overwhelmingly located in the GC-richest compartment of the genome, which is predominantly associated with an open chromatin structure; (iii) and are slightly more and slightly less frequent than genes, respectively, in the gene-rich and in the gene-poor isochore families.
Collapse
Affiliation(s)
- Miriam Di Filippo
- Laboratory of Molecular Evolution, Stazione Zoologica Anton Dohrn, 80121 Naples, Italy
| | | |
Collapse
|
235
|
Abstract
While the hereditary information encoded in the Watson-Crick base pairing of genomes is largely static within a given individual, access to this information is controlled by dynamic mechanisms. The human genome is pervasively transcribed, but the roles played by the majority of the non-protein-coding genome sequences are still largely unknown. In this review we focus on insights to gene transcriptional regulation by placing special emphasis on genome-wide approaches, and on how non-coding RNAs, which derive from global transcription of the genome, in turn control gene expression. We review recent progress in the field with highlights on the immune system.
Collapse
Affiliation(s)
- Matthew E Pipkin
- Immune Disease Institute and Harvard Medical School, Boston, MA, USA
| | | |
Collapse
|
236
|
Henikoff S. Nucleosome destabilization in the epigenetic regulation of gene expression. Nat Rev Genet 2008; 9:15-26. [DOI: 10.1038/nrg2206] [Citation(s) in RCA: 351] [Impact Index Per Article: 21.9] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 12/11/2022]
|
237
|
Interferon regulatory factors are transcriptional regulators of adipogenesis. Cell Metab 2008; 7:86-94. [PMID: 18177728 PMCID: PMC2278019 DOI: 10.1016/j.cmet.2007.11.002] [Citation(s) in RCA: 107] [Impact Index Per Article: 6.7] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Submit a Manuscript] [Subscribe] [Scholar Register] [Received: 02/14/2007] [Revised: 08/03/2007] [Accepted: 11/05/2007] [Indexed: 12/31/2022]
Abstract
We have sought to identify transcriptional pathways in adipogenesis using an integrated experimental and computational approach. Here, we employ high-throughput DNase hypersensitivity analysis to find regions of altered chromatin structure surrounding key adipocyte genes. Regions that display differentiation-dependent changes in hypersensitivity were used to predict binding sites for proteins involved in adipogenesis. A high-scoring example was a binding motif for interferon regulatory factor (IRF) family members. Expression of all nine mammalian IRF mRNAs is regulated during adipogenesis, and several bind to the identified motifs in a differentiation-dependent manner. Furthermore, several IRF proteins repress differentiation. This analysis suggests an important role for IRF proteins in adipocyte biology and demonstrates the utility of this approach in identifying cis- and trans-acting factors not previously suspected to participate in adipogenesis.
Collapse
|
238
|
Higgs DR, Vernimmen D, Hughes J, Gibbons R. Using genomics to study how chromatin influences gene expression. Annu Rev Genomics Hum Genet 2007; 8:299-325. [PMID: 17506662 DOI: 10.1146/annurev.genom.8.080706.092323] [Citation(s) in RCA: 31] [Impact Index Per Article: 1.8] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/09/2022]
Abstract
A postgenome challenge is to understand how the code in DNA is converted into the biological processes underlying various cell fates. By establishing the appropriate technical tools, we are moving from an era in which such questions have been asked by studying individual genes to one in which large domains, whole chromosomes, and the entire human genome can be investigated. These developments will allow us to study in parallel the transcriptional program and components of the epigenetic program (nuclear position, timing of replication, chromatin structure and modification, DNA methylation) to determine the hierarchy and order of events required to switch genes on and off during differentiation and development.
Collapse
Affiliation(s)
- Douglas R Higgs
- MRC Molecular Haematology Unit, Weatherall Institute of Molecular Medicine, John Radcliffe Hospital, Headington, Oxford OX3 9DS, United Kingdom.
| | | | | | | |
Collapse
|
239
|
Affiliation(s)
- X Shirley Liu
- Department of Biostatistics and Computational Biology, Dana-Farber Cancer Institute, Harvard School of Public Health, Boston, Massachusetts, USA.
| |
Collapse
|
240
|
Buckley NJ. Analysis of transcription, chromatin dynamics and epigenetic changes in neural genes. Prog Neurobiol 2007; 83:195-210. [PMID: 17884276 DOI: 10.1016/j.pneurobio.2007.07.004] [Citation(s) in RCA: 6] [Impact Index Per Article: 0.4] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 11/01/2006] [Revised: 06/14/2007] [Accepted: 07/18/2007] [Indexed: 01/08/2023]
Abstract
The ways in which gene transcription is investigated have undergone radical change since the turn of the millennium. Piece-meal approaches focussed upon model genes have increasingly been complemented by genome-wide approaches that allow interrogation of multiple cohorts of genes or even entire genomes. This sea change has been founded upon the increasing availability of whole genome sequences and the attendant evolution of microarray based discovery platforms. Collectively, these approaches are being used to build a global and dynamic perspective of transcription factor occupancy, co-factor recruitment and epigenetic signature. As yet, few of these approaches have been applied to the study of neuronal gene transcription, but this is set to change. Here, I review these key developments and point to their potential application to the study of transcriptional and epigenetic changes in neurons in health and disease.
Collapse
Affiliation(s)
- Noel J Buckley
- King's College London, Department of Neuroscience, Institute of Psychiatry, Centre for the Cellular Basis of Behaviour, CCBB/CCIB, Room 1-045, 125 Coldharbour Lane, London SE5 9NU, UK.
| |
Collapse
|
241
|
Abstract
Computational analysis of eukaryotic promoters is one of the most difficult problems in computational genomics and is essential for understanding gene expression profiles and reverse-engineering gene regulation network circuits. Here I give a basic introduction of the problem and recent update on both experimental and computational approaches. More details may be found in the extended references. This review is based on a summer lecture given at Max Planck Institute at Berlin in 2005.
Collapse
Affiliation(s)
- Michael Q Zhang
- Cold Spring Harbor Laboratory, 1 Bungtown Road, Cold Spring Harbor, NY 11724, USA
| |
Collapse
|
242
|
So AYL, Chaivorapol C, Bolton EC, Li H, Yamamoto KR. Determinants of cell- and gene-specific transcriptional regulation by the glucocorticoid receptor. PLoS Genet 2007; 3:e94. [PMID: 17559307 PMCID: PMC1904358 DOI: 10.1371/journal.pgen.0030094] [Citation(s) in RCA: 240] [Impact Index Per Article: 14.1] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 01/09/2007] [Accepted: 04/23/2007] [Indexed: 11/19/2022] Open
Abstract
The glucocorticoid receptor (GR) associates with glucocorticoid response elements (GREs) and regulates selective gene transcription in a cell-specific manner. Native GREs are typically thought to be composite elements that recruit GR as well as other regulatory factors into functional complexes. We assessed whether GR occupancy is commonly a limiting determinant of GRE function as well as the extent to which core GR binding sequences and GRE architecture are conserved at functional loci. We surveyed 100-kb regions surrounding each of 548 known or potentially glucocorticoid-responsive genes in A549 human lung cells for GR-occupied GREs. We found that GR was bound in A549 cells predominately near genes responsive to glucocorticoids in those cells and not at genes regulated by GR in other cells. The GREs were positionally conserved at each responsive gene but across the set of responsive genes were distributed equally upstream and downstream of the transcription start sites, with 63% of them >10 kb from those sites. Strikingly, although the core GR binding sequences across the set of GREs varied extensively around a consensus, the precise sequence at an individual GRE was conserved across four mammalian species. Similarly, sequences flanking the core GR binding sites also varied among GREs but were conserved at individual GREs. We conclude that GR occupancy is a primary determinant of glucocorticoid responsiveness in A549 cells and that core GR binding sequences as well as GRE architecture likely harbor gene-specific regulatory information. The glucocorticoid receptor (GR) regulates a myriad of physiological functions, such as cell differentiation and metabolism, achieved through modulating transcription in a cell- and gene-specific manner. However, the determinants that specify cell- and gene-specific GR transcriptional regulation are not well established. We describe three properties that contribute to this specificity: (1) GR occupancy at genomic glucocorticoid response elements (GREs) appears to be a primary determinant of glucocorticoid responsiveness; (2) the DNA sequences bound by GR vary widely around a consensus, but the precise sequences of individual GREs are highly conserved, suggesting a role for these sequences in gene-specific GR transcriptional regulation; and (3) native chromosomal GREs were generally found to be composite elements, comprised of multiple factor binding sites that were highly variable in composition, but as with the GR binding sequences, highly conserved at individual GREs. In addition, we discovered that most GREs were positioned far from their GR target genes and that they were equally distributed upstream and downstream of the target genes. These findings, which may be applicable to other regulatory factors, provide fundamental insights for understanding cell- and gene-specific transcriptional regulation.
Collapse
Affiliation(s)
- Alex Yick-Lun So
- Department of Cellular and Molecular Pharmacology, University of California San Francisco, San Francisco, California, United States of America
- Chemistry and Chemical Biology Graduate Program, University of California San Francisco, San Francisco, California, United States of America
| | - Christina Chaivorapol
- Department of Biochemistry and Biophysics, University of California San Francisco, San Francisco, California, United States of America
- California Institute for Quantitative Biomedical Research, University of California San Francisco, San Francisco, California, United States of America
- Graduate Program in Biological and Medical Informatics, University of California San Francisco, San Francisco, California, United States of America
| | - Eric C Bolton
- Department of Cellular and Molecular Pharmacology, University of California San Francisco, San Francisco, California, United States of America
| | - Hao Li
- Department of Biochemistry and Biophysics, University of California San Francisco, San Francisco, California, United States of America
- California Institute for Quantitative Biomedical Research, University of California San Francisco, San Francisco, California, United States of America
- Graduate Program in Biological and Medical Informatics, University of California San Francisco, San Francisco, California, United States of America
| | - Keith R Yamamoto
- Department of Cellular and Molecular Pharmacology, University of California San Francisco, San Francisco, California, United States of America
- Chemistry and Chemical Biology Graduate Program, University of California San Francisco, San Francisco, California, United States of America
- * To whom correspondence should be addressed. E-mail:
| |
Collapse
|
243
|
Sharp AJ, Itsara A, Cheng Z, Alkan C, Schwartz S, Eichler EE. Optimal design of oligonucleotide microarrays for measurement of DNA copy-number. Hum Mol Genet 2007; 16:2770-9. [PMID: 17725982 DOI: 10.1093/hmg/ddm234] [Citation(s) in RCA: 23] [Impact Index Per Article: 1.4] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/14/2022] Open
Abstract
Copy-number variants (CNVs) occur frequently within the human genome, and may be associated with many human phenotypes. If disease association studies of CNVs are to be performed routinely, it is essential that the copy-number status be accurately genotyped. We systematically assessed the dynamic range response of an oligonucleotide microarray platform to accurately predict copy-number in a set of seven patients who had previously been shown to carry between 1 and 6 copies of an approximately 4 Mb region of 15q12.2-q13.1. We identify probe uniqueness, probe length, uniformity of probe melting temperature, overlap with SNPs and common repeats (particularly Alu elements) and guanine homopolymer content as parameters that significantly affect probe performance. Further, we prove the influence of these criteria on array performance by using these parameters to prospectively filter data from a second array design covering an independent genomic region and observing significant improvements in data quality. The informed selection of probes which have superior performance characteristics allows the prospective design of oligonucleotide arrays which show increased sensitivity and specificity compared with current designs. Although based on the analysis of data from comparative genomic hybridization experiments, we anticipate that our results are relevant to the design of improved oligonucleotide arrays for high-throughput copy-number genotyping of complex regions of the human genome.
Collapse
Affiliation(s)
- Andrew J Sharp
- Department of Genome Sciences, University of Washington School of Medicine, Seattle, WA, USA
| | | | | | | | | | | |
Collapse
|
244
|
Trinklein ND, Karaöz U, Wu J, Halees A, Force Aldred S, Collins PJ, Zheng D, Zhang ZD, Gerstein MB, Snyder M, Myers RM, Weng Z. Integrated analysis of experimental data sets reveals many novel promoters in 1% of the human genome. Genome Res 2007; 17:720-31. [PMID: 17567992 PMCID: PMC1891333 DOI: 10.1101/gr.5716607] [Citation(s) in RCA: 31] [Impact Index Per Article: 1.8] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/24/2022]
Abstract
The regulation of transcriptional initiation in the human genome is a critical component of global gene regulation, but a complete catalog of human promoters currently does not exist. In order to identify regulatory regions, we developed four computational methods to integrate 129 sets of ENCODE-wide chromatin immunoprecipitation data. They collectively predicted 1393 regions. Roughly 47% of the regions were unique to one method, as each method makes different assumptions about the data. Overall, predicted regions tend to localize to highly conserved, DNase I hypersensitive, and actively transcribed regions in the genome. Interestingly, a significant portion of the regions overlaps with annotated 3'-UTRs, suggesting that some of them might regulate anti-sense transcription. The majority of the predicted regions are >2 kb away from the 5'-ends of previously annotated human cDNAs and hence are novel. These novel regions may regulate unannotated transcripts or may represent new alternative transcription start sites of known genes. We tested 163 such regions for promoter activity in four cell lines using transient transfection assays, and 25% of them showed transcriptional activity above background in at least one cell line. We also performed 5'-RACE experiments on 62 novel regions, and 76% of the regions were associated with the 5'-ends of at least two RACE products. Our results suggest that there are at least 35% more functional promoters in the human genome than currently annotated.
Collapse
Affiliation(s)
- Nathan D. Trinklein
- Department of Genetics, Stanford University School of Medicine, Stanford, California 94305, USA
| | - Ulaş Karaöz
- Bioinformatics Program, Boston University, Boston, Massachusetts 02215, USA
| | - Jiaqian Wu
- Department of Molecular, Cellular and Developmental Biology, Yale University, New Haven, Connecticut 06520, USA
| | - Anason Halees
- Bioinformatics Program, Boston University, Boston, Massachusetts 02215, USA
| | - Shelley Force Aldred
- Department of Genetics, Stanford University School of Medicine, Stanford, California 94305, USA
| | - Patrick J. Collins
- Department of Genetics, Stanford University School of Medicine, Stanford, California 94305, USA
| | - Deyou Zheng
- Department of Molecular Biophysics and Biochemistry, Yale University, New Haven, Connecticut 06520, USA
| | - Zhengdong D. Zhang
- Department of Molecular Biophysics and Biochemistry, Yale University, New Haven, Connecticut 06520, USA
| | - Mark B. Gerstein
- Department of Molecular Biophysics and Biochemistry, Yale University, New Haven, Connecticut 06520, USA
| | - Michael Snyder
- Department of Molecular, Cellular and Developmental Biology, Yale University, New Haven, Connecticut 06520, USA
- Department of Molecular Biophysics and Biochemistry, Yale University, New Haven, Connecticut 06520, USA
| | - Richard M. Myers
- Department of Genetics, Stanford University School of Medicine, Stanford, California 94305, USA
- Corresponding authors.E-mail ; fax (617) 353-6766.E-mail ; fax (650) 725-9689
| | - Zhiping Weng
- Bioinformatics Program, Boston University, Boston, Massachusetts 02215, USA
- Biomedical Engineering Department, Boston University, Boston, Massachusetts 02215, USA
- Corresponding authors.E-mail ; fax (617) 353-6766.E-mail ; fax (650) 725-9689
| |
Collapse
|
245
|
Greenbaum JA, Parker SC, Tullius TD. Detection of DNA structural motifs in functional genomic elements. Genome Res 2007; 17:940-6. [PMID: 17568009 PMCID: PMC1891352 DOI: 10.1101/gr.5602807] [Citation(s) in RCA: 17] [Impact Index Per Article: 1.0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 01/03/2023]
Abstract
The completion of the human genome project has fueled the search for regulatory elements by a variety of different approaches. Many successful analyses have focused on examining primary DNA sequence and/or chromatin structure. However, it has been difficult to detect common sequence motifs within the feature of chromatin structure most closely associated with regulatory elements, DNase I hypersensitive sites (DHSs). Considering just the nucleotide sequence and/or the chromatin structure of regulatory elements may neglect a critical feature of what is recognized by the regulatory machinery--DNA structure. We introduce a new computational method to detect common DNA structural motifs in a large collection of DHSs that are found in the ENCODE regions of the human genome. We show that DHSs have common DNA structural motifs that show no apparent sequence consensus. One such structural motif is much more highly enriched in experimentally identified DHSs that are in CpG islands and near transcription start sites (TSSs), compared to DHSs not in CpG islands and farther from TSSs, suggesting that DNA structural motifs may participate in the formation of functional regulatory elements. We propose that studies of the conservation of DNA structure, independent of sequence conservation, will provide new information about the link between the nucleotide sequence of a DNA molecule and its experimentally demonstrated function.
Collapse
Affiliation(s)
- Jason A. Greenbaum
- Program in Bioinformatics, Boston University, Boston, Massachusetts 02215, USA
| | - Stephen C.J. Parker
- Program in Bioinformatics, Boston University, Boston, Massachusetts 02215, USA
| | - Thomas D. Tullius
- Program in Bioinformatics, Boston University, Boston, Massachusetts 02215, USA
- Department of Chemistry, Boston University, Boston, Massachusetts 02215, USA
- Corresponding author.E-mail ; fax (617) 353-6466
| |
Collapse
|
246
|
King DC, Taylor J, Zhang Y, Cheng Y, Lawson HA, Martin J, Chiaromonte F, Miller W, Hardison RC. Finding cis-regulatory elements using comparative genomics: some lessons from ENCODE data. Genome Res 2007; 17:775-86. [PMID: 17567996 PMCID: PMC1891337 DOI: 10.1101/gr.5592107] [Citation(s) in RCA: 68] [Impact Index Per Article: 4.0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 01/31/2023]
Abstract
Identification of functional genomic regions using interspecies comparison will be most effective when the full span of relationships between genomic function and evolutionary constraint are utilized. We find that sets of putative transcriptional regulatory sequences, defined by ENCODE experimental data, have a wide span of evolutionary histories, ranging from stringent constraint shown by deep phylogenetic comparisons to recent selection on lineage-specific elements. This diversity of evolutionary histories can be captured, at least in part, by the suite of available comparative genomics tools, especially after correction for regional differences in the neutral substitution rate. Putative transcriptional regulatory regions show alignability in different clades, and the genes associated with them are enriched for distinct functions. Some of the putative regulatory regions show evidence for recent selection, including a primate-specific, distal promoter that may play a novel role in regulation.
Collapse
Affiliation(s)
- David C. King
- Center for Comparative Genomics and Bioinformatics, The Pennsylvania State University, University Park, Pennsylvania 16802, USA
- Department of Biochemistry and Molecular Biology, The Pennsylvania State University, University Park, Pennsylvania 16802, USA
| | - James Taylor
- Center for Comparative Genomics and Bioinformatics, The Pennsylvania State University, University Park, Pennsylvania 16802, USA
- Department of Computer Science and Engineering, The Pennsylvania State University, University Park, Pennsylvania 16802, USA
| | - Ying Zhang
- Center for Comparative Genomics and Bioinformatics, The Pennsylvania State University, University Park, Pennsylvania 16802, USA
- Department of Biochemistry and Molecular Biology, The Pennsylvania State University, University Park, Pennsylvania 16802, USA
| | - Yong Cheng
- Center for Comparative Genomics and Bioinformatics, The Pennsylvania State University, University Park, Pennsylvania 16802, USA
- Department of Biochemistry and Molecular Biology, The Pennsylvania State University, University Park, Pennsylvania 16802, USA
| | - Heather A. Lawson
- Center for Comparative Genomics and Bioinformatics, The Pennsylvania State University, University Park, Pennsylvania 16802, USA
- Department of Anthropology, The Pennsylvania State University, University Park, Pennsylvania 16802, USA
| | - Joel Martin
- Center for Comparative Genomics and Bioinformatics, The Pennsylvania State University, University Park, Pennsylvania 16802, USA
- Department of Biochemistry and Molecular Biology, The Pennsylvania State University, University Park, Pennsylvania 16802, USA
| | | | - Francesca Chiaromonte
- Center for Comparative Genomics and Bioinformatics, The Pennsylvania State University, University Park, Pennsylvania 16802, USA
- Department of Statistics, The Pennsylvania State University, University Park, Pennsylvania 16802, USA
| | - Webb Miller
- Center for Comparative Genomics and Bioinformatics, The Pennsylvania State University, University Park, Pennsylvania 16802, USA
- Department of Computer Science and Engineering, The Pennsylvania State University, University Park, Pennsylvania 16802, USA
- Department of Biology, The Pennsylvania State University, University Park, Pennsylvania 16802, USA
| | - Ross C. Hardison
- Center for Comparative Genomics and Bioinformatics, The Pennsylvania State University, University Park, Pennsylvania 16802, USA
- Department of Biochemistry and Molecular Biology, The Pennsylvania State University, University Park, Pennsylvania 16802, USA
- Corresponding author.E-mail ; fax (814) 863-7024
| |
Collapse
|
247
|
Eeckhoute J, Keeton EK, Lupien M, Krum SA, Carroll JS, Brown M. Positive Cross-Regulatory Loop Ties GATA-3 to Estrogen Receptor α Expression in Breast Cancer. Cancer Res 2007; 67:6477-83. [PMID: 17616709 DOI: 10.1158/0008-5472.can-07-0746] [Citation(s) in RCA: 280] [Impact Index Per Article: 16.5] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/16/2022]
Abstract
The transcription factor GATA-3 is required for normal mammary gland development, and its expression is highly correlated with estrogen receptor alpha (ER alpha) in human breast tumors. However, the functional role of GATA-3 in ER alpha-positive breast cancers is yet to be established. Here, we show that GATA-3 is required for estradiol stimulation of cell cycle progression in breast cancer cells. The role of GATA-3 in estradiol signaling requires the direct positive regulation of the expression of the ER alpha gene itself by GATA-3. GATA-3 binds to two cis-regulatory elements located within the ER alpha gene, and this is required for RNA polymerase II recruitment to ER alpha promoters. Reciprocally, ER alpha directly stimulates the transcription of the GATA-3 gene, indicating that these two factors are involved in a positive cross-regulatory loop. Moreover, GATA-3 and ER alpha regulate their own expression in breast cancer cells. Hence, this transcriptional coregulatory mechanism accounts for the robust coexpression of GATA-3 and ER alpha in human breast cancers. In addition, these results highlight the crucial role of GATA-3 for the response of ER alpha-positive breast cancers to estradiol. Moreover, they identify GATA-3 as a critical component of the master cell-type-specific transcriptional network including ER alpha and FoxA1 that dictates the phenotype of hormone-dependent breast cancer.
Collapse
Affiliation(s)
- Jérôme Eeckhoute
- Division of Molecular and Cellular Oncology, Department of Medical Oncology, Dana-Farber Cancer Institute, Boston, MA 02115, USA
| | | | | | | | | | | |
Collapse
|
248
|
Martins RP, Platts AE, Krawetz SA. Tracking chromatin states using controlled DNase I treatment and real-time PCR. Cell Mol Biol Lett 2007; 12:545-55. [PMID: 17588221 PMCID: PMC6275682 DOI: 10.2478/s11658-007-0024-z] [Citation(s) in RCA: 5] [Impact Index Per Article: 0.3] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Journal Information] [Subscribe] [Scholar Register] [Received: 02/21/2007] [Accepted: 05/22/2007] [Indexed: 11/22/2022] Open
Abstract
A novel approach to DNase I-sensitivity analysis was applied to examining genes of the spermatogenic pathway, reflective of the substantial morphological and genomic changes that occur during this program of differentiation. A new real-time PCR-based strategy that considers the nuances of response to nuclease treatment was used to assess the nuclease susceptibility through differentiation. Data analysis was automated with the K-Lab PCR algorithm, facilitating the rapid analysis of multiple samples while eliminating the subjectivity usually associated with Ct analyses. The utility of this assay and analytical paradigm as applied to nuclease-sensitivity mapping is presented.
Collapse
Affiliation(s)
- Rui Pires Martins
- Center for Molecular Medicine and Genetics, Wayne State University School of Medicine, Detroit, Michigan USA
| | - Adrian E. Platts
- Center for Molecular Medicine and Genetics, Wayne State University School of Medicine, Detroit, Michigan USA
- Department of Obstetrics and Gynecology, Wayne State University School of Medicine, Detroit, Michigan USA
| | - Stephen A. Krawetz
- Center for Molecular Medicine and Genetics, Wayne State University School of Medicine, Detroit, Michigan USA
- Institute for Scientific Computing, Wayne State University, Detroit, Michigan USA
| |
Collapse
|
249
|
Birney E, Stamatoyannopoulos JA, Dutta A, Guigó R, Gingeras TR, Margulies EH, Weng Z, Snyder M, Dermitzakis ET, Thurman RE, Kuehn MS, Taylor CM, Neph S, Koch CM, Asthana S, Malhotra A, Adzhubei I, Greenbaum JA, Andrews RM, Flicek P, Boyle PJ, Cao H, Carter NP, Clelland GK, Davis S, Day N, Dhami P, Dillon SC, Dorschner MO, Fiegler H, Giresi PG, Goldy J, Hawrylycz M, Haydock A, Humbert R, James KD, Johnson BE, Johnson EM, Frum TT, Rosenzweig ER, Karnani N, Lee K, Lefebvre GC, Navas PA, Neri F, Parker SCJ, Sabo PJ, Sandstrom R, Shafer A, Vetrie D, Weaver M, Wilcox S, Yu M, Collins FS, Dekker J, Lieb JD, Tullius TD, Crawford GE, Sunyaev S, Noble WS, Dunham I, Denoeud F, Reymond A, Kapranov P, Rozowsky J, Zheng D, Castelo R, Frankish A, Harrow J, Ghosh S, Sandelin A, Hofacker IL, Baertsch R, Keefe D, Dike S, Cheng J, Hirsch HA, Sekinger EA, Lagarde J, Abril JF, Shahab A, Flamm C, Fried C, Hackermüller J, Hertel J, Lindemeyer M, Missal K, Tanzer A, Washietl S, Korbel J, Emanuelsson O, Pedersen JS, Holroyd N, Taylor R, Swarbreck D, Matthews N, Dickson MC, Thomas DJ, Weirauch MT, Gilbert J, Drenkow J, Bell I, Zhao X, Srinivasan KG, Sung WK, Ooi HS, Chiu KP, Foissac S, Alioto T, Brent M, Pachter L, Tress ML, Valencia A, Choo SW, Choo CY, Ucla C, Manzano C, Wyss C, Cheung E, Clark TG, Brown JB, Ganesh M, Patel S, Tammana H, Chrast J, Henrichsen CN, Kai C, Kawai J, Nagalakshmi U, Wu J, Lian Z, Lian J, Newburger P, Zhang X, Bickel P, Mattick JS, Carninci P, Hayashizaki Y, Weissman S, Hubbard T, Myers RM, Rogers J, Stadler PF, Lowe TM, Wei CL, Ruan Y, Struhl K, Gerstein M, Antonarakis SE, Fu Y, Green ED, Karaöz U, Siepel A, Taylor J, Liefer LA, Wetterstrand KA, Good PJ, Feingold EA, Guyer MS, Cooper GM, Asimenos G, Dewey CN, Hou M, Nikolaev S, Montoya-Burgos JI, Löytynoja A, Whelan S, Pardi F, Massingham T, Huang H, Zhang NR, Holmes I, Mullikin JC, Ureta-Vidal A, Paten B, Seringhaus M, Church D, Rosenbloom K, Kent WJ, Stone EA, Batzoglou S, Goldman N, Hardison RC, Haussler D, Miller W, Sidow A, Trinklein ND, Zhang ZD, Barrera L, Stuart R, King DC, Ameur A, Enroth S, Bieda MC, Kim J, Bhinge AA, Jiang N, Liu J, Yao F, Vega VB, Lee CWH, Ng P, Shahab A, Yang A, Moqtaderi Z, Zhu Z, Xu X, Squazzo S, Oberley MJ, Inman D, Singer MA, Richmond TA, Munn KJ, Rada-Iglesias A, Wallerman O, Komorowski J, Fowler JC, Couttet P, Bruce AW, Dovey OM, Ellis PD, Langford CF, Nix DA, Euskirchen G, Hartman S, Urban AE, Kraus P, Van Calcar S, Heintzman N, Kim TH, Wang K, Qu C, Hon G, Luna R, Glass CK, Rosenfeld MG, Aldred SF, Cooper SJ, Halees A, Lin JM, Shulha HP, Zhang X, Xu M, Haidar JNS, Yu Y, Ruan Y, Iyer VR, Green RD, Wadelius C, Farnham PJ, Ren B, Harte RA, Hinrichs AS, Trumbower H, Clawson H, Hillman-Jackson J, Zweig AS, Smith K, Thakkapallayil A, Barber G, Kuhn RM, Karolchik D, Armengol L, Bird CP, de Bakker PIW, Kern AD, Lopez-Bigas N, Martin JD, Stranger BE, Woodroffe A, Davydov E, Dimas A, Eyras E, Hallgrímsdóttir IB, Huppert J, Zody MC, Abecasis GR, Estivill X, Bouffard GG, Guan X, Hansen NF, Idol JR, Maduro VVB, Maskeri B, McDowell JC, Park M, Thomas PJ, Young AC, Blakesley RW, Muzny DM, Sodergren E, Wheeler DA, Worley KC, Jiang H, Weinstock GM, Gibbs RA, Graves T, Fulton R, Mardis ER, Wilson RK, Clamp M, Cuff J, Gnerre S, Jaffe DB, Chang JL, Lindblad-Toh K, Lander ES, Koriabine M, Nefedov M, Osoegawa K, Yoshinaga Y, Zhu B, de Jong PJ. Identification and analysis of functional elements in 1% of the human genome by the ENCODE pilot project. Nature 2007; 447:799-816. [PMID: 17571346 PMCID: PMC2212820 DOI: 10.1038/nature05874] [Citation(s) in RCA: 3826] [Impact Index Per Article: 225.1] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 01/17/2023]
Abstract
We report the generation and analysis of functional data from multiple, diverse experiments performed on a targeted 1% of the human genome as part of the pilot phase of the ENCODE Project. These data have been further integrated and augmented by a number of evolutionary and computational analyses. Together, our results advance the collective knowledge about human genome function in several major areas. First, our studies provide convincing evidence that the genome is pervasively transcribed, such that the majority of its bases can be found in primary transcripts, including non-protein-coding transcripts, and those that extensively overlap one another. Second, systematic examination of transcriptional regulation has yielded new understanding about transcription start sites, including their relationship to specific regulatory sequences and features of chromatin accessibility and histone modification. Third, a more sophisticated view of chromatin structure has emerged, including its inter-relationship with DNA replication and transcriptional regulation. Finally, integration of these new sources of information, in particular with respect to mammalian evolution based on inter- and intra-species sequence comparisons, has yielded new mechanistic and evolutionary insights concerning the functional landscape of the human genome. Together, these studies are defining a path for pursuit of a more comprehensive characterization of human genome function.
Collapse
|
250
|
Akopov SB, Chernov IP, Bulanenkova SS, Skvortsova YV, Vetchinova AS, Nikolaev LG. Methods for identification of epigenetic elements in mammalian long multigenic genome sequences. BIOCHEMISTRY (MOSCOW) 2007; 72:589-94. [PMID: 17630903 DOI: 10.1134/s0006297907060016] [Citation(s) in RCA: 2] [Impact Index Per Article: 0.1] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 11/23/2022]
Abstract
Epigenetic elements of the genome, i.e. elements that determine stably inherited changes in gene expression without changes in the genomic DNA sequence, are essential tools of genetic regulation in higher eukaryotes. The complete sequencing of the human and other genomes allowed studies to be started on positioning of these elements within long multigenic regions of the genome, which is a prerequisite for a comprehensive functional annotation of genomes. This mini-review considers some recent experimental approaches to the high-throughput identification and mapping of epigenetic elements of mammalian genomes, including the mapping of methylated CpG sites, open and closed chromatin regions, and DNase I hypersensitivity sites.
Collapse
Affiliation(s)
- S B Akopov
- Shemyakin and Ovchinnikov Institute of Bioorganic Chemistry, Russian Academy of Sciences, Moscow 117997, Russia
| | | | | | | | | | | |
Collapse
|