51
|
Gusmão EG, Dieterich C, Costa IG. Prediction of Transcription Factor Binding Sites by Integrating DNase Digestion and Histone Modification. ADVANCES IN BIOINFORMATICS AND COMPUTATIONAL BIOLOGY 2012. [DOI: 10.1007/978-3-642-31927-3_10] [Citation(s) in RCA: 1] [Impact Index Per Article: 0.1] [Reference Citation Analysis] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 01/20/2023]
|
52
|
Splinter E, de Laat W. The complex transcription regulatory landscape of our genome: control in three dimensions. EMBO J 2011; 30:4345-55. [PMID: 21952046 DOI: 10.1038/emboj.2011.344] [Citation(s) in RCA: 53] [Impact Index Per Article: 4.1] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/15/2011] [Accepted: 08/29/2011] [Indexed: 11/09/2022] Open
Abstract
The non-coding part of our genome contains sequence motifs that can control gene transcription over distance. Here, we discuss functional genomics studies that uncover and characterize these sequences across the mammalian genome. The picture emerging is of a genome being a complex regulatory landscape. We explore the principles that underlie the wiring of regulatory DNA sequences and genes. We argue transcriptional control over distance can be understood when considering action in the context of the folded genome. Genome topology is expected to differ between individual cells, and this may cause variegated expression. High-resolution three-dimensional genome topology maps, ultimately of single cells, are required to understand the cis-regulatory networks that underlie cellular transcriptomes.
Collapse
Affiliation(s)
- Erik Splinter
- Hubrecht Insitute-KNAW & University Medical Center Utrecht, Utrecht, The Netherlands
| | | |
Collapse
|
53
|
Abstract
In eukaryotes, all DNA-templated reactions occur in the context of chromatin. Nucleosome packaging inherently restricts DNA accessibility for regulatory proteins but also provides an opportunity to regulate DNA-based processes through modulating nucleosome positions and local chromatin structure. Recent advances in genome-scale methods are yielding increasingly detailed profiles of the genomic distribution of nucleosomes, their modifications and their modifiers. The picture now emerging is one in which the dynamic control of genome accessibility is governed by contributions from DNA sequence, ATP-dependent chromatin remodelling and nucleosome modifications. Here we discuss the interplay of these processes by reviewing our current understanding of how chromatin access contributes to the regulation of transcription, replication and repair.
Collapse
|
54
|
Shu W, Chen H, Bo X, Wang S. Genome-wide analysis of the relationships between DNaseI HS, histone modifications and gene expression reveals distinct modes of chromatin domains. Nucleic Acids Res 2011; 39:7428-43. [PMID: 21685456 PMCID: PMC3177195 DOI: 10.1093/nar/gkr443] [Citation(s) in RCA: 49] [Impact Index Per Article: 3.8] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 02/01/2023] Open
Abstract
To understand the molecular mechanisms that underlie global transcriptional regulation, it is essential to first identify all the transcriptional regulatory elements in the human genome. The advent of next-generation sequencing has provided a powerful platform for genome-wide analysis of different species and specific cell types; when combined with traditional techniques to identify regions of open chromatin [DNaseI hypersensitivity (DHS)] or specific binding locations of transcription factors [chromatin immunoprecipitation (ChIP)], and expression data from microarrays, we become uniquely poised to uncover the mysteries of the genome and its regulation. To this end, we have performed global meta-analysis of the relationship among data from DNaseI-seq, ChIP-seq and expression arrays, and found that specific correlations exist among regulatory elements and gene expression across different cell types. These correlations revealed four distinct modes of chromatin domain structure reflecting different functions: repressive, active, primed and bivalent. Furthermore, CCCTC-binding factor (CTCF) binding sites were identified based on these integrative data. Our findings uncovered a complex regulatory process involving by DNaseI HS sites and histone modifications, and suggest that these dynamic elements may be responsible for maintaining chromatin structure and integrity of the human genome. Our integrative approach provides an example by which data from diverse technology platforms may be integrated to provide more meaningful insights into global transcriptional regulation.
Collapse
Affiliation(s)
- Wenjie Shu
- Beijing Institute of Radiation Medicine, Beijing 100850, China
| | | | | | | |
Collapse
|
55
|
Genomic views of STAT function in CD4+ T helper cell differentiation. Nat Rev Immunol 2011; 11:239-50. [PMID: 21436836 DOI: 10.1038/nri2958] [Citation(s) in RCA: 226] [Impact Index Per Article: 17.4] [Reference Citation Analysis] [Abstract] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 12/12/2022]
Abstract
Signal transducer and activator of transcription (STAT) proteins are well known for their essential roles in transmitting cytokine-mediated signals and specifying T helper (T(H)) cell differentiation. Recent technological advances have revealed that STAT proteins have broad and complex roles in gene regulation and epigenetic control, including important roles as functional repressors. However, the challenge of how to link signal transduction, nucleosome biology and gene regulation remains. The relevance of tackling this problem is highlighted by genome-wide association studies that link cytokine signalling and STATs to various autoimmune or immune deficiency disorders. Defining exactly how extrinsic signals control the specification and plasticity of T(H) cells will provide important insights and perhaps therapeutic opportunities in these diseases.
Collapse
|
56
|
An NK and T cell enhancer lies 280 kilobase pairs 3' to the gata3 structural gene. Mol Cell Biol 2011; 31:1894-904. [PMID: 21383068 DOI: 10.1128/mcb.05065-11] [Citation(s) in RCA: 35] [Impact Index Per Article: 2.7] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 01/29/2023] Open
Abstract
Transcription factor GATA-3 is vital for multiple stages of T cell and natural killer (NK) cell development, and yet the factors that directly regulate Gata3 transcription during hematopoiesis are only marginally defined. Here, we show that neither of the Gata3 promoters, previously implicated in its tissue-specific regulation, is alone capable of directing Gata3 transcription in T lymphocytes. In contrast, by surveying large swaths of DNA surrounding the Gata3 locus, we located a cis element that can recapitulate aspects of the Gata3-dependent T cell regulatory program in vivo. This element, located 280 kbp 3' to the structural gene, directs both T cell- and NK cell-specific transcription in vivo but harbors no other tissue activity. This novel, distant element regulates multiple major developmental stages that require GATA-3 activity.
Collapse
|
57
|
Dynamic BRG1 recruitment during T helper differentiation and activation reveals distal regulatory elements. Mol Cell Biol 2011; 31:1512-27. [PMID: 21262765 DOI: 10.1128/mcb.00920-10] [Citation(s) in RCA: 50] [Impact Index Per Article: 3.8] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 12/12/2022] Open
Abstract
T helper cell differentiation and activation require specific transcriptional programs accompanied by changes in chromatin structure. However, little is known about the chromatin remodeling enzymes responsible. We performed genome-wide analysis to determine the general principles of BRG1 binding, followed by analysis of specific genes to determine whether these general rules were typical of key T cell genes. We found that binding of the remodeling protein BRG1 was programmed by both lineage and activation signals. BRG1 binding positively correlated with gene activity at protein-coding and microRNA (miRNA) genes. BRG1 binding was found at promoters and distal regions, including both novel and previously validated distal regulatory elements. Distal BRG1 binding correlated with expression, and novel distal sites in the Gata3 locus possessed enhancer-like activity, suggesting a general role for BRG1 in long-distance gene regulation. BRG1 recruitment to distal sites in Gata3 was impaired in cells lacking STAT6, a transcription factor that regulates lineage-specific genes. Together, these findings suggest that BRG1 interprets both differentiation and activation signals and plays a causal role in gene regulation, chromatin structure, and cell fate. Our findings suggest that BRG1 binding is a useful marker for identifying active cis-regulatory regions in protein-coding and miRNA genes.
Collapse
|
58
|
Sakabe NJ, Nobrega MA. Genome-wide maps of transcription regulatory elements. WILEY INTERDISCIPLINARY REVIEWS-SYSTEMS BIOLOGY AND MEDICINE 2010; 2:422-437. [PMID: 20836039 DOI: 10.1002/wsbm.70] [Citation(s) in RCA: 20] [Impact Index Per Article: 1.4] [Reference Citation Analysis] [Abstract] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 01/25/2023]
Abstract
Expression of eukaryotic genes with complex spatial-temporal regulation during development requires finer regulation than that of genes with simpler expression patterns. Given the high degree of conservation of the developmental gene set across distantly related phylogenetic taxa, it is argued that evolutionary variation has occurred by tweaking regulation of expression of developmental genes, rather than by changes in genes themselves. Complex regulation is often achieved through the coordinated action of transcription regulatory elements spread across the genome up to tens of kilobases from the promoters of their target genes. Disruption of regulatory elements has been implicated in several diseases and studies showing associations between disease traits and nonprotein coding variation hint for a role of regulatory elements as cause of diseases. Therefore, the identification and mapping of regulatory elements in genome scale is crucial to understand how gene expression is regulated, how organisms evolve, and to identify sequence variation causing diseases. Previously developed experimental techniques have been adapted to identify regulatory elements in genome scale and high-throughput, allowing a global view of their biological roles. We review methods as chromatin immunoprecipitation, DNase I hypersensitivity, and computational approaches and how they have been employed to generate maps of histone modifications, open chromatin, nucleosome positioning, and transcription factor binding regions in whole mammalian genomes. Given the importance of non-promoter elements in gene regulation and the recent explosion in the number of studies devoted to them, we focus on these elements and discuss the insights on gene regulation being obtained by these studies.
Collapse
Affiliation(s)
- Noboru J Sakabe
- Department of Human Genetics, University of Chicago, Chicago, IL 60637, USA
| | - Marcelo A Nobrega
- Department of Human Genetics, University of Chicago, Chicago, IL 60637, USA
| |
Collapse
|
59
|
Genomic profiling of HMGN1 reveals an association with chromatin at regulatory regions. Mol Cell Biol 2010; 31:700-9. [PMID: 21173166 DOI: 10.1128/mcb.00740-10] [Citation(s) in RCA: 39] [Impact Index Per Article: 2.8] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 02/07/2023] Open
Abstract
The interaction of architectural proteins such as the linker histone H1 and high-mobility-group (HMG) proteins with nucleosomes leads to changes in chromatin structure and histone modifications and alters the cellular transcription profile. The interaction of HMG proteins with chromatin is dynamic. However, it is not clear whether the proteins are constantly and randomly redistributed among all the nucleosomes or whether they preferentially associate with, and turn over at, specific regions in chromatin. To address this question, we examined the genome-wide distribution of the nucleosome binding protein HMGN1 and compared it to that of regulatory chromatin marks. We find that HMGN1 is not randomly distributed throughout the genome. Instead, the protein preferentially localizes to DNase I hypersensitive (HS) sites, promoters, functional enhancers, and transcription factor binding sites. Our results suggest that HMGN1 is part of the cellular machinery that modulates transcriptional fidelity by generating, maintaining, or preferentially interacting with specific sites in chromatin.
Collapse
|
60
|
Stitzel ML, Sethupathy P, Pearson DS, Chines PS, Song L, Erdos MR, Welch R, Parker SCJ, Boyle AP, Scott LJ, Margulies EH, Boehnke M, Furey TS, Crawford GE, Collins FS. Global epigenomic analysis of primary human pancreatic islets provides insights into type 2 diabetes susceptibility loci. Cell Metab 2010; 12:443-55. [PMID: 21035756 PMCID: PMC3026436 DOI: 10.1016/j.cmet.2010.09.012] [Citation(s) in RCA: 144] [Impact Index Per Article: 10.3] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Submit a Manuscript] [Subscribe] [Scholar Register] [Received: 05/07/2010] [Revised: 07/22/2010] [Accepted: 08/26/2010] [Indexed: 01/17/2023]
Abstract
Identifying cis-regulatory elements is important to understanding how human pancreatic islets modulate gene expression in physiologic or pathophysiologic (e.g., diabetic) conditions. We conducted genome-wide analysis of DNase I hypersensitive sites, histone H3 lysine methylation modifications (K4me1, K4me3, K79me2), and CCCTC factor (CTCF) binding in human islets. This identified ∼18,000 putative promoters (several hundred unannotated and islet-active). Surprisingly, active promoter modifications were absent at genes encoding islet-specific hormones, suggesting a distinct regulatory mechanism. Of 34,039 distal (nonpromoter) regulatory elements, 47% are islet unique and 22% are CTCF bound. In the 18 type 2 diabetes (T2D)-associated loci, we identified 118 putative regulatory elements and confirmed enhancer activity for 12 of 33 tested. Among six regulatory elements harboring T2D-associated variants, two exhibit significant allele-specific differences in activity. These findings present a global snapshot of the human islet epigenome and should provide functional context for noncoding variants emerging from genetic studies of T2D and other islet disorders.
Collapse
Affiliation(s)
- Michael L Stitzel
- Genome Technology Branch, National Human Genome Research Institute, National Institutes of Health, Bethesda, MD 20892, USA
| | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | |
Collapse
|
61
|
Pekowska A, Benoukraf T, Ferrier P, Spicuglia S. A unique H3K4me2 profile marks tissue-specific gene regulation. Genome Res 2010; 20:1493-502. [PMID: 20841431 DOI: 10.1101/gr.109389.110] [Citation(s) in RCA: 99] [Impact Index Per Article: 7.1] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 12/17/2022]
Abstract
Characterization of the epigenetic landscape fundamentally contributes toward deciphering the regulatory mechanisms that govern gene expression. However, despite an increasing flow of newly generated data, no clear pattern of chromatin modifications has so far been linked to specific modes of transcriptional regulation. Here, we used high-throughput genomic data from CD4(+) T lymphocytes to provide a comprehensive analysis of histone H3 lysine 4 dimethylation (H3K4me2) enrichment in genomic regions surrounding transcriptional start sites (TSSs). We discovered that a subgroup of genes linked to T cell functions displayed high levels of H3K4me2 within their gene body, in sharp contrast to the TSS-centered profile typical of housekeeping genes. Analysis of additional chromatin modifications and DNase I hypersensitive sites (DHSS) revealed a combinatorial chromatin signature characteristic of this subgroup. We propose that this epigenetic feature reflects the activity of an as yet unrecognized, intragenic cis-regulatory platform dedicated to refining tissue-specificity in gene expression.
Collapse
Affiliation(s)
- Aleksandra Pekowska
- Centre d'Immunologie de Marseille-Luminy, Université Aix Marseille, Marseille 13009, France
| | | | | | | |
Collapse
|
62
|
Naumova N, Dekker J. Integrating one-dimensional and three-dimensional maps of genomes. J Cell Sci 2010; 123:1979-88. [PMID: 20519580 DOI: 10.1242/jcs.051631] [Citation(s) in RCA: 50] [Impact Index Per Article: 3.6] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 01/16/2023] Open
Abstract
Genomes exist in vivo as complex physical structures, and their functional output (i.e. the gene expression profile of a cell) is related to their spatial organization inside the nucleus as well as to local chromatin status. Chromatin modifications and chromosome conformation are distinct in different tissues and cell types, which corresponds closely with the diversity in gene-expression patterns found in different tissues of the body. The biological processes and mechanisms driving these general correlations are currently the topic of intense study. An emerging theme is that genome compartmentalization - both along the linear length of chromosomes, and in three dimensions by the spatial colocalization of chromatin domains and genomic loci from across the genome - is a crucial parameter in regulating genome expression. In this Commentary, we propose that a full understanding of genome regulation requires integrating three different types of data: first, one-dimensional data regarding the state of local chromatin - such as patterns of protein binding along chromosomes; second, three-dimensional data that describe the population-averaged folding of chromatin inside cells and; third, single-cell observations of three-dimensional spatial colocalization of genetic loci and trans factors that reveal information about their dynamics and frequency of colocalization.
Collapse
Affiliation(s)
- Natalia Naumova
- Department of Biochemistry and Molecular Pharmacology, University of Massachusetts Medical School, Worcester, MA 01605-0103, USA
| | | |
Collapse
|
63
|
Paquet Y, Anderson A. Sequence composition similarities with the 7SL RNA are highly predictive of functional genomic features. Nucleic Acids Res 2010; 38:4907-16. [PMID: 20392819 PMCID: PMC2926601 DOI: 10.1093/nar/gkq234] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 12/02/2022] Open
Abstract
Transposable elements derived from the 7SL RNA gene, such as Alu elements in primates, have had remarkable success in several mammalian lineages. The results presented here show a broad spectrum of functions for genomic segments that display sequence composition similarities with the 7SL RNA gene. Using thoroughly documented loci, we report that DNaseI-hypersensitive sites can be singled out in large genomic sequences by an assessment of sequence composition similarities with the 7SL RNA gene. We apply a root word frequency approach to illustrate a distinctive relationship between the sequence of the 7SL RNA gene and several classes of functional genomic features that are not presumed to be of transposable origin. Transposable elements that show noticeable similarities with the 7SL sequence include Alu sequences, as expected, but also long terminal repeats and the 5′-untranslated regions of long interspersed repetitive elements. In sequences masked for repeated elements, we find, when using the 7SL RNA gene as query sequence, distinctive similarities with promoters, exons and distal gene regulatory regions. The latter being the most notoriously difficult to detect, this approach may be useful for finding genomic segments that have regulatory functions and that may have escaped detection by existing methods.
Collapse
Affiliation(s)
- Yanick Paquet
- Centre de recherche en cancérologie de l’Université Laval, L’Hôtel-Dieu de Québec, Centre hospitalier universitaire de Québec, Québec G1R 2J6 and Département de biologie, Université Laval, Québec G1K 7P4, Canada
| | - Alan Anderson
- Centre de recherche en cancérologie de l’Université Laval, L’Hôtel-Dieu de Québec, Centre hospitalier universitaire de Québec, Québec G1R 2J6 and Département de biologie, Université Laval, Québec G1K 7P4, Canada
- *To whom correspondence should be addressed. Tel: + 418 691 5281; Fax: +418 691 5439;
| |
Collapse
|
64
|
Lee JS, Kim MR, Kim NS, Kim YS, Yang JM, Cho AY, Lee Y, Kim CD, Lee JH. Expression profiling of calcium induced genes in cultured human keratinocytes. J Korean Med Sci 2010; 25:619-25. [PMID: 20358008 PMCID: PMC2844606 DOI: 10.3346/jkms.2010.25.4.619] [Citation(s) in RCA: 1] [Impact Index Per Article: 0.1] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Submit a Manuscript] [Subscribe] [Scholar Register] [Received: 04/05/2009] [Accepted: 07/27/2009] [Indexed: 11/20/2022] Open
Abstract
Terminal differentiation of skin keratinocytes is a vertically directed multi-step process that is tightly controlled by the sequential expression of a variety of genes. To examine the gene expression profile in calcium-induced keratinocyte differentiation, we constructed a normalized cDNA library using mRNA isolated from these calcium-treated keratinocytes. After sequencing about 10,000 clones, we were able to obtain 4,104 independent genes. They consisted of 3,699 annotated genes and 405 expressed sequence tags (ESTs). Some were the genes involved in constituting epidermal structures and others were unknown genes that are probably associated with keratinocytes. In particular, we were able to identify genes located at the chromosome 1q21, the locus for the epidermal differentiation complex, and 19q13.1, another probable locus for epidermal differentiation-related gene clusters. One EST located at the chromosome 19q13.1 showed increased expression by calcium treatment, suggesting a novel candidate gene relevant to keratinocyte differentiation. These results demonstrate the complexity of the transcriptional profile of keratinocytes, providing important clues on which to base further investigations of the molecular events underlying keratinocyte differentiation.
Collapse
Affiliation(s)
- Jung-Suk Lee
- Department of Dermatology and Research Institute for Medical Sciences, School of Medicine, Chungnam National University, Daejeon, Korea
| | - Mi-Rang Kim
- Human Genomics Laboratory, Genome Research Center, Korea Research Institute of Bioscience and Biotechnology, Daejeon, Korea
| | - Nam-Soon Kim
- Human Genomics Laboratory, Genome Research Center, Korea Research Institute of Bioscience and Biotechnology, Daejeon, Korea
| | - Yong Sung Kim
- Human Genomics Laboratory, Genome Research Center, Korea Research Institute of Bioscience and Biotechnology, Daejeon, Korea
| | - Jun-Mo Yang
- Department of Dermatology, Sungkyunkwan University School of Medicine, Samsung Medical Center, Seoul, Korea
| | - Ah Young Cho
- Department of Dermatology and Research Institute for Medical Sciences, School of Medicine, Chungnam National University, Daejeon, Korea
| | - Young Lee
- Department of Dermatology and Research Institute for Medical Sciences, School of Medicine, Chungnam National University, Daejeon, Korea
| | - Chang Deok Kim
- Department of Dermatology and Research Institute for Medical Sciences, School of Medicine, Chungnam National University, Daejeon, Korea
| | - Jeung-Hoon Lee
- Department of Dermatology and Research Institute for Medical Sciences, School of Medicine, Chungnam National University, Daejeon, Korea
| |
Collapse
|
65
|
de Guzman Strong C, Conlan S, Deming CB, Cheng J, Sears KE, Segre JA. A milieu of regulatory elements in the epidermal differentiation complex syntenic block: implications for atopic dermatitis and psoriasis. Hum Mol Genet 2010; 19:1453-60. [PMID: 20089530 DOI: 10.1093/hmg/ddq019] [Citation(s) in RCA: 73] [Impact Index Per Article: 5.2] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/14/2022] Open
Abstract
Two common inflammatory skin disorders with impaired barrier, atopic dermatitis (AD) and psoriasis, share distinct genetic linkage to the Epidermal Differentiation Complex (EDC) locus on 1q21. The EDC is comprised of tandemly arrayed gene families encoding proteins involved in skin cell differentiation. Discovery of semi-dominant mutations in filaggrin (FLG) associated with AD and a copy number variation within the LCE genes associated with psoriasis provide compelling evidence for the role of EDC genes in the pathogenesis of these diseases. To date, little is known about the potentially complex regulatory landscape within the EDC. Here, we report a computational approach to identify conserved non-coding elements (CNEs) in the EDC queried for regulatory function. Coordinate expression of EDC genes during mouse embryonic skin development and a striking degree of synteny and linearity in the EDC locus across a wide range of mammalian (placental and marsupial) genomes suggests an evolutionary conserved regulatory milieu in the EDC. CNEs identified by comparative genomics exhibit dynamic regulatory activity (enhancer or repressor) in differentiating or proliferating conditions. We further demonstrate epidermal-specific, developmental in vivo enhancer activities (DNaseI and transgenic mouse assays) in CNEs, including one within the psoriasis-associated deletion, LCE3C_LCE3B-del. Together, our multidisciplinary study features a network of regulatory elements coordinating developmental EDC gene expression as an unexplored resource for genetic variants in skin diseases.
Collapse
Affiliation(s)
- Cristina de Guzman Strong
- Genetics and Molecular Biology Branch, National Human Genome Research Institute, NIH, Bethesda, MD, USA
| | | | | | | | | | | |
Collapse
|
66
|
Boyle AP, Furey TS. High-resolution mapping studies of chromatin and gene regulatory elements. Epigenomics 2009; 1:319-329. [PMID: 20514362 DOI: 10.2217/epi.09.29] [Citation(s) in RCA: 4] [Impact Index Per Article: 0.3] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/21/2022] Open
Abstract
Microarray and high-throughput sequencing technologies have enabled the development of comprehensive assays to identify locations of particular chromatin structures and regulatory elements. It is now possible to create genome-wide maps of DNA methylation, trans-factor binding sites, histone variants and histone tail modifications, nucleosome positions, regions of open chromatin, and chromosome locations and interactions. This review provides a summary of these new assays that are changing the way in which molecular biology research is being performed. While the generation of large amounts of data from these experiments is becoming increasingly easier, the development of corresponding analysis methods has progressed more slowly. It will likely be years before the full extent of the information contained in these data is fully appreciated.
Collapse
|
67
|
Shibata Y, Crawford GE. Mapping regulatory elements by DNaseI hypersensitivity chip (DNase-Chip). Methods Mol Biol 2009; 556:177-90. [PMID: 19488879 DOI: 10.1007/978-1-60327-192-9_13] [Citation(s) in RCA: 16] [Impact Index Per Article: 1.1] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 02/17/2023]
Abstract
Historically, the simplest method to robustly identify active gene regulatory elements has been enzymatic digestion of nuclear DNA by nucleases such as DNaseI. Regions of extreme chromatin accessibility to DNaseI, commonly known as DNaseI hypersensitive sites, have been repeatedly shown to be markers for all types of active cis-acting regulatory elements, including promoters, enhancers, silencers, insulators, and locus control regions. However, the original classical method, which for over 25 years relied on Southern blot, was limited to studying only small regions of the genome. Here we describe the detailed protocol for DNase-chip, a high-throughput method that allows for a targeted or genome-wide identification of cis-acting gene regulatory elements.
Collapse
Affiliation(s)
- Yoichiro Shibata
- Institute for Genome Sciences & Policy, Duke University, Durham, NC, USA
| | | |
Collapse
|
68
|
Abstract
The CFTR (cystic fibrosis transmembrane conductance regulator) gene, which when mutated causes cystic fibrosis, encompasses nearly 200 kb of genomic DNA at chromosome 7q31.2. It is flanked by two genes ASZ1 [ankyrin repeat, SAM (sterile alpha-motif) and basic leucine zipper] and CTTNBP2 (cortactin-binding protein 2), which have very different expression profiles. CFTR is expressed primarily in specialized epithelial cells, whereas ASZ1 is transcribed exclusively in the testis and ovary, and CTTNBP2 is highly expressed in the brain, kidney and pancreas, with lower levels of expression in other tissues. Despite its highly regulated pattern of expression, the promoter of the CFTR gene apparently lacks the necessary elements to achieve this. We previously suggested that cis-acting regulatory elements elsewhere in the locus, both flanking the gene and within introns, were required to co-ordinate regulated, tissue-specific expression of CFTR. We identified a number of crucial elements, including enhancer-blocking insulators flanking the locus, intronic tissue-specific enhancers and also characterized some of the interacting proteins. We recently employed a high-resolution method of mapping DHS (DNase I-hypersensitive sites) using tiled microarrays. DHS are often associated with regulatory elements and use of this technique generated cell-specific profiles of potential regulatory sequences in primary cells and cell lines. We characterized a set of cis-acting elements within the CFTR locus and demonstrated direct physical interaction between them and the CFTR promoter, by chromosome conformation capture (3C). These results provide the first insight into the three-dimensional structure of the active CFTR gene.
Collapse
|
69
|
Ng CEL, Ito Y, Osato M. Retroviral integration sites (RIS) mark cis-regulatory elements. Crit Rev Oncol Hematol 2009; 71:1-11. [DOI: 10.1016/j.critrevonc.2008.10.008] [Citation(s) in RCA: 1] [Impact Index Per Article: 0.1] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 02/06/2008] [Revised: 09/29/2008] [Accepted: 10/15/2008] [Indexed: 12/15/2022] Open
|
70
|
Gargiulo G, Levy S, Bucci G, Romanenghi M, Fornasari L, Beeson KY, Goldberg SM, Cesaroni M, Ballarini M, Santoro F, Bezman N, Frigè G, Gregory PD, Holmes MC, Strausberg RL, Pelicci PG, Urnov FD, Minucci S. NA-Seq: a discovery tool for the analysis of chromatin structure and dynamics during differentiation. Dev Cell 2009; 16:466-81. [PMID: 19289091 DOI: 10.1016/j.devcel.2009.02.002] [Citation(s) in RCA: 47] [Impact Index Per Article: 3.1] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 08/09/2008] [Revised: 12/19/2008] [Accepted: 02/06/2009] [Indexed: 12/19/2022]
Abstract
It is well established that epigenetic modulation of genome accessibility in chromatin occurs during biological processes. Here we describe a method based on restriction enzymes and next-generation sequencing for identifying accessible DNA elements using a small amount of starting material, and use it to examine myeloid differentiation of primary human CD34+ cells. The accessibility of several classes of cis-regulatory elements was a predictive marker of in vivo DNA binding by transcription factors, and was associated with distinct patterns of histone posttranslational modifications. We also mapped large chromosomal domains with differential accessibility in progenitors and maturing cells. Accessibility became restricted during differentiation, correlating with a decreased number of expressed genes and loss of regulatory potential. Our data suggest that a permissive chromatin structure in multipotent cells is progressively and selectively closed during differentiation, and illustrate the use of our method for the identification of functional cis-regulatory elements.
Collapse
Affiliation(s)
- Gaetano Gargiulo
- Department of Experimental Oncology, IFOM-IEO Campus, European Institute of Oncology (IEO), Via Ripamonti 435, 20141 Milan, Italy
| | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | |
Collapse
|
71
|
Brady T, Lee YN, Ronen K, Malani N, Berry CC, Bieniasz PD, Bushman FD. Integration target site selection by a resurrected human endogenous retrovirus. Genes Dev 2009; 23:633-42. [PMID: 19270161 DOI: 10.1101/gad.1762309] [Citation(s) in RCA: 88] [Impact Index Per Article: 5.9] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/25/2022]
Abstract
At least 8% of the human genome was formed by integration of retroviral DNA sequences. Here we analyze the forces directing the accumulation of human endogenous retroviruses (HERVs) by comparing de novo HERV integration targeting with the distribution of fixed HERV elements in the human genome. All known genomic HERVs are inactive due to mutation, but we were able to study integration targeting using a reconstituted consensus HERV-K (designated HERV-K(Con)). We found that HERV-K(Con) integrated preferentially in transcription units, in gene-rich regions, and near features associated with active transcription units and associated regulatory regions. In contrast, genomic HERV-K proviruses are found preferentially outside transcription units. The minority of genomic HERVKs present inside transcription units are in opposite transcriptional orientation relative to the host gene, the orientation predicted to be minimally disruptive to host mRNA synthesis, but de novo HERV-K(Con) integration within transcription units showed no orientation bias. We also found that the youngest HERV-K elements in the human genome showed a distribution intermediate between de novo HERV-K(Con) integration sites and older fixed HERV-Ks. These findings indicate that accumulation of HERVs in the human germline is a two-step process: integration targeting biases direct initial accumulation, then purifying selection leads to loss of proviruses disrupting gene function.
Collapse
Affiliation(s)
- Troy Brady
- Department of Microbiology, University of Pennsylvania School of Medicine, Philadelphia, Pennsylvania 19104, USA
| | | | | | | | | | | | | |
Collapse
|
72
|
Basheer A, Berger H, Reyes-Dominguez Y, Gorfer M, Strauss J. A library-based method to rapidly analyse chromatin accessibility at multiple genomic regions. Nucleic Acids Res 2009; 37:e42. [PMID: 19251760 PMCID: PMC2665225 DOI: 10.1093/nar/gkp037] [Citation(s) in RCA: 5] [Impact Index Per Article: 0.3] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 01/15/2023] Open
Abstract
Traditional chromatin analysis methods only test one locus at the time or use different templates for each locus, making a standardized analysis of large genomic regions or many co-regulated genes at different loci a difficult task. On the other hand, genome-wide high-resolution mapping of chromatin accessibility employing massive parallel sequencing platforms generates an extensive data set laborious to analyse and is a cost-intensive method, only applicable to the analysis of a limited set of biological samples. To close this gap between the traditional and the high-throughput procedures we have developed a method in which a condition-specific, genome-wide chromatin fragment library is produced and then used for locus-specific DNA fragment analysis. To validate the method, we used, as a test locus, the well-studied promoter of the divergently transcribed niiA and niaD genes coding for nitrate assimilation enzymes in Aspergillus. Additionally, we have used the condition-specific libraries to study nucleosomal positioning at two different loci, the promoters of the general nitrogen regulator areA and the regulator of secondary metabolism, aflR.
Collapse
Affiliation(s)
- Asjad Basheer
- Austrian Research Centers, Department of Applied Genetics and Cell Biology, BOKU University Vienna, Vienna, Austria
| | | | | | | | | |
Collapse
|
73
|
Integration site preference of xenotropic murine leukemia virus-related virus, a new human retrovirus associated with prostate cancer. J Virol 2008; 82:9964-77. [PMID: 18684813 DOI: 10.1128/jvi.01299-08] [Citation(s) in RCA: 86] [Impact Index Per Article: 5.4] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 01/04/2023] Open
Abstract
Xenotropic murine leukemia virus-related virus (XMRV) is a new human gammaretrovirus identified in prostate cancer tissue from patients homozygous for a reduced-activity variant of the antiviral enzyme RNase L. Neither a casual relationship between XMRV infection and prostate cancer nor a mechanism of tumorigenesis has been established. To determine the integration site preferences of XMRV and the potential risk of proviral insertional mutagenesis, we carried out a genome-wide analysis of viral integration sites in the prostate cell line DU145 after an acute XMRV infection and compared the integration site pattern of XMRV with those found for murine leukemia virus and two human retroviruses, human immunodeficiency virus type 1 and human T-cell leukemia virus type 1. Among all retroviruses analyzed, XMRV has the strongest preference for transcription start sites, CpG islands, DNase-hypersensitive sites, and gene-dense regions; all are features frequently associated with structurally open transcription regulatory regions of a chromosome. Analyses of XMRV integration sites in tissues from prostate cancer patients found a similar preference for the aforementioned chromosomal features. Additionally, XMRV integration sites in cancer tissues were associated with cancer breakpoints, common fragile sites, microRNA, and cancer-related genes, suggesting a selection process that favors certain chromosomal integration sites. In both acutely infected cells and cancer tissues, no common integration site was detected within or near proto-oncogenes or tumor suppressor genes. These results are consistent with a model in which XMRV may contribute to tumorigenicity via a paracrine mechanism.
Collapse
|
74
|
Nikolaev LG, Akopov SB, Chernov IP, Sverdlov ED. Maps of cis-Regulatory Nodes in Megabase Long Genome Segments are an Inevitable Intermediate Step Toward Whole Genome Functional Mapping. Curr Genomics 2008; 8:137-49. [PMID: 18660850 DOI: 10.2174/138920207780368178] [Citation(s) in RCA: 2] [Impact Index Per Article: 0.1] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 01/17/2007] [Revised: 02/22/2007] [Accepted: 02/27/2007] [Indexed: 11/22/2022] Open
Abstract
The availability of complete human and other metazoan genome sequences has greatly facilitated positioning and analysis of various genomic functional elements, with initial emphasis on coding sequences. However, complete functional maps of sequenced eukaryotic genomes should include also positions of all non-coding regulatory elements. Unfortunately, experimental data on genomic positions of a multitude of regulatory sequences, such as enhancers, silencers, insulators, transcription terminators, and replication origins are very limited, especially at the whole genome level. Since most genomic regulatory elements (e.g. enhancers) are generally gene-, tissue-, or cell-specific, the prediction of these elements by computational methods is difficult and often ambiguous. Therefore, the development of high-throughput experimental approaches for identifying and mapping genomic functional elements is highly desirable. At the same time, the creation of whole-genome map of hundreds of thousands of regulatory elements in several hundreds of tissue/cell types is presently far beyond our capabilities. A possible alternative for the whole genome approach is to concentrate efforts on individual genomic segments and then to integrate the data obtained into a whole genome functional map. Moreover, the maps of polygenic fragments with functional cis-regulatory elements would provide valuable data on complex regulatory systems, including their variability and evolution. Here, we reviewed experimental approaches to the realization of these ideas, including our own developments of experimental techniques for selection of cis-acting functionally active DNA fragments from large (megabase-sized) segments of mammalian genomes.
Collapse
Affiliation(s)
- Lev G Nikolaev
- Shemyakin-Ovchinnikov Institute of Bioorganic Chemistry, Russian Academy of Sciences, 16/10 Miklukho-Maklaya,117997, Moscow, Russia
| | | | | | | |
Collapse
|
75
|
Taylor JM, Wicks K, Vandiedonck C, Knight JC. Chromatin profiling across the human tumour necrosis factor gene locus reveals a complex, cell type-specific landscape with novel regulatory elements. Nucleic Acids Res 2008; 36:4845-62. [PMID: 18653526 PMCID: PMC2528168 DOI: 10.1093/nar/gkn444] [Citation(s) in RCA: 20] [Impact Index Per Article: 1.3] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/22/2022] Open
Abstract
The TNF locus on chromosome 6p21 encodes a family of proteins with key roles in the immune response whose dysregulation leads to severe disease. Transcriptional regulation is important, with cell type and stimulus-specific enhancer complexes involving the proximal TNF promoter. We show how quantitative chromatin profiling across a 34 kb region spanning the TNF locus has allowed us to identify a number of novel DNase hypersensitive sites and characterize more distant regulatory elements. We demonstrate DNase hypersensitive sites corresponding to the lymphotoxin alpha (LTA) and tumour necrosis factor (TNF) promoter regions, a CpG island in exon 4 of lymphotoxin beta (LTB), the 3′ end of nuclear factor of kappa light polypeptide gene enhancer in B-cells inhibitor-like 1 (NFKBIL1) and 3.4 kb upstream of LTA. These sites co-localize to highly conserved DNA sequences and show evidence of cell type specificity when lymphoblastoid, Jurkat, U937, HeLa and HEK293T cell lines are analysed using Southern blotting. For Jurkat T cells, we define histone modifications across the locus. Peaks of acetylated histone H3 and H4, together with tri-methyl K4 of histone H3, correspond to hypersensitive sites, notably in exon 4 of LTB. We provide evidence of a functional role for an intergenic DNase I hypersensitive site distal to LTA in Jurkat cells based on reporter gene analysis, with evidence of recruitment of upstream stimulatory factors (USF) transcription factors.
Collapse
Affiliation(s)
- Jennifer M Taylor
- Wellcome Trust Centre for Human Genetics, University of Oxford, Oxford OX3 7BN, UK
| | | | | | | |
Collapse
|
76
|
Gómez M, Antequera F. Overreplication of short DNA regions during S phase in human cells. Genes Dev 2008; 22:375-85. [PMID: 18245449 DOI: 10.1101/gad.445608] [Citation(s) in RCA: 27] [Impact Index Per Article: 1.7] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 01/14/2023]
Abstract
DNA replication origins (ORI) are regulatory regions from which the genome is replicated once every cell cycle. A widely used method for their identification in mammalian chromosomes relies on quantitative PCR of DNA nascent strands across candidate regions. We developed a new high-resolution PCR strategy to localize ORIs directly on total unfractionated human DNA. The increase in sensitivity provided by this approach has revealed that a short region of approximately 200-base-pair overlapping well-characterized replication origins undergoes several rounds of replication, coinciding with their specific time of activation during S phase. This process generates a population of discrete dsDNA fragments detectable as free molecules in preparations of total DNA in normally proliferating cells. Overreplicated regions have precise boundaries at the edge of the nucleosome-free gap that encompasses the transcription initiation sites of CpG island promoters. By itself, active transcription does not induce overreplication but does stimulate it at ORIs associated with promoters. The coincidence in time and space between the overproduction of short DNA fragments and ORI activity predicts the precise localization of thousands of ORIs in the human genome and uncovers a previously unnoticed step in the initiation of DNA replication.
Collapse
Affiliation(s)
- María Gómez
- Instituto de Microbiología Bioquímica, Consejo Superior de Investigaciones Científicas (CSIC)/Universidad de Salamanca, Edificio Departamental, Campus Miguel de Unamuno, 37007-Salamanca, Spain.
| | | |
Collapse
|
77
|
Maston GA, Evans SK, Green MR. Transcriptional regulatory elements in the human genome. Annu Rev Genomics Hum Genet 2008; 7:29-59. [PMID: 16719718 DOI: 10.1146/annurev.genom.7.080505.115623] [Citation(s) in RCA: 546] [Impact Index Per Article: 34.1] [Reference Citation Analysis] [Abstract] [MESH Headings] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/09/2022]
Abstract
The faithful execution of biological processes requires a precise and carefully orchestrated set of steps that depend on the proper spatial and temporal expression of genes. Here we review the various classes of transcriptional regulatory elements (core promoters, proximal promoters, distal enhancers, silencers, insulators/boundary elements, and locus control regions) and the molecular machinery (general transcription factors, activators, and coactivators) that interacts with the regulatory elements to mediate precisely controlled patterns of gene expression. The biological importance of transcriptional regulation is highlighted by examples of how alterations in these transcriptional components can lead to disease. Finally, we discuss the methods currently used to identify transcriptional regulatory elements, and the ability of these methods to be scaled up for the purpose of annotating the entire human genome.
Collapse
Affiliation(s)
- Glenn A Maston
- Howard Hughes Medical Institute, Programs in Gene Function and Expression and Molecular Medicine, University of Massachusetts Medical School, Worcester, Massachusetts 01605, USA.
| | | | | |
Collapse
|
78
|
Lähdesmäki H, Rust AG, Shmulevich I. Probabilistic inference of transcription factor binding from multiple data sources. PLoS One 2008; 3:e1820. [PMID: 18364997 PMCID: PMC2268002 DOI: 10.1371/journal.pone.0001820] [Citation(s) in RCA: 38] [Impact Index Per Article: 2.4] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 09/17/2007] [Accepted: 02/04/2008] [Indexed: 11/21/2022] Open
Abstract
An important problem in molecular biology is to build a complete understanding of transcriptional regulatory processes in the cell. We have developed a flexible, probabilistic framework to predict TF binding from multiple data sources that differs from the standard hypothesis testing (scanning) methods in several ways. Our probabilistic modeling framework estimates the probability of binding and, thus, naturally reflects our degree of belief in binding. Probabilistic modeling also allows for easy and systematic integration of our binding predictions into other probabilistic modeling methods, such as expression-based gene network inference. The method answers the question of whether the whole analyzed promoter has a binding site, but can also be extended to estimate the binding probability at each nucleotide position. Further, we introduce an extension to model combinatorial regulation by several TFs. Most importantly, the proposed methods can make principled probabilistic inference from multiple evidence sources, such as, multiple statistical models (motifs) of the TFs, evolutionary conservation, regulatory potential, CpG islands, nucleosome positioning, DNase hypersensitive sites, ChIP-chip binding segments and other (prior) sequence-based biological knowledge. We developed both a likelihood and a Bayesian method, where the latter is implemented with a Markov chain Monte Carlo algorithm. Results on a carefully constructed test set from the mouse genome demonstrate that principled data fusion can significantly improve the performance of TF binding prediction methods. We also applied the probabilistic modeling framework to all promoters in the mouse genome and the results indicate a sparse connectivity between transcriptional regulators and their target promoters. To facilitate analysis of other sequences and additional data, we have developed an on-line web tool, ProbTF, which implements our probabilistic TF binding prediction method using multiple data sources. Test data set, a web tool, source codes and supplementary data are available at: http://www.probtf.org.
Collapse
Affiliation(s)
- Harri Lähdesmäki
- Institute for Systems Biology, Seattle, Washington, United States of America
| | - Alistair G. Rust
- Institute for Systems Biology, Seattle, Washington, United States of America
| | - Ilya Shmulevich
- Institute for Systems Biology, Seattle, Washington, United States of America
| |
Collapse
|
79
|
Boyle AP, Davis S, Shulha HP, Meltzer P, Margulies EH, Weng Z, Furey TS, Crawford GE. High-resolution mapping and characterization of open chromatin across the genome. Cell 2008; 132:311-22. [PMID: 18243105 DOI: 10.1016/j.cell.2007.12.014] [Citation(s) in RCA: 1003] [Impact Index Per Article: 62.7] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 08/13/2007] [Revised: 10/22/2007] [Accepted: 12/04/2007] [Indexed: 12/28/2022]
Abstract
Mapping DNase I hypersensitive (HS) sites is an accurate method of identifying the location of genetic regulatory elements, including promoters, enhancers, silencers, insulators, and locus control regions. We employed high-throughput sequencing and whole-genome tiled array strategies to identify DNase I HS sites within human primary CD4+ T cells. Combining these two technologies, we have created a comprehensive and accurate genome-wide open chromatin map. Surprisingly, only 16%-21% of the identified 94,925 DNase I HS sites are found in promoters or first exons of known genes, but nearly half of the most open sites are in these regions. In conjunction with expression, motif, and chromatin immunoprecipitation data, we find evidence of cell-type-specific characteristics, including the ability to identify transcription start sites and locations of different chromatin marks utilized in these cells. In addition, and unexpectedly, our analyses have uncovered detailed features of nucleosome structure.
Collapse
Affiliation(s)
- Alan P Boyle
- Institute for Genome Sciences & Policy, Duke University, Durham, NC 27708, USA
| | | | | | | | | | | | | | | |
Collapse
|
80
|
Noyes MB, Meng X, Wakabayashi A, Sinha S, Brodsky MH, Wolfe SA. A systematic characterization of factors that regulate Drosophila segmentation via a bacterial one-hybrid system. Nucleic Acids Res 2008; 36:2547-60. [PMID: 18332042 PMCID: PMC2377422 DOI: 10.1093/nar/gkn048] [Citation(s) in RCA: 139] [Impact Index Per Article: 8.7] [Reference Citation Analysis] [Abstract] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 12/16/2022] Open
Abstract
Specificity data for groups of transcription factors (TFs) in a common regulatory network can be used to computationally identify the location of cis-regulatory modules in a genome. The primary limitation for this type of analysis is the paucity of specificity data that is available for the majority of TFs. We describe an omega-based bacterial one-hybrid system that provides a rapid method for characterizing DNA-binding specificities on a genome-wide scale. Using this system, 35 members of the Drosophila melanogaster segmentation network have been characterized, including representative members of all of the major classes of DNA-binding domains. A suite of web-based tools was created that uses this binding site dataset and phylogenetic comparisons to identify cis-regulatory modules throughout the fly genome. These tools allow specificities for any combination of factors to be used to perform rapid local or genome-wide searches for cis-regulatory modules. The utility of these factor specificities and tools is demonstrated on the well-characterized segmentation network. By incorporating specificity data on an additional 66 factors that we have characterized, our tools utilize ∼14% of the predicted factors within the fly genome and provide an important new community resource for the identification of cis-regulatory modules.
Collapse
Affiliation(s)
- Marcus B Noyes
- Program in Gene Function and Expression, Department of Biochemistry and Molecular Pharmacology, Program in Molecular Medicine, University of Massachusetts Medical School, Worcester, MA 01605, USA
| | | | | | | | | | | |
Collapse
|
81
|
Korb M, Rust AG, Thorsson V, Battail C, Li B, Hwang D, Kennedy KA, Roach JC, Rosenberger CM, Gilchrist M, Zak D, Johnson C, Marzolf B, Aderem A, Shmulevich I, Bolouri H. The Innate Immune Database (IIDB). BMC Immunol 2008; 9:7. [PMID: 18321385 PMCID: PMC2268913 DOI: 10.1186/1471-2172-9-7] [Citation(s) in RCA: 38] [Impact Index Per Article: 2.4] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/11/2007] [Accepted: 03/05/2008] [Indexed: 02/04/2023] Open
Abstract
Background As part of a National Institute of Allergy and Infectious Diseases funded collaborative project, we have performed over 150 microarray experiments measuring the response of C57/BL6 mouse bone marrow macrophages to toll-like receptor stimuli. These microarray expression profiles are available freely from our project web site . Here, we report the development of a database of computationally predicted transcription factor binding sites and related genomic features for a set of over 2000 murine immune genes of interest. Our database, which includes microarray co-expression clusters and a host of web-based query, analysis and visualization facilities, is available freely via the internet. It provides a broad resource to the research community, and a stepping stone towards the delineation of the network of transcriptional regulatory interactions underlying the integrated response of macrophages to pathogens. Description We constructed a database indexed on genes and annotations of the immediate surrounding genomic regions. To facilitate both gene-specific and systems biology oriented research, our database provides the means to analyze individual genes or an entire genomic locus. Although our focus to-date has been on mammalian toll-like receptor signaling pathways, our database structure is not limited to this subject, and is intended to be broadly applicable to immunology. By focusing on selected immune-active genes, we were able to perform computationally intensive expression and sequence analyses that would currently be prohibitive if applied to the entire genome. Using six complementary computational algorithms and methodologies, we identified transcription factor binding sites based on the Position Weight Matrices available in TRANSFAC. For one example transcription factor (ATF3) for which experimental data is available, over 50% of our predicted binding sites coincide with genome-wide chromatin immnuopreciptation (ChIP-chip) results. Our database can be interrogated via a web interface. Genomic annotations and binding site predictions can be automatically viewed with a customized version of the Argo genome browser. Conclusion We present the Innate Immune Database (IIDB) as a community resource for immunologists interested in gene regulatory systems underlying innate responses to pathogens. The database website can be freely accessed at .
Collapse
Affiliation(s)
- Martin Korb
- Institute for Systems Biology, 1441 North 34thStreet, Seattle, Washington 98103-8904, USA.
| | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | |
Collapse
|
82
|
Abstract
Over two metres of DNA is packaged into each nucleus in the human body in a manner that still allows for gene regulation. This remarkable feat is accomplished by the wrapping of DNA around histone proteins in repeating units of nucleosomes to form a structure known as chromatin. This chromatin structure is subject to various modifications that have profound influences on gene expression. Recently developed techniques to study chromatin modifications at a genome-wide scale are now allowing researchers to probe the complex components that make up epigenomes. Here we review genome-wide approaches to studying epigenomic structure and the exciting findings that have been obtained using these technologies.
Collapse
Affiliation(s)
- Dustin E Schones
- Laboratory of Molecular Immunology, The National Heart, Lung and Blood Institute, National Institutes of Health, Building 10, Room 7B05, 9,000 Rockville Pike, Bethesda, Maryland 20892, USA.
| | | |
Collapse
|
83
|
Di Filippo M, Bernardi G. Mapping DNase-I hypersensitive sites on human isochores. Gene 2008; 419:62-5. [PMID: 18436395 DOI: 10.1016/j.gene.2008.02.012] [Citation(s) in RCA: 20] [Impact Index Per Article: 1.3] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 12/20/2007] [Revised: 02/04/2008] [Accepted: 02/05/2008] [Indexed: 10/22/2022]
Abstract
Mapping DNase-I hypersensitive sites (HS) was used in the past to identify regulatory elements of specific genes. More recently, thousands of HS were identified in the human genome by using high-throughput methods. These approaches showed a general enrichment of HS near or within known genes, within CpG islands, within human-mouse conserved regions and in GC-rich regions of the genome. Here we show that HS: (i) are characterized by a much higher GC level (approximately 56%) than the average GC level of the human genome (approximately 41%); (ii) are overwhelmingly located in the GC-richest compartment of the genome, which is predominantly associated with an open chromatin structure; (iii) and are slightly more and slightly less frequent than genes, respectively, in the gene-rich and in the gene-poor isochore families.
Collapse
Affiliation(s)
- Miriam Di Filippo
- Laboratory of Molecular Evolution, Stazione Zoologica Anton Dohrn, 80121 Naples, Italy
| | | |
Collapse
|
84
|
Abstract
While the hereditary information encoded in the Watson-Crick base pairing of genomes is largely static within a given individual, access to this information is controlled by dynamic mechanisms. The human genome is pervasively transcribed, but the roles played by the majority of the non-protein-coding genome sequences are still largely unknown. In this review we focus on insights to gene transcriptional regulation by placing special emphasis on genome-wide approaches, and on how non-coding RNAs, which derive from global transcription of the genome, in turn control gene expression. We review recent progress in the field with highlights on the immune system.
Collapse
Affiliation(s)
- Matthew E Pipkin
- Immune Disease Institute and Harvard Medical School, Boston, MA, USA
| | | |
Collapse
|
85
|
Interferon regulatory factors are transcriptional regulators of adipogenesis. Cell Metab 2008; 7:86-94. [PMID: 18177728 PMCID: PMC2278019 DOI: 10.1016/j.cmet.2007.11.002] [Citation(s) in RCA: 107] [Impact Index Per Article: 6.7] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Submit a Manuscript] [Subscribe] [Scholar Register] [Received: 02/14/2007] [Revised: 08/03/2007] [Accepted: 11/05/2007] [Indexed: 12/31/2022]
Abstract
We have sought to identify transcriptional pathways in adipogenesis using an integrated experimental and computational approach. Here, we employ high-throughput DNase hypersensitivity analysis to find regions of altered chromatin structure surrounding key adipocyte genes. Regions that display differentiation-dependent changes in hypersensitivity were used to predict binding sites for proteins involved in adipogenesis. A high-scoring example was a binding motif for interferon regulatory factor (IRF) family members. Expression of all nine mammalian IRF mRNAs is regulated during adipogenesis, and several bind to the identified motifs in a differentiation-dependent manner. Furthermore, several IRF proteins repress differentiation. This analysis suggests an important role for IRF proteins in adipocyte biology and demonstrates the utility of this approach in identifying cis- and trans-acting factors not previously suspected to participate in adipogenesis.
Collapse
|
86
|
Buckley NJ. Analysis of transcription, chromatin dynamics and epigenetic changes in neural genes. Prog Neurobiol 2007; 83:195-210. [PMID: 17884276 DOI: 10.1016/j.pneurobio.2007.07.004] [Citation(s) in RCA: 6] [Impact Index Per Article: 0.4] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 11/01/2006] [Revised: 06/14/2007] [Accepted: 07/18/2007] [Indexed: 01/08/2023]
Abstract
The ways in which gene transcription is investigated have undergone radical change since the turn of the millennium. Piece-meal approaches focussed upon model genes have increasingly been complemented by genome-wide approaches that allow interrogation of multiple cohorts of genes or even entire genomes. This sea change has been founded upon the increasing availability of whole genome sequences and the attendant evolution of microarray based discovery platforms. Collectively, these approaches are being used to build a global and dynamic perspective of transcription factor occupancy, co-factor recruitment and epigenetic signature. As yet, few of these approaches have been applied to the study of neuronal gene transcription, but this is set to change. Here, I review these key developments and point to their potential application to the study of transcriptional and epigenetic changes in neurons in health and disease.
Collapse
Affiliation(s)
- Noel J Buckley
- King's College London, Department of Neuroscience, Institute of Psychiatry, Centre for the Cellular Basis of Behaviour, CCBB/CCIB, Room 1-045, 125 Coldharbour Lane, London SE5 9NU, UK.
| |
Collapse
|
87
|
Kim SI, Gelvin SB. Genome-wide analysis of Agrobacterium T-DNA integration sites in the Arabidopsis genome generated under non-selective conditions. THE PLANT JOURNAL : FOR CELL AND MOLECULAR BIOLOGY 2007; 51:779-91. [PMID: 17605756 DOI: 10.1111/j.1365-313x.2007.03183.x] [Citation(s) in RCA: 126] [Impact Index Per Article: 7.4] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 05/04/2023]
Abstract
Previous work from numerous laboratories has suggested that integration of Agrobacterium tumefaciens T-DNA into the plant genome occurs preferentially in promoter or transcriptionally active regions. However, all of these studies were conducted on plants recovered from selective conditions requiring the expression of transgenes. The conclusions of these studies may therefore have been biased because of the selection of transformants. In this study, we investigated T-DNA integration sites in the Arabidopsis genome by analyzing T-DNA/plant DNA junctions generated under non-selective conditions. We found a relatively high frequency of T-DNA insertions in heterochromatic regions, including centromeres, telomeres and rDNA repeats. These T-DNA insertion regions are disfavored under selective conditions. The frequency with which T-DNA insertions mapped to exon, intron, 5' upstream and 3' downstream regions closely resembled their respective proportions in the Arabidopsis genome. Transcriptional profiling indicated that expression levels of T-DNA pre-integration target sites recovered using selective conditions were significantly higher than those of random Arabidopsis sequences, whereas expression levels of genomic sequences targeted by T-DNA under non-selective conditions were similar to those of random Arabidopsis sequences. T-DNA target sites identified using non-selective conditions did not correlate with DNA methylation status, suggesting that T-DNA integration occurs without regard to DNA methylation. Our results indicate that T-DNA integration may occur more randomly than previously indicated, and that selection pressure may shift the recovery of T-DNA insertions into gene-rich or transcriptionally active regions of chromatin.
Collapse
Affiliation(s)
- Sang-Ic Kim
- Department of Biological Sciences, Purdue University, West Lafayette, IN 47907-1392, USA
| | | |
Collapse
|
88
|
Greenbaum JA, Parker SC, Tullius TD. Detection of DNA structural motifs in functional genomic elements. Genome Res 2007; 17:940-6. [PMID: 17568009 PMCID: PMC1891352 DOI: 10.1101/gr.5602807] [Citation(s) in RCA: 17] [Impact Index Per Article: 1.0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 01/03/2023]
Abstract
The completion of the human genome project has fueled the search for regulatory elements by a variety of different approaches. Many successful analyses have focused on examining primary DNA sequence and/or chromatin structure. However, it has been difficult to detect common sequence motifs within the feature of chromatin structure most closely associated with regulatory elements, DNase I hypersensitive sites (DHSs). Considering just the nucleotide sequence and/or the chromatin structure of regulatory elements may neglect a critical feature of what is recognized by the regulatory machinery--DNA structure. We introduce a new computational method to detect common DNA structural motifs in a large collection of DHSs that are found in the ENCODE regions of the human genome. We show that DHSs have common DNA structural motifs that show no apparent sequence consensus. One such structural motif is much more highly enriched in experimentally identified DHSs that are in CpG islands and near transcription start sites (TSSs), compared to DHSs not in CpG islands and farther from TSSs, suggesting that DNA structural motifs may participate in the formation of functional regulatory elements. We propose that studies of the conservation of DNA structure, independent of sequence conservation, will provide new information about the link between the nucleotide sequence of a DNA molecule and its experimentally demonstrated function.
Collapse
Affiliation(s)
- Jason A. Greenbaum
- Program in Bioinformatics, Boston University, Boston, Massachusetts 02215, USA
| | - Stephen C.J. Parker
- Program in Bioinformatics, Boston University, Boston, Massachusetts 02215, USA
| | - Thomas D. Tullius
- Program in Bioinformatics, Boston University, Boston, Massachusetts 02215, USA
- Department of Chemistry, Boston University, Boston, Massachusetts 02215, USA
- Corresponding author.E-mail ; fax (617) 353-6466
| |
Collapse
|
89
|
Elnitski LL, Shah P, Moreland RT, Umayam L, Wolfsberg TG, Baxevanis AD. The ENCODEdb portal: simplified access to ENCODE Consortium data. Genome Res 2007; 17:954-9. [PMID: 17568011 PMCID: PMC1891354 DOI: 10.1101/gr.5582207] [Citation(s) in RCA: 13] [Impact Index Per Article: 0.8] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/25/2022]
Abstract
The Encyclopedia of DNA Elements (ENCODE) project aims to identify and characterize all functional elements in a representative chromosomal sample comprising 1% of the human genome. Data generated by members of The ENCODE Project Consortium are housed in a number of public databases, such as the UCSC Genome Browser, NCBI's Gene Expression Omnibus (GEO), and EBI's ArrayExpress. As such, it is often difficult for biologists to gather all of the ENCODE data from a particular genomic region of interest and integrate them with relevant information found in other public databases. The ENCODEdb portal was developed to address this problem. ENCODEdb provides a unified, single point-of-access to data generated by the ENCODE Consortium, as well as to data from other source databases that lie within ENCODE regions; this provides the user a complete view of all known data in a particular region of interest. ENCODEdb Genomic Context searches allow for the retrieval of information on functional elements annotated within ENCODE regions, including mRNA, EST, and STS sequences; single nucleotide polymorphisms, and UniGene clusters. Information is also retrieved from GEO, OMIM, and major genome sequence browsers. ENCODEdb Consortium Data searches allow users to perform compound queries on array-based ENCODE data available both from GEO and from the UCSC Genome Browser. Results are retrieved from a specific genomic area of interest and can be further manipulated in a variety of contexts, including the UCSC Genome Browser and the Galaxy large-scale genome analysis platform. The ENCODEdb portal is freely accessible at http://research.nhgri.nih.gov/ENCODEdb.
Collapse
Affiliation(s)
- Laura L. Elnitski
- Genome Technology Branch, National Human Genome Research Institute, National Institutes of Health, Bethesda, Maryland 20892, USA
| | - Prachi Shah
- Genome Technology Branch, National Human Genome Research Institute, National Institutes of Health, Bethesda, Maryland 20892, USA
| | - R. Travis Moreland
- Genome Technology Branch, National Human Genome Research Institute, National Institutes of Health, Bethesda, Maryland 20892, USA
| | - Lowell Umayam
- Genome Technology Branch, National Human Genome Research Institute, National Institutes of Health, Bethesda, Maryland 20892, USA
| | - Tyra G. Wolfsberg
- Genome Technology Branch, National Human Genome Research Institute, National Institutes of Health, Bethesda, Maryland 20892, USA
| | - Andreas D. Baxevanis
- Genome Technology Branch, National Human Genome Research Institute, National Institutes of Health, Bethesda, Maryland 20892, USA
- Corresponding author.E-mail ; fax (301) 480-2634
| |
Collapse
|
90
|
Peng Y, Song J, Lu J, Chen X. The histone deacetylase inhibitor sodium butyrate inhibits baculovirus-mediated transgene expression in Sf9 cells. J Biotechnol 2007; 131:180-7. [PMID: 17655959 DOI: 10.1016/j.jbiotec.2007.06.009] [Citation(s) in RCA: 9] [Impact Index Per Article: 0.5] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 02/13/2007] [Revised: 05/26/2007] [Accepted: 06/14/2007] [Indexed: 10/23/2022]
Abstract
Recent studies have indicated that histone deacetylase inhibitors (HDACis) could enhance and prolong expression of exogenous genes delivered by various viral vehicles in mammalian cells, including baculovirus vectors. In this study, the effects of HDACis on expression of a baculovirus-mediated eGFP reporter gene under control of baculovirus late promoter p10 in Sf9 cells were evaluated. It was found that sodium butyrate (NaBu) decreased the expression level of the target gene driven by p10 promoter by four to fivefold. Moreover, addition of NaBu increased DNaseI-sensitivity of transgene p10 promoter region and did not influence viral DNA replication. FACS assay has shown that both NaBu and fluorodeoxyuridine (FdUrd) blocked Sf9 cells at G1 phase and inhibited the target gene expression. Another HDACi, trichostatin, had little effects on both cell cycle and Ac-p10-eGFP expression, strongly suggesting that cell cycle arrest accounts for the mechanisms by which NaBu inhibits Ac-p10-eGFP expression. The inhibiting effects of NaBu on baculovirus transgene expression in Sf9 cells are promoter specific since the enhancement of NaBu on transgene expression in insect and mammalian cells are mediated by baculovirus harboring a murine cytomegalovirus (mCMV) immediate early promoter. This study was aimed at improving the productivity of the recombinant proteins and providing a better understanding of the epigenetic regulation of baculovirus gene expression.
Collapse
Affiliation(s)
- Ying Peng
- State Key Laboratory of Virology, Wuhan Institute of Virology, Chinese Academy of Sciences, Wuhan 430071, People's Republic of China
| | | | | | | |
Collapse
|
91
|
Akopov SB, Chernov IP, Bulanenkova SS, Skvortsova YV, Vetchinova AS, Nikolaev LG. Methods for identification of epigenetic elements in mammalian long multigenic genome sequences. BIOCHEMISTRY (MOSCOW) 2007; 72:589-94. [PMID: 17630903 DOI: 10.1134/s0006297907060016] [Citation(s) in RCA: 2] [Impact Index Per Article: 0.1] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 11/23/2022]
Abstract
Epigenetic elements of the genome, i.e. elements that determine stably inherited changes in gene expression without changes in the genomic DNA sequence, are essential tools of genetic regulation in higher eukaryotes. The complete sequencing of the human and other genomes allowed studies to be started on positioning of these elements within long multigenic regions of the genome, which is a prerequisite for a comprehensive functional annotation of genomes. This mini-review considers some recent experimental approaches to the high-throughput identification and mapping of epigenetic elements of mammalian genomes, including the mapping of methylated CpG sites, open and closed chromatin regions, and DNase I hypersensitivity sites.
Collapse
Affiliation(s)
- S B Akopov
- Shemyakin and Ovchinnikov Institute of Bioorganic Chemistry, Russian Academy of Sciences, Moscow 117997, Russia
| | | | | | | | | | | |
Collapse
|
92
|
Derse D, Crise B, Li Y, Princler G, Lum N, Stewart C, McGrath CF, Hughes SH, Munroe DJ, Wu X. Human T-cell leukemia virus type 1 integration target sites in the human genome: comparison with those of other retroviruses. J Virol 2007; 81:6731-41. [PMID: 17409138 PMCID: PMC1900082 DOI: 10.1128/jvi.02752-06] [Citation(s) in RCA: 125] [Impact Index Per Article: 7.4] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 12/23/2022] Open
Abstract
Retroviral integration into the host genome is not entirely random, and integration site preferences vary among different retroviruses. Human immunodeficiency virus (HIV) prefers to integrate within active genes, whereas murine leukemia virus (MLV) prefers to integrate near transcription start sites and CpG islands. On the other hand, integration of avian sarcoma-leukosis virus (ASLV) shows little preference either for genes, transcription start sites, or CpG islands. While host cellular factors play important roles in target site selection, the viral integrase is probably the major viral determinant. It is reasonable to hypothesize that retroviruses with similar integrases have similar preferences for target site selection. Although integration profiles are well defined for members of the lentivirus, spumaretrovirus, alpharetrovirus, and gammaretrovirus genera, no members of the deltaretroviruses, for example, human T-cell leukemia virus type 1 (HTLV-1), have been evaluated. We have mapped 541 HTLV-1 integration sites in human HeLa cells and show that HTLV-1, like ASLV, does not specifically target transcription units and transcription start sites. Comparing the integration sites of HTLV-1 with those of ASLV, HIV, simian immunodeficiency virus, MLV, and foamy virus, we show that global and local integration site preferences correlate with the sequence/structure of virus-encoded integrases, supporting the idea that integrase is the major determinant of retroviral integration site selection. Our results suggest that the global integration profiles of other retroviruses could be predicted from phylogenetic comparisons of the integrase proteins. Our results show that retroviruses that engender different insertional mutagenesis risks can have similar integration profiles.
Collapse
Affiliation(s)
- David Derse
- HIV Drug Resistance Program, Laboratory of Molecular Technology, SAIC-Frederick, Inc., NCI-Frederick, 915 Toll House Avenue, Frederick, MD 21702, USA
| | | | | | | | | | | | | | | | | | | |
Collapse
|
93
|
Follows GA, Janes ME, Vallier L, Green AR, Gottgens B. Real-time PCR mapping of DNaseI-hypersensitive sites using a novel ligation-mediated amplification technique. Nucleic Acids Res 2007; 35:e56. [PMID: 17389645 PMCID: PMC1885650 DOI: 10.1093/nar/gkm108] [Citation(s) in RCA: 17] [Impact Index Per Article: 1.0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/13/2022] Open
Abstract
Mapping sites within the genome that are hypersensitive to digestion with DNaseI is an important method for identifying DNA elements that regulate transcription. The standard approach to locating these DNaseI-hypersensitive sites (DHSs) has been to use Southern blotting techniques, although we, and others, have recently published alternative methods using a range of technologies including high-throughput sequencing and genomic array tiling paths. In this article, we describe a novel protocol to use real-time PCR to map DHS. Advantages of the technique reported here include the small cell numbers required for each analysis, rapid, relatively low-cost experiments with minimal need for specialist equipment. Presented examples include comparative DHS mapping of known TAL1/SCL regulatory elements between human embryonic stem cells and K562 cells.
Collapse
Affiliation(s)
- George A Follows
- Department of Haematology, Cambridge Institute for Medical Research, Addenbrooke's Hospital, University of Cambridge, Cambridge CB2 2XY, UK.
| | | | | | | | | |
Collapse
|
94
|
Nagel S, Scherr M, Kel A, Hornischer K, Crawford GE, Kaufmann M, Meyer C, Drexler HG, MacLeod RAF. Activation of TLX3 and NKX2-5 in t(5;14)(q35;q32) T-cell acute lymphoblastic leukemia by remote 3'-BCL11B enhancers and coregulation by PU.1 and HMGA1. Cancer Res 2007; 67:1461-71. [PMID: 17308084 DOI: 10.1158/0008-5472.can-06-2615] [Citation(s) in RCA: 58] [Impact Index Per Article: 3.4] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/16/2022]
Abstract
In T-cell acute lymphoblastic leukemia, alternative t(5;14)(q35;q32.2) forms effect dysregulation of either TLX3 or NKX2-5 homeobox genes at 5q35 by juxtaposition with 14q32.2 breakpoints dispersed across the BCL11B downstream genomic desert. Leukemic gene dysregulation by t(5;14) was investigated by DNA inhibitory treatments with 26-mer double-stranded DNA oligonucleotides directed against candidate enhancers at, or near, orphan T-cell DNase I hypersensitive sites located between 3'-BCL11B and VRK1. NKX2-5 down-regulation in t(5;14) PEER cells was almost entirely restricted to DNA inhibitory treatment targeting enhancers within the distal breakpoint cluster region and was dose and sequence dependent, whereas enhancers near 3'-BCL11B regulated that gene only. Chromatin immunoprecipitation assays showed that the four most effectual NKX2-5 ectopic enhancers were hyperacetylated. These enhancers clustered approximately 1 Mbp downstream of BCL11B, within a region displaying multiple regulatory stigmata, including a TCRA enhancer motif, deep sequence conservation, and tight nuclear matrix attachment relaxed by trichostatin A treatment. Intriguingly, although TLX3/NKX2-5 promoter/exon 1 regions were hypoacetylated, their expression was trichostatin A sensitive, implying extrinsic regulation by factor(s) under acetylation control. Knockdown of PU.1, known to be trichostatin A responsive and which potentially binds TLX3/NKX2-5 promoters, effected down-regulation of both homeobox genes. Moreover, genomic analysis showed preferential enrichment near ectopic enhancers of binding sites for the PU.1 cofactor HMGA1, the knockdown of which also inhibited NKX2-5. We suggest that HMGA1 and PU.1 coregulate ectopic homeobox gene expression in t(5;14) T-cell acute lymphoblastic leukemia by interactions mediated at the nuclear matrix. Our data document homeobox gene dysregulation by a novel regulatory region at 3'-BCL11B responsive to histone deacetylase inhibition and highlight a novel class of potential therapeutic target amid noncoding DNA.
Collapse
MESH Headings
- Acetylation
- Chromosome Breakage
- Chromosomes, Human, Pair 14
- Chromosomes, Human, Pair 5
- DNA-Binding Proteins/genetics
- Deoxyribonuclease I/metabolism
- Enhancer Elements, Genetic
- Gene Expression Regulation, Leukemic
- HMGA Proteins/genetics
- Histones/metabolism
- Homeobox Protein Nkx-2.5
- Homeodomain Proteins/genetics
- Humans
- Leukemia-Lymphoma, Adult T-Cell/genetics
- Leukemia-Lymphoma, Adult T-Cell/metabolism
- Multigene Family
- Nuclear Matrix/metabolism
- Oligonucleotides/genetics
- Oncogene Proteins/genetics
- Precursor Cell Lymphoblastic Leukemia-Lymphoma/genetics
- Precursor Cell Lymphoblastic Leukemia-Lymphoma/metabolism
- Proto-Oncogene Proteins/genetics
- RNA, Small Interfering/genetics
- Repressor Proteins/genetics
- Trans-Activators/genetics
- Transcription Factors/genetics
- Translocation, Genetic
- Tumor Suppressor Proteins/genetics
Collapse
Affiliation(s)
- Stefan Nagel
- German Collection of Microorganisms and Cell Cultures, Department of Cell Cultures, Inhoffenstrasse 7B, 38124 Braunschweig, Germany.
| | | | | | | | | | | | | | | | | |
Collapse
|
95
|
Kodama Y, Nagaya S, Shinmyo A, Kato K. Mapping and characterization of DNase I hypersensitive sites in Arabidopsis chromatin. PLANT & CELL PHYSIOLOGY 2007; 48:459-70. [PMID: 17283013 DOI: 10.1093/pcp/pcm017] [Citation(s) in RCA: 22] [Impact Index Per Article: 1.3] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 05/13/2023]
Abstract
Recent genome-wide analyses of yeast and human chromatin revealed the widespread prevalence of DNase I hypersensitive sites (DNase I HSs) at gene regulatory regions with possible roles in eukaryotic gene regulation. The presence of DNase I HSs in plants has been described for only a few genes, and we analyzed the chromatin structure of an 80 kb genomic region containing 30 variably expressed genes by DNase I sensitivity assay at 500 bp resolution in Arabidopsis. Distinct DNase I HSs were found at the 5' and/or 3' ends of most genes irrespective of their expression levels. Further analysis of well-characterized genes showed that the DNase I HSs occurred near cis-regulatory elements in the promoters of these genes. Upon transcriptional activation of a heat-inducible gene, the DNase I HS was extended into the vicinity of a cis-element and adjacent TATA element in the promoter. Concomitant with this change in DNase I HS, histones were acetylated, removed from the promoter, and a transcription activator bound to this cis-element. These results suggest that the DNase I HSs participate in the transcriptional regulation of Arabidopsis genes by enhancing the access of chromatin remodeling factors and/or transcription factors to their target sites as seen in yeast and human chromatin.
Collapse
Affiliation(s)
- Yuichi Kodama
- Graduate School of Biological Sciences, Nara Institute of Science and Technology, Ikoma, Nara, 630-0192, Japan
| | | | | | | |
Collapse
|
96
|
Berry C, Hannenhalli S, Leipzig J, Bushman FD. Selection of target sites for mobile DNA integration in the human genome. PLoS Comput Biol 2007; 2:e157. [PMID: 17166054 PMCID: PMC1664696 DOI: 10.1371/journal.pcbi.0020157] [Citation(s) in RCA: 176] [Impact Index Per Article: 10.4] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 08/17/2006] [Accepted: 10/04/2006] [Indexed: 01/01/2023] Open
Abstract
DNA sequences from retroviruses, retrotransposons, DNA transposons, and parvoviruses can all become integrated into the human genome. Accumulation of such sequences accounts for at least 40% of our genome today. These integrating elements are also of interest as gene-delivery vectors for human gene therapy. Here we present a comprehensive bioinformatic analysis of integration targeting by HIV, MLV, ASLV, SFV, L1, SB, and AAV. We used a mathematical method which allowed annotation of each base pair in the human genome for its likelihood of hosting an integration event by each type of element, taking advantage of more than 200 types of genomic annotation. This bioinformatic resource documents a wealth of new associations between genomic features and integration targeting. The study also revealed that the length of genomic intervals analyzed strongly affected the conclusions drawn--thus, answering the question "What genomic features affect integration?" requires carefully specifying the length scale of interest.
Collapse
Affiliation(s)
- Charles Berry
- Department of Family and Preventive Medicine, School of Medicine, University of California San Diego, La Jolla, California, United States of America
| | - Sridhar Hannenhalli
- Department of Genetics, University of Pennsylvania School of Medicine, Philadelphia, Pennsylvania, United States of America
| | - Jeremy Leipzig
- Department of Microbiology, University of Pennsylvania School of Medicine, Philadelphia, Pennsylvania, United States of America
| | - Frederic D Bushman
- Department of Microbiology, University of Pennsylvania School of Medicine, Philadelphia, Pennsylvania, United States of America
- * To whom correspondence should be addressed. E-mail:
| |
Collapse
|
97
|
Ganapathi M, Singh GP, Sandhu KS, Brahmachari SK, Brahmachari V. A whole genome analysis of 5' regulatory regions of human genes for putative cis-acting modulators of nucleosome positioning. Gene 2007; 391:242-51. [PMID: 17321698 DOI: 10.1016/j.gene.2007.01.008] [Citation(s) in RCA: 11] [Impact Index Per Article: 0.6] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 08/24/2006] [Revised: 01/16/2007] [Accepted: 01/16/2007] [Indexed: 01/10/2023]
Abstract
We have carried out in silico analysis of upstream regions of 23,034 genes from the human genome for sequence motifs, which can potentially affect nucleosome positioning. Nucleosome exclusion elements (NEE) occur in 12% of the genes while less than 1% contain nucleosome positioning elements (NPE). NEE are significantly higher in 5' regions of certain categories of genes, namely, genes with active promoters, genes localised to gene-rich chromosomes 16, 17 and 19, genes having significantly higher expression levels and higher levels of occupancy of general transcription machinery proteins. NEE are also enriched in housekeeping and TATA-less genes, but are significantly under-represented in the upstream region of genes functionally classified under 'organ development' and 'morphogenesis' categories. Further, DNase I hypersensitive sites which co-localise with NEE, preferentially occur in 5' regulatory regions. Considering the positioning sequences identified so far, we speculate that low affinity nucleosome positioning in the upstream sequences of genes in the human genome is the default state requiring activation through chromatin remodelling, while, there appears to be a selection for nucleosome excluding sequences in the upstream sequences of genes that are ubiquitously expressed.
Collapse
Affiliation(s)
- Mythily Ganapathi
- Dr. B. R. Ambedkar Centre for Biomedical Research, University of Delhi, Delhi-110007, India
| | | | | | | | | |
Collapse
|
98
|
Giresi PG, Kim J, McDaniell RM, Iyer VR, Lieb JD. FAIRE (Formaldehyde-Assisted Isolation of Regulatory Elements) isolates active regulatory elements from human chromatin. Genome Res 2006; 17:877-85. [PMID: 17179217 PMCID: PMC1891346 DOI: 10.1101/gr.5533506] [Citation(s) in RCA: 666] [Impact Index Per Article: 37.0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/24/2022]
Abstract
DNA segments that actively regulate transcription in vivo are typically characterized by eviction of nucleosomes from chromatin and are experimentally identified by their hypersensitivity to nucleases. Here we demonstrate a simple procedure for the isolation of nucleosome-depleted DNA from human chromatin, termed FAIRE (Formaldehyde-Assisted Isolation of Regulatory Elements). To perform FAIRE, chromatin is crosslinked with formaldehyde in vivo, sheared by sonication, and phenol-chloroform extracted. The DNA recovered in the aqueous phase is fluorescently labeled and hybridized to a DNA microarray. FAIRE performed in human cells strongly enriches DNA coincident with the location of DNaseI hypersensitive sites, transcriptional start sites, and active promoters. Evidence for cell-type-specific patterns of FAIRE enrichment is also presented. FAIRE has utility as a positive selection for genomic regions associated with regulatory activity, including regions traditionally detected by nuclease hypersensitivity assays.
Collapse
Affiliation(s)
- Paul G. Giresi
- Department of Biology and the Carolina Center for Genome Sciences, University of North Carolina at Chapel Hill, Chapel Hill, North Carolina 27599-3280, USA
| | - Jonghwan Kim
- Institute for Cellular and Molecular Biology and Center for Systems and Synthetic Biology, University of Texas at Austin, Austin, Texas 78712-0159, USA
| | - Ryan M. McDaniell
- Institute for Cellular and Molecular Biology and Center for Systems and Synthetic Biology, University of Texas at Austin, Austin, Texas 78712-0159, USA
| | - Vishwanath R. Iyer
- Institute for Cellular and Molecular Biology and Center for Systems and Synthetic Biology, University of Texas at Austin, Austin, Texas 78712-0159, USA
| | - Jason D. Lieb
- Department of Biology and the Carolina Center for Genome Sciences, University of North Carolina at Chapel Hill, Chapel Hill, North Carolina 27599-3280, USA
- Corresponding author.E-mail ; fax (919) 962-1625
| |
Collapse
|
99
|
Follows GA, Dhami P, Göttgens B, Bruce AW, Campbell PJ, Dillon SC, Smith AM, Koch C, Donaldson IJ, Scott MA, Dunham I, Janes ME, Vetrie D, Green AR. Identifying gene regulatory elements by genomic microarray mapping of DNaseI hypersensitive sites. Genome Res 2006; 16:1310-9. [PMID: 16963707 PMCID: PMC1581440 DOI: 10.1101/gr.5373606] [Citation(s) in RCA: 34] [Impact Index Per Article: 1.9] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 12/11/2022]
Abstract
The identification of cis-regulatory elements is central to understanding gene transcription. Hypersensitivity of cis-regulatory elements to digestion with DNaseI remains the gold-standard approach to locating such elements. Traditional methods used to identify DNaseI hypersensitive sites are cumbersome and can only be applied to short stretches of DNA at defined locations. Here we report the development of a novel genomic array-based approach to DNaseI hypersensitive site mapping (ADHM) that permits precise, large-scale identification of such sites from as few as 5 million cells. Using ADHM we identified all previously recognized hematopoietic regulatory elements across 200 kb of the mouse T-cell acute lymphocytic leukemia-1 (Tal1) locus, and, in addition, identified two novel elements within the locus, which show transcriptional regulatory activity. We further validated the ADHM protocol by mapping the DNaseI hypersensitive sites across 250 kb of the human TAL1 locus in CD34+ primary stem/progenitor cells and K562 cells and by mapping the previously known DNaseI hypersensitive sites across 240 kb of the human alpha-globin locus in K562 cells. ADHM provides a powerful approach to identifying DNaseI hypersensitive sites across large genomic regions.
Collapse
Affiliation(s)
- George A Follows
- Department of Haematology, Cambridge Institute for Medical Research, University of Cambridge, Cambridge, CB2 2XY, United Kingdom.
| | | | | | | | | | | | | | | | | | | | | | | | | | | |
Collapse
|
100
|
KELLY BB, HEDLUND E, KIM C, ISHIGURO H, ISACSON O, CHIKARAISHI DM, KIM KS, FENG G. A tyrosine hydroxylase-yellow fluorescent protein knock-in reporter system labeling dopaminergic neurons reveals potential regulatory role for the first intron of the rodent tyrosine hydroxylase gene. Neuroscience 2006; 142:343-54. [PMID: 16876957 PMCID: PMC2610443 DOI: 10.1016/j.neuroscience.2006.06.032] [Citation(s) in RCA: 10] [Impact Index Per Article: 0.6] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 05/15/2006] [Revised: 06/14/2006] [Accepted: 06/16/2006] [Indexed: 11/20/2022]
Abstract
Degeneration of the dopaminergic neurons of the substantia nigra is a hallmark of Parkinson's disease. To facilitate the study of the differentiation and maintenance of this population of dopaminergic neurons both in vivo and in vitro, we generated a knock-in reporter line in which the yellow fluorescent protein (YFP) replaced the first exon and the first intron of the tyrosine hydroxylase (TH) gene in one allele by homologous recombination. Expression of YFP under the direct control of the entire endogenous 5' upstream region of the TH gene was predicted to closely match expression of TH from the wild type allele, thus marking functional dopaminergic neurons. We found that YFP was expressed in dopaminergic neurons differentiated in vitro from the knock-in mouse embryonic stem cell line and in dopaminergic brain regions in knock-in mice. Surprisingly, however, YFP expression did not overlap completely with TH expression, and the degree of overlap varied in different TH-expressing brain regions. Thus, the reporter gene did not identify functional TH-expressing cells with complete accuracy. A DNaseI hypersensitivity assay revealed a cluster of hypersensitivity sites in the first intron of the TH gene, which was deleted by insertion of the reporter gene, suggesting that this region may contain cis-acting regulatory sequences. Our results suggest that the first intron of the rodent TH gene may be important for accurate expression of TH.
Collapse
Affiliation(s)
- B. B. KELLY
- Department of Neurobiology, Box 3209, Duke University Medical Center, Durham, NC 27710, USA
| | - E. HEDLUND
- Udall Parkinson’s Disease Research Center of Excellence, McLean Hospital/Harvard Medical School, MA 02478, USA
- Molecular Neurobiology Laboratory, McLean Hospital/Harvard Medical School, Belmont, MA 02478, USA
- Neuroregeneration Laboratory, McLean Hospital/Harvard Medical School, Belmont, MA 02478, USA
| | - C. KIM
- Udall Parkinson’s Disease Research Center of Excellence, McLean Hospital/Harvard Medical School, MA 02478, USA
- Molecular Neurobiology Laboratory, McLean Hospital/Harvard Medical School, Belmont, MA 02478, USA
| | - H. ISHIGURO
- Carna Bioscience, KIBC 511, 5-5-2, Minatojima-Minamimachi, Chuo-ku, Kobe 650-0047, Hyogo, Japan
| | - O. ISACSON
- Udall Parkinson’s Disease Research Center of Excellence, McLean Hospital/Harvard Medical School, MA 02478, USA
- Neuroregeneration Laboratory, McLean Hospital/Harvard Medical School, Belmont, MA 02478, USA
| | - D. M. CHIKARAISHI
- Department of Neurobiology, Box 3209, Duke University Medical Center, Durham, NC 27710, USA
| | - K.-S. KIM
- Udall Parkinson’s Disease Research Center of Excellence, McLean Hospital/Harvard Medical School, MA 02478, USA
- Molecular Neurobiology Laboratory, McLean Hospital/Harvard Medical School, Belmont, MA 02478, USA
| | - G. FENG
- Department of Neurobiology, Box 3209, Duke University Medical Center, Durham, NC 27710, USA
- Department of Pathology, Duke University Medical Center, Durham, NC 27710, USA
- Correspondence to: G. Feng, Department of Neurobiology, Box 3209, Duke University Medical Center, Durham, NC 27710, USA. Tel: +1-919-668-1657; fax: +1-919-668-1891. E-mail address: (G. Feng)
| |
Collapse
|