51
|
Jankowski A, Tiuryn J, Prabhakar S. Romulus: robust multi-state identification of transcription factor binding sites from DNase-seq data. Bioinformatics 2016; 32:2419-26. [PMID: 27153645 PMCID: PMC4978937 DOI: 10.1093/bioinformatics/btw209] [Citation(s) in RCA: 15] [Impact Index Per Article: 1.9] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 09/18/2015] [Accepted: 04/12/2016] [Indexed: 12/24/2022] Open
Abstract
Motivation: Computational prediction of transcription factor (TF) binding sites in the genome remains a challenging task. Here, we present Romulus, a novel computational method for identifying individual TF binding sites from genome sequence information and cell-type–specific experimental data, such as DNase-seq. It combines the strengths of previous approaches, and improves robustness by reducing the number of free parameters in the model by an order of magnitude. Results: We show that Romulus significantly outperforms existing methods across three sources of DNase-seq data, by assessing the performance of these tools against ChIP-seq profiles. The difference was particularly significant when applied to binding site prediction for low-information-content motifs. Our method is capable of inferring multiple binding modes for a single TF, which differ in their DNase I cut profile. Finally, using the model learned by Romulus and ChIP-seq data, we introduce Binding in Closed Chromatin (BCC) as a quantitative measure of TF pioneer factor activity. Uniquely, our measure quantifies a defining feature of pioneer factors, namely their ability to bind closed chromatin. Availability and Implementation: Romulus is freely available as an R package at http://github.com/ajank/Romulus. Contact:ajank@mimuw.edu.pl Supplementary information:Supplementary data are available at Bioinformatics online.
Collapse
Affiliation(s)
- Aleksander Jankowski
- Faculty of Mathematics, Informatics and Mechanics, University of Warsaw, 02-097 Warszawa, Poland Computational and Systems Biology, Genome Institute of Singapore, Singapore 138672, Singapore
| | - Jerzy Tiuryn
- Faculty of Mathematics, Informatics and Mechanics, University of Warsaw, 02-097 Warszawa, Poland
| | - Shyam Prabhakar
- Computational and Systems Biology, Genome Institute of Singapore, Singapore 138672, Singapore
| |
Collapse
|
52
|
Zang C, Wang T, Deng K, Li B, Hu S, Qin Q, Xiao T, Zhang S, Meyer CA, He HH, Brown M, Liu JS, Xie Y, Liu XS. High-dimensional genomic data bias correction and data integration using MANCIE. Nat Commun 2016; 7:11305. [PMID: 27072482 PMCID: PMC4833864 DOI: 10.1038/ncomms11305] [Citation(s) in RCA: 32] [Impact Index Per Article: 4.0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 09/17/2015] [Accepted: 03/11/2016] [Indexed: 12/24/2022] Open
Abstract
High-dimensional genomic data analysis is challenging due to noises and biases in high-throughput experiments. We present a computational method matrix analysis and normalization by concordant information enhancement (MANCIE) for bias correction and data integration of distinct genomic profiles on the same samples. MANCIE uses a Bayesian-supported principal component analysis-based approach to adjust the data so as to achieve better consistency between sample-wise distances in the different profiles. MANCIE can improve tissue-specific clustering in ENCODE data, prognostic prediction in Molecular Taxonomy of Breast Cancer International Consortium and The Cancer Genome Atlas data, copy number and expression agreement in Cancer Cell Line Encyclopedia data, and has broad applications in cross-platform, high-dimensional data integration.
Collapse
Affiliation(s)
- Chongzhi Zang
- Department of Biostatistics and Computational Biology, Dana-Farber Cancer Institute and Harvard T.H. Chan School of Public Health, Boston, Massachusetts 02215, USA
- Center for Functional Cancer Epigenetics, Dana-Farber Cancer Institute, Boston, Massachusetts 02215, USA
| | - Tao Wang
- Quantitative Biomedical Research Center, Department of Clinical Sciences, University of Texas Southwestern Medical Center, Dallas, Texas 75390, USA
- Center for the Genetics of Host Defense, University of Texas Southwestern Medical Center, Dallas, Texas 75390, USA
| | - Ke Deng
- Center for Statistical Science, Tsinghua University, Beijing 100084, China
| | - Bo Li
- Department of Biostatistics and Computational Biology, Dana-Farber Cancer Institute and Harvard T.H. Chan School of Public Health, Boston, Massachusetts 02215, USA
- Center for Functional Cancer Epigenetics, Dana-Farber Cancer Institute, Boston, Massachusetts 02215, USA
- Department of Statistics, Harvard University, Cambridge, Massachusetts 02138, USA
| | - Sheng'en Hu
- Department of Bioinformatics, School of Life Sciences, Tongji University, Shanghai 200092, China
| | - Qian Qin
- Department of Bioinformatics, School of Life Sciences, Tongji University, Shanghai 200092, China
| | - Tengfei Xiao
- Department of Biostatistics and Computational Biology, Dana-Farber Cancer Institute and Harvard T.H. Chan School of Public Health, Boston, Massachusetts 02215, USA
- Center for Functional Cancer Epigenetics, Dana-Farber Cancer Institute, Boston, Massachusetts 02215, USA
- Department of Medical Oncology, Dana-Farber Cancer Institute and Harvard Medical School, Boston, Massachusetts 02215, USA
| | - Shihua Zhang
- National Center for Mathematics and Interdisciplinary Sciences, Academy of Mathematics and Systems Science, Chinese Academy of Sciences, Beijing 100190, China
| | - Clifford A. Meyer
- Department of Biostatistics and Computational Biology, Dana-Farber Cancer Institute and Harvard T.H. Chan School of Public Health, Boston, Massachusetts 02215, USA
- Center for Functional Cancer Epigenetics, Dana-Farber Cancer Institute, Boston, Massachusetts 02215, USA
| | - Housheng Hansen He
- Department of Biostatistics and Computational Biology, Dana-Farber Cancer Institute and Harvard T.H. Chan School of Public Health, Boston, Massachusetts 02215, USA
- Center for Functional Cancer Epigenetics, Dana-Farber Cancer Institute, Boston, Massachusetts 02215, USA
- Department of Medical Oncology, Dana-Farber Cancer Institute and Harvard Medical School, Boston, Massachusetts 02215, USA
- Department of Medical Biophysics, University of Toronto, Toronto, Ontatio M5G 1L7, Canada
| | - Myles Brown
- Center for Functional Cancer Epigenetics, Dana-Farber Cancer Institute, Boston, Massachusetts 02215, USA
- Department of Medical Oncology, Dana-Farber Cancer Institute and Harvard Medical School, Boston, Massachusetts 02215, USA
| | - Jun S. Liu
- Department of Statistics, Harvard University, Cambridge, Massachusetts 02138, USA
| | - Yang Xie
- Quantitative Biomedical Research Center, Department of Clinical Sciences, University of Texas Southwestern Medical Center, Dallas, Texas 75390, USA
- Department of Bioinformatics, University of Texas Southwestern Medical Center, Dallas, Texas 75390, USA
- Simons Comprehensive Cancer Center, University of Texas Southwestern Medical Center, Dallas, Texas 75390, USA
| | - X. Shirley Liu
- Department of Biostatistics and Computational Biology, Dana-Farber Cancer Institute and Harvard T.H. Chan School of Public Health, Boston, Massachusetts 02215, USA
- Center for Functional Cancer Epigenetics, Dana-Farber Cancer Institute, Boston, Massachusetts 02215, USA
| |
Collapse
|
53
|
Vierstra J, Stamatoyannopoulos JA. Genomic footprinting. Nat Methods 2016; 13:213-21. [DOI: 10.1038/nmeth.3768] [Citation(s) in RCA: 76] [Impact Index Per Article: 9.5] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 09/14/2015] [Accepted: 01/13/2016] [Indexed: 01/08/2023]
|
54
|
Sos BC, Fung HL, Gao DR, Osothprarop TF, Kia A, He MM, Zhang K. Characterization of chromatin accessibility with a transposome hypersensitive sites sequencing (THS-seq) assay. Genome Biol 2016; 17:20. [PMID: 26846207 PMCID: PMC4743176 DOI: 10.1186/s13059-016-0882-7] [Citation(s) in RCA: 38] [Impact Index Per Article: 4.8] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 09/07/2015] [Accepted: 01/18/2016] [Indexed: 12/22/2022] Open
Abstract
Chromatin accessibility captures in vivo protein-chromosome binding status, and is considered an informative proxy for protein-DNA interactions. DNase I and Tn5 transposase assays require thousands to millions of fresh cells for comprehensive chromatin mapping. Applying Tn5 tagmentation to hundreds of cells results in sparse chromatin maps. We present a transposome hypersensitive sites sequencing assay for highly sensitive characterization of chromatin accessibility. Linear amplification of accessible DNA ends with in vitro transcription, coupled with an engineered Tn5 super-mutant, demonstrates improved sensitivity on limited input materials, and accessibility of small regions near distal enhancers, compared with ATAC-seq.
Collapse
Affiliation(s)
- Brandon Chin Sos
- Department of Bioengineering, University of California San Diego, 9500 Gilman Drive, La Jolla, CA, USA.,Biomedical Sciences Graduate Program, University of California San Diego, 9500 Gilman Drive, La Jolla, CA, USA
| | - Ho-Lim Fung
- Department of Bioengineering, University of California San Diego, 9500 Gilman Drive, La Jolla, CA, USA
| | - Derek Rui Gao
- Department of Bioengineering, University of California San Diego, 9500 Gilman Drive, La Jolla, CA, USA
| | | | - Amirali Kia
- Illumina Inc, 5200 Illumina Way, San Diego, CA, USA
| | - Molly Min He
- Illumina Inc, 5200 Illumina Way, San Diego, CA, USA
| | - Kun Zhang
- Department of Bioengineering, University of California San Diego, 9500 Gilman Drive, La Jolla, CA, USA. .,Biomedical Sciences Graduate Program, University of California San Diego, 9500 Gilman Drive, La Jolla, CA, USA.
| |
Collapse
|
55
|
Bery A, Mérot Y, Rétaux S. Genes expressed in mouse cortical progenitors are enriched in Pax, Lhx, and Sox transcription factor putative binding sites. Brain Res 2015; 1633:37-51. [PMID: 26721689 DOI: 10.1016/j.brainres.2015.12.022] [Citation(s) in RCA: 4] [Impact Index Per Article: 0.4] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/22/2015] [Revised: 11/25/2015] [Accepted: 12/14/2015] [Indexed: 11/19/2022]
Abstract
Considerable progress has been made in the understanding of molecular and cellular mechanisms controlling the development of the mammalian cortex. The proliferative and neurogenic properties of cortical progenitors located in the ventricular germinal zone start being understood. Little is known however on the cis-regulatory control that finely tunes gene expression in these progenitors. Here, we undertook an in silico-based approach to address this question, followed by some functional validation. Using the Eurexpress database, we established a list of 30 genes specifically expressed in the cortical germinal zone, we selected mouse/human conserved non-coding elements (CNEs) around these genes and we performed motif-enrichment search in these CNEs. We found an over-representation of motifs corresponding to binding sites for Pax, Sox, and Lhx transcription factors, often found as pairs and located within 100bp windows. A small subset of CNEs (n=7) was tested for enhancer activity, by ex-vivo and in utero electroporation assays. Two showed strong enhancer activity in the germinal zone progenitors. Mutagenesis experiments on a selected CNE showed the functional importance of the Pax, Sox, and Lhx TFBS for conferring enhancer activity to the CNE. Overall, from a cis-regulatory viewpoint, our data suggest an input from Pax, Sox and Lhx transcription factors to orchestrate corticogenesis. These results are discussed with regards to the known functional roles of Pax6, Sox2 and Lhx2 in cortical development.
Collapse
Affiliation(s)
- Amandine Bery
- DECA Group, Institut des Neurosciences Paris-Saclay, Université Paris-Saclay, Université Paris-Sud, CNRS, UMR 9197, 91198 Gif-sur-Yvette, France.
| | - Yohann Mérot
- DECA Group, Institut des Neurosciences Paris-Saclay, Université Paris-Saclay, Université Paris-Sud, CNRS, UMR 9197, 91198 Gif-sur-Yvette, France
| | - Sylvie Rétaux
- DECA Group, Institut des Neurosciences Paris-Saclay, Université Paris-Saclay, Université Paris-Sud, CNRS, UMR 9197, 91198 Gif-sur-Yvette, France.
| |
Collapse
|
56
|
Piper J, Assi SA, Cauchy P, Ladroue C, Cockerill PN, Bonifer C, Ott S. Wellington-bootstrap: differential DNase-seq footprinting identifies cell-type determining transcription factors. BMC Genomics 2015; 16:1000. [PMID: 26608661 PMCID: PMC4658755 DOI: 10.1186/s12864-015-2081-4] [Citation(s) in RCA: 40] [Impact Index Per Article: 4.4] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 05/12/2015] [Accepted: 10/13/2015] [Indexed: 12/04/2022] Open
Abstract
Background The analysis of differential gene expression is a fundamental tool to relate gene regulation with specific biological processes. Differential binding of transcription factors (TFs) can drive differential gene expression. While DNase-seq data can provide global snapshots of TF binding, tools for detecting differential binding from pairs of DNase-seq data sets are lacking. Results In order to link expression changes with changes in TF binding we introduce the concept of differential footprinting alongside a computational tool. We demonstrate that differential footprinting is associated with differential gene expression and can be used to define cell types by their specific TF occupancy patterns. Conclusions Our new tool, Wellington-bootstrap, will enable the detection of differential TF binding facilitating the study of gene regulatory systems. Electronic supplementary material The online version of this article (doi:10.1186/s12864-015-2081-4) contains supplementary material, which is available to authorized users.
Collapse
Affiliation(s)
- Jason Piper
- Warwick Systems Biology Centre, University of Warwick, Coventry, CV4 7AL, UK.,Institute of Cancer and Genomic Sciences, College of Medical and Dental Sciences, Institute of Biomedical Research, University of Birmingham, Birmingham, B15 2TT, UK
| | - Salam A Assi
- Institute of Cancer and Genomic Sciences, College of Medical and Dental Sciences, Institute of Biomedical Research, University of Birmingham, Birmingham, B15 2TT, UK
| | - Pierre Cauchy
- Institute of Cancer and Genomic Sciences, College of Medical and Dental Sciences, Institute of Biomedical Research, University of Birmingham, Birmingham, B15 2TT, UK
| | - Christophe Ladroue
- Department of Computer Science, University of Warwick, Coventry, CV4 7AL, UK
| | - Peter N Cockerill
- Institute of Cancer and Genomic Sciences, College of Medical and Dental Sciences, Institute of Biomedical Research, University of Birmingham, Birmingham, B15 2TT, UK.
| | - Constanze Bonifer
- Institute of Cancer and Genomic Sciences, College of Medical and Dental Sciences, Institute of Biomedical Research, University of Birmingham, Birmingham, B15 2TT, UK.
| | - Sascha Ott
- Warwick Systems Biology Centre, University of Warwick, Coventry, CV4 7AL, UK.
| |
Collapse
|
57
|
Ren B, Yue F. Transcriptional Enhancers: Bridging the Genome and Phenome. COLD SPRING HARBOR SYMPOSIA ON QUANTITATIVE BIOLOGY 2015; 80:17-26. [PMID: 26582789 DOI: 10.1101/sqb.2015.80.027219] [Citation(s) in RCA: 24] [Impact Index Per Article: 2.7] [Reference Citation Analysis] [Abstract] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 11/25/2022]
Abstract
Enhancers play a major role in animal development by modulating spatiotemporal expression of genes. They interact with sequence-specific transcriptional regulators in response to internal and external cues to bring about transcriptional changes, thus serving as the critical link between an organism's genome and its phenotypic traits. Deciphering the biology of enhancers is a key to understanding the genetic basis of common human diseases. Although a large number of candidate enhancers have been annotated through genome-wide analyses of chromatin accessibility, transcription factor binding, and histone modification in diverse cell types, efforts to characterize their biological roles in human diseases have only begun. Recent experiments have suggested a role for the three-dimensional chromatin architecture in regulation of gene expression by enhancers.
Collapse
Affiliation(s)
- Bing Ren
- Ludwig Institute for Cancer Research, University of California, San Diego School of Medicine, Department of Cellular and Molecular Medicine, Institute of Genomic Medicine, and Moores Cancer Center, La Jolla, California 92093-0653
| | - Feng Yue
- Department of Biochemistry and Molecular Biology and Institute for Personalized Medicine, Pennsylvania State University College of Medicine, Hershey, Pennsylvania 17033
| |
Collapse
|
58
|
Ugarte F, Sousae R, Cinquin B, Martin EW, Krietsch J, Sanchez G, Inman M, Tsang H, Warr M, Passegué E, Larabell CA, Forsberg EC. Progressive Chromatin Condensation and H3K9 Methylation Regulate the Differentiation of Embryonic and Hematopoietic Stem Cells. Stem Cell Reports 2015; 5:728-740. [PMID: 26489895 PMCID: PMC4649257 DOI: 10.1016/j.stemcr.2015.09.009] [Citation(s) in RCA: 82] [Impact Index Per Article: 9.1] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/15/2014] [Revised: 09/10/2015] [Accepted: 09/11/2015] [Indexed: 01/09/2023] Open
Abstract
Epigenetic regulation serves as the basis for stem cell differentiation into distinct cell types, but it is unclear how global epigenetic changes are regulated during this process. Here, we tested the hypothesis that global chromatin organization affects the lineage potential of stem cells and that manipulation of chromatin dynamics influences stem cell function. Using nuclease sensitivity assays, we found a progressive decrease in chromatin digestion among pluripotent embryonic stem cells (ESCs), multipotent hematopoietic stem cells (HSCs), and mature hematopoietic cells. Quantitative high-resolution microscopy revealed that ESCs contain significantly more euchromatin than HSCs, with a further reduction in mature cells. Increased cellular maturation also led to heterochromatin localization to the nuclear periphery. Functionally, prevention of heterochromatin formation by inhibition of the histone methyltransferase G9A resulted in delayed HSC differentiation. Our results demonstrate global chromatin rearrangements during stem cell differentiation and that heterochromatin formation by H3K9 methylation regulates HSC differentiation.
Collapse
Affiliation(s)
- Fernando Ugarte
- Institute for the Biology of Stem Cells, Department of Biomolecular Engineering, University of California, Santa Cruz, Santa Cruz, CA 95064, USA
| | - Rebekah Sousae
- Institute for the Biology of Stem Cells, Department of Biomolecular Engineering, University of California, Santa Cruz, Santa Cruz, CA 95064, USA
| | - Bertrand Cinquin
- National Center for X-ray Tomography, UCSF, San Francisco, CA 94143, USA; Physical Biosciences Division, Lawrence Berkeley National Laboratory, Berkeley, CA 94720, USA
| | - Eric W Martin
- Institute for the Biology of Stem Cells, Department of Biomolecular Engineering, University of California, Santa Cruz, Santa Cruz, CA 95064, USA
| | - Jana Krietsch
- Institute for the Biology of Stem Cells, Department of Biomolecular Engineering, University of California, Santa Cruz, Santa Cruz, CA 95064, USA
| | - Gabriela Sanchez
- Institute for the Biology of Stem Cells, Department of Biomolecular Engineering, University of California, Santa Cruz, Santa Cruz, CA 95064, USA
| | - Margaux Inman
- Institute for the Biology of Stem Cells, Department of Biomolecular Engineering, University of California, Santa Cruz, Santa Cruz, CA 95064, USA
| | - Herman Tsang
- Institute for the Biology of Stem Cells, Department of Biomolecular Engineering, University of California, Santa Cruz, Santa Cruz, CA 95064, USA
| | - Matthew Warr
- The Eli and Edythe Broad Center for Regenerative Medicine and Stem Cell Research, Medicine Department, Hem/Onc Division, University of California, San Francisco, San Francisco, CA 94143, USA
| | - Emmanuelle Passegué
- The Eli and Edythe Broad Center for Regenerative Medicine and Stem Cell Research, Medicine Department, Hem/Onc Division, University of California, San Francisco, San Francisco, CA 94143, USA
| | - Carolyn A Larabell
- National Center for X-ray Tomography, UCSF, San Francisco, CA 94143, USA; Physical Biosciences Division, Lawrence Berkeley National Laboratory, Berkeley, CA 94720, USA
| | - E Camilla Forsberg
- Institute for the Biology of Stem Cells, Department of Biomolecular Engineering, University of California, Santa Cruz, Santa Cruz, CA 95064, USA.
| |
Collapse
|
59
|
Hall JB, Cooke Bailey JN, Hoffman JD, Pericak-Vance MA, Scott WK, Kovach JL, Schwartz SG, Agarwal A, Brantley MA, Haines JL, Bush WS. Estimating cumulative pathway effects on risk for age-related macular degeneration using mixed linear models. BMC Bioinformatics 2015; 16:329. [PMID: 26467978 PMCID: PMC4606903 DOI: 10.1186/s12859-015-0760-4] [Citation(s) in RCA: 8] [Impact Index Per Article: 0.9] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 05/05/2015] [Accepted: 10/03/2015] [Indexed: 11/13/2022] Open
Abstract
Background Age-related macular degeneration (AMD) is the leading cause of irreversible visual loss in the elderly in developed countries and typically affects more than 10 % of individuals over age 80. AMD has a large genetic component, with heritability estimated to be between 45 % and 70 %. Numerous variants have been identified and implicate various molecular mechanisms and pathways for AMD pathogenesis but those variants only explain a portion of AMD’s heritability. The goal of our study was to estimate the cumulative genetic contribution of common variants on AMD risk for multiple pathways related to the etiology of AMD, including angiogenesis, antioxidant activity, apoptotic signaling, complement activation, inflammatory response, response to nicotine, oxidative phosphorylation, and the tricarboxylic acid cycle. While these mechanisms have been associated with AMD in literature, the overall extent of the contribution to AMD risk for each is unknown. Methods In a case–control dataset with 1,813 individuals genotyped for over 600,000 SNPs we used Genome-wide Complex Trait Analysis (GCTA) to estimate the proportion of AMD risk explained by SNPs in genes associated with each pathway. SNPs within a 50 kb region flanking each gene were also assessed, as well as more distant, putatively regulatory SNPs, based on DNaseI hypersensitivity data from ocular tissue in the ENCODE project. Results We found that 19 previously associated AMD risk SNPs contributed to 13.3 % of the risk for AMD in our dataset, while the remaining genotyped SNPs contributed to 36.7 % of AMD risk. Adjusting for the 19 risk SNPs, the complement activation and inflammatory response pathways still explained a statistically significant proportion of additional risk for AMD (9.8 % and 17.9 %, respectively), with other pathways showing no significant effects (0.3 % – 4.4 %). Discussion Our results show that SNPs associated with complement activation and inflammation significantly contribute to AMD risk, separately from the risk explained by the 19 known risk SNPs. We found that SNPs within 50 kb regions flanking genes explained additional risk beyond genic SNPs, suggesting a potential regulatory role, but that more distant SNPs explained less than 0.5 % additional risk for each pathway. Conclusions From these analyses we find that the impact of complement SNPs on risk for AMD extends beyond the established genome-wide significant SNPs. Electronic supplementary material The online version of this article (doi:10.1186/s12859-015-0760-4) contains supplementary material, which is available to authorized users.
Collapse
Affiliation(s)
- Jacob B Hall
- Graduate Program in Human Genetics, Vanderbilt University, Nashville, TN, USA.
| | - Jessica N Cooke Bailey
- Department of Epidemiology and Biostatistics, Case Western Reserve University, Cleveland, OH, USA.
| | - Joshua D Hoffman
- Graduate Program in Human Genetics, Vanderbilt University, Nashville, TN, USA.
| | - Margaret A Pericak-Vance
- John P. Hussman Institute for Human Genomics, University of Miami Miller School of Medicine, Miami, FL, USA.
| | - William K Scott
- John P. Hussman Institute for Human Genomics, University of Miami Miller School of Medicine, Miami, FL, USA.
| | - Jaclyn L Kovach
- Department of Ophthalmology at Bascom Palmer Eye Institute, University of Miami Miller School of Medicine, Miami, FL, USA.
| | - Stephen G Schwartz
- Department of Ophthalmology at Bascom Palmer Eye Institute, University of Miami Miller School of Medicine, Miami, FL, USA.
| | - Anita Agarwal
- Department of Ophthalmology and Visual Sciences, Vanderbilt University, Nashville, TN, USA.
| | - Milam A Brantley
- Department of Ophthalmology and Visual Sciences, Vanderbilt University, Nashville, TN, USA.
| | - Jonathan L Haines
- Institute for Computational Biology, Department of Epidemiology and Biostatistics, Case Western Reserve University, Cleveland, OH, USA.
| | - William S Bush
- Institute for Computational Biology, Department of Epidemiology and Biostatistics, Case Western Reserve University, Cleveland, OH, USA.
| |
Collapse
|
60
|
Mijušković M, Chou YF, Gigi V, Lindsay CR, Shestova O, Lewis SM, Roth DB. Off-Target V(D)J Recombination Drives Lymphomagenesis and Is Escalated by Loss of the Rag2 C Terminus. Cell Rep 2015; 12:1842-52. [PMID: 26365182 DOI: 10.1016/j.celrep.2015.08.034] [Citation(s) in RCA: 22] [Impact Index Per Article: 2.4] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 05/26/2015] [Revised: 07/07/2015] [Accepted: 08/08/2015] [Indexed: 11/29/2022] Open
Abstract
Genome-wide analysis of thymic lymphomas from Tp53(-/-) mice with wild-type or C-terminally truncated Rag2 revealed numerous off-target, RAG-mediated DNA rearrangements. A significantly higher fraction of these errors mutated known and suspected oncogenes/tumor suppressor genes than did sporadic rearrangements (p < 0.0001). This tractable mouse model recapitulates recent findings in human pre-B ALL and allows comparison of wild-type and mutant RAG2. Recurrent, RAG-mediated deletions affected Notch1, Pten, Ikzf1, Jak1, Phlda1, Trat1, and Agpat9. Rag2 truncation substantially increased the frequency of off-target V(D)J recombination. The data suggest that interactions between Rag2 and a specific chromatin modification, H3K4me3, support V(D)J recombination fidelity. Oncogenic effects of off-target rearrangements created by this highly regulated recombinase may need to be considered in design of site-specific nucleases engineered for genome modification.
Collapse
Affiliation(s)
- Martina Mijušković
- Department of Pathology and Laboratory Medicine, Raymond and Ruth Perelman School of Medicine, University of Pennsylvania, Philadelphia, PA, 19104, USA; Abramson Family Cancer Research Institute, Raymond and Ruth Perelman School of Medicine, University of Pennsylvania, Philadelphia, PA, 19104, USA; Department of Genetics and Epidemiology, The Institute of Cancer Research, 15 Cotswold Road, Sutton, Surrey SM2 5NG, UK
| | - Yi-Fan Chou
- Department of Pathology and Laboratory Medicine, Raymond and Ruth Perelman School of Medicine, University of Pennsylvania, Philadelphia, PA, 19104, USA; Abramson Family Cancer Research Institute, Raymond and Ruth Perelman School of Medicine, University of Pennsylvania, Philadelphia, PA, 19104, USA
| | - Vered Gigi
- Department of Pathology and Laboratory Medicine, Raymond and Ruth Perelman School of Medicine, University of Pennsylvania, Philadelphia, PA, 19104, USA; Abramson Family Cancer Research Institute, Raymond and Ruth Perelman School of Medicine, University of Pennsylvania, Philadelphia, PA, 19104, USA; The Boston Consulting Group, 1735 Market Street, Philadelphia, PA 19103, USA
| | - Cory R Lindsay
- Department of Pathology and Laboratory Medicine, Raymond and Ruth Perelman School of Medicine, University of Pennsylvania, Philadelphia, PA, 19104, USA; Abramson Family Cancer Research Institute, Raymond and Ruth Perelman School of Medicine, University of Pennsylvania, Philadelphia, PA, 19104, USA; Cardeza Foundation for Hematological Research, Thomas Jefferson University, 394 Jefferson Alumni Hall, 1020 Locust Street, Philadelphia, PA 19104, USA
| | - Olga Shestova
- Department of Pathology and Laboratory Medicine, Raymond and Ruth Perelman School of Medicine, University of Pennsylvania, Philadelphia, PA, 19104, USA; Abramson Family Cancer Research Institute, Raymond and Ruth Perelman School of Medicine, University of Pennsylvania, Philadelphia, PA, 19104, USA; Translational Research Program, Abramson Family Research Cancer Institute, University of Pennsylvania, Philadelphia, PA 19104, USA
| | - Susanna M Lewis
- Department of Pathology and Laboratory Medicine, Raymond and Ruth Perelman School of Medicine, University of Pennsylvania, Philadelphia, PA, 19104, USA; Abramson Family Cancer Research Institute, Raymond and Ruth Perelman School of Medicine, University of Pennsylvania, Philadelphia, PA, 19104, USA.
| | - David B Roth
- Department of Pathology and Laboratory Medicine, Raymond and Ruth Perelman School of Medicine, University of Pennsylvania, Philadelphia, PA, 19104, USA; Abramson Family Cancer Research Institute, Raymond and Ruth Perelman School of Medicine, University of Pennsylvania, Philadelphia, PA, 19104, USA.
| |
Collapse
|
61
|
Lercher L, Raj R, Patel NA, Price J, Mohammed S, Robinson CV, Schofield CJ, Davis BG. Generation of a synthetic GlcNAcylated nucleosome reveals regulation of stability by H2A-Thr101 GlcNAcylation. Nat Commun 2015; 6:7978. [PMID: 26305776 PMCID: PMC4560749 DOI: 10.1038/ncomms8978] [Citation(s) in RCA: 44] [Impact Index Per Article: 4.9] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 12/02/2014] [Accepted: 07/02/2015] [Indexed: 02/06/2023] Open
Abstract
O-GlcNAcylation is a newly discovered histone modification implicated in transcriptional regulation, but no structural information on the physical effect of GlcNAcylation on chromatin exists. Here, we generate synthetic, pure GlcNAcylated histones and nucleosomes and reveal that GlcNAcylation can modulate structure through direct destabilization of H2A/H2B dimers in the nucleosome, thus promoting an 'open' chromatin state. The results suggest that a plausible molecular basis for one role of histone O-GlcNAcylation in epigenetic regulation is to lower the barrier for RNA polymerase passage and hence increase transcription.
Collapse
Affiliation(s)
- Lukas Lercher
- Department of Chemistry, University of Oxford, Chemistry Research Laboratory, Mansfield Road, Oxford OX1 3TA, UK
| | - Ritu Raj
- Department of Chemistry, University of Oxford, Chemistry Research Laboratory, Mansfield Road, Oxford OX1 3TA, UK
| | - Nisha A. Patel
- Department of Chemistry, University of Oxford, Physical and Theoretical Chemistry Laboratory, South Parks Road, Oxford OX1 3QZ, UK
| | - Joshua Price
- Department of Chemistry, University of Oxford, Chemistry Research Laboratory, Mansfield Road, Oxford OX1 3TA, UK
| | - Shabaz Mohammed
- Department of Chemistry, University of Oxford, Chemistry Research Laboratory, Mansfield Road, Oxford OX1 3TA, UK
| | - Carol V. Robinson
- Department of Chemistry, University of Oxford, Physical and Theoretical Chemistry Laboratory, South Parks Road, Oxford OX1 3QZ, UK
| | - Christopher J. Schofield
- Department of Chemistry, University of Oxford, Chemistry Research Laboratory, Mansfield Road, Oxford OX1 3TA, UK
| | - Benjamin G. Davis
- Department of Chemistry, University of Oxford, Chemistry Research Laboratory, Mansfield Road, Oxford OX1 3TA, UK
| |
Collapse
|
62
|
Bilgin Sonay T, Carvalho T, Robinson MD, Greminger MP, Krützen M, Comas D, Highnam G, Mittelman D, Sharp A, Marques-Bonet T, Wagner A. Tandem repeat variation in human and great ape populations and its impact on gene expression divergence. Genome Res 2015; 25:1591-9. [PMID: 26290536 PMCID: PMC4617956 DOI: 10.1101/gr.190868.115] [Citation(s) in RCA: 57] [Impact Index Per Article: 6.3] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 02/16/2015] [Accepted: 08/14/2015] [Indexed: 12/20/2022]
Abstract
Tandem repeats (TRs) are stretches of DNA that are highly variable in length and mutate rapidly. They are thus an important source of genetic variation. This variation is highly informative for population and conservation genetics. It has also been associated with several pathological conditions and with gene expression regulation. However, genome-wide surveys of TR variation in humans and closely related species have been scarce due to technical difficulties derived from short-read technology. Here we explored the genome-wide diversity of TRs in a panel of 83 human and nonhuman great ape genomes, in a total of six different species, and studied their impact on gene expression evolution. We found that population diversity patterns can be efficiently captured with short TRs (repeat unit length, 1–5 bp). We examined the potential evolutionary role of TRs in gene expression differences between humans and primates by using 30,275 larger TRs (repeat unit length, 2–50 bp). Genes that contained TRs in the promoters, in their 3′ untranslated region, in introns, and in exons had higher expression divergence than genes without repeats in the regions. Polymorphic small repeats (1–5 bp) had also higher expression divergence compared with genes with fixed or no TRs in the gene promoters. Our findings highlight the potential contribution of TRs to human evolution through gene regulation.
Collapse
Affiliation(s)
- Tugce Bilgin Sonay
- Institute of Evolutionary Biology and Environmental Studies, University of Zurich, CH-805 Zurich, Switzerland; The Swiss Institute of Bioinformatics, 1015 Lausanne, Switzerland
| | - Tiago Carvalho
- Institute of Evolutionary Biology (CSIC-UPF), Department of Experimental and Health Sciences, Universitat Pompeu Fabra, 08003 Barcelona, Spain
| | - Mark D Robinson
- The Swiss Institute of Bioinformatics, 1015 Lausanne, Switzerland; Institute of Molecular Life Sciences, University of Zurich, 8057 Zurich, Switzerland
| | - Maja P Greminger
- Evolutionary Genetics Group, Anthropological Institute and Museum, University of Zurich, CH-8057 Zurich, Switzerland
| | - Michael Krützen
- Evolutionary Genetics Group, Anthropological Institute and Museum, University of Zurich, CH-8057 Zurich, Switzerland
| | - David Comas
- Institute of Evolutionary Biology (CSIC-UPF), Department of Experimental and Health Sciences, Universitat Pompeu Fabra, 08003 Barcelona, Spain
| | - Gareth Highnam
- Department of Biological Science and Virginia Bioinformatics Institute, Virginia Tech, Blacksburg, Virginia 24061, USA
| | - David Mittelman
- Department of Biological Science and Virginia Bioinformatics Institute, Virginia Tech, Blacksburg, Virginia 24061, USA
| | - Andrew Sharp
- Department of Genetics and Genomic Sciences, Icahn School of Medicine at Mount Sinai School, New York, New York 10029, USA
| | - Tomàs Marques-Bonet
- Institute of Evolutionary Biology (CSIC-UPF), Department of Experimental and Health Sciences, Universitat Pompeu Fabra, 08003 Barcelona, Spain; Centro Nacional de Análisis Genómico (CNAG), PCB, Barcelona, 08028 Catalonia, Spain; Catalan Institution for Research and Advanced Studies (ICREA), 08010 Barcelona, Spain
| | - Andreas Wagner
- Institute of Evolutionary Biology and Environmental Studies, University of Zurich, CH-805 Zurich, Switzerland; The Swiss Institute of Bioinformatics, 1015 Lausanne, Switzerland; The Santa Fe Institute, Santa Fe, New Mexico 87501, USA
| |
Collapse
|
63
|
Tumor necrosis factor-α confers cardioprotection through ectopic expression of keratins K8 and K18. Nat Med 2015; 21:1076-84. [PMID: 26280121 DOI: 10.1038/nm.3925] [Citation(s) in RCA: 77] [Impact Index Per Article: 8.6] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 02/20/2015] [Accepted: 07/15/2015] [Indexed: 12/18/2022]
Abstract
Tumor necrosis factor-α (TNF-α), one of the major stress-induced proinflammatory cytokines, is upregulated in the heart after tissue injury, and its sustained expression can contribute to the development of heart failure. Whether TNF-α also exerts cytoprotective effects in heart failure is not known. Here we provide evidence for a cardioprotective function of TNF-α in a genetic heart failure model, desmin-deficient mice. The cardioprotective effects of TNF-α are a consequence of nuclear factor-κB (NF-κB)-mediated ectopic expression in cardiomyocytes of keratin 8 (K8) and keratin 18 (K18), two epithelial-specific intermediate filament proteins. In cardiomyocytes, K8 and K18 (K8/K18) formed an alternative cytoskeletal network that localized mainly at intercalated discs (IDs) and conferred cardioprotection by maintaining normal ID structure and mitochondrial integrity and function. Ectopic induction of K8/K18 expression in cardiomyocytes also occurred in other genetic and experimental models of heart failure. Loss of the K8/K18 network resulted in a maladaptive cardiac phenotype following transverse aortic constriction. In human failing myocardium, where TNF-α expression is upregulated, K8/K18 were also ectopically expressed and localized primarily at IDs, which did not contain detectable amounts of desmin. Thus, TNF-α- and NF-κB-mediated formation of an alternative, stress-induced intermediate filament cytoskeleton has cardioprotective function in mice and potentially in humans.
Collapse
|
64
|
Stavreva DA, Coulon A, Baek S, Sung MH, John S, Stixova L, Tesikova M, Hakim O, Miranda T, Hawkins M, Stamatoyannopoulos JA, Chow CC, Hager GL. Dynamics of chromatin accessibility and long-range interactions in response to glucocorticoid pulsing. Genome Res 2015; 25:845-57. [PMID: 25677181 PMCID: PMC4448681 DOI: 10.1101/gr.184168.114] [Citation(s) in RCA: 66] [Impact Index Per Article: 7.3] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 09/09/2014] [Accepted: 02/05/2015] [Indexed: 12/20/2022]
Abstract
Although physiological steroid levels are often pulsatile (ultradian), the genomic effects of this pulsatility are poorly understood. By utilizing glucocorticoid receptor (GR) signaling as a model system, we uncovered striking spatiotemporal relationships between receptor loading, lifetimes of the DNase I hypersensitivity sites (DHSs), long-range interactions, and gene regulation. We found that hormone-induced DHSs were enriched within ± 50 kb of GR-responsive genes and displayed a broad spectrum of lifetimes upon hormone withdrawal. These lifetimes dictate the strength of the DHS interactions with gene targets and contribute to gene regulation from a distance. Our results demonstrate that pulsatile and constant hormone stimulations induce unique, treatment-specific patterns of gene and regulatory element activation. These modes of activation have implications for corticosteroid function in vivo and for steroid therapies in various clinical settings.
Collapse
Affiliation(s)
- Diana A Stavreva
- Laboratory of Receptor Biology and Gene Expression, National Cancer Institute, NIH, Bethesda, Maryland 20892, USA
| | - Antoine Coulon
- Laboratory of Biological Modeling, National Institute of Diabetes and Digestive and Kidney Diseases, NIH, Bethesda, Maryland 20892, USA
| | - Songjoon Baek
- Laboratory of Receptor Biology and Gene Expression, National Cancer Institute, NIH, Bethesda, Maryland 20892, USA
| | - Myong-Hee Sung
- Laboratory of Receptor Biology and Gene Expression, National Cancer Institute, NIH, Bethesda, Maryland 20892, USA
| | - Sam John
- Laboratory of Genome Integrity, National Cancer Institute, NIH, Bethesda, Maryland 20892, USA
| | - Lenka Stixova
- Department of Molecular Cytology and Cytometry, Institute of Biophysics, Academy of Sciences of the Czech Republic, 612 65 Brno, Czech Republic
| | - Martina Tesikova
- Department of Biosciences, University of Oslo, 0316 Oslo, Norway
| | - Ofir Hakim
- The Mina and Everard Goodman Faculty of Life Sciences, Bar-Ilan University, Ramat-Gan 5290002, Israel
| | - Tina Miranda
- Laboratory of Receptor Biology and Gene Expression, National Cancer Institute, NIH, Bethesda, Maryland 20892, USA
| | - Mary Hawkins
- Laboratory of Receptor Biology and Gene Expression, National Cancer Institute, NIH, Bethesda, Maryland 20892, USA
| | | | - Carson C Chow
- Laboratory of Biological Modeling, National Institute of Diabetes and Digestive and Kidney Diseases, NIH, Bethesda, Maryland 20892, USA
| | - Gordon L Hager
- Laboratory of Receptor Biology and Gene Expression, National Cancer Institute, NIH, Bethesda, Maryland 20892, USA
| |
Collapse
|
65
|
Wang C, Lv Y, Wang B, Yin C, Lin Y, Pan L. Survey of protein-DNA interactions in Aspergillus oryzae on a genomic scale. Nucleic Acids Res 2015; 43:4429-46. [PMID: 25883143 PMCID: PMC4482085 DOI: 10.1093/nar/gkv334] [Citation(s) in RCA: 7] [Impact Index Per Article: 0.8] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 05/21/2014] [Accepted: 03/31/2015] [Indexed: 01/23/2023] Open
Abstract
The genome-scale delineation of in vivo protein–DNA interactions is key to understanding genome function. Only ∼5% of transcription factors (TFs) in the Aspergillus genus have been identified using traditional methods. Although the Aspergillus oryzae genome contains >600 TFs, knowledge of the in vivo genome-wide TF-binding sites (TFBSs) in aspergilli remains limited because of the lack of high-quality antibodies. We investigated the landscape of in vivo protein–DNA interactions across the A. oryzae genome through coupling the DNase I digestion of intact nuclei with massively parallel sequencing and the analysis of cleavage patterns in protein–DNA interactions at single-nucleotide resolution. The resulting map identified overrepresented de novo TF-binding motifs from genomic footprints, and provided the detailed chromatin remodeling patterns and the distribution of digital footprints near transcription start sites. The TFBSs of 19 known Aspergillus TFs were also identified based on DNase I digestion data surrounding potential binding sites in conjunction with TF binding specificity information. We observed that the cleavage patterns of TFBSs were dependent on the orientation of TF motifs and independent of strand orientation, consistent with the DNA shape features of binding motifs with flanking sequences.
Collapse
Affiliation(s)
- Chao Wang
- School of Bioscience and Bioengineering, South China University of Technology, Guangzhou, Guangdong, 510006, China
| | - Yangyong Lv
- School of Bioscience and Bioengineering, South China University of Technology, Guangzhou, Guangdong, 510006, China
| | - Bin Wang
- School of Bioscience and Bioengineering, South China University of Technology, Guangzhou, Guangdong, 510006, China
| | - Chao Yin
- School of Bioscience and Bioengineering, South China University of Technology, Guangzhou, Guangdong, 510006, China
| | - Ying Lin
- School of Bioscience and Bioengineering, South China University of Technology, Guangzhou, Guangdong, 510006, China
| | - Li Pan
- School of Bioscience and Bioengineering, South China University of Technology, Guangzhou, Guangdong, 510006, China
| |
Collapse
|
66
|
High resolution mapping of enhancer-promoter interactions. PLoS One 2015; 10:e0122420. [PMID: 25970635 PMCID: PMC4430501 DOI: 10.1371/journal.pone.0122420] [Citation(s) in RCA: 2] [Impact Index Per Article: 0.2] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 11/12/2014] [Accepted: 02/20/2015] [Indexed: 01/19/2023] Open
Abstract
RNA Polymerase II ChIA-PET data has revealed enhancers that are active in a profiled cell type and the genes that the enhancers regulate through chromatin interactions. The most commonly used computational method for analyzing ChIA-PET data, the ChIA-PET Tool, discovers interaction anchors at a spatial resolution that is insufficient to accurately identify individual enhancers. We introduce Germ, a computational method that estimates the likelihood that any two narrowly defined genomic locations are jointly occupied by RNA Polymerase II. Germ takes a blind deconvolution approach to simultaneously estimate the likelihood of RNA Polymerase II occupation as well as a model of the arrangement of read alignments relative to locations occupied by RNA Polymerase II. Both types of information are utilized to estimate the likelihood that RNA Polymerase II jointly occupies any two genomic locations. We apply Germ to RNA Polymerase II ChIA-PET data from embryonic stem cells to identify the genomic locations that are jointly occupied along with transcription start sites. We show that these genomic locations align more closely with features of active enhancers measured by ChIP-Seq than the locations identified using the ChIA-PET Tool. We also apply Germ to RNA Polymerase II ChIA-PET data from motor neuron progenitors. Based on the Germ results, we observe that a combination of cell type specific and cell type independent regulatory interactions are utilized by cells to regulate gene expression.
Collapse
|
67
|
Provenzi L, Fumagalli M, Sirgiovanni I, Giorda R, Pozzoli U, Morandi F, Beri S, Menozzi G, Mosca F, Borgatti R, Montirosso R. Pain-related stress during the Neonatal Intensive Care Unit stay and SLC6A4 methylation in very preterm infants. Front Behav Neurosci 2015; 9:99. [PMID: 25941480 PMCID: PMC4403508 DOI: 10.3389/fnbeh.2015.00099] [Citation(s) in RCA: 57] [Impact Index Per Article: 6.3] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 01/13/2015] [Accepted: 04/04/2015] [Indexed: 11/13/2022] Open
Abstract
Very preterm (VPT) infants need long-lasting hospitalization in the Neonatal Intensive Care Unit (NICU) during which they are daily exposed to pain-related stress. Alterations of DNA methylation at the promoter region of the SLC6A4 have been associated with early adverse experiences in infants. The main aim of the present work was to investigate the association between level of exposure to pain-related stress during hospitalization and changes in SLC6A4 DNA methylation at NICU discharge in VPT infants. In order to exclude the potential effect of birth status (i.e., preterm vs. full-term birth) on SLC6A4 methylation, we preliminarily assessed SLC6A4 epigenetic differences between VPT and full-term (FT) infants at birth. Fifty-six VPT and thirty-two FT infants participated in the study. The level of exposure to pain-related stress was quantified on the basis of the amount of skin-breaking procedures to which they were exposed. VPT infants were divided in two sub-groups: low-pain exposure (LPE, N = 25) and high-pain exposure (HPE, N = 31). DNA methylation was evaluated at birth for both VPT and FT infants, assessing 20 CpG sites within the SLC6A4 promoter region. The same CpG sites were re-evaluated for variations in DNA methylation at NICU discharge in LPE and HPE VPT infants. No differences in SLC6A4 CpG sites' methylation emerged between FT and VPT infants at birth. Methylation at CpG sites 5 and 6 significantly increased from birth to NICU discharge only for HPE VPT infants. Findings show that preterm birth per se is not associated with epigenetic alterations of the SLC6A4, whereas higher levels of pain-related stress exposure during NICU stay might alter the transcriptional functionality of the serotonin transporter gene.
Collapse
Affiliation(s)
- Livio Provenzi
- 0-3 Center for the Study of Social Emotional Development of the at-Risk Infant, Scientific Institute, IRCCS Eugenio Medea Bosisio Parini (LC), Italy
| | - Monica Fumagalli
- NICU, Department of Clinical Sciences and Community Health, Università degli Studi di Milano, Fondazione IRCCS Ca' Granda Ospedale Maggiore Policlinico Milan, Italy
| | - Ida Sirgiovanni
- NICU, Department of Clinical Sciences and Community Health, Università degli Studi di Milano, Fondazione IRCCS Ca' Granda Ospedale Maggiore Policlinico Milan, Italy
| | - Roberto Giorda
- Molecular Biology Lab, Scientific Institute, IRCCS Eugenio Medea Bosisio Parini (LC), Italy
| | - Uberto Pozzoli
- Bioinformatic Lab, Scientific Institute, IRCCS Eugenio Medea Bosisio Parini (LC), Italy
| | | | - Silvana Beri
- Molecular Biology Lab, Scientific Institute, IRCCS Eugenio Medea Bosisio Parini (LC), Italy
| | - Giorgia Menozzi
- Bioinformatic Lab, Scientific Institute, IRCCS Eugenio Medea Bosisio Parini (LC), Italy
| | - Fabio Mosca
- NICU, Department of Clinical Sciences and Community Health, Università degli Studi di Milano, Fondazione IRCCS Ca' Granda Ospedale Maggiore Policlinico Milan, Italy
| | - Renato Borgatti
- Neuropsychiatry and Neurorehabilitation Unit, Scientific Institute, IRCCS Eugenio Medea Bosisio Parini (LC), Italy
| | - Rosario Montirosso
- 0-3 Center for the Study of Social Emotional Development of the at-Risk Infant, Scientific Institute, IRCCS Eugenio Medea Bosisio Parini (LC), Italy
| |
Collapse
|
68
|
Wilken MS, Brzezinski JA, La Torre A, Siebenthall K, Thurman R, Sabo P, Sandstrom RS, Vierstra J, Canfield TK, Hansen RS, Bender MA, Stamatoyannopoulos J, Reh TA. DNase I hypersensitivity analysis of the mouse brain and retina identifies region-specific regulatory elements. Epigenetics Chromatin 2015; 8:8. [PMID: 25972927 PMCID: PMC4429822 DOI: 10.1186/1756-8935-8-8] [Citation(s) in RCA: 53] [Impact Index Per Article: 5.9] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 12/29/2014] [Accepted: 01/27/2015] [Indexed: 11/10/2022] Open
Abstract
BACKGROUND The brain, spinal cord, and neural retina comprise the central nervous system (CNS) of vertebrates. Understanding the regulatory mechanisms that underlie the enormous cell-type diversity of the CNS is a significant challenge. Whole-genome mapping of DNase I-hypersensitive sites (DHSs) has been used to identify cis-regulatory elements in many tissues. We have applied this approach to the mouse CNS, including developing and mature neural retina, whole brain, and two well-characterized brain regions, the cerebellum and the cerebral cortex. RESULTS For the various regions and developmental stages of the CNS that we analyzed, there were approximately the same number of DHSs; however, there were many DHSs unique to each CNS region and developmental stage. Many of the DHSs are likely to mark enhancers that are specific to the specific CNS region and developmental stage. We validated the DNase I mapping approach for identification of CNS enhancers using the existing VISTA Browser database and with in vivo and in vitro electroporation of the retina. Analysis of transcription factor consensus sites within the DHSs shows distinct region-specific profiles of transcriptional regulators particular to each region. Clustering developmentally dynamic DHSs in the retina revealed enrichment of developmental stage-specific transcriptional regulators. Additionally, we found reporter gene activity in the retina driven from several previously uncharacterized regulatory elements surrounding the neurodevelopmental gene Otx2. Identification of DHSs shared between mouse and human showed region-specific differences in the evolution of cis-regulatory elements. CONCLUSIONS Overall, our results demonstrate the potential of genome-wide DNase I mapping to cis-regulatory questions regarding the regional diversity within the CNS. These data represent an extensive catalogue of potential cis-regulatory elements within the CNS that display region and temporal specificity, as well as a set of DHSs common to CNS tissues. Further examination of evolutionary conservation of DHSs between CNS regions and different species may reveal important cis-regulatory elements in the evolution of the mammalian CNS.
Collapse
Affiliation(s)
- Matthew S Wilken
- Department of Biological Structure, University of Washington, 1959 NE Pacific Street, Box 357420, Seattle, WA 98195 USA ; Molecular and Cellular Biology Program, University of Washington, MCB Program Office, T-466 Health Sciences Building, Box 357275, Seattle, WA 98195 USA
| | - Joseph A Brzezinski
- Department of Biological Structure, University of Washington, 1959 NE Pacific Street, Box 357420, Seattle, WA 98195 USA ; Department of Ophthalmology, University of Colorado School of Medicine, 1675 Aurora Court, Aurora, CO 80045 USA
| | - Anna La Torre
- Department of Biological Structure, University of Washington, 1959 NE Pacific Street, Box 357420, Seattle, WA 98195 USA
| | - Kyle Siebenthall
- Department of Genome Sciences, University of Washington, Foege Building S-250, 3720 15th Ave NE, Box 355065, Seattle, WA 98195 USA
| | - Robert Thurman
- Department of Genome Sciences, University of Washington, Foege Building S-250, 3720 15th Ave NE, Box 355065, Seattle, WA 98195 USA
| | - Peter Sabo
- Department of Genome Sciences, University of Washington, Foege Building S-250, 3720 15th Ave NE, Box 355065, Seattle, WA 98195 USA
| | - Richard S Sandstrom
- Department of Genome Sciences, University of Washington, Foege Building S-250, 3720 15th Ave NE, Box 355065, Seattle, WA 98195 USA
| | - Jeff Vierstra
- Department of Genome Sciences, University of Washington, Foege Building S-250, 3720 15th Ave NE, Box 355065, Seattle, WA 98195 USA
| | - Theresa K Canfield
- Department of Genome Sciences, University of Washington, Foege Building S-250, 3720 15th Ave NE, Box 355065, Seattle, WA 98195 USA
| | - R Scott Hansen
- Department of Genome Sciences, University of Washington, Foege Building S-250, 3720 15th Ave NE, Box 355065, Seattle, WA 98195 USA
| | - Michael A Bender
- Department of Pediatrics, University of Washington, 1959 NE Pacific St, Health Sciences Building, Seattle, WA Box 356320, 98195 USA ; Clinical Research Division, Fred Hutchinson Cancer Research Center, 1100 Fairview Avenue North, Seattle, WA 98109 USA
| | - John Stamatoyannopoulos
- Department of Genome Sciences, University of Washington, Foege Building S-250, 3720 15th Ave NE, Box 355065, Seattle, WA 98195 USA
| | - Thomas A Reh
- Department of Biological Structure, University of Washington, 1959 NE Pacific Street, Box 357420, Seattle, WA 98195 USA
| |
Collapse
|
69
|
Babaei S, Akhtar W, de Jong J, Reinders M, de Ridder J. 3D hotspots of recurrent retroviral insertions reveal long-range interactions with cancer genes. Nat Commun 2015; 6:6381. [PMID: 25721899 PMCID: PMC4351571 DOI: 10.1038/ncomms7381] [Citation(s) in RCA: 24] [Impact Index Per Article: 2.7] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/18/2014] [Accepted: 01/26/2015] [Indexed: 11/09/2022] Open
Abstract
Genomically distal mutations can contribute to the deregulation of cancer genes by engaging in chromatin interactions. To study this, we overlay viral cancer-causing insertions obtained in a murine retroviral insertional mutagenesis screen with genome-wide chromatin conformation capture data. Here we find that insertions tend to cluster in 3D hotspots within the nucleus. The identified hotspots are significantly enriched for known cancer genes, and bear the expected characteristics of bona fide regulatory interactions, such as enrichment for transcription factor-binding sites. In addition, we observe a striking pattern of mutual exclusive integration. This is an indication that insertions in these loci target the same gene, either in their linear genomic vicinity or in their 3D spatial vicinity. Our findings shed new light on the repertoire of targets obtained from insertional mutagenesis screening and underline the importance of considering the genome as a 3D structure when studying effects of genomic perturbations.
Collapse
Affiliation(s)
- Sepideh Babaei
- Delft Bioinformatics Lab, Faculty of Electrical Engineering Mathematics and Computer Science, Delft University of Technology, Mekelweg 4, 2628 CD Delft, The Netherlands
| | - Waseem Akhtar
- Division of Molecular Genetics, The Netherlands Cancer Institute, Plesmanlaan 121, 1066 CX Amsterdam, The Netherlands
| | - Johann de Jong
- Division of Molecular Carcinogenesis, The Netherlands Cancer Institute, Plesmanlaan 121, 1066 CX Amsterdam, The Netherlands
| | - Marcel Reinders
- Delft Bioinformatics Lab, Faculty of Electrical Engineering Mathematics and Computer Science, Delft University of Technology, Mekelweg 4, 2628 CD Delft, The Netherlands
| | - Jeroen de Ridder
- Delft Bioinformatics Lab, Faculty of Electrical Engineering Mathematics and Computer Science, Delft University of Technology, Mekelweg 4, 2628 CD Delft, The Netherlands
| |
Collapse
|
70
|
Drillon G, Audit B, Argoul F, Arneodo A. Ubiquitous human 'master' origins of replication are encoded in the DNA sequence via a local enrichment in nucleosome excluding energy barriers. JOURNAL OF PHYSICS. CONDENSED MATTER : AN INSTITUTE OF PHYSICS JOURNAL 2015; 27:064102. [PMID: 25563930 DOI: 10.1088/0953-8984/27/6/064102] [Citation(s) in RCA: 7] [Impact Index Per Article: 0.8] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 06/04/2023]
Abstract
As the elementary building block of eukaryotic chromatin, the nucleosome is at the heart of the compromise between the necessity of compacting DNA in the cell nucleus and the required accessibility to regulatory proteins. The recent availability of genome-wide experimental maps of nucleosome positions for many different organisms and cell types has provided an unprecedented opportunity to elucidate to what extent the DNA sequence conditions the primary structure of chromatin and in turn participates in the chromatin-mediated regulation of nuclear functions, such as gene expression and DNA replication. In this study, we use in vivo and in vitro genome-wide nucleosome occupancy data together with the set of nucleosome-free regions (NFRs) predicted by a physical model of nucleosome formation based on sequence-dependent bending properties of the DNA double-helix, to investigate the role of intrinsic nucleosome occupancy in the regulation of the replication spatio-temporal programme in human. We focus our analysis on the so-called replication U/N-domains that were shown to cover about half of the human genome in the germline (skew-N domains) as well as in embryonic stem cells, somatic and HeLa cells (mean replication timing U-domains). The 'master' origins of replication (MaOris) that border these megabase-sized U/N-domains were found to be specified by a few hundred kb wide regions that are hyper-sensitive to DNase I cleavage, hypomethylated, and enriched in epigenetic marks involved in transcription regulation, the hallmarks of localized open chromatin structures. Here we show that replication U/N-domain borders that are conserved in all considered cell lines have an environment highly enriched in nucleosome-excluding-energy barriers, suggesting that these ubiquitous MaOris have been selected during evolution. In contrast, MaOris that are cell-type-specific are mainly regulated epigenetically and are no longer favoured by a local abundance of intrinsic NFRs encoded in the DNA sequence. At the smaller few hundred bp scale of gene promoters, CpG-rich promoters of housekeeping genes found nearby ubiquitous MaOris as well as CpG-poor promoters of tissue-specific genes found nearby cell-type-specific MaOris, both correspond to in vivo NFRs that are not coded as nucleosome-excluding-energy barriers. Whereas the former promoters are likely to correspond to high occupancy transcription factor binding regions, the latter are an illustration that gene regulation in human is typically cell-type-specific.
Collapse
Affiliation(s)
- Guénola Drillon
- Université de Lyon, F-69000 Lyon, France. Laboratoire de Physique, CNRS UMR 5672, École Normale Supérieure de Lyon, F-69007 Lyon, France
| | | | | | | |
Collapse
|
71
|
Wan J, Oliver VF, Wang G, Zhu H, Zack DJ, Merbs SL, Qian J. Characterization of tissue-specific differential DNA methylation suggests distinct modes of positive and negative gene expression regulation. BMC Genomics 2015; 16:49. [PMID: 25652663 PMCID: PMC4331481 DOI: 10.1186/s12864-015-1271-4] [Citation(s) in RCA: 110] [Impact Index Per Article: 12.2] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 09/19/2014] [Accepted: 01/22/2015] [Indexed: 01/20/2023] Open
Abstract
Background DNA methylation plays an important role in regulating gene expression during many biological processes. However, the mechanism of DNA-methylation-dependent gene regulation is not fully understood. Here, we explore two possible DNA methylation regulatory mechanisms with opposite modes of gene expression regulation. Results By comparing the genome-wide methylation and expression patterns in different tissues, we find that majority of tissue-specific differentially methylated regions (T-DMRs) are negatively correlated with expression of their associated genes (negative T-DMRs), consistent with the classical dogma that DNA methylation suppresses gene expression; however, a significant portion of T-DMRs are positively correlated with gene expression (positive T-DMRs). We observe that the positive T-DMRs have similar genomic location as negative T-DMRs, except that the positive T-DMRs are more enriched in the promoter regions. Both positive and negative T-DMRs are enriched in DNase I hypersensitivity sites (DHSs), suggesting that both are likely to be functional. The CpG sites of both positive and negative T-DMRs are also more evolutionarily conserved than the genomic background. Interestingly, the putative target genes of the positive T-DMR are enriched for negative regulators such as transcriptional repressors, suggesting a novel mode of indirect DNA methylation inhibition of expression through transcriptional repressors. Likewise, two distinct sets of DNA sequence motifs exist for positive and negative T-DMRs, suggesting that two distinct sets of transcription factors (TFs) are involved in positive and negative regulation mediated by DNA methylation. Conclusions We find both negative and positive association between T-DMRs and gene expression, which implies the existence of two different mechanisms of DNA methylation-dependent gene regulation. Electronic supplementary material The online version of this article (doi:10.1186/s12864-015-1271-4) contains supplementary material, which is available to authorized users.
Collapse
Affiliation(s)
- Jun Wan
- Department of Ophthalmology, Wilmer Institute, Johns Hopkins University School of Medicine, Baltimore, MA, USA.
| | - Verity F Oliver
- Department of Ophthalmology, Wilmer Institute, Johns Hopkins University School of Medicine, Baltimore, MA, USA.
| | - Guohua Wang
- Department of Ophthalmology, Wilmer Institute, Johns Hopkins University School of Medicine, Baltimore, MA, USA.
| | - Heng Zhu
- Department of Pharmacology and Molecular Science, Johns Hopkins University School of Medicine, Baltimore, MA, USA.
| | - Donald J Zack
- Department of Ophthalmology, Wilmer Institute, Johns Hopkins University School of Medicine, Baltimore, MA, USA. .,Department of Molecular Biology and Genetics, Johns Hopkins University School of Medicine, Baltimore, MA, USA. .,Department of Neuroscience, Johns Hopkins University School of Medicine, Baltimore, MA, USA. .,Institute of Genetic Medicine, Johns Hopkins University School of Medicine, Baltimore, MA, USA. .,Institut de la Vision, Université Pierre et Marie Curie, 17 rue Moreau, Paris, France.
| | - Shannath L Merbs
- Department of Ophthalmology, Wilmer Institute, Johns Hopkins University School of Medicine, Baltimore, MA, USA.
| | - Jiang Qian
- Department of Ophthalmology, Wilmer Institute, Johns Hopkins University School of Medicine, Baltimore, MA, USA.
| |
Collapse
|
72
|
Abstract
Genomic regions associated with regulatory proteins are known to be highly sensitive to DNase I digestion and are termed DNase I hypersensitive sites (DHSs). DHSs can be identified by DNase I digestion followed by high-throughput DNA sequencing (DNase-seq). DNase-seq has become a powerful technique for genome-wide mapping of chromatin accessibility in eukaryotes with a sequenced genome. We have developed a DNase-seq procedure in plants. This procedure was adapted from the protocol originally developed for mammalian cell lines. It includes plant nuclei isolation, digestion of purified nuclei with DNase I, recovery of DNase-trimmed DNA fragments, DNase-seq library development, Illumina sequencing and data analysis. We also introduce a barcoding system for library preparation. We have conducted DNase-seq in both Arabidopsis thaliana and rice, and developed genome-wide open chromatin maps in both species. These DHS datasets have been used to detect footprints from regulatory protein binding and to reveal genome-wide nucleosome positioning patterns.
Collapse
Affiliation(s)
- Wenli Zhang
- Department of Horticulture, University of Wisconsin-Madison, 1575 Linden Drive, Madison, WI, 53706-1580, USA
| | | |
Collapse
|
73
|
Chen PB, Zhu LJ, Hainer SJ, McCannell KN, Fazzio TG. Unbiased chromatin accessibility profiling by RED-seq uncovers unique features of nucleosome variants in vivo. BMC Genomics 2014; 15:1104. [PMID: 25494698 PMCID: PMC4378318 DOI: 10.1186/1471-2164-15-1104] [Citation(s) in RCA: 11] [Impact Index Per Article: 1.1] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/02/2014] [Accepted: 12/10/2014] [Indexed: 11/10/2022] Open
Abstract
BACKGROUND Differential accessibility of DNA to nuclear proteins underlies the regulation of numerous cellular processes. Although DNA accessibility is primarily determined by the presence or absence of nucleosomes, differences in nucleosome composition or dynamics may also regulate accessibility. Methods for mapping nucleosome positions and occupancies genome-wide (MNase-seq) have uncovered the nucleosome landscapes of many different cell types and organisms. Conversely, methods specialized for the detection of large nucleosome-free regions of chromatin (DNase-seq, FAIRE-seq) have uncovered numerous gene regulatory elements. However, these methods are less successful in measuring the accessibility of DNA sequences within nucelosome arrays. RESULTS Here we probe the genome-wide accessibility of multiple cell types in an unbiased manner using restriction endonuclease digestion of chromatin coupled to deep sequencing (RED-seq). Using this method, we identified differences in chromatin accessibility between populations of cells, not only in nucleosome-depleted regions of the genome (e.g., enhancers and promoters), but also within the majority of the genome that is packaged into nucleosome arrays. Furthermore, we identified both large differences in chromatin accessibility in distinct cell lineages and subtle but significant changes during differentiation of mouse embryonic stem cells (ESCs). Most significantly, using RED-seq, we identified differences in accessibility among nucleosomes harboring well-studied histone variants, and show that these differences depend on factors required for their deposition. CONCLUSIONS Using an unbiased method to probe chromatin accessibility genome-wide, we uncover unique features of chromatin structure that are not observed using more widely-utilized methods. We demonstrate that different types of nucleosomes within mammalian cells exhibit different degrees of accessibility. These findings provide significant insight into the regulation of DNA accessibility.
Collapse
Affiliation(s)
| | | | | | | | - Thomas G Fazzio
- Program in Gene Function and Expression, University of Massachusetts Medical School, Worcester, MA 01605, USA.
| |
Collapse
|
74
|
Mourad R, Hsu PY, Juan L, Shen C, Koneru P, Lin H, Liu Y, Nephew K, Huang TH, Li L. Estrogen induces global reorganization of chromatin structure in human breast cancer cells. PLoS One 2014; 9:e113354. [PMID: 25470140 PMCID: PMC4255042 DOI: 10.1371/journal.pone.0113354] [Citation(s) in RCA: 40] [Impact Index Per Article: 4.0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 02/19/2014] [Accepted: 10/22/2014] [Indexed: 12/21/2022] Open
Abstract
In the cell nucleus, each chromosome is confined to a chromosome territory. This spatial organization of chromosomes plays a crucial role in gene regulation and genome stability. An additional level of organization has been discovered at the chromosome scale: the spatial segregation into open and closed chromatins to form two genome-wide compartments. Although considerable progress has been made in our knowledge of chromatin organization, a fundamental issue remains the understanding of its dynamics, especially in cancer. To address this issue, we performed genome-wide mapping of chromatin interactions (Hi-C) over the time after estrogen stimulation of breast cancer cells. To biologically interpret these interactions, we integrated with estrogen receptor (ERα) binding events, gene expression and epigenetic marks. We show that gene-rich chromosomes as well as areas of open and highly transcribed chromatins are rearranged to greater spatial proximity, thus enabling genes to share transcriptional machinery and regulatory elements. At a smaller scale, differentially interacting loci are enriched for cancer proliferation and estrogen-related genes. Moreover, these loci are correlated with higher ERα binding events and gene expression. Taken together these results reveal the role of a hormone - estrogen - on genome organization, and its effect on gene regulation in cancer.
Collapse
Affiliation(s)
- Raphaël Mourad
- Department of Medical and Molecular Genetics, Center for Computational Biology and Bioinformatics, Indiana School of Medicine, Indiana University, Indianapolis, IN, 46202, United States of America
| | - Pei-Yin Hsu
- Departments of Molecular Medicine/Institute of Biotechnology, University of Texas Health Science Center, San Antonio, TX, 78245, United States of America
| | - Liran Juan
- Department of Medical and Molecular Genetics, Center for Computational Biology and Bioinformatics, Indiana School of Medicine, Indiana University, Indianapolis, IN, 46202, United States of America
| | - Changyu Shen
- Department of Biostatistics, Center for Computational Biology and Bioinformatics, Indiana School of Medicine, Indiana University, Indianapolis, IN, 46202, United States of America
| | - Prasad Koneru
- Department of Medical and Molecular Genetics, Center for Computational Biology and Bioinformatics, Indiana School of Medicine, Indiana University, Indianapolis, IN, 46202, United States of America
| | - Hai Lin
- Department of Medical and Molecular Genetics, Center for Computational Biology and Bioinformatics, Indiana School of Medicine, Indiana University, Indianapolis, IN, 46202, United States of America
| | - Yunlong Liu
- Department of Medical and Molecular Genetics, Center for Computational Biology and Bioinformatics, Indiana School of Medicine, Indiana University, Indianapolis, IN, 46202, United States of America
| | - Kenneth Nephew
- Laboratory of Ovarian Cancer Epigenomics, Indiana University, Bloomington, IN, 47405, United States of America
| | - Tim H. Huang
- Departments of Molecular Medicine/Institute of Biotechnology, University of Texas Health Science Center, San Antonio, TX, 78245, United States of America
| | - Lang Li
- Department of Medical and Molecular Genetics, Center for Computational Biology and Bioinformatics, Indiana School of Medicine, Indiana University, Indianapolis, IN, 46202, United States of America
- * E-mail:
| |
Collapse
|
75
|
Tsompana M, Buck MJ. Chromatin accessibility: a window into the genome. Epigenetics Chromatin 2014; 7:33. [PMID: 25473421 PMCID: PMC4253006 DOI: 10.1186/1756-8935-7-33] [Citation(s) in RCA: 261] [Impact Index Per Article: 26.1] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 09/15/2014] [Accepted: 11/05/2014] [Indexed: 01/09/2023] Open
Abstract
Transcriptional activation throughout the eukaryotic lineage has been tightly linked with disruption of nucleosome organization at promoters, enhancers, silencers, insulators and locus control regions due to transcription factor binding. Regulatory DNA thus coincides with open or accessible genomic sites of remodeled chromatin. Current chromatin accessibility assays are used to separate the genome by enzymatic or chemical means and isolate either the accessible or protected locations. The isolated DNA is then quantified using a next-generation sequencing platform. Wide application of these assays has recently focused on the identification of the instrumental epigenetic changes responsible for differential gene expression, cell proliferation, functional diversification and disease development. Here we discuss the limitations and advantages of current genome-wide chromatin accessibility assays with especial attention on experimental precautions and sequence data analysis. We conclude with our perspective on future improvements necessary for moving the field of chromatin profiling forward.
Collapse
Affiliation(s)
- Maria Tsompana
- New York State Center of Excellence in Bioinformatics and Life Sciences, State University of New York at Buffalo, 701 Ellicott St, Buffalo, NY 14203 USA
| | - Michael J Buck
- New York State Center of Excellence in Bioinformatics and Life Sciences, State University of New York at Buffalo, 701 Ellicott St, Buffalo, NY 14203 USA ; Department of Biochemistry, State University of New York at Buffalo, Buffalo, NY USA
| |
Collapse
|
76
|
Fortini BK, Tring S, Plummer SJ, Edlund CK, Moreno V, Bresalier RS, Barry EL, Church TR, Figueiredo JC, Casey G. Multiple functional risk variants in a SMAD7 enhancer implicate a colorectal cancer risk haplotype. PLoS One 2014; 9:e111914. [PMID: 25375357 PMCID: PMC4223076 DOI: 10.1371/journal.pone.0111914] [Citation(s) in RCA: 26] [Impact Index Per Article: 2.6] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 05/16/2014] [Accepted: 10/01/2014] [Indexed: 12/31/2022] Open
Abstract
Genome-wide association studies (GWAS) of colorectal cancer (CRC) have led to the identification of a number of common variants associated with modest risk. Several risk variants map within the vicinity of TGFβ/BMP signaling pathway genes, including rs4939827 within an intron of SMAD7 at 18q21.1. A previous study implicated a novel SNP (novel 1 or rs58920878) as a functional variant within an enhancer element in SMAD7 intron 4. In this study, we show that four SNPs including novel 1 (rs6507874, rs6507875, rs8085824, and rs58920878) in linkage disequilibrium (LD) with the index SNP rs4939827 demonstrate allele-specific enhancer effects in a large, multi-component enhancer of SMAD7. All four SNPs demonstrate allele-specific protein binding to nuclear extracts of CRC cell lines. Furthermore, some of the risk-associated alleles correlate with increased expression of SMAD7 in normal colon tissues. Finally, we show that the enhancer is responsive to BMP4 stimulation. Taken together, we propose that the associated CRC risk at 18q21.1 is due to four functional variants that regulate SMAD7 expression and potentially perturb a BMP negative feedback loop in TGFβ/BMP signaling pathways.
Collapse
Affiliation(s)
- Barbara K. Fortini
- Department of Preventive Medicine, University of Southern California, Los Angeles, California, United States of America
- * E-mail:
| | - Stephanie Tring
- Department of Preventive Medicine, University of Southern California, Los Angeles, California, United States of America
| | - Sarah J. Plummer
- Department of Preventive Medicine, University of Southern California, Los Angeles, California, United States of America
| | - Christopher K. Edlund
- Department of Preventive Medicine, University of Southern California, Los Angeles, California, United States of America
| | - Victor Moreno
- Cancer Prevention and Control Program, Catalan Institute of Oncology, CIBERESP and University of Barcelona, Hospitalet de Llobregat, Barcelona, Spain
| | - Robert S. Bresalier
- Department of Gastroenterology, Hepatology and Nutrition, University of Texas MD Anderson Cancer Center, Houston, Texas, United States of America
| | - Elizabeth L. Barry
- Department of Community and Family Medicine, Geisel School of Medicine at Dartmouth, Lebanon, New Hampshire, United States of America
| | - Timothy R. Church
- Division of Environmental Health Sciences, University of Minnesota School of Public Health, Minneapolis, Minnesota, United States of America
| | - Jane C. Figueiredo
- Department of Preventive Medicine, University of Southern California, Los Angeles, California, United States of America
| | - Graham Casey
- Department of Preventive Medicine, University of Southern California, Los Angeles, California, United States of America
| |
Collapse
|
77
|
Ryan NM, Morris SW, Porteous DJ, Taylor MS, Evans KL. SuRFing the genomics wave: an R package for prioritising SNPs by functionality. Genome Med 2014; 6:79. [PMID: 25400697 PMCID: PMC4224693 DOI: 10.1186/s13073-014-0079-1] [Citation(s) in RCA: 12] [Impact Index Per Article: 1.2] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/20/2014] [Accepted: 09/26/2014] [Indexed: 12/16/2022] Open
Abstract
Identifying functional non-coding variants is one of the greatest unmet challenges in genetics. To help address this, we introduce an R package, SuRFR, which integrates functional annotation and prior biological knowledge to prioritise candidate functional variants. SuRFR is publicly available, modular, flexible, fast, and simple to use. We demonstrate that SuRFR performs with high sensitivity and specificity and provide a widely applicable and scalable benchmarking dataset for model training and validation. Website: http://www.cgem.ed.ac.uk/resources/
Collapse
Affiliation(s)
- Niamh M Ryan
- Centre for Genomic and Experimental Medicine, Institute of Genetics and Molecular Medicine, The University of Edinburgh, Western General Hospital, Crewe Road, Edinburgh, EH4 2XU UK
| | - Stewart W Morris
- Centre for Genomic and Experimental Medicine, Institute of Genetics and Molecular Medicine, The University of Edinburgh, Western General Hospital, Crewe Road, Edinburgh, EH4 2XU UK
| | - David J Porteous
- Centre for Genomic and Experimental Medicine, Institute of Genetics and Molecular Medicine, The University of Edinburgh, Western General Hospital, Crewe Road, Edinburgh, EH4 2XU UK ; Centre for Cognitive Ageing and Cognitive Epidemiology, The University of Edinburgh, 7 George Square, Edinburgh, EH8 9JZ UK
| | - Martin S Taylor
- MRC Human Genetics Unit, Institute of Genetics and Molecular Medicine, The University of Edinburgh, Western General Hospital, Crewe Road, Edinburgh, EH4 2XU UK
| | - Kathryn L Evans
- Centre for Genomic and Experimental Medicine, Institute of Genetics and Molecular Medicine, The University of Edinburgh, Western General Hospital, Crewe Road, Edinburgh, EH4 2XU UK ; Centre for Cognitive Ageing and Cognitive Epidemiology, The University of Edinburgh, 7 George Square, Edinburgh, EH8 9JZ UK
| |
Collapse
|
78
|
Abstract
The widespread adoption of short-read DNA sequencing as a digital epigenomic readout platform has motivated the development of genome-wide tools that achieve base-pair resolution. New methods for footprinting and affinity purification of nucleosomes, RNA polymerases, chromatin remodellers and transcription factors have increased the resolution of epigenomic profiling by two orders of magnitude, leading to new insights into how the chromatin landscape affects gene regulation. These digital epigenomic tools have also been applied to directly profile both turnover kinetics and transcription in situ. In this Review, we describe how these new genome-wide tools allow interrogation of diverse aspects of the epigenome.
Collapse
|
79
|
Luyten A, Zang C, Liu XS, Shivdasani RA. Active enhancers are delineated de novo during hematopoiesis, with limited lineage fidelity among specified primary blood cells. Genes Dev 2014; 28:1827-39. [PMID: 25128499 PMCID: PMC4197967 DOI: 10.1101/gad.240101.114] [Citation(s) in RCA: 36] [Impact Index Per Article: 3.6] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 01/22/2023]
Abstract
Tissues may adopt diverse strategies to establish specific transcriptional programs in daughter lineages. In blood, it is unclear when chromatin becomes permissive for transcription of genes expressed in distinct lineages. Luyten et al. present genome-wide analyses on chromatin dynamics and transcription factor binding at successive stages in primary mouse blood cell differentiation. The results reveal surprising features of enhancer delineation in a self-renewing tissue, distinct from the strategy applied in self-renewing intestinal crypts. This study provides a useful foundation to consider classical observations on cellular reprogramming and lineage plasticity in hematopoiesis. Tissues may adopt diverse strategies to establish specific transcriptional programs in daughter lineages. In intestinal crypts, enhancers for genes expressed in both major cell types appear broadly permissive in stem and specified progenitor cells. In blood, another self-renewing tissue, it is unclear when chromatin becomes permissive for transcription of genes expressed in distinct terminal lineages. Using chromatin immunoprecipitation (ChIP) combined with deep sequencing (ChIP-seq) to profile activating histone marks, we studied enhancer dynamics in primary mouse blood stem, progenitor, and specified cells. Stem and multipotent progenitor cells show scant H3K4me2 marking at enhancers bound by specific transcription factors in their committed progeny. Rather, enhancers are modulated dynamically and serially, with substantial loss and gain of H3K4me2, at each cellular transition. Quantitative analysis of these dynamics accurately modeled hematopoiesis according to Waddington’s notion of epigenotypes. Delineation of enhancers in terminal blood lineages coincides with cell specification, and enhancers active in single lineages show well-positioned H3K4me2- and H3K27ac-marked nucleosomes and DNaseI hypersensitivity in other cell types, revealing limited lineage fidelity. These findings demonstrate that enhancer chronology in blood cells differs markedly from that in intestinal crypts. Chromatin dynamics in hematopoiesis provide a useful foundation to consider classical observations such as cellular reprogramming and multilineage locus priming.
Collapse
Affiliation(s)
- Annouck Luyten
- Department of Medical Oncology, Center for Functional Cancer Epigenetics, Dana-Farber Cancer Institute, Boston, Massachusetts 02215, USA
| | - Chongzhi Zang
- Department of Biostatistics and Computational Biology, Center for Functional Cancer Epigenetics, Dana-Farber Cancer Institute, Harvard School of Public Health, Boston, Massachusetts 02215, USA
| | - X Shirley Liu
- Department of Biostatistics and Computational Biology, Center for Functional Cancer Epigenetics, Dana-Farber Cancer Institute, Harvard School of Public Health, Boston, Massachusetts 02215, USA;
| | - Ramesh A Shivdasani
- Department of Medical Oncology, Center for Functional Cancer Epigenetics, Dana-Farber Cancer Institute, Boston, Massachusetts 02215, USA; Department of Medicine, Brigham and Women's Hospital, Harvard Medical School, Boston, Massachusetts 02115, USA; Harvard Stem Cell Institute, Cambridge, Massachusetts 02138, USA
| |
Collapse
|
80
|
Collier SP, Henderson MA, Tossberg JT, Aune TM. Regulation of the Th1 genomic locus from Ifng through Tmevpg1 by T-bet. THE JOURNAL OF IMMUNOLOGY 2014; 193:3959-65. [PMID: 25225667 DOI: 10.4049/jimmunol.1401099] [Citation(s) in RCA: 92] [Impact Index Per Article: 9.2] [Reference Citation Analysis] [Abstract] [Subscribe] [Scholar Register] [Indexed: 12/24/2022]
Abstract
Long noncoding RNAs (lncRNAs), critical regulators of protein-coding genes, are likely to be coexpressed with neighboring protein-coding genes in the genome. How the genome integrates signals to achieve coexpression of lncRNA genes and neighboring protein-coding genes is not well understood. The lncRNA Tmevpg1 (NeST, Ifng-AS1) is critical for Th1-lineage-specific expression of Ifng and is coexpressed with Ifng. In this study, we show that T-bet guides epigenetic remodeling of Tmevpg1 proximal and distal enhancers, leading to recruitment of stimulus-inducible transcription factors, NF-κB and Ets-1, to the locus. Activities of Tmevpg1-specific enhancers and Tmevpg1 transcription are dependent upon NF-κB. Thus, we propose that T-bet stimulates epigenetic remodeling of Tmevpg1-specific enhancers and Ifng-specific enhancers to achieve Th1-lineage-specific expression of Ifng.
Collapse
Affiliation(s)
- Sarah P Collier
- Department of Pathology, Microbiology, and Immunology, Vanderbilt University School of Medicine, Nashville, TN 37232; and
| | - Melodie A Henderson
- Department of Medicine, Vanderbilt University School of Medicine, Nashville, TN 37232
| | - John T Tossberg
- Department of Medicine, Vanderbilt University School of Medicine, Nashville, TN 37232
| | - Thomas M Aune
- Department of Pathology, Microbiology, and Immunology, Vanderbilt University School of Medicine, Nashville, TN 37232; and Department of Medicine, Vanderbilt University School of Medicine, Nashville, TN 37232
| |
Collapse
|
81
|
Selective transcriptional regulation by Myc in cellular growth control and lymphomagenesis. Nature 2014; 511:488-492. [PMID: 25043028 DOI: 10.1038/nature13537] [Citation(s) in RCA: 355] [Impact Index Per Article: 35.5] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 08/14/2013] [Accepted: 05/28/2014] [Indexed: 12/30/2022]
Abstract
The c-myc proto-oncogene product, Myc, is a transcription factor that binds thousands of genomic loci. Recent work suggested that rather than up- and downregulating selected groups of genes, Myc targets all active promoters and enhancers in the genome (a phenomenon termed 'invasion') and acts as a general amplifier of transcription. However, the available data did not readily discriminate between direct and indirect effects of Myc on RNA biogenesis. We addressed this issue with genome-wide chromatin immunoprecipitation and RNA expression profiles during B-cell lymphomagenesis in mice, in cultured B cells and fibroblasts. Consistent with long-standing observations, we detected general increases in total RNA or messenger RNA copies per cell (hereby termed 'amplification') when comparing actively proliferating cells with control quiescent cells: this was true whether cells were stimulated by mitogens (requiring endogenous Myc for a proliferative response) or by deregulated, oncogenic Myc activity. RNA amplification and promoter/enhancer invasion by Myc were separable phenomena that could occur without one another. Moreover, whether or not associated with RNA amplification, Myc drove the differential expression of distinct subsets of target genes. Hence, although having the potential to interact with all active or poised regulatory elements in the genome, Myc does not directly act as a global transcriptional amplifier. Instead, our results indicate that Myc activates and represses transcription of discrete gene sets, leading to changes in cellular state that can in turn feed back on global RNA production and turnover.
Collapse
|
82
|
Zhang W, Zhang T, Wu Y, Jiang J. Open Chromatin in Plant Genomes. Cytogenet Genome Res 2014; 143:18-27. [DOI: 10.1159/000362827] [Citation(s) in RCA: 15] [Impact Index Per Article: 1.5] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/19/2022] Open
|
83
|
Single-molecule dynamics of enhanceosome assembly in embryonic stem cells. Cell 2014; 156:1274-1285. [PMID: 24630727 PMCID: PMC4040518 DOI: 10.1016/j.cell.2014.01.062] [Citation(s) in RCA: 422] [Impact Index Per Article: 42.2] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 09/20/2013] [Revised: 12/16/2013] [Accepted: 01/27/2014] [Indexed: 11/21/2022]
Abstract
Enhancer-binding pluripotency regulators (Sox2 and Oct4) play a seminal role in embryonic stem (ES) cell-specific gene regulation. Here, we combine in vivo and in vitro single-molecule imaging, transcription factor (TF) mutagenesis, and ChIP-exo mapping to determine how TFs dynamically search for and assemble on their cognate DNA target sites. We find that enhanceosome assembly is hierarchically ordered with kinetically favored Sox2 engaging the target DNA first, followed by assisted binding of Oct4. Sox2/Oct4 follow a trial-and-error sampling mechanism involving 84-97 events of 3D diffusion (3.3-3.7 s) interspersed with brief nonspecific collisions (0.75-0.9 s) before acquiring and dwelling at specific target DNA (12.0-14.6 s). Sox2 employs a 3D diffusion-dominated search mode facilitated by 1D sliding along open DNA to efficiently locate targets. Our findings also reveal fundamental aspects of gene and developmental regulation by fine-tuning TF dynamics and influence of the epigenome on target search parameters.
Collapse
|
84
|
Iversen ES, Lipton G, Clyde MA, Monteiro ANA. Functional annotation signatures of disease susceptibility loci improve SNP association analysis. BMC Genomics 2014; 15:398. [PMID: 24886216 PMCID: PMC4041996 DOI: 10.1186/1471-2164-15-398] [Citation(s) in RCA: 14] [Impact Index Per Article: 1.4] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 12/20/2013] [Accepted: 05/13/2014] [Indexed: 01/16/2023] Open
Abstract
BACKGROUND Genetic association studies are conducted to discover genetic loci that contribute to an inherited trait, identify the variants behind these associations and ascertain their functional role in determining the phenotype. To date, functional annotations of the genetic variants have rarely played more than an indirect role in assessing evidence for association. Here, we demonstrate how these data can be systematically integrated into an association study's analysis plan. RESULTS We developed a Bayesian statistical model for the prior probability of phenotype-genotype association that incorporates data from past association studies and publicly available functional annotation data regarding the susceptibility variants under study. The model takes the form of a binary regression of association status on a set of annotation variables whose coefficients were estimated through an analysis of associated SNPs in the GWAS Catalog (GC). The functional predictors examined included measures that have been demonstrated to correlate with the association status of SNPs in the GC and some whose utility in this regard is speculative: summaries of the UCSC Human Genome Browser ENCODE super-track data, dbSNP function class, sequence conservation summaries, proximity to genomic variants in the Database of Genomic Variants and known regulatory elements in the Open Regulatory Annotation database, PolyPhen-2 probabilities and RegulomeDB categories. Because we expected that only a fraction of the annotations would contribute to predicting association, we employed a penalized likelihood method to reduce the impact of non-informative predictors and evaluated the model's ability to predict GC SNPs not used to construct the model. We show that the functional data alone are predictive of a SNP's presence in the GC. Further, using data from a genome-wide study of ovarian cancer, we demonstrate that their use as prior data when testing for association is practical at the genome-wide scale and improves power to detect associations. CONCLUSIONS We show how diverse functional annotations can be efficiently combined to create 'functional signatures' that predict the a priori odds of a variant's association to a trait and how these signatures can be integrated into a standard genome-wide-scale association analysis, resulting in improved power to detect truly associated variants.
Collapse
Affiliation(s)
- Edwin S Iversen
- Department of Statistical Science, Duke University, Box 90251, 27708-0251 Durham, NC, USA.
| | | | | | | |
Collapse
|
85
|
Rodrigueza WV, Woolliscroft MJ, Ebrahim AS, Forgey R, McGovren PJ, Endert G, Wagner A, Holewa D, Aboukameel A, Gill RD, Bisgaier CL, Messmann RA, Whitehead CE, Izbicka E, Streeper R, Wick MC, Stiegler G, Stein CA, Monsma D, Webb C, Sooch MP, Panzner S, Mohammad R, Goodwin NC, Al-Katib A. Development and antitumor activity of a BCL-2 targeted single-stranded DNA oligonucleotide. Cancer Chemother Pharmacol 2014; 74:151-66. [PMID: 24832107 PMCID: PMC4077254 DOI: 10.1007/s00280-014-2476-y] [Citation(s) in RCA: 18] [Impact Index Per Article: 1.8] [Reference Citation Analysis] [Abstract] [MESH Headings] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 11/22/2013] [Accepted: 04/23/2014] [Indexed: 12/31/2022]
Abstract
PNT100 is a 24-base, chemically unmodified DNA oligonucleotide sequence that is complementary to a region upstream of the BCL-2 gene. Exposure of tumor cells to PNT100 results in suppression of proliferation and cell death by a process called DNA interference. PNT2258 is PNT100 that is encapsulated in protective amphoteric liposomes developed to efficiently encapsulate the PNT100 oligonucleotide, provide enhanced serum stability, optimized pharmacokinetic properties and antitumor activity of the nanoparticle both in vivo and in vitro. PNT2258 demonstrates broad antitumor activity against BCL-2-driven WSU-DLCL2 lymphoma, highly resistant A375 melanoma, PC-3 prostate, and Daudi-Burkitt’s lymphoma xenografts. The sequence specificity of PNT100 was demonstrated against three control sequences (scrambled, mismatched, and reverse complement) all encapsulated in a lipid formulation with identical particle characteristics, and control sequences did not demonstrate antiproliferative activity in vivo or in vitro. PNT2258 is currently undergoing clinical testing to evaluate safety and antitumor activity in patients with recurrent or refractory non-Hodgkin’s lymphoma and additional studies are planned.
Collapse
MESH Headings
- 5' Flanking Region/drug effects
- Animals
- Antineoplastic Agents/administration & dosage
- Antineoplastic Agents/pharmacokinetics
- Antineoplastic Agents/pharmacology
- Antineoplastic Agents/therapeutic use
- Antineoplastic Combined Chemotherapy Protocols/administration & dosage
- Antineoplastic Combined Chemotherapy Protocols/metabolism
- Antineoplastic Combined Chemotherapy Protocols/pharmacokinetics
- Antineoplastic Combined Chemotherapy Protocols/therapeutic use
- Cell Line, Tumor
- Cell Survival/drug effects
- DNA, Antisense/administration & dosage
- DNA, Antisense/pharmacokinetics
- DNA, Antisense/pharmacology
- DNA, Antisense/therapeutic use
- DNA, Single-Stranded/administration & dosage
- DNA, Single-Stranded/pharmacokinetics
- DNA, Single-Stranded/pharmacology
- DNA, Single-Stranded/therapeutic use
- Drug Compounding
- Drug Stability
- Female
- Gene Silencing/drug effects
- Liposomes
- Mice
- Mice, Inbred BALB C
- Mice, Nude
- Mice, SCID
- Neoplasms/blood
- Neoplasms/drug therapy
- Oligodeoxyribonucleotides/chemistry
- Oligodeoxyribonucleotides/pharmacokinetics
- Oligodeoxyribonucleotides/pharmacology
- Oligodeoxyribonucleotides/therapeutic use
- Pharmaceutical Vehicles
- Proto-Oncogene Proteins c-bcl-2/antagonists & inhibitors
- Proto-Oncogene Proteins c-bcl-2/genetics
- Proto-Oncogene Proteins c-bcl-2/metabolism
- Random Allocation
- Xenograft Model Antitumor Assays
Collapse
|
86
|
Koohy H, Down TA, Spivakov M, Hubbard T. A comparison of peak callers used for DNase-Seq data. PLoS One 2014; 9:e96303. [PMID: 24810143 PMCID: PMC4014496 DOI: 10.1371/journal.pone.0096303] [Citation(s) in RCA: 49] [Impact Index Per Article: 4.9] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 11/11/2013] [Accepted: 04/06/2014] [Indexed: 11/19/2022] Open
Abstract
Genome-wide profiling of open chromatin regions using DNase I and high-throughput sequencing (DNase-seq) is an increasingly popular approach for finding and studying regulatory elements. A variety of algorithms have been developed to identify regions of open chromatin from raw sequence-tag data, which has motivated us to assess and compare their performance. In this study, four published, publicly available peak calling algorithms used for DNase-seq data analysis (F-seq, Hotspot, MACS and ZINBA) are assessed at a range of signal thresholds on two published DNase-seq datasets for three cell types. The results were benchmarked against an independent dataset of regulatory regions derived from ENCODE in vivo transcription factor binding data for each particular cell type. The level of overlap between peak regions reported by each algorithm and this ENCODE-derived reference set was used to assess sensitivity and specificity of the algorithms. Our study suggests that F-seq has a slightly higher sensitivity than the next best algorithms. Hotspot and the ChIP-seq oriented method, MACS, both perform competitively when used with their default parameters. However the generic peak finder ZINBA appears to be less sensitive than the other three. We also assess accuracy of each algorithm over a range of signal thresholds. In particular, we show that the accuracy of F-Seq can be considerably improved by using a threshold setting that is different from the default value.
Collapse
Affiliation(s)
- Hashem Koohy
- The Babraham Institute, Babraham Research Campus, Cambridge, United Kingdom
- Wellcome Trust Sanger Institute, Wellcome Trust Genome Campus, Cambridge, United Kingdom
| | - Thomas A. Down
- The Babraham Institute, Babraham Research Campus, Cambridge, United Kingdom
| | - Mikhail Spivakov
- Wellcome Trust Sanger Institute, Wellcome Trust Genome Campus, Cambridge, United Kingdom
| | - Tim Hubbard
- The Babraham Institute, Babraham Research Campus, Cambridge, United Kingdom
| |
Collapse
|
87
|
The histone lysine demethylase JMJD3/KDM6B is recruited to p53 bound promoters and enhancer elements in a p53 dependent manner. PLoS One 2014; 9:e96545. [PMID: 24797517 PMCID: PMC4010471 DOI: 10.1371/journal.pone.0096545] [Citation(s) in RCA: 53] [Impact Index Per Article: 5.3] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 01/22/2014] [Accepted: 04/09/2014] [Indexed: 11/19/2022] Open
Abstract
The JmjC domain-containing protein JMJD3/KDM6B catalyses the demethylation of H3K27me3 and H3K27me2. JMJD3 appears to be highly regulated at the transcriptional level and is upregulated in response to diverse stimuli such as differentiation inducers and stress signals. Accordingly, JMJD3 has been linked to the regulation of different biological processes such as differentiation of embryonic stem cells, inflammatory responses in macrophages, and induction of cellular senescence via regulation of the INK4A-ARF locus. Here we show here that JMJD3 interacts with the tumour suppressor protein p53. We find that the interaction is dependent on the p53 tetramerization domain. Following DNA damage, JMJD3 is transcriptionally upregulated and by performing genome-wide mapping of JMJD3, we demonstrate that it binds genes involved in basic cellular processes, as well as genes regulating cell cycle, response to stress and apoptosis. Moreover, we find that JMJD3 binding sites show significant overlap with p53 bound promoters and enhancer elements. The binding of JMJD3 to p53 target sites is increased in response to DNA damage, and we demonstrate that the recruitment of JMJD3 to these sites is dependent on p53 expression. Therefore, we propose a model in which JMJD3 is recruited to p53 responsive elements via its interaction with p53 and speculate that JMJD3 could act as a fail-safe mechanism to remove low levels of H3K27me3 and H3K27me2 to allow for efficient acetylation of H3K27.
Collapse
|
88
|
Spieler D, Kaffe M, Knauf F, Bessa J, Tena JJ, Giesert F, Schormair B, Tilch E, Lee H, Horsch M, Czamara D, Karbalai N, von Toerne C, Waldenberger M, Gieger C, Lichtner P, Claussnitzer M, Naumann R, Müller-Myhsok B, Torres M, Garrett L, Rozman J, Klingenspor M, Gailus-Durner V, Fuchs H, Hrabě de Angelis M, Beckers J, Hölter SM, Meitinger T, Hauck SM, Laumen H, Wurst W, Casares F, Gómez-Skarmeta JL, Winkelmann J. Restless legs syndrome-associated intronic common variant in Meis1 alters enhancer function in the developing telencephalon. Genome Res 2014; 24:592-603. [PMID: 24642863 PMCID: PMC3975059 DOI: 10.1101/gr.166751.113] [Citation(s) in RCA: 79] [Impact Index Per Article: 7.9] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 12/17/2022]
Abstract
Genome-wide association studies (GWAS) identified the MEIS1 locus for Restless Legs Syndrome (RLS), but causal single nucleotide polymorphisms (SNPs) and their functional relevance remain unknown. This locus contains a large number of highly conserved noncoding regions (HCNRs) potentially functioning as cis-regulatory modules. We analyzed these HCNRs for allele-dependent enhancer activity in zebrafish and mice and found that the risk allele of the lead SNP rs12469063 reduces enhancer activity in the Meis1 expression domain of the murine embryonic ganglionic eminences (GE). CREB1 binds this enhancer and rs12469063 affects its binding in vitro. In addition, MEIS1 target genes suggest a role in the specification of neuronal progenitors in the GE, and heterozygous Meis1-deficient mice exhibit hyperactivity, resembling the RLS phenotype. Thus, in vivo and in vitro analysis of a common SNP with small effect size showed allele-dependent function in the prospective basal ganglia representing the first neurodevelopmental region implicated in RLS.
Collapse
Affiliation(s)
- Derek Spieler
- Institute of Human Genetics, Helmholtz Zentrum München, German Research Center for Environmental Health, 85764 Neuherberg, Germany
| | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | |
Collapse
|
89
|
The desmosomal protein desmoglein 1 aids recovery of epidermal differentiation after acute UV light exposure. J Invest Dermatol 2014; 134:2154-2162. [PMID: 24594668 PMCID: PMC4102640 DOI: 10.1038/jid.2014.124] [Citation(s) in RCA: 30] [Impact Index Per Article: 3.0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 01/08/2014] [Accepted: 02/18/2014] [Indexed: 12/20/2022]
Abstract
Epidermal structure is damaged by exposure to UV light, but the molecular mechanisms governing structural repair are largely unknown. UVB (290-320 nm wavelengths) exposure before induction of differentiation reduced expression of differentiation-associated proteins, including desmoglein 1 (Dsg1), desmocollin 1 (Dsc1), and keratins 1 and 10 (K1/K10), in a dose-dependent manner in normal human epidermal keratinocytes (NHEKs). The UVB-induced reduction in both Dsg1 transcript and protein was associated with reduced binding of the p63 transcription factor to previously unreported enhancer regulatory regions of the Dsg1 gene. As Dsg1 promotes epidermal differentiation in addition to participating in cell-cell adhesion, the role of Dsg1 in aiding differentiation after UVB damage was tested. Compared with controls, depleting Dsg1 via short hairpin RNA resulted in further reduction of Dsc1 and K1/K10 expression in monolayer NHEK cultures and in abnormal epidermal architecture in organotypic skin models recovering from UVB exposure. Ectopic expression of Dsg1 in keratinocyte monolayers rescued the UVB-induced differentiation defect. Treatment of UVB-exposed monolayer or organotypic cultures with trichostatin A, a histone deacetylase inhibitor, partially restored differentiation marker expression, suggesting a potential therapeutic strategy for reversing UV-induced impairment of epidermal differentiation after acute sun exposure.
Collapse
|
90
|
Kieffer-Kwon KR, Tang Z, Mathe E, Qian J, Sung MH, Li G, Resch W, Baek S, Pruett N, Grøntved L, Vian L, Nelson S, Zare H, Hakim O, Reyon D, Yamane A, Nakahashi H, Kovalchuk AL, Zou J, Joung JK, Sartorelli V, Wei CL, Ruan X, Hager GL, Ruan Y, Casellas R. Interactome maps of mouse gene regulatory domains reveal basic principles of transcriptional regulation. Cell 2014; 155:1507-20. [PMID: 24360274 DOI: 10.1016/j.cell.2013.11.039] [Citation(s) in RCA: 221] [Impact Index Per Article: 22.1] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 09/11/2013] [Revised: 11/01/2013] [Accepted: 11/25/2013] [Indexed: 12/31/2022]
Abstract
A key finding of the ENCODE project is that the enhancer landscape of mammalian cells undergoes marked alterations during ontogeny. However, the nature and extent of these changes are unclear. As part of the NIH Mouse Regulome Project, we here combined DNaseI hypersensitivity, ChIP-seq, and ChIA-PET technologies to map the promoter-enhancer interactomes of pluripotent ES cells and differentiated B lymphocytes. We confirm that enhancer usage varies widely across tissues. Unexpectedly, we find that this feature extends to broadly transcribed genes, including Myc and Pim1 cell-cycle regulators, which associate with an entirely different set of enhancers in ES and B cells. By means of high-resolution CpG methylomes, genome editing, and digital footprinting, we show that these enhancers recruit lineage-determining factors. Furthermore, we demonstrate that the turning on and off of enhancers during development correlates with promoter activity. We propose that organisms rely on a dynamic enhancer landscape to control basic cellular functions in a tissue-specific manner.
Collapse
Affiliation(s)
| | - Zhonghui Tang
- The Jackson Laboratory for Genomic Medicine, and Department of Genetic and Development Biology, University of Connecticut, 400 Farmington, CT 06030, USA
| | - Ewy Mathe
- Genomics and Immunity, NIAMS, National Institutes of Health, Bethesda, MD 20892, USA
| | - Jason Qian
- Genomics and Immunity, NIAMS, National Institutes of Health, Bethesda, MD 20892, USA
| | - Myong-Hee Sung
- Laboratory of Receptor Biology and Gene Expression, NCI, National Institutes of Health, Bethesda, MD 20892, USA
| | - Guoliang Li
- The Jackson Laboratory for Genomic Medicine, and Department of Genetic and Development Biology, University of Connecticut, 400 Farmington, CT 06030, USA
| | - Wolfgang Resch
- Genomics and Immunity, NIAMS, National Institutes of Health, Bethesda, MD 20892, USA
| | - Songjoon Baek
- Laboratory of Receptor Biology and Gene Expression, NCI, National Institutes of Health, Bethesda, MD 20892, USA
| | - Nathanael Pruett
- Genomics and Immunity, NIAMS, National Institutes of Health, Bethesda, MD 20892, USA
| | - Lars Grøntved
- Laboratory of Receptor Biology and Gene Expression, NCI, National Institutes of Health, Bethesda, MD 20892, USA
| | - Laura Vian
- Genomics and Immunity, NIAMS, National Institutes of Health, Bethesda, MD 20892, USA
| | - Steevenson Nelson
- Genomics and Immunity, NIAMS, National Institutes of Health, Bethesda, MD 20892, USA
| | - Hossein Zare
- Laboratory of Muscle Stem Cells and Gene Regulation, NIAMS, National Institutes of Health, Bethesda, MD 20892, USA
| | - Ofir Hakim
- Bar-Ilan University, Ramat-Gan 5290002, Israel
| | - Deepak Reyon
- Molecular Pathology Unit, Center for Computational and Integrative Biology, and Center for Cancer Research, Massachusetts General Hospital, Charlestown, MA 02129, USA; Department of Pathology, Harvard Medical School, Boston, MA 02115 USA
| | - Arito Yamane
- Genomics and Immunity, NIAMS, National Institutes of Health, Bethesda, MD 20892, USA
| | - Hirotaka Nakahashi
- Genomics and Immunity, NIAMS, National Institutes of Health, Bethesda, MD 20892, USA
| | - Alexander L Kovalchuk
- Laboratory of Immunogenetics, National Institute of Allergy and Infectious Diseases, National Institutes of Health, Rockville, MD 20852, USA
| | - Jizhong Zou
- Laboratory of Stem Cell Biology, NIAMS, National Institutes of Health, Bethesda, MD 20892, USA
| | - J Keith Joung
- Molecular Pathology Unit, Center for Computational and Integrative Biology, and Center for Cancer Research, Massachusetts General Hospital, Charlestown, MA 02129, USA; Department of Pathology, Harvard Medical School, Boston, MA 02115 USA
| | - Vittorio Sartorelli
- Laboratory of Muscle Stem Cells and Gene Regulation, NIAMS, National Institutes of Health, Bethesda, MD 20892, USA
| | - Chia-Lin Wei
- DOE Joint Genome Institute, 2800 Mitchell Drive, Walnut Creek, CA 94598, USA
| | - Xiaoan Ruan
- The Jackson Laboratory for Genomic Medicine, and Department of Genetic and Development Biology, University of Connecticut, 400 Farmington, CT 06030, USA
| | - Gordon L Hager
- Laboratory of Receptor Biology and Gene Expression, NCI, National Institutes of Health, Bethesda, MD 20892, USA
| | - Yijun Ruan
- The Jackson Laboratory for Genomic Medicine, and Department of Genetic and Development Biology, University of Connecticut, 400 Farmington, CT 06030, USA
| | - Rafael Casellas
- Genomics and Immunity, NIAMS, National Institutes of Health, Bethesda, MD 20892, USA; Center of Cancer Research, NCI, National Institutes of Health, Bethesda, MD 20892, USA.
| |
Collapse
|
91
|
John S, Sabo PJ, Canfield TK, Lee K, Vong S, Weaver M, Wang H, Vierstra J, Reynolds AP, Thurman RE, Stamatoyannopoulos JA. Genome-scale mapping of DNase I hypersensitivity. CURRENT PROTOCOLS IN MOLECULAR BIOLOGY 2014; Chapter 27:Unit 21.27. [PMID: 23821440 DOI: 10.1002/0471142727.mb2127s103] [Citation(s) in RCA: 61] [Impact Index Per Article: 6.1] [Reference Citation Analysis] [Abstract] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 11/11/2022]
Abstract
DNase I-seq is a global and high-resolution method that uses the nonspecific endonuclease DNase I to map chromatin accessibility. These accessible regions, designated as DNase I hypersensitive sites (DHSs), define the regulatory features, (e.g., promoters, enhancers, insulators, and locus control regions) of complex genomes. In this unit, methods are described for nuclei isolation, digestion of nuclei with limiting concentrations of DNase I, and the biochemical fractionation of DNase I hypersensitive sites in preparation for high-throughput sequencing. DNase I-seq is an unbiased and robust method that is not predicated on an a priori understanding of regulatory patterns or chromatin features.
Collapse
Affiliation(s)
- Sam John
- Department of Genome Sciences, University of Washington, Seattle, Washington, USA
| | | | | | | | | | | | | | | | | | | | | |
Collapse
|
92
|
Marcovitz A, Levy Y. Obstacles may facilitate and direct DNA search by proteins. Biophys J 2013; 104:2042-50. [PMID: 23663847 DOI: 10.1016/j.bpj.2013.03.030] [Citation(s) in RCA: 51] [Impact Index Per Article: 4.6] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 01/05/2013] [Revised: 03/17/2013] [Accepted: 03/20/2013] [Indexed: 12/28/2022] Open
Abstract
DNA recognition by DNA-binding proteins (DBPs), which is a pivotal event in most gene regulatory processes, is often preceded by an extensive search for the correct site. A facilitated diffusion process in which a DBP combines three-dimensional diffusion in solution with one-dimensional sliding along DNA has been suggested to explain how proteins can locate their target sites on DNA much faster than predicted by three-dimensional diffusion alone. Although experimental and theoretical studies have recently advanced understanding of the biophysical principles underlying the search mechanism, the process under in vivo cellular conditions is poorly understood. In this study, we used various computational approaches to explore how the presence of obstacle proteins on the DNA influences search efficiency. At a low obstacle occupancy (i.e., when few obstacles occupy sites on the DNA), sliding by the searching DBP may be confined, which may impair search efficiency. The obstacles, however, can be bypassed during hopping events, and the number of bypasses is larger for higher obstacle occupancies. Dynamism on the part of the obstacles may even further facilitate search kinetics. Our study shows that the nature and efficiency of the search process may be governed not only by the intrinsic properties of the DBP and the salt concentration of the medium, but also by the in vivo association of DNA with other macromolecular obstacles, their location, and occupancy.
Collapse
Affiliation(s)
- Amir Marcovitz
- Department of Structural Biology, Weizmann Institute of Science, Rehovot, Israel
| | | |
Collapse
|
93
|
Iwata M, Sandstrom RS, Delrow JJ, Stamatoyannopoulos JA, Torok-Storb B. Functionally and phenotypically distinct subpopulations of marrow stromal cells are fibroblast in origin and induce different fates in peripheral blood monocytes. Stem Cells Dev 2013; 23:729-40. [PMID: 24131213 DOI: 10.1089/scd.2013.0300] [Citation(s) in RCA: 21] [Impact Index Per Article: 1.9] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 12/16/2022] Open
Abstract
Marrow stromal cells constitute a heterogeneous population of cells, typically isolated after expansion in culture. In vivo, stromal cells often exist in close proximity or in direct contact with monocyte-derived macrophages, yet their interaction with monocytes is largely unexplored. In this report, isolated CD146(+) and CD146(-) stromal cells, as well as immortalized cell lines representative of each (designated HS27a and HS5, respectively), were shown by global DNase I hypersensitive site mapping and principal coordinate analysis to have a lineage association with marrow fibroblasts. Gene expression profiles generated for the CD146(+) and CD146(-) cell lines indicate significant differences in their respective transcriptomes, which translates into differences in secreted factors. Consequently, the conditioned media (CM) from these two populations induce different fates in peripheral blood monocytes. Monocytes incubated in CD146(+) CM acquire a tissue macrophage phenotype, whereas monocytes incubated in CM from CD146(-) cells express markers associated with pre-dendritic cells. Importantly, when CD14(+) monocytes are cultured in contact with the CD146(+) cells, the combined cell populations, assayed as a unit, show increased levels of transcripts associated with organismal development and hematopoietic regulation. In contrast, the gene expression profile from cocultures of monocytes and CD146(-) cells does not differ from that obtained when monocytes are cultured with CD146(-) CM. These in vitro results show that the CD146(+) marrow stromal cells together with monocytes increase the expression of genes relevant to hematopoietic regulation. In vivo relevance of these data is suggested by immunohistochemistry of marrow biopsies showing juxtaposed CD146(+) cells and CD68(+) cells associated with these upregulated proteins.
Collapse
Affiliation(s)
- Mineo Iwata
- 1 Fred Hutchinson Cancer Research Center , Seattle, Washington
| | | | | | | | | |
Collapse
|
94
|
Biancolella M, Fortini BK, Tring S, Plummer SJ, Mendoza-Fandino GA, Hartiala J, Hitchler MJ, Yan C, Schumacher FR, Conti DV, Edlund CK, Noushmehr H, Coetzee SG, Bresalier RS, Ahnen DJ, Barry EL, Berman BP, Rice JC, Coetzee GA, Casey G. Identification and characterization of functional risk variants for colorectal cancer mapping to chromosome 11q23.1. Hum Mol Genet 2013; 23:2198-209. [PMID: 24256810 DOI: 10.1093/hmg/ddt584] [Citation(s) in RCA: 34] [Impact Index Per Article: 3.1] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 12/23/2022] Open
Abstract
Genome-wide association studies of colorectal cancer (CRC) have identified a number of common variants associated with modest risk, including rs3802842 at chromosome 11q23.1. Several genes map to this region but rs3802842 does not map to any known transcribed or regulatory sequences. We reasoned, therefore, that rs3802842 is not the functional single-nucleotide polymorphism (SNP), but is in linkage disequilibrium (LD) with a functional SNP(s). We performed ChIP-seq for histone modifications in SW480 and HCT-116 CRC cells, and incorporated ChIP-seq and DNase I hypersensitivity data available through ENCODE within a 137-kb genomic region containing rs3802842 on 11q23.1. We identified SNP rs10891246 in LD with rs3802842 that mapped within a bidirectional promoter region of genes C11orf92 and C11orf93. Following mutagenesis to the risk allele, the promoter demonstrated lower levels of reporter gene expression. A second SNP rs7130173 was identified in LD with rs3802842 that mapped to a candidate enhancer region, which showed strong unidirectional activity in both HCT-116 and SW480 CRC cells. The risk allele of rs7130173 demonstrated reduced enhancer activity compared with the common allele, and reduced nuclear protein binding affinity in electromobility shift assays compared with the common allele suggesting differential transcription factor (TF) binding. SNPs rs10891246 and rs7130173 are on the same haplotype, and expression quantitative trait loci (eQTL) analyses of neighboring genes implicate C11orf53, C11orf92 and C11orf93 as candidate target genes. These data imply that rs10891246 and rs7130173 are functional SNPs mapping to 11q23.1 and that C11orf53, C11orf92 and C11orf93 represent novel candidate target genes involved in CRC etiology.
Collapse
|
95
|
Vierstra J, Wang H, John S, Sandstrom R, Stamatoyannopoulos JA. Coupling transcription factor occupancy to nucleosome architecture with DNase-FLASH. Nat Methods 2013; 11:66-72. [PMID: 24185839 DOI: 10.1038/nmeth.2713] [Citation(s) in RCA: 54] [Impact Index Per Article: 4.9] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/27/2013] [Accepted: 10/04/2013] [Indexed: 12/16/2022]
Abstract
It is currently not possible to resolve the genome-wide relationship of transcription factors (TFs) and nucleosomes at the level of individual chromatin templates despite rapidly increasing data on TF and nucleosome occupancy in the human genome. Here we describe DNase I-released fragment-length analysis of hypersensitivity (DNase-FLASH), an approach that directly couples mapping of TF occupancy, via quantification of DNA microfragments released from individual TF recognition sites in regulatory DNA, to the surrounding nucleosome architecture, via analysis of larger DNA fragments, in a single assay. DNase-FLASH enables coupling of individual TF footprints to nucleosome occupancy, identifying TFs that precisely demarcate the regulatory DNA-nucleosome interface.
Collapse
Affiliation(s)
- Jeff Vierstra
- Department of Genome Sciences, University of Washington, Seattle, Washington, USA
| | - Hao Wang
- Department of Genome Sciences, University of Washington, Seattle, Washington, USA
| | - Sam John
- Department of Genome Sciences, University of Washington, Seattle, Washington, USA
| | - Richard Sandstrom
- Department of Genome Sciences, University of Washington, Seattle, Washington, USA
| | - John A Stamatoyannopoulos
- 1] Department of Genome Sciences, University of Washington, Seattle, Washington, USA. [2] Department of Medicine, Division of Oncology, University of Washington, Seattle, Washington, USA
| |
Collapse
|
96
|
Lo KA, Labadorf A, Kennedy NJ, Han MS, Yap YS, Matthews B, Xin X, Sun L, Davis RJ, Lodish HF, Fraenkel E. Analysis of in vitro insulin-resistance models and their physiological relevance to in vivo diet-induced adipose insulin resistance. Cell Rep 2013; 5:259-70. [PMID: 24095730 DOI: 10.1016/j.celrep.2013.08.039] [Citation(s) in RCA: 74] [Impact Index Per Article: 6.7] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 02/18/2013] [Revised: 07/12/2013] [Accepted: 08/23/2013] [Indexed: 12/15/2022] Open
Abstract
Diet-induced obesity (DIO) predisposes individuals to insulin resistance, and adipose tissue has a major role in the disease. Insulin resistance can be induced in cultured adipocytes by a variety of treatments, but what aspects of the in vivo responses are captured by these models remains unknown. We use global RNA sequencing to investigate changes induced by TNF-α, hypoxia, dexamethasone, high insulin, and a combination of TNF-α and hypoxia, comparing the results to the changes in white adipose tissue from DIO mice. We found that different in vitro models capture distinct features of DIO adipose insulin resistance, and a combined treatment of TNF-α and hypoxia is most able to mimic the in vivo changes. Using genome-wide DNase I hypersensitivity followed by sequencing, we further examined the transcriptional regulation of TNF-α-induced insulin resistance, and we found that C/EPBβ is a potential key regulator of adipose insulin resistance.
Collapse
Affiliation(s)
- Kinyui Alice Lo
- Department of Biological Engineering, Massachusetts Institute of Technology, Cambridge, MA 02139, USA; Whitehead Institute for Biomedical Research, 9 Cambridge Center, Cambridge, MA 02142, USA
| | | | | | | | | | | | | | | | | | | | | |
Collapse
|
97
|
Piper J, Elze MC, Cauchy P, Cockerill PN, Bonifer C, Ott S. Wellington: a novel method for the accurate identification of digital genomic footprints from DNase-seq data. Nucleic Acids Res 2013; 41:e201. [PMID: 24071585 PMCID: PMC3834841 DOI: 10.1093/nar/gkt850] [Citation(s) in RCA: 150] [Impact Index Per Article: 13.6] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 01/17/2023] Open
Abstract
The expression of eukaryotic genes is regulated by cis-regulatory elements such as promoters and enhancers, which bind sequence-specific DNA-binding proteins. One of the great challenges in the gene regulation field is to characterise these elements. This involves the identification of transcription factor (TF) binding sites within regulatory elements that are occupied in a defined regulatory context. Digestion with DNase and the subsequent analysis of regions protected from cleavage (DNase footprinting) has for many years been used to identify specific binding sites occupied by TFs at individual cis-elements with high resolution. This methodology has recently been adapted for high-throughput sequencing (DNase-seq). In this study, we describe an imbalance in the DNA strand-specific alignment information of DNase-seq data surrounding protein–DNA interactions that allows accurate prediction of occupied TF binding sites. Our study introduces a novel algorithm, Wellington, which considers the imbalance in this strand-specific information to efficiently identify DNA footprints. This algorithm significantly enhances specificity by reducing the proportion of false positives and requires significantly fewer predictions than previously reported methods to recapitulate an equal amount of ChIP-seq data. We also provide an open-source software package, pyDNase, which implements the Wellington algorithm to interface with DNase-seq data and expedite analyses.
Collapse
Affiliation(s)
- Jason Piper
- Warwick Systems Biology Centre, University of Warwick, Coventry, CV4 7AL, United Kingdom, School of Cancer Sciences, Institute of Biomedical Research, College of Medical and Dental Sciences, University of Birmingham, Birmingham, B15 2TT, United Kingdom, Department of Statistics, University of Warwick, Coventry, CV4 7AL, United Kingdom and School of Immunity and Infection, Institute of Biomedical Research, College of Medical and Dental Sciences, University of Birmingham, Birmingham, B15 2TT, United Kingdom
| | | | | | | | | | | |
Collapse
|
98
|
Chai X, Nagarajan S, Kim K, Lee K, Choi JK. Regulation of the boundaries of accessible chromatin. PLoS Genet 2013; 9:e1003778. [PMID: 24068952 PMCID: PMC3772044 DOI: 10.1371/journal.pgen.1003778] [Citation(s) in RCA: 13] [Impact Index Per Article: 1.2] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 04/03/2013] [Accepted: 07/23/2013] [Indexed: 11/24/2022] Open
Abstract
Regulatory regions maintain nucleosome-depleted, open chromatin status but simultaneously require the presence of nucleosomes for specific histone modifications. It remains unclear how these can be achieved for proper regulatory function. Here we demonstrate that nucleosomes positioned within accessible chromatin regions near the boundaries provide platforms for histone modifications while preventing the occlusion of regulatory elements. These boundary nucleosomes were particularly enriched for active or poised regulatory marks in human, such as histone acetylations, H3K4 methylations, H3K9me3, H3K79me2, and H4K20me1. Additionally, we found that based on a genome-wide profiling of ∼100 recombinant yeast strains, the location of open chromatin borders tends to vary mostly within 150 bp upon genetic perturbation whereas this positional variation increases in proportion to the sequence preferences of the underlying DNA for nucleosome formation. More than 40% of the local boundary shifts were associated with genetic variation in cis- or trans-acting factors. A sizeable fraction of the identified genetic factors was also associated with nearby gene expression, which was correlated with the distance between the transcription start site (tss) and the boundary that faces the tss. Taken together, the variation in the width of accessible chromatin regions may arise in conjunction with the modulation of the boundary nucleosomes by post-translational modifications or by chromatin regulators and in association with the activity of nearby gene transcription. Open chromatin formation and regulation are intimately coupled with nucleosome remodelling and modification. Regulatory regions such as promoters and enhancers maintain nucleosome-free, open chromatin states whilst at the same time the presence of nucleosomes is required for specific histone modifications. In this work, we carried out detailed analyses of our data of open chromatin maps for ∼100 different yeast strains and whole-genome nucleosome occupancy along with the public data of open chromatin and nucleosome positioning in human generated in the ENCODE project. We observed nucleosomes positioned within accessible chromatin regions near their boundaries. These boundary nucleosomes appeared to carry various histone methylations without hampering the binding of DNA regulators and sequence preferences for these nucleosomes were associated with variation in the width of accessible chromatin. The end positions of open chromatin domains, particularly with high intrinsic preferences for nucleosome formation, were more flexible than the middle point, changing mostly within 150 bp upon genetic perturbation. By using quantitative trait loci (QTL) mapping, we identified genetic variants that are associated with the variation in the width of open chromatin and examined its relationship with nearby gene expression.
Collapse
Affiliation(s)
- Xiaoran Chai
- Genome Institute of Singapore, Singapore, Republic of Singapore
| | | | - Kwoneel Kim
- Department of Bio and Brain Engineering, KAIST, Daejeon, Korea
| | - Kibaick Lee
- Department of Bio and Brain Engineering, KAIST, Daejeon, Korea
| | - Jung Kyoon Choi
- Genome Institute of Singapore, Singapore, Republic of Singapore
- Department of Bio and Brain Engineering, KAIST, Daejeon, Korea
- * E-mail:
| |
Collapse
|
99
|
Koohy H, Down TA, Hubbard TJ. Chromatin accessibility data sets show bias due to sequence specificity of the DNase I enzyme. PLoS One 2013; 8:e69853. [PMID: 23922824 PMCID: PMC3724795 DOI: 10.1371/journal.pone.0069853] [Citation(s) in RCA: 46] [Impact Index Per Article: 4.2] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 02/05/2013] [Accepted: 06/12/2013] [Indexed: 11/19/2022] Open
Abstract
BACKGROUND DNase I is an enzyme which cuts duplex DNA at a rate that depends strongly upon its chromatin environment. In combination with high-throughput sequencing (HTS) technology, it can be used to infer genome-wide landscapes of open chromatin regions. Using this technology, systematic identification of hundreds of thousands of DNase I hypersensitive sites (DHS) per cell type has been possible, and this in turn has helped to precisely delineate genomic regulatory compartments. However, to date there has been relatively little investigation into possible biases affecting this data. RESULTS We report a significant degree of sequence preference spanning sites cut by DNase I in a number of published data sets. The two major protocols in current use each show a different pattern, but for a given protocol the pattern of sequence specificity seems to be quite consistent. The patterns are substantially different from biases seen in other types of HTS data sets, and in some cases the most constrained position lies outside the sequenced fragment, implying that this constraint must relate to the digestion process rather than events occurring during library preparation or sequencing. CONCLUSIONS DNase I is a sequence-specific enzyme, with a specificity that may depend on experimental conditions. This sequence specificity is not taken into account by existing pipelines for identifying open chromatin regions. Care must be taken when interpreting DNase I results, especially when looking at the precise locations of the reads. Future studies may be able to improve the sensitivity and precision of chromatin state measurement by compensating for sequence bias.
Collapse
Affiliation(s)
- Hashem Koohy
- Wellcome Trust Sanger Institute, Hinxton, Cambridge, United Kingdom.
| | | | | |
Collapse
|
100
|
Smith RP, Riesenfeld SJ, Holloway AK, Li Q, Murphy KK, Feliciano NM, Orecchia L, Oksenberg N, Pollard KS, Ahituv N. A compact, in vivo screen of all 6-mers reveals drivers of tissue-specific expression and guides synthetic regulatory element design. Genome Biol 2013; 14:R72. [PMID: 23867016 PMCID: PMC4054837 DOI: 10.1186/gb-2013-14-7-r72] [Citation(s) in RCA: 14] [Impact Index Per Article: 1.3] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 01/17/2013] [Revised: 03/08/2013] [Accepted: 07/18/2013] [Indexed: 11/28/2022] Open
Abstract
BACKGROUND Large-scale annotation efforts have improved our ability to coarsely predict regulatory elements throughout vertebrate genomes. However, it is unclear how complex spatiotemporal patterns of gene expression driven by these elements emerge from the activity of short, transcription factor binding sequences. RESULTS We describe a comprehensive promoter extension assay in which the regulatory potential of all 6 base-pair (bp) sequences was tested in the context of a minimal promoter. To enable this large-scale screen, we developed algorithms that use a reverse-complement aware decomposition of the de Bruijn graph to design a library of DNA oligomers incorporating every 6-bp sequence exactly once. Our library multiplexes all 4,096 unique 6-mers into 184 double-stranded 15-bp oligomers, which is sufficiently compact for in vivo testing. We injected each multiplexed construct into zebrafish embryos and scored GFP expression in 15 tissues at two developmental time points. Twenty-seven constructs produced consistent expression patterns, with the majority doing so in only one tissue. Functional sequences are enriched near biologically relevant genes, match motifs for developmental transcription factors, and are required for enhancer activity. By concatenating tissue-specific functional sequences, we generated completely synthetic enhancers for the notochord, epidermis, spinal cord, forebrain and otic lateral line, and show that short regulatory sequences do not always function modularly. CONCLUSIONS This work introduces a unique in vivo catalog of short, functional regulatory sequences and demonstrates several important principles of regulatory element organization. Furthermore, we provide resources for designing compact, reverse-complement aware k-mer libraries.
Collapse
Affiliation(s)
- Robin P Smith
- Department of Bioengineering and Therapeutic Sciences, University of California San Francisco, 1550 4th St, San Francisco, CA 94158, USA
- Institute for Human Genetics, University of California San Francisco, 1550 4th St, San Francisco, CA 94158, USA
| | - Samantha J Riesenfeld
- Gladstone Institutes, University of California San Francisco, 1650 Owens St, San Francisco, CA 94158, USA
| | - Alisha K Holloway
- Gladstone Institutes, University of California San Francisco, 1650 Owens St, San Francisco, CA 94158, USA
| | - Qiang Li
- Department of Bioengineering and Therapeutic Sciences, University of California San Francisco, 1550 4th St, San Francisco, CA 94158, USA
- Institute for Human Genetics, University of California San Francisco, 1550 4th St, San Francisco, CA 94158, USA
- Current address: Institute for Pediatrics, Translational Research Center for Development and Disease, Children's Hospital of Fudan University, Shanghai, 201102, China
| | - Karl K Murphy
- Department of Bioengineering and Therapeutic Sciences, University of California San Francisco, 1550 4th St, San Francisco, CA 94158, USA
- Institute for Human Genetics, University of California San Francisco, 1550 4th St, San Francisco, CA 94158, USA
| | - Natalie M Feliciano
- Department of Bioengineering and Therapeutic Sciences, University of California San Francisco, 1550 4th St, San Francisco, CA 94158, USA
- Institute for Human Genetics, University of California San Francisco, 1550 4th St, San Francisco, CA 94158, USA
| | - Lorenzo Orecchia
- Division of Biostatistics, University of California San Francisco, 1650 Owens St, CA 94158, USA
| | - Nir Oksenberg
- Department of Bioengineering and Therapeutic Sciences, University of California San Francisco, 1550 4th St, San Francisco, CA 94158, USA
- Institute for Human Genetics, University of California San Francisco, 1550 4th St, San Francisco, CA 94158, USA
| | - Katherine S Pollard
- Institute for Human Genetics, University of California San Francisco, 1550 4th St, San Francisco, CA 94158, USA
- Gladstone Institutes, University of California San Francisco, 1650 Owens St, San Francisco, CA 94158, USA
- Department of Mathematics, Massachusetts Institute of Technology, 77 Massachusetts Ave, Cambridge, MA 02139, USA
| | - Nadav Ahituv
- Department of Bioengineering and Therapeutic Sciences, University of California San Francisco, 1550 4th St, San Francisco, CA 94158, USA
- Institute for Human Genetics, University of California San Francisco, 1550 4th St, San Francisco, CA 94158, USA
| |
Collapse
|