1
|
Gobbo F, Zingariello M, Verachi P, Falchi M, Arciprete F, Martelli F, Peli A, Mazzarini M, Vierstra J, Mead-Harvey C, Dueck AC, Sarli G, Nava S, Sgalla G, Richeldi L, Migliaccio AR. GATA1-defective immune-megakaryocytes as possible drivers of idiopathic pulmonary fibrosis. bioRxiv 2023:2023.06.20.542249. [PMID: 37425686 PMCID: PMC10327123 DOI: 10.1101/2023.06.20.542249] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [What about the content of this article? (0)] [Affiliation(s)] [Abstract] [Key Words] [Grants] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 07/11/2023]
Abstract
Idiopathic pulmonary fibrosis (IPF) is a progressive fibrotic lung disorder with limited therapeutic options. Insufficient understanding of driver mutations and poor fidelity of currently available animal models has limited the development of effective therapies. Since GATA1 deficient megakaryocytes sustain myelofibrosis, we hypothesized that they may also induce fibrosis in lungs. We discovered that lungs from IPF patients and Gata1low mice contain numerous GATA1negative immune-poised megakaryocytes that, in mice, have defective RNA-seq profiling and increased TGF-β1, CXCL1 and P-selectin content. With age, Gata1low mice develop fibrosis in lungs. Development of lung fibrosis in this model is prevented by P-selectin deletion and rescued by P-selectin, TGF-β1 or CXCL1 inhibition. Mechanistically, P-selectin inhibition decreases TGF-β1 and CXCL1 content and increases GATA1positive megakaryocytes while TGF-β1 or CXCL1 inhibition decreased CXCL1 only. In conclusion, Gata1low mice are a novel genetic-driven model for IPF and provide a link between abnormal immune-megakaryocytes and lung fibrosis.
Collapse
Affiliation(s)
- Francesca Gobbo
- Department of Veterinary Medical Sciences, University of Bologna, Ozzano dell’Emilia (Bologna) 40064, Italy
- Department of Biomedical and Neuromotor Sciences, University of Bologna, Bologna, Italy
| | - Maria Zingariello
- Unit of Microscopic and Ultrastructural Anatomy, Department of Medicine, University Campus Bio-Medico, Rome 00128, Italy
| | - Paola Verachi
- Department of Biomedical and Neuromotor Sciences, University of Bologna, Bologna, Italy
| | - Mario Falchi
- National Center HIV/AIDS Research, Istituto Superiore di Sanita, Rome 00161, Italy
| | - Francesca Arciprete
- Unit of Microscopic and Ultrastructural Anatomy, Department of Medicine, University Campus Bio-Medico, Rome 00128, Italy
| | - Fabrizio Martelli
- National Center for Preclinical and Clinical Research and Evaluation of Pharmaceutical Drugs, Istituto Superiore di Sanita, Rome 00161, Italy
| | - Angelo Peli
- Department for Life Quality Studies, University of Bologna, Rimini Campus, Rimini 47921, Italy
| | - Maria Mazzarini
- Department of Biomedical and Neuromotor Sciences, University of Bologna, Bologna, Italy
- Altius Institute for Biomedical Sciences, Seattle, WA 98121, USA
| | - Jeff Vierstra
- Altius Institute for Biomedical Sciences, Seattle, WA 98121, USA
| | - Carolyn Mead-Harvey
- Mayo Clinic, Department of Quantitative Health Sciences, Division of Clinical Trials and Biostatistics, Scottsdale, AZ 85259, USA
| | - Amylou C. Dueck
- Mayo Clinic, Department of Quantitative Health Sciences, Division of Clinical Trials and Biostatistics, Scottsdale, AZ 85259, USA
| | - Giuseppe Sarli
- Department of Veterinary Medical Sciences, University of Bologna, Ozzano dell’Emilia (Bologna) 40064, Italy
| | - Stefano Nava
- IRCCS Azienda Ospedaliero-Universitaria di Bologna, Respiratory and Critical Care Unit, Bologna 40138, Italy
- Department of Medical and Surgical Sciences, University of Bologna, Bologna 40138, Italy
| | - Giacomo Sgalla
- Department of Medical and Surgical Sciences Fondazione Policlinico Universitario “A. Gemelli” IRCCS, Universita Cattolica del Sacro Cuore, Roma 00168, Italy
| | - Luca Richeldi
- Department of Medical and Surgical Sciences Fondazione Policlinico Universitario “A. Gemelli” IRCCS, Universita Cattolica del Sacro Cuore, Roma 00168, Italy
| | - Anna Rita Migliaccio
- Unit of Microscopic and Ultrastructural Anatomy, Department of Medicine, University Campus Bio-Medico, Rome 00128, Italy
- Altius Institute for Biomedical Sciences, Seattle, WA 98121, USA
| |
Collapse
|
2
|
Georgolopoulos G, Psatha N, Iwata M, Nishida A, Som T, Yiangou M, Stamatoyannopoulos JA, Vierstra J. Discrete regulatory modules instruct hematopoietic lineage commitment and differentiation. Nat Commun 2021; 12:6790. [PMID: 34815405 PMCID: PMC8611072 DOI: 10.1038/s41467-021-27159-x] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [What about the content of this article? (0)] [Affiliation(s)] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 04/07/2020] [Accepted: 10/20/2021] [Indexed: 11/08/2022] Open
Abstract
Lineage commitment and differentiation is driven by the concerted action of master transcriptional regulators at their target chromatin sites. Multiple efforts have characterized the key transcription factors (TFs) that determine the various hematopoietic lineages. However, the temporal interactions between individual TFs and their chromatin targets during differentiation and how these interactions dictate lineage commitment remains poorly understood. Here we perform dense, daily, temporal profiling of chromatin accessibility (DNase I-seq) and gene expression changes (total RNA-seq) along ex vivo human erythropoiesis to comprehensively define developmentally regulated DNase I hypersensitive sites (DHSs) and transcripts. We link both distal DHSs to their target gene promoters and individual TFs to their target DHSs, revealing that the regulatory landscape is organized in distinct sequential regulatory modules that regulate lineage restriction and maturation. Finally, direct comparison of transcriptional dynamics (bulk and single-cell) and lineage potential between erythropoiesis and megakaryopoiesis uncovers differential fate commitment dynamics between the two lineages as they exit the stem and progenitor stage. Collectively, these data provide insights into the temporally regulated synergy of the cis- and the trans-regulatory components underlying hematopoietic lineage commitment and differentiation.
Collapse
Affiliation(s)
- Grigorios Georgolopoulos
- Altius Institute for Biomedical Sciences, Seattle, WA, USA.
- Department of Genetics, Development & Molecular Biology, School of Biology, Aristotle University of Thessaloniki, Thessaloniki, Greece.
| | | | - Mineo Iwata
- Altius Institute for Biomedical Sciences, Seattle, WA, USA
| | - Andrew Nishida
- Altius Institute for Biomedical Sciences, Seattle, WA, USA
| | - Tannishtha Som
- Altius Institute for Biomedical Sciences, Seattle, WA, USA
| | - Minas Yiangou
- Department of Genetics, Development & Molecular Biology, School of Biology, Aristotle University of Thessaloniki, Thessaloniki, Greece
| | - John A Stamatoyannopoulos
- Altius Institute for Biomedical Sciences, Seattle, WA, USA
- Department of Genome Sciences, University of Washington, Seattle, WA, USA
- Division of Oncology, Department of Medicine, University of Washington, Seattle, WA, USA
| | - Jeff Vierstra
- Altius Institute for Biomedical Sciences, Seattle, WA, USA.
| |
Collapse
|
3
|
Meuleman W, Muratov A, Rynes E, Halow J, Lee K, Bates D, Diegel M, Dunn D, Neri F, Teodosiadis A, Reynolds A, Haugen E, Nelson J, Johnson A, Frerker M, Buckley M, Sandstrom R, Vierstra J, Kaul R, Stamatoyannopoulos J. Index and biological spectrum of human DNase I hypersensitive sites. Nature 2020; 584:244-251. [PMID: 32728217 PMCID: PMC7422677 DOI: 10.1038/s41586-020-2559-3] [Citation(s) in RCA: 134] [Impact Index Per Article: 33.5] [Reference Citation Analysis] [What about the content of this article? (0)] [Affiliation(s)] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 11/15/2019] [Accepted: 07/01/2020] [Indexed: 01/08/2023]
Abstract
DNase I hypersensitive sites (DHSs) are generic markers of regulatory DNA1–5 and contain genetic variations associated with diseases and phenotypic traits6–8. We created high-resolution maps of DHSs from 733 human biosamples encompassing 438 cell and tissue types and states, and integrated these to delineate and numerically index approximately 3.6 million DHSs within the human genome sequence, providing a common coordinate system for regulatory DNA. Here we show that these maps highly resolve the cis-regulatory compartment of the human genome, which encodes unexpectedly diverse cell- and tissue-selective regulatory programs at very high density. These programs can be captured comprehensively by a simple vocabulary that enables the assignment to each DHS of a regulatory barcode that encapsulates its tissue manifestations, and global annotation of protein-coding and non-coding RNA genes in a manner orthogonal to gene expression. Finally, we show that sharply resolved DHSs markedly enhance the genetic association and heritability signals of diseases and traits. Rather than being confined to a small number of distal elements or promoters, we find that genetic signals converge on congruently regulated sets of DHSs that decorate entire gene bodies. Together, our results create a universal, extensible coordinate system and vocabulary for human regulatory DNA marked by DHSs, and provide a new global perspective on the architecture of human gene regulation. High-resolution maps of DNase I hypersensitive sites from 733 human biosamples are used to identify and index regulatory DNA within the human genome.
Collapse
Affiliation(s)
| | | | - Eric Rynes
- Altius Institute for Biomedical Sciences, Seattle, WA, USA
| | - Jessica Halow
- Altius Institute for Biomedical Sciences, Seattle, WA, USA
| | - Kristen Lee
- Altius Institute for Biomedical Sciences, Seattle, WA, USA
| | - Daniel Bates
- Altius Institute for Biomedical Sciences, Seattle, WA, USA
| | - Morgan Diegel
- Altius Institute for Biomedical Sciences, Seattle, WA, USA
| | - Douglas Dunn
- Altius Institute for Biomedical Sciences, Seattle, WA, USA
| | - Fidencio Neri
- Altius Institute for Biomedical Sciences, Seattle, WA, USA
| | | | - Alex Reynolds
- Altius Institute for Biomedical Sciences, Seattle, WA, USA
| | - Eric Haugen
- Altius Institute for Biomedical Sciences, Seattle, WA, USA
| | - Jemma Nelson
- Altius Institute for Biomedical Sciences, Seattle, WA, USA
| | - Audra Johnson
- Altius Institute for Biomedical Sciences, Seattle, WA, USA
| | - Mark Frerker
- Altius Institute for Biomedical Sciences, Seattle, WA, USA
| | | | | | - Jeff Vierstra
- Altius Institute for Biomedical Sciences, Seattle, WA, USA
| | - Rajinder Kaul
- Altius Institute for Biomedical Sciences, Seattle, WA, USA
| | - John Stamatoyannopoulos
- Altius Institute for Biomedical Sciences, Seattle, WA, USA. .,Department of Genome Sciences, University of Washington, Seattle, WA, USA. .,Division of Oncology, Department of Medicine, University of Washington, Seattle, WA, USA.
| |
Collapse
|
4
|
Breeze CE, Reynolds AP, van Dongen J, Dunham I, Lazar J, Neph S, Vierstra J, Bourque G, Teschendorff AE, Stamatoyannopoulos JA, Beck S. eFORGE v2.0: updated analysis of cell type-specific signal in epigenomic data. Bioinformatics 2020; 35:4767-4769. [PMID: 31161210 PMCID: PMC6853678 DOI: 10.1093/bioinformatics/btz456] [Citation(s) in RCA: 68] [Impact Index Per Article: 17.0] [Reference Citation Analysis] [What about the content of this article? (0)] [Affiliation(s)] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 11/11/2018] [Revised: 04/24/2019] [Accepted: 05/29/2019] [Indexed: 12/31/2022] Open
Abstract
SUMMARY The Illumina Infinium EPIC BeadChip is a new high-throughput array for DNA methylation analysis, extending the earlier 450k array by over 400 000 new sites. Previously, a method named eFORGE was developed to provide insights into cell type-specific and cell-composition effects for 450k data. Here, we present a significantly updated and improved version of eFORGE that can analyze both EPIC and 450k array data. New features include analysis of chromatin states, transcription factor motifs and DNase I footprints, providing tools for epigenome-wide association study interpretation and epigenome editing. AVAILABILITY AND IMPLEMENTATION eFORGE v2.0 is implemented as a web tool available from https://eforge.altiusinstitute.org and https://eforge-tf.altiusinstitute.org/. SUPPLEMENTARY INFORMATION Supplementary data are available at Bioinformatics online.
Collapse
Affiliation(s)
- Charles E Breeze
- Medical Genomics Group, UCL Cancer Institute, University College London, London WC1E 6BT, UK.,Altius Institute for Biomedical Sciences, Seattle, WA 98121, USA
| | - Alex P Reynolds
- Altius Institute for Biomedical Sciences, Seattle, WA 98121, USA
| | - Jenny van Dongen
- Department of Biological Psychology, Vrije Universiteit Amsterdam, Amsterdam 1081BT, The Netherlands
| | - Ian Dunham
- European Molecular Biology Laboratory, European Bioinformatics Institute (EMBL-EBI), Cambridge CB10 1SD, UK
| | - John Lazar
- Altius Institute for Biomedical Sciences, Seattle, WA 98121, USA
| | - Shane Neph
- Altius Institute for Biomedical Sciences, Seattle, WA 98121, USA
| | - Jeff Vierstra
- Altius Institute for Biomedical Sciences, Seattle, WA 98121, USA
| | - Guillaume Bourque
- Department of Human Genetics, McGill University and Génome Québec Innovation Center, Montréal H3A 0G1, Canada
| | - Andrew E Teschendorff
- CAS Key Lab of Computational Biology, CAS-MPG Partner Institute for Computational Biology, Shanghai Institute for Biological Sciences, Chinese Academy of Sciences, Shanghai 200031, China.,Statistical Genomics Group, UCL Cancer Institute, University College London, London WC1E 6BT, UK
| | | | - Stephan Beck
- Medical Genomics Group, UCL Cancer Institute, University College London, London WC1E 6BT, UK
| |
Collapse
|
5
|
Vierstra J, Lazar J, Sandstrom R, Halow J, Lee K, Bates D, Diegel M, Dunn D, Neri F, Haugen E, Rynes E, Reynolds A, Nelson J, Johnson A, Frerker M, Buckley M, Kaul R, Meuleman W, Stamatoyannopoulos JA. Global reference mapping of human transcription factor footprints. Nature 2020; 583:729-736. [PMID: 32728250 PMCID: PMC7410829 DOI: 10.1038/s41586-020-2528-x] [Citation(s) in RCA: 152] [Impact Index Per Article: 38.0] [Reference Citation Analysis] [What about the content of this article? (0)] [Affiliation(s)] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 01/30/2020] [Accepted: 06/25/2020] [Indexed: 11/09/2022]
Abstract
Combinatorial binding of transcription factors to regulatory DNA underpins gene regulation in all organisms. Genetic variation in regulatory regions has been connected with diseases and diverse phenotypic traits1, but it remains challenging to distinguish variants that affect regulatory function2. Genomic DNase I footprinting enables the quantitative, nucleotide-resolution delineation of sites of transcription factor occupancy within native chromatin3-6. However, only a small fraction of such sites have been precisely resolved on the human genome sequence6. Here, to enable comprehensive mapping of transcription factor footprints, we produced high-density DNase I cleavage maps from 243 human cell and tissue types and states and integrated these data to delineate about 4.5 million compact genomic elements that encode transcription factor occupancy at nucleotide resolution. We map the fine-scale structure within about 1.6 million DNase I-hypersensitive sites and show that the overwhelming majority are populated by well-spaced sites of single transcription factor-DNA interaction. Cell-context-dependent cis-regulation is chiefly executed by wholesale modulation of accessibility at regulatory DNA rather than by differential transcription factor occupancy within accessible elements. We also show that the enrichment of genetic variants associated with diseases or phenotypic traits in regulatory regions1,7 is almost entirely attributable to variants within footprints, and that functional variants that affect transcription factor occupancy are nearly evenly partitioned between loss- and gain-of-function alleles. Unexpectedly, we find increased density of human genetic variation within transcription factor footprints, revealing an unappreciated driver of cis-regulatory evolution. Our results provide a framework for both global and nucleotide-precision analyses of gene regulatory mechanisms and functional genetic variation.
Collapse
Affiliation(s)
- Jeff Vierstra
- Altius Institute for Biomedical Sciences, Seattle, WA, USA.
| | - John Lazar
- Altius Institute for Biomedical Sciences, Seattle, WA, USA
- Department of Genome Sciences, University of Washington, Seattle, WA, USA
| | | | - Jessica Halow
- Altius Institute for Biomedical Sciences, Seattle, WA, USA
| | - Kristen Lee
- Altius Institute for Biomedical Sciences, Seattle, WA, USA
| | - Daniel Bates
- Altius Institute for Biomedical Sciences, Seattle, WA, USA
| | - Morgan Diegel
- Altius Institute for Biomedical Sciences, Seattle, WA, USA
| | - Douglas Dunn
- Altius Institute for Biomedical Sciences, Seattle, WA, USA
| | - Fidencio Neri
- Altius Institute for Biomedical Sciences, Seattle, WA, USA
| | - Eric Haugen
- Altius Institute for Biomedical Sciences, Seattle, WA, USA
| | - Eric Rynes
- Altius Institute for Biomedical Sciences, Seattle, WA, USA
| | - Alex Reynolds
- Altius Institute for Biomedical Sciences, Seattle, WA, USA
| | - Jemma Nelson
- Altius Institute for Biomedical Sciences, Seattle, WA, USA
| | - Audra Johnson
- Altius Institute for Biomedical Sciences, Seattle, WA, USA
| | - Mark Frerker
- Altius Institute for Biomedical Sciences, Seattle, WA, USA
| | | | - Rajinder Kaul
- Altius Institute for Biomedical Sciences, Seattle, WA, USA
| | | | - John A Stamatoyannopoulos
- Altius Institute for Biomedical Sciences, Seattle, WA, USA.
- Department of Genome Sciences, University of Washington, Seattle, WA, USA.
- Division of Oncology, Department of Medicine, University of Washington, Seattle, WA, USA.
| |
Collapse
|
6
|
Georgolopoulos G, Iwata M, Psatha N, Yiangou M, Vierstra J. Unbiased phenotypic identification of functionally distinct hematopoietic progenitors. ACTA ACUST UNITED AC 2019; 26:4. [PMID: 31360678 PMCID: PMC6639971 DOI: 10.1186/s40709-019-0097-7] [Citation(s) in RCA: 1] [Impact Index Per Article: 0.2] [Reference Citation Analysis] [What about the content of this article? (0)] [Affiliation(s)] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 05/16/2019] [Accepted: 07/06/2019] [Indexed: 12/14/2022]
Abstract
Background Hematopoiesis is a model-system for studying cellular development and differentiation. Phenotypic and functional characterization of hematopoietic progenitors has significantly aided our understanding of the mechanisms that govern fate choice, lineage specification and maturity. Methods for progenitor isolation have historically relied on complex flow-cytometric strategies based on nested, arbitrary gates within defined panels of immunophenotypic markers. The resulted populations are then functionally assessed, although functional homogeneity or absolute linkage between function and phenotype is not always achieved, thus distorting our view on progenitor biology. Method In this study, we present a protocol for unbiased phenotypic identification and functional characterization which combines index sorting and clonogenic assessment of individual progenitor cells. Single-cells are plated into custom media allowing multiple hematopoietic fates to emerge and are allowed to give rise to unilineage colonies or mixed. After colony identification, lineage potential is assigned to each progenitor and finally the indexed phenotype of the initial cell is recalled and a phenotype is assigned to each functional output. Conclusions Our approach overcomes the limitations of the current protocols expanding beyond the established cell-surface marker panels and abolishing the need for nested gating. Using this method we were able to resolve the relationships of myeloid progenitors according to the revised model of hematopoiesis, as well as identify a novel marker for erythroid progenitors. Finally, this protocol can be applied to the characterization of any progenitor cell with measurable function. Electronic supplementary material The online version of this article (10.1186/s40709-019-0097-7) contains supplementary material, which is available to authorized users.
Collapse
Affiliation(s)
- Grigorios Georgolopoulos
- 1Altius Institute for Biomedical Sciences, Seattle, WA 98121 USA.,2Department of Genetics, Development and Molecular Biology, School of Biology, Aristotle University of Thessaloniki, 541 24 Thessaloniki, Greece
| | - Mineo Iwata
- 1Altius Institute for Biomedical Sciences, Seattle, WA 98121 USA
| | - Nikoletta Psatha
- 1Altius Institute for Biomedical Sciences, Seattle, WA 98121 USA
| | - Minas Yiangou
- 2Department of Genetics, Development and Molecular Biology, School of Biology, Aristotle University of Thessaloniki, 541 24 Thessaloniki, Greece
| | - Jeff Vierstra
- 1Altius Institute for Biomedical Sciences, Seattle, WA 98121 USA
| |
Collapse
|
7
|
|
8
|
Maurano MT, Haugen E, Sandstrom R, Vierstra J, Shafer A, Kaul R, Stamatoyannopoulos JA. Erratum: Large-scale identification of sequence variants influencing human transcription factor occupancy in vivo. Nat Genet 2016; 48:101. [DOI: 10.1038/ng0116-101c] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [What about the content of this article? (0)] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/09/2022]
|
9
|
He X, Tillo D, Vierstra J, Syed KS, Deng C, Ray GJ, Stamatoyannopoulos J, FitzGerald PC, Vinson C. Methylated Cytosines Mutate to Transcription Factor Binding Sites that Drive Tetrapod Evolution. Genome Biol Evol 2015; 7:3155-69. [PMID: 26507798 PMCID: PMC4994754 DOI: 10.1093/gbe/evv205] [Citation(s) in RCA: 17] [Impact Index Per Article: 1.9] [Reference Citation Analysis] [What about the content of this article? (0)] [Affiliation(s)] [Abstract] [Key Words] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 01/09/2023] Open
Abstract
In mammals, the cytosine in CG dinucleotides is typically methylated producing
5-methylcytosine (5mC), a chemically less stable form of cytosine that can spontaneously
deaminate to thymidine resulting in a T•G mismatched base pair. Unlike other eukaryotes
that efficiently repair this mismatched base pair back to C•G, in mammals, 5mCG
deamination is mutagenic, sometimes producing TG dinucleotides, explaining the depletion
of CG dinucleotides in mammalian genomes. It was suggested that new TG dinucleotides
generate genetic diversity that may be critical for evolutionary change. We tested this
conjecture by examining the DNA sequence properties of regulatory sequences identified by
DNase I hypersensitive sites (DHSs) in human and mouse genomes. We hypothesized that the
new TG dinucleotides generate transcription factor binding sites (TFBS) that become
tissue-specific DHSs (TS-DHSs). We find that 8-mers containing the CG dinucleotide are
enriched in DHSs in both species. However, 8-mers containing a TG and no CG dinucleotide
are preferentially enriched in TS-DHSs when compared with 8-mers with neither a TG nor a
CG dinucleotide. The most enriched 8-mer with a TG and no CG dinucleotide in
tissue-specific regulatory regions in both genomes is the AP-1 motif
(TGAC/GTCAN), and we find evidence that
TG dinucleotides in the AP-1 motif arose from CG dinucleotides. Additional TS-DHS-enriched
TFBS containing the TG/CA dinucleotide are the E-Box motif
(GCAGCTGC), the NF-1 motif (GGCA—TGCC), and the
GR (glucocorticoid receptor) motif (G-ACA—TGT-C). Our results support the
suggestion that cytosine methylation is mutagenic in tetrapods producing TG dinucleotides
that create TFBS that drive evolution.
Collapse
Affiliation(s)
- Ximiao He
- Laboratory of Metabolism, National Cancer Institute, National Institutes of Health, Bethesda, Maryland
| | - Desiree Tillo
- Laboratory of Metabolism, National Cancer Institute, National Institutes of Health, Bethesda, Maryland
| | - Jeff Vierstra
- Department of Genome Sciences, University of Washington
| | - Khund-Sayeed Syed
- Laboratory of Metabolism, National Cancer Institute, National Institutes of Health, Bethesda, Maryland
| | - Callie Deng
- Laboratory of Metabolism, National Cancer Institute, National Institutes of Health, Bethesda, Maryland
| | - G Jordan Ray
- Laboratory of Metabolism, National Cancer Institute, National Institutes of Health, Bethesda, Maryland
| | | | - Peter C FitzGerald
- Genome Analysis Unit, Genetics Branch, National Cancer Institute, National Institutes of Health, Bethesda, Maryland
| | - Charles Vinson
- Laboratory of Metabolism, National Cancer Institute, National Institutes of Health, Bethesda, Maryland
| |
Collapse
|
10
|
Vierstra J, Reik A, Chang KH, Stehling-Sun S, Zhou Y, Hinkley SJ, Paschon DE, Zhang L, Psatha N, Bendana YR, O'Neil CM, Song AH, Mich AK, Liu PQ, Lee G, Bauer DE, Holmes MC, Orkin SH, Papayannopoulou T, Stamatoyannopoulos G, Rebar EJ, Gregory PD, Urnov FD, Stamatoyannopoulos JA. Functional footprinting of regulatory DNA. Nat Methods 2015; 12:927-30. [PMID: 26322838 PMCID: PMC5381659 DOI: 10.1038/nmeth.3554] [Citation(s) in RCA: 106] [Impact Index Per Article: 11.8] [Reference Citation Analysis] [What about the content of this article? (0)] [Affiliation(s)] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/25/2015] [Accepted: 07/30/2015] [Indexed: 12/19/2022]
Abstract
Regulatory regions harbor multiple transcription factor recognition sites; however, the contribution of individual sites to regulatory function remains challenging to define. We describe a facile approach that exploits the error-prone nature of genome editing-induced double-strand break repair to map functional elements within regulatory DNA at nucleotide resolution. We demonstrate the approach on a human erythroid enhancer, revealing single TF recognition sites that gate the majority of downstream regulatory function.
Collapse
Affiliation(s)
- Jeff Vierstra
- Department of Genome Sciences, University of Washington, Seattle, Washington, USA
| | - Andreas Reik
- Sangamo BioSciences, Pt. Richmond, California, USA
| | - Kai-Hsin Chang
- Department of Medicine, Division of Hematology, University of Washington, Seattle, Washington, USA
| | - Sandra Stehling-Sun
- Department of Genome Sciences, University of Washington, Seattle, Washington, USA
| | - Yuanyue Zhou
- Sangamo BioSciences, Pt. Richmond, California, USA
| | | | | | - Lei Zhang
- Sangamo BioSciences, Pt. Richmond, California, USA
| | - Nikoletta Psatha
- Department of Medicine, Division of Hematology, University of Washington, Seattle, Washington, USA
| | | | | | | | | | - Pei-Qi Liu
- Sangamo BioSciences, Pt. Richmond, California, USA
| | - Gary Lee
- Sangamo BioSciences, Pt. Richmond, California, USA
| | - Daniel E Bauer
- Boston Children's Hospital, Division of Hematology/Oncology, Boston, Massachusetts, USA
| | | | - Stuart H Orkin
- Boston Children's Hospital, Division of Hematology/Oncology, Boston, Massachusetts, USA
| | - Thalia Papayannopoulou
- Department of Medicine, Division of Hematology, University of Washington, Seattle, Washington, USA
| | - George Stamatoyannopoulos
- Department of Medicine, Division of Medical Genetics, University of Washington, Seattle, Washington, USA
| | | | | | | | - John A Stamatoyannopoulos
- Department of Genome Sciences, University of Washington, Seattle, Washington, USA.,Department of Medicine, Division of Oncology, University of Washington, Seattle, Washington, USA
| |
Collapse
|
11
|
Reik A, Chang KH, Vierstra J, Psatha N, Stehling-Sun S, Zhou Y, O'Neil CM, Mich A, Miller JC, Lee G, Tan S, Jiang H, Stamatoyannopoulos G, Papayannopoulou T, Rebar EJ, Gregory PD, Stamatoyannopoulos JA, Urnov FD. 53. From GWAS To the Clinic: Genome-Editing the Human BCL11A Erythroid Enhancer for Fetal Globin Elevation in the Hemoglobinopathies. Mol Ther 2015. [DOI: 10.1016/s1525-0016(16)33658-9] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [What about the content of this article? (0)] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 10/20/2022] Open
|
12
|
Mayer A, Iulio J, Maleri S, Eser U, Reynolds A, Vierstra J, Sandstrom R, Stamatoyannopoulos J, Churchman LS. High Resolution Architecture of Human Transcriptional Activity Revealed by Native Elongating Transcript Sequencing. FASEB J 2015. [DOI: 10.1096/fasebj.29.1_supplement.562.1] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [What about the content of this article? (0)] [Affiliation(s)] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/11/2022]
Affiliation(s)
- Andreas Mayer
- Department of Genetics Harvard Medical SchoolBostonMAUnited States
| | - Julia Iulio
- Department of Genetics Harvard Medical SchoolBostonMAUnited States
| | - Seth Maleri
- Department of Genetics Harvard Medical SchoolBostonMAUnited States
| | - Umut Eser
- Department of Genetics Harvard Medical SchoolBostonMAUnited States
| | - Alex Reynolds
- Department of Genome SciencesUniversity of WashingtonSeattleWashingtonUnited States
| | - Jeff Vierstra
- Department of Genome SciencesUniversity of WashingtonSeattleWashingtonUnited States
| | - Richard Sandstrom
- Department of Genome SciencesUniversity of WashingtonSeattleWashingtonUnited States
| | | | | |
Collapse
|
13
|
Wilken MS, Brzezinski JA, La Torre A, Siebenthall K, Thurman R, Sabo P, Sandstrom RS, Vierstra J, Canfield TK, Hansen RS, Bender MA, Stamatoyannopoulos J, Reh TA. DNase I hypersensitivity analysis of the mouse brain and retina identifies region-specific regulatory elements. Epigenetics Chromatin 2015; 8:8. [PMID: 25972927 PMCID: PMC4429822 DOI: 10.1186/1756-8935-8-8] [Citation(s) in RCA: 53] [Impact Index Per Article: 5.9] [Reference Citation Analysis] [What about the content of this article? (0)] [Affiliation(s)] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 12/29/2014] [Accepted: 01/27/2015] [Indexed: 11/10/2022] Open
Abstract
BACKGROUND The brain, spinal cord, and neural retina comprise the central nervous system (CNS) of vertebrates. Understanding the regulatory mechanisms that underlie the enormous cell-type diversity of the CNS is a significant challenge. Whole-genome mapping of DNase I-hypersensitive sites (DHSs) has been used to identify cis-regulatory elements in many tissues. We have applied this approach to the mouse CNS, including developing and mature neural retina, whole brain, and two well-characterized brain regions, the cerebellum and the cerebral cortex. RESULTS For the various regions and developmental stages of the CNS that we analyzed, there were approximately the same number of DHSs; however, there were many DHSs unique to each CNS region and developmental stage. Many of the DHSs are likely to mark enhancers that are specific to the specific CNS region and developmental stage. We validated the DNase I mapping approach for identification of CNS enhancers using the existing VISTA Browser database and with in vivo and in vitro electroporation of the retina. Analysis of transcription factor consensus sites within the DHSs shows distinct region-specific profiles of transcriptional regulators particular to each region. Clustering developmentally dynamic DHSs in the retina revealed enrichment of developmental stage-specific transcriptional regulators. Additionally, we found reporter gene activity in the retina driven from several previously uncharacterized regulatory elements surrounding the neurodevelopmental gene Otx2. Identification of DHSs shared between mouse and human showed region-specific differences in the evolution of cis-regulatory elements. CONCLUSIONS Overall, our results demonstrate the potential of genome-wide DNase I mapping to cis-regulatory questions regarding the regional diversity within the CNS. These data represent an extensive catalogue of potential cis-regulatory elements within the CNS that display region and temporal specificity, as well as a set of DHSs common to CNS tissues. Further examination of evolutionary conservation of DHSs between CNS regions and different species may reveal important cis-regulatory elements in the evolution of the mammalian CNS.
Collapse
Affiliation(s)
- Matthew S Wilken
- Department of Biological Structure, University of Washington, 1959 NE Pacific Street, Box 357420, Seattle, WA 98195 USA ; Molecular and Cellular Biology Program, University of Washington, MCB Program Office, T-466 Health Sciences Building, Box 357275, Seattle, WA 98195 USA
| | - Joseph A Brzezinski
- Department of Biological Structure, University of Washington, 1959 NE Pacific Street, Box 357420, Seattle, WA 98195 USA ; Department of Ophthalmology, University of Colorado School of Medicine, 1675 Aurora Court, Aurora, CO 80045 USA
| | - Anna La Torre
- Department of Biological Structure, University of Washington, 1959 NE Pacific Street, Box 357420, Seattle, WA 98195 USA
| | - Kyle Siebenthall
- Department of Genome Sciences, University of Washington, Foege Building S-250, 3720 15th Ave NE, Box 355065, Seattle, WA 98195 USA
| | - Robert Thurman
- Department of Genome Sciences, University of Washington, Foege Building S-250, 3720 15th Ave NE, Box 355065, Seattle, WA 98195 USA
| | - Peter Sabo
- Department of Genome Sciences, University of Washington, Foege Building S-250, 3720 15th Ave NE, Box 355065, Seattle, WA 98195 USA
| | - Richard S Sandstrom
- Department of Genome Sciences, University of Washington, Foege Building S-250, 3720 15th Ave NE, Box 355065, Seattle, WA 98195 USA
| | - Jeff Vierstra
- Department of Genome Sciences, University of Washington, Foege Building S-250, 3720 15th Ave NE, Box 355065, Seattle, WA 98195 USA
| | - Theresa K Canfield
- Department of Genome Sciences, University of Washington, Foege Building S-250, 3720 15th Ave NE, Box 355065, Seattle, WA 98195 USA
| | - R Scott Hansen
- Department of Genome Sciences, University of Washington, Foege Building S-250, 3720 15th Ave NE, Box 355065, Seattle, WA 98195 USA
| | - Michael A Bender
- Department of Pediatrics, University of Washington, 1959 NE Pacific St, Health Sciences Building, Seattle, WA Box 356320, 98195 USA ; Clinical Research Division, Fred Hutchinson Cancer Research Center, 1100 Fairview Avenue North, Seattle, WA 98109 USA
| | - John Stamatoyannopoulos
- Department of Genome Sciences, University of Washington, Foege Building S-250, 3720 15th Ave NE, Box 355065, Seattle, WA 98195 USA
| | - Thomas A Reh
- Department of Biological Structure, University of Washington, 1959 NE Pacific Street, Box 357420, Seattle, WA 98195 USA
| |
Collapse
|
14
|
Yue F, Cheng Y, Breschi A, Vierstra J, Wu W, Ryba T, Sandstrom R, Ma Z, Davis C, Pope BD, Shen Y, Pervouchine DD, Djebali S, Thurman RE, Kaul R, Rynes E, Kirilusha A, Marinov GK, Williams BA, Trout D, Amrhein H, Fisher-Aylor K, Antoshechkin I, DeSalvo G, See LH, Fastuca M, Drenkow J, Zaleski C, Dobin A, Prieto P, Lagarde J, Bussotti G, Tanzer A, Denas O, Li K, Bender MA, Zhang M, Byron R, Groudine MT, McCleary D, Pham L, Ye Z, Kuan S, Edsall L, Wu YC, Rasmussen MD, Bansal MS, Kellis M, Keller CA, Morrissey CS, Mishra T, Jain D, Dogan N, Harris RS, Cayting P, Kawli T, Boyle AP, Euskirchen G, Kundaje A, Lin S, Lin Y, Jansen C, Malladi VS, Cline MS, Erickson DT, Kirkup VM, Learned K, Sloan CA, Rosenbloom KR, Lacerda de Sousa B, Beal K, Pignatelli M, Flicek P, Lian J, Kahveci T, Lee D, Kent WJ, Ramalho Santos M, Herrero J, Notredame C, Johnson A, Vong S, Lee K, Bates D, Neri F, Diegel M, Canfield T, Sabo PJ, Wilken MS, Reh TA, Giste E, Shafer A, Kutyavin T, Haugen E, Dunn D, Reynolds AP, Neph S, Humbert R, Hansen RS, De Bruijn M, Selleri L, Rudensky A, Josefowicz S, Samstein R, Eichler EE, Orkin SH, Levasseur D, Papayannopoulou T, Chang KH, Skoultchi A, Gosh S, Disteche C, Treuting P, Wang Y, Weiss MJ, Blobel GA, Cao X, Zhong S, Wang T, Good PJ, Lowdon RF, Adams LB, Zhou XQ, Pazin MJ, Feingold EA, Wold B, Taylor J, Mortazavi A, Weissman SM, Stamatoyannopoulos JA, Snyder MP, Guigo R, Gingeras TR, Gilbert DM, Hardison RC, Beer MA, Ren B. A comparative encyclopedia of DNA elements in the mouse genome. Nature 2015; 515:355-64. [PMID: 25409824 PMCID: PMC4266106 DOI: 10.1038/nature13992] [Citation(s) in RCA: 1135] [Impact Index Per Article: 126.1] [Reference Citation Analysis] [What about the content of this article? (0)] [Affiliation(s)] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 02/03/2014] [Accepted: 10/24/2014] [Indexed: 12/11/2022]
Abstract
The laboratory mouse shares the majority of its protein-coding genes with humans, making it the premier model organism in biomedical research, yet the two mammals differ in significant ways. To gain greater insights into both shared and species-specific transcriptional and cellular regulatory programs in the mouse, the Mouse ENCODE Consortium has mapped transcription, DNase I hypersensitivity, transcription factor binding, chromatin modifications and replication domains throughout the mouse genome in diverse cell and tissue types. By comparing with the human genome, we not only confirm substantial conservation in the newly annotated potential functional sequences, but also find a large degree of divergence of sequences involved in transcriptional regulation, chromatin state and higher order chromatin organization. Our results illuminate the wide range of evolutionary forces acting on genes and their regulatory regions, and provide a general resource for research into mammalian biology and mechanisms of human diseases.
Collapse
Affiliation(s)
- Feng Yue
- 1] Ludwig Institute for Cancer Research and University of California, San Diego School of Medicine, 9500 Gilman Drive, La Jolla, California 92093, USA. [2] Department of Biochemistry and Molecular Biology, College of Medicine, The Pennsylvania State University, Hershey, Pennsylvania 17033, USA
| | - Yong Cheng
- Department of Genetics, Stanford University, 300 Pasteur Drive, MC-5477 Stanford, California 94305, USA
| | - Alessandra Breschi
- Bioinformatics and Genomics, Centre for Genomic Regulation (CRG) and UPF, Doctor Aiguader, 88, 08003 Barcelona, Catalonia, Spain
| | - Jeff Vierstra
- Department of Genome Sciences, University of Washington, Seattle, Washington 98195, USA
| | - Weisheng Wu
- Center for Comparative Genomics and Bioinformatics, Huck Institutes of the Life Sciences, The Pennsylvania State University, University Park, Pennsylvania 16802, USA
| | - Tyrone Ryba
- Department of Biological Science, 319 Stadium Drive, Florida State University, Tallahassee, Florida 32306-4295, USA
| | - Richard Sandstrom
- Department of Genome Sciences, University of Washington, Seattle, Washington 98195, USA
| | - Zhihai Ma
- Department of Genetics, Stanford University, 300 Pasteur Drive, MC-5477 Stanford, California 94305, USA
| | - Carrie Davis
- Functional Genomics, Cold Spring Harbor Laboratory, Bungtown Road, Cold Spring Harbor, New York 11724, USA
| | - Benjamin D Pope
- Department of Biological Science, 319 Stadium Drive, Florida State University, Tallahassee, Florida 32306-4295, USA
| | - Yin Shen
- Ludwig Institute for Cancer Research and University of California, San Diego School of Medicine, 9500 Gilman Drive, La Jolla, California 92093, USA
| | - Dmitri D Pervouchine
- Bioinformatics and Genomics, Centre for Genomic Regulation (CRG) and UPF, Doctor Aiguader, 88, 08003 Barcelona, Catalonia, Spain
| | - Sarah Djebali
- Bioinformatics and Genomics, Centre for Genomic Regulation (CRG) and UPF, Doctor Aiguader, 88, 08003 Barcelona, Catalonia, Spain
| | - Robert E Thurman
- Department of Genome Sciences, University of Washington, Seattle, Washington 98195, USA
| | - Rajinder Kaul
- Department of Genome Sciences, University of Washington, Seattle, Washington 98195, USA
| | - Eric Rynes
- Department of Genome Sciences, University of Washington, Seattle, Washington 98195, USA
| | - Anthony Kirilusha
- Division of Biology, California Institute of Technology, Pasadena, California 91125, USA
| | - Georgi K Marinov
- Division of Biology, California Institute of Technology, Pasadena, California 91125, USA
| | - Brian A Williams
- Division of Biology, California Institute of Technology, Pasadena, California 91125, USA
| | - Diane Trout
- Division of Biology, California Institute of Technology, Pasadena, California 91125, USA
| | - Henry Amrhein
- Division of Biology, California Institute of Technology, Pasadena, California 91125, USA
| | - Katherine Fisher-Aylor
- Division of Biology, California Institute of Technology, Pasadena, California 91125, USA
| | - Igor Antoshechkin
- Division of Biology, California Institute of Technology, Pasadena, California 91125, USA
| | - Gilberto DeSalvo
- Division of Biology, California Institute of Technology, Pasadena, California 91125, USA
| | - Lei-Hoon See
- Functional Genomics, Cold Spring Harbor Laboratory, Bungtown Road, Cold Spring Harbor, New York 11724, USA
| | - Meagan Fastuca
- Functional Genomics, Cold Spring Harbor Laboratory, Bungtown Road, Cold Spring Harbor, New York 11724, USA
| | - Jorg Drenkow
- Functional Genomics, Cold Spring Harbor Laboratory, Bungtown Road, Cold Spring Harbor, New York 11724, USA
| | - Chris Zaleski
- Functional Genomics, Cold Spring Harbor Laboratory, Bungtown Road, Cold Spring Harbor, New York 11724, USA
| | - Alex Dobin
- Functional Genomics, Cold Spring Harbor Laboratory, Bungtown Road, Cold Spring Harbor, New York 11724, USA
| | - Pablo Prieto
- Bioinformatics and Genomics, Centre for Genomic Regulation (CRG) and UPF, Doctor Aiguader, 88, 08003 Barcelona, Catalonia, Spain
| | - Julien Lagarde
- Bioinformatics and Genomics, Centre for Genomic Regulation (CRG) and UPF, Doctor Aiguader, 88, 08003 Barcelona, Catalonia, Spain
| | - Giovanni Bussotti
- Bioinformatics and Genomics, Centre for Genomic Regulation (CRG) and UPF, Doctor Aiguader, 88, 08003 Barcelona, Catalonia, Spain
| | - Andrea Tanzer
- 1] Bioinformatics and Genomics, Centre for Genomic Regulation (CRG) and UPF, Doctor Aiguader, 88, 08003 Barcelona, Catalonia, Spain. [2] Department of Theoretical Chemistry, Faculty of Chemistry, University of Vienna, Waehringerstrasse 17/3/303, A-1090 Vienna, Austria
| | - Olgert Denas
- Departments of Biology and Mathematics and Computer Science, Emory University, O. Wayne Rollins Research Center, 1510 Clifton Road NE, Atlanta, Georgia 30322, USA
| | - Kanwei Li
- Departments of Biology and Mathematics and Computer Science, Emory University, O. Wayne Rollins Research Center, 1510 Clifton Road NE, Atlanta, Georgia 30322, USA
| | - M A Bender
- 1] Department of Pediatrics, University of Washington, Seattle, Washington 98195, USA. [2] Clinical Research Division, Fred Hutchinson Cancer Research Center, Seattle, Washington 98109, USA
| | - Miaohua Zhang
- Basic Science Division, Fred Hutchinson Cancer Research Center, Seattle, Washington 98109, USA
| | - Rachel Byron
- Basic Science Division, Fred Hutchinson Cancer Research Center, Seattle, Washington 98109, USA
| | - Mark T Groudine
- 1] Basic Science Division, Fred Hutchinson Cancer Research Center, Seattle, Washington 98109, USA. [2] Department of Radiation Oncology, University of Washington, Seattle, Washington 98195, USA
| | - David McCleary
- Ludwig Institute for Cancer Research and University of California, San Diego School of Medicine, 9500 Gilman Drive, La Jolla, California 92093, USA
| | - Long Pham
- Ludwig Institute for Cancer Research and University of California, San Diego School of Medicine, 9500 Gilman Drive, La Jolla, California 92093, USA
| | - Zhen Ye
- Ludwig Institute for Cancer Research and University of California, San Diego School of Medicine, 9500 Gilman Drive, La Jolla, California 92093, USA
| | - Samantha Kuan
- Ludwig Institute for Cancer Research and University of California, San Diego School of Medicine, 9500 Gilman Drive, La Jolla, California 92093, USA
| | - Lee Edsall
- Ludwig Institute for Cancer Research and University of California, San Diego School of Medicine, 9500 Gilman Drive, La Jolla, California 92093, USA
| | - Yi-Chieh Wu
- Computer Science and Artificial Intelligence Laboratory, Massachusetts Institute of Technology (MIT), Cambridge, Massachusetts 02139, USA
| | - Matthew D Rasmussen
- Computer Science and Artificial Intelligence Laboratory, Massachusetts Institute of Technology (MIT), Cambridge, Massachusetts 02139, USA
| | - Mukul S Bansal
- Computer Science and Artificial Intelligence Laboratory, Massachusetts Institute of Technology (MIT), Cambridge, Massachusetts 02139, USA
| | - Manolis Kellis
- 1] Computer Science and Artificial Intelligence Laboratory, Massachusetts Institute of Technology (MIT), Cambridge, Massachusetts 02139, USA. [2] Broad Institute of MIT and Harvard, Cambridge, Massachusetts 02142, USA
| | - Cheryl A Keller
- Center for Comparative Genomics and Bioinformatics, Huck Institutes of the Life Sciences, The Pennsylvania State University, University Park, Pennsylvania 16802, USA
| | - Christapher S Morrissey
- Center for Comparative Genomics and Bioinformatics, Huck Institutes of the Life Sciences, The Pennsylvania State University, University Park, Pennsylvania 16802, USA
| | - Tejaswini Mishra
- Center for Comparative Genomics and Bioinformatics, Huck Institutes of the Life Sciences, The Pennsylvania State University, University Park, Pennsylvania 16802, USA
| | - Deepti Jain
- Center for Comparative Genomics and Bioinformatics, Huck Institutes of the Life Sciences, The Pennsylvania State University, University Park, Pennsylvania 16802, USA
| | - Nergiz Dogan
- Center for Comparative Genomics and Bioinformatics, Huck Institutes of the Life Sciences, The Pennsylvania State University, University Park, Pennsylvania 16802, USA
| | - Robert S Harris
- Center for Comparative Genomics and Bioinformatics, Huck Institutes of the Life Sciences, The Pennsylvania State University, University Park, Pennsylvania 16802, USA
| | - Philip Cayting
- Department of Genetics, Stanford University, 300 Pasteur Drive, MC-5477 Stanford, California 94305, USA
| | - Trupti Kawli
- Department of Genetics, Stanford University, 300 Pasteur Drive, MC-5477 Stanford, California 94305, USA
| | - Alan P Boyle
- Department of Genetics, Stanford University, 300 Pasteur Drive, MC-5477 Stanford, California 94305, USA
| | - Ghia Euskirchen
- Department of Genetics, Stanford University, 300 Pasteur Drive, MC-5477 Stanford, California 94305, USA
| | - Anshul Kundaje
- Department of Genetics, Stanford University, 300 Pasteur Drive, MC-5477 Stanford, California 94305, USA
| | - Shin Lin
- Department of Genetics, Stanford University, 300 Pasteur Drive, MC-5477 Stanford, California 94305, USA
| | - Yiing Lin
- Department of Genetics, Stanford University, 300 Pasteur Drive, MC-5477 Stanford, California 94305, USA
| | - Camden Jansen
- Department of Developmental and Cell Biology, University of California, Irvine, Irvine, California 92697, USA
| | - Venkat S Malladi
- Department of Genetics, Stanford University, 300 Pasteur Drive, MC-5477 Stanford, California 94305, USA
| | - Melissa S Cline
- Center for Biomolecular Science and Engineering, School of Engineering, University of California Santa Cruz (UCSC), Santa Cruz, California 95064, USA
| | - Drew T Erickson
- Department of Genetics, Stanford University, 300 Pasteur Drive, MC-5477 Stanford, California 94305, USA
| | - Vanessa M Kirkup
- Center for Biomolecular Science and Engineering, School of Engineering, University of California Santa Cruz (UCSC), Santa Cruz, California 95064, USA
| | - Katrina Learned
- Center for Biomolecular Science and Engineering, School of Engineering, University of California Santa Cruz (UCSC), Santa Cruz, California 95064, USA
| | - Cricket A Sloan
- Department of Genetics, Stanford University, 300 Pasteur Drive, MC-5477 Stanford, California 94305, USA
| | - Kate R Rosenbloom
- Center for Biomolecular Science and Engineering, School of Engineering, University of California Santa Cruz (UCSC), Santa Cruz, California 95064, USA
| | - Beatriz Lacerda de Sousa
- Departments of Obstetrics/Gynecology and Pathology, and Center for Reproductive Sciences, University of California San Francisco, San Francisco, California 94143, USA
| | - Kathryn Beal
- European Molecular Biology Laboratory, European Bioinformatics Institute, Wellcome Trust Genome Campus, Hinxton, Cambridge CB10 1SD, UK
| | - Miguel Pignatelli
- European Molecular Biology Laboratory, European Bioinformatics Institute, Wellcome Trust Genome Campus, Hinxton, Cambridge CB10 1SD, UK
| | - Paul Flicek
- European Molecular Biology Laboratory, European Bioinformatics Institute, Wellcome Trust Genome Campus, Hinxton, Cambridge CB10 1SD, UK
| | - Jin Lian
- Yale University, Department of Genetics, PO Box 208005, 333 Cedar Street, New Haven, Connecticut 06520-8005, USA
| | - Tamer Kahveci
- Computer &Information Sciences &Engineering, University of Florida, Gainesville, Florida 32611, USA
| | - Dongwon Lee
- McKusick-Nathans Institute of Genetic Medicine and Department of Biomedical Engineering, Johns Hopkins University, 733 N. Broadway, BRB 573 Baltimore, Maryland 21205, USA
| | - W James Kent
- Center for Biomolecular Science and Engineering, School of Engineering, University of California Santa Cruz (UCSC), Santa Cruz, California 95064, USA
| | - Miguel Ramalho Santos
- Departments of Obstetrics/Gynecology and Pathology, and Center for Reproductive Sciences, University of California San Francisco, San Francisco, California 94143, USA
| | - Javier Herrero
- 1] European Molecular Biology Laboratory, European Bioinformatics Institute, Wellcome Trust Genome Campus, Hinxton, Cambridge CB10 1SD, UK. [2] Bill Lyons Informatics Centre, UCL Cancer Institute, University College London, London WC1E 6DD, UK
| | - Cedric Notredame
- Bioinformatics and Genomics, Centre for Genomic Regulation (CRG) and UPF, Doctor Aiguader, 88, 08003 Barcelona, Catalonia, Spain
| | - Audra Johnson
- Department of Genome Sciences, University of Washington, Seattle, Washington 98195, USA
| | - Shinny Vong
- Department of Genome Sciences, University of Washington, Seattle, Washington 98195, USA
| | - Kristen Lee
- Department of Genome Sciences, University of Washington, Seattle, Washington 98195, USA
| | - Daniel Bates
- Department of Genome Sciences, University of Washington, Seattle, Washington 98195, USA
| | - Fidencio Neri
- Department of Genome Sciences, University of Washington, Seattle, Washington 98195, USA
| | - Morgan Diegel
- Department of Genome Sciences, University of Washington, Seattle, Washington 98195, USA
| | - Theresa Canfield
- Department of Genome Sciences, University of Washington, Seattle, Washington 98195, USA
| | - Peter J Sabo
- Department of Genome Sciences, University of Washington, Seattle, Washington 98195, USA
| | - Matthew S Wilken
- Department of Biological Structure, University of Washington, HSB I-516, 1959 NE Pacific Street, Seattle, Washington 98195, USA
| | - Thomas A Reh
- Department of Biological Structure, University of Washington, HSB I-516, 1959 NE Pacific Street, Seattle, Washington 98195, USA
| | - Erika Giste
- Department of Genome Sciences, University of Washington, Seattle, Washington 98195, USA
| | - Anthony Shafer
- Department of Genome Sciences, University of Washington, Seattle, Washington 98195, USA
| | - Tanya Kutyavin
- Department of Genome Sciences, University of Washington, Seattle, Washington 98195, USA
| | - Eric Haugen
- Department of Genome Sciences, University of Washington, Seattle, Washington 98195, USA
| | - Douglas Dunn
- Department of Genome Sciences, University of Washington, Seattle, Washington 98195, USA
| | - Alex P Reynolds
- Department of Genome Sciences, University of Washington, Seattle, Washington 98195, USA
| | - Shane Neph
- Department of Genome Sciences, University of Washington, Seattle, Washington 98195, USA
| | - Richard Humbert
- Department of Genome Sciences, University of Washington, Seattle, Washington 98195, USA
| | - R Scott Hansen
- Department of Genome Sciences, University of Washington, Seattle, Washington 98195, USA
| | - Marella De Bruijn
- MRC Molecular Haemotology Unit, University of Oxford, Oxford OX3 9DS, UK
| | - Licia Selleri
- Department of Cell and Developmental Biology, Weill Cornell Medical College, New York, New York 10065, USA
| | - Alexander Rudensky
- HHMI and Ludwig Center at Memorial Sloan Kettering Cancer Center, Immunology Program, Memorial Sloan Kettering Cancer Canter, New York, New York 10065, USA
| | - Steven Josefowicz
- HHMI and Ludwig Center at Memorial Sloan Kettering Cancer Center, Immunology Program, Memorial Sloan Kettering Cancer Canter, New York, New York 10065, USA
| | - Robert Samstein
- HHMI and Ludwig Center at Memorial Sloan Kettering Cancer Center, Immunology Program, Memorial Sloan Kettering Cancer Canter, New York, New York 10065, USA
| | - Evan E Eichler
- Department of Genome Sciences, University of Washington, Seattle, Washington 98195, USA
| | - Stuart H Orkin
- Dana Farber Cancer Institute, Harvard Medical School, Cambridge, Massachusetts 02138, USA
| | - Dana Levasseur
- University of Iowa Carver College of Medicine, Department of Internal Medicine, Iowa City, Iowa 52242, USA
| | - Thalia Papayannopoulou
- Division of Hematology, Department of Medicine, University of Washington, Seattle, Washington 98195, USA
| | - Kai-Hsin Chang
- University of Iowa Carver College of Medicine, Department of Internal Medicine, Iowa City, Iowa 52242, USA
| | - Arthur Skoultchi
- Department of Cell Biology, Albert Einstein College of Medicine, Bronx, New York 10461, USA
| | - Srikanta Gosh
- Department of Cell Biology, Albert Einstein College of Medicine, Bronx, New York 10461, USA
| | - Christine Disteche
- Department of Pathology, University of Washington, Seattle, Washington 98195, USA
| | - Piper Treuting
- Department of Comparative Medicine, University of Washington, Seattle, Washington 98195, USA
| | - Yanli Wang
- Bioinformatics and Genomics program, The Pennsylvania State University, University Park, Pennsylvania 16802, USA
| | - Mitchell J Weiss
- Department of Hematology, St Jude Children's Research Hospital, Memphis, Tennessee 38105, USA
| | - Gerd A Blobel
- 1] Division of Hematology, The Children's Hospital of Philadelphia, Philadelphia, Pennsylvania 19104, USA. [2] Perelman School of Medicine at the University of Pennsylvania, Philadelphia, Pennsylvania 19104, USA
| | - Xiaoyi Cao
- Department of Bioengineering, University of California, San Diego, 9500 Gilman Drive, La Jolla, California 92093, USA
| | - Sheng Zhong
- Department of Bioengineering, University of California, San Diego, 9500 Gilman Drive, La Jolla, California 92093, USA
| | - Ting Wang
- Department of Genetics, Center for Genome Sciences and Systems Biology, Washington University School of Medicine, St. Louis, Missouri 63108, USA
| | - Peter J Good
- NHGRI, National Institutes of Health, 5635 Fishers Lane, Bethesda, Maryland 20892-9307, USA
| | - Rebecca F Lowdon
- NHGRI, National Institutes of Health, 5635 Fishers Lane, Bethesda, Maryland 20892-9307, USA
| | - Leslie B Adams
- NHGRI, National Institutes of Health, 5635 Fishers Lane, Bethesda, Maryland 20892-9307, USA
| | - Xiao-Qiao Zhou
- NHGRI, National Institutes of Health, 5635 Fishers Lane, Bethesda, Maryland 20892-9307, USA
| | - Michael J Pazin
- NHGRI, National Institutes of Health, 5635 Fishers Lane, Bethesda, Maryland 20892-9307, USA
| | - Elise A Feingold
- NHGRI, National Institutes of Health, 5635 Fishers Lane, Bethesda, Maryland 20892-9307, USA
| | - Barbara Wold
- Division of Biology, California Institute of Technology, Pasadena, California 91125, USA
| | - James Taylor
- Departments of Biology and Mathematics and Computer Science, Emory University, O. Wayne Rollins Research Center, 1510 Clifton Road NE, Atlanta, Georgia 30322, USA
| | - Ali Mortazavi
- Department of Developmental and Cell Biology, University of California, Irvine, Irvine, California 92697, USA
| | - Sherman M Weissman
- Yale University, Department of Genetics, PO Box 208005, 333 Cedar Street, New Haven, Connecticut 06520-8005, USA
| | | | - Michael P Snyder
- Department of Genetics, Stanford University, 300 Pasteur Drive, MC-5477 Stanford, California 94305, USA
| | - Roderic Guigo
- Bioinformatics and Genomics, Centre for Genomic Regulation (CRG) and UPF, Doctor Aiguader, 88, 08003 Barcelona, Catalonia, Spain
| | - Thomas R Gingeras
- Functional Genomics, Cold Spring Harbor Laboratory, Bungtown Road, Cold Spring Harbor, New York 11724, USA
| | - David M Gilbert
- Department of Biological Science, 319 Stadium Drive, Florida State University, Tallahassee, Florida 32306-4295, USA
| | - Ross C Hardison
- Center for Comparative Genomics and Bioinformatics, Huck Institutes of the Life Sciences, The Pennsylvania State University, University Park, Pennsylvania 16802, USA
| | - Michael A Beer
- McKusick-Nathans Institute of Genetic Medicine and Department of Biomedical Engineering, Johns Hopkins University, 733 N. Broadway, BRB 573 Baltimore, Maryland 21205, USA
| | - Bing Ren
- Ludwig Institute for Cancer Research and University of California, San Diego School of Medicine, 9500 Gilman Drive, La Jolla, California 92093, USA
| | | |
Collapse
|
15
|
Stergachis AB, Neph S, Sandstrom R, Haugen E, Reynolds AP, Zhang M, Byron R, Canfield T, Stelhing-Sun S, Lee K, Thurman RE, Vong S, Bates D, Neri F, Diegel M, Giste E, Dunn D, Vierstra J, Hansen RS, Johnson AK, Sabo PJ, Wilken MS, Reh TA, Treuting PM, Kaul R, Groudine M, Bender MA, Borenstein E, Stamatoyannopoulos JA. Conservation of trans-acting circuitry during mammalian regulatory evolution. Nature 2015; 515:365-70. [PMID: 25409825 PMCID: PMC4405208 DOI: 10.1038/nature13972] [Citation(s) in RCA: 176] [Impact Index Per Article: 19.6] [Reference Citation Analysis] [What about the content of this article? (0)] [Affiliation(s)] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 02/21/2014] [Accepted: 10/15/2014] [Indexed: 12/27/2022]
Abstract
The basic body plan and major physiological axes have been highly conserved during mammalian evolution, yet only a small fraction of the human genome sequence appears to be subject to evolutionary constraint. To quantify cis- versus trans-acting contributions to mammalian regulatory evolution, we performed genomic DNase I footprinting of the mouse genome across 25 cell and tissue types, collectively defining ∼8.6 million transcription factor (TF) occupancy sites at nucleotide resolution. Here we show that mouse TF footprints conjointly encode a regulatory lexicon that is ∼95% similar with that derived from human TF footprints. However, only ∼20% of mouse TF footprints have human orthologues. Despite substantial turnover of the cis-regulatory landscape, nearly half of all pairwise regulatory interactions connecting mouse TF genes have been maintained in orthologous human cell types through evolutionary innovation of TF recognition sequences. Furthermore, the higher-level organization of mouse TF-to-TF connections into cellular network architectures is nearly identical with human. Our results indicate that evolutionary selection on mammalian gene regulation is targeted chiefly at the level of trans-regulatory circuitry, enabling and potentiating cis-regulatory plasticity. Mouse genomic footprinting reveals conservation of transcription factor (TF) recognition repertoires and trans-regulatory circuitry despite massive turnover of DNA elements that contact TFs in vivo. Having generated genomic DNase I footprinting data of the mouse genome across 25 cell and tissue types, these authors use these data to quantify cis-versus-trans regulatory contributions to mammalian regulatory evolution. They describe more than 600 motifs that collectively are over 95% similar to that recognized in vivo by human transcription factors (TFs). Despite substantial turnover of the cis-regulatory landscape around each TF gene, nearly half of all pairwise regulatory interactions connecting mouse TF genes have been maintained in orthologous human cell types through evolutionary innovation of TF recognition sequences. Conservation between mouse and human TF regulatory networks is particularly similar at the highest organization level. The work was performed as part of the mouse ENCODE project.
Collapse
Affiliation(s)
- Andrew B Stergachis
- Department of Genome Sciences, University of Washington, Seattle, Washington 98195, USA
| | - Shane Neph
- Department of Genome Sciences, University of Washington, Seattle, Washington 98195, USA
| | - Richard Sandstrom
- Department of Genome Sciences, University of Washington, Seattle, Washington 98195, USA
| | - Eric Haugen
- Department of Genome Sciences, University of Washington, Seattle, Washington 98195, USA
| | - Alex P Reynolds
- Department of Genome Sciences, University of Washington, Seattle, Washington 98195, USA
| | - Miaohua Zhang
- Division of Basic Sciences, Fred Hutchinson Cancer Research Center, Seattle, Washington 98109, USA
| | - Rachel Byron
- Division of Basic Sciences, Fred Hutchinson Cancer Research Center, Seattle, Washington 98109, USA
| | - Theresa Canfield
- Department of Genome Sciences, University of Washington, Seattle, Washington 98195, USA
| | - Sandra Stelhing-Sun
- Department of Genome Sciences, University of Washington, Seattle, Washington 98195, USA
| | - Kristen Lee
- Department of Genome Sciences, University of Washington, Seattle, Washington 98195, USA
| | - Robert E Thurman
- Department of Genome Sciences, University of Washington, Seattle, Washington 98195, USA
| | - Shinny Vong
- Department of Genome Sciences, University of Washington, Seattle, Washington 98195, USA
| | - Daniel Bates
- Department of Genome Sciences, University of Washington, Seattle, Washington 98195, USA
| | - Fidencio Neri
- Department of Genome Sciences, University of Washington, Seattle, Washington 98195, USA
| | - Morgan Diegel
- Department of Genome Sciences, University of Washington, Seattle, Washington 98195, USA
| | - Erika Giste
- Department of Genome Sciences, University of Washington, Seattle, Washington 98195, USA
| | - Douglas Dunn
- Department of Genome Sciences, University of Washington, Seattle, Washington 98195, USA
| | - Jeff Vierstra
- Department of Genome Sciences, University of Washington, Seattle, Washington 98195, USA
| | - R Scott Hansen
- 1] Department of Genome Sciences, University of Washington, Seattle, Washington 98195, USA [2] Department of Medicine, University of Washington, Seattle, Washington 98195, USA
| | - Audra K Johnson
- Department of Genome Sciences, University of Washington, Seattle, Washington 98195, USA
| | - Peter J Sabo
- Department of Genome Sciences, University of Washington, Seattle, Washington 98195, USA
| | - Matthew S Wilken
- Department of Biological Structure, University of Washington, Seattle, Washington 98195, USA
| | - Thomas A Reh
- Department of Biological Structure, University of Washington, Seattle, Washington 98195, USA
| | - Piper M Treuting
- Department of Comparative Medicine, University of Washington, Seattle, Washington 98195, USA
| | - Rajinder Kaul
- 1] Department of Genome Sciences, University of Washington, Seattle, Washington 98195, USA [2] Department of Medicine, University of Washington, Seattle, Washington 98195, USA
| | - Mark Groudine
- 1] Division of Basic Sciences, Fred Hutchinson Cancer Research Center, Seattle, Washington 98109, USA [2] Division of Radiation Oncology, University of Washington, Seattle, Washington 98195, USA
| | - M A Bender
- 1] Clinical Research Division, Fred Hutchinson Cancer Research Center, Seattle, Washington 98109, USA [2] Department of Pediatrics, University of Washington, Seattle, Washington 98195, USA
| | - Elhanan Borenstein
- 1] Department of Genome Sciences, University of Washington, Seattle, Washington 98195, USA [2] Department of Computer Science and Engineering, University of Washington, Seattle, Washington 98102, USA [3] Santa Fe Institute, Santa Fe, New Mexico 87501, USA
| | - John A Stamatoyannopoulos
- 1] Department of Genome Sciences, University of Washington, Seattle, Washington 98195, USA [2] Department of Medicine, University of Washington, Seattle, Washington 98195, USA
| |
Collapse
|
16
|
Vierstra J, Rynes E, Sandstrom R, Zhang M, Canfield T, Hansen RS, Stehling-Sun S, Sabo PJ, Byron R, Humbert R, Thurman RE, Johnson AK, Vong S, Lee K, Bates D, Neri F, Diegel M, Giste E, Haugen E, Dunn D, Wilken MS, Josefowicz S, Samstein R, Chang KH, Eichler EE, De Bruijn M, Reh TA, Skoultchi A, Rudensky A, Orkin SH, Papayannopoulou T, Treuting PM, Selleri L, Kaul R, Groudine M, Bender MA, Stamatoyannopoulos JA. Mouse regulatory DNA landscapes reveal global principles of cis-regulatory evolution. Science 2014; 346:1007-12. [PMID: 25411453 PMCID: PMC4337786 DOI: 10.1126/science.1246426] [Citation(s) in RCA: 186] [Impact Index Per Article: 18.6] [Reference Citation Analysis] [What about the content of this article? (0)] [Affiliation(s)] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 12/14/2022]
Abstract
To study the evolutionary dynamics of regulatory DNA, we mapped >1.3 million deoxyribonuclease I-hypersensitive sites (DHSs) in 45 mouse cell and tissue types, and systematically compared these with human DHS maps from orthologous compartments. We found that the mouse and human genomes have undergone extensive cis-regulatory rewiring that combines branch-specific evolutionary innovation and loss with widespread repurposing of conserved DHSs to alternative cell fates, and that this process is mediated by turnover of transcription factor (TF) recognition elements. Despite pervasive evolutionary remodeling of the location and content of individual cis-regulatory regions, within orthologous mouse and human cell types the global fraction of regulatory DNA bases encoding recognition sites for each TF has been strictly conserved. Our findings provide new insights into the evolutionary forces shaping mammalian regulatory DNA landscapes.
Collapse
Affiliation(s)
- Jeff Vierstra
- Department of Genome Sciences, University of Washington, Seattle, WA 98195, USA
| | - Eric Rynes
- Department of Genome Sciences, University of Washington, Seattle, WA 98195, USA
| | - Richard Sandstrom
- Department of Genome Sciences, University of Washington, Seattle, WA 98195, USA
| | - Miaohua Zhang
- Fred Hutchinson Cancer Research Center, Seattle, WA 98109, USA
| | - Theresa Canfield
- Department of Genome Sciences, University of Washington, Seattle, WA 98195, USA
| | - R Scott Hansen
- Division of Medical Genetics, Department of Medicine, University of Washington, Seattle, WA 98195, USA
| | - Sandra Stehling-Sun
- Department of Genome Sciences, University of Washington, Seattle, WA 98195, USA
| | - Peter J Sabo
- Department of Genome Sciences, University of Washington, Seattle, WA 98195, USA
| | - Rachel Byron
- Fred Hutchinson Cancer Research Center, Seattle, WA 98109, USA
| | - Richard Humbert
- Department of Genome Sciences, University of Washington, Seattle, WA 98195, USA
| | - Robert E Thurman
- Department of Genome Sciences, University of Washington, Seattle, WA 98195, USA
| | - Audra K Johnson
- Department of Genome Sciences, University of Washington, Seattle, WA 98195, USA
| | - Shinny Vong
- Department of Genome Sciences, University of Washington, Seattle, WA 98195, USA
| | - Kristen Lee
- Department of Genome Sciences, University of Washington, Seattle, WA 98195, USA
| | - Daniel Bates
- Department of Genome Sciences, University of Washington, Seattle, WA 98195, USA
| | - Fidencio Neri
- Department of Genome Sciences, University of Washington, Seattle, WA 98195, USA
| | - Morgan Diegel
- Department of Genome Sciences, University of Washington, Seattle, WA 98195, USA
| | - Erika Giste
- Department of Genome Sciences, University of Washington, Seattle, WA 98195, USA
| | - Eric Haugen
- Department of Genome Sciences, University of Washington, Seattle, WA 98195, USA
| | - Douglas Dunn
- Department of Genome Sciences, University of Washington, Seattle, WA 98195, USA
| | - Matthew S Wilken
- Department of Biological Structure, University of Washington, Seattle, WA 98195, USA
| | - Steven Josefowicz
- Immunology Program, Memorial Sloan-Kettering Cancer Center, New York, NY 10065, USA. Howard Hughes Medical Institute
| | - Robert Samstein
- Immunology Program, Memorial Sloan-Kettering Cancer Center, New York, NY 10065, USA. Howard Hughes Medical Institute
| | - Kai-Hsin Chang
- Division of Hematology, Department of Medicine, University of Washington, Seattle, WA 98195, USA
| | - Evan E Eichler
- Department of Genome Sciences, University of Washington, Seattle, WA 98195, USA. Howard Hughes Medical Institute
| | - Marella De Bruijn
- Medical Research Council (MRC) Molecular Haematology Unit, Weatherall Institute of Molecular Medicine, John Radcliffe Hospital, Oxford OX3 9DS, UK
| | - Thomas A Reh
- Department of Biological Structure, University of Washington, Seattle, WA 98195, USA
| | - Arthur Skoultchi
- Department of Cell Biology, Albert Einstein College of Medicine, Bronx, NY 10461, USA
| | - Alexander Rudensky
- Immunology Program, Memorial Sloan-Kettering Cancer Center, New York, NY 10065, USA. Howard Hughes Medical Institute
| | - Stuart H Orkin
- Howard Hughes Medical Institute. Division of Hematology/Oncology, Children's Hospital Boston and Department of Pediatric Oncology, Dana-Farber Cancer Institute, Harvard Stem Cell Institute, Harvard Medical School, Boston, MA 02115, USA
| | - Thalia Papayannopoulou
- Division of Hematology, Department of Medicine, University of Washington, Seattle, WA 98195, USA
| | - Piper M Treuting
- Department of Comparative Medicine, University of Washington, Seattle, WA 98195, USA
| | - Licia Selleri
- Department of Cell and Developmental Biology, Weill Medical College of Cornell University, New York, NY 10065, USA
| | - Rajinder Kaul
- Department of Genome Sciences, University of Washington, Seattle, WA 98195, USA. Division of Medical Genetics, Department of Medicine, University of Washington, Seattle, WA 98195, USA
| | - Mark Groudine
- Fred Hutchinson Cancer Research Center, Seattle, WA 98109, USA. Department of Radiation Oncology, University of Washington, Seattle, WA 98109, USA
| | - M A Bender
- Fred Hutchinson Cancer Research Center, Seattle, WA 98109, USA. Department of Pediatrics, University of Washington, Seattle, WA 98195, USA
| | - John A Stamatoyannopoulos
- Department of Genome Sciences, University of Washington, Seattle, WA 98195, USA. Division of Oncology, Department of Medicine, University of Washington, Seattle, WA 98195, USA.
| |
Collapse
|
17
|
John S, Sabo PJ, Canfield TK, Lee K, Vong S, Weaver M, Wang H, Vierstra J, Reynolds AP, Thurman RE, Stamatoyannopoulos JA. Genome-scale mapping of DNase I hypersensitivity. Curr Protoc Mol Biol 2014; Chapter 27:Unit 21.27. [PMID: 23821440 DOI: 10.1002/0471142727.mb2127s103] [Citation(s) in RCA: 61] [Impact Index Per Article: 6.1] [Reference Citation Analysis] [What about the content of this article? (0)] [Affiliation(s)] [Abstract] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 11/11/2022]
Abstract
DNase I-seq is a global and high-resolution method that uses the nonspecific endonuclease DNase I to map chromatin accessibility. These accessible regions, designated as DNase I hypersensitive sites (DHSs), define the regulatory features, (e.g., promoters, enhancers, insulators, and locus control regions) of complex genomes. In this unit, methods are described for nuclei isolation, digestion of nuclei with limiting concentrations of DNase I, and the biochemical fractionation of DNase I hypersensitive sites in preparation for high-throughput sequencing. DNase I-seq is an unbiased and robust method that is not predicated on an a priori understanding of regulatory patterns or chromatin features.
Collapse
Affiliation(s)
- Sam John
- Department of Genome Sciences, University of Washington, Seattle, Washington, USA
| | | | | | | | | | | | | | | | | | | | | |
Collapse
|
18
|
Vierstra J, Wang H, John S, Sandstrom R, Stamatoyannopoulos JA. Coupling transcription factor occupancy to nucleosome architecture with DNase-FLASH. Nat Methods 2013; 11:66-72. [PMID: 24185839 DOI: 10.1038/nmeth.2713] [Citation(s) in RCA: 54] [Impact Index Per Article: 4.9] [Reference Citation Analysis] [What about the content of this article? (0)] [Affiliation(s)] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/27/2013] [Accepted: 10/04/2013] [Indexed: 12/16/2022]
Abstract
It is currently not possible to resolve the genome-wide relationship of transcription factors (TFs) and nucleosomes at the level of individual chromatin templates despite rapidly increasing data on TF and nucleosome occupancy in the human genome. Here we describe DNase I-released fragment-length analysis of hypersensitivity (DNase-FLASH), an approach that directly couples mapping of TF occupancy, via quantification of DNA microfragments released from individual TF recognition sites in regulatory DNA, to the surrounding nucleosome architecture, via analysis of larger DNA fragments, in a single assay. DNase-FLASH enables coupling of individual TF footprints to nucleosome occupancy, identifying TFs that precisely demarcate the regulatory DNA-nucleosome interface.
Collapse
Affiliation(s)
- Jeff Vierstra
- Department of Genome Sciences, University of Washington, Seattle, Washington, USA
| | - Hao Wang
- Department of Genome Sciences, University of Washington, Seattle, Washington, USA
| | - Sam John
- Department of Genome Sciences, University of Washington, Seattle, Washington, USA
| | - Richard Sandstrom
- Department of Genome Sciences, University of Washington, Seattle, Washington, USA
| | - John A Stamatoyannopoulos
- 1] Department of Genome Sciences, University of Washington, Seattle, Washington, USA. [2] Department of Medicine, Division of Oncology, University of Washington, Seattle, Washington, USA
| |
Collapse
|
19
|
Bauer DE, Kamran SC, Lessard S, Xu J, Fujiwara Y, Lin C, Shao Z, Canver MC, Smith EC, Pinello L, Sabo PJ, Vierstra J, Voit RA, Yuan GC, Porteus MH, Stamatoyannopoulos JA, Lettre G, Orkin SH. An erythroid enhancer of BCL11A subject to genetic variation determines fetal hemoglobin level. Science 2013; 342:253-7. [PMID: 24115442 PMCID: PMC4018826 DOI: 10.1126/science.1242088] [Citation(s) in RCA: 448] [Impact Index Per Article: 40.7] [Reference Citation Analysis] [What about the content of this article? (0)] [Affiliation(s)] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 12/11/2022]
Abstract
Genome-wide association studies (GWASs) have ascertained numerous trait-associated common genetic variants, frequently localized to regulatory DNA. We found that common genetic variation at BCL11A associated with fetal hemoglobin (HbF) level lies in noncoding sequences decorated by an erythroid enhancer chromatin signature. Fine-mapping uncovers a motif-disrupting common variant associated with reduced transcription factor (TF) binding, modestly diminished BCL11A expression, and elevated HbF. The surrounding sequences function in vivo as a developmental stage-specific, lineage-restricted enhancer. Genome engineering reveals the enhancer is required in erythroid but not B-lymphoid cells for BCL11A expression. These findings illustrate how GWASs may expose functional variants of modest impact within causal elements essential for appropriate gene expression. We propose the GWAS-marked BCL11A enhancer represents an attractive target for therapeutic genome engineering for the β-hemoglobinopathies.
Collapse
Affiliation(s)
- Daniel E. Bauer
- Division of Hematology/Oncology, Boston Children’s Hospital, Boston, MA, 02115
- Department of Pediatric Oncology, Dana-Farber Cancer Institute, Boston, MA, 02115
- Harvard Medical School, Boston, MA, 02115
| | - Sophia C. Kamran
- Harvard Medical School, Boston, MA, 02115
- Howard Hughes Medical Institute, Boston, MA, 02115
| | - Samuel Lessard
- Montreal Heart Institute and Université Montréal, Montreal, Quebec, H1T 1C8, Canada
| | - Jian Xu
- Division of Hematology/Oncology, Boston Children’s Hospital, Boston, MA, 02115
- Harvard Medical School, Boston, MA, 02115
| | - Yuko Fujiwara
- Division of Hematology/Oncology, Boston Children’s Hospital, Boston, MA, 02115
| | - Carrie Lin
- Division of Hematology/Oncology, Boston Children’s Hospital, Boston, MA, 02115
| | - Zhen Shao
- Division of Hematology/Oncology, Boston Children’s Hospital, Boston, MA, 02115
| | | | - Elenoe C. Smith
- Division of Hematology/Oncology, Boston Children’s Hospital, Boston, MA, 02115
| | - Luca Pinello
- Department of Biostatistics and Computational Biology, Dana-Farber Cancer Institute, Boston, MA, 02115
| | - Peter J. Sabo
- Departments of Genome Sciences and Medicine, University of Washington, Seattle, WA, 98195
| | - Jeff Vierstra
- Departments of Genome Sciences and Medicine, University of Washington, Seattle, WA, 98195
| | - Richard A. Voit
- Department of Pediatrics, Stanford University, Palo Alto, CA, 94304
| | - Guo-Cheng Yuan
- Department of Biostatistics and Computational Biology, Dana-Farber Cancer Institute, Boston, MA, 02115
- Harvard School of Public Health, Boston, MA, 02115
| | | | | | - Guillaume Lettre
- Montreal Heart Institute and Université Montréal, Montreal, Quebec, H1T 1C8, Canada
| | - Stuart H. Orkin
- Division of Hematology/Oncology, Boston Children’s Hospital, Boston, MA, 02115
- Department of Pediatric Oncology, Dana-Farber Cancer Institute, Boston, MA, 02115
- Harvard Medical School, Boston, MA, 02115
- Howard Hughes Medical Institute, Boston, MA, 02115
| |
Collapse
|
20
|
Vernot B, Stergachis AB, Maurano MT, Vierstra J, Neph S, Thurman RE, Stamatoyannopoulos JA, Akey JM. Personal and population genomics of human regulatory variation. Genome Res 2013; 22:1689-97. [PMID: 22955981 PMCID: PMC3431486 DOI: 10.1101/gr.134890.111] [Citation(s) in RCA: 91] [Impact Index Per Article: 8.3] [Reference Citation Analysis] [What about the content of this article? (0)] [Affiliation(s)] [Abstract] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/24/2022]
Abstract
The characteristics and evolutionary forces acting on regulatory variation in humans remains elusive because of the difficulty in defining functionally important noncoding DNA. Here, we combine genome-scale maps of regulatory DNA marked by DNase I hypersensitive sites (DHSs) from 138 cell and tissue types with whole-genome sequences of 53 geographically diverse individuals in order to better delimit the patterns of regulatory variation in humans. We estimate that individuals likely harbor many more functionally important variants in regulatory DNA compared with protein-coding regions, although they are likely to have, on average, smaller effect sizes. Moreover, we demonstrate that there is significant heterogeneity in the level of functional constraint in regulatory DNA among different cell types. We also find marked variability in functional constraint among transcription factor motifs in regulatory DNA, with sequence motifs for major developmental regulators, such as HOX proteins, exhibiting levels of constraint comparable to protein-coding regions. Finally, we perform a genome-wide scan of recent positive selection and identify hundreds of novel substrates of adaptive regulatory evolution that are enriched for biologically interesting pathways such as melanogenesis and adipocytokine signaling. These data and results provide new insights into patterns of regulatory variation in individuals and populations and demonstrate that a large proportion of functionally important variation lies beyond the exome.
Collapse
Affiliation(s)
- Benjamin Vernot
- Department of Genome Sciences, University of Washington, Seattle, Washington 98195, USA
| | | | | | | | | | | | | | | |
Collapse
|
21
|
Thurman RE, Rynes E, Humbert R, Vierstra J, Maurano MT, Haugen E, Sheffield NC, Stergachis AB, Wang H, Vernot B, Garg K, Sandstrom R, Bates D, Canfield TK, Diegel M, Dunn D, Ebersol AK, Frum T, Giste E, Harding L, Johnson AK, Johnson EM, Kutyavin T, Lajoie B, Lee BK, Lee K, London D, Lotakis D, Neph S, Neri F, Nguyen ED, Reynolds AP, Roach V, Safi A, Sanchez ME, Sanyal A, Shafer A, Simon JM, Song L, Vong S, Weaver M, Zhang Z, Zhang Z, Lenhard B, Tewari M, Dorschner MO, Hansen RS, Navas PA, Stamatoyannopoulos G, Iyer VR, Lieb JD, Sunyaev SR, Akey JM, Sabo PJ, Kaul R, Furey TS, Dekker J, Crawford GE, Stamatoyannopoulos JA. The accessible chromatin landscape of the human genome. Nature 2012; 489:75-82. [PMID: 22955617 PMCID: PMC3721348 DOI: 10.1038/nature11232] [Citation(s) in RCA: 1898] [Impact Index Per Article: 158.2] [Reference Citation Analysis] [What about the content of this article? (0)] [Affiliation(s)] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 12/15/2011] [Accepted: 05/15/2012] [Indexed: 02/07/2023]
Abstract
DNase I hypersensitive sites (DHSs) are markers of regulatory DNA and have underpinned the discovery of all classes of cis-regulatory elements including enhancers, promoters, insulators, silencers and locus control regions. Here we present the first extensive map of human DHSs identified through genome-wide profiling in 125 diverse cell and tissue types. We identify ∼2.9 million DHSs that encompass virtually all known experimentally validated cis-regulatory sequences and expose a vast trove of novel elements, most with highly cell-selective regulation. Annotating these elements using ENCODE data reveals novel relationships between chromatin accessibility, transcription, DNA methylation and regulatory factor occupancy patterns. We connect ∼580,000 distal DHSs with their target promoters, revealing systematic pairing of different classes of distal DHSs and specific promoter types. Patterning of chromatin accessibility at many regulatory regions is organized with dozens to hundreds of co-activated elements, and the transcellular DNase I sensitivity pattern at a given region can predict cell-type-specific functional behaviours. The DHS landscape shows signatures of recent functional evolutionary constraint. However, the DHS compartment in pluripotent and immortalized cells exhibits higher mutation rates than that in highly differentiated cells, exposing an unexpected link between chromatin accessibility, proliferative potential and patterns of human variation.
Collapse
Affiliation(s)
- Robert E. Thurman
- Department of Genome Sciences, University of Washington, Seattle, WA
| | - Eric Rynes
- Department of Genome Sciences, University of Washington, Seattle, WA
| | - Richard Humbert
- Department of Genome Sciences, University of Washington, Seattle, WA
| | - Jeff Vierstra
- Department of Genome Sciences, University of Washington, Seattle, WA
| | | | - Eric Haugen
- Department of Genome Sciences, University of Washington, Seattle, WA
| | | | | | - Hao Wang
- Department of Genome Sciences, University of Washington, Seattle, WA
| | - Benjamin Vernot
- Department of Genome Sciences, University of Washington, Seattle, WA
| | - Kavita Garg
- Division of Human Biology, Fred Hutchinson Cancer Research Center, Seattle, WA
| | - Richard Sandstrom
- Department of Genome Sciences, University of Washington, Seattle, WA
| | - Daniel Bates
- Department of Genome Sciences, University of Washington, Seattle, WA
| | | | - Morgan Diegel
- Department of Genome Sciences, University of Washington, Seattle, WA
| | - Douglas Dunn
- Department of Genome Sciences, University of Washington, Seattle, WA
| | - Abigail K. Ebersol
- Department of Medicine, Division of Medical Genetics, University of Washington, Seattle, WA
| | - Tristan Frum
- Department of Medicine, Division of Medical Genetics, University of Washington, Seattle, WA
| | - Erika Giste
- Department of Genome Sciences, University of Washington, Seattle, WA
| | - Lisa Harding
- Department of Medicine, Division of Medical Genetics, University of Washington, Seattle, WA
| | - Audra K. Johnson
- Department of Genome Sciences, University of Washington, Seattle, WA
| | - Ericka M. Johnson
- Department of Medicine, Division of Medical Genetics, University of Washington, Seattle, WA
| | - Tanya Kutyavin
- Department of Genome Sciences, University of Washington, Seattle, WA
| | - Bryan Lajoie
- Program in Gene Function, University of Massachusetts Medical School, Worcester, MA
| | - Bum-Kyu Lee
- Institute for Cellular and Molecular Biology, University of Texas, Austin, TX
| | - Kristen Lee
- Department of Genome Sciences, University of Washington, Seattle, WA
| | - Darin London
- Institute for Genome Sciences and Policy, Duke University, Durham, NC
| | - Dimitra Lotakis
- Department of Medicine, Division of Medical Genetics, University of Washington, Seattle, WA
| | - Shane Neph
- Department of Genome Sciences, University of Washington, Seattle, WA
| | - Fidencio Neri
- Department of Genome Sciences, University of Washington, Seattle, WA
| | - Eric D. Nguyen
- Department of Medicine, Division of Medical Genetics, University of Washington, Seattle, WA
| | - Alex P. Reynolds
- Department of Genome Sciences, University of Washington, Seattle, WA
| | - Vaughn Roach
- Department of Genome Sciences, University of Washington, Seattle, WA
| | - Alexias Safi
- Institute for Genome Sciences and Policy, Duke University, Durham, NC
| | - Minerva E. Sanchez
- Department of Medicine, Division of Medical Genetics, University of Washington, Seattle, WA
| | - Amartya Sanyal
- Program in Gene Function, University of Massachusetts Medical School, Worcester, MA
| | - Anthony Shafer
- Department of Genome Sciences, University of Washington, Seattle, WA
| | - Jeremy M. Simon
- Department of Biology, University of North Carolina, Chapel Hill, NC
| | - Lingyun Song
- Institute for Genome Sciences and Policy, Duke University, Durham, NC
| | - Shinny Vong
- Department of Genome Sciences, University of Washington, Seattle, WA
| | - Molly Weaver
- Department of Genome Sciences, University of Washington, Seattle, WA
| | - Zhancheng Zhang
- Department of Biology, University of North Carolina, Chapel Hill, NC
| | - Zhuzhu Zhang
- Department of Biology, University of North Carolina, Chapel Hill, NC
| | - Boris Lenhard
- Bergen Center for Computational Science, University of Bergen, Bergen, Norway
| | - Muneesh Tewari
- Division of Human Biology, Fred Hutchinson Cancer Research Center, Seattle, WA
| | - Michael O. Dorschner
- Dept. of Psychiatry and Behavioral Sciences, University of Washington, Seattle, WA
| | - R. Scott Hansen
- Department of Medicine, Division of Medical Genetics, University of Washington, Seattle, WA
| | - Patrick A. Navas
- Department of Medicine, Division of Medical Genetics, University of Washington, Seattle, WA
| | | | - Vishwanath R. Iyer
- Institute for Cellular and Molecular Biology, University of Texas, Austin, TX
| | - Jason D. Lieb
- Department of Biology, University of North Carolina, Chapel Hill, NC
| | - Shamil R. Sunyaev
- Dept. of Medicine, Division of Genetics, Brigham & Women’s Hospital and Harvard Medical School, Boston, MA
| | - Joshua M. Akey
- Department of Genome Sciences, University of Washington, Seattle, WA
| | - Peter J. Sabo
- Department of Genome Sciences, University of Washington, Seattle, WA
| | - Rajinder Kaul
- Department of Medicine, Division of Medical Genetics, University of Washington, Seattle, WA
| | - Terrence S. Furey
- Department of Biology, University of North Carolina, Chapel Hill, NC
| | - Job Dekker
- Program in Gene Function, University of Massachusetts Medical School, Worcester, MA
| | | | - John A. Stamatoyannopoulos
- Department of Genome Sciences, University of Washington, Seattle, WA
- Department of Medicine, Division of Oncology, University of Washington, Seattle, WA
| |
Collapse
|
22
|
Neph S, Vierstra J, Stergachis AB, Reynolds AP, Haugen E, Vernot B, Thurman RE, Sandstrom R, Johnson AK, Maurano MT, Humbert R, Rynes E, Wang H, Vong S, Lee K, Bates D, Diegel M, Roach V, Dunn D, Neri J, Schafer A, Hansen RS, Kutyavin T, Giste E, Weaver M, Canfield T, Sabo P, Zhang M, Balasundaram G, Byron R, MacCoss MJ, Akey JM, Bender M, Groudine M, Kaul R, Stamatoyannopoulos JA. An expansive human regulatory lexicon encoded in transcription factor footprints. Nature 2012; 489:83-90. [PMID: 22955618 PMCID: PMC3736582 DOI: 10.1038/nature11212] [Citation(s) in RCA: 566] [Impact Index Per Article: 47.2] [Reference Citation Analysis] [What about the content of this article? (0)] [Affiliation(s)] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 12/11/2011] [Accepted: 05/10/2012] [Indexed: 01/04/2023]
Abstract
Regulatory factor binding to genomic DNA protects the underlying sequence from cleavage by DNase I, leaving nucleotide-resolution footprints. Using genomic DNase I footprinting across 41 diverse cell and tissue types, we detected 45 million transcription factor occupancy events within regulatory regions, representing differential binding to 8.4 million distinct short sequence elements. Here we show that this small genomic sequence compartment, roughly twice the size of the exome, encodes an expansive repertoire of conserved recognition sequences for DNA-binding proteins that nearly doubles the size of the human cis-regulatory lexicon. We find that genetic variants affecting allelic chromatin states are concentrated in footprints, and that these elements are preferentially sheltered from DNA methylation. High-resolution DNase I cleavage patterns mirror nucleotide-level evolutionary conservation and track the crystallographic topography of protein-DNA interfaces, indicating that transcription factor structure has been evolutionarily imprinted on the human genome sequence. We identify a stereotyped 50-base-pair footprint that precisely defines the site of transcript origination within thousands of human promoters. Finally, we describe a large collection of novel regulatory factor recognition motifs that are highly conserved in both sequence and function, and exhibit cell-selective occupancy patterns that closely parallel major regulators of development, differentiation and pluripotency.
Collapse
Affiliation(s)
- Shane Neph
- Department of Genome Sciences, University of Washington, Seattle, WA 98195
| | - Jeff Vierstra
- Department of Genome Sciences, University of Washington, Seattle, WA 98195
| | | | - Alex P. Reynolds
- Department of Genome Sciences, University of Washington, Seattle, WA 98195
| | - Eric Haugen
- Department of Genome Sciences, University of Washington, Seattle, WA 98195
| | - Benjamin Vernot
- Department of Genome Sciences, University of Washington, Seattle, WA 98195
| | - Robert E. Thurman
- Department of Genome Sciences, University of Washington, Seattle, WA 98195
| | - Richard Sandstrom
- Department of Genome Sciences, University of Washington, Seattle, WA 98195
| | - Audra K. Johnson
- Department of Genome Sciences, University of Washington, Seattle, WA 98195
| | - Matthew T. Maurano
- Department of Genome Sciences, University of Washington, Seattle, WA 98195
| | - Richard Humbert
- Department of Genome Sciences, University of Washington, Seattle, WA 98195
| | - Eric Rynes
- Department of Genome Sciences, University of Washington, Seattle, WA 98195
| | - Hao Wang
- Department of Genome Sciences, University of Washington, Seattle, WA 98195
| | - Shinny Vong
- Department of Genome Sciences, University of Washington, Seattle, WA 98195
| | - Kristen Lee
- Department of Genome Sciences, University of Washington, Seattle, WA 98195
| | - Daniel Bates
- Department of Genome Sciences, University of Washington, Seattle, WA 98195
| | - Morgan Diegel
- Department of Genome Sciences, University of Washington, Seattle, WA 98195
| | - Vaughn Roach
- Department of Genome Sciences, University of Washington, Seattle, WA 98195
| | - Douglas Dunn
- Department of Genome Sciences, University of Washington, Seattle, WA 98195
| | - Jun Neri
- Department of Genome Sciences, University of Washington, Seattle, WA 98195
| | - Anthony Schafer
- Department of Genome Sciences, University of Washington, Seattle, WA 98195
| | - R. Scott Hansen
- Department of Genome Sciences, University of Washington, Seattle, WA 98195
- Division of Medical Genetics, Department of Medicine, University of Washington, Seattle, WA 98195
| | - Tanya Kutyavin
- Department of Genome Sciences, University of Washington, Seattle, WA 98195
| | - Erika Giste
- Department of Genome Sciences, University of Washington, Seattle, WA 98195
| | - Molly Weaver
- Department of Genome Sciences, University of Washington, Seattle, WA 98195
| | - Theresa Canfield
- Department of Genome Sciences, University of Washington, Seattle, WA 98195
| | - Peter Sabo
- Department of Genome Sciences, University of Washington, Seattle, WA 98195
| | - Miaohua Zhang
- Basic Sciences Division, Fred Hutchison Cancer Research Center, Seattle, WA 98109
| | | | - Rachel Byron
- Basic Sciences Division, Fred Hutchison Cancer Research Center, Seattle, WA 98109
| | - Michael J. MacCoss
- Department of Genome Sciences, University of Washington, Seattle, WA 98195
| | - Joshua M. Akey
- Department of Genome Sciences, University of Washington, Seattle, WA 98195
| | - Michael Bender
- Basic Sciences Division, Fred Hutchison Cancer Research Center, Seattle, WA 98109
| | - Mark Groudine
- Basic Sciences Division, Fred Hutchison Cancer Research Center, Seattle, WA 98109
| | - Rajinder Kaul
- Department of Genome Sciences, University of Washington, Seattle, WA 98195
- Division of Medical Genetics, Department of Medicine, University of Washington, Seattle, WA 98195
| | - John A. Stamatoyannopoulos
- Department of Genome Sciences, University of Washington, Seattle, WA 98195
- Division of Oncology, Deparment of Medicine, University of Washington, Seattle, WA 98195
| |
Collapse
|
23
|
Neph S, Kuehn MS, Reynolds AP, Haugen E, Thurman RE, Johnson AK, Rynes E, Maurano MT, Vierstra J, Thomas S, Sandstrom R, Humbert R, Stamatoyannopoulos JA. BEDOPS: high-performance genomic feature operations. ACTA ACUST UNITED AC 2012; 28:1919-20. [PMID: 22576172 DOI: 10.1093/bioinformatics/bts277] [Citation(s) in RCA: 567] [Impact Index Per Article: 47.3] [Reference Citation Analysis] [What about the content of this article? (0)] [Affiliation(s)] [Abstract] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/12/2022]
Abstract
UNLABELLED The large and growing number of genome-wide datasets highlights the need for high-performance feature analysis and data comparison methods, in addition to efficient data storage and retrieval techniques. We introduce BEDOPS, a software suite for common genomic analysis tasks which offers improved flexibility, scalability and execution time characteristics over previously published packages. The suite includes a utility to compress large inputs into a lossless format that can provide greater space savings and faster data extractions than alternatives. AVAILABILITY http://code.google.com/p/bedops/ includes binaries, source and documentation.
Collapse
Affiliation(s)
- Shane Neph
- Department of Genome Sciences and Department of Medicine, University of Washington, Seattle, Washington, DC 98195, USA.
| | | | | | | | | | | | | | | | | | | | | | | | | |
Collapse
|