1
|
Breeze CE, Reynolds AP, van Dongen J, Dunham I, Lazar J, Neph S, Vierstra J, Bourque G, Teschendorff AE, Stamatoyannopoulos JA, Beck S. eFORGE v2.0: updated analysis of cell type-specific signal in epigenomic data. Bioinformatics 2020; 35:4767-4769. [PMID: 31161210 PMCID: PMC6853678 DOI: 10.1093/bioinformatics/btz456] [Citation(s) in RCA: 68] [Impact Index Per Article: 17.0] [Reference Citation Analysis] [What about the content of this article? (0)] [Affiliation(s)] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 11/11/2018] [Revised: 04/24/2019] [Accepted: 05/29/2019] [Indexed: 12/31/2022] Open
Abstract
SUMMARY The Illumina Infinium EPIC BeadChip is a new high-throughput array for DNA methylation analysis, extending the earlier 450k array by over 400 000 new sites. Previously, a method named eFORGE was developed to provide insights into cell type-specific and cell-composition effects for 450k data. Here, we present a significantly updated and improved version of eFORGE that can analyze both EPIC and 450k array data. New features include analysis of chromatin states, transcription factor motifs and DNase I footprints, providing tools for epigenome-wide association study interpretation and epigenome editing. AVAILABILITY AND IMPLEMENTATION eFORGE v2.0 is implemented as a web tool available from https://eforge.altiusinstitute.org and https://eforge-tf.altiusinstitute.org/. SUPPLEMENTARY INFORMATION Supplementary data are available at Bioinformatics online.
Collapse
Affiliation(s)
- Charles E Breeze
- Medical Genomics Group, UCL Cancer Institute, University College London, London WC1E 6BT, UK.,Altius Institute for Biomedical Sciences, Seattle, WA 98121, USA
| | - Alex P Reynolds
- Altius Institute for Biomedical Sciences, Seattle, WA 98121, USA
| | - Jenny van Dongen
- Department of Biological Psychology, Vrije Universiteit Amsterdam, Amsterdam 1081BT, The Netherlands
| | - Ian Dunham
- European Molecular Biology Laboratory, European Bioinformatics Institute (EMBL-EBI), Cambridge CB10 1SD, UK
| | - John Lazar
- Altius Institute for Biomedical Sciences, Seattle, WA 98121, USA
| | - Shane Neph
- Altius Institute for Biomedical Sciences, Seattle, WA 98121, USA
| | - Jeff Vierstra
- Altius Institute for Biomedical Sciences, Seattle, WA 98121, USA
| | - Guillaume Bourque
- Department of Human Genetics, McGill University and Génome Québec Innovation Center, Montréal H3A 0G1, Canada
| | - Andrew E Teschendorff
- CAS Key Lab of Computational Biology, CAS-MPG Partner Institute for Computational Biology, Shanghai Institute for Biological Sciences, Chinese Academy of Sciences, Shanghai 200031, China.,Statistical Genomics Group, UCL Cancer Institute, University College London, London WC1E 6BT, UK
| | | | - Stephan Beck
- Medical Genomics Group, UCL Cancer Institute, University College London, London WC1E 6BT, UK
| |
Collapse
|
2
|
Abstract
The bulk of modern genomics research includes, in part, analyses of large data sets, such as those derived from high resolution, high-throughput experiments, that make computations challenging. The BEDOPS toolkit offers a broad spectrum of fundamental analysis capabilities to query, operate on, and compare quantitatively genomic data sets of any size and number. The toolkit facilitates the construction of complex analysis pipelines that remain efficient in both memory and time by chaining together combinations of its complementary components. The principal utilities accept raw or compressed data in a flexible format, and they provide built-in features to expedite parallel computations.
Collapse
Affiliation(s)
- Shane Neph
- Department of Genome Sciences, Altius Institute for Biomedical Sciences, 2211 Elliott Avenue, Seattle, WA, 98195, USA
| | - Alex P Reynolds
- Department of Genome Sciences, Altius Institute for Biomedical Sciences, 2211 Elliott Avenue, Seattle, WA, 98195, USA
| | - M Scott Kuehn
- Opower Inc., 760 Market Street, San Francisco, CA, 94102, USA
| | - John A Stamatoyannopoulos
- Altius Institute for Biomedical Sciences, 2211 Elliott Avenue, Seattle, WA, 98195, USA.
- Department of Medicine, University of Washington, Seattle, WA, USA.
- Department of Genome Sciences, University of Washington, Seattle, WA, USA.
| |
Collapse
|
3
|
Yue F, Cheng Y, Breschi A, Vierstra J, Wu W, Ryba T, Sandstrom R, Ma Z, Davis C, Pope BD, Shen Y, Pervouchine DD, Djebali S, Thurman RE, Kaul R, Rynes E, Kirilusha A, Marinov GK, Williams BA, Trout D, Amrhein H, Fisher-Aylor K, Antoshechkin I, DeSalvo G, See LH, Fastuca M, Drenkow J, Zaleski C, Dobin A, Prieto P, Lagarde J, Bussotti G, Tanzer A, Denas O, Li K, Bender MA, Zhang M, Byron R, Groudine MT, McCleary D, Pham L, Ye Z, Kuan S, Edsall L, Wu YC, Rasmussen MD, Bansal MS, Kellis M, Keller CA, Morrissey CS, Mishra T, Jain D, Dogan N, Harris RS, Cayting P, Kawli T, Boyle AP, Euskirchen G, Kundaje A, Lin S, Lin Y, Jansen C, Malladi VS, Cline MS, Erickson DT, Kirkup VM, Learned K, Sloan CA, Rosenbloom KR, Lacerda de Sousa B, Beal K, Pignatelli M, Flicek P, Lian J, Kahveci T, Lee D, Kent WJ, Ramalho Santos M, Herrero J, Notredame C, Johnson A, Vong S, Lee K, Bates D, Neri F, Diegel M, Canfield T, Sabo PJ, Wilken MS, Reh TA, Giste E, Shafer A, Kutyavin T, Haugen E, Dunn D, Reynolds AP, Neph S, Humbert R, Hansen RS, De Bruijn M, Selleri L, Rudensky A, Josefowicz S, Samstein R, Eichler EE, Orkin SH, Levasseur D, Papayannopoulou T, Chang KH, Skoultchi A, Gosh S, Disteche C, Treuting P, Wang Y, Weiss MJ, Blobel GA, Cao X, Zhong S, Wang T, Good PJ, Lowdon RF, Adams LB, Zhou XQ, Pazin MJ, Feingold EA, Wold B, Taylor J, Mortazavi A, Weissman SM, Stamatoyannopoulos JA, Snyder MP, Guigo R, Gingeras TR, Gilbert DM, Hardison RC, Beer MA, Ren B. A comparative encyclopedia of DNA elements in the mouse genome. Nature 2015; 515:355-64. [PMID: 25409824 PMCID: PMC4266106 DOI: 10.1038/nature13992] [Citation(s) in RCA: 1135] [Impact Index Per Article: 126.1] [Reference Citation Analysis] [What about the content of this article? (0)] [Affiliation(s)] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 02/03/2014] [Accepted: 10/24/2014] [Indexed: 12/11/2022]
Abstract
The laboratory mouse shares the majority of its protein-coding genes with humans, making it the premier model organism in biomedical research, yet the two mammals differ in significant ways. To gain greater insights into both shared and species-specific transcriptional and cellular regulatory programs in the mouse, the Mouse ENCODE Consortium has mapped transcription, DNase I hypersensitivity, transcription factor binding, chromatin modifications and replication domains throughout the mouse genome in diverse cell and tissue types. By comparing with the human genome, we not only confirm substantial conservation in the newly annotated potential functional sequences, but also find a large degree of divergence of sequences involved in transcriptional regulation, chromatin state and higher order chromatin organization. Our results illuminate the wide range of evolutionary forces acting on genes and their regulatory regions, and provide a general resource for research into mammalian biology and mechanisms of human diseases.
Collapse
Affiliation(s)
- Feng Yue
- 1] Ludwig Institute for Cancer Research and University of California, San Diego School of Medicine, 9500 Gilman Drive, La Jolla, California 92093, USA. [2] Department of Biochemistry and Molecular Biology, College of Medicine, The Pennsylvania State University, Hershey, Pennsylvania 17033, USA
| | - Yong Cheng
- Department of Genetics, Stanford University, 300 Pasteur Drive, MC-5477 Stanford, California 94305, USA
| | - Alessandra Breschi
- Bioinformatics and Genomics, Centre for Genomic Regulation (CRG) and UPF, Doctor Aiguader, 88, 08003 Barcelona, Catalonia, Spain
| | - Jeff Vierstra
- Department of Genome Sciences, University of Washington, Seattle, Washington 98195, USA
| | - Weisheng Wu
- Center for Comparative Genomics and Bioinformatics, Huck Institutes of the Life Sciences, The Pennsylvania State University, University Park, Pennsylvania 16802, USA
| | - Tyrone Ryba
- Department of Biological Science, 319 Stadium Drive, Florida State University, Tallahassee, Florida 32306-4295, USA
| | - Richard Sandstrom
- Department of Genome Sciences, University of Washington, Seattle, Washington 98195, USA
| | - Zhihai Ma
- Department of Genetics, Stanford University, 300 Pasteur Drive, MC-5477 Stanford, California 94305, USA
| | - Carrie Davis
- Functional Genomics, Cold Spring Harbor Laboratory, Bungtown Road, Cold Spring Harbor, New York 11724, USA
| | - Benjamin D Pope
- Department of Biological Science, 319 Stadium Drive, Florida State University, Tallahassee, Florida 32306-4295, USA
| | - Yin Shen
- Ludwig Institute for Cancer Research and University of California, San Diego School of Medicine, 9500 Gilman Drive, La Jolla, California 92093, USA
| | - Dmitri D Pervouchine
- Bioinformatics and Genomics, Centre for Genomic Regulation (CRG) and UPF, Doctor Aiguader, 88, 08003 Barcelona, Catalonia, Spain
| | - Sarah Djebali
- Bioinformatics and Genomics, Centre for Genomic Regulation (CRG) and UPF, Doctor Aiguader, 88, 08003 Barcelona, Catalonia, Spain
| | - Robert E Thurman
- Department of Genome Sciences, University of Washington, Seattle, Washington 98195, USA
| | - Rajinder Kaul
- Department of Genome Sciences, University of Washington, Seattle, Washington 98195, USA
| | - Eric Rynes
- Department of Genome Sciences, University of Washington, Seattle, Washington 98195, USA
| | - Anthony Kirilusha
- Division of Biology, California Institute of Technology, Pasadena, California 91125, USA
| | - Georgi K Marinov
- Division of Biology, California Institute of Technology, Pasadena, California 91125, USA
| | - Brian A Williams
- Division of Biology, California Institute of Technology, Pasadena, California 91125, USA
| | - Diane Trout
- Division of Biology, California Institute of Technology, Pasadena, California 91125, USA
| | - Henry Amrhein
- Division of Biology, California Institute of Technology, Pasadena, California 91125, USA
| | - Katherine Fisher-Aylor
- Division of Biology, California Institute of Technology, Pasadena, California 91125, USA
| | - Igor Antoshechkin
- Division of Biology, California Institute of Technology, Pasadena, California 91125, USA
| | - Gilberto DeSalvo
- Division of Biology, California Institute of Technology, Pasadena, California 91125, USA
| | - Lei-Hoon See
- Functional Genomics, Cold Spring Harbor Laboratory, Bungtown Road, Cold Spring Harbor, New York 11724, USA
| | - Meagan Fastuca
- Functional Genomics, Cold Spring Harbor Laboratory, Bungtown Road, Cold Spring Harbor, New York 11724, USA
| | - Jorg Drenkow
- Functional Genomics, Cold Spring Harbor Laboratory, Bungtown Road, Cold Spring Harbor, New York 11724, USA
| | - Chris Zaleski
- Functional Genomics, Cold Spring Harbor Laboratory, Bungtown Road, Cold Spring Harbor, New York 11724, USA
| | - Alex Dobin
- Functional Genomics, Cold Spring Harbor Laboratory, Bungtown Road, Cold Spring Harbor, New York 11724, USA
| | - Pablo Prieto
- Bioinformatics and Genomics, Centre for Genomic Regulation (CRG) and UPF, Doctor Aiguader, 88, 08003 Barcelona, Catalonia, Spain
| | - Julien Lagarde
- Bioinformatics and Genomics, Centre for Genomic Regulation (CRG) and UPF, Doctor Aiguader, 88, 08003 Barcelona, Catalonia, Spain
| | - Giovanni Bussotti
- Bioinformatics and Genomics, Centre for Genomic Regulation (CRG) and UPF, Doctor Aiguader, 88, 08003 Barcelona, Catalonia, Spain
| | - Andrea Tanzer
- 1] Bioinformatics and Genomics, Centre for Genomic Regulation (CRG) and UPF, Doctor Aiguader, 88, 08003 Barcelona, Catalonia, Spain. [2] Department of Theoretical Chemistry, Faculty of Chemistry, University of Vienna, Waehringerstrasse 17/3/303, A-1090 Vienna, Austria
| | - Olgert Denas
- Departments of Biology and Mathematics and Computer Science, Emory University, O. Wayne Rollins Research Center, 1510 Clifton Road NE, Atlanta, Georgia 30322, USA
| | - Kanwei Li
- Departments of Biology and Mathematics and Computer Science, Emory University, O. Wayne Rollins Research Center, 1510 Clifton Road NE, Atlanta, Georgia 30322, USA
| | - M A Bender
- 1] Department of Pediatrics, University of Washington, Seattle, Washington 98195, USA. [2] Clinical Research Division, Fred Hutchinson Cancer Research Center, Seattle, Washington 98109, USA
| | - Miaohua Zhang
- Basic Science Division, Fred Hutchinson Cancer Research Center, Seattle, Washington 98109, USA
| | - Rachel Byron
- Basic Science Division, Fred Hutchinson Cancer Research Center, Seattle, Washington 98109, USA
| | - Mark T Groudine
- 1] Basic Science Division, Fred Hutchinson Cancer Research Center, Seattle, Washington 98109, USA. [2] Department of Radiation Oncology, University of Washington, Seattle, Washington 98195, USA
| | - David McCleary
- Ludwig Institute for Cancer Research and University of California, San Diego School of Medicine, 9500 Gilman Drive, La Jolla, California 92093, USA
| | - Long Pham
- Ludwig Institute for Cancer Research and University of California, San Diego School of Medicine, 9500 Gilman Drive, La Jolla, California 92093, USA
| | - Zhen Ye
- Ludwig Institute for Cancer Research and University of California, San Diego School of Medicine, 9500 Gilman Drive, La Jolla, California 92093, USA
| | - Samantha Kuan
- Ludwig Institute for Cancer Research and University of California, San Diego School of Medicine, 9500 Gilman Drive, La Jolla, California 92093, USA
| | - Lee Edsall
- Ludwig Institute for Cancer Research and University of California, San Diego School of Medicine, 9500 Gilman Drive, La Jolla, California 92093, USA
| | - Yi-Chieh Wu
- Computer Science and Artificial Intelligence Laboratory, Massachusetts Institute of Technology (MIT), Cambridge, Massachusetts 02139, USA
| | - Matthew D Rasmussen
- Computer Science and Artificial Intelligence Laboratory, Massachusetts Institute of Technology (MIT), Cambridge, Massachusetts 02139, USA
| | - Mukul S Bansal
- Computer Science and Artificial Intelligence Laboratory, Massachusetts Institute of Technology (MIT), Cambridge, Massachusetts 02139, USA
| | - Manolis Kellis
- 1] Computer Science and Artificial Intelligence Laboratory, Massachusetts Institute of Technology (MIT), Cambridge, Massachusetts 02139, USA. [2] Broad Institute of MIT and Harvard, Cambridge, Massachusetts 02142, USA
| | - Cheryl A Keller
- Center for Comparative Genomics and Bioinformatics, Huck Institutes of the Life Sciences, The Pennsylvania State University, University Park, Pennsylvania 16802, USA
| | - Christapher S Morrissey
- Center for Comparative Genomics and Bioinformatics, Huck Institutes of the Life Sciences, The Pennsylvania State University, University Park, Pennsylvania 16802, USA
| | - Tejaswini Mishra
- Center for Comparative Genomics and Bioinformatics, Huck Institutes of the Life Sciences, The Pennsylvania State University, University Park, Pennsylvania 16802, USA
| | - Deepti Jain
- Center for Comparative Genomics and Bioinformatics, Huck Institutes of the Life Sciences, The Pennsylvania State University, University Park, Pennsylvania 16802, USA
| | - Nergiz Dogan
- Center for Comparative Genomics and Bioinformatics, Huck Institutes of the Life Sciences, The Pennsylvania State University, University Park, Pennsylvania 16802, USA
| | - Robert S Harris
- Center for Comparative Genomics and Bioinformatics, Huck Institutes of the Life Sciences, The Pennsylvania State University, University Park, Pennsylvania 16802, USA
| | - Philip Cayting
- Department of Genetics, Stanford University, 300 Pasteur Drive, MC-5477 Stanford, California 94305, USA
| | - Trupti Kawli
- Department of Genetics, Stanford University, 300 Pasteur Drive, MC-5477 Stanford, California 94305, USA
| | - Alan P Boyle
- Department of Genetics, Stanford University, 300 Pasteur Drive, MC-5477 Stanford, California 94305, USA
| | - Ghia Euskirchen
- Department of Genetics, Stanford University, 300 Pasteur Drive, MC-5477 Stanford, California 94305, USA
| | - Anshul Kundaje
- Department of Genetics, Stanford University, 300 Pasteur Drive, MC-5477 Stanford, California 94305, USA
| | - Shin Lin
- Department of Genetics, Stanford University, 300 Pasteur Drive, MC-5477 Stanford, California 94305, USA
| | - Yiing Lin
- Department of Genetics, Stanford University, 300 Pasteur Drive, MC-5477 Stanford, California 94305, USA
| | - Camden Jansen
- Department of Developmental and Cell Biology, University of California, Irvine, Irvine, California 92697, USA
| | - Venkat S Malladi
- Department of Genetics, Stanford University, 300 Pasteur Drive, MC-5477 Stanford, California 94305, USA
| | - Melissa S Cline
- Center for Biomolecular Science and Engineering, School of Engineering, University of California Santa Cruz (UCSC), Santa Cruz, California 95064, USA
| | - Drew T Erickson
- Department of Genetics, Stanford University, 300 Pasteur Drive, MC-5477 Stanford, California 94305, USA
| | - Vanessa M Kirkup
- Center for Biomolecular Science and Engineering, School of Engineering, University of California Santa Cruz (UCSC), Santa Cruz, California 95064, USA
| | - Katrina Learned
- Center for Biomolecular Science and Engineering, School of Engineering, University of California Santa Cruz (UCSC), Santa Cruz, California 95064, USA
| | - Cricket A Sloan
- Department of Genetics, Stanford University, 300 Pasteur Drive, MC-5477 Stanford, California 94305, USA
| | - Kate R Rosenbloom
- Center for Biomolecular Science and Engineering, School of Engineering, University of California Santa Cruz (UCSC), Santa Cruz, California 95064, USA
| | - Beatriz Lacerda de Sousa
- Departments of Obstetrics/Gynecology and Pathology, and Center for Reproductive Sciences, University of California San Francisco, San Francisco, California 94143, USA
| | - Kathryn Beal
- European Molecular Biology Laboratory, European Bioinformatics Institute, Wellcome Trust Genome Campus, Hinxton, Cambridge CB10 1SD, UK
| | - Miguel Pignatelli
- European Molecular Biology Laboratory, European Bioinformatics Institute, Wellcome Trust Genome Campus, Hinxton, Cambridge CB10 1SD, UK
| | - Paul Flicek
- European Molecular Biology Laboratory, European Bioinformatics Institute, Wellcome Trust Genome Campus, Hinxton, Cambridge CB10 1SD, UK
| | - Jin Lian
- Yale University, Department of Genetics, PO Box 208005, 333 Cedar Street, New Haven, Connecticut 06520-8005, USA
| | - Tamer Kahveci
- Computer &Information Sciences &Engineering, University of Florida, Gainesville, Florida 32611, USA
| | - Dongwon Lee
- McKusick-Nathans Institute of Genetic Medicine and Department of Biomedical Engineering, Johns Hopkins University, 733 N. Broadway, BRB 573 Baltimore, Maryland 21205, USA
| | - W James Kent
- Center for Biomolecular Science and Engineering, School of Engineering, University of California Santa Cruz (UCSC), Santa Cruz, California 95064, USA
| | - Miguel Ramalho Santos
- Departments of Obstetrics/Gynecology and Pathology, and Center for Reproductive Sciences, University of California San Francisco, San Francisco, California 94143, USA
| | - Javier Herrero
- 1] European Molecular Biology Laboratory, European Bioinformatics Institute, Wellcome Trust Genome Campus, Hinxton, Cambridge CB10 1SD, UK. [2] Bill Lyons Informatics Centre, UCL Cancer Institute, University College London, London WC1E 6DD, UK
| | - Cedric Notredame
- Bioinformatics and Genomics, Centre for Genomic Regulation (CRG) and UPF, Doctor Aiguader, 88, 08003 Barcelona, Catalonia, Spain
| | - Audra Johnson
- Department of Genome Sciences, University of Washington, Seattle, Washington 98195, USA
| | - Shinny Vong
- Department of Genome Sciences, University of Washington, Seattle, Washington 98195, USA
| | - Kristen Lee
- Department of Genome Sciences, University of Washington, Seattle, Washington 98195, USA
| | - Daniel Bates
- Department of Genome Sciences, University of Washington, Seattle, Washington 98195, USA
| | - Fidencio Neri
- Department of Genome Sciences, University of Washington, Seattle, Washington 98195, USA
| | - Morgan Diegel
- Department of Genome Sciences, University of Washington, Seattle, Washington 98195, USA
| | - Theresa Canfield
- Department of Genome Sciences, University of Washington, Seattle, Washington 98195, USA
| | - Peter J Sabo
- Department of Genome Sciences, University of Washington, Seattle, Washington 98195, USA
| | - Matthew S Wilken
- Department of Biological Structure, University of Washington, HSB I-516, 1959 NE Pacific Street, Seattle, Washington 98195, USA
| | - Thomas A Reh
- Department of Biological Structure, University of Washington, HSB I-516, 1959 NE Pacific Street, Seattle, Washington 98195, USA
| | - Erika Giste
- Department of Genome Sciences, University of Washington, Seattle, Washington 98195, USA
| | - Anthony Shafer
- Department of Genome Sciences, University of Washington, Seattle, Washington 98195, USA
| | - Tanya Kutyavin
- Department of Genome Sciences, University of Washington, Seattle, Washington 98195, USA
| | - Eric Haugen
- Department of Genome Sciences, University of Washington, Seattle, Washington 98195, USA
| | - Douglas Dunn
- Department of Genome Sciences, University of Washington, Seattle, Washington 98195, USA
| | - Alex P Reynolds
- Department of Genome Sciences, University of Washington, Seattle, Washington 98195, USA
| | - Shane Neph
- Department of Genome Sciences, University of Washington, Seattle, Washington 98195, USA
| | - Richard Humbert
- Department of Genome Sciences, University of Washington, Seattle, Washington 98195, USA
| | - R Scott Hansen
- Department of Genome Sciences, University of Washington, Seattle, Washington 98195, USA
| | - Marella De Bruijn
- MRC Molecular Haemotology Unit, University of Oxford, Oxford OX3 9DS, UK
| | - Licia Selleri
- Department of Cell and Developmental Biology, Weill Cornell Medical College, New York, New York 10065, USA
| | - Alexander Rudensky
- HHMI and Ludwig Center at Memorial Sloan Kettering Cancer Center, Immunology Program, Memorial Sloan Kettering Cancer Canter, New York, New York 10065, USA
| | - Steven Josefowicz
- HHMI and Ludwig Center at Memorial Sloan Kettering Cancer Center, Immunology Program, Memorial Sloan Kettering Cancer Canter, New York, New York 10065, USA
| | - Robert Samstein
- HHMI and Ludwig Center at Memorial Sloan Kettering Cancer Center, Immunology Program, Memorial Sloan Kettering Cancer Canter, New York, New York 10065, USA
| | - Evan E Eichler
- Department of Genome Sciences, University of Washington, Seattle, Washington 98195, USA
| | - Stuart H Orkin
- Dana Farber Cancer Institute, Harvard Medical School, Cambridge, Massachusetts 02138, USA
| | - Dana Levasseur
- University of Iowa Carver College of Medicine, Department of Internal Medicine, Iowa City, Iowa 52242, USA
| | - Thalia Papayannopoulou
- Division of Hematology, Department of Medicine, University of Washington, Seattle, Washington 98195, USA
| | - Kai-Hsin Chang
- University of Iowa Carver College of Medicine, Department of Internal Medicine, Iowa City, Iowa 52242, USA
| | - Arthur Skoultchi
- Department of Cell Biology, Albert Einstein College of Medicine, Bronx, New York 10461, USA
| | - Srikanta Gosh
- Department of Cell Biology, Albert Einstein College of Medicine, Bronx, New York 10461, USA
| | - Christine Disteche
- Department of Pathology, University of Washington, Seattle, Washington 98195, USA
| | - Piper Treuting
- Department of Comparative Medicine, University of Washington, Seattle, Washington 98195, USA
| | - Yanli Wang
- Bioinformatics and Genomics program, The Pennsylvania State University, University Park, Pennsylvania 16802, USA
| | - Mitchell J Weiss
- Department of Hematology, St Jude Children's Research Hospital, Memphis, Tennessee 38105, USA
| | - Gerd A Blobel
- 1] Division of Hematology, The Children's Hospital of Philadelphia, Philadelphia, Pennsylvania 19104, USA. [2] Perelman School of Medicine at the University of Pennsylvania, Philadelphia, Pennsylvania 19104, USA
| | - Xiaoyi Cao
- Department of Bioengineering, University of California, San Diego, 9500 Gilman Drive, La Jolla, California 92093, USA
| | - Sheng Zhong
- Department of Bioengineering, University of California, San Diego, 9500 Gilman Drive, La Jolla, California 92093, USA
| | - Ting Wang
- Department of Genetics, Center for Genome Sciences and Systems Biology, Washington University School of Medicine, St. Louis, Missouri 63108, USA
| | - Peter J Good
- NHGRI, National Institutes of Health, 5635 Fishers Lane, Bethesda, Maryland 20892-9307, USA
| | - Rebecca F Lowdon
- NHGRI, National Institutes of Health, 5635 Fishers Lane, Bethesda, Maryland 20892-9307, USA
| | - Leslie B Adams
- NHGRI, National Institutes of Health, 5635 Fishers Lane, Bethesda, Maryland 20892-9307, USA
| | - Xiao-Qiao Zhou
- NHGRI, National Institutes of Health, 5635 Fishers Lane, Bethesda, Maryland 20892-9307, USA
| | - Michael J Pazin
- NHGRI, National Institutes of Health, 5635 Fishers Lane, Bethesda, Maryland 20892-9307, USA
| | - Elise A Feingold
- NHGRI, National Institutes of Health, 5635 Fishers Lane, Bethesda, Maryland 20892-9307, USA
| | - Barbara Wold
- Division of Biology, California Institute of Technology, Pasadena, California 91125, USA
| | - James Taylor
- Departments of Biology and Mathematics and Computer Science, Emory University, O. Wayne Rollins Research Center, 1510 Clifton Road NE, Atlanta, Georgia 30322, USA
| | - Ali Mortazavi
- Department of Developmental and Cell Biology, University of California, Irvine, Irvine, California 92697, USA
| | - Sherman M Weissman
- Yale University, Department of Genetics, PO Box 208005, 333 Cedar Street, New Haven, Connecticut 06520-8005, USA
| | | | - Michael P Snyder
- Department of Genetics, Stanford University, 300 Pasteur Drive, MC-5477 Stanford, California 94305, USA
| | - Roderic Guigo
- Bioinformatics and Genomics, Centre for Genomic Regulation (CRG) and UPF, Doctor Aiguader, 88, 08003 Barcelona, Catalonia, Spain
| | - Thomas R Gingeras
- Functional Genomics, Cold Spring Harbor Laboratory, Bungtown Road, Cold Spring Harbor, New York 11724, USA
| | - David M Gilbert
- Department of Biological Science, 319 Stadium Drive, Florida State University, Tallahassee, Florida 32306-4295, USA
| | - Ross C Hardison
- Center for Comparative Genomics and Bioinformatics, Huck Institutes of the Life Sciences, The Pennsylvania State University, University Park, Pennsylvania 16802, USA
| | - Michael A Beer
- McKusick-Nathans Institute of Genetic Medicine and Department of Biomedical Engineering, Johns Hopkins University, 733 N. Broadway, BRB 573 Baltimore, Maryland 21205, USA
| | - Bing Ren
- Ludwig Institute for Cancer Research and University of California, San Diego School of Medicine, 9500 Gilman Drive, La Jolla, California 92093, USA
| | | |
Collapse
|
4
|
Stergachis AB, Neph S, Sandstrom R, Haugen E, Reynolds AP, Zhang M, Byron R, Canfield T, Stelhing-Sun S, Lee K, Thurman RE, Vong S, Bates D, Neri F, Diegel M, Giste E, Dunn D, Vierstra J, Hansen RS, Johnson AK, Sabo PJ, Wilken MS, Reh TA, Treuting PM, Kaul R, Groudine M, Bender MA, Borenstein E, Stamatoyannopoulos JA. Conservation of trans-acting circuitry during mammalian regulatory evolution. Nature 2015; 515:365-70. [PMID: 25409825 PMCID: PMC4405208 DOI: 10.1038/nature13972] [Citation(s) in RCA: 176] [Impact Index Per Article: 19.6] [Reference Citation Analysis] [What about the content of this article? (0)] [Affiliation(s)] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 02/21/2014] [Accepted: 10/15/2014] [Indexed: 12/27/2022]
Abstract
The basic body plan and major physiological axes have been highly conserved during mammalian evolution, yet only a small fraction of the human genome sequence appears to be subject to evolutionary constraint. To quantify cis- versus trans-acting contributions to mammalian regulatory evolution, we performed genomic DNase I footprinting of the mouse genome across 25 cell and tissue types, collectively defining ∼8.6 million transcription factor (TF) occupancy sites at nucleotide resolution. Here we show that mouse TF footprints conjointly encode a regulatory lexicon that is ∼95% similar with that derived from human TF footprints. However, only ∼20% of mouse TF footprints have human orthologues. Despite substantial turnover of the cis-regulatory landscape, nearly half of all pairwise regulatory interactions connecting mouse TF genes have been maintained in orthologous human cell types through evolutionary innovation of TF recognition sequences. Furthermore, the higher-level organization of mouse TF-to-TF connections into cellular network architectures is nearly identical with human. Our results indicate that evolutionary selection on mammalian gene regulation is targeted chiefly at the level of trans-regulatory circuitry, enabling and potentiating cis-regulatory plasticity. Mouse genomic footprinting reveals conservation of transcription factor (TF) recognition repertoires and trans-regulatory circuitry despite massive turnover of DNA elements that contact TFs in vivo. Having generated genomic DNase I footprinting data of the mouse genome across 25 cell and tissue types, these authors use these data to quantify cis-versus-trans regulatory contributions to mammalian regulatory evolution. They describe more than 600 motifs that collectively are over 95% similar to that recognized in vivo by human transcription factors (TFs). Despite substantial turnover of the cis-regulatory landscape around each TF gene, nearly half of all pairwise regulatory interactions connecting mouse TF genes have been maintained in orthologous human cell types through evolutionary innovation of TF recognition sequences. Conservation between mouse and human TF regulatory networks is particularly similar at the highest organization level. The work was performed as part of the mouse ENCODE project.
Collapse
Affiliation(s)
- Andrew B Stergachis
- Department of Genome Sciences, University of Washington, Seattle, Washington 98195, USA
| | - Shane Neph
- Department of Genome Sciences, University of Washington, Seattle, Washington 98195, USA
| | - Richard Sandstrom
- Department of Genome Sciences, University of Washington, Seattle, Washington 98195, USA
| | - Eric Haugen
- Department of Genome Sciences, University of Washington, Seattle, Washington 98195, USA
| | - Alex P Reynolds
- Department of Genome Sciences, University of Washington, Seattle, Washington 98195, USA
| | - Miaohua Zhang
- Division of Basic Sciences, Fred Hutchinson Cancer Research Center, Seattle, Washington 98109, USA
| | - Rachel Byron
- Division of Basic Sciences, Fred Hutchinson Cancer Research Center, Seattle, Washington 98109, USA
| | - Theresa Canfield
- Department of Genome Sciences, University of Washington, Seattle, Washington 98195, USA
| | - Sandra Stelhing-Sun
- Department of Genome Sciences, University of Washington, Seattle, Washington 98195, USA
| | - Kristen Lee
- Department of Genome Sciences, University of Washington, Seattle, Washington 98195, USA
| | - Robert E Thurman
- Department of Genome Sciences, University of Washington, Seattle, Washington 98195, USA
| | - Shinny Vong
- Department of Genome Sciences, University of Washington, Seattle, Washington 98195, USA
| | - Daniel Bates
- Department of Genome Sciences, University of Washington, Seattle, Washington 98195, USA
| | - Fidencio Neri
- Department of Genome Sciences, University of Washington, Seattle, Washington 98195, USA
| | - Morgan Diegel
- Department of Genome Sciences, University of Washington, Seattle, Washington 98195, USA
| | - Erika Giste
- Department of Genome Sciences, University of Washington, Seattle, Washington 98195, USA
| | - Douglas Dunn
- Department of Genome Sciences, University of Washington, Seattle, Washington 98195, USA
| | - Jeff Vierstra
- Department of Genome Sciences, University of Washington, Seattle, Washington 98195, USA
| | - R Scott Hansen
- 1] Department of Genome Sciences, University of Washington, Seattle, Washington 98195, USA [2] Department of Medicine, University of Washington, Seattle, Washington 98195, USA
| | - Audra K Johnson
- Department of Genome Sciences, University of Washington, Seattle, Washington 98195, USA
| | - Peter J Sabo
- Department of Genome Sciences, University of Washington, Seattle, Washington 98195, USA
| | - Matthew S Wilken
- Department of Biological Structure, University of Washington, Seattle, Washington 98195, USA
| | - Thomas A Reh
- Department of Biological Structure, University of Washington, Seattle, Washington 98195, USA
| | - Piper M Treuting
- Department of Comparative Medicine, University of Washington, Seattle, Washington 98195, USA
| | - Rajinder Kaul
- 1] Department of Genome Sciences, University of Washington, Seattle, Washington 98195, USA [2] Department of Medicine, University of Washington, Seattle, Washington 98195, USA
| | - Mark Groudine
- 1] Division of Basic Sciences, Fred Hutchinson Cancer Research Center, Seattle, Washington 98109, USA [2] Division of Radiation Oncology, University of Washington, Seattle, Washington 98195, USA
| | - M A Bender
- 1] Clinical Research Division, Fred Hutchinson Cancer Research Center, Seattle, Washington 98109, USA [2] Department of Pediatrics, University of Washington, Seattle, Washington 98195, USA
| | - Elhanan Borenstein
- 1] Department of Genome Sciences, University of Washington, Seattle, Washington 98195, USA [2] Department of Computer Science and Engineering, University of Washington, Seattle, Washington 98102, USA [3] Santa Fe Institute, Santa Fe, New Mexico 87501, USA
| | - John A Stamatoyannopoulos
- 1] Department of Genome Sciences, University of Washington, Seattle, Washington 98195, USA [2] Department of Medicine, University of Washington, Seattle, Washington 98195, USA
| |
Collapse
|
5
|
Sullivan AM, Arsovski AA, Lempe J, Bubb KL, Weirauch MT, Sabo PJ, Sandstrom R, Thurman RE, Neph S, Reynolds AP, Stergachis AB, Vernot B, Johnson AK, Haugen E, Sullivan ST, Thompson A, Neri FV, Weaver M, Diegel M, Mnaimneh S, Yang A, Hughes TR, Nemhauser JL, Queitsch C, Stamatoyannopoulos JA. Mapping and dynamics of regulatory DNA and transcription factor networks in A. thaliana. Cell Rep 2014; 8:2015-2030. [PMID: 25220462 DOI: 10.1016/j.celrep.2014.08.019] [Citation(s) in RCA: 159] [Impact Index Per Article: 15.9] [Reference Citation Analysis] [What about the content of this article? (0)] [Affiliation(s)] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 12/04/2013] [Revised: 05/20/2014] [Accepted: 08/07/2014] [Indexed: 01/23/2023] Open
Abstract
Our understanding of gene regulation in plants is constrained by our limited knowledge of plant cis-regulatory DNA and its dynamics. We mapped DNase I hypersensitive sites (DHSs) in A. thaliana seedlings and used genomic footprinting to delineate ∼ 700,000 sites of in vivo transcription factor (TF) occupancy at nucleotide resolution. We show that variation associated with 72 diverse quantitative phenotypes localizes within DHSs. TF footprints encode an extensive cis-regulatory lexicon subject to recent evolutionary pressures, and widespread TF binding within exons may have shaped codon usage patterns. The architecture of A. thaliana TF regulatory networks is strikingly similar to that of animals in spite of diverged regulatory repertoires. We analyzed regulatory landscape dynamics during heat shock and photomorphogenesis, disclosing thousands of environmentally sensitive elements and enabling mapping of key TF regulatory circuits underlying these fundamental responses. Our results provide an extensive resource for the study of A. thaliana gene regulation and functional biology.
Collapse
Affiliation(s)
| | - Andrej A Arsovski
- Department of Biology, University of Washington, Seattle, WA 98195, USA
| | - Janne Lempe
- Department of Genome Sciences, University of Washington, Seattle, WA 98195, USA
| | - Kerry L Bubb
- Department of Genome Sciences, University of Washington, Seattle, WA 98195, USA
| | - Matthew T Weirauch
- Center for Autoimmune Genomics and Etiology (CAGE) and Divisions of Biomedical Informatics and Developmental Biology, Cincinnati Children's Hospital Medical Center, Cincinnati, OH 45229, USA
| | - Peter J Sabo
- Department of Genome Sciences, University of Washington, Seattle, WA 98195, USA
| | - Richard Sandstrom
- Department of Genome Sciences, University of Washington, Seattle, WA 98195, USA
| | - Robert E Thurman
- Department of Genome Sciences, University of Washington, Seattle, WA 98195, USA
| | - Shane Neph
- Department of Genome Sciences, University of Washington, Seattle, WA 98195, USA
| | - Alex P Reynolds
- Department of Genome Sciences, University of Washington, Seattle, WA 98195, USA
| | - Andrew B Stergachis
- Department of Genome Sciences, University of Washington, Seattle, WA 98195, USA
| | - Benjamin Vernot
- Department of Genome Sciences, University of Washington, Seattle, WA 98195, USA
| | - Audra K Johnson
- Department of Genome Sciences, University of Washington, Seattle, WA 98195, USA
| | - Eric Haugen
- Department of Genome Sciences, University of Washington, Seattle, WA 98195, USA
| | - Shawn T Sullivan
- Department of Genome Sciences, University of Washington, Seattle, WA 98195, USA
| | - Agnieszka Thompson
- Department of Genome Sciences, University of Washington, Seattle, WA 98195, USA
| | - Fidencio V Neri
- Department of Genome Sciences, University of Washington, Seattle, WA 98195, USA
| | - Molly Weaver
- Department of Genome Sciences, University of Washington, Seattle, WA 98195, USA
| | - Morgan Diegel
- Department of Genome Sciences, University of Washington, Seattle, WA 98195, USA
| | - Sanie Mnaimneh
- Donnelly Centre and Department of Molecular Genetics, University of Toronto, Toronto ON M5S 3E1, Canada
| | - Ally Yang
- Donnelly Centre and Department of Molecular Genetics, University of Toronto, Toronto ON M5S 3E1, Canada
| | - Timothy R Hughes
- Donnelly Centre and Department of Molecular Genetics, University of Toronto, Toronto ON M5S 3E1, Canada; Canadian Institute for Advanced Research (CIFAR) Program in Genetic Networks, Toronto ON M5G 1Z8, Canada
| | | | - Christine Queitsch
- Department of Genome Sciences, University of Washington, Seattle, WA 98195, USA.
| | | |
Collapse
|
6
|
John S, Sabo PJ, Canfield TK, Lee K, Vong S, Weaver M, Wang H, Vierstra J, Reynolds AP, Thurman RE, Stamatoyannopoulos JA. Genome-scale mapping of DNase I hypersensitivity. Curr Protoc Mol Biol 2014; Chapter 27:Unit 21.27. [PMID: 23821440 DOI: 10.1002/0471142727.mb2127s103] [Citation(s) in RCA: 61] [Impact Index Per Article: 6.1] [Reference Citation Analysis] [What about the content of this article? (0)] [Affiliation(s)] [Abstract] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 11/11/2022]
Abstract
DNase I-seq is a global and high-resolution method that uses the nonspecific endonuclease DNase I to map chromatin accessibility. These accessible regions, designated as DNase I hypersensitive sites (DHSs), define the regulatory features, (e.g., promoters, enhancers, insulators, and locus control regions) of complex genomes. In this unit, methods are described for nuclei isolation, digestion of nuclei with limiting concentrations of DNase I, and the biochemical fractionation of DNase I hypersensitive sites in preparation for high-throughput sequencing. DNase I-seq is an unbiased and robust method that is not predicated on an a priori understanding of regulatory patterns or chromatin features.
Collapse
Affiliation(s)
- Sam John
- Department of Genome Sciences, University of Washington, Seattle, Washington, USA
| | | | | | | | | | | | | | | | | | | | | |
Collapse
|
7
|
Thurman RE, Rynes E, Humbert R, Vierstra J, Maurano MT, Haugen E, Sheffield NC, Stergachis AB, Wang H, Vernot B, Garg K, Sandstrom R, Bates D, Canfield TK, Diegel M, Dunn D, Ebersol AK, Frum T, Giste E, Harding L, Johnson AK, Johnson EM, Kutyavin T, Lajoie B, Lee BK, Lee K, London D, Lotakis D, Neph S, Neri F, Nguyen ED, Reynolds AP, Roach V, Safi A, Sanchez ME, Sanyal A, Shafer A, Simon JM, Song L, Vong S, Weaver M, Zhang Z, Zhang Z, Lenhard B, Tewari M, Dorschner MO, Hansen RS, Navas PA, Stamatoyannopoulos G, Iyer VR, Lieb JD, Sunyaev SR, Akey JM, Sabo PJ, Kaul R, Furey TS, Dekker J, Crawford GE, Stamatoyannopoulos JA. The accessible chromatin landscape of the human genome. Nature 2012; 489:75-82. [PMID: 22955617 PMCID: PMC3721348 DOI: 10.1038/nature11232] [Citation(s) in RCA: 1898] [Impact Index Per Article: 158.2] [Reference Citation Analysis] [What about the content of this article? (0)] [Affiliation(s)] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 12/15/2011] [Accepted: 05/15/2012] [Indexed: 02/07/2023]
Abstract
DNase I hypersensitive sites (DHSs) are markers of regulatory DNA and have underpinned the discovery of all classes of cis-regulatory elements including enhancers, promoters, insulators, silencers and locus control regions. Here we present the first extensive map of human DHSs identified through genome-wide profiling in 125 diverse cell and tissue types. We identify ∼2.9 million DHSs that encompass virtually all known experimentally validated cis-regulatory sequences and expose a vast trove of novel elements, most with highly cell-selective regulation. Annotating these elements using ENCODE data reveals novel relationships between chromatin accessibility, transcription, DNA methylation and regulatory factor occupancy patterns. We connect ∼580,000 distal DHSs with their target promoters, revealing systematic pairing of different classes of distal DHSs and specific promoter types. Patterning of chromatin accessibility at many regulatory regions is organized with dozens to hundreds of co-activated elements, and the transcellular DNase I sensitivity pattern at a given region can predict cell-type-specific functional behaviours. The DHS landscape shows signatures of recent functional evolutionary constraint. However, the DHS compartment in pluripotent and immortalized cells exhibits higher mutation rates than that in highly differentiated cells, exposing an unexpected link between chromatin accessibility, proliferative potential and patterns of human variation.
Collapse
Affiliation(s)
- Robert E. Thurman
- Department of Genome Sciences, University of Washington, Seattle, WA
| | - Eric Rynes
- Department of Genome Sciences, University of Washington, Seattle, WA
| | - Richard Humbert
- Department of Genome Sciences, University of Washington, Seattle, WA
| | - Jeff Vierstra
- Department of Genome Sciences, University of Washington, Seattle, WA
| | | | - Eric Haugen
- Department of Genome Sciences, University of Washington, Seattle, WA
| | | | | | - Hao Wang
- Department of Genome Sciences, University of Washington, Seattle, WA
| | - Benjamin Vernot
- Department of Genome Sciences, University of Washington, Seattle, WA
| | - Kavita Garg
- Division of Human Biology, Fred Hutchinson Cancer Research Center, Seattle, WA
| | - Richard Sandstrom
- Department of Genome Sciences, University of Washington, Seattle, WA
| | - Daniel Bates
- Department of Genome Sciences, University of Washington, Seattle, WA
| | | | - Morgan Diegel
- Department of Genome Sciences, University of Washington, Seattle, WA
| | - Douglas Dunn
- Department of Genome Sciences, University of Washington, Seattle, WA
| | - Abigail K. Ebersol
- Department of Medicine, Division of Medical Genetics, University of Washington, Seattle, WA
| | - Tristan Frum
- Department of Medicine, Division of Medical Genetics, University of Washington, Seattle, WA
| | - Erika Giste
- Department of Genome Sciences, University of Washington, Seattle, WA
| | - Lisa Harding
- Department of Medicine, Division of Medical Genetics, University of Washington, Seattle, WA
| | - Audra K. Johnson
- Department of Genome Sciences, University of Washington, Seattle, WA
| | - Ericka M. Johnson
- Department of Medicine, Division of Medical Genetics, University of Washington, Seattle, WA
| | - Tanya Kutyavin
- Department of Genome Sciences, University of Washington, Seattle, WA
| | - Bryan Lajoie
- Program in Gene Function, University of Massachusetts Medical School, Worcester, MA
| | - Bum-Kyu Lee
- Institute for Cellular and Molecular Biology, University of Texas, Austin, TX
| | - Kristen Lee
- Department of Genome Sciences, University of Washington, Seattle, WA
| | - Darin London
- Institute for Genome Sciences and Policy, Duke University, Durham, NC
| | - Dimitra Lotakis
- Department of Medicine, Division of Medical Genetics, University of Washington, Seattle, WA
| | - Shane Neph
- Department of Genome Sciences, University of Washington, Seattle, WA
| | - Fidencio Neri
- Department of Genome Sciences, University of Washington, Seattle, WA
| | - Eric D. Nguyen
- Department of Medicine, Division of Medical Genetics, University of Washington, Seattle, WA
| | - Alex P. Reynolds
- Department of Genome Sciences, University of Washington, Seattle, WA
| | - Vaughn Roach
- Department of Genome Sciences, University of Washington, Seattle, WA
| | - Alexias Safi
- Institute for Genome Sciences and Policy, Duke University, Durham, NC
| | - Minerva E. Sanchez
- Department of Medicine, Division of Medical Genetics, University of Washington, Seattle, WA
| | - Amartya Sanyal
- Program in Gene Function, University of Massachusetts Medical School, Worcester, MA
| | - Anthony Shafer
- Department of Genome Sciences, University of Washington, Seattle, WA
| | - Jeremy M. Simon
- Department of Biology, University of North Carolina, Chapel Hill, NC
| | - Lingyun Song
- Institute for Genome Sciences and Policy, Duke University, Durham, NC
| | - Shinny Vong
- Department of Genome Sciences, University of Washington, Seattle, WA
| | - Molly Weaver
- Department of Genome Sciences, University of Washington, Seattle, WA
| | - Zhancheng Zhang
- Department of Biology, University of North Carolina, Chapel Hill, NC
| | - Zhuzhu Zhang
- Department of Biology, University of North Carolina, Chapel Hill, NC
| | - Boris Lenhard
- Bergen Center for Computational Science, University of Bergen, Bergen, Norway
| | - Muneesh Tewari
- Division of Human Biology, Fred Hutchinson Cancer Research Center, Seattle, WA
| | - Michael O. Dorschner
- Dept. of Psychiatry and Behavioral Sciences, University of Washington, Seattle, WA
| | - R. Scott Hansen
- Department of Medicine, Division of Medical Genetics, University of Washington, Seattle, WA
| | - Patrick A. Navas
- Department of Medicine, Division of Medical Genetics, University of Washington, Seattle, WA
| | | | - Vishwanath R. Iyer
- Institute for Cellular and Molecular Biology, University of Texas, Austin, TX
| | - Jason D. Lieb
- Department of Biology, University of North Carolina, Chapel Hill, NC
| | - Shamil R. Sunyaev
- Dept. of Medicine, Division of Genetics, Brigham & Women’s Hospital and Harvard Medical School, Boston, MA
| | - Joshua M. Akey
- Department of Genome Sciences, University of Washington, Seattle, WA
| | - Peter J. Sabo
- Department of Genome Sciences, University of Washington, Seattle, WA
| | - Rajinder Kaul
- Department of Medicine, Division of Medical Genetics, University of Washington, Seattle, WA
| | - Terrence S. Furey
- Department of Biology, University of North Carolina, Chapel Hill, NC
| | - Job Dekker
- Program in Gene Function, University of Massachusetts Medical School, Worcester, MA
| | | | - John A. Stamatoyannopoulos
- Department of Genome Sciences, University of Washington, Seattle, WA
- Department of Medicine, Division of Oncology, University of Washington, Seattle, WA
| |
Collapse
|
8
|
Neph S, Vierstra J, Stergachis AB, Reynolds AP, Haugen E, Vernot B, Thurman RE, Sandstrom R, Johnson AK, Maurano MT, Humbert R, Rynes E, Wang H, Vong S, Lee K, Bates D, Diegel M, Roach V, Dunn D, Neri J, Schafer A, Hansen RS, Kutyavin T, Giste E, Weaver M, Canfield T, Sabo P, Zhang M, Balasundaram G, Byron R, MacCoss MJ, Akey JM, Bender M, Groudine M, Kaul R, Stamatoyannopoulos JA. An expansive human regulatory lexicon encoded in transcription factor footprints. Nature 2012; 489:83-90. [PMID: 22955618 PMCID: PMC3736582 DOI: 10.1038/nature11212] [Citation(s) in RCA: 566] [Impact Index Per Article: 47.2] [Reference Citation Analysis] [What about the content of this article? (0)] [Affiliation(s)] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 12/11/2011] [Accepted: 05/10/2012] [Indexed: 01/04/2023]
Abstract
Regulatory factor binding to genomic DNA protects the underlying sequence from cleavage by DNase I, leaving nucleotide-resolution footprints. Using genomic DNase I footprinting across 41 diverse cell and tissue types, we detected 45 million transcription factor occupancy events within regulatory regions, representing differential binding to 8.4 million distinct short sequence elements. Here we show that this small genomic sequence compartment, roughly twice the size of the exome, encodes an expansive repertoire of conserved recognition sequences for DNA-binding proteins that nearly doubles the size of the human cis-regulatory lexicon. We find that genetic variants affecting allelic chromatin states are concentrated in footprints, and that these elements are preferentially sheltered from DNA methylation. High-resolution DNase I cleavage patterns mirror nucleotide-level evolutionary conservation and track the crystallographic topography of protein-DNA interfaces, indicating that transcription factor structure has been evolutionarily imprinted on the human genome sequence. We identify a stereotyped 50-base-pair footprint that precisely defines the site of transcript origination within thousands of human promoters. Finally, we describe a large collection of novel regulatory factor recognition motifs that are highly conserved in both sequence and function, and exhibit cell-selective occupancy patterns that closely parallel major regulators of development, differentiation and pluripotency.
Collapse
Affiliation(s)
- Shane Neph
- Department of Genome Sciences, University of Washington, Seattle, WA 98195
| | - Jeff Vierstra
- Department of Genome Sciences, University of Washington, Seattle, WA 98195
| | | | - Alex P. Reynolds
- Department of Genome Sciences, University of Washington, Seattle, WA 98195
| | - Eric Haugen
- Department of Genome Sciences, University of Washington, Seattle, WA 98195
| | - Benjamin Vernot
- Department of Genome Sciences, University of Washington, Seattle, WA 98195
| | - Robert E. Thurman
- Department of Genome Sciences, University of Washington, Seattle, WA 98195
| | - Richard Sandstrom
- Department of Genome Sciences, University of Washington, Seattle, WA 98195
| | - Audra K. Johnson
- Department of Genome Sciences, University of Washington, Seattle, WA 98195
| | - Matthew T. Maurano
- Department of Genome Sciences, University of Washington, Seattle, WA 98195
| | - Richard Humbert
- Department of Genome Sciences, University of Washington, Seattle, WA 98195
| | - Eric Rynes
- Department of Genome Sciences, University of Washington, Seattle, WA 98195
| | - Hao Wang
- Department of Genome Sciences, University of Washington, Seattle, WA 98195
| | - Shinny Vong
- Department of Genome Sciences, University of Washington, Seattle, WA 98195
| | - Kristen Lee
- Department of Genome Sciences, University of Washington, Seattle, WA 98195
| | - Daniel Bates
- Department of Genome Sciences, University of Washington, Seattle, WA 98195
| | - Morgan Diegel
- Department of Genome Sciences, University of Washington, Seattle, WA 98195
| | - Vaughn Roach
- Department of Genome Sciences, University of Washington, Seattle, WA 98195
| | - Douglas Dunn
- Department of Genome Sciences, University of Washington, Seattle, WA 98195
| | - Jun Neri
- Department of Genome Sciences, University of Washington, Seattle, WA 98195
| | - Anthony Schafer
- Department of Genome Sciences, University of Washington, Seattle, WA 98195
| | - R. Scott Hansen
- Department of Genome Sciences, University of Washington, Seattle, WA 98195
- Division of Medical Genetics, Department of Medicine, University of Washington, Seattle, WA 98195
| | - Tanya Kutyavin
- Department of Genome Sciences, University of Washington, Seattle, WA 98195
| | - Erika Giste
- Department of Genome Sciences, University of Washington, Seattle, WA 98195
| | - Molly Weaver
- Department of Genome Sciences, University of Washington, Seattle, WA 98195
| | - Theresa Canfield
- Department of Genome Sciences, University of Washington, Seattle, WA 98195
| | - Peter Sabo
- Department of Genome Sciences, University of Washington, Seattle, WA 98195
| | - Miaohua Zhang
- Basic Sciences Division, Fred Hutchison Cancer Research Center, Seattle, WA 98109
| | | | - Rachel Byron
- Basic Sciences Division, Fred Hutchison Cancer Research Center, Seattle, WA 98109
| | - Michael J. MacCoss
- Department of Genome Sciences, University of Washington, Seattle, WA 98195
| | - Joshua M. Akey
- Department of Genome Sciences, University of Washington, Seattle, WA 98195
| | - Michael Bender
- Basic Sciences Division, Fred Hutchison Cancer Research Center, Seattle, WA 98109
| | - Mark Groudine
- Basic Sciences Division, Fred Hutchison Cancer Research Center, Seattle, WA 98109
| | - Rajinder Kaul
- Department of Genome Sciences, University of Washington, Seattle, WA 98195
- Division of Medical Genetics, Department of Medicine, University of Washington, Seattle, WA 98195
| | - John A. Stamatoyannopoulos
- Department of Genome Sciences, University of Washington, Seattle, WA 98195
- Division of Oncology, Deparment of Medicine, University of Washington, Seattle, WA 98195
| |
Collapse
|
9
|
Maurano MT, Humbert R, Rynes E, Thurman RE, Haugen E, Wang H, Reynolds AP, Sandstrom R, Qu H, Brody J, Shafer A, Neri F, Lee K, Kutyavin T, Stehling-Sun S, Johnson AK, Canfield TK, Giste E, Diegel M, Bates D, Hansen RS, Neph S, Sabo PJ, Heimfeld S, Raubitschek A, Ziegler S, Cotsapas C, Sotoodehnia N, Glass I, Sunyaev SR, Kaul R, Stamatoyannopoulos JA. Systematic localization of common disease-associated variation in regulatory DNA. Science 2012; 337:1190-5. [PMID: 22955828 DOI: 10.1126/science.1222794] [Citation(s) in RCA: 2409] [Impact Index Per Article: 200.8] [Reference Citation Analysis] [What about the content of this article? (0)] [Affiliation(s)] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 12/11/2022]
Abstract
Genome-wide association studies have identified many noncoding variants associated with common diseases and traits. We show that these variants are concentrated in regulatory DNA marked by deoxyribonuclease I (DNase I) hypersensitive sites (DHSs). Eighty-eight percent of such DHSs are active during fetal development and are enriched in variants associated with gestational exposure-related phenotypes. We identified distant gene targets for hundreds of variant-containing DHSs that may explain phenotype associations. Disease-associated variants systematically perturb transcription factor recognition sequences, frequently alter allelic chromatin states, and form regulatory networks. We also demonstrated tissue-selective enrichment of more weakly disease-associated variants within DHSs and the de novo identification of pathogenic cell types for Crohn's disease, multiple sclerosis, and an electrocardiogram trait, without prior knowledge of physiological mechanisms. Our results suggest pervasive involvement of regulatory DNA variation in common human disease and provide pathogenic insights into diverse disorders.
Collapse
Affiliation(s)
- Matthew T Maurano
- Department of Genome Sciences, University of Washington, Seattle, WA 98195, USA
| | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | |
Collapse
|
10
|
Neph S, Kuehn MS, Reynolds AP, Haugen E, Thurman RE, Johnson AK, Rynes E, Maurano MT, Vierstra J, Thomas S, Sandstrom R, Humbert R, Stamatoyannopoulos JA. BEDOPS: high-performance genomic feature operations. ACTA ACUST UNITED AC 2012; 28:1919-20. [PMID: 22576172 DOI: 10.1093/bioinformatics/bts277] [Citation(s) in RCA: 567] [Impact Index Per Article: 47.3] [Reference Citation Analysis] [What about the content of this article? (0)] [Affiliation(s)] [Abstract] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/12/2022]
Abstract
UNLABELLED The large and growing number of genome-wide datasets highlights the need for high-performance feature analysis and data comparison methods, in addition to efficient data storage and retrieval techniques. We introduce BEDOPS, a software suite for common genomic analysis tasks which offers improved flexibility, scalability and execution time characteristics over previously published packages. The suite includes a utility to compress large inputs into a lossless format that can provide greater space savings and faster data extractions than alternatives. AVAILABILITY http://code.google.com/p/bedops/ includes binaries, source and documentation.
Collapse
Affiliation(s)
- Shane Neph
- Department of Genome Sciences and Department of Medicine, University of Washington, Seattle, Washington, DC 98195, USA.
| | | | | | | | | | | | | | | | | | | | | | | | | |
Collapse
|
11
|
Abstract
Cranial MRI was obtained in 13 of a group of 57 children receiving long-term parenteral nutrition, who were being investigated for hypermanganasaemia. Increased signal intensity on T1-weighted images has been reported in adult patients on long-term parenteral nutrition and with encephalopathy following chronic manganese exposure in are welding. It has been postulated that these changes are due to deposition of the paramagnetic trace element manganese. In excess manganese is hepato- and neurotoxic and we present the correlation of whole blood manganese levels with imaging findings. The age range of our patients was 6 months to 10 years, and the duration of therapy 3 months to 10 years. In 7 children we found characteristic increased signal intensity on T1-weighted images, with no abnormality on T2-weighted images. All patients had elevated whole blood manganese levels, suggesting that the basis for this abnormality is indeed deposition of manganese within the tissues.
Collapse
Affiliation(s)
- G Quaghebeur
- Department of Neuroradiology, National Hospital for Neurology and Neurosurgery, London, UK
| | | | | | | | | | | |
Collapse
|
12
|
Abstract
BACKGROUND In patients receiving long-term parenteral nutrition (PN), cholestatic disease and nervous system disorders have been associated with high blood concentrations of manganese. In such patients, the normal homoeostatic mechanisms of the liver and gut are bypassed and the requirement for this trace element is not known; nor has it been certain whether hypermanganesaemia causes the cholestasis or vice versa. We explored the direction of effect by serial tests of liver function after withdrawal of manganese supplements from children receiving long-term PN. We also examined the relation between blood manganese concentrations and brain lesions, as indicated by clinical examination and magnetic resonance imaging (MRI). METHODS From a combined group of 57 children receiving PN we identified 11 with the combination of hypermanganesaemia and cholestasis; one also had a movement disorder. Manganese supplements were reduced in the first three and withdrawn in the remainder. MRI was done in two of these children. We also looked at manganese concentrations and MRI scans in six children who had received PN for more than 2 years without developing liver disease. FINDINGS In the hypermanganesaemia/cholestasis group, four of the 11 patients died. In the seven survivors baseline whole-blood manganese was 615-1840 nmol/L, and after 4 months it had declined by a median of 643 nmol/L (p < 0.01). Over the same interval total bilirubin declined by a median of 70 mumol/L (p < 0.05). Two of these children had movement disorders, one of whom survived to have an MRI scan; this showed, with T1 weighted images, bilateral symmetrically increased signal intensity in the globus pallidus and subthalamic nuclei. Such changes were also seen in five other children--one from the hypermanganesaemia/cholestasis group and four of six in the long-term PN group without liver disease (in all of whom blood manganese was above normal). INTERPRETATION The cholestasis complicating PN is multifactorial, but these results add to the evidence that manganese contributes. In view of the additional hazard of basal ganglia damage from high manganese levels in children receiving long-term PN, we recommend a low dose regimen of not more than 0.018 mumol/kg per 24 h together with regular examination of the nervous system.
Collapse
Affiliation(s)
- J M Fell
- Department of Gastroenterology, Great Ormond Street Hospital for Children NHS Trust, London, UK
| | | | | | | | | | | | | | | |
Collapse
|
13
|
Young RJ, Critchley JA, Young KK, Freebairn RC, Reynolds AP, Lolin YI. Fatal acute hepatorenal failure following potassium permanganate ingestion. Hum Exp Toxicol 1996; 15:259-61. [PMID: 8839216 DOI: 10.1177/096032719601500313] [Citation(s) in RCA: 33] [Impact Index Per Article: 1.2] [Reference Citation Analysis] [What about the content of this article? (0)] [Affiliation(s)] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 02/02/2023]
Abstract
Potassium permanganate (KMnO4), a powerful oxidizing agent, is readily available without prescription. Tissue contact produces coagulation necrosis and the lethal consequences of oral ingestion are well described, with most deaths because of airway oedema and obstruction or circulatory collapse. Whilst systemic toxicity is reported, its mechanism is unclear. We describe a case of suicidal ingestion of KMnO4 followed by acute hepatorenal toxicity resulting in the death of the patient. The clinical course bore close resemblance to that of severe paracetamol overdose. We discuss the pathogenesis of the systemic toxicity of KMnO4 and postulate that it is due to oxidative injury from free radicals generated by the absorbed permanganate ion. We recommend that N-acetyl cysteine be given within the first few hours to all patients with potassium permanganate poisoning.
Collapse
Affiliation(s)
- R J Young
- Department of Anaesthesia and Intensive Care, Chinese University of Hong Kong
| | | | | | | | | | | |
Collapse
|
14
|
Abstract
The current practice of providing manganese supplementation to neonates on long term parenteral nutrition is leading to a high incidence of hypermanganesaemia. Magnetic resonance imaging (MRI) studies in adults on long term manganese parenteral nutrition have shown changes in TI weighted MRI images and similar findings in a neonate receiving trace element supplementation are reported here. Whole blood manganese concentration in the infant was 1740 nmol/l (or 8.3 times upper reference limit). In all neonates on long term parenteral nutrition with evidence of cholestatic liver disease so far investigated, the whole blood manganese concentrations were > 360 nmol/l (reference range 73-210). Manganese supplementation to patients on long term parenteral nutrition requires reappraisal, particularly in those who develop cholestatic liver disease associated with parenteral nutrition.
Collapse
Affiliation(s)
- A P Reynolds
- Great Ormond Street Hospital for Children NHS Trust, London
| | | | | |
Collapse
|
15
|
Bjarnason I, Maxton D, Reynolds AP, Catt S, Peters TJ, Menzies IS. Comparison of four markers of intestinal permeability in control subjects and patients with coeliac disease. Scand J Gastroenterol 1994; 29:630-9. [PMID: 7939400 DOI: 10.3109/00365529409092484] [Citation(s) in RCA: 59] [Impact Index Per Article: 2.0] [Reference Citation Analysis] [What about the content of this article? (0)] [Affiliation(s)] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Submit a Manuscript] [Subscribe] [Scholar Register] [Indexed: 02/04/2023]
Abstract
BACKGROUND Controversy surrounds the issue of intestinal permeability in patients with coeliac disease, polyethylene glycol 400 indicating reduced and di-/mono-saccharide urine excretion ratios and 51Cr-labeled ethylenediaminetetraacetic acid (EDTA) indicating increased permeability. METHODS We assessed the suitability of polyethylene glycol 400, L-rhamnose, lactulose, and 51Cr-EDTA as markers of intestinal permeability by assessing urine excretions after simultaneous intravenous instillation of these markers and after oral administration in normals and patients with coeliac disease. RESULTS After intravenous administration the 24-h urine excretion of polyethylene glycol 400, L-rhamnose, lactulose, and 51Cr-EDTA was 40%, 72%, 93%, and 97%, respectively. There was no significant difference between controls and patients with coeliac disease. Oral administration of the markers in an iso- and hyper-osmolar test solution demonstrates reduced permeation due to an osmotic retention effect of lactulose. In contrast, hyperosmolar glycerol increases permeation of all markers except L-rhamnose. Timing of urines and altering osmolarity is important for the behavior of individual markers but does not enhance the discrimination between controls and patients when the differential urine excretion of lactulose/L-rhamnose is used. The sensitivity of the urine excretion ratio of lactulose/L-rhamnose was comparable to that of 51Cr-EDTA used by itself. Whereas lactulose/L-rhamnose and 51Cr-EDTA showed increased intestinal permeability in coeliac disease, the permeation of polyethylene glycol was reduced. Permeation of the markers did not correlate significantly with jejunal histology. CONCLUSIONS Correlations of marker permeation rates with test dose osmolarity in controls and patients with coeliac disease shows a variable lack of conformity, suggesting that the markers may permeate the intestine by different routes, which are affected to a different extent in coeliac disease.
Collapse
Affiliation(s)
- I Bjarnason
- Dept. of Clinical Biochemistry, King's College School of Medicine and Dentistry, London, United Kingdom
| | | | | | | | | | | |
Collapse
|
16
|
Reynolds AP, Akagi H, Hjelm NM. Reliability of immunoassays of cyclosporin A in blood. Clin Chem 1992; 38:1508-9. [PMID: 1643728] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [What about the content of this article? (0)] [MESH Headings] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 12/28/2022]
|
17
|
|
18
|
Affiliation(s)
- A P Reynolds
- Department of Clinical Biochemistry, Hospital for Sick Children, London, UK
| | | |
Collapse
|
19
|
Maxton DG, Bjarnason I, Reynolds AP, Catt SD, Peters TJ, Menzies IS. Lactulose, 51Cr-labelled ethylenediaminetetra-acetate, L-rhamnose and polyethyleneglycol 400 [corrected] as probe markers for assessment in vivo of human intestinal permeability. Clin Sci (Lond) 1986; 71:71-80. [PMID: 3086024 DOI: 10.1042/cs0710071] [Citation(s) in RCA: 163] [Impact Index Per Article: 4.3] [Reference Citation Analysis] [What about the content of this article? (0)] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 01/04/2023]
Abstract
The urinary excretion of lactulose, 51Cr-labelled ethylenediaminetetra-acetate (51Cr-EDTA), L-rhamnose and polyethyleneglycol 400 (PEG-400) has been measured after intravenous and oral administration in healthy volunteers. Intestinal permeation of the probes was compared after their ingestion in iso-osmolar, hyperosmolar and cetrimide-containing test solutions. Urinary recovery of lactulose and 51Cr-EDTA after intravenous administration reached 75% by 5 h, and exceeded 90% at 24 h, and these values were 62 and 72%, respectively, for L-rhamnose. Recovery of PEG-400, however, varied with the relative molecular mass (Mr) of each polymer from 25.9 to 68.5% in 24 h. Intestinal permeation of ingested lactulose and 51Cr-EDTA was low, but that of L-rhamnose was 45-fold, and that of PEG-400 100-fold, greater. Permeation of lactulose and 51Cr-EDTA was markedly increased by cetrimide and hyperosmolar stress, whereas that of L-rhamnose showed little change. PEG-400 permeation was not affected by cetrimide, but was slightly increased by hyperosmolar stress. The 5 h permeation of lactulose, but not of L-rhamnose or PEG-400, correlated with that of 51Cr-EDTA (r = 0.98, P less than 0.001). These findings are compatible with three distinct pathways of unmediated mucosal permeation, L-rhamnose (radius less than 0.4 nm) passing mainly through small aqueous 'pores' of high incidence, lactulose and 51Cr-EDTA (radius greater than 0.5 nm) through larger aqueous 'channels' of low incidence susceptible to cetrimide and hyperosmolar stress, and PEG-400, which has appreciable lipid solubility, by partition through cell membrane lipid as well as the aqueous 'pores'.
Collapse
|
20
|
Möhler H, Patel AJ, Johnson AL, Reynolds AP, Balázs R. Metabolic compartmentation in the brain: effects of a central nervous system depressant, 1-hydroxy-3-amino-pyrrolidone-2. J Neurochem 1975; 24:865-73. [PMID: 1141896 DOI: 10.1111/j.1471-4159.1975.tb03649.x] [Citation(s) in RCA: 3] [Impact Index Per Article: 0.1] [Reference Citation Analysis] [What about the content of this article? (0)] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 12/25/2022]
|
21
|
Reynolds AP, Gallagher BB. The effect of hexafluorodiethyl ether (flurothyl) on the metabolism of rat brain amino acids labelled by (U-14C)glucose. Life Sci 1973; 13:87-95. [PMID: 4751414 DOI: 10.1016/0024-3205(73)90183-5] [Citation(s) in RCA: 4] [Impact Index Per Article: 0.1] [Reference Citation Analysis] [What about the content of this article? (0)] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 01/12/2023]
|
22
|
Adlard BP, Dobbing J, Lynch A, Balázs R, Reynolds AP. Effect of undernutrition in early life on glutamate decarboxylase activity in the adult brain. Biochem J 1972; 130:12P. [PMID: 4655412 PMCID: PMC1174353 DOI: 10.1042/bj1300012pa] [Citation(s) in RCA: 2] [Impact Index Per Article: 0.0] [Reference Citation Analysis] [What about the content of this article? (0)] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 01/11/2023]
|
23
|
Reynolds AP, Watkins JC. The effect of strychnine and of electrical stimulation on the labelling of -aminobutyric acid and other free amino acids from (U- 14 C)glucose in the spinal cord of the nembutalized rat. Brain Res 1972; 36:343-51. [PMID: 5009642 DOI: 10.1016/0006-8993(72)90739-1] [Citation(s) in RCA: 5] [Impact Index Per Article: 0.1] [Reference Citation Analysis] [What about the content of this article? (0)] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 01/13/2023]
|