1
|
Sullivan AM, Arsovski AA, Thompson A, Sandstrom R, Thurman RE, Neph S, Johnson AK, Sullivan ST, Sabo PJ, Neri FV, Weaver M, Diegel M, Nemhauser JL, Stamatoyannopoulos JA, Bubb KL, Queitsch C. Mapping and Dynamics of Regulatory DNA in Maturing Arabidopsis thaliana Siliques. Front Plant Sci 2019; 10:1434. [PMID: 31798605 PMCID: PMC6868056 DOI: 10.3389/fpls.2019.01434] [Citation(s) in RCA: 11] [Impact Index Per Article: 2.2] [Reference Citation Analysis] [What about the content of this article? (0)] [Affiliation(s)] [Abstract] [Key Words] [Track Full Text] [Subscribe] [Scholar Register] [Received: 06/21/2019] [Accepted: 10/16/2019] [Indexed: 05/04/2023]
Abstract
The genome is reprogrammed during development to produce diverse cell types, largely through altered expression and activity of key transcription factors. The accessibility and critical functions of epidermal cells have made them a model for connecting transcriptional events to development in a range of model systems. In Arabidopsis thaliana and many other plants, fertilization triggers differentiation of specialized epidermal seed coat cells that have a unique morphology caused by large extracellular deposits of polysaccharides. Here, we used DNase I-seq to generate regulatory landscapes of A. thaliana seeds at two critical time points in seed coat maturation (4 and 7 DPA), enriching for seed coat cells with the INTACT method. We found over 3,000 developmentally dynamic regulatory DNA elements and explored their relationship with nearby gene expression. The dynamic regulatory elements were enriched for motifs for several transcription factors families; most notably the TCP family at the earlier time point and the MYB family at the later one. To assess the extent to which the observed regulatory sites in seeds added to previously known regulatory sites in A. thaliana, we compared our data to 11 other data sets generated with 7-day-old seedlings for diverse tissues and conditions. Surprisingly, over a quarter of the regulatory, i.e. accessible, bases observed in seeds were novel. Notably, plant regulatory landscapes from different tissues, cell types, or developmental stages were more dynamic than those generated from bulk tissue in response to environmental perturbations, highlighting the importance of extending studies of regulatory DNA to single tissues and cell types during development.
Collapse
Affiliation(s)
| | - Andrej A. Arsovski
- Department of Biology, University of Washington, Seattle, WA, United States
| | - Agnieszka Thompson
- Department of Genome Sciences, University of Washington, Seattle, WA, United States
| | - Richard Sandstrom
- Department of Genome Sciences, University of Washington, Seattle, WA, United States
| | - Robert E. Thurman
- Department of Genome Sciences, University of Washington, Seattle, WA, United States
| | - Shane Neph
- Department of Genome Sciences, University of Washington, Seattle, WA, United States
| | - Audra K. Johnson
- Department of Genome Sciences, University of Washington, Seattle, WA, United States
| | - Shawn T. Sullivan
- Department of Genome Sciences, University of Washington, Seattle, WA, United States
| | - Peter J. Sabo
- Department of Genome Sciences, University of Washington, Seattle, WA, United States
| | - Fidencio V. Neri
- Department of Genome Sciences, University of Washington, Seattle, WA, United States
| | - Molly Weaver
- Department of Genome Sciences, University of Washington, Seattle, WA, United States
| | - Morgan Diegel
- Department of Genome Sciences, University of Washington, Seattle, WA, United States
| | | | | | - Kerry L. Bubb
- Department of Genome Sciences, University of Washington, Seattle, WA, United States
- *Correspondence: Kerry L. Bubb,
| | - Christine Queitsch
- Department of Genome Sciences, University of Washington, Seattle, WA, United States
| |
Collapse
|
2
|
Kundaje A, Meuleman W, Ernst J, Bilenky M, Yen A, Heravi-Moussavi A, Kheradpour P, Zhang Z, Wang J, Ziller MJ, Amin V, Whitaker JW, Schultz MD, Ward LD, Sarkar A, Quon G, Sandstrom RS, Eaton ML, Wu YC, Pfenning AR, Wang X, Claussnitzer M, Liu Y, Coarfa C, Harris RA, Shoresh N, Epstein CB, Gjoneska E, Leung D, Xie W, Hawkins RD, Lister R, Hong C, Gascard P, Mungall AJ, Moore R, Chuah E, Tam A, Canfield TK, Hansen RS, Kaul R, Sabo PJ, Bansal MS, Carles A, Dixon JR, Farh KH, Feizi S, Karlic R, Kim AR, Kulkarni A, Li D, Lowdon R, Elliott G, Mercer TR, Neph SJ, Onuchic V, Polak P, Rajagopal N, Ray P, Sallari RC, Siebenthall KT, Sinnott-Armstrong NA, Stevens M, Thurman RE, Wu J, Zhang B, Zhou X, Beaudet AE, Boyer LA, De Jager PL, Farnham PJ, Fisher SJ, Haussler D, Jones SJM, Li W, Marra MA, McManus MT, Sunyaev S, Thomson JA, Tlsty TD, Tsai LH, Wang W, Waterland RA, Zhang MQ, Chadwick LH, Bernstein BE, Costello JF, Ecker JR, Hirst M, Meissner A, Milosavljevic A, Ren B, Stamatoyannopoulos JA, Wang T, Kellis M. Integrative analysis of 111 reference human epigenomes. Nature 2015; 518:317-30. [PMID: 25693563 PMCID: PMC4530010 DOI: 10.1038/nature14248] [Citation(s) in RCA: 3993] [Impact Index Per Article: 443.7] [Reference Citation Analysis] [What about the content of this article? (0)] [Affiliation(s)] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 04/10/2014] [Accepted: 01/21/2015] [Indexed: 02/06/2023]
Abstract
The reference human genome sequence set the stage for studies of genetic variation and its association with human disease, but epigenomic studies lack a similar reference. To address this need, the NIH Roadmap Epigenomics Consortium generated the largest collection so far of human epigenomes for primary cells and tissues. Here we describe the integrative analysis of 111 reference human epigenomes generated as part of the programme, profiled for histone modification patterns, DNA accessibility, DNA methylation and RNA expression. We establish global maps of regulatory elements, define regulatory modules of coordinated activity, and their likely activators and repressors. We show that disease- and trait-associated genetic variants are enriched in tissue-specific epigenomic marks, revealing biologically relevant cell types for diverse human traits, and providing a resource for interpreting the molecular basis of human disease. Our results demonstrate the central role of epigenomic information for understanding gene regulation, cellular differentiation and human disease.
Collapse
Affiliation(s)
- Anshul Kundaje
- 1] Computer Science and Artificial Intelligence Lab, Massachusetts Institute of Technology, 32 Vassar St, Cambridge, Massachusetts 02139, USA. [2] The Broad Institute of Harvard and MIT, 415 Main Street, Cambridge, Massachusetts 02142, USA. [3] Department of Genetics, Department of Computer Science, 300 Pasteur Dr., Lane Building, L301, Stanford, California 94305-5120, USA
| | - Wouter Meuleman
- 1] Computer Science and Artificial Intelligence Lab, Massachusetts Institute of Technology, 32 Vassar St, Cambridge, Massachusetts 02139, USA. [2] The Broad Institute of Harvard and MIT, 415 Main Street, Cambridge, Massachusetts 02142, USA
| | - Jason Ernst
- 1] Computer Science and Artificial Intelligence Lab, Massachusetts Institute of Technology, 32 Vassar St, Cambridge, Massachusetts 02139, USA. [2] The Broad Institute of Harvard and MIT, 415 Main Street, Cambridge, Massachusetts 02142, USA. [3] Department of Biological Chemistry, University of California, Los Angeles, 615 Charles E Young Dr South, Los Angeles, California 90095, USA
| | - Misha Bilenky
- Canada's Michael Smith Genome Sciences Centre, BC Cancer Agency, 675 West 10th Avenue, Vancouver, British Columbia V5Z 1L3, Canada
| | - Angela Yen
- 1] Computer Science and Artificial Intelligence Lab, Massachusetts Institute of Technology, 32 Vassar St, Cambridge, Massachusetts 02139, USA. [2] The Broad Institute of Harvard and MIT, 415 Main Street, Cambridge, Massachusetts 02142, USA
| | - Alireza Heravi-Moussavi
- Canada's Michael Smith Genome Sciences Centre, BC Cancer Agency, 675 West 10th Avenue, Vancouver, British Columbia V5Z 1L3, Canada
| | - Pouya Kheradpour
- 1] Computer Science and Artificial Intelligence Lab, Massachusetts Institute of Technology, 32 Vassar St, Cambridge, Massachusetts 02139, USA. [2] The Broad Institute of Harvard and MIT, 415 Main Street, Cambridge, Massachusetts 02142, USA
| | - Zhizhuo Zhang
- 1] Computer Science and Artificial Intelligence Lab, Massachusetts Institute of Technology, 32 Vassar St, Cambridge, Massachusetts 02139, USA. [2] The Broad Institute of Harvard and MIT, 415 Main Street, Cambridge, Massachusetts 02142, USA
| | - Jianrong Wang
- 1] Computer Science and Artificial Intelligence Lab, Massachusetts Institute of Technology, 32 Vassar St, Cambridge, Massachusetts 02139, USA. [2] The Broad Institute of Harvard and MIT, 415 Main Street, Cambridge, Massachusetts 02142, USA
| | - Michael J Ziller
- 1] The Broad Institute of Harvard and MIT, 415 Main Street, Cambridge, Massachusetts 02142, USA. [2] Department of Stem Cell and Regenerative Biology, 7 Divinity Ave, Cambridge, Massachusetts 02138, USA
| | - Viren Amin
- Epigenome Center, Baylor College of Medicine, One Baylor Plaza, Houston, Texas 77030, USA
| | - John W Whitaker
- Department of Cellular and Molecular Medicine, Institute of Genomic Medicine, Moores Cancer Center, Department of Chemistry and Biochemistry, University of California San Diego, 9500 Gilman Drive, La Jolla, California 92093, USA
| | - Matthew D Schultz
- Genomic Analysis Laboratory, Howard Hughes Medical Institute &The Salk Institute for Biological Studies, 10010 N. Torrey Pines Road, La Jolla, California 92037, USA
| | - Lucas D Ward
- 1] Computer Science and Artificial Intelligence Lab, Massachusetts Institute of Technology, 32 Vassar St, Cambridge, Massachusetts 02139, USA. [2] The Broad Institute of Harvard and MIT, 415 Main Street, Cambridge, Massachusetts 02142, USA
| | - Abhishek Sarkar
- 1] Computer Science and Artificial Intelligence Lab, Massachusetts Institute of Technology, 32 Vassar St, Cambridge, Massachusetts 02139, USA. [2] The Broad Institute of Harvard and MIT, 415 Main Street, Cambridge, Massachusetts 02142, USA
| | - Gerald Quon
- 1] Computer Science and Artificial Intelligence Lab, Massachusetts Institute of Technology, 32 Vassar St, Cambridge, Massachusetts 02139, USA. [2] The Broad Institute of Harvard and MIT, 415 Main Street, Cambridge, Massachusetts 02142, USA
| | - Richard S Sandstrom
- Department of Genome Sciences, University of Washington, 3720 15th Ave. NE, Seattle, Washington 98195, USA
| | - Matthew L Eaton
- 1] Computer Science and Artificial Intelligence Lab, Massachusetts Institute of Technology, 32 Vassar St, Cambridge, Massachusetts 02139, USA. [2] The Broad Institute of Harvard and MIT, 415 Main Street, Cambridge, Massachusetts 02142, USA
| | - Yi-Chieh Wu
- 1] Computer Science and Artificial Intelligence Lab, Massachusetts Institute of Technology, 32 Vassar St, Cambridge, Massachusetts 02139, USA. [2] The Broad Institute of Harvard and MIT, 415 Main Street, Cambridge, Massachusetts 02142, USA
| | - Andreas R Pfenning
- 1] Computer Science and Artificial Intelligence Lab, Massachusetts Institute of Technology, 32 Vassar St, Cambridge, Massachusetts 02139, USA. [2] The Broad Institute of Harvard and MIT, 415 Main Street, Cambridge, Massachusetts 02142, USA
| | - Xinchen Wang
- 1] Computer Science and Artificial Intelligence Lab, Massachusetts Institute of Technology, 32 Vassar St, Cambridge, Massachusetts 02139, USA. [2] The Broad Institute of Harvard and MIT, 415 Main Street, Cambridge, Massachusetts 02142, USA. [3] Biology Department, Massachusetts Institute of Technology, 31 Ames St, Cambridge, Massachusetts 02142, USA
| | - Melina Claussnitzer
- 1] Computer Science and Artificial Intelligence Lab, Massachusetts Institute of Technology, 32 Vassar St, Cambridge, Massachusetts 02139, USA. [2] The Broad Institute of Harvard and MIT, 415 Main Street, Cambridge, Massachusetts 02142, USA
| | - Yaping Liu
- 1] Computer Science and Artificial Intelligence Lab, Massachusetts Institute of Technology, 32 Vassar St, Cambridge, Massachusetts 02139, USA. [2] The Broad Institute of Harvard and MIT, 415 Main Street, Cambridge, Massachusetts 02142, USA
| | - Cristian Coarfa
- Epigenome Center, Baylor College of Medicine, One Baylor Plaza, Houston, Texas 77030, USA
| | - R Alan Harris
- Epigenome Center, Baylor College of Medicine, One Baylor Plaza, Houston, Texas 77030, USA
| | - Noam Shoresh
- The Broad Institute of Harvard and MIT, 415 Main Street, Cambridge, Massachusetts 02142, USA
| | - Charles B Epstein
- The Broad Institute of Harvard and MIT, 415 Main Street, Cambridge, Massachusetts 02142, USA
| | - Elizabeta Gjoneska
- 1] The Broad Institute of Harvard and MIT, 415 Main Street, Cambridge, Massachusetts 02142, USA. [2] The Picower Institute for Learning and Memory, Department of Brain and Cognitive Sciences, Massachusetts Institute of Technology, 43 Vassar St, Cambridge, Massachusetts 02139, USA
| | - Danny Leung
- 1] Department of Cellular and Molecular Medicine, Institute of Genomic Medicine, Moores Cancer Center, Department of Chemistry and Biochemistry, University of California San Diego, 9500 Gilman Drive, La Jolla, California 92093, USA. [2] Ludwig Institute for Cancer Research, 9500 Gilman Drive, La Jolla, California 92093, USA
| | - Wei Xie
- 1] Department of Cellular and Molecular Medicine, Institute of Genomic Medicine, Moores Cancer Center, Department of Chemistry and Biochemistry, University of California San Diego, 9500 Gilman Drive, La Jolla, California 92093, USA. [2] Ludwig Institute for Cancer Research, 9500 Gilman Drive, La Jolla, California 92093, USA
| | - R David Hawkins
- 1] Department of Cellular and Molecular Medicine, Institute of Genomic Medicine, Moores Cancer Center, Department of Chemistry and Biochemistry, University of California San Diego, 9500 Gilman Drive, La Jolla, California 92093, USA. [2] Ludwig Institute for Cancer Research, 9500 Gilman Drive, La Jolla, California 92093, USA
| | - Ryan Lister
- Genomic Analysis Laboratory, Howard Hughes Medical Institute &The Salk Institute for Biological Studies, 10010 N. Torrey Pines Road, La Jolla, California 92037, USA
| | - Chibo Hong
- Department of Neurosurgery, Helen Diller Family Comprehensive Cancer Center, University of California San Francisco, 1450 3rd Street, San Francisco, California 94158, USA
| | - Philippe Gascard
- Department of Pathology, University of California San Francisco, 513 Parnassus Avenue, San Francisco, California 94143-0511, USA
| | - Andrew J Mungall
- Canada's Michael Smith Genome Sciences Centre, BC Cancer Agency, 675 West 10th Avenue, Vancouver, British Columbia V5Z 1L3, Canada
| | - Richard Moore
- Canada's Michael Smith Genome Sciences Centre, BC Cancer Agency, 675 West 10th Avenue, Vancouver, British Columbia V5Z 1L3, Canada
| | - Eric Chuah
- Canada's Michael Smith Genome Sciences Centre, BC Cancer Agency, 675 West 10th Avenue, Vancouver, British Columbia V5Z 1L3, Canada
| | - Angela Tam
- Canada's Michael Smith Genome Sciences Centre, BC Cancer Agency, 675 West 10th Avenue, Vancouver, British Columbia V5Z 1L3, Canada
| | - Theresa K Canfield
- Department of Genome Sciences, University of Washington, 3720 15th Ave. NE, Seattle, Washington 98195, USA
| | - R Scott Hansen
- Department of Medicine, Division of Medical Genetics, University of Washington, 2211 Elliot Avenue, Seattle, Washington 98121, USA
| | - Rajinder Kaul
- Department of Medicine, Division of Medical Genetics, University of Washington, 2211 Elliot Avenue, Seattle, Washington 98121, USA
| | - Peter J Sabo
- Department of Genome Sciences, University of Washington, 3720 15th Ave. NE, Seattle, Washington 98195, USA
| | - Mukul S Bansal
- 1] Computer Science and Artificial Intelligence Lab, Massachusetts Institute of Technology, 32 Vassar St, Cambridge, Massachusetts 02139, USA. [2] The Broad Institute of Harvard and MIT, 415 Main Street, Cambridge, Massachusetts 02142, USA. [3] Department of Computer Science &Engineering, University of Connecticut, 371 Fairfield Way, Storrs, Connecticut 06269, USA
| | - Annaick Carles
- Department of Microbiology and Immunology and Centre for High-Throughput Biology, University of British Columbia, 2125 East Mall, Vancouver, British Columbia V6T 1Z4, Canada
| | - Jesse R Dixon
- 1] Department of Cellular and Molecular Medicine, Institute of Genomic Medicine, Moores Cancer Center, Department of Chemistry and Biochemistry, University of California San Diego, 9500 Gilman Drive, La Jolla, California 92093, USA. [2] Ludwig Institute for Cancer Research, 9500 Gilman Drive, La Jolla, California 92093, USA
| | - Kai-How Farh
- The Broad Institute of Harvard and MIT, 415 Main Street, Cambridge, Massachusetts 02142, USA
| | - Soheil Feizi
- 1] Computer Science and Artificial Intelligence Lab, Massachusetts Institute of Technology, 32 Vassar St, Cambridge, Massachusetts 02139, USA. [2] The Broad Institute of Harvard and MIT, 415 Main Street, Cambridge, Massachusetts 02142, USA
| | - Rosa Karlic
- Bioinformatics Group, Department of Molecular Biology, Division of Biology, Faculty of Science, University of Zagreb, Horvatovac 102a, 10000 Zagreb, Croatia
| | - Ah-Ram Kim
- 1] Computer Science and Artificial Intelligence Lab, Massachusetts Institute of Technology, 32 Vassar St, Cambridge, Massachusetts 02139, USA. [2] The Broad Institute of Harvard and MIT, 415 Main Street, Cambridge, Massachusetts 02142, USA
| | - Ashwinikumar Kulkarni
- Department of Molecular and Cell Biology, Center for Systems Biology, The University of Texas, Dallas, NSERL, RL10, 800 W Campbell Road, Richardson, Texas 75080, USA
| | - Daofeng Li
- Department of Genetics, Center for Genome Sciences and Systems Biology, Washington University in St Louis, 4444 Forest Park Ave, St Louis, Missouri 63108, USA
| | - Rebecca Lowdon
- Department of Genetics, Center for Genome Sciences and Systems Biology, Washington University in St Louis, 4444 Forest Park Ave, St Louis, Missouri 63108, USA
| | - GiNell Elliott
- Department of Genetics, Center for Genome Sciences and Systems Biology, Washington University in St Louis, 4444 Forest Park Ave, St Louis, Missouri 63108, USA
| | - Tim R Mercer
- Institute for Molecular Bioscience, University of Queensland, St Lucia, Queensland 4072, Australia
| | - Shane J Neph
- Department of Genome Sciences, University of Washington, 3720 15th Ave. NE, Seattle, Washington 98195, USA
| | - Vitor Onuchic
- Epigenome Center, Baylor College of Medicine, One Baylor Plaza, Houston, Texas 77030, USA
| | - Paz Polak
- 1] The Broad Institute of Harvard and MIT, 415 Main Street, Cambridge, Massachusetts 02142, USA. [2] Brigham &Women's Hospital, 75 Francis Street, Boston, Massachusetts 02115, USA
| | - Nisha Rajagopal
- 1] Department of Cellular and Molecular Medicine, Institute of Genomic Medicine, Moores Cancer Center, Department of Chemistry and Biochemistry, University of California San Diego, 9500 Gilman Drive, La Jolla, California 92093, USA. [2] Ludwig Institute for Cancer Research, 9500 Gilman Drive, La Jolla, California 92093, USA
| | - Pradipta Ray
- Department of Molecular and Cell Biology, Center for Systems Biology, The University of Texas, Dallas, NSERL, RL10, 800 W Campbell Road, Richardson, Texas 75080, USA
| | - Richard C Sallari
- 1] Computer Science and Artificial Intelligence Lab, Massachusetts Institute of Technology, 32 Vassar St, Cambridge, Massachusetts 02139, USA. [2] The Broad Institute of Harvard and MIT, 415 Main Street, Cambridge, Massachusetts 02142, USA
| | - Kyle T Siebenthall
- Department of Genome Sciences, University of Washington, 3720 15th Ave. NE, Seattle, Washington 98195, USA
| | - Nicholas A Sinnott-Armstrong
- 1] Computer Science and Artificial Intelligence Lab, Massachusetts Institute of Technology, 32 Vassar St, Cambridge, Massachusetts 02139, USA. [2] The Broad Institute of Harvard and MIT, 415 Main Street, Cambridge, Massachusetts 02142, USA
| | - Michael Stevens
- 1] Department of Genetics, Center for Genome Sciences and Systems Biology, Washington University in St Louis, 4444 Forest Park Ave, St Louis, Missouri 63108, USA. [2] Department of Computer Science and Engineeering, Washington University in St. Louis, St. Louis, Missouri 63130, USA
| | - Robert E Thurman
- Department of Genome Sciences, University of Washington, 3720 15th Ave. NE, Seattle, Washington 98195, USA
| | - Jie Wu
- 1] Department of Applied Mathematics and Statistics, Stony Brook University, Stony Brook, New York 11794-3600, USA. [2] Cold Spring Harbor Laboratory, Cold Spring Harbor, New York 11724, USA
| | - Bo Zhang
- Department of Genetics, Center for Genome Sciences and Systems Biology, Washington University in St Louis, 4444 Forest Park Ave, St Louis, Missouri 63108, USA
| | - Xin Zhou
- Department of Genetics, Center for Genome Sciences and Systems Biology, Washington University in St Louis, 4444 Forest Park Ave, St Louis, Missouri 63108, USA
| | - Arthur E Beaudet
- Molecular and Human Genetics Department, Baylor College of Medicine, One Baylor Plaza, Houston, Texas 77030, USA
| | - Laurie A Boyer
- Biology Department, Massachusetts Institute of Technology, 31 Ames St, Cambridge, Massachusetts 02142, USA
| | - Philip L De Jager
- 1] The Broad Institute of Harvard and MIT, 415 Main Street, Cambridge, Massachusetts 02142, USA. [2] Brigham &Women's Hospital, 75 Francis Street, Boston, Massachusetts 02115, USA. [3] Harvard Medical School, 25 Shattuck St, Boston, Massachusetts 02115, USA
| | - Peggy J Farnham
- Department of Biochemistry, Keck School of Medicine, University of Southern California, 1450 Biggy Street, Los Angeles, California 90089-9601, USA
| | - Susan J Fisher
- ObGyn, Reproductive Sciences, University of California San Francisco, 35 Medical Center Way, San Francisco, California 94143, USA
| | - David Haussler
- Center for Biomolecular Sciences and Engineering, University of Santa Cruz, 1156 High Street, Santa Cruz, California 95064, USA
| | - Steven J M Jones
- 1] Canada's Michael Smith Genome Sciences Centre, BC Cancer Agency, 675 West 10th Avenue, Vancouver, British Columbia V5Z 1L3, Canada. [2] Department of Molecular Biology and Biochemistry, Simon Fraser University, 8888 University Drive, Burnaby, British Columbia V5A 1S6, Canada. [3] Department of Medical Genetics, University of British Columbia, 2329 West Mall, Vancouver, BC, Canada, V6T 1Z4
| | - Wei Li
- Dan L. Duncan Cancer Center, Baylor College of Medicine, One Baylor Plaza, Houston, Texas 77030, USA
| | - Marco A Marra
- 1] Canada's Michael Smith Genome Sciences Centre, BC Cancer Agency, 675 West 10th Avenue, Vancouver, British Columbia V5Z 1L3, Canada. [2] Department of Medical Genetics, University of British Columbia, 2329 West Mall, Vancouver, BC, Canada, V6T 1Z4
| | - Michael T McManus
- Department of Microbiology and Immunology, Diabetes Center, University of California, San Francisco, 513 Parnassus Ave, San Francisco, California 94143-0534, USA
| | - Shamil Sunyaev
- 1] The Broad Institute of Harvard and MIT, 415 Main Street, Cambridge, Massachusetts 02142, USA. [2] Brigham &Women's Hospital, 75 Francis Street, Boston, Massachusetts 02115, USA. [3] Harvard Medical School, 25 Shattuck St, Boston, Massachusetts 02115, USA
| | - James A Thomson
- 1] University of Wisconsin, Madison, Wisconsin 53715, USA. [2] Morgridge Institute for Research, 330 N. Orchard Street, Madison, Wisconsin 53707, USA
| | - Thea D Tlsty
- Department of Pathology, University of California San Francisco, 513 Parnassus Avenue, San Francisco, California 94143-0511, USA
| | - Li-Huei Tsai
- 1] The Broad Institute of Harvard and MIT, 415 Main Street, Cambridge, Massachusetts 02142, USA. [2] The Picower Institute for Learning and Memory, Department of Brain and Cognitive Sciences, Massachusetts Institute of Technology, 43 Vassar St, Cambridge, Massachusetts 02139, USA
| | - Wei Wang
- Department of Cellular and Molecular Medicine, Institute of Genomic Medicine, Moores Cancer Center, Department of Chemistry and Biochemistry, University of California San Diego, 9500 Gilman Drive, La Jolla, California 92093, USA
| | - Robert A Waterland
- USDA/ARS Children's Nutrition Research Center, Baylor College of Medicine, 1100 Bates Street, Houston, Texas 77030, USA
| | - Michael Q Zhang
- 1] Department of Molecular and Cell Biology, Center for Systems Biology, The University of Texas, Dallas, NSERL, RL10, 800 W Campbell Road, Richardson, Texas 75080, USA. [2] Bioinformatics Division, Center for Synthetic and Systems Biology, TNLIST, Tsinghua University, Beijing 100084, China
| | - Lisa H Chadwick
- National Institute of Environmental Health Sciences, 111 T.W. Alexander Drive, Research Triangle Park, North Carolina 27709, USA
| | - Bradley E Bernstein
- 1] The Broad Institute of Harvard and MIT, 415 Main Street, Cambridge, Massachusetts 02142, USA. [2] Massachusetts General Hospital, 55 Fruit St, Boston, Massachusetts 02114, USA. [3] Howard Hughes Medical Institute, 4000 Jones Bridge Road, Chevy Chase, Maryland 20815-6789, USA
| | - Joseph F Costello
- Department of Neurosurgery, Helen Diller Family Comprehensive Cancer Center, University of California San Francisco, 1450 3rd Street, San Francisco, California 94158, USA
| | - Joseph R Ecker
- Genomic Analysis Laboratory, Howard Hughes Medical Institute &The Salk Institute for Biological Studies, 10010 N. Torrey Pines Road, La Jolla, California 92037, USA
| | - Martin Hirst
- 1] Canada's Michael Smith Genome Sciences Centre, BC Cancer Agency, 675 West 10th Avenue, Vancouver, British Columbia V5Z 1L3, Canada. [2] Department of Microbiology and Immunology and Centre for High-Throughput Biology, University of British Columbia, 2125 East Mall, Vancouver, British Columbia V6T 1Z4, Canada
| | - Alexander Meissner
- 1] The Broad Institute of Harvard and MIT, 415 Main Street, Cambridge, Massachusetts 02142, USA. [2] Department of Stem Cell and Regenerative Biology, 7 Divinity Ave, Cambridge, Massachusetts 02138, USA
| | | | - Bing Ren
- 1] Department of Cellular and Molecular Medicine, Institute of Genomic Medicine, Moores Cancer Center, Department of Chemistry and Biochemistry, University of California San Diego, 9500 Gilman Drive, La Jolla, California 92093, USA. [2] Ludwig Institute for Cancer Research, 9500 Gilman Drive, La Jolla, California 92093, USA
| | - John A Stamatoyannopoulos
- Department of Genome Sciences, University of Washington, 3720 15th Ave. NE, Seattle, Washington 98195, USA
| | - Ting Wang
- Department of Genetics, Center for Genome Sciences and Systems Biology, Washington University in St Louis, 4444 Forest Park Ave, St Louis, Missouri 63108, USA
| | - Manolis Kellis
- 1] Computer Science and Artificial Intelligence Lab, Massachusetts Institute of Technology, 32 Vassar St, Cambridge, Massachusetts 02139, USA. [2] The Broad Institute of Harvard and MIT, 415 Main Street, Cambridge, Massachusetts 02142, USA
| |
Collapse
|
3
|
Yue F, Cheng Y, Breschi A, Vierstra J, Wu W, Ryba T, Sandstrom R, Ma Z, Davis C, Pope BD, Shen Y, Pervouchine DD, Djebali S, Thurman RE, Kaul R, Rynes E, Kirilusha A, Marinov GK, Williams BA, Trout D, Amrhein H, Fisher-Aylor K, Antoshechkin I, DeSalvo G, See LH, Fastuca M, Drenkow J, Zaleski C, Dobin A, Prieto P, Lagarde J, Bussotti G, Tanzer A, Denas O, Li K, Bender MA, Zhang M, Byron R, Groudine MT, McCleary D, Pham L, Ye Z, Kuan S, Edsall L, Wu YC, Rasmussen MD, Bansal MS, Kellis M, Keller CA, Morrissey CS, Mishra T, Jain D, Dogan N, Harris RS, Cayting P, Kawli T, Boyle AP, Euskirchen G, Kundaje A, Lin S, Lin Y, Jansen C, Malladi VS, Cline MS, Erickson DT, Kirkup VM, Learned K, Sloan CA, Rosenbloom KR, Lacerda de Sousa B, Beal K, Pignatelli M, Flicek P, Lian J, Kahveci T, Lee D, Kent WJ, Ramalho Santos M, Herrero J, Notredame C, Johnson A, Vong S, Lee K, Bates D, Neri F, Diegel M, Canfield T, Sabo PJ, Wilken MS, Reh TA, Giste E, Shafer A, Kutyavin T, Haugen E, Dunn D, Reynolds AP, Neph S, Humbert R, Hansen RS, De Bruijn M, Selleri L, Rudensky A, Josefowicz S, Samstein R, Eichler EE, Orkin SH, Levasseur D, Papayannopoulou T, Chang KH, Skoultchi A, Gosh S, Disteche C, Treuting P, Wang Y, Weiss MJ, Blobel GA, Cao X, Zhong S, Wang T, Good PJ, Lowdon RF, Adams LB, Zhou XQ, Pazin MJ, Feingold EA, Wold B, Taylor J, Mortazavi A, Weissman SM, Stamatoyannopoulos JA, Snyder MP, Guigo R, Gingeras TR, Gilbert DM, Hardison RC, Beer MA, Ren B. A comparative encyclopedia of DNA elements in the mouse genome. Nature 2015; 515:355-64. [PMID: 25409824 PMCID: PMC4266106 DOI: 10.1038/nature13992] [Citation(s) in RCA: 1135] [Impact Index Per Article: 126.1] [Reference Citation Analysis] [What about the content of this article? (0)] [Affiliation(s)] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 02/03/2014] [Accepted: 10/24/2014] [Indexed: 12/11/2022]
Abstract
The laboratory mouse shares the majority of its protein-coding genes with humans, making it the premier model organism in biomedical research, yet the two mammals differ in significant ways. To gain greater insights into both shared and species-specific transcriptional and cellular regulatory programs in the mouse, the Mouse ENCODE Consortium has mapped transcription, DNase I hypersensitivity, transcription factor binding, chromatin modifications and replication domains throughout the mouse genome in diverse cell and tissue types. By comparing with the human genome, we not only confirm substantial conservation in the newly annotated potential functional sequences, but also find a large degree of divergence of sequences involved in transcriptional regulation, chromatin state and higher order chromatin organization. Our results illuminate the wide range of evolutionary forces acting on genes and their regulatory regions, and provide a general resource for research into mammalian biology and mechanisms of human diseases.
Collapse
Affiliation(s)
- Feng Yue
- 1] Ludwig Institute for Cancer Research and University of California, San Diego School of Medicine, 9500 Gilman Drive, La Jolla, California 92093, USA. [2] Department of Biochemistry and Molecular Biology, College of Medicine, The Pennsylvania State University, Hershey, Pennsylvania 17033, USA
| | - Yong Cheng
- Department of Genetics, Stanford University, 300 Pasteur Drive, MC-5477 Stanford, California 94305, USA
| | - Alessandra Breschi
- Bioinformatics and Genomics, Centre for Genomic Regulation (CRG) and UPF, Doctor Aiguader, 88, 08003 Barcelona, Catalonia, Spain
| | - Jeff Vierstra
- Department of Genome Sciences, University of Washington, Seattle, Washington 98195, USA
| | - Weisheng Wu
- Center for Comparative Genomics and Bioinformatics, Huck Institutes of the Life Sciences, The Pennsylvania State University, University Park, Pennsylvania 16802, USA
| | - Tyrone Ryba
- Department of Biological Science, 319 Stadium Drive, Florida State University, Tallahassee, Florida 32306-4295, USA
| | - Richard Sandstrom
- Department of Genome Sciences, University of Washington, Seattle, Washington 98195, USA
| | - Zhihai Ma
- Department of Genetics, Stanford University, 300 Pasteur Drive, MC-5477 Stanford, California 94305, USA
| | - Carrie Davis
- Functional Genomics, Cold Spring Harbor Laboratory, Bungtown Road, Cold Spring Harbor, New York 11724, USA
| | - Benjamin D Pope
- Department of Biological Science, 319 Stadium Drive, Florida State University, Tallahassee, Florida 32306-4295, USA
| | - Yin Shen
- Ludwig Institute for Cancer Research and University of California, San Diego School of Medicine, 9500 Gilman Drive, La Jolla, California 92093, USA
| | - Dmitri D Pervouchine
- Bioinformatics and Genomics, Centre for Genomic Regulation (CRG) and UPF, Doctor Aiguader, 88, 08003 Barcelona, Catalonia, Spain
| | - Sarah Djebali
- Bioinformatics and Genomics, Centre for Genomic Regulation (CRG) and UPF, Doctor Aiguader, 88, 08003 Barcelona, Catalonia, Spain
| | - Robert E Thurman
- Department of Genome Sciences, University of Washington, Seattle, Washington 98195, USA
| | - Rajinder Kaul
- Department of Genome Sciences, University of Washington, Seattle, Washington 98195, USA
| | - Eric Rynes
- Department of Genome Sciences, University of Washington, Seattle, Washington 98195, USA
| | - Anthony Kirilusha
- Division of Biology, California Institute of Technology, Pasadena, California 91125, USA
| | - Georgi K Marinov
- Division of Biology, California Institute of Technology, Pasadena, California 91125, USA
| | - Brian A Williams
- Division of Biology, California Institute of Technology, Pasadena, California 91125, USA
| | - Diane Trout
- Division of Biology, California Institute of Technology, Pasadena, California 91125, USA
| | - Henry Amrhein
- Division of Biology, California Institute of Technology, Pasadena, California 91125, USA
| | - Katherine Fisher-Aylor
- Division of Biology, California Institute of Technology, Pasadena, California 91125, USA
| | - Igor Antoshechkin
- Division of Biology, California Institute of Technology, Pasadena, California 91125, USA
| | - Gilberto DeSalvo
- Division of Biology, California Institute of Technology, Pasadena, California 91125, USA
| | - Lei-Hoon See
- Functional Genomics, Cold Spring Harbor Laboratory, Bungtown Road, Cold Spring Harbor, New York 11724, USA
| | - Meagan Fastuca
- Functional Genomics, Cold Spring Harbor Laboratory, Bungtown Road, Cold Spring Harbor, New York 11724, USA
| | - Jorg Drenkow
- Functional Genomics, Cold Spring Harbor Laboratory, Bungtown Road, Cold Spring Harbor, New York 11724, USA
| | - Chris Zaleski
- Functional Genomics, Cold Spring Harbor Laboratory, Bungtown Road, Cold Spring Harbor, New York 11724, USA
| | - Alex Dobin
- Functional Genomics, Cold Spring Harbor Laboratory, Bungtown Road, Cold Spring Harbor, New York 11724, USA
| | - Pablo Prieto
- Bioinformatics and Genomics, Centre for Genomic Regulation (CRG) and UPF, Doctor Aiguader, 88, 08003 Barcelona, Catalonia, Spain
| | - Julien Lagarde
- Bioinformatics and Genomics, Centre for Genomic Regulation (CRG) and UPF, Doctor Aiguader, 88, 08003 Barcelona, Catalonia, Spain
| | - Giovanni Bussotti
- Bioinformatics and Genomics, Centre for Genomic Regulation (CRG) and UPF, Doctor Aiguader, 88, 08003 Barcelona, Catalonia, Spain
| | - Andrea Tanzer
- 1] Bioinformatics and Genomics, Centre for Genomic Regulation (CRG) and UPF, Doctor Aiguader, 88, 08003 Barcelona, Catalonia, Spain. [2] Department of Theoretical Chemistry, Faculty of Chemistry, University of Vienna, Waehringerstrasse 17/3/303, A-1090 Vienna, Austria
| | - Olgert Denas
- Departments of Biology and Mathematics and Computer Science, Emory University, O. Wayne Rollins Research Center, 1510 Clifton Road NE, Atlanta, Georgia 30322, USA
| | - Kanwei Li
- Departments of Biology and Mathematics and Computer Science, Emory University, O. Wayne Rollins Research Center, 1510 Clifton Road NE, Atlanta, Georgia 30322, USA
| | - M A Bender
- 1] Department of Pediatrics, University of Washington, Seattle, Washington 98195, USA. [2] Clinical Research Division, Fred Hutchinson Cancer Research Center, Seattle, Washington 98109, USA
| | - Miaohua Zhang
- Basic Science Division, Fred Hutchinson Cancer Research Center, Seattle, Washington 98109, USA
| | - Rachel Byron
- Basic Science Division, Fred Hutchinson Cancer Research Center, Seattle, Washington 98109, USA
| | - Mark T Groudine
- 1] Basic Science Division, Fred Hutchinson Cancer Research Center, Seattle, Washington 98109, USA. [2] Department of Radiation Oncology, University of Washington, Seattle, Washington 98195, USA
| | - David McCleary
- Ludwig Institute for Cancer Research and University of California, San Diego School of Medicine, 9500 Gilman Drive, La Jolla, California 92093, USA
| | - Long Pham
- Ludwig Institute for Cancer Research and University of California, San Diego School of Medicine, 9500 Gilman Drive, La Jolla, California 92093, USA
| | - Zhen Ye
- Ludwig Institute for Cancer Research and University of California, San Diego School of Medicine, 9500 Gilman Drive, La Jolla, California 92093, USA
| | - Samantha Kuan
- Ludwig Institute for Cancer Research and University of California, San Diego School of Medicine, 9500 Gilman Drive, La Jolla, California 92093, USA
| | - Lee Edsall
- Ludwig Institute for Cancer Research and University of California, San Diego School of Medicine, 9500 Gilman Drive, La Jolla, California 92093, USA
| | - Yi-Chieh Wu
- Computer Science and Artificial Intelligence Laboratory, Massachusetts Institute of Technology (MIT), Cambridge, Massachusetts 02139, USA
| | - Matthew D Rasmussen
- Computer Science and Artificial Intelligence Laboratory, Massachusetts Institute of Technology (MIT), Cambridge, Massachusetts 02139, USA
| | - Mukul S Bansal
- Computer Science and Artificial Intelligence Laboratory, Massachusetts Institute of Technology (MIT), Cambridge, Massachusetts 02139, USA
| | - Manolis Kellis
- 1] Computer Science and Artificial Intelligence Laboratory, Massachusetts Institute of Technology (MIT), Cambridge, Massachusetts 02139, USA. [2] Broad Institute of MIT and Harvard, Cambridge, Massachusetts 02142, USA
| | - Cheryl A Keller
- Center for Comparative Genomics and Bioinformatics, Huck Institutes of the Life Sciences, The Pennsylvania State University, University Park, Pennsylvania 16802, USA
| | - Christapher S Morrissey
- Center for Comparative Genomics and Bioinformatics, Huck Institutes of the Life Sciences, The Pennsylvania State University, University Park, Pennsylvania 16802, USA
| | - Tejaswini Mishra
- Center for Comparative Genomics and Bioinformatics, Huck Institutes of the Life Sciences, The Pennsylvania State University, University Park, Pennsylvania 16802, USA
| | - Deepti Jain
- Center for Comparative Genomics and Bioinformatics, Huck Institutes of the Life Sciences, The Pennsylvania State University, University Park, Pennsylvania 16802, USA
| | - Nergiz Dogan
- Center for Comparative Genomics and Bioinformatics, Huck Institutes of the Life Sciences, The Pennsylvania State University, University Park, Pennsylvania 16802, USA
| | - Robert S Harris
- Center for Comparative Genomics and Bioinformatics, Huck Institutes of the Life Sciences, The Pennsylvania State University, University Park, Pennsylvania 16802, USA
| | - Philip Cayting
- Department of Genetics, Stanford University, 300 Pasteur Drive, MC-5477 Stanford, California 94305, USA
| | - Trupti Kawli
- Department of Genetics, Stanford University, 300 Pasteur Drive, MC-5477 Stanford, California 94305, USA
| | - Alan P Boyle
- Department of Genetics, Stanford University, 300 Pasteur Drive, MC-5477 Stanford, California 94305, USA
| | - Ghia Euskirchen
- Department of Genetics, Stanford University, 300 Pasteur Drive, MC-5477 Stanford, California 94305, USA
| | - Anshul Kundaje
- Department of Genetics, Stanford University, 300 Pasteur Drive, MC-5477 Stanford, California 94305, USA
| | - Shin Lin
- Department of Genetics, Stanford University, 300 Pasteur Drive, MC-5477 Stanford, California 94305, USA
| | - Yiing Lin
- Department of Genetics, Stanford University, 300 Pasteur Drive, MC-5477 Stanford, California 94305, USA
| | - Camden Jansen
- Department of Developmental and Cell Biology, University of California, Irvine, Irvine, California 92697, USA
| | - Venkat S Malladi
- Department of Genetics, Stanford University, 300 Pasteur Drive, MC-5477 Stanford, California 94305, USA
| | - Melissa S Cline
- Center for Biomolecular Science and Engineering, School of Engineering, University of California Santa Cruz (UCSC), Santa Cruz, California 95064, USA
| | - Drew T Erickson
- Department of Genetics, Stanford University, 300 Pasteur Drive, MC-5477 Stanford, California 94305, USA
| | - Vanessa M Kirkup
- Center for Biomolecular Science and Engineering, School of Engineering, University of California Santa Cruz (UCSC), Santa Cruz, California 95064, USA
| | - Katrina Learned
- Center for Biomolecular Science and Engineering, School of Engineering, University of California Santa Cruz (UCSC), Santa Cruz, California 95064, USA
| | - Cricket A Sloan
- Department of Genetics, Stanford University, 300 Pasteur Drive, MC-5477 Stanford, California 94305, USA
| | - Kate R Rosenbloom
- Center for Biomolecular Science and Engineering, School of Engineering, University of California Santa Cruz (UCSC), Santa Cruz, California 95064, USA
| | - Beatriz Lacerda de Sousa
- Departments of Obstetrics/Gynecology and Pathology, and Center for Reproductive Sciences, University of California San Francisco, San Francisco, California 94143, USA
| | - Kathryn Beal
- European Molecular Biology Laboratory, European Bioinformatics Institute, Wellcome Trust Genome Campus, Hinxton, Cambridge CB10 1SD, UK
| | - Miguel Pignatelli
- European Molecular Biology Laboratory, European Bioinformatics Institute, Wellcome Trust Genome Campus, Hinxton, Cambridge CB10 1SD, UK
| | - Paul Flicek
- European Molecular Biology Laboratory, European Bioinformatics Institute, Wellcome Trust Genome Campus, Hinxton, Cambridge CB10 1SD, UK
| | - Jin Lian
- Yale University, Department of Genetics, PO Box 208005, 333 Cedar Street, New Haven, Connecticut 06520-8005, USA
| | - Tamer Kahveci
- Computer &Information Sciences &Engineering, University of Florida, Gainesville, Florida 32611, USA
| | - Dongwon Lee
- McKusick-Nathans Institute of Genetic Medicine and Department of Biomedical Engineering, Johns Hopkins University, 733 N. Broadway, BRB 573 Baltimore, Maryland 21205, USA
| | - W James Kent
- Center for Biomolecular Science and Engineering, School of Engineering, University of California Santa Cruz (UCSC), Santa Cruz, California 95064, USA
| | - Miguel Ramalho Santos
- Departments of Obstetrics/Gynecology and Pathology, and Center for Reproductive Sciences, University of California San Francisco, San Francisco, California 94143, USA
| | - Javier Herrero
- 1] European Molecular Biology Laboratory, European Bioinformatics Institute, Wellcome Trust Genome Campus, Hinxton, Cambridge CB10 1SD, UK. [2] Bill Lyons Informatics Centre, UCL Cancer Institute, University College London, London WC1E 6DD, UK
| | - Cedric Notredame
- Bioinformatics and Genomics, Centre for Genomic Regulation (CRG) and UPF, Doctor Aiguader, 88, 08003 Barcelona, Catalonia, Spain
| | - Audra Johnson
- Department of Genome Sciences, University of Washington, Seattle, Washington 98195, USA
| | - Shinny Vong
- Department of Genome Sciences, University of Washington, Seattle, Washington 98195, USA
| | - Kristen Lee
- Department of Genome Sciences, University of Washington, Seattle, Washington 98195, USA
| | - Daniel Bates
- Department of Genome Sciences, University of Washington, Seattle, Washington 98195, USA
| | - Fidencio Neri
- Department of Genome Sciences, University of Washington, Seattle, Washington 98195, USA
| | - Morgan Diegel
- Department of Genome Sciences, University of Washington, Seattle, Washington 98195, USA
| | - Theresa Canfield
- Department of Genome Sciences, University of Washington, Seattle, Washington 98195, USA
| | - Peter J Sabo
- Department of Genome Sciences, University of Washington, Seattle, Washington 98195, USA
| | - Matthew S Wilken
- Department of Biological Structure, University of Washington, HSB I-516, 1959 NE Pacific Street, Seattle, Washington 98195, USA
| | - Thomas A Reh
- Department of Biological Structure, University of Washington, HSB I-516, 1959 NE Pacific Street, Seattle, Washington 98195, USA
| | - Erika Giste
- Department of Genome Sciences, University of Washington, Seattle, Washington 98195, USA
| | - Anthony Shafer
- Department of Genome Sciences, University of Washington, Seattle, Washington 98195, USA
| | - Tanya Kutyavin
- Department of Genome Sciences, University of Washington, Seattle, Washington 98195, USA
| | - Eric Haugen
- Department of Genome Sciences, University of Washington, Seattle, Washington 98195, USA
| | - Douglas Dunn
- Department of Genome Sciences, University of Washington, Seattle, Washington 98195, USA
| | - Alex P Reynolds
- Department of Genome Sciences, University of Washington, Seattle, Washington 98195, USA
| | - Shane Neph
- Department of Genome Sciences, University of Washington, Seattle, Washington 98195, USA
| | - Richard Humbert
- Department of Genome Sciences, University of Washington, Seattle, Washington 98195, USA
| | - R Scott Hansen
- Department of Genome Sciences, University of Washington, Seattle, Washington 98195, USA
| | - Marella De Bruijn
- MRC Molecular Haemotology Unit, University of Oxford, Oxford OX3 9DS, UK
| | - Licia Selleri
- Department of Cell and Developmental Biology, Weill Cornell Medical College, New York, New York 10065, USA
| | - Alexander Rudensky
- HHMI and Ludwig Center at Memorial Sloan Kettering Cancer Center, Immunology Program, Memorial Sloan Kettering Cancer Canter, New York, New York 10065, USA
| | - Steven Josefowicz
- HHMI and Ludwig Center at Memorial Sloan Kettering Cancer Center, Immunology Program, Memorial Sloan Kettering Cancer Canter, New York, New York 10065, USA
| | - Robert Samstein
- HHMI and Ludwig Center at Memorial Sloan Kettering Cancer Center, Immunology Program, Memorial Sloan Kettering Cancer Canter, New York, New York 10065, USA
| | - Evan E Eichler
- Department of Genome Sciences, University of Washington, Seattle, Washington 98195, USA
| | - Stuart H Orkin
- Dana Farber Cancer Institute, Harvard Medical School, Cambridge, Massachusetts 02138, USA
| | - Dana Levasseur
- University of Iowa Carver College of Medicine, Department of Internal Medicine, Iowa City, Iowa 52242, USA
| | - Thalia Papayannopoulou
- Division of Hematology, Department of Medicine, University of Washington, Seattle, Washington 98195, USA
| | - Kai-Hsin Chang
- University of Iowa Carver College of Medicine, Department of Internal Medicine, Iowa City, Iowa 52242, USA
| | - Arthur Skoultchi
- Department of Cell Biology, Albert Einstein College of Medicine, Bronx, New York 10461, USA
| | - Srikanta Gosh
- Department of Cell Biology, Albert Einstein College of Medicine, Bronx, New York 10461, USA
| | - Christine Disteche
- Department of Pathology, University of Washington, Seattle, Washington 98195, USA
| | - Piper Treuting
- Department of Comparative Medicine, University of Washington, Seattle, Washington 98195, USA
| | - Yanli Wang
- Bioinformatics and Genomics program, The Pennsylvania State University, University Park, Pennsylvania 16802, USA
| | - Mitchell J Weiss
- Department of Hematology, St Jude Children's Research Hospital, Memphis, Tennessee 38105, USA
| | - Gerd A Blobel
- 1] Division of Hematology, The Children's Hospital of Philadelphia, Philadelphia, Pennsylvania 19104, USA. [2] Perelman School of Medicine at the University of Pennsylvania, Philadelphia, Pennsylvania 19104, USA
| | - Xiaoyi Cao
- Department of Bioengineering, University of California, San Diego, 9500 Gilman Drive, La Jolla, California 92093, USA
| | - Sheng Zhong
- Department of Bioengineering, University of California, San Diego, 9500 Gilman Drive, La Jolla, California 92093, USA
| | - Ting Wang
- Department of Genetics, Center for Genome Sciences and Systems Biology, Washington University School of Medicine, St. Louis, Missouri 63108, USA
| | - Peter J Good
- NHGRI, National Institutes of Health, 5635 Fishers Lane, Bethesda, Maryland 20892-9307, USA
| | - Rebecca F Lowdon
- NHGRI, National Institutes of Health, 5635 Fishers Lane, Bethesda, Maryland 20892-9307, USA
| | - Leslie B Adams
- NHGRI, National Institutes of Health, 5635 Fishers Lane, Bethesda, Maryland 20892-9307, USA
| | - Xiao-Qiao Zhou
- NHGRI, National Institutes of Health, 5635 Fishers Lane, Bethesda, Maryland 20892-9307, USA
| | - Michael J Pazin
- NHGRI, National Institutes of Health, 5635 Fishers Lane, Bethesda, Maryland 20892-9307, USA
| | - Elise A Feingold
- NHGRI, National Institutes of Health, 5635 Fishers Lane, Bethesda, Maryland 20892-9307, USA
| | - Barbara Wold
- Division of Biology, California Institute of Technology, Pasadena, California 91125, USA
| | - James Taylor
- Departments of Biology and Mathematics and Computer Science, Emory University, O. Wayne Rollins Research Center, 1510 Clifton Road NE, Atlanta, Georgia 30322, USA
| | - Ali Mortazavi
- Department of Developmental and Cell Biology, University of California, Irvine, Irvine, California 92697, USA
| | - Sherman M Weissman
- Yale University, Department of Genetics, PO Box 208005, 333 Cedar Street, New Haven, Connecticut 06520-8005, USA
| | | | - Michael P Snyder
- Department of Genetics, Stanford University, 300 Pasteur Drive, MC-5477 Stanford, California 94305, USA
| | - Roderic Guigo
- Bioinformatics and Genomics, Centre for Genomic Regulation (CRG) and UPF, Doctor Aiguader, 88, 08003 Barcelona, Catalonia, Spain
| | - Thomas R Gingeras
- Functional Genomics, Cold Spring Harbor Laboratory, Bungtown Road, Cold Spring Harbor, New York 11724, USA
| | - David M Gilbert
- Department of Biological Science, 319 Stadium Drive, Florida State University, Tallahassee, Florida 32306-4295, USA
| | - Ross C Hardison
- Center for Comparative Genomics and Bioinformatics, Huck Institutes of the Life Sciences, The Pennsylvania State University, University Park, Pennsylvania 16802, USA
| | - Michael A Beer
- McKusick-Nathans Institute of Genetic Medicine and Department of Biomedical Engineering, Johns Hopkins University, 733 N. Broadway, BRB 573 Baltimore, Maryland 21205, USA
| | - Bing Ren
- Ludwig Institute for Cancer Research and University of California, San Diego School of Medicine, 9500 Gilman Drive, La Jolla, California 92093, USA
| | | |
Collapse
|
4
|
Stergachis AB, Neph S, Sandstrom R, Haugen E, Reynolds AP, Zhang M, Byron R, Canfield T, Stelhing-Sun S, Lee K, Thurman RE, Vong S, Bates D, Neri F, Diegel M, Giste E, Dunn D, Vierstra J, Hansen RS, Johnson AK, Sabo PJ, Wilken MS, Reh TA, Treuting PM, Kaul R, Groudine M, Bender MA, Borenstein E, Stamatoyannopoulos JA. Conservation of trans-acting circuitry during mammalian regulatory evolution. Nature 2015; 515:365-70. [PMID: 25409825 PMCID: PMC4405208 DOI: 10.1038/nature13972] [Citation(s) in RCA: 176] [Impact Index Per Article: 19.6] [Reference Citation Analysis] [What about the content of this article? (0)] [Affiliation(s)] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 02/21/2014] [Accepted: 10/15/2014] [Indexed: 12/27/2022]
Abstract
The basic body plan and major physiological axes have been highly conserved during mammalian evolution, yet only a small fraction of the human genome sequence appears to be subject to evolutionary constraint. To quantify cis- versus trans-acting contributions to mammalian regulatory evolution, we performed genomic DNase I footprinting of the mouse genome across 25 cell and tissue types, collectively defining ∼8.6 million transcription factor (TF) occupancy sites at nucleotide resolution. Here we show that mouse TF footprints conjointly encode a regulatory lexicon that is ∼95% similar with that derived from human TF footprints. However, only ∼20% of mouse TF footprints have human orthologues. Despite substantial turnover of the cis-regulatory landscape, nearly half of all pairwise regulatory interactions connecting mouse TF genes have been maintained in orthologous human cell types through evolutionary innovation of TF recognition sequences. Furthermore, the higher-level organization of mouse TF-to-TF connections into cellular network architectures is nearly identical with human. Our results indicate that evolutionary selection on mammalian gene regulation is targeted chiefly at the level of trans-regulatory circuitry, enabling and potentiating cis-regulatory plasticity. Mouse genomic footprinting reveals conservation of transcription factor (TF) recognition repertoires and trans-regulatory circuitry despite massive turnover of DNA elements that contact TFs in vivo. Having generated genomic DNase I footprinting data of the mouse genome across 25 cell and tissue types, these authors use these data to quantify cis-versus-trans regulatory contributions to mammalian regulatory evolution. They describe more than 600 motifs that collectively are over 95% similar to that recognized in vivo by human transcription factors (TFs). Despite substantial turnover of the cis-regulatory landscape around each TF gene, nearly half of all pairwise regulatory interactions connecting mouse TF genes have been maintained in orthologous human cell types through evolutionary innovation of TF recognition sequences. Conservation between mouse and human TF regulatory networks is particularly similar at the highest organization level. The work was performed as part of the mouse ENCODE project.
Collapse
Affiliation(s)
- Andrew B Stergachis
- Department of Genome Sciences, University of Washington, Seattle, Washington 98195, USA
| | - Shane Neph
- Department of Genome Sciences, University of Washington, Seattle, Washington 98195, USA
| | - Richard Sandstrom
- Department of Genome Sciences, University of Washington, Seattle, Washington 98195, USA
| | - Eric Haugen
- Department of Genome Sciences, University of Washington, Seattle, Washington 98195, USA
| | - Alex P Reynolds
- Department of Genome Sciences, University of Washington, Seattle, Washington 98195, USA
| | - Miaohua Zhang
- Division of Basic Sciences, Fred Hutchinson Cancer Research Center, Seattle, Washington 98109, USA
| | - Rachel Byron
- Division of Basic Sciences, Fred Hutchinson Cancer Research Center, Seattle, Washington 98109, USA
| | - Theresa Canfield
- Department of Genome Sciences, University of Washington, Seattle, Washington 98195, USA
| | - Sandra Stelhing-Sun
- Department of Genome Sciences, University of Washington, Seattle, Washington 98195, USA
| | - Kristen Lee
- Department of Genome Sciences, University of Washington, Seattle, Washington 98195, USA
| | - Robert E Thurman
- Department of Genome Sciences, University of Washington, Seattle, Washington 98195, USA
| | - Shinny Vong
- Department of Genome Sciences, University of Washington, Seattle, Washington 98195, USA
| | - Daniel Bates
- Department of Genome Sciences, University of Washington, Seattle, Washington 98195, USA
| | - Fidencio Neri
- Department of Genome Sciences, University of Washington, Seattle, Washington 98195, USA
| | - Morgan Diegel
- Department of Genome Sciences, University of Washington, Seattle, Washington 98195, USA
| | - Erika Giste
- Department of Genome Sciences, University of Washington, Seattle, Washington 98195, USA
| | - Douglas Dunn
- Department of Genome Sciences, University of Washington, Seattle, Washington 98195, USA
| | - Jeff Vierstra
- Department of Genome Sciences, University of Washington, Seattle, Washington 98195, USA
| | - R Scott Hansen
- 1] Department of Genome Sciences, University of Washington, Seattle, Washington 98195, USA [2] Department of Medicine, University of Washington, Seattle, Washington 98195, USA
| | - Audra K Johnson
- Department of Genome Sciences, University of Washington, Seattle, Washington 98195, USA
| | - Peter J Sabo
- Department of Genome Sciences, University of Washington, Seattle, Washington 98195, USA
| | - Matthew S Wilken
- Department of Biological Structure, University of Washington, Seattle, Washington 98195, USA
| | - Thomas A Reh
- Department of Biological Structure, University of Washington, Seattle, Washington 98195, USA
| | - Piper M Treuting
- Department of Comparative Medicine, University of Washington, Seattle, Washington 98195, USA
| | - Rajinder Kaul
- 1] Department of Genome Sciences, University of Washington, Seattle, Washington 98195, USA [2] Department of Medicine, University of Washington, Seattle, Washington 98195, USA
| | - Mark Groudine
- 1] Division of Basic Sciences, Fred Hutchinson Cancer Research Center, Seattle, Washington 98109, USA [2] Division of Radiation Oncology, University of Washington, Seattle, Washington 98195, USA
| | - M A Bender
- 1] Clinical Research Division, Fred Hutchinson Cancer Research Center, Seattle, Washington 98109, USA [2] Department of Pediatrics, University of Washington, Seattle, Washington 98195, USA
| | - Elhanan Borenstein
- 1] Department of Genome Sciences, University of Washington, Seattle, Washington 98195, USA [2] Department of Computer Science and Engineering, University of Washington, Seattle, Washington 98102, USA [3] Santa Fe Institute, Santa Fe, New Mexico 87501, USA
| | - John A Stamatoyannopoulos
- 1] Department of Genome Sciences, University of Washington, Seattle, Washington 98195, USA [2] Department of Medicine, University of Washington, Seattle, Washington 98195, USA
| |
Collapse
|
5
|
Vierstra J, Rynes E, Sandstrom R, Zhang M, Canfield T, Hansen RS, Stehling-Sun S, Sabo PJ, Byron R, Humbert R, Thurman RE, Johnson AK, Vong S, Lee K, Bates D, Neri F, Diegel M, Giste E, Haugen E, Dunn D, Wilken MS, Josefowicz S, Samstein R, Chang KH, Eichler EE, De Bruijn M, Reh TA, Skoultchi A, Rudensky A, Orkin SH, Papayannopoulou T, Treuting PM, Selleri L, Kaul R, Groudine M, Bender MA, Stamatoyannopoulos JA. Mouse regulatory DNA landscapes reveal global principles of cis-regulatory evolution. Science 2014; 346:1007-12. [PMID: 25411453 PMCID: PMC4337786 DOI: 10.1126/science.1246426] [Citation(s) in RCA: 186] [Impact Index Per Article: 18.6] [Reference Citation Analysis] [What about the content of this article? (0)] [Affiliation(s)] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 12/14/2022]
Abstract
To study the evolutionary dynamics of regulatory DNA, we mapped >1.3 million deoxyribonuclease I-hypersensitive sites (DHSs) in 45 mouse cell and tissue types, and systematically compared these with human DHS maps from orthologous compartments. We found that the mouse and human genomes have undergone extensive cis-regulatory rewiring that combines branch-specific evolutionary innovation and loss with widespread repurposing of conserved DHSs to alternative cell fates, and that this process is mediated by turnover of transcription factor (TF) recognition elements. Despite pervasive evolutionary remodeling of the location and content of individual cis-regulatory regions, within orthologous mouse and human cell types the global fraction of regulatory DNA bases encoding recognition sites for each TF has been strictly conserved. Our findings provide new insights into the evolutionary forces shaping mammalian regulatory DNA landscapes.
Collapse
Affiliation(s)
- Jeff Vierstra
- Department of Genome Sciences, University of Washington, Seattle, WA 98195, USA
| | - Eric Rynes
- Department of Genome Sciences, University of Washington, Seattle, WA 98195, USA
| | - Richard Sandstrom
- Department of Genome Sciences, University of Washington, Seattle, WA 98195, USA
| | - Miaohua Zhang
- Fred Hutchinson Cancer Research Center, Seattle, WA 98109, USA
| | - Theresa Canfield
- Department of Genome Sciences, University of Washington, Seattle, WA 98195, USA
| | - R Scott Hansen
- Division of Medical Genetics, Department of Medicine, University of Washington, Seattle, WA 98195, USA
| | - Sandra Stehling-Sun
- Department of Genome Sciences, University of Washington, Seattle, WA 98195, USA
| | - Peter J Sabo
- Department of Genome Sciences, University of Washington, Seattle, WA 98195, USA
| | - Rachel Byron
- Fred Hutchinson Cancer Research Center, Seattle, WA 98109, USA
| | - Richard Humbert
- Department of Genome Sciences, University of Washington, Seattle, WA 98195, USA
| | - Robert E Thurman
- Department of Genome Sciences, University of Washington, Seattle, WA 98195, USA
| | - Audra K Johnson
- Department of Genome Sciences, University of Washington, Seattle, WA 98195, USA
| | - Shinny Vong
- Department of Genome Sciences, University of Washington, Seattle, WA 98195, USA
| | - Kristen Lee
- Department of Genome Sciences, University of Washington, Seattle, WA 98195, USA
| | - Daniel Bates
- Department of Genome Sciences, University of Washington, Seattle, WA 98195, USA
| | - Fidencio Neri
- Department of Genome Sciences, University of Washington, Seattle, WA 98195, USA
| | - Morgan Diegel
- Department of Genome Sciences, University of Washington, Seattle, WA 98195, USA
| | - Erika Giste
- Department of Genome Sciences, University of Washington, Seattle, WA 98195, USA
| | - Eric Haugen
- Department of Genome Sciences, University of Washington, Seattle, WA 98195, USA
| | - Douglas Dunn
- Department of Genome Sciences, University of Washington, Seattle, WA 98195, USA
| | - Matthew S Wilken
- Department of Biological Structure, University of Washington, Seattle, WA 98195, USA
| | - Steven Josefowicz
- Immunology Program, Memorial Sloan-Kettering Cancer Center, New York, NY 10065, USA. Howard Hughes Medical Institute
| | - Robert Samstein
- Immunology Program, Memorial Sloan-Kettering Cancer Center, New York, NY 10065, USA. Howard Hughes Medical Institute
| | - Kai-Hsin Chang
- Division of Hematology, Department of Medicine, University of Washington, Seattle, WA 98195, USA
| | - Evan E Eichler
- Department of Genome Sciences, University of Washington, Seattle, WA 98195, USA. Howard Hughes Medical Institute
| | - Marella De Bruijn
- Medical Research Council (MRC) Molecular Haematology Unit, Weatherall Institute of Molecular Medicine, John Radcliffe Hospital, Oxford OX3 9DS, UK
| | - Thomas A Reh
- Department of Biological Structure, University of Washington, Seattle, WA 98195, USA
| | - Arthur Skoultchi
- Department of Cell Biology, Albert Einstein College of Medicine, Bronx, NY 10461, USA
| | - Alexander Rudensky
- Immunology Program, Memorial Sloan-Kettering Cancer Center, New York, NY 10065, USA. Howard Hughes Medical Institute
| | - Stuart H Orkin
- Howard Hughes Medical Institute. Division of Hematology/Oncology, Children's Hospital Boston and Department of Pediatric Oncology, Dana-Farber Cancer Institute, Harvard Stem Cell Institute, Harvard Medical School, Boston, MA 02115, USA
| | - Thalia Papayannopoulou
- Division of Hematology, Department of Medicine, University of Washington, Seattle, WA 98195, USA
| | - Piper M Treuting
- Department of Comparative Medicine, University of Washington, Seattle, WA 98195, USA
| | - Licia Selleri
- Department of Cell and Developmental Biology, Weill Medical College of Cornell University, New York, NY 10065, USA
| | - Rajinder Kaul
- Department of Genome Sciences, University of Washington, Seattle, WA 98195, USA. Division of Medical Genetics, Department of Medicine, University of Washington, Seattle, WA 98195, USA
| | - Mark Groudine
- Fred Hutchinson Cancer Research Center, Seattle, WA 98109, USA. Department of Radiation Oncology, University of Washington, Seattle, WA 98109, USA
| | - M A Bender
- Fred Hutchinson Cancer Research Center, Seattle, WA 98109, USA. Department of Pediatrics, University of Washington, Seattle, WA 98195, USA
| | - John A Stamatoyannopoulos
- Department of Genome Sciences, University of Washington, Seattle, WA 98195, USA. Division of Oncology, Department of Medicine, University of Washington, Seattle, WA 98195, USA.
| |
Collapse
|
6
|
Sullivan AM, Arsovski AA, Lempe J, Bubb KL, Weirauch MT, Sabo PJ, Sandstrom R, Thurman RE, Neph S, Reynolds AP, Stergachis AB, Vernot B, Johnson AK, Haugen E, Sullivan ST, Thompson A, Neri FV, Weaver M, Diegel M, Mnaimneh S, Yang A, Hughes TR, Nemhauser JL, Queitsch C, Stamatoyannopoulos JA. Mapping and dynamics of regulatory DNA and transcription factor networks in A. thaliana. Cell Rep 2014; 8:2015-2030. [PMID: 25220462 DOI: 10.1016/j.celrep.2014.08.019] [Citation(s) in RCA: 159] [Impact Index Per Article: 15.9] [Reference Citation Analysis] [What about the content of this article? (0)] [Affiliation(s)] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 12/04/2013] [Revised: 05/20/2014] [Accepted: 08/07/2014] [Indexed: 01/23/2023] Open
Abstract
Our understanding of gene regulation in plants is constrained by our limited knowledge of plant cis-regulatory DNA and its dynamics. We mapped DNase I hypersensitive sites (DHSs) in A. thaliana seedlings and used genomic footprinting to delineate ∼ 700,000 sites of in vivo transcription factor (TF) occupancy at nucleotide resolution. We show that variation associated with 72 diverse quantitative phenotypes localizes within DHSs. TF footprints encode an extensive cis-regulatory lexicon subject to recent evolutionary pressures, and widespread TF binding within exons may have shaped codon usage patterns. The architecture of A. thaliana TF regulatory networks is strikingly similar to that of animals in spite of diverged regulatory repertoires. We analyzed regulatory landscape dynamics during heat shock and photomorphogenesis, disclosing thousands of environmentally sensitive elements and enabling mapping of key TF regulatory circuits underlying these fundamental responses. Our results provide an extensive resource for the study of A. thaliana gene regulation and functional biology.
Collapse
Affiliation(s)
| | - Andrej A Arsovski
- Department of Biology, University of Washington, Seattle, WA 98195, USA
| | - Janne Lempe
- Department of Genome Sciences, University of Washington, Seattle, WA 98195, USA
| | - Kerry L Bubb
- Department of Genome Sciences, University of Washington, Seattle, WA 98195, USA
| | - Matthew T Weirauch
- Center for Autoimmune Genomics and Etiology (CAGE) and Divisions of Biomedical Informatics and Developmental Biology, Cincinnati Children's Hospital Medical Center, Cincinnati, OH 45229, USA
| | - Peter J Sabo
- Department of Genome Sciences, University of Washington, Seattle, WA 98195, USA
| | - Richard Sandstrom
- Department of Genome Sciences, University of Washington, Seattle, WA 98195, USA
| | - Robert E Thurman
- Department of Genome Sciences, University of Washington, Seattle, WA 98195, USA
| | - Shane Neph
- Department of Genome Sciences, University of Washington, Seattle, WA 98195, USA
| | - Alex P Reynolds
- Department of Genome Sciences, University of Washington, Seattle, WA 98195, USA
| | - Andrew B Stergachis
- Department of Genome Sciences, University of Washington, Seattle, WA 98195, USA
| | - Benjamin Vernot
- Department of Genome Sciences, University of Washington, Seattle, WA 98195, USA
| | - Audra K Johnson
- Department of Genome Sciences, University of Washington, Seattle, WA 98195, USA
| | - Eric Haugen
- Department of Genome Sciences, University of Washington, Seattle, WA 98195, USA
| | - Shawn T Sullivan
- Department of Genome Sciences, University of Washington, Seattle, WA 98195, USA
| | - Agnieszka Thompson
- Department of Genome Sciences, University of Washington, Seattle, WA 98195, USA
| | - Fidencio V Neri
- Department of Genome Sciences, University of Washington, Seattle, WA 98195, USA
| | - Molly Weaver
- Department of Genome Sciences, University of Washington, Seattle, WA 98195, USA
| | - Morgan Diegel
- Department of Genome Sciences, University of Washington, Seattle, WA 98195, USA
| | - Sanie Mnaimneh
- Donnelly Centre and Department of Molecular Genetics, University of Toronto, Toronto ON M5S 3E1, Canada
| | - Ally Yang
- Donnelly Centre and Department of Molecular Genetics, University of Toronto, Toronto ON M5S 3E1, Canada
| | - Timothy R Hughes
- Donnelly Centre and Department of Molecular Genetics, University of Toronto, Toronto ON M5S 3E1, Canada; Canadian Institute for Advanced Research (CIFAR) Program in Genetic Networks, Toronto ON M5G 1Z8, Canada
| | | | - Christine Queitsch
- Department of Genome Sciences, University of Washington, Seattle, WA 98195, USA.
| | | |
Collapse
|
7
|
John S, Sabo PJ, Canfield TK, Lee K, Vong S, Weaver M, Wang H, Vierstra J, Reynolds AP, Thurman RE, Stamatoyannopoulos JA. Genome-scale mapping of DNase I hypersensitivity. Curr Protoc Mol Biol 2014; Chapter 27:Unit 21.27. [PMID: 23821440 DOI: 10.1002/0471142727.mb2127s103] [Citation(s) in RCA: 61] [Impact Index Per Article: 6.1] [Reference Citation Analysis] [What about the content of this article? (0)] [Affiliation(s)] [Abstract] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 11/11/2022]
Abstract
DNase I-seq is a global and high-resolution method that uses the nonspecific endonuclease DNase I to map chromatin accessibility. These accessible regions, designated as DNase I hypersensitive sites (DHSs), define the regulatory features, (e.g., promoters, enhancers, insulators, and locus control regions) of complex genomes. In this unit, methods are described for nuclei isolation, digestion of nuclei with limiting concentrations of DNase I, and the biochemical fractionation of DNase I hypersensitive sites in preparation for high-throughput sequencing. DNase I-seq is an unbiased and robust method that is not predicated on an a priori understanding of regulatory patterns or chromatin features.
Collapse
Affiliation(s)
- Sam John
- Department of Genome Sciences, University of Washington, Seattle, Washington, USA
| | | | | | | | | | | | | | | | | | | | | |
Collapse
|
8
|
Bauer DE, Kamran SC, Lessard S, Xu J, Fujiwara Y, Lin C, Shao Z, Canver MC, Smith EC, Pinello L, Sabo PJ, Vierstra J, Voit RA, Yuan GC, Porteus MH, Stamatoyannopoulos JA, Lettre G, Orkin SH. An erythroid enhancer of BCL11A subject to genetic variation determines fetal hemoglobin level. Science 2013; 342:253-7. [PMID: 24115442 PMCID: PMC4018826 DOI: 10.1126/science.1242088] [Citation(s) in RCA: 448] [Impact Index Per Article: 40.7] [Reference Citation Analysis] [What about the content of this article? (0)] [Affiliation(s)] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 12/11/2022]
Abstract
Genome-wide association studies (GWASs) have ascertained numerous trait-associated common genetic variants, frequently localized to regulatory DNA. We found that common genetic variation at BCL11A associated with fetal hemoglobin (HbF) level lies in noncoding sequences decorated by an erythroid enhancer chromatin signature. Fine-mapping uncovers a motif-disrupting common variant associated with reduced transcription factor (TF) binding, modestly diminished BCL11A expression, and elevated HbF. The surrounding sequences function in vivo as a developmental stage-specific, lineage-restricted enhancer. Genome engineering reveals the enhancer is required in erythroid but not B-lymphoid cells for BCL11A expression. These findings illustrate how GWASs may expose functional variants of modest impact within causal elements essential for appropriate gene expression. We propose the GWAS-marked BCL11A enhancer represents an attractive target for therapeutic genome engineering for the β-hemoglobinopathies.
Collapse
Affiliation(s)
- Daniel E. Bauer
- Division of Hematology/Oncology, Boston Children’s Hospital, Boston, MA, 02115
- Department of Pediatric Oncology, Dana-Farber Cancer Institute, Boston, MA, 02115
- Harvard Medical School, Boston, MA, 02115
| | - Sophia C. Kamran
- Harvard Medical School, Boston, MA, 02115
- Howard Hughes Medical Institute, Boston, MA, 02115
| | - Samuel Lessard
- Montreal Heart Institute and Université Montréal, Montreal, Quebec, H1T 1C8, Canada
| | - Jian Xu
- Division of Hematology/Oncology, Boston Children’s Hospital, Boston, MA, 02115
- Harvard Medical School, Boston, MA, 02115
| | - Yuko Fujiwara
- Division of Hematology/Oncology, Boston Children’s Hospital, Boston, MA, 02115
| | - Carrie Lin
- Division of Hematology/Oncology, Boston Children’s Hospital, Boston, MA, 02115
| | - Zhen Shao
- Division of Hematology/Oncology, Boston Children’s Hospital, Boston, MA, 02115
| | | | - Elenoe C. Smith
- Division of Hematology/Oncology, Boston Children’s Hospital, Boston, MA, 02115
| | - Luca Pinello
- Department of Biostatistics and Computational Biology, Dana-Farber Cancer Institute, Boston, MA, 02115
| | - Peter J. Sabo
- Departments of Genome Sciences and Medicine, University of Washington, Seattle, WA, 98195
| | - Jeff Vierstra
- Departments of Genome Sciences and Medicine, University of Washington, Seattle, WA, 98195
| | - Richard A. Voit
- Department of Pediatrics, Stanford University, Palo Alto, CA, 94304
| | - Guo-Cheng Yuan
- Department of Biostatistics and Computational Biology, Dana-Farber Cancer Institute, Boston, MA, 02115
- Harvard School of Public Health, Boston, MA, 02115
| | | | | | - Guillaume Lettre
- Montreal Heart Institute and Université Montréal, Montreal, Quebec, H1T 1C8, Canada
| | - Stuart H. Orkin
- Division of Hematology/Oncology, Boston Children’s Hospital, Boston, MA, 02115
- Department of Pediatric Oncology, Dana-Farber Cancer Institute, Boston, MA, 02115
- Harvard Medical School, Boston, MA, 02115
- Howard Hughes Medical Institute, Boston, MA, 02115
| |
Collapse
|
9
|
Xiong Q, Zhang Z, Chang KH, Qu H, Wang H, Qi H, Li Y, Ruan X, Yang Y, Yang Y, Li Y, Sandstrom R, Sabo PJ, Li Q, Stamatoyannopoulos G, Stamatoyannopoulos JA, Fang X. Comprehensive characterization of erythroid-specific enhancers in the genomic regions of human Krüppel-like factors. BMC Genomics 2013; 14:587. [PMID: 23985037 PMCID: PMC3846580 DOI: 10.1186/1471-2164-14-587] [Citation(s) in RCA: 19] [Impact Index Per Article: 1.7] [Reference Citation Analysis] [What about the content of this article? (0)] [Affiliation(s)] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/18/2013] [Accepted: 08/23/2013] [Indexed: 11/10/2022] Open
Abstract
Background Mapping of DNase I hypersensitive sites (DHSs) is a powerful tool to experimentally identify cis-regulatory elements (CREs). Among CREs, enhancers are abundant and predominantly act in driving cell-specific gene expression. Krüppel-like factors (KLFs) are a family of eukaryotic transcription factors. Several KLFs have been demonstrated to play important roles in hematopoiesis. However, transcriptional regulation of KLFs via CREs, particularly enhancers, in erythroid cells has been poorly understood. Results In this study, 23 erythroid-specific or putative erythroid-specific DHSs were identified by DNase-seq in the genomic regions of 17 human KLFs, and their enhancer activities were evaluated using dual-luciferase reporter (DLR) assay. Of the 23 erythroid-specific DHSs, the enhancer activities of 15 DHSs were comparable to that of the classical enhancer HS2 in driving minimal promoter (minP). Fifteen DHSs, some overlapping those that increased minP activities, acted as enhancers when driving the corresponding KLF promoters (KLF-Ps) in erythroid cells; of these, 10 DHSs were finally characterized as erythroid-specific KLF enhancers. These 10 erythroid-specific KLF enhancers were further confirmed using chromatin immunoprecipitation coupled to sequencing (ChIP-seq) data-based bioinformatic and biochemical analyses. Conclusion Our present findings provide a feasible strategy to extensively identify gene- and cell-specific enhancers from DHSs obtained by high-throughput sequencing, which will help reveal the transcriptional regulation and biological functions of genes in some specific cells.
Collapse
Affiliation(s)
- Qian Xiong
- CAS Key Laboratory of Genome Sciences and Information, Beijing Institute of Genomics, Chinese Academy of Sciences, Beijing 100101, P,R, China.
| | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | |
Collapse
|
10
|
Lazarovici A, Zhou T, Shafer A, Machado ACD, Sandstrom R, Sabo PJ, Lu Y, Rohs R, Stamatoyannopoulos JA, Bussemaker HJ. 103 Probing DNA shape and methylation state on a genomic scale with DNase I. J Biomol Struct Dyn 2013. [DOI: 10.1080/07391102.2013.786345] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [What about the content of this article? (0)] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 10/26/2022]
|
11
|
Thurman RE, Rynes E, Humbert R, Vierstra J, Maurano MT, Haugen E, Sheffield NC, Stergachis AB, Wang H, Vernot B, Garg K, Sandstrom R, Bates D, Canfield TK, Diegel M, Dunn D, Ebersol AK, Frum T, Giste E, Harding L, Johnson AK, Johnson EM, Kutyavin T, Lajoie B, Lee BK, Lee K, London D, Lotakis D, Neph S, Neri F, Nguyen ED, Reynolds AP, Roach V, Safi A, Sanchez ME, Sanyal A, Shafer A, Simon JM, Song L, Vong S, Weaver M, Zhang Z, Zhang Z, Lenhard B, Tewari M, Dorschner MO, Hansen RS, Navas PA, Stamatoyannopoulos G, Iyer VR, Lieb JD, Sunyaev SR, Akey JM, Sabo PJ, Kaul R, Furey TS, Dekker J, Crawford GE, Stamatoyannopoulos JA. The accessible chromatin landscape of the human genome. Nature 2012; 489:75-82. [PMID: 22955617 PMCID: PMC3721348 DOI: 10.1038/nature11232] [Citation(s) in RCA: 1898] [Impact Index Per Article: 158.2] [Reference Citation Analysis] [What about the content of this article? (0)] [Affiliation(s)] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 12/15/2011] [Accepted: 05/15/2012] [Indexed: 02/07/2023]
Abstract
DNase I hypersensitive sites (DHSs) are markers of regulatory DNA and have underpinned the discovery of all classes of cis-regulatory elements including enhancers, promoters, insulators, silencers and locus control regions. Here we present the first extensive map of human DHSs identified through genome-wide profiling in 125 diverse cell and tissue types. We identify ∼2.9 million DHSs that encompass virtually all known experimentally validated cis-regulatory sequences and expose a vast trove of novel elements, most with highly cell-selective regulation. Annotating these elements using ENCODE data reveals novel relationships between chromatin accessibility, transcription, DNA methylation and regulatory factor occupancy patterns. We connect ∼580,000 distal DHSs with their target promoters, revealing systematic pairing of different classes of distal DHSs and specific promoter types. Patterning of chromatin accessibility at many regulatory regions is organized with dozens to hundreds of co-activated elements, and the transcellular DNase I sensitivity pattern at a given region can predict cell-type-specific functional behaviours. The DHS landscape shows signatures of recent functional evolutionary constraint. However, the DHS compartment in pluripotent and immortalized cells exhibits higher mutation rates than that in highly differentiated cells, exposing an unexpected link between chromatin accessibility, proliferative potential and patterns of human variation.
Collapse
Affiliation(s)
- Robert E. Thurman
- Department of Genome Sciences, University of Washington, Seattle, WA
| | - Eric Rynes
- Department of Genome Sciences, University of Washington, Seattle, WA
| | - Richard Humbert
- Department of Genome Sciences, University of Washington, Seattle, WA
| | - Jeff Vierstra
- Department of Genome Sciences, University of Washington, Seattle, WA
| | | | - Eric Haugen
- Department of Genome Sciences, University of Washington, Seattle, WA
| | | | | | - Hao Wang
- Department of Genome Sciences, University of Washington, Seattle, WA
| | - Benjamin Vernot
- Department of Genome Sciences, University of Washington, Seattle, WA
| | - Kavita Garg
- Division of Human Biology, Fred Hutchinson Cancer Research Center, Seattle, WA
| | - Richard Sandstrom
- Department of Genome Sciences, University of Washington, Seattle, WA
| | - Daniel Bates
- Department of Genome Sciences, University of Washington, Seattle, WA
| | | | - Morgan Diegel
- Department of Genome Sciences, University of Washington, Seattle, WA
| | - Douglas Dunn
- Department of Genome Sciences, University of Washington, Seattle, WA
| | - Abigail K. Ebersol
- Department of Medicine, Division of Medical Genetics, University of Washington, Seattle, WA
| | - Tristan Frum
- Department of Medicine, Division of Medical Genetics, University of Washington, Seattle, WA
| | - Erika Giste
- Department of Genome Sciences, University of Washington, Seattle, WA
| | - Lisa Harding
- Department of Medicine, Division of Medical Genetics, University of Washington, Seattle, WA
| | - Audra K. Johnson
- Department of Genome Sciences, University of Washington, Seattle, WA
| | - Ericka M. Johnson
- Department of Medicine, Division of Medical Genetics, University of Washington, Seattle, WA
| | - Tanya Kutyavin
- Department of Genome Sciences, University of Washington, Seattle, WA
| | - Bryan Lajoie
- Program in Gene Function, University of Massachusetts Medical School, Worcester, MA
| | - Bum-Kyu Lee
- Institute for Cellular and Molecular Biology, University of Texas, Austin, TX
| | - Kristen Lee
- Department of Genome Sciences, University of Washington, Seattle, WA
| | - Darin London
- Institute for Genome Sciences and Policy, Duke University, Durham, NC
| | - Dimitra Lotakis
- Department of Medicine, Division of Medical Genetics, University of Washington, Seattle, WA
| | - Shane Neph
- Department of Genome Sciences, University of Washington, Seattle, WA
| | - Fidencio Neri
- Department of Genome Sciences, University of Washington, Seattle, WA
| | - Eric D. Nguyen
- Department of Medicine, Division of Medical Genetics, University of Washington, Seattle, WA
| | - Alex P. Reynolds
- Department of Genome Sciences, University of Washington, Seattle, WA
| | - Vaughn Roach
- Department of Genome Sciences, University of Washington, Seattle, WA
| | - Alexias Safi
- Institute for Genome Sciences and Policy, Duke University, Durham, NC
| | - Minerva E. Sanchez
- Department of Medicine, Division of Medical Genetics, University of Washington, Seattle, WA
| | - Amartya Sanyal
- Program in Gene Function, University of Massachusetts Medical School, Worcester, MA
| | - Anthony Shafer
- Department of Genome Sciences, University of Washington, Seattle, WA
| | - Jeremy M. Simon
- Department of Biology, University of North Carolina, Chapel Hill, NC
| | - Lingyun Song
- Institute for Genome Sciences and Policy, Duke University, Durham, NC
| | - Shinny Vong
- Department of Genome Sciences, University of Washington, Seattle, WA
| | - Molly Weaver
- Department of Genome Sciences, University of Washington, Seattle, WA
| | - Zhancheng Zhang
- Department of Biology, University of North Carolina, Chapel Hill, NC
| | - Zhuzhu Zhang
- Department of Biology, University of North Carolina, Chapel Hill, NC
| | - Boris Lenhard
- Bergen Center for Computational Science, University of Bergen, Bergen, Norway
| | - Muneesh Tewari
- Division of Human Biology, Fred Hutchinson Cancer Research Center, Seattle, WA
| | - Michael O. Dorschner
- Dept. of Psychiatry and Behavioral Sciences, University of Washington, Seattle, WA
| | - R. Scott Hansen
- Department of Medicine, Division of Medical Genetics, University of Washington, Seattle, WA
| | - Patrick A. Navas
- Department of Medicine, Division of Medical Genetics, University of Washington, Seattle, WA
| | | | - Vishwanath R. Iyer
- Institute for Cellular and Molecular Biology, University of Texas, Austin, TX
| | - Jason D. Lieb
- Department of Biology, University of North Carolina, Chapel Hill, NC
| | - Shamil R. Sunyaev
- Dept. of Medicine, Division of Genetics, Brigham & Women’s Hospital and Harvard Medical School, Boston, MA
| | - Joshua M. Akey
- Department of Genome Sciences, University of Washington, Seattle, WA
| | - Peter J. Sabo
- Department of Genome Sciences, University of Washington, Seattle, WA
| | - Rajinder Kaul
- Department of Medicine, Division of Medical Genetics, University of Washington, Seattle, WA
| | - Terrence S. Furey
- Department of Biology, University of North Carolina, Chapel Hill, NC
| | - Job Dekker
- Program in Gene Function, University of Massachusetts Medical School, Worcester, MA
| | | | - John A. Stamatoyannopoulos
- Department of Genome Sciences, University of Washington, Seattle, WA
- Department of Medicine, Division of Oncology, University of Washington, Seattle, WA
| |
Collapse
|
12
|
Maurano MT, Humbert R, Rynes E, Thurman RE, Haugen E, Wang H, Reynolds AP, Sandstrom R, Qu H, Brody J, Shafer A, Neri F, Lee K, Kutyavin T, Stehling-Sun S, Johnson AK, Canfield TK, Giste E, Diegel M, Bates D, Hansen RS, Neph S, Sabo PJ, Heimfeld S, Raubitschek A, Ziegler S, Cotsapas C, Sotoodehnia N, Glass I, Sunyaev SR, Kaul R, Stamatoyannopoulos JA. Systematic localization of common disease-associated variation in regulatory DNA. Science 2012; 337:1190-5. [PMID: 22955828 DOI: 10.1126/science.1222794] [Citation(s) in RCA: 2409] [Impact Index Per Article: 200.8] [Reference Citation Analysis] [What about the content of this article? (0)] [Affiliation(s)] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 12/11/2022]
Abstract
Genome-wide association studies have identified many noncoding variants associated with common diseases and traits. We show that these variants are concentrated in regulatory DNA marked by deoxyribonuclease I (DNase I) hypersensitive sites (DHSs). Eighty-eight percent of such DHSs are active during fetal development and are enriched in variants associated with gestational exposure-related phenotypes. We identified distant gene targets for hundreds of variant-containing DHSs that may explain phenotype associations. Disease-associated variants systematically perturb transcription factor recognition sequences, frequently alter allelic chromatin states, and form regulatory networks. We also demonstrated tissue-selective enrichment of more weakly disease-associated variants within DHSs and the de novo identification of pathogenic cell types for Crohn's disease, multiple sclerosis, and an electrocardiogram trait, without prior knowledge of physiological mechanisms. Our results suggest pervasive involvement of regulatory DNA variation in common human disease and provide pathogenic insights into diverse disorders.
Collapse
Affiliation(s)
- Matthew T Maurano
- Department of Genome Sciences, University of Washington, Seattle, WA 98195, USA
| | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | |
Collapse
|
13
|
Stamatoyannopoulos JA, Snyder M, Hardison R, Ren B, Gingeras T, Gilbert DM, Groudine M, Bender M, Kaul R, Canfield T, Giste E, Johnson A, Zhang M, Balasundaram G, Byron R, Roach V, Sabo PJ, Sandstrom R, Stehling AS, Thurman RE, Weissman SM, Cayting P, Hariharan M, Lian J, Cheng Y, Landt SG, Ma Z, Wold BJ, Dekker J, Crawford GE, Keller CA, Wu W, Morrissey C, Kumar SA, Mishra T, Jain D, Byrska-Bishop M, Blankenberg D, Lajoie1 BR, Jain G, Sanyal A, Chen KB, Denas O, Taylor J, Blobel GA, Weiss MJ, Pimkin M, Deng W, Marinov GK, Williams BA, Fisher-Aylor KI, Desalvo G, Kiralusha A, Trout D, Amrhein H, Mortazavi A, Edsall L, McCleary D, Kuan S, Shen Y, Yue F, Ye Z, Davis CA, Zaleski C, Jha S, Xue C, Dobin A, Lin W, Fastuca M, Wang H, Guigo R, Djebali S, Lagarde J, Ryba T, Sasaki T, Malladi VS, Cline MS, Kirkup VM, Learned K, Rosenbloom KR, Kent WJ, Feingold EA, Good PJ, Pazin M, Lowdon RF, Adams LB. An encyclopedia of mouse DNA elements (Mouse ENCODE). Genome Biol 2012; 13:418. [PMID: 22889292 PMCID: PMC3491367 DOI: 10.1186/gb-2012-13-8-418] [Citation(s) in RCA: 343] [Impact Index Per Article: 28.6] [Reference Citation Analysis] [What about the content of this article? (0)] [Affiliation(s)] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 12/04/2022] Open
Abstract
To complement the human Encyclopedia of DNA Elements (ENCODE) project and to enable a broad range of mouse genomics efforts, the Mouse ENCODE Consortium is applying the same experimental pipelines developed for human ENCODE to annotate the mouse genome.
Collapse
Affiliation(s)
- John A Stamatoyannopoulos
- Department of Genome Sciences, University of Washington School of Medicine, Seattle, Washington, USA
| | - Michael Snyder
- Department of Genetics, Stanford University School of Medicine, Stanford, California, USA
| | - Ross Hardison
- Center for Comparative Genomics and Bioinformatics, Department of Biochemistry and Molecular Biology, The Pennsylvania State University, University Park, Pennsylvania, USA
| | - Bing Ren
- Department of Cellular and Molecular Medicine, Institute of Genomic Medicine, University of California San Diego, La Jolla, California, USA
| | - Thomas Gingeras
- Dept. of Functional Genomics, Cold Spring Harbor Laboratory, Cold Spring Harbor, New York, USA
| | - David M Gilbert
- Department of Biological Science, Florida State University, Tallahassee, Florida, USA
| | - Mark Groudine
- Basic Sciences Division, Fred Hutchinson Cancer Research Center, Seattle, Washington, USA
| | - Michael Bender
- Basic Sciences Division, Fred Hutchinson Cancer Research Center, Seattle, Washington, USA
| | - Rajinder Kaul
- Department of Genome Sciences, University of Washington School of Medicine, Seattle, Washington, USA
| | - Theresa Canfield
- Department of Genome Sciences, University of Washington School of Medicine, Seattle, Washington, USA
| | - Erica Giste
- Department of Genome Sciences, University of Washington School of Medicine, Seattle, Washington, USA
| | - Audra Johnson
- Department of Genome Sciences, University of Washington School of Medicine, Seattle, Washington, USA
| | - Mia Zhang
- Basic Sciences Division, Fred Hutchinson Cancer Research Center, Seattle, Washington, USA
| | - Gayathri Balasundaram
- Basic Sciences Division, Fred Hutchinson Cancer Research Center, Seattle, Washington, USA
| | - Rachel Byron
- Basic Sciences Division, Fred Hutchinson Cancer Research Center, Seattle, Washington, USA
| | - Vaughan Roach
- Department of Genome Sciences, University of Washington School of Medicine, Seattle, Washington, USA
| | - Peter J Sabo
- Department of Genome Sciences, University of Washington School of Medicine, Seattle, Washington, USA
| | - Richard Sandstrom
- Department of Genome Sciences, University of Washington School of Medicine, Seattle, Washington, USA
| | - A Sandra Stehling
- Department of Genome Sciences, University of Washington School of Medicine, Seattle, Washington, USA
| | - Robert E Thurman
- Department of Genome Sciences, University of Washington School of Medicine, Seattle, Washington, USA
| | | | - Philip Cayting
- Department of Genetics, Yale University, New Haven, Connecticut, USA
- Program in Computational Biology and Bioinformatics, Yale University, New Haven, Connecticut, USA
- Department of Molecular Biophysics and Biochemistry, Yale University, New Haven, Connecticut, USA
| | - Manoj Hariharan
- Department of Genetics, Stanford University School of Medicine, Stanford, California, USA
| | - Jin Lian
- Program in Computational Biology and Bioinformatics, Yale University, New Haven, Connecticut, USA
| | - Yong Cheng
- Department of Genetics, Stanford University School of Medicine, Stanford, California, USA
| | - Stephen G Landt
- Department of Genetics, Stanford University School of Medicine, Stanford, California, USA
| | - Zhihai Ma
- Department of Genetics, Stanford University School of Medicine, Stanford, California, USA
| | - Barbara J Wold
- Div. of Biology, California Institute of Technology, Pasadena, California, USA
| | - Job Dekker
- Department of Biochemistry and Molecular Pharmacology, University of Massachusetts Medical School, Worcester, Massachussetts, USA
| | - Gregory E Crawford
- Institute for Genome Sciences and Policy, Duke University, Durham, North Carolina, USA
- Department of Pediatrics, Duke University, Durham, North Carolina, USA
| | - Cheryl A Keller
- Center for Comparative Genomics and Bioinformatics, Department of Biochemistry and Molecular Biology, The Pennsylvania State University, University Park, Pennsylvania, USA
| | - Weisheng Wu
- Center for Comparative Genomics and Bioinformatics, Department of Biochemistry and Molecular Biology, The Pennsylvania State University, University Park, Pennsylvania, USA
| | - Christopher Morrissey
- Center for Comparative Genomics and Bioinformatics, Department of Biochemistry and Molecular Biology, The Pennsylvania State University, University Park, Pennsylvania, USA
| | - Swathi A Kumar
- Center for Comparative Genomics and Bioinformatics, Department of Biochemistry and Molecular Biology, The Pennsylvania State University, University Park, Pennsylvania, USA
| | - Tejaswini Mishra
- Center for Comparative Genomics and Bioinformatics, Department of Biochemistry and Molecular Biology, The Pennsylvania State University, University Park, Pennsylvania, USA
| | - Deepti Jain
- Center for Comparative Genomics and Bioinformatics, Department of Biochemistry and Molecular Biology, The Pennsylvania State University, University Park, Pennsylvania, USA
| | - Marta Byrska-Bishop
- Center for Comparative Genomics and Bioinformatics, Department of Biochemistry and Molecular Biology, The Pennsylvania State University, University Park, Pennsylvania, USA
| | - Daniel Blankenberg
- Center for Comparative Genomics and Bioinformatics, Department of Biochemistry and Molecular Biology, The Pennsylvania State University, University Park, Pennsylvania, USA
| | - Bryan R Lajoie1
- Department of Genetics, Stanford University School of Medicine, Stanford, California, USA
| | - Gaurav Jain
- Department of Biochemistry and Molecular Pharmacology, University of Massachusetts Medical School, Worcester, Massachussetts, USA
| | - Amartya Sanyal
- Department of Biochemistry and Molecular Pharmacology, University of Massachusetts Medical School, Worcester, Massachussetts, USA
| | - Kaun-Bei Chen
- Institute for Genome Sciences and Policy, Duke University, Durham, North Carolina, USA
| | - Olgert Denas
- Institute for Genome Sciences and Policy, Duke University, Durham, North Carolina, USA
| | - James Taylor
- Department of Mathematics and Computer Science, Emory University, Atlanta, Georgia, USA
| | - Gerd A Blobel
- Div. of Hematology, Children's Hospital of Philadelphia, Abramson Research Center, Philadelphia, Pennsylvania, USA
| | - Mitchell J Weiss
- Div. of Hematology, Children's Hospital of Philadelphia, Abramson Research Center, Philadelphia, Pennsylvania, USA
| | - Max Pimkin
- Div. of Hematology, Children's Hospital of Philadelphia, Abramson Research Center, Philadelphia, Pennsylvania, USA
| | - Wulan Deng
- Div. of Hematology, Children's Hospital of Philadelphia, Abramson Research Center, Philadelphia, Pennsylvania, USA
| | - Georgi K Marinov
- Div. of Biology, California Institute of Technology, Pasadena, California, USA
| | - Brian A Williams
- Div. of Biology, California Institute of Technology, Pasadena, California, USA
| | | | - Gilberto Desalvo
- Div. of Biology, California Institute of Technology, Pasadena, California, USA
| | - Anthony Kiralusha
- Div. of Biology, California Institute of Technology, Pasadena, California, USA
| | - Diane Trout
- Div. of Biology, California Institute of Technology, Pasadena, California, USA
| | - Henry Amrhein
- Div. of Biology, California Institute of Technology, Pasadena, California, USA
| | - Ali Mortazavi
- Dept. of Developmental and Cell Biology, University of California Irvine, Irvine California, USA
| | - Lee Edsall
- Department of Cellular and Molecular Medicine, Institute of Genomic Medicine, University of California San Diego, La Jolla, California, USA
| | - David McCleary
- Department of Cellular and Molecular Medicine, Institute of Genomic Medicine, University of California San Diego, La Jolla, California, USA
| | - Samantha Kuan
- Department of Cellular and Molecular Medicine, Institute of Genomic Medicine, University of California San Diego, La Jolla, California, USA
| | - Yin Shen
- Department of Cellular and Molecular Medicine, Institute of Genomic Medicine, University of California San Diego, La Jolla, California, USA
| | - Feng Yue
- Department of Cellular and Molecular Medicine, Institute of Genomic Medicine, University of California San Diego, La Jolla, California, USA
| | - Zhen Ye
- Department of Cellular and Molecular Medicine, Institute of Genomic Medicine, University of California San Diego, La Jolla, California, USA
| | - Carrie A Davis
- Dept. of Functional Genomics, Cold Spring Harbor Laboratory, Cold Spring Harbor, New York, USA
| | - Chris Zaleski
- Dept. of Functional Genomics, Cold Spring Harbor Laboratory, Cold Spring Harbor, New York, USA
| | - Sonali Jha
- Dept. of Functional Genomics, Cold Spring Harbor Laboratory, Cold Spring Harbor, New York, USA
| | - Chenghai Xue
- Dept. of Functional Genomics, Cold Spring Harbor Laboratory, Cold Spring Harbor, New York, USA
| | - Alex Dobin
- Dept. of Functional Genomics, Cold Spring Harbor Laboratory, Cold Spring Harbor, New York, USA
| | - Wei Lin
- Dept. of Functional Genomics, Cold Spring Harbor Laboratory, Cold Spring Harbor, New York, USA
| | - Meagan Fastuca
- Dept. of Functional Genomics, Cold Spring Harbor Laboratory, Cold Spring Harbor, New York, USA
| | - Huaien Wang
- Dept. of Functional Genomics, Cold Spring Harbor Laboratory, Cold Spring Harbor, New York, USA
| | - Roderic Guigo
- Division of Bioinformatics and Genomics, Center for Genomic Regulation, Barcelona, Catalunya, Spain
| | - Sarah Djebali
- Division of Bioinformatics and Genomics, Center for Genomic Regulation, Barcelona, Catalunya, Spain
| | - Julien Lagarde
- Division of Bioinformatics and Genomics, Center for Genomic Regulation, Barcelona, Catalunya, Spain
| | - Tyrone Ryba
- Department of Biological Science, Florida State University, Tallahassee, Florida, USA
| | - Takayo Sasaki
- Department of Biological Science, Florida State University, Tallahassee, Florida, USA
| | - Venkat S Malladi
- Center for Biomolecular Science and Engineering, School of Engineering, University of California Santa Cruz (UCSC), Santa Cruz, California, USA
| | - Melissa S Cline
- Center for Biomolecular Science and Engineering, School of Engineering, University of California Santa Cruz (UCSC), Santa Cruz, California, USA
| | - Vanessa M Kirkup
- Center for Biomolecular Science and Engineering, School of Engineering, University of California Santa Cruz (UCSC), Santa Cruz, California, USA
| | - Katrina Learned
- Center for Biomolecular Science and Engineering, School of Engineering, University of California Santa Cruz (UCSC), Santa Cruz, California, USA
| | - Kate R Rosenbloom
- Center for Biomolecular Science and Engineering, School of Engineering, University of California Santa Cruz (UCSC), Santa Cruz, California, USA
| | - W James Kent
- Center for Biomolecular Science and Engineering, School of Engineering, University of California Santa Cruz (UCSC), Santa Cruz, California, USA
| | - Elise A Feingold
- National Human Genome Research Institute, National Institutes of Health, Bethesda, Maryland, USA
| | - Peter J Good
- National Human Genome Research Institute, National Institutes of Health, Bethesda, Maryland, USA
| | - Michael Pazin
- National Human Genome Research Institute, National Institutes of Health, Bethesda, Maryland, USA
| | - Rebecca F Lowdon
- National Human Genome Research Institute, National Institutes of Health, Bethesda, Maryland, USA
| | - Leslie B Adams
- National Human Genome Research Institute, National Institutes of Health, Bethesda, Maryland, USA
| |
Collapse
|
14
|
Ganis JJ, Hsia N, Trompouki E, de Jong JLO, DiBiase A, Lambert JS, Jia Z, Sabo PJ, Weaver M, Sandstrom R, Stamatoyannopoulos JA, Zhou Y, Zon LI. Zebrafish globin switching occurs in two developmental stages and is controlled by the LCR. Dev Biol 2012; 366:185-94. [PMID: 22537494 DOI: 10.1016/j.ydbio.2012.03.021] [Citation(s) in RCA: 106] [Impact Index Per Article: 8.8] [Reference Citation Analysis] [What about the content of this article? (0)] [Affiliation(s)] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/25/2011] [Revised: 02/20/2012] [Accepted: 03/19/2012] [Indexed: 02/02/2023]
Abstract
Globin gene switching is a complex, highly regulated process allowing expression of distinct globin genes at specific developmental stages. Here, for the first time, we have characterized all of the zebrafish globins based on the completed genomic sequence. Two distinct chromosomal loci, termed major (chromosome 3) and minor (chromosome 12), harbor the globin genes containing α/β pairs in a 5'-3' to 3'-5' orientation. Both these loci share synteny with the mammalian α-globin locus. Zebrafish globin expression was assayed during development and demonstrated two globin switches, similar to human development. A conserved regulatory element, the locus control region (LCR), was revealed by analyzing DNase I hypersensitive sites, H3K4 trimethylation marks and GATA1 binding sites. Surprisingly, the position of these sites with relation to the globin genes is evolutionarily conserved, despite a lack of overall sequence conservation. Motifs within the zebrafish LCR include CACCC, GATA, and NFE2 sites, suggesting functional interactions with known transcription factors but not the same LCR architecture. Functional homology to the mammalian α-LCR MCS-R2 region was confirmed by robust and specific reporter expression in erythrocytes of transgenic zebrafish. Our studies provide a comprehensive characterization of the zebrafish globin loci and clarify the regulation of globin switching.
Collapse
Affiliation(s)
- Jared J Ganis
- Stem Cell Program and Division of Hematology/Oncology, Children's Hospital and Dana Farber Cancer Institute, and Harvard Stem Cell Institute, Harvard Medical School, 1 Blackfan Cir., Karp 7, Boston, MA 02115, USA.
| | | | | | | | | | | | | | | | | | | | | | | | | |
Collapse
|
15
|
Thomas S, Li XY, Sabo PJ, Sandstrom R, Thurman RE, Canfield TK, Giste E, Fisher W, Hammonds A, Celniker SE, Biggin MD, Stamatoyannopoulos JA. Dynamic reprogramming of chromatin accessibility during Drosophila embryo development. Genome Biol 2011; 12:R43. [PMID: 21569360 PMCID: PMC3219966 DOI: 10.1186/gb-2011-12-5-r43] [Citation(s) in RCA: 126] [Impact Index Per Article: 9.7] [Reference Citation Analysis] [What about the content of this article? (0)] [Affiliation(s)] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 02/08/2011] [Revised: 03/21/2011] [Accepted: 05/11/2011] [Indexed: 12/22/2022] Open
Abstract
Background The development of complex organisms is believed to involve progressive restrictions in cellular fate. Understanding the scope and features of chromatin dynamics during embryogenesis, and identifying regulatory elements important for directing developmental processes remain key goals of developmental biology. Results We used in vivo DNaseI sensitivity to map the locations of regulatory elements, and explore the changing chromatin landscape during the first 11 hours of Drosophila embryonic development. We identified thousands of conserved, developmentally dynamic, distal DNaseI hypersensitive sites associated with spatial and temporal expression patterning of linked genes and with large regions of chromatin plasticity. We observed a nearly uniform balance between developmentally up- and down-regulated DNaseI hypersensitive sites. Analysis of promoter chromatin architecture revealed a novel role for classical core promoter sequence elements in directing temporally regulated chromatin remodeling. Another unexpected feature of the chromatin landscape was the presence of localized accessibility over many protein-coding regions, subsets of which were developmentally regulated or associated with the transcription of genes with prominent maternal RNA contributions in the blastoderm. Conclusions Our results provide a global view of the rich and dynamic chromatin landscape of early animal development, as well as novel insights into the organization of developmentally regulated chromatin features.
Collapse
Affiliation(s)
- Sean Thomas
- Department of Genome Sciences, University of Washington, Foege S310A, 1705 NE Pacific Street, Box 355065, Seattle, WA 98195, USA
| | | | | | | | | | | | | | | | | | | | | | | |
Collapse
|
16
|
Li XY, Thomas S, Sabo PJ, Eisen MB, Stamatoyannopoulos JA, Biggin MD. The role of chromatin accessibility in directing the widespread, overlapping patterns of Drosophila transcription factor binding. Genome Biol 2011; 12:R34. [PMID: 21473766 PMCID: PMC3218860 DOI: 10.1186/gb-2011-12-4-r34] [Citation(s) in RCA: 180] [Impact Index Per Article: 13.8] [Reference Citation Analysis] [What about the content of this article? (0)] [Affiliation(s)] [Abstract] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/19/2011] [Accepted: 04/07/2011] [Indexed: 12/11/2022] Open
Abstract
Background In Drosophila embryos, many biochemically and functionally unrelated transcription factors bind quantitatively to highly overlapping sets of genomic regions, with much of the lowest levels of binding being incidental, non-functional interactions on DNA. The primary biochemical mechanisms that drive these genome-wide occupancy patterns have yet to be established. Results Here we use data resulting from the DNaseI digestion of isolated embryo nuclei to provide a biophysical measure of the degree to which proteins can access different regions of the genome. We show that the in vivo binding patterns of 21 developmental regulators are quantitatively correlated with DNA accessibility in chromatin. Furthermore, we find that levels of factor occupancy in vivo correlate much more with the degree of chromatin accessibility than with occupancy predicted from in vitro affinity measurements using purified protein and naked DNA. Within accessible regions, however, the intrinsic affinity of the factor for DNA does play a role in determining net occupancy, with even weak affinity recognition sites contributing. Finally, we show that programmed changes in chromatin accessibility between different developmental stages correlate with quantitative alterations in factor binding. Conclusions Based on these and other results, we propose a general mechanism to explain the widespread, overlapping DNA binding by animal transcription factors. In this view, transcription factors are expressed at sufficiently high concentrations in cells such that they can occupy their recognition sequences in highly accessible chromatin without the aid of physical cooperative interactions with other proteins, leading to highly overlapping, graded binding of unrelated factors.
Collapse
Affiliation(s)
- Xiao-Yong Li
- Genomics Division, Lawrence Berkeley National Laboratory, 1 Cyclotron Road MS 84-171, Berkeley, CA 94720, USA
| | | | | | | | | | | |
Collapse
|
17
|
Hakim O, Sung MH, Voss TC, Splinter E, John S, Sabo PJ, Thurman RE, Stamatoyannopoulos JA, de Laat W, Hager GL. Diverse gene reprogramming events occur in the same spatial clusters of distal regulatory elements. Genome Res 2011; 21:697-706. [PMID: 21471403 DOI: 10.1101/gr.111153.110] [Citation(s) in RCA: 114] [Impact Index Per Article: 8.8] [Reference Citation Analysis] [What about the content of this article? (0)] [Affiliation(s)] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 12/20/2022]
Abstract
The spatial organization of genes in the interphase nucleus plays an important role in establishment and regulation of gene expression. Contradicting results have been reported to date, with little consensus about the dynamics of nuclear organization and the features of the contact loci. In this study, we investigated the properties and dynamics of genomic loci that are in contact with glucocorticoid receptor (GR)-responsive loci. We took a systematic approach, combining genome-wide interaction profiling by the chromosome conformation capture on chip (4C) technology with expression, protein occupancy, and chromatin accessibility profiles. This approach allowed a comprehensive analysis of how distinct features of the linear genome are organized in the three-dimensional nuclear space in the context of rapid gene regulation. We found that the transcriptional response to GR occurs without dramatic nuclear reorganization. Moreover, contrary to the view of transcription-driven organization, even genes with opposite transcriptional responses colocalize. Regions contacting GR-regulated genes are not particularly enriched for GR-regulated loci or for any functional group of genes, suggesting that these subnuclear environments are not organized to respond to a specific factor. The contact regions are, however, highly enriched for DNase I-hypersensitive sites that comprehensively mark cell-type-specific regulatory sites. These findings indicate that the nucleus is pre-organized in a conformation allowing rapid transcriptional reprogramming, and this organization is significantly correlated with cell-type-specific chromatin sites accessible to regulatory factors. Numerous open chromatin loci may be arranged in nuclear domains that are poised to respond to diverse signals in general and to permit efficient gene regulation.
Collapse
Affiliation(s)
- Ofir Hakim
- Laboratory of Receptor Biology and Gene Expression, National Cancer Institute, National Institutes of Health, Bethesda, Maryland 20892-5055, USA
| | | | | | | | | | | | | | | | | | | |
Collapse
|
18
|
Kaplan T, Li XY, Sabo PJ, Thomas S, Stamatoyannopoulos JA, Biggin MD, Eisen MB. Quantitative models of the mechanisms that control genome-wide patterns of transcription factor binding during early Drosophila development. PLoS Genet 2011; 7:e1001290. [PMID: 21304941 PMCID: PMC3033374 DOI: 10.1371/journal.pgen.1001290] [Citation(s) in RCA: 139] [Impact Index Per Article: 10.7] [Reference Citation Analysis] [What about the content of this article? (0)] [Affiliation(s)] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 11/04/2010] [Accepted: 01/01/2011] [Indexed: 01/01/2023] Open
Abstract
Transcription factors that drive complex patterns of gene expression during animal development bind to thousands of genomic regions, with quantitative differences in binding across bound regions mediating their activity. While we now have tools to characterize the DNA affinities of these proteins and to precisely measure their genome-wide distribution in vivo, our understanding of the forces that determine where, when, and to what extent they bind remains primitive. Here we use a thermodynamic model of transcription factor binding to evaluate the contribution of different biophysical forces to the binding of five regulators of early embryonic anterior-posterior patterning in Drosophila melanogaster. Predictions based on DNA sequence and in vitro protein-DNA affinities alone achieve a correlation of ∼0.4 with experimental measurements of in vivo binding. Incorporating cooperativity and competition among the five factors, and accounting for spatial patterning by modeling binding in every nucleus independently, had little effect on prediction accuracy. A major source of error was the prediction of binding events that do not occur in vivo, which we hypothesized reflected reduced accessibility of chromatin. To test this, we incorporated experimental measurements of genome-wide DNA accessibility into our model, effectively restricting predicted binding to regions of open chromatin. This dramatically improved our predictions to a correlation of 0.6-0.9 for various factors across known target genes. Finally, we used our model to quantify the roles of DNA sequence, accessibility, and binding competition and cooperativity. Our results show that, in regions of open chromatin, binding can be predicted almost exclusively by the sequence specificity of individual factors, with a minimal role for protein interactions. We suggest that a combination of experimentally determined chromatin accessibility data and simple computational models of transcription factor binding may be used to predict the binding landscape of any animal transcription factor with significant precision.
Collapse
Affiliation(s)
- Tommy Kaplan
- Department of Molecular and Cell Biology, California Institute of Quantitative Biosciences, University of California Berkeley, Berkeley, California, United States of America
| | - Xiao-Yong Li
- Howard Hughes Medical Institute, University of California Berkeley, Berkeley, California, United States of America
| | - Peter J. Sabo
- Department of Genome Sciences, University of Washington, Seattle, Washington, United States of America
| | - Sean Thomas
- Department of Genome Sciences, University of Washington, Seattle, Washington, United States of America
| | | | - Mark D. Biggin
- Genomics Division, Lawrence Berkeley National Laboratory, Berkeley, California, United States of America
| | - Michael B. Eisen
- Department of Molecular and Cell Biology, California Institute of Quantitative Biosciences, University of California Berkeley, Berkeley, California, United States of America
- Howard Hughes Medical Institute, University of California Berkeley, Berkeley, California, United States of America
- Genomics Division, Lawrence Berkeley National Laboratory, Berkeley, California, United States of America
| |
Collapse
|
19
|
John S, Sabo PJ, Thurman RE, Sung MH, Biddie SC, Johnson TA, Hager GL, Stamatoyannopoulos JA. Chromatin accessibility pre-determines glucocorticoid receptor binding patterns. Nat Genet 2011; 43:264-8. [PMID: 21258342 PMCID: PMC6386452 DOI: 10.1038/ng.759] [Citation(s) in RCA: 691] [Impact Index Per Article: 53.2] [Reference Citation Analysis] [What about the content of this article? (0)] [Affiliation(s)] [Abstract] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 08/30/2010] [Accepted: 12/29/2010] [Indexed: 12/25/2022]
Abstract
Development, differentiation, and response to environmental stimuli are
characterized by sequential changes in cellular state initiated by the
de novo binding of regulated transcriptional factors to
their cognate genomic sites 1,2,3.
The mechanism whereby a given regulatory factor selects a limited number of
in vivo targets from myriads of potential genomic binding
sites is undetermined. Here we show that up to 95% of induced de
novo genomic binding by the glucocorticoid receptor4, a paradigmatic ligand-activated transcription
factor, is targeted to pre-existing foci of accessible chromatin. Factor binding
invariably potentiates chromatin accessibility. Cell-selective glucocortocoid
receptor genomic occupancy patterns appear to be comprehensively pre-determined
by cell-specific differences in baseline chromatin accessibility patterns, with
secondary contributions from local sequence features. The results define a novel
framework for understanding regulatory factor-genome interactions, and provide a
molecular basis for the tissue-selectivity of steroid pharmaceuticals and other
agents that intersect the living genome.
Collapse
Affiliation(s)
- Sam John
- Laboratory of Receptor Biology and Gene Expression, Center for Cancer Research, National Cancer Institute, National Institutes of Health, Bethesda, Maryland, USA
| | | | | | | | | | | | | | | |
Collapse
|
20
|
Kharchenko PV, Alekseyenko AA, Schwartz YB, Minoda A, Riddle NC, Ernst J, Sabo PJ, Larschan E, Gorchakov AA, Gu T, Linder-Basso D, Plachetka A, Shanower G, Tolstorukov MY, Luquette LJ, Xi R, Jung YL, Park RW, Bishop EP, Canfield TK, Sandstrom R, Thurman RE, MacAlpine DM, Stamatoyannopoulos JA, Kellis M, Elgin SCR, Kuroda MI, Pirrotta V, Karpen GH, Park PJ. Comprehensive analysis of the chromatin landscape in Drosophila melanogaster. Nature 2010; 471:480-5. [PMID: 21179089 PMCID: PMC3109908 DOI: 10.1038/nature09725] [Citation(s) in RCA: 647] [Impact Index Per Article: 46.2] [Reference Citation Analysis] [What about the content of this article? (0)] [Affiliation(s)] [Abstract] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 09/02/2010] [Accepted: 12/06/2010] [Indexed: 12/17/2022]
Abstract
Chromatin is composed of DNA and a variety of modified histones and non-histone proteins, which have an impact on cell differentiation, gene regulation and other key cellular processes. Here we present a genome-wide chromatin landscape for Drosophila melanogaster based on eighteen histone modifications, summarized by nine prevalent combinatorial patterns. Integrative analysis with other data (non-histone chromatin proteins, DNase I hypersensitivity, GRO-Seq reads produced by engaged polymerase, short/long RNA products) reveals discrete characteristics of chromosomes, genes, regulatory elements and other functional domains. We find that active genes display distinct chromatin signatures that are correlated with disparate gene lengths, exon patterns, regulatory functions and genomic contexts. We also demonstrate a diversity of signatures among Polycomb targets that include a subset with paused polymerase. This systematic profiling and integrative analysis of chromatin signatures provides insights into how genomic elements are regulated, and will serve as a resource for future experimental investigations of genome structure and function.
Collapse
Affiliation(s)
- Peter V Kharchenko
- Center for Biomedical Informatics, Harvard Medical School, Boston, Massachusetts 02115, USA
| | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | |
Collapse
|
21
|
Lieberman-Aiden E, van Berkum NL, Williams L, Imakaev M, Ragoczy T, Telling A, Amit I, Lajoie BR, Sabo PJ, Dorschner MO, Sandstrom R, Bernstein B, Bender MA, Groudine M, Gnirke A, Stamatoyannopoulos J, Mirny LA, Lander ES, Dekker J. Comprehensive mapping of long-range interactions reveals folding principles of the human genome. Science 2009; 326:289-93. [PMID: 19815776 DOI: 10.1126/science.1181369] [Citation(s) in RCA: 5314] [Impact Index Per Article: 354.3] [Reference Citation Analysis] [What about the content of this article? (0)] [Affiliation(s)] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 12/12/2022]
Abstract
We describe Hi-C, a method that probes the three-dimensional architecture of whole genomes by coupling proximity-based ligation with massively parallel sequencing. We constructed spatial proximity maps of the human genome with Hi-C at a resolution of 1 megabase. These maps confirm the presence of chromosome territories and the spatial proximity of small, gene-rich chromosomes. We identified an additional level of genome organization that is characterized by the spatial segregation of open and closed chromatin to form two genome-wide compartments. At the megabase scale, the chromatin conformation is consistent with a fractal globule, a knot-free, polymer conformation that enables maximally dense packing while preserving the ability to easily fold and unfold any genomic locus. The fractal globule is distinct from the more commonly used globular equilibrium model. Our results demonstrate the power of Hi-C to map the dynamic conformations of whole genomes.
Collapse
Affiliation(s)
- Erez Lieberman-Aiden
- Broad Institute of Harvard and Massachusetts Institute of Technology (MIT), MA 02139, USA
| | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | |
Collapse
|
22
|
Sekimata M, Pérez-Melgosa M, Miller SA, Weinmann AS, Sabo PJ, Sandstrom R, Dorschner MO, Stamatoyannopoulos JA, Wilson CB. CCCTC-binding factor and the transcription factor T-bet orchestrate T helper 1 cell-specific structure and function at the interferon-gamma locus. Immunity 2009; 31:551-64. [PMID: 19818655 PMCID: PMC2810421 DOI: 10.1016/j.immuni.2009.08.021] [Citation(s) in RCA: 117] [Impact Index Per Article: 7.8] [Reference Citation Analysis] [What about the content of this article? (0)] [Affiliation(s)] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/27/2009] [Revised: 07/20/2009] [Accepted: 08/17/2009] [Indexed: 12/17/2022]
Abstract
How cell type-specific differences in chromatin conformation are achieved and their contribution to gene expression are incompletely understood. Here we identify a cryptic upstream orchestrator of interferon-gamma (IFNG) transcription, which is embedded within the human IL26 gene, compromised of a single CCCTC-binding factor (CTCF) binding site and retained in all mammals, even surviving near-complete evolutionary deletion of the equivalent gene encoding IL-26 in rodents. CTCF and cohesins occupy this element in vivo in a cell type-nonspecific manner. This element is juxtaposed to two other sites located within the first intron and downstream of Ifng, where CTCF, cohesins, and the transcription factor T-bet bind in a T helper 1 (Th1) cell-specific manner. These interactions, close proximity of other elements within the locus to each other and to the gene encoding interferon-gamma, and robust murine Ifng expression are dependent on CTCF and T-bet. The results demonstrate that cooperation between architectural (CTCF) and transcriptional enhancing (T-bet) factors and the elements to which they bind is required for proper Th1 cell-specific expression of Ifng.
Collapse
Affiliation(s)
- Masayuki Sekimata
- Department of Immunology, University of Washington School of Medicine, Seattle WA, 98195 USA
| | - Mercedes Pérez-Melgosa
- Department of Immunology, University of Washington School of Medicine, Seattle WA, 98195 USA
| | - Sara A. Miller
- Molecular and Cellular Biology Graduate Program, University of Washington School of Medicine, Seattle WA, 98195 USA
| | - Amy S. Weinmann
- Department of Immunology, University of Washington School of Medicine, Seattle WA, 98195 USA
| | - Peter J. Sabo
- Department of Genome Sciences, University of Washington School of Medicine, Seattle WA, 98195 USA
| | - Richard Sandstrom
- Department of Genome Sciences, University of Washington School of Medicine, Seattle WA, 98195 USA
| | - Michael O. Dorschner
- Department of Genome Sciences, University of Washington School of Medicine, Seattle WA, 98195 USA
| | - John A. Stamatoyannopoulos
- Department of Genome Sciences, University of Washington School of Medicine, Seattle WA, 98195 USA
- Department of Medicine, University of Washington School of Medicine, Seattle WA, 98195 USA
| | - Christopher B. Wilson
- Department of Immunology, University of Washington School of Medicine, Seattle WA, 98195 USA
- Department of Pediatrics, University of Washington School of Medicine, Seattle WA, 98195 USA
| |
Collapse
|
23
|
Lieberman-Aiden E, van Berkum NL, Williams L, Imakaev M, Ragoczy T, Telling A, Amit I, Lajoie BR, Sabo PJ, Dorschner MO, Sandstrom R, Bernstein B, Bender MA, Groudine M, Gnirke A, Stamatoyannopoulos J, Mirny LA, Lander ES, Dekker J. Comprehensive mapping of long-range interactions reveals folding principles of the human genome. Science 2009. [PMID: 19815776 DOI: 10.1126/science.1181369/suppl_file/lieberman-aiden.som.pdf] [Citation(s) in RCA: 1] [Impact Index Per Article: 0.1] [Reference Citation Analysis] [What about the content of this article? (0)] [Affiliation(s)] [Abstract] [MESH Headings] [Grants] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 04/01/2023]
Abstract
We describe Hi-C, a method that probes the three-dimensional architecture of whole genomes by coupling proximity-based ligation with massively parallel sequencing. We constructed spatial proximity maps of the human genome with Hi-C at a resolution of 1 megabase. These maps confirm the presence of chromosome territories and the spatial proximity of small, gene-rich chromosomes. We identified an additional level of genome organization that is characterized by the spatial segregation of open and closed chromatin to form two genome-wide compartments. At the megabase scale, the chromatin conformation is consistent with a fractal globule, a knot-free, polymer conformation that enables maximally dense packing while preserving the ability to easily fold and unfold any genomic locus. The fractal globule is distinct from the more commonly used globular equilibrium model. Our results demonstrate the power of Hi-C to map the dynamic conformations of whole genomes.
Collapse
Affiliation(s)
- Erez Lieberman-Aiden
- Broad Institute of Harvard and Massachusetts Institute of Technology (MIT), MA 02139, USA
| | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | |
Collapse
|
24
|
Attanasio C, Reymond A, Humbert R, Lyle R, Kuehn MS, Neph S, Sabo PJ, Goldy J, Weaver M, Haydock A, Lee K, Dorschner M, Dermitzakis ET, Antonarakis SE, Stamatoyannopoulos JA. Assaying the regulatory potential of mammalian conserved non-coding sequences in human cells. Genome Biol 2008; 9:R168. [PMID: 19055709 PMCID: PMC2646272 DOI: 10.1186/gb-2008-9-12-r168] [Citation(s) in RCA: 18] [Impact Index Per Article: 1.1] [Reference Citation Analysis] [What about the content of this article? (0)] [Affiliation(s)] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/09/2008] [Revised: 09/24/2008] [Accepted: 12/02/2008] [Indexed: 01/26/2023] Open
Abstract
The fraction of experimentally active conserved non-coding sequences within any given cell type is low, so classical assays are unlikely to expose their potential. Background Conserved non-coding sequences in the human genome are approximately tenfold more abundant than known genes, and have been hypothesized to mark the locations of cis-regulatory elements. However, the global contribution of conserved non-coding sequences to the transcriptional regulation of human genes is currently unknown. Deeply conserved elements shared between humans and teleost fish predominantly flank genes active during morphogenesis and are enriched for positive transcriptional regulatory elements. However, such deeply conserved elements account for <1% of the conserved non-coding sequences in the human genome, which are predominantly mammalian. Results We explored the regulatory potential of a large sample of these 'common' conserved non-coding sequences using a variety of classic assays, including chromatin remodeling, and enhancer/repressor and promoter activity. When tested across diverse human model cell types, we find that the fraction of experimentally active conserved non-coding sequences within any given cell type is low (approximately 5%), and that this proportion increases only modestly when considered collectively across cell types. Conclusions The results suggest that classic assays of cis-regulatory potential are unlikely to expose the functional potential of the substantial majority of mammalian conserved non-coding sequences in the human genome.
Collapse
Affiliation(s)
- Catia Attanasio
- Department of Genetic Medicine and Development, University of Geneva Medical School, 1 rue Michel Servet, 1211, Geneva 4, Switzerland.
| | | | | | | | | | | | | | | | | | | | | | | | | | | | | |
Collapse
|
25
|
John S, Sabo PJ, Johnson TA, Sung MH, Biddie SC, Lightman SL, Voss TC, Davis SR, Meltzer PS, Stamatoyannopoulos JA, Hager GL. Interaction of the glucocorticoid receptor with the chromatin landscape. Mol Cell 2008; 29:611-24. [PMID: 18342607 DOI: 10.1016/j.molcel.2008.02.010] [Citation(s) in RCA: 264] [Impact Index Per Article: 16.5] [Reference Citation Analysis] [What about the content of this article? (0)] [Affiliation(s)] [Abstract] [MESH Headings] [Grants] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/29/2007] [Revised: 10/22/2007] [Accepted: 02/27/2008] [Indexed: 11/18/2022]
Abstract
The generality and spectrum of chromatin-remodeling requirements for nuclear receptor function are unknown. We have characterized glucocorticoid receptor (GR) binding events and chromatin structural transitions across GR-induced or -repressed genes. This analysis reveals that GR binding invariably occurs at nuclease-accessible sites (DHS). A remarkable diversity of mechanisms, however, render these sites available for GR binding. Accessibility of the GR binding sites is either constitutive or hormone inducible. Within each category, some DHS sites require the Brg1-containing Swi/Snf complex, but others are Brg1 independent, implicating a different remodeling complex. The H2A.Z histone variant is highly enriched at both inducible and constitutive DHS sites and is subject to exchange during hormone activation. The DHS profile is highly cell specific, implicating cell-selective organization of the chromatin landscape as a critical determinant of tissue-selective receptor function. Furthermore, the widespread requirement for chromatin remodeling supports the recent hypothesis that the rapid exchange of receptor proteins occurs during nucleosome reorganization.
Collapse
Affiliation(s)
- Sam John
- Laboratory of Receptor Biology and Gene Expression, Center for Cancer Research, National Cancer Institute, NIH, Bethesda, MD 20892-5055, USA
| | | | | | | | | | | | | | | | | | | | | |
Collapse
|
26
|
Liu M, Sabo PJ, Kuehn MS, Stamatoyannopoulos JA, Emery DW. Gammaretroviral vector integration preference for DNAse hypersensitive sites. Blood Cells Mol Dis 2008. [DOI: 10.1016/j.bcmd.2007.10.037] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [What about the content of this article? (0)] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/16/2022]
|
27
|
Birney E, Stamatoyannopoulos JA, Dutta A, Guigó R, Gingeras TR, Margulies EH, Weng Z, Snyder M, Dermitzakis ET, Thurman RE, Kuehn MS, Taylor CM, Neph S, Koch CM, Asthana S, Malhotra A, Adzhubei I, Greenbaum JA, Andrews RM, Flicek P, Boyle PJ, Cao H, Carter NP, Clelland GK, Davis S, Day N, Dhami P, Dillon SC, Dorschner MO, Fiegler H, Giresi PG, Goldy J, Hawrylycz M, Haydock A, Humbert R, James KD, Johnson BE, Johnson EM, Frum TT, Rosenzweig ER, Karnani N, Lee K, Lefebvre GC, Navas PA, Neri F, Parker SCJ, Sabo PJ, Sandstrom R, Shafer A, Vetrie D, Weaver M, Wilcox S, Yu M, Collins FS, Dekker J, Lieb JD, Tullius TD, Crawford GE, Sunyaev S, Noble WS, Dunham I, Denoeud F, Reymond A, Kapranov P, Rozowsky J, Zheng D, Castelo R, Frankish A, Harrow J, Ghosh S, Sandelin A, Hofacker IL, Baertsch R, Keefe D, Dike S, Cheng J, Hirsch HA, Sekinger EA, Lagarde J, Abril JF, Shahab A, Flamm C, Fried C, Hackermüller J, Hertel J, Lindemeyer M, Missal K, Tanzer A, Washietl S, Korbel J, Emanuelsson O, Pedersen JS, Holroyd N, Taylor R, Swarbreck D, Matthews N, Dickson MC, Thomas DJ, Weirauch MT, Gilbert J, Drenkow J, Bell I, Zhao X, Srinivasan KG, Sung WK, Ooi HS, Chiu KP, Foissac S, Alioto T, Brent M, Pachter L, Tress ML, Valencia A, Choo SW, Choo CY, Ucla C, Manzano C, Wyss C, Cheung E, Clark TG, Brown JB, Ganesh M, Patel S, Tammana H, Chrast J, Henrichsen CN, Kai C, Kawai J, Nagalakshmi U, Wu J, Lian Z, Lian J, Newburger P, Zhang X, Bickel P, Mattick JS, Carninci P, Hayashizaki Y, Weissman S, Hubbard T, Myers RM, Rogers J, Stadler PF, Lowe TM, Wei CL, Ruan Y, Struhl K, Gerstein M, Antonarakis SE, Fu Y, Green ED, Karaöz U, Siepel A, Taylor J, Liefer LA, Wetterstrand KA, Good PJ, Feingold EA, Guyer MS, Cooper GM, Asimenos G, Dewey CN, Hou M, Nikolaev S, Montoya-Burgos JI, Löytynoja A, Whelan S, Pardi F, Massingham T, Huang H, Zhang NR, Holmes I, Mullikin JC, Ureta-Vidal A, Paten B, Seringhaus M, Church D, Rosenbloom K, Kent WJ, Stone EA, Batzoglou S, Goldman N, Hardison RC, Haussler D, Miller W, Sidow A, Trinklein ND, Zhang ZD, Barrera L, Stuart R, King DC, Ameur A, Enroth S, Bieda MC, Kim J, Bhinge AA, Jiang N, Liu J, Yao F, Vega VB, Lee CWH, Ng P, Shahab A, Yang A, Moqtaderi Z, Zhu Z, Xu X, Squazzo S, Oberley MJ, Inman D, Singer MA, Richmond TA, Munn KJ, Rada-Iglesias A, Wallerman O, Komorowski J, Fowler JC, Couttet P, Bruce AW, Dovey OM, Ellis PD, Langford CF, Nix DA, Euskirchen G, Hartman S, Urban AE, Kraus P, Van Calcar S, Heintzman N, Kim TH, Wang K, Qu C, Hon G, Luna R, Glass CK, Rosenfeld MG, Aldred SF, Cooper SJ, Halees A, Lin JM, Shulha HP, Zhang X, Xu M, Haidar JNS, Yu Y, Ruan Y, Iyer VR, Green RD, Wadelius C, Farnham PJ, Ren B, Harte RA, Hinrichs AS, Trumbower H, Clawson H, Hillman-Jackson J, Zweig AS, Smith K, Thakkapallayil A, Barber G, Kuhn RM, Karolchik D, Armengol L, Bird CP, de Bakker PIW, Kern AD, Lopez-Bigas N, Martin JD, Stranger BE, Woodroffe A, Davydov E, Dimas A, Eyras E, Hallgrímsdóttir IB, Huppert J, Zody MC, Abecasis GR, Estivill X, Bouffard GG, Guan X, Hansen NF, Idol JR, Maduro VVB, Maskeri B, McDowell JC, Park M, Thomas PJ, Young AC, Blakesley RW, Muzny DM, Sodergren E, Wheeler DA, Worley KC, Jiang H, Weinstock GM, Gibbs RA, Graves T, Fulton R, Mardis ER, Wilson RK, Clamp M, Cuff J, Gnerre S, Jaffe DB, Chang JL, Lindblad-Toh K, Lander ES, Koriabine M, Nefedov M, Osoegawa K, Yoshinaga Y, Zhu B, de Jong PJ. Identification and analysis of functional elements in 1% of the human genome by the ENCODE pilot project. Nature 2007; 447:799-816. [PMID: 17571346 PMCID: PMC2212820 DOI: 10.1038/nature05874] [Citation(s) in RCA: 3782] [Impact Index Per Article: 222.5] [Reference Citation Analysis] [What about the content of this article? (0)] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 01/17/2023]
Abstract
We report the generation and analysis of functional data from multiple, diverse experiments performed on a targeted 1% of the human genome as part of the pilot phase of the ENCODE Project. These data have been further integrated and augmented by a number of evolutionary and computational analyses. Together, our results advance the collective knowledge about human genome function in several major areas. First, our studies provide convincing evidence that the genome is pervasively transcribed, such that the majority of its bases can be found in primary transcripts, including non-protein-coding transcripts, and those that extensively overlap one another. Second, systematic examination of transcriptional regulation has yielded new understanding about transcription start sites, including their relationship to specific regulatory sequences and features of chromatin accessibility and histone modification. Third, a more sophisticated view of chromatin structure has emerged, including its inter-relationship with DNA replication and transcriptional regulation. Finally, integration of these new sources of information, in particular with respect to mammalian evolution based on inter- and intra-species sequence comparisons, has yielded new mechanistic and evolutionary insights concerning the functional landscape of the human genome. Together, these studies are defining a path for pursuit of a more comprehensive characterization of human genome function.
Collapse
|
28
|
Sabo PJ, Kuehn MS, Thurman R, Johnson BE, Johnson EM, Cao H, Yu M, Rosenzweig E, Goldy J, Haydock A, Weaver M, Shafer A, Lee K, Neri F, Humbert R, Singer MA, Richmond TA, Dorschner MO, McArthur M, Hawrylycz M, Green RD, Navas PA, Noble WS, Stamatoyannopoulos JA. Genome-scale mapping of DNase I sensitivity in vivo using tiling DNA microarrays. Nat Methods 2006; 3:511-8. [PMID: 16791208 DOI: 10.1038/nmeth890] [Citation(s) in RCA: 273] [Impact Index Per Article: 15.2] [Reference Citation Analysis] [What about the content of this article? (0)] [Affiliation(s)] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 04/12/2006] [Accepted: 05/22/2006] [Indexed: 11/09/2022]
Abstract
Localized accessibility of critical DNA sequences to the regulatory machinery is a key requirement for regulation of human genes. Here we describe a high-resolution, genome-scale approach for quantifying chromatin accessibility by measuring DNase I sensitivity as a continuous function of genome position using tiling DNA microarrays (DNase-array). We demonstrate this approach across 1% ( approximately 30 Mb) of the human genome, wherein we localized 2,690 classical DNase I hypersensitive sites with high sensitivity and specificity, and also mapped larger-scale patterns of chromatin architecture. DNase I hypersensitive sites exhibit marked aggregation around transcriptional start sites (TSSs), though the majority mark nonpromoter functional elements. We also developed a computational approach for visualizing higher-order features of chromatin structure. This revealed that human chromatin organization is dominated by large (100-500 kb) 'superclusters' of DNase I hypersensitive sites, which encompass both gene-rich and gene-poor regions. DNase-array is a powerful and straightforward approach for systematic exposition of the cis-regulatory architecture of complex genomes.
Collapse
Affiliation(s)
- Peter J Sabo
- Department of Genome Sciences, University of Washington, 1705 NE Pacific St., Box 357730, Seattle, Washington 98195, USA
| | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | |
Collapse
|
29
|
Dorschner MO, Hawrylycz M, Humbert R, Wallace JC, Shafer A, Kawamoto J, Mack J, Hall R, Goldy J, Sabo PJ, Kohli A, Li Q, McArthur M, Stamatoyannopoulos JA. High-throughput localization of functional elements by quantitative chromatin profiling. Nat Methods 2004; 1:219-25. [PMID: 15782197 DOI: 10.1038/nmeth721] [Citation(s) in RCA: 109] [Impact Index Per Article: 5.5] [Reference Citation Analysis] [What about the content of this article? (0)] [Affiliation(s)] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 08/10/2004] [Accepted: 10/19/2004] [Indexed: 11/08/2022]
Abstract
Identification of functional, noncoding elements that regulate transcription in the context of complex genomes is a major goal of modern biology. Localization of functionality to specific sequences is a requirement for genetic and computational studies. Here, we describe a generic approach, quantitative chromatin profiling, that uses quantitative analysis of in vivo chromatin structure over entire gene loci to rapidly and precisely localize cis-regulatory sequences and other functional modalities encoded by DNase I hypersensitive sites. To demonstrate the accuracy of this approach, we analyzed approximately 300 kilobases of human genome sequence from diverse gene loci and cleanly delineated functional elements corresponding to a spectrum of classical cis-regulatory activities including enhancers, promoters, locus control regions and insulators as well as novel elements. Systematic, high-throughput identification of functional elements coinciding with DNase I hypersensitive sites will substantially expand our knowledge of transcriptional regulation and should simplify the search for noncoding genetic variation with phenotypic consequences.
Collapse
Affiliation(s)
- Michael O Dorschner
- Department of Molecular Biology, Regulome, 2211 Elliott Avenue, Suite 600, Seattle, Washington 98121, USA
| | | | | | | | | | | | | | | | | | | | | | | | | | | |
Collapse
|
30
|
Sabo PJ, Hawrylycz M, Wallace JC, Humbert R, Yu M, Shafer A, Kawamoto J, Hall R, Mack J, Dorschner MO, McArthur M, Stamatoyannopoulos JA. Discovery of functional noncoding elements by digital analysis of chromatin structure. Proc Natl Acad Sci U S A 2004; 101:16837-42. [PMID: 15550541 PMCID: PMC534745 DOI: 10.1073/pnas.0407387101] [Citation(s) in RCA: 116] [Impact Index Per Article: 5.8] [Reference Citation Analysis] [What about the content of this article? (0)] [Affiliation(s)] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/18/2022] Open
Abstract
We developed a quantitative methodology, digital analysis of chromatin structure (DACS), for high-throughput, automated mapping of DNase I-hypersensitive sites and associated cis-regulatory sequences in the human and other complex genomes. We used 19/20-bp genomic DNA tags to localize individual DNase I cutting events in nuclear chromatin and produced approximately 257,000 tags from erythroid cells. Tags were mapped to the human genome, and a quantitative algorithm was applied to discriminate statistically significant clusters of independent DNase I cutting events. We show that such clusters identify both known regulatory sequences and previously unrecognized functional elements across the genome. We used in silico simulation to demonstrate that DACS is capable of efficient and accurate localization of the majority of DNase I-hypersensitive sites in the human genome without requiring an independent validation step. A unique feature of DACS is that it permits unbiased evaluation of the chromatin state of regulatory sequences from widely separated genomic loci. We found surprisingly large differences in the accessibility of distant regulatory sequences, suggesting the existence of a hierarchy of nuclear organization that escapes detection by conventional chromatin assays.
Collapse
Affiliation(s)
- Peter J Sabo
- Department of Molecular Biology, Regulome, 2211 Elliott Avenue, Suite 600, Seattle, WA 98121, USA
| | | | | | | | | | | | | | | | | | | | | | | |
Collapse
|
31
|
Sabo PJ, Humbert R, Hawrylycz M, Wallace JC, Dorschner MO, McArthur M, Stamatoyannopoulos JA. Genome-wide identification of DNaseI hypersensitive sites using active chromatin sequence libraries. Proc Natl Acad Sci U S A 2004; 101:4537-42. [PMID: 15070753 PMCID: PMC384782 DOI: 10.1073/pnas.0400678101] [Citation(s) in RCA: 112] [Impact Index Per Article: 5.6] [Reference Citation Analysis] [What about the content of this article? (0)] [Affiliation(s)] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 01/26/2023] Open
Abstract
Comprehensive identification of sequences that regulate transcription is one of the major goals of genome biology. Focal alteration in chromatin structure in vivo, detectable through hypersensitivity to DNaseI and other nucleases, is the sine qua non of a diverse cast of transcriptional regulatory elements including enhancers, promoters, insulators, and locus control regions. We developed an approach for genome-scale identification of DNaseI hypersensitive sites (HSs) via isolation and cloning of in vivo DNaseI cleavage sites to create libraries of active chromatin sequences (ACSs). Here, we describe analysis of >61,000 ACSs derived from erythroid cells. We observed peaks in the density of ACSs at the transcriptional start sites of known genes at non-gene-associated CpG islands, and, to a lesser degree, at evolutionarily conserved noncoding sequences. Peaks in ACS density paralleled the distribution of DNaseI HSs. ACSs and DNaseI HSs were distributed between both expressed and nonexpressed genes, suggesting that a large proportion of genes reside within open chromatin domains. The results permit a quantitative approximation of the distribution of HSs and classical cis-regulatory sequences in the human genome.
Collapse
Affiliation(s)
- Peter J Sabo
- Department of Molecular Biology, Regulome, Canal View Building, 551 North 34th Street, Seattle, WA 98103, USA
| | | | | | | | | | | | | |
Collapse
|
32
|
Brunkow ME, Gardner JC, Van Ness J, Paeper BW, Kovacevich BR, Proll S, Skonier JE, Zhao L, Sabo PJ, Fu Y, Alisch RS, Gillett L, Colbert T, Tacconi P, Galas D, Hamersma H, Beighton P, Mulligan J. Bone dysplasia sclerosteosis results from loss of the SOST gene product, a novel cystine knot-containing protein. Am J Hum Genet 2001; 68:577-89. [PMID: 11179006 PMCID: PMC1274471 DOI: 10.1086/318811] [Citation(s) in RCA: 688] [Impact Index Per Article: 29.9] [Reference Citation Analysis] [What about the content of this article? (0)] [Affiliation(s)] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 12/29/2000] [Accepted: 01/19/2001] [Indexed: 12/11/2022] Open
Abstract
Sclerosteosis is an autosomal recessive sclerosing bone dysplasia characterized by progressive skeletal overgrowth. The majority of affected individuals have been reported in the Afrikaner population of South Africa, where a high incidence of the disorder occurs as a result of a founder effect. Homozygosity mapping in Afrikaner families along with analysis of historical recombinants localized sclerosteosis to an interval of approximately 2 cM between the loci D17S1787 and D17S930 on chromosome 17q12-q21. Here we report two independent mutations in a novel gene, termed "SOST." Affected Afrikaners carry a nonsense mutation near the amino terminus of the encoded protein, whereas an unrelated affected person of Senegalese origin carries a splicing mutation within the single intron of the gene. The SOST gene encodes a protein that shares similarity with a class of cystine knot-containing factors including dan, cerberus, gremlin, prdc, and caronte. The specific and progressive effect on bone formation observed in individuals affected with sclerosteosis, along with the data presented in this study, together suggest that the SOST gene encodes an important new regulator of bone homeostasis.
Collapse
|
33
|
Abstract
The live attenuated bacillus Calmette-Guérin (BCG) vaccine for the prevention of disease associated with Mycobacterium tuberculosis was derived from the closely related virulent tubercle bacillus, Mycobacterium bovis. Although the BCG vaccine has been one of the most widely used vaccines in the world for over 40 years, the genetic basis of BCG's attenuation has never been elucidated. We employed subtractive genomic hybridization to identify genetic differences between virulent M. bovis and M. tuberculosis and avirulent BCG. Three distinct genomic regions of difference (designated RD1 to RD3) were found to be deleted from BCG, and the precise junctions and DNA sequence of each deletion were determined. RD3, a 9.3-kb genomic segment present in virulent laboratory strains of M. bovis and M. tuberculosis, was absent from BCG and 84% of virulent clinical isolates. RD2, a 10.7-kb DNA segment containing a novel repetitive element and the previously identified mpt-64 gene, was conserved in all virulent laboratory and clinical tubercle bacilli tested and was deleted only from substrains derived from the original BCG Pasteur strain after 1925. Thus, the RD2 deletion occurred after the original derivation of BCG. RD1, a 9.5-kb DNA segment found to be deleted from all BCG substrains, was conserved in all virulent laboratory and clinical isolates of M. bovis and M. tuberculosis tested. The reintroduction of RD1 into BCG repressed the expression of at least 10 proteins and resulted in a protein expression profile almost identical to that of virulent M. bovis and M. tuberculosis, as determined by two-dimensional gel electrophoresis. These data indicate a role for RD1 in the regulation of multiple genetic loci, suggesting that the loss of virulence by BCG is due to a regulatory mutation. These findings may be applicable to the rational design of a new attenuated tuberculosis vaccine and the development of new diagnostic tests to distinguish BCG vaccination from tuberculosis infection.
Collapse
Affiliation(s)
- G G Mahairas
- Laboratory of Tuberculosis and Molecular Microbiology, PathoGenesis Corp., Seattle, Washington 98119, USA
| | | | | | | | | |
Collapse
|
34
|
Sherman DR, Sabo PJ, Hickey MJ, Arain TM, Mahairas GG, Yuan Y, Barry CE, Stover CK. Disparate responses to oxidative stress in saprophytic and pathogenic mycobacteria. Proc Natl Acad Sci U S A 1995; 92:6625-9. [PMID: 7604044 PMCID: PMC41571 DOI: 10.1073/pnas.92.14.6625] [Citation(s) in RCA: 156] [Impact Index Per Article: 5.4] [Reference Citation Analysis] [What about the content of this article? (0)] [Affiliation(s)] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 01/26/2023] Open
Abstract
To persist in macrophages and in granulomatous caseous lesions, pathogenic mycobacteria must be equipped to withstand the action of toxic oxygen metabolites. In Gram-negative bacteria, the OxyR protein is a critical component of the oxidative stress response. OxyR is both a sensor of reactive oxygen species and a transcriptional activator, inducing expression of detoxifying enzymes such as catalase/hydroperoxidase and alkyl hydroperoxidase. We have characterized the responses of various mycobacteria to hydrogen peroxide both phenotypically and at the levels of gene and protein expression. Only the saprophytic Mycobacterium smegmatis induced a protective oxidative stress response analogous to the OxyR response of Gram-negative bacteria. Under similar conditions, the pathogenic mycobacteria exhibited a limited, nonprotective response, which in the case of Mycobacterium tuberculosis was restricted to induction of a single protein, KatG. We have also isolated DNA sequences homologous to oxyR and ahpC from M. tuberculosis and Mycobacterium avium. While the M. avium oxyR appears intact, the oxyR homologue of M. tuberculosis contains numerous deletions and frameshifts and is probably nonfunctional. Apparently the response of pathogenic mycobacteria to oxidative stress differs significantly from the inducible OxyR response of other bacteria.
Collapse
Affiliation(s)
- D R Sherman
- Laboratory of Tuberculosis and Molecular Microbiology, PathoGenesis Corporation, Seattle, WA 98119, USA
| | | | | | | | | | | | | | | |
Collapse
|