1
|
Deleuze V, Soler E, Andrieu-Soler C. Protocol for efficient CRISPR-Cas9-mediated fluorescent tag knockin in hard-to-transfect erythroid cell lines. STAR Protoc 2024; 5:103016. [PMID: 38640065 PMCID: PMC11044133 DOI: 10.1016/j.xpro.2024.103016] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 01/16/2024] [Revised: 03/01/2024] [Accepted: 03/27/2024] [Indexed: 04/21/2024] Open
Abstract
Precise insertion of fluorescent tags by CRISPR-Cas9-mediated homologous recombination (HR) in mammalian genes is a powerful tool allowing to study gene function and protein gene products. Here, we present a protocol for efficient HR-mediated targeted insertion of fluorescent markers in the genome of hard-to-transfect erythroid cell lines MEL (mouse erythroleukemic) and MEDEP (mouse ES cell-derived erythroid progenitor line). We describe steps for plasmid construction, electroporation, amplification, and verification of genome editing. We then detail procedures for isolating positive clones and validating knockin clones. For complete details on the use and execution of this protocol, please refer to Deleuze et al.1.
Collapse
Affiliation(s)
- Virginie Deleuze
- IGMM University Montpellier, CNRS, Montpellier, France; Laboratory of Excellence GR-Ex, Université' de Paris, Paris, France
| | - Eric Soler
- IGMM University Montpellier, CNRS, Montpellier, France; Laboratory of Excellence GR-Ex, Université' de Paris, Paris, France.
| | - Charlotte Andrieu-Soler
- IGMM University Montpellier, CNRS, Montpellier, France; Laboratory of Excellence GR-Ex, Université' de Paris, Paris, France.
| |
Collapse
|
2
|
Deleuze V, Garcia L, Rouaisnel B, Salma M, Kinoo A, Andrieu-Soler C, Soler E. Efficient genome editing in erythroid cells unveils novel MYB target genes and regulatory functions. iScience 2023; 26:107641. [PMID: 37670779 PMCID: PMC10475484 DOI: 10.1016/j.isci.2023.107641] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/27/2023] [Revised: 07/09/2023] [Accepted: 08/11/2023] [Indexed: 09/07/2023] Open
Abstract
Targeted genome editing holds great promise in biology. However, efficient genome modification, including gene knock-in (KI), remains an unattained goal in multiple cell types and loci due to poor transfection efficiencies and low target genes expression, impeding the positive selection of recombined cells. Here, we describe a genome editing approach to achieve efficient gene targeting using hard to transfect erythroid cell lines. We demonstrate robust fluorescent protein KI efficiency in low expressed transcription factor (TF) genes (e.g., Myb or Zeb1). We further show the ability to target two independent loci in individual cells, exemplified by MYB-GFP and NuMA-Cherry double KI, allowing multicolor labeling of regulatory factors at physiological endogenous levels. Our KI tagging approach allowed us to perform genome-wide TF analysis at increased signal-to-noise ratios, and highlighted previously unidentified MYB target genes and pathways. Overall, we establish a versatile CRISPR-Cas9-based platform, offering attractive opportunities for the dissection of the erythroid differentiation process.
Collapse
Affiliation(s)
| | - Leonor Garcia
- IGMM, University Montpellier, CNRS, Montpellier, France
| | | | - Mohammad Salma
- IGMM, University Montpellier, CNRS, Montpellier, France
- Laboratory of Excellence GR-Ex, Université de Paris, Paris, France
| | - Alexia Kinoo
- IGMM, University Montpellier, CNRS, Montpellier, France
| | - Charlotte Andrieu-Soler
- IGMM, University Montpellier, CNRS, Montpellier, France
- Laboratory of Excellence GR-Ex, Université de Paris, Paris, France
| | - Eric Soler
- IGMM, University Montpellier, CNRS, Montpellier, France
- Laboratory of Excellence GR-Ex, Université de Paris, Paris, France
| |
Collapse
|
3
|
Yaschenko AE, Fenech M, Mazzoni-Putman S, Alonso JM, Stepanova AN. Deciphering the molecular basis of tissue-specific gene expression in plants: Can synthetic biology help? CURRENT OPINION IN PLANT BIOLOGY 2022; 68:102241. [PMID: 35700675 PMCID: PMC10605770 DOI: 10.1016/j.pbi.2022.102241] [Citation(s) in RCA: 16] [Impact Index Per Article: 8.0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Subscribe] [Scholar Register] [Received: 03/28/2022] [Revised: 05/05/2022] [Accepted: 05/12/2022] [Indexed: 06/15/2023]
Abstract
Gene expression differences between distinct cell types are orchestrated by specific sets of transcription factors and epigenetic regulators acting upon the genome. In plants, the mechanisms underlying tissue-specific gene activity remain largely unexplored. Although transcriptional and epigenetic profiling of individual organs, tissues, and more recently, of single cells can easily detect the molecular signatures of different biological samples, how these unique cell identities are established at the mechanistic level is only beginning to be decoded. Computational methods, including machine learning, used in combination with experimental approaches, enable the identification and validation of candidate cis-regulatory elements driving cell-specific expression. Synthetic biology shows great promise not only as a means of testing candidate DNA motifs but also for establishing the general rules of nature driving promoter architecture and for the rational design of genetic circuits in research and agriculture to confer tissue-specific expression to genes or molecular pathways of interest.
Collapse
Affiliation(s)
- Anna E Yaschenko
- Department of Plant and Microbial Biology, Program in Genetics, North Carolina State University, Raleigh, NC 27695, USA
| | - Mario Fenech
- Department of Plant and Microbial Biology, Program in Genetics, North Carolina State University, Raleigh, NC 27695, USA
| | - Serina Mazzoni-Putman
- Department of Horticultural Science, North Carolina State University, Raleigh, NC 27695, USA
| | - Jose M Alonso
- Department of Plant and Microbial Biology, Program in Genetics, North Carolina State University, Raleigh, NC 27695, USA
| | - Anna N Stepanova
- Department of Plant and Microbial Biology, Program in Genetics, North Carolina State University, Raleigh, NC 27695, USA.
| |
Collapse
|
4
|
Farrell CM, Goldfarb T, Rangwala SH, Astashyn A, Ermolaeva OD, Hem V, Katz KS, Kodali VK, Ludwig F, Wallin CL, Pruitt KD, Murphy TD. RefSeq Functional Elements as experimentally assayed nongenic reference standards and functional interactions in human and mouse. Genome Res 2022; 32:175-188. [PMID: 34876495 PMCID: PMC8744684 DOI: 10.1101/gr.275819.121] [Citation(s) in RCA: 4] [Impact Index Per Article: 2.0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 05/26/2021] [Accepted: 12/02/2021] [Indexed: 11/25/2022]
Abstract
Eukaryotic genomes contain many nongenic elements that function in gene regulation, chromosome organization, recombination, repair, or replication, and mutation of those elements can affect genome function and cause disease. Although numerous epigenomic studies provide high coverage of gene regulatory regions, those data are not usually exposed in traditional genome annotation and can be difficult to access and interpret without field-specific expertise. The National Center for Biotechnology Information (NCBI) therefore provides RefSeq Functional Elements (RefSeqFEs), which represent experimentally validated human and mouse nongenic elements derived from the literature. The curated data set is comprised of richly annotated sequence records, descriptive records in the NCBI Gene database, reference genome feature annotation, and activity-based interactions between nongenic regions, target genes, and each other. The data set provides succinct functional details and transparent experimental evidence, leverages data from multiple experimental sources, is readily accessible and adaptable, and uses a flexible data model. The data have multiple uses for basic functional discovery, bioinformatics studies, genetic variant interpretation; as known positive controls for epigenomic data evaluation; and as reference standards for functional interactions. Comparisons to other gene regulatory data sets show that the RefSeqFE data set includes a wider range of feature types representing more areas of biology, but it is comparatively smaller and subject to data selection biases. RefSeqFEs thus provide an alternative and complementary resource for experimentally assayed functional elements, with future data set growth expected.
Collapse
Affiliation(s)
- Catherine M Farrell
- National Center for Biotechnology Information, National Library of Medicine, National Institutes of Health, Bethesda, Maryland 20894, USA
| | - Tamara Goldfarb
- National Center for Biotechnology Information, National Library of Medicine, National Institutes of Health, Bethesda, Maryland 20894, USA
| | - Sanjida H Rangwala
- National Center for Biotechnology Information, National Library of Medicine, National Institutes of Health, Bethesda, Maryland 20894, USA
| | - Alexander Astashyn
- National Center for Biotechnology Information, National Library of Medicine, National Institutes of Health, Bethesda, Maryland 20894, USA
| | - Olga D Ermolaeva
- National Center for Biotechnology Information, National Library of Medicine, National Institutes of Health, Bethesda, Maryland 20894, USA
| | - Vichet Hem
- National Center for Biotechnology Information, National Library of Medicine, National Institutes of Health, Bethesda, Maryland 20894, USA
| | - Kenneth S Katz
- National Center for Biotechnology Information, National Library of Medicine, National Institutes of Health, Bethesda, Maryland 20894, USA
| | - Vamsi K Kodali
- National Center for Biotechnology Information, National Library of Medicine, National Institutes of Health, Bethesda, Maryland 20894, USA
| | - Frank Ludwig
- National Center for Biotechnology Information, National Library of Medicine, National Institutes of Health, Bethesda, Maryland 20894, USA
| | - Craig L Wallin
- National Center for Biotechnology Information, National Library of Medicine, National Institutes of Health, Bethesda, Maryland 20894, USA
| | - Kim D Pruitt
- National Center for Biotechnology Information, National Library of Medicine, National Institutes of Health, Bethesda, Maryland 20894, USA
| | - Terence D Murphy
- National Center for Biotechnology Information, National Library of Medicine, National Institutes of Health, Bethesda, Maryland 20894, USA
| |
Collapse
|
5
|
Xiang G, Keller CA, Heuston E, Giardine BM, An L, Wixom AQ, Miller A, Cockburn A, Sauria MEG, Weaver K, Lichtenberg J, Göttgens B, Li Q, Bodine D, Mahony S, Taylor J, Blobel GA, Weiss MJ, Cheng Y, Yue F, Hughes J, Higgs DR, Zhang Y, Hardison RC. An integrative view of the regulatory and transcriptional landscapes in mouse hematopoiesis. Genome Res 2020; 30:472-484. [PMID: 32132109 PMCID: PMC7111515 DOI: 10.1101/gr.255760.119] [Citation(s) in RCA: 23] [Impact Index Per Article: 5.8] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 08/09/2019] [Accepted: 02/21/2020] [Indexed: 01/29/2023]
Abstract
Thousands of epigenomic data sets have been generated in the past decade, but it is difficult for researchers to effectively use all the data relevant to their projects. Systematic integrative analysis can help meet this need, and the VISION project was established for validated systematic integration of epigenomic data in hematopoiesis. Here, we systematically integrated extensive data recording epigenetic features and transcriptomes from many sources, including individual laboratories and consortia, to produce a comprehensive view of the regulatory landscape of differentiating hematopoietic cell types in mouse. By using IDEAS as our integrative and discriminative epigenome annotation system, we identified and assigned epigenetic states simultaneously along chromosomes and across cell types, precisely and comprehensively. Combining nuclease accessibility and epigenetic states produced a set of more than 200,000 candidate cis-regulatory elements (cCREs) that efficiently capture enhancers and promoters. The transitions in epigenetic states of these cCREs across cell types provided insights into mechanisms of regulation, including decreases in numbers of active cCREs during differentiation of most lineages, transitions from poised to active or inactive states, and shifts in nuclease accessibility of CTCF-bound elements. Regression modeling of epigenetic states at cCREs and gene expression produced a versatile resource to improve selection of cCREs potentially regulating target genes. These resources are available from our VISION website to aid research in genomics and hematopoiesis.
Collapse
Affiliation(s)
- Guanjue Xiang
- Department of Biochemistry and Molecular Biology, The Pennsylvania State University, University Park, Pennsylvania 16802, USA
| | - Cheryl A Keller
- Department of Biochemistry and Molecular Biology, The Pennsylvania State University, University Park, Pennsylvania 16802, USA
| | - Elisabeth Heuston
- NHGRI Hematopoiesis Section, Genetics and Molecular Biology Branch, National Institutes of Health, Bethesda, Maryland 20892, USA
| | - Belinda M Giardine
- Department of Biochemistry and Molecular Biology, The Pennsylvania State University, University Park, Pennsylvania 16802, USA
| | - Lin An
- Department of Biochemistry and Molecular Biology, The Pennsylvania State University, University Park, Pennsylvania 16802, USA
| | - Alexander Q Wixom
- Department of Biochemistry and Molecular Biology, The Pennsylvania State University, University Park, Pennsylvania 16802, USA
| | - Amber Miller
- Department of Biochemistry and Molecular Biology, The Pennsylvania State University, University Park, Pennsylvania 16802, USA
| | - April Cockburn
- Department of Biochemistry and Molecular Biology, The Pennsylvania State University, University Park, Pennsylvania 16802, USA
| | - Michael E G Sauria
- Departments of Biology and Computer Science, Johns Hopkins University, Baltimore, Maryland 20218, USA
| | - Kathryn Weaver
- Departments of Biology and Computer Science, Johns Hopkins University, Baltimore, Maryland 20218, USA
| | - Jens Lichtenberg
- NHGRI Hematopoiesis Section, Genetics and Molecular Biology Branch, National Institutes of Health, Bethesda, Maryland 20892, USA
| | - Berthold Göttgens
- Welcome and MRC Cambridge Stem Cell Institute, University of Cambridge, Cambridge CB2 1TN, United Kingdom
| | - Qunhua Li
- Department of Statistics, Program in Bioinformatics and Genomics, Center for Computational Biology and Bioinformatics, The Pennsylvania State University, University Park, Pennsylvania 16802, USA
| | - David Bodine
- NHGRI Hematopoiesis Section, Genetics and Molecular Biology Branch, National Institutes of Health, Bethesda, Maryland 20892, USA
| | - Shaun Mahony
- Department of Biochemistry and Molecular Biology, The Pennsylvania State University, University Park, Pennsylvania 16802, USA
| | - James Taylor
- Departments of Biology and Computer Science, Johns Hopkins University, Baltimore, Maryland 20218, USA
| | - Gerd A Blobel
- Department of Pediatrics, Children's Hospital of Philadelphia and University of Pennsylvania School of Medicine, Philadelphia, Pennsylvania 19104, USA
| | - Mitchell J Weiss
- Department of Hematology, St. Jude Children's Research Hospital, Memphis, Tennessee 38105, USA
| | - Yong Cheng
- Department of Hematology, St. Jude Children's Research Hospital, Memphis, Tennessee 38105, USA
| | - Feng Yue
- Department of Biochemistry and Molecular Biology, The Pennsylvania State University College of Medicine, Hershey, Pennsylvania 17033, USA
| | - Jim Hughes
- MRC Weatherall Institute of Molecular Medicine, Oxford University, Oxford OX3 9DS, United Kingdom
| | - Douglas R Higgs
- MRC Weatherall Institute of Molecular Medicine, Oxford University, Oxford OX3 9DS, United Kingdom
| | - Yu Zhang
- Department of Statistics, Program in Bioinformatics and Genomics, Center for Computational Biology and Bioinformatics, The Pennsylvania State University, University Park, Pennsylvania 16802, USA
| | - Ross C Hardison
- Department of Biochemistry and Molecular Biology, The Pennsylvania State University, University Park, Pennsylvania 16802, USA
| |
Collapse
|
6
|
Hardison RC, Zhang Y, Keller CA, Xiang G, Heuston EF, An L, Lichtenberg J, Giardine BM, Bodine D, Mahony S, Li Q, Yue F, Weiss MJ, Blobel GA, Taylor J, Hughes J, Higgs DR, Göttgens B. Systematic integration of GATA transcription factors and epigenomes via IDEAS paints the regulatory landscape of hematopoietic cells. IUBMB Life 2020; 72:27-38. [PMID: 31769130 PMCID: PMC6972633 DOI: 10.1002/iub.2195] [Citation(s) in RCA: 6] [Impact Index Per Article: 1.5] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/02/2019] [Accepted: 10/17/2019] [Indexed: 01/15/2023]
Abstract
Members of the GATA family of transcription factors play key roles in the differentiation of specific cell lineages by regulating the expression of target genes. Three GATA factors play distinct roles in hematopoietic differentiation. In order to better understand how these GATA factors function to regulate genes throughout the genome, we are studying the epigenomic and transcriptional landscapes of hematopoietic cells in a model-driven, integrative fashion. We have formed the collaborative multi-lab VISION project to conduct ValIdated Systematic IntegratiON of epigenomic data in mouse and human hematopoiesis. The epigenomic data included nuclease accessibility in chromatin, CTCF occupancy, and histone H3 modifications for 20 cell types covering hematopoietic stem cells, multilineage progenitor cells, and mature cells across the blood cell lineages of mouse. The analysis used the Integrative and Discriminative Epigenome Annotation System (IDEAS), which learns all common combinations of features (epigenetic states) simultaneously in two dimensions-along chromosomes and across cell types. The result is a segmentation that effectively paints the regulatory landscape in readily interpretable views, revealing constitutively active or silent loci as well as the loci specifically induced or repressed in each stage and lineage. Nuclease accessible DNA segments in active chromatin states were designated candidate cis-regulatory elements in each cell type, providing one of the most comprehensive registries of candidate hematopoietic regulatory elements to date. Applications of VISION resources are illustrated for the regulation of genes encoding GATA1, GATA2, GATA3, and Ikaros. VISION resources are freely available from our website http://usevision.org.
Collapse
Affiliation(s)
- Ross C. Hardison
- Departments of Biochemistry and Molecular Biology and of StatisticsThe Pennsylvania State University, University ParkPA
| | - Yu Zhang
- Departments of Biochemistry and Molecular Biology and of StatisticsThe Pennsylvania State University, University ParkPA
| | - Cheryl A. Keller
- Departments of Biochemistry and Molecular Biology and of StatisticsThe Pennsylvania State University, University ParkPA
| | - Guanjue Xiang
- Departments of Biochemistry and Molecular Biology and of StatisticsThe Pennsylvania State University, University ParkPA
| | - Elisabeth F. Heuston
- Genetics and Molecular Biology Branch, Hematopoiesis SectionNational Institutes of Health, NHGRIBethesdaMD
| | - Lin An
- Departments of Biochemistry and Molecular Biology and of StatisticsThe Pennsylvania State University, University ParkPA
| | - Jens Lichtenberg
- Genetics and Molecular Biology Branch, Hematopoiesis SectionNational Institutes of Health, NHGRIBethesdaMD
| | - Belinda M. Giardine
- Departments of Biochemistry and Molecular Biology and of StatisticsThe Pennsylvania State University, University ParkPA
| | - David Bodine
- Genetics and Molecular Biology Branch, Hematopoiesis SectionNational Institutes of Health, NHGRIBethesdaMD
| | - Shaun Mahony
- Departments of Biochemistry and Molecular Biology and of StatisticsThe Pennsylvania State University, University ParkPA
| | - Qunhua Li
- Departments of Biochemistry and Molecular Biology and of StatisticsThe Pennsylvania State University, University ParkPA
| | - Feng Yue
- Department of Biochemistry and Molecular BiologyThe Pennsylvania State University College of MedicineHershey, PA
| | - Mitchell J. Weiss
- Hematology DepartmentSt. Jude Children's Research HospitalMemphis, TN
| | | | - James Taylor
- Departments of Biology and of Computer ScienceJohns Hopkins UniversityBaltimore, MD
| | - Jim Hughes
- Laboratory of Gene RegulationWeatherall Institute of Molecular Medicine, Oxford UniversityOxfordUK
| | - Douglas R. Higgs
- Laboratory of Gene RegulationWeatherall Institute of Molecular Medicine, Oxford UniversityOxfordUK
| | - Berthold Göttgens
- Department of Hematology, Cambridge Institute for Medical ResearchUniversity of CambridgeCambridgeUK
| |
Collapse
|
7
|
Peterson TA, Mort M, Cooper DN, Radivojac P, Kann MG, Mooney SD. Regulatory Single-Nucleotide Variant Predictor Increases Predictive Performance of Functional Regulatory Variants. Hum Mutat 2016; 37:1137-1143. [PMID: 27406314 DOI: 10.1002/humu.23049] [Citation(s) in RCA: 12] [Impact Index Per Article: 1.5] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/04/2016] [Accepted: 06/28/2016] [Indexed: 12/20/2022]
Abstract
In silico methods for detecting functionally relevant genetic variants are important for identifying genetic markers of human inherited disease. Much research has focused on protein-coding variants since coding regions have well-defined physicochemical and functional properties. However, many bioinformatics tools are not applicable to variants outside coding regions. Here, we increase the classification performance of our regulatory single-nucleotide variant predictor (RSVP) for variants that cause regulatory abnormalities from an AUC of 0.90-0.97 by incorporating genomic regions identified by the ENCODE project into RSVP. RSVP is comparable to a recently published tool, Genome-Wide Annotation of Variants (GWAVA); both RSVP and GWAVA perform better on regulatory variants than a traditional variant predictor, combined annotation-dependent depletion (CADD). However, our method outperforms GWAVA on variants located at similar distances to the transcription start site as the positive set (AUC: 0.96) as compared with GWAVA (AUC: 0.71). Much of this disparity is due to RSVP's incorporation of features pertaining to the nearest gene (expression, GO terms, etc.), which are not included in GWAVA. Our findings hold out the promise of a framework for the assessment of all functional regulatory variants, providing a means to predict which rare or de novo variants are of pathogenic significance.
Collapse
Affiliation(s)
- Thomas A Peterson
- Department of Biological Sciences, University of Maryland, Baltimore County, Baltimore, Maryland
| | - Matthew Mort
- Institute of Medical Genetics, School of Medicine, Cardiff University, Cardiff, CF14 4XN, United Kingdom
| | - David N Cooper
- Institute of Medical Genetics, School of Medicine, Cardiff University, Cardiff, CF14 4XN, United Kingdom
| | - Predrag Radivojac
- Department of Computer Science and Informatics, Indiana University, Bloomington, Indiana
| | - Maricel G Kann
- Department of Biological Sciences, University of Maryland, Baltimore County, Baltimore, Maryland
| | - Sean D Mooney
- Department of Biomedical Informatics and Medical Education, University of Washington, Seattle, Washington.
| |
Collapse
|
8
|
Insight into GATA1 transcriptional activity through interrogation of cis elements disrupted in human erythroid disorders. Proc Natl Acad Sci U S A 2016; 113:4434-9. [PMID: 27044088 DOI: 10.1073/pnas.1521754113] [Citation(s) in RCA: 47] [Impact Index Per Article: 5.9] [Reference Citation Analysis] [Abstract] [Key Words] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 12/19/2022] Open
Abstract
Whole-exome sequencing has been incredibly successful in identifying causal genetic variants and has revealed a number of novel genes associated with blood and other diseases. One limitation of this approach is that it overlooks mutations in noncoding regulatory elements. Furthermore, the mechanisms by which mutations in transcriptionalcis-regulatory elements result in disease remain poorly understood. Here we used CRISPR/Cas9 genome editing to interrogate three such elements harboring mutations in human erythroid disorders, which in all cases are predicted to disrupt a canonical binding motif for the hematopoietic transcription factor GATA1. Deletions of as few as two to four nucleotides resulted in a substantial decrease (>80%) in target gene expression. Isolated deletions of the canonical GATA1 binding motif completely abrogated binding of the cofactor TAL1, which binds to a separate motif. Having verified the functionality of these three GATA1 motifs, we demonstrate strong evolutionary conservation of GATA1 motifs in regulatory elements proximal to other genes implicated in erythroid disorders, and show that targeted disruption of such elements results in altered gene expression. By modeling transcription factor binding patterns, we show that multiple transcription factors are associated with erythroid gene expression, and have created predictive maps modeling putative disruptions of their binding sites at key regulatory elements. Our study provides insight into GATA1 transcriptional activity and may prove a useful resource for investigating the pathogenicity of noncoding variants in human erythroid disorders.
Collapse
|
9
|
Holloway AK, Bruneau BG, Sukonnik T, Rubenstein JL, Pollard KS. Accelerated Evolution of Enhancer Hotspots in the Mammal Ancestor. Mol Biol Evol 2016; 33:1008-18. [PMID: 26715627 PMCID: PMC4776709 DOI: 10.1093/molbev/msv344] [Citation(s) in RCA: 16] [Impact Index Per Article: 2.0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 12/22/2022] Open
Abstract
Mammals have evolved remarkably different sensory, reproductive, metabolic, and skeletal systems. To explore the genetic basis for these differences, we developed a comparative genomics approach to scan whole-genome multiple sequence alignments to identify regions that evolved rapidly in an ancestral lineage but are conserved within extant species. This pattern suggests that ancestral changes in function were maintained in descendants. After applying this test to therian mammals, we identified 4,797 accelerated regions, many of which are noncoding and located near developmental transcription factors. We then used mouse transgenic reporter assays to test if noncoding accelerated regions are enhancers and to determine how therian-specific substitutions affect their activity in vivo. We discovered enhancers with expression specific to the therian version in brain regions involved in the hormonal control of milk ejection, uterine contractions, blood pressure, temperature, and visual processing. This work underscores the idea that changes in developmental gene expression are important for mammalian evolution, and it pinpoints candidate genes for unique aspects of mammalian biology.
Collapse
Affiliation(s)
- Alisha K Holloway
- Gladstone Institute of Cardiovascular Disease, San Francisco, CA Department of Epidemiology and Biostatistics, University of California, San Francisco, CA
| | - Benoit G Bruneau
- Gladstone Institute of Cardiovascular Disease, San Francisco, CA Department of Pediatrics and Cardiovascular Research Institute, University of California, San Francisco, CA
| | - Tatyana Sukonnik
- Gladstone Institute of Cardiovascular Disease, San Francisco, CA
| | - John L Rubenstein
- Nina Ireland Laboratory of Developmental Neurobiology and Department of Psychiatry, University of California, San Francisco
| | - Katherine S Pollard
- Gladstone Institute of Cardiovascular Disease, San Francisco, CA Department of Epidemiology and Biostatistics, University of California, San Francisco, CA Institute for Human Genetics, University of California, San Francisco, CA
| |
Collapse
|
10
|
Dogan N, Wu W, Morrissey CS, Chen KB, Stonestrom A, Long M, Keller CA, Cheng Y, Jain D, Visel A, Pennacchio LA, Weiss MJ, Blobel GA, Hardison RC. Occupancy by key transcription factors is a more accurate predictor of enhancer activity than histone modifications or chromatin accessibility. Epigenetics Chromatin 2015; 8:16. [PMID: 25984238 PMCID: PMC4432502 DOI: 10.1186/s13072-015-0009-5] [Citation(s) in RCA: 77] [Impact Index Per Article: 8.6] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 01/23/2015] [Accepted: 04/02/2015] [Indexed: 12/12/2022] Open
Abstract
Background Regulated gene expression controls organismal development, and variation in regulatory patterns has been implicated in complex traits. Thus accurate prediction of enhancers is important for further understanding of these processes. Genome-wide measurement of epigenetic features, such as histone modifications and occupancy by transcription factors, is improving enhancer predictions, but the contribution of these features to prediction accuracy is not known. Given the importance of the hematopoietic transcription factor TAL1 for erythroid gene activation, we predicted candidate enhancers based on genomic occupancy by TAL1 and measured their activity. Contributions of multiple features to enhancer prediction were evaluated based on the results of these and other studies. Results TAL1-bound DNA segments were active enhancers at a high rate both in transient transfections of cultured cells (39 of 79, or 56%) and transgenic mice (43 of 66, or 65%). The level of binding signal for TAL1 or GATA1 did not help distinguish TAL1-bound DNA segments as active versus inactive enhancers, nor did the density of regulation-related histone modifications. A meta-analysis of results from this and other studies (273 tested predicted enhancers) showed that the presence of TAL1, GATA1, EP300, SMAD1, H3K4 methylation, H3K27ac, and CAGE tags at DNase hypersensitive sites gave the most accurate predictors of enhancer activity, with a success rate over 80% and a median threefold increase in activity. Chromatin accessibility assays and the histone modifications H3K4me1 and H3K27ac were sensitive for finding enhancers, but they have high false positive rates unless transcription factor occupancy is also included. Conclusions Occupancy by key transcription factors such as TAL1, GATA1, SMAD1, and EP300, along with evidence of transcription, improves the accuracy of enhancer predictions based on epigenetic features. Electronic supplementary material The online version of this article (doi:10.1186/s13072-015-0009-5) contains supplementary material, which is available to authorized users.
Collapse
Affiliation(s)
- Nergiz Dogan
- Center for Comparative Genomics and Bioinformatics, Department of Biochemistry and Molecular Biology, The Pennsylvania State University, 304 Wartik Laboratory, University Park, PA 16802 USA
| | - Weisheng Wu
- Center for Comparative Genomics and Bioinformatics, Department of Biochemistry and Molecular Biology, The Pennsylvania State University, 304 Wartik Laboratory, University Park, PA 16802 USA ; Bioinformatics Core, Department of Computational Medicine and Bioinformatics, University of Michigan, Ann Arbor, MI 48109-2218 USA
| | - Christapher S Morrissey
- Center for Comparative Genomics and Bioinformatics, Department of Biochemistry and Molecular Biology, The Pennsylvania State University, 304 Wartik Laboratory, University Park, PA 16802 USA
| | - Kuan-Bei Chen
- Center for Comparative Genomics and Bioinformatics, Department of Biochemistry and Molecular Biology, The Pennsylvania State University, 304 Wartik Laboratory, University Park, PA 16802 USA
| | - Aaron Stonestrom
- Division of Hematology, The Children's Hospital of Philadelphia, 3401 Civic Center Boulevard, Philadelphia, PA 19104 USA ; Perelman School of Medicine at the University of Pennsylvania, 415 Curie Boulevard, Philadelphia, PA 19104 USA
| | - Maria Long
- Center for Comparative Genomics and Bioinformatics, Department of Biochemistry and Molecular Biology, The Pennsylvania State University, 304 Wartik Laboratory, University Park, PA 16802 USA
| | - Cheryl A Keller
- Center for Comparative Genomics and Bioinformatics, Department of Biochemistry and Molecular Biology, The Pennsylvania State University, 304 Wartik Laboratory, University Park, PA 16802 USA
| | - Yong Cheng
- Department of Genetics, Mail Stop-5120, Stanford University, Stanford, CA 94305 USA
| | - Deepti Jain
- Center for Comparative Genomics and Bioinformatics, Department of Biochemistry and Molecular Biology, The Pennsylvania State University, 304 Wartik Laboratory, University Park, PA 16802 USA
| | - Axel Visel
- Genomics Division, Lawrence Berkeley National Laboratory, 1 Cyclotron Road, Mailstop 84-171, Berkeley, CA 94720 USA ; DOE Joint Genome Institute, 2800 Mitchell Drive, Walnut Creek, CA 94598 USA
| | - Len A Pennacchio
- Genomics Division, Lawrence Berkeley National Laboratory, 1 Cyclotron Road, Mailstop 84-171, Berkeley, CA 94720 USA ; DOE Joint Genome Institute, 2800 Mitchell Drive, Walnut Creek, CA 94598 USA
| | - Mitchell J Weiss
- Department of Hematology, St Jude Children's Research Hospital, 262 Danny Thomas Place, Memphis, TN 38105 USA
| | - Gerd A Blobel
- Division of Hematology, The Children's Hospital of Philadelphia, 3401 Civic Center Boulevard, Philadelphia, PA 19104 USA ; Perelman School of Medicine at the University of Pennsylvania, 415 Curie Boulevard, Philadelphia, PA 19104 USA
| | - Ross C Hardison
- Center for Comparative Genomics and Bioinformatics, Department of Biochemistry and Molecular Biology, The Pennsylvania State University, 304 Wartik Laboratory, University Park, PA 16802 USA
| |
Collapse
|
11
|
Laguna T, Notario L, Pippa R, Fontela MG, Vázquez BN, Maicas M, Aguilera-Montilla N, Corbí ÁL, Odero MD, Lauzurica P. New insights on the transcriptional regulation of CD69 gene through a potent enhancer located in the conserved non-coding sequence 2. Mol Immunol 2015; 66:171-9. [PMID: 25801305 DOI: 10.1016/j.molimm.2015.02.031] [Citation(s) in RCA: 12] [Impact Index Per Article: 1.3] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 01/13/2015] [Revised: 02/25/2015] [Accepted: 02/27/2015] [Indexed: 01/03/2023]
Abstract
The CD69 type II C-type lectin is one of the earliest indicators of leukocyte activation acting in lymphocyte migration and cytokine secretion. CD69 expression in hematopoietic lineage undergoes rapid changes depending on the cell-lineage, the activation state or the localization of the cell where it is expressed, suggesting a complex and tightly controlled regulation. Here we provide new insights on the transcriptional regulation of CD69 gene in mammal species. Through in silico studies, we analyzed several regulatory features of the 4 upstream conserved non-coding sequences (CNS 1-4) previously described, confirming a major function of CNS2 in the transcriptional regulation of CD69. In addition, multiple transcription binding sites are identified in the CNS2 region by DNA cross-species conservation analysis. By functional approaches we defined a core region of 226bp located within CNS2 as the main enhancer element of CD69 transcription in the hematopoietic cells analyzed. By chromatin immunoprecipitation, binding of RUNX1 to the core-CNS2 was shown in a T cell line. In addition, we found an activating but not essential role of RUNX1 in CD69 gene transcription by site-directed mutagenesis and RNA silencing, probably through the interaction with this potent enhancer specifically in the hematopoietic lineage. In summary, in this study we contribute with new evidences to the landscape of the transcriptional regulation of the CD69 gene.
Collapse
Affiliation(s)
- Teresa Laguna
- Activación Immunológica Group, Carlos III National Health Institute (ISCIII), Majadahonda, Spain
| | - Laura Notario
- Activación Immunológica Group, Carlos III National Health Institute (ISCIII), Majadahonda, Spain
| | - Raffaella Pippa
- Laboratory of Genetics, Division of Oncology, CIMA, University of Navarra, Pamplona, Spain
| | - Miguel G Fontela
- Activación Immunológica Group, Carlos III National Health Institute (ISCIII), Majadahonda, Spain
| | - Berta N Vázquez
- Activación Immunológica Group, Carlos III National Health Institute (ISCIII), Majadahonda, Spain
| | - Miren Maicas
- Laboratory of Genetics, Division of Oncology, CIMA, University of Navarra, Pamplona, Spain
| | - Noemí Aguilera-Montilla
- Centro de Investigaciones Biológicas, Consejo Superior de Investigaciones Científicas (CSIC), Madrid, Spain
| | - Ángel L Corbí
- Centro de Investigaciones Biológicas, Consejo Superior de Investigaciones Científicas (CSIC), Madrid, Spain
| | - María D Odero
- Laboratory of Genetics, Division of Oncology, CIMA, University of Navarra, Pamplona, Spain
| | - Pilar Lauzurica
- Activación Immunológica Group, Carlos III National Health Institute (ISCIII), Majadahonda, Spain.
| |
Collapse
|
12
|
Campagna DR, de Bie CI, Schmitz-Abe K, Sweeney M, Sendamarai AK, Schmidt PJ, Heeney MM, Yntema HG, Kannengiesser C, Grandchamp B, Niemeyer CM, Knoers NV, Swart S, Marron G, van Wijk R, Raymakers RA, May A, Markianos K, Bottomley SS, Swinkels DW, Fleming MD. X-linked sideroblastic anemia due to ALAS2 intron 1 enhancer element GATA-binding site mutations. Am J Hematol 2014; 89:315-9. [PMID: 24166784 PMCID: PMC3943703 DOI: 10.1002/ajh.23616] [Citation(s) in RCA: 36] [Impact Index Per Article: 3.6] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/20/2013] [Accepted: 10/21/2013] [Indexed: 02/06/2023]
Abstract
X-linked sideroblastic anemia (XLSA) is the most common form of congenital sideroblastic anemia. In affected males, it is uniformly associated with partial loss-of-function missense mutations in the erythroid-specific heme biosynthesis protein 5-aminolevulinate synthase 2 (ALAS2). Here, we report five families with XLSA owing to mutations in a GATA transcription factor binding site located in a transcriptional enhancer element in intron 1 of the ALAS2 gene. As such, this study defines a new class of mutations that should be evaluated in patients undergoing genetic testing for a suspected diagnosis of XLSA.
Collapse
Affiliation(s)
- Dean R. Campagna
- Department of Pathology, Boston Children's Hospital, Boston, MA, USA
| | - Charlotte I. de Bie
- Department of Medical Genetics, University Medical Centre, Utrecht, Utrecht, the Netherlands
| | - Klaus Schmitz-Abe
- Division of Genetics and Genomics, Department of Medicine, Boston Children's Hospital, Boston, MA, USA
| | - Marion Sweeney
- Department of Haematology, Cardiff University School of Medicine, Heath Park, Cardiff, Wales
| | | | - Paul J. Schmidt
- Department of Pathology, Boston Children's Hospital, Boston, MA, USA
| | | | - Helger G. Yntema
- Department of Genetics, Radboud University Medical Centre, Nijmegen, the Netherlands
| | - Caroline Kannengiesser
- Laboratoire de Génétique Moléculaire, Unité fonctionnelle de génétique, Hôpital Xavier Bichat-Claude Bernard, Université Paris Diderot, insermU773, Paris, France
| | - Bernard Grandchamp
- Laboratoire de Génétique Moléculaire, Unité fonctionnelle de génétique, Hôpital Xavier Bichat-Claude Bernard, Université Paris Diderot, insermU773, Paris, France
| | - Charlotte M. Niemeyer
- Children's Hospital, University of Freiburg, Pediatric Hematology and Oncology Department, Freiburg, Germany
| | - Nine V.A.M. Knoers
- Department of Medical Genetics, University Medical Centre, Utrecht, Utrecht, the Netherlands
| | - Sonia Swart
- NGH NHS Trust, Northampton General Hospital, Clifton Ville, Northampton, England
| | - Gordon Marron
- Department of Haematology, Ninewells Hospital, Dundee, DD1 9SY, Scotland
| | - Richard van Wijk
- Department of Clinical Chemistry and Haematology, University Medical Centre Utrecht, Utrecht, the Netherlands
| | - Reinier A. Raymakers
- Department of Haematology, University Medical Centre, Utrecht, Utrecht, the Netherlands
| | - Alison May
- Department of Haematology, Cardiff University School of Medicine, Heath Park, Cardiff, Wales
| | - Kyriacos Markianos
- Division of Genetics and Genomics, Department of Medicine, Boston Children's Hospital, Boston, MA, USA
| | - Sylvia S. Bottomley
- Department of Medicine, Hematology-Oncology Section, University of Oklahoma College of Medicine, Oklahoma City, OK, USA
| | - Dorine W. Swinkels
- Department of Laboratory Medicine, Radboud University Medical Centre, Nijmegen, the Netherlands
| | - Mark D. Fleming
- Department of Pathology, Boston Children's Hospital, Boston, MA, USA
| |
Collapse
|
13
|
Mitchell JA, Clay I, Umlauf D, Chen CY, Moir CA, Eskiw CH, Schoenfelder S, Chakalova L, Nagano T, Fraser P. Nuclear RNA sequencing of the mouse erythroid cell transcriptome. PLoS One 2012; 7:e49274. [PMID: 23209567 PMCID: PMC3510205 DOI: 10.1371/journal.pone.0049274] [Citation(s) in RCA: 30] [Impact Index Per Article: 2.5] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/18/2012] [Accepted: 10/08/2012] [Indexed: 12/31/2022] Open
Abstract
In addition to protein coding genes a substantial proportion of mammalian genomes are transcribed. However, most transcriptome studies investigate steady-state mRNA levels, ignoring a considerable fraction of the transcribed genome. In addition, steady-state mRNA levels are influenced by both transcriptional and posttranscriptional mechanisms, and thus do not provide a clear picture of transcriptional output. Here, using deep sequencing of nuclear RNAs (nucRNA-Seq) in parallel with chromatin immunoprecipitation sequencing (ChIP-Seq) of active RNA polymerase II, we compared the nuclear transcriptome of mouse anemic spleen erythroid cells with polymerase occupancy on a genome-wide scale. We demonstrate that unspliced transcripts quantified by nucRNA-seq correlate with primary transcript frequencies measured by RNA FISH, but differ from steady-state mRNA levels measured by poly(A)-enriched RNA-seq. Highly expressed protein coding genes showed good correlation between RNAPII occupancy and transcriptional output; however, genome-wide we observed a poor correlation between transcriptional output and RNAPII association. This poor correlation is due to intergenic regions associated with RNAPII which correspond with transcription factor bound regulatory regions and a group of stable, nuclear-retained long non-coding transcripts. In conclusion, sequencing the nuclear transcriptome provides an opportunity to investigate the transcriptional landscape in a given cell type through quantification of unspliced primary transcripts and the identification of nuclear-retained long non-coding RNAs.
Collapse
Affiliation(s)
- Jennifer A Mitchell
- Department of Cell and Systems Biology, University of Toronto, Toronto, Ontario, Canada.
| | | | | | | | | | | | | | | | | | | |
Collapse
|
14
|
Abstract
Differential gene expression is the fundamental mechanism underlying animal development and cell differentiation. However, it is a challenge to identify comprehensively and accurately the DNA sequences that are required to regulate gene expression: namely, cis-regulatory modules (CRMs). Three major features, either singly or in combination, are used to predict CRMs: clusters of transcription factor binding site motifs, non-coding DNA that is under evolutionary constraint and biochemical marks associated with CRMs, such as histone modifications and protein occupancy. The validation rates for predictions indicate that identifying diagnostic biochemical marks is the most reliable method, and understanding is enhanced by the analysis of motifs and conservation patterns within those predicted CRMs.
Collapse
|
15
|
Abstract
Many evolutionary studies over the past decade have estimated α(sel), the proportion of all nucleotides in the human genome that are subject to purifying selection because of their biological function. Most of these studies have estimated the nucleotide substitution rates from genome sequence alignments across many diverse mammals. Some α(sel) estimates will be affected by the heterogeneity of substitution rates in neutral sequence across the genome. Most will also be inaccurate if change in the functional sequence repertoire occurs rapidly relative to the separation of lineages that are being compared. Evidence gathered from both evolutionary and experimental analyses now indicate that rates of "turnover" of functional, predominantly noncoding, sequence are, indeed, high. They are sufficiently high that an estimated 50% of mouse constrained noncoding sequence is predicted not to be shared with rat, a closely related rodent. The rapidity of turnover results in, at least, a twofold underestimate of α(sel) by analyses that measure constraint across the eutherian phylogeny. Approaches that take account of turnover estimate that the steady-state value of α(sel) lies between 10% and 15%. Experimental studies corroborate the predicted rates of loss and gain of noncoding functional sites. These studies show the limitations inherent in the use of deep sequence conservation for identifying functional sequence. Experimental investigations focusing on lineage-specific, noncoding, and functional sequence are now essential if we are to appreciate the complete functional repertoire of the human genome.
Collapse
Affiliation(s)
- Chris P Ponting
- MRC Functional Genomics Unit, Department of Physiology, Anatomy and Genetics, University of Oxford, Oxford OX1 3QX, United Kingdom.
| | | |
Collapse
|
16
|
Wu W, Cheng Y, Keller CA, Ernst J, Kumar SA, Mishra T, Morrissey C, Dorman CM, Chen KB, Drautz D, Giardine B, Shibata Y, Song L, Pimkin M, Crawford GE, Furey TS, Kellis M, Miller W, Taylor J, Schuster SC, Zhang Y, Chiaromonte F, Blobel GA, Weiss MJ, Hardison RC. Dynamics of the epigenetic landscape during erythroid differentiation after GATA1 restoration. Genome Res 2011; 21:1659-71. [PMID: 21795386 DOI: 10.1101/gr.125088.111] [Citation(s) in RCA: 100] [Impact Index Per Article: 7.7] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 12/27/2022]
Abstract
Interplays among lineage-specific nuclear proteins, chromatin modifying enzymes, and the basal transcription machinery govern cellular differentiation, but their dynamics of action and coordination with transcriptional control are not fully understood. Alterations in chromatin structure appear to establish a permissive state for gene activation at some loci, but they play an integral role in activation at other loci. To determine the predominant roles of chromatin states and factor occupancy in directing gene regulation during differentiation, we mapped chromatin accessibility, histone modifications, and nuclear factor occupancy genome-wide during mouse erythroid differentiation dependent on the master regulatory transcription factor GATA1. Notably, despite extensive changes in gene expression, the chromatin state profiles (proportions of a gene in a chromatin state dominated by activating or repressive histone modifications) and accessibility remain largely unchanged during GATA1-induced erythroid differentiation. In contrast, gene induction and repression are strongly associated with changes in patterns of transcription factor occupancy. Our results indicate that during erythroid differentiation, the broad features of chromatin states are established at the stage of lineage commitment, largely independently of GATA1. These determine permissiveness for expression, with subsequent induction or repression mediated by distinctive combinations of transcription factors.
Collapse
Affiliation(s)
- Weisheng Wu
- Center for Comparative Genomics and Bioinformatics, Pennsylvania State University, University Park, Pennsylvania 16802, USA
| | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | |
Collapse
|
17
|
Nguyen TT, Foteinou PT, Calvano SE, Lowry SF, Androulakis IP. Computational identification of transcriptional regulators in human endotoxemia. PLoS One 2011; 6:e18889. [PMID: 21637747 PMCID: PMC3103499 DOI: 10.1371/journal.pone.0018889] [Citation(s) in RCA: 18] [Impact Index Per Article: 1.4] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 11/26/2010] [Accepted: 03/23/2011] [Indexed: 12/21/2022] Open
Abstract
One of the great challenges in the post-genomic era is to decipher the underlying principles governing the dynamics of biological responses. As modulating gene expression levels is among the key regulatory responses of an organism to changes in its environment, identifying biologically relevant transcriptional regulators and their putative regulatory interactions with target genes is an essential step towards studying the complex dynamics of transcriptional regulation. We present an analysis that integrates various computational and biological aspects to explore the transcriptional regulation of systemic inflammatory responses through a human endotoxemia model. Given a high-dimensional transcriptional profiling dataset from human blood leukocytes, an elementary set of temporal dynamic responses which capture the essence of a pro-inflammatory phase, a counter-regulatory response and a dysregulation in leukocyte bioenergetics has been extracted. Upon identification of these expression patterns, fourteen inflammation-specific gene batteries that represent groups of hypothetically ‘coregulated’ genes are proposed. Subsequently, statistically significant cis-regulatory modules (CRMs) are identified and decomposed into a list of critical transcription factors (34) that are validated largely on primary literature. Finally, our analysis further allows for the construction of a dynamic representation of the temporal transcriptional regulatory program across the host, deciphering possible combinatorial interactions among factors under which they might be active. Although much remains to be explored, this study has computationally identified key transcription factors and proposed a putative time-dependent transcriptional regulatory program associated with critical transcriptional inflammatory responses. These results provide a solid foundation for future investigations to elucidate the underlying transcriptional regulatory mechanisms under the host inflammatory response. Also, the assumption that coexpressed genes that are functionally relevant are more likely to share some common transcriptional regulatory mechanism seems to be promising, making the proposed framework become essential in unravelling context-specific transcriptional regulatory interactions underlying diverse mammalian biological processes.
Collapse
Affiliation(s)
- Tung T. Nguyen
- BioMaPS Institute for Quantitative Biology, Rutgers University, Piscataway, New Jersey, United States of America
| | - Panagiota T. Foteinou
- Department of Biomedical Engineering, Rutgers University, Piscataway, New Jersey, United States of America
| | - Steven E. Calvano
- Department of Surgery, Robert Wood Johnson Medical School, University of Medicine and Dentistry, New Jersey, New Brunswick, New Jersey, United States of America
| | - Stephen F. Lowry
- Department of Surgery, Robert Wood Johnson Medical School, University of Medicine and Dentistry, New Jersey, New Brunswick, New Jersey, United States of America
| | - Ioannis P. Androulakis
- Department of Biomedical Engineering, Rutgers University, Piscataway, New Jersey, United States of America
- Department of Surgery, Robert Wood Johnson Medical School, University of Medicine and Dentistry, New Jersey, New Brunswick, New Jersey, United States of America
- * E-mail:
| |
Collapse
|
18
|
When needles look like hay: how to find tissue-specific enhancers in model organism genomes. Dev Biol 2010; 350:239-54. [PMID: 21130761 DOI: 10.1016/j.ydbio.2010.11.026] [Citation(s) in RCA: 25] [Impact Index Per Article: 1.8] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 04/14/2010] [Revised: 11/11/2010] [Accepted: 11/22/2010] [Indexed: 01/22/2023]
Abstract
A major prerequisite for the investigation of tissue-specific processes is the identification of cis-regulatory elements. No generally applicable technique is available to distinguish them from any other type of genomic non-coding sequence. Therefore, researchers often have to identify these elements by elaborate in vivo screens, testing individual regions until the right one is found. Here, based on many examples from the literature, we summarize how functional enhancers have been isolated from other elements in the genome and how they have been characterized in transgenic animals. Covering computational and experimental studies, we provide an overview of the global properties of cis-regulatory elements, like their specific interactions with promoters and target gene distances. We describe conserved non-coding elements (CNEs) and their internal structure, nucleotide composition, binding site clustering and overlap, with a special focus on developmental enhancers. Conflicting data and unresolved questions on the nature of these elements are highlighted. Our comprehensive overview of the experimental shortcuts that have been found in the different model organism communities and the new field of high-throughput assays should help during the preparation phase of a screen for enhancers. The review is accompanied by a list of general guidelines for such a project.
Collapse
|
19
|
Su J, Teichmann SA, Down TA. Assessing computational methods of cis-regulatory module prediction. PLoS Comput Biol 2010; 6:e1001020. [PMID: 21152003 PMCID: PMC2996316 DOI: 10.1371/journal.pcbi.1001020] [Citation(s) in RCA: 63] [Impact Index Per Article: 4.5] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/25/2010] [Accepted: 10/29/2010] [Indexed: 01/02/2023] Open
Abstract
Computational methods attempting to identify instances of cis-regulatory modules (CRMs) in the genome face a challenging problem of searching for potentially interacting transcription factor binding sites while knowledge of the specific interactions involved remains limited. Without a comprehensive comparison of their performance, the reliability and accuracy of these tools remains unclear. Faced with a large number of different tools that address this problem, we summarized and categorized them based on search strategy and input data requirements. Twelve representative methods were chosen and applied to predict CRMs from the Drosophila CRM database REDfly, and across the human ENCODE regions. Our results show that the optimal choice of method varies depending on species and composition of the sequences in question. When discriminating CRMs from non-coding regions, those methods considering evolutionary conservation have a stronger predictive power than methods designed to be run on a single genome. Different CRM representations and search strategies rely on different CRM properties, and different methods can complement one another. For example, some favour homotypical clusters of binding sites, while others perform best on short CRMs. Furthermore, most methods appear to be sensitive to the composition and structure of the genome to which they are applied. We analyze the principal features that distinguish the methods that performed well, identify weaknesses leading to poor performance, and provide a guide for users. We also propose key considerations for the development and evaluation of future CRM-prediction methods. Transcriptional regulation involves multiple transcription factors binding to DNA sequences. A limited repertoire of transcription factors performs this complex regulatory step through various spatial and temporal interactions between themselves and their binding sites. These transcription factor binding interactions are clustered as distinct modules: cis-regulatory modules (CRMs). Computational methods attempting to identify instances of CRMs in the genome face a challenging problem because a majority of these interactions between transcription factors remain unknown. To investigate the reliability and accuracy of these methods, we chose twelve representative methods and applied them to predict CRMs on both the fly and human genomes. Our results show that the optimal choice of method varies depending on species and composition of the sequences in question. Different CRM representations and search strategies rely on different CRM properties, and different methods can complement one another. We provide a guide for users and key considerations for developers. We also expect that, along with new technology generating new types of genomic data, future CRM prediction methods will be able to reveal transcription binding interactions in three-dimensional space.
Collapse
Affiliation(s)
- Jing Su
- MRC Laboratory of Molecular Biology, Cambridge, United Kingdom
| | | | | |
Collapse
|
20
|
Kim DW, Choi SH, Kim RN, Kim SH, Paik SG, Nam SH, Kim DW, Kim A, Kang A, Park HS. Comparative genomic analysis of the false killer whale (Pseudorca crassidens) LMBR1 locus. Genome 2010; 53:658-66. [PMID: 20924415 DOI: 10.1139/g10-043] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/22/2022]
Abstract
The sequencing and comparative genomic analysis of LMBR1 loci in mammals or other species, including human, would be very important in understanding evolutionary genetic changes underlying the evolution of limb development. In this regard, comparative genomic annotation of the false killer whale LMBR1 locus could shed new light on the evolution of limb development. We sequenced two false killer whale BAC clones, corresponding to 156 kb and 144 kb, respectively, harboring the tightly linked RNF32, LMBR1, and NOM1 genes. Our annotation of the false killer whale LMBR1 gene showed that it consists of 17 exons (1473 bp), in contrast to 18 exons (1596 bp) in human, and it displays 93.1% and 95.6% nucleotide and amino acid sequence similarity, respectively, compared with the human gene. In particular, we discovered that exon 10, deleted in the false killer whale LMBR1 gene, is present only in primates, and this fact strongly implies that exon 10 might be crucial in determining primate-specific limb development. ZRS and TFBS sequences have been well conserved across 11 species, suggesting that these regions could be involved in an important function of limb development and limb patterning. The neighboring gene RNF32 showed several lineage-conserved exons, such as exons 2 through 9 conserved in eutherian mammals, exons 3 through 9 conserved in mammals, and exons 5 through 9 conserved in vertebrates. The other neighboring gene, NOM1, had undergone a substitution (ATG→GTA) at the start codon, giving rise to a 36 bp shorter N-terminal sequence compared with the human sequence. Our comparative analysis of the false killer whale LMBR1 genomic locus provides important clues regarding the genetic regions that may play crucial roles in limb development and patterning.
Collapse
Affiliation(s)
- Dae-Won Kim
- Genome Resource Center, Korea Research Institute of Bioscience and Biotechnology, Daejeon 305-806, Korea
| | | | | | | | | | | | | | | | | | | |
Collapse
|
21
|
A single cis element maintains repression of the key developmental regulator Gata2. PLoS Genet 2010; 6:e1001103. [PMID: 20838598 PMCID: PMC2936534 DOI: 10.1371/journal.pgen.1001103] [Citation(s) in RCA: 45] [Impact Index Per Article: 3.2] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 12/21/2009] [Accepted: 07/29/2010] [Indexed: 11/20/2022] Open
Abstract
In development, lineage-restricted transcription factors simultaneously promote differentiation while repressing alternative fates. Molecular dissection of this process has been challenging as transcription factor loci are regulated by many trans-acting factors functioning through dispersed cis elements. It is not understood whether these elements function collectively to confer transcriptional regulation, or individually to control specific aspects of activation or repression, such as initiation versus maintenance. Here, we have analyzed cis element regulation of the critical hematopoietic factor Gata2, which is expressed in early precursors and repressed as GATA-1 levels rise during terminal differentiation. We engineered mice lacking a single cis element −1.8 kb upstream of the Gata2 transcriptional start site. Although Gata2 is normally repressed in late-stage erythroblasts, the −1.8 kb mutation unexpectedly resulted in reactivated Gata2 transcription, blocked differentiation, and an aberrant lineage-specific gene expression pattern. Our findings demonstrate that the −1.8 kb site selectively maintains repression, confers a specific histone modification pattern and expels RNA Polymerase II from the locus. These studies reveal how an individual cis element establishes a normal developmental program via regulating specific steps in the mechanism by which a critical transcription factor is repressed. Different cell types are formed and maintained by proteins called transcription factors that directly bind to specific DNA sequences to activate or repress gene expression. While numerous DNA sequences bound by transcription factors are established, many questions remain unanswered regarding how they function at specific sites located at distinct chromosomal regions. As a model to study this process, we examined the regulation of a gene controlling red blood cell development, Gata2, by the transcription factor GATA1. In the DNA sequence upstream of Gata2, there are several sites that GATA1 is known to bind to; however, it is unclear whether these binding sites work together or independently to control expression of Gata2. To study this, we engineered mice to specifically remove one of these GATA1-binding sites. We found that removal of this single site reactivated expression of Gata2 in a specific stage of red blood cell development where Gata2 is normally not expressed, caused a block in differentiation of these cells, and changed the histone modification pattern specifically in the region upstream of Gata2. This work supports a model in which individual transcription factor binding sites within regions of multiple binding sites can independently and distinctly regulate gene expression during development.
Collapse
|
22
|
Wu DC, Paulson RF. Hypoxia regulates BMP4 expression in the murine spleen during the recovery from acute anemia. PLoS One 2010; 5:e11303. [PMID: 20585586 PMCID: PMC2892039 DOI: 10.1371/journal.pone.0011303] [Citation(s) in RCA: 36] [Impact Index Per Article: 2.6] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/15/2010] [Accepted: 05/29/2010] [Indexed: 12/01/2022] Open
Abstract
Background Bone marrow erythropoiesis is primarily homeostatic, producing new erythrocytes at a constant rate. However at times of acute anemia, new erythrocytes must be rapidly produced much faster than bone marrow steady state erythropoiesis. At these times stress erythropoiesis predominates. Stress erythropoiesis occurs in the fetal liver during embryogenesis and in the adult spleen and liver. In adult mice, stress erythropoiesis utilizes a specialized population of stress erythroid progenitors that are resident in the spleen. In response to acute anemia, these progenitors rapidly expand and differentiate in response to three signals, BMP4, SCF and hypoxia. In absence of acute anemic stress, two of these signals, BMP4 and hypoxia, are not present and the pathway is not active. The initiating event in the activation of this pathway is the up-regulation of BMP4 expression in the spleen. Methodology/Principal Findings In this paper we analyze the regulation of BMP4 expression in the spleen by hypoxia. Using stromal cell lines, we establish a role for hypoxia transcription factor HIFs (Hypoxia Inducible Factors) in the transcription of BMP4. We identified putative Hypoxia Responsive Elements (HREs) in the BMP4 gene using bioinformatics. Analysis of these elements showed that in vivo, Hif2α binds two cis regulatory sites in the BMP4 gene, which regulate BMP4 expression during the recovery from acute anemia. Conclusions and Significance These data show that hypoxia plays a key role in initiating the BMP4 dependent stress erythropoiesis pathway by regulating BMP4 expression.
Collapse
Affiliation(s)
- Dai-Chen Wu
- Graduate Program in Biochemistry, Microbiology and Molecular Biology, The Pennsylvania State University, University Park, Pennsylvania, United States of America
- Department of Biochemistry and Molecular Biology, The Pennsylvania State University, University Park, Pennsylvania, United States of America
| | - Robert F. Paulson
- Graduate Program in Biochemistry, Microbiology and Molecular Biology, The Pennsylvania State University, University Park, Pennsylvania, United States of America
- Center for Molecular Immunology and Infectious Disease, The Pennsylvania State University, University Park, Pennsylvania, United States of America
- Department of Veterinary and Biomedical Sciences, The Pennsylvania State University, University Park, Pennsylvania, United States of America
- * E-mail:
| |
Collapse
|
23
|
Zhang Y, Wu W, Cheng Y, King DC, Harris RS, Taylor J, Chiaromonte F, Hardison RC. Primary sequence and epigenetic determinants of in vivo occupancy of genomic DNA by GATA1. Nucleic Acids Res 2010; 37:7024-38. [PMID: 19767611 PMCID: PMC2790884 DOI: 10.1093/nar/gkp747] [Citation(s) in RCA: 24] [Impact Index Per Article: 1.7] [Reference Citation Analysis] [Abstract] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/21/2022] Open
Abstract
DNA sequence motifs and epigenetic modifications contribute to specific binding by a transcription factor, but the extent to which each feature determines occupancy in vivo is poorly understood. We addressed this question in erythroid cells by identifying DNA segments occupied by GATA1 and measuring the level of trimethylation of histone H3 lysine 27 (H3K27me3) and monomethylation of H3 lysine 4 (H3K4me1) along a 66 Mb region of mouse chromosome 7. While 91% of the GATA1-occupied segments contain the consensus binding-site motif WGATAR, only ∼0.7% of DNA segments with such a motif are occupied. Using a discriminative motif enumeration method, we identified additional motifs predictive of occupancy given the presence of WGATAR. The specific motif variant AGATAA and occurrence of multiple WGATAR motifs are both strong discriminators. Combining motifs to pair a WGATAR motif with a binding site motif for GATA1, EKLF or SP1 improves discriminative power. Epigenetic modifications are also strong determinants, with the factor-bound segments highly enriched for H3K4me1 and depleted of H3K27me3. Combining primary sequence and epigenetic determinants captures 52% of the GATA1-occupied DNA segments and substantially increases the specificity, to one out of seven segments with the required motif combination and epigenetic signals being bound.
Collapse
Affiliation(s)
- Ying Zhang
- Center for Comparative Genomics and Bioinformatics, Huck Institutes of Life Sciences
| | | | | | | | | | | | | | | |
Collapse
|
24
|
Abstract
Determining the timing and molecular repertoire responsible for gene expression is fundamental to understanding a gene's function. Heritable differences in this character are increasingly regarded as explanatory for complex and common traits. For many known trait-predisposing genes, studies have sought to elucidate the associated logic behind gene regulation. However, there exist many challenges in deciphering these mechanisms. Among them, it is recognized that we have limited understanding of regulatory complexity, the current models of gene regulation have low specificity and any gene's regulatory logic is dependent on biological context. Addressing these limitations and defining the regulatory genome is an ongoing challenge for molecular biology. We discuss current efforts to define and annotate the regulatory genome by focusing on curation and text-mining activities. We further highlight the type of information and curation process for describing regulatory elements within the ORegAnno database ( www.oreganno.org ) and how the general standards for such information are changing.
Collapse
|
25
|
Cheng Y, Wu W, Ashok Kumar S, Yu D, Deng W, Tripic T, King DC, Chen KB, Zhang Y, Drautz D, Giardine B, Schuster SC, Miller W, Chiaromonte F, Zhang Y, Blobel GA, Weiss MJ, Hardison RC. Erythroid GATA1 function revealed by genome-wide analysis of transcription factor occupancy, histone modifications, and mRNA expression. Genome Res 2009; 19:2172-84. [PMID: 19887574 PMCID: PMC2792182 DOI: 10.1101/gr.098921.109] [Citation(s) in RCA: 175] [Impact Index Per Article: 11.7] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/24/2009] [Accepted: 10/05/2009] [Indexed: 11/24/2022]
Abstract
The transcription factor GATA1 regulates an extensive program of gene activation and repression during erythroid development. However, the associated mechanisms, including the contributions of distal versus proximal cis-regulatory modules, co-occupancy with other transcription factors, and the effects of histone modifications, are poorly understood. We studied these problems genome-wide in a Gata1 knockout erythroblast cell line that undergoes GATA1-dependent terminal maturation, identifying 2616 GATA1-responsive genes and 15,360 GATA1-occupied DNA segments after restoration of GATA1. Virtually all occupied DNA segments have high levels of H3K4 monomethylation and low levels of H3K27me3 around the canonical GATA binding motif, regardless of whether the nearby gene is induced or repressed. Induced genes tend to be bound by GATA1 close to the transcription start site (most frequently in the first intron), have multiple GATA1-occupied segments that are also bound by TAL1, and show evolutionary constraint on the GATA1-binding site motif. In contrast, repressed genes are further away from GATA1-occupied segments, and a subset shows reduced TAL1 occupancy and increased H3K27me3 at the transcription start site. Our data expand the repertoire of GATA1 action in erythropoiesis by defining a new cohort of target genes and determining the spatial distribution of cis-regulatory modules throughout the genome. In addition, we begin to establish functional criteria and mechanisms that distinguish GATA1 activation from repression at specific target genes. More broadly, these studies illustrate how a "master regulator" transcription factor coordinates tissue differentiation through a panoply of DNA and protein interactions.
Collapse
Affiliation(s)
- Yong Cheng
- Center for Comparative Genomics and Bioinformatics of the Huck Institutes of Life Sciences, The Pennsylvania State University, University Park, Pennsylvania 16802, USA
- Department of Biochemistry and Molecular Biology, The Pennsylvania State University, University Park, Pennsylvania 16802, USA
| | - Weisheng Wu
- Center for Comparative Genomics and Bioinformatics of the Huck Institutes of Life Sciences, The Pennsylvania State University, University Park, Pennsylvania 16802, USA
- Department of Biochemistry and Molecular Biology, The Pennsylvania State University, University Park, Pennsylvania 16802, USA
| | - Swathi Ashok Kumar
- Center for Comparative Genomics and Bioinformatics of the Huck Institutes of Life Sciences, The Pennsylvania State University, University Park, Pennsylvania 16802, USA
- Department of Biochemistry and Molecular Biology, The Pennsylvania State University, University Park, Pennsylvania 16802, USA
| | - Duonan Yu
- Division of Hematology, Children's Hospital of Philadelphia, Philadelphia, Pennsylvania 19104, USA
| | - Wulan Deng
- Division of Hematology, Children's Hospital of Philadelphia, Philadelphia, Pennsylvania 19104, USA
| | - Tamara Tripic
- Division of Hematology, Children's Hospital of Philadelphia, Philadelphia, Pennsylvania 19104, USA
| | - David C. King
- Center for Comparative Genomics and Bioinformatics of the Huck Institutes of Life Sciences, The Pennsylvania State University, University Park, Pennsylvania 16802, USA
- Department of Biochemistry and Molecular Biology, The Pennsylvania State University, University Park, Pennsylvania 16802, USA
| | - Kuan-Bei Chen
- Center for Comparative Genomics and Bioinformatics of the Huck Institutes of Life Sciences, The Pennsylvania State University, University Park, Pennsylvania 16802, USA
- Department of Computer Science and Engineering, The Pennsylvania State University, University Park, Pennsylvania 16802, USA
| | - Ying Zhang
- Center for Comparative Genomics and Bioinformatics of the Huck Institutes of Life Sciences, The Pennsylvania State University, University Park, Pennsylvania 16802, USA
- Department of Biochemistry and Molecular Biology, The Pennsylvania State University, University Park, Pennsylvania 16802, USA
| | - Daniela Drautz
- Center for Comparative Genomics and Bioinformatics of the Huck Institutes of Life Sciences, The Pennsylvania State University, University Park, Pennsylvania 16802, USA
- Department of Biochemistry and Molecular Biology, The Pennsylvania State University, University Park, Pennsylvania 16802, USA
| | - Belinda Giardine
- Center for Comparative Genomics and Bioinformatics of the Huck Institutes of Life Sciences, The Pennsylvania State University, University Park, Pennsylvania 16802, USA
| | - Stephan C. Schuster
- Center for Comparative Genomics and Bioinformatics of the Huck Institutes of Life Sciences, The Pennsylvania State University, University Park, Pennsylvania 16802, USA
- Department of Biochemistry and Molecular Biology, The Pennsylvania State University, University Park, Pennsylvania 16802, USA
| | - Webb Miller
- Center for Comparative Genomics and Bioinformatics of the Huck Institutes of Life Sciences, The Pennsylvania State University, University Park, Pennsylvania 16802, USA
- Department of Computer Science and Engineering, The Pennsylvania State University, University Park, Pennsylvania 16802, USA
- Department of Biology, The Pennsylvania State University, University Park, Pennsylvania 16802, USA
| | - Francesca Chiaromonte
- Center for Comparative Genomics and Bioinformatics of the Huck Institutes of Life Sciences, The Pennsylvania State University, University Park, Pennsylvania 16802, USA
- Department of Statistics, The Pennsylvania State University, University Park, Pennsylvania 16802, USA
| | - Yu Zhang
- Center for Comparative Genomics and Bioinformatics of the Huck Institutes of Life Sciences, The Pennsylvania State University, University Park, Pennsylvania 16802, USA
- Department of Statistics, The Pennsylvania State University, University Park, Pennsylvania 16802, USA
| | - Gerd A. Blobel
- Division of Hematology, Children's Hospital of Philadelphia, Philadelphia, Pennsylvania 19104, USA
| | - Mitchell J. Weiss
- Division of Hematology, Children's Hospital of Philadelphia, Philadelphia, Pennsylvania 19104, USA
| | - Ross C. Hardison
- Center for Comparative Genomics and Bioinformatics of the Huck Institutes of Life Sciences, The Pennsylvania State University, University Park, Pennsylvania 16802, USA
- Department of Biochemistry and Molecular Biology, The Pennsylvania State University, University Park, Pennsylvania 16802, USA
| |
Collapse
|
26
|
Amigo JD, Ackermann GE, Cope JJ, Yu M, Cooney JD, Ma D, Langer NB, Shafizadeh E, Shaw GC, Horsely W, Trede NS, Davidson AJ, Barut BA, Zhou Y, Wojiski SA, Traver D, Moran TB, Kourkoulis G, Hsu K, Kanki JP, Shah DI, Lin HF, Handin RI, Cantor AB, Paw BH. The role and regulation of friend of GATA-1 (FOG-1) during blood development in the zebrafish. Blood 2009; 114:4654-63. [PMID: 19729519 PMCID: PMC2780302 DOI: 10.1182/blood-2008-12-189910] [Citation(s) in RCA: 39] [Impact Index Per Article: 2.6] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 12/19/2008] [Accepted: 08/06/2009] [Indexed: 01/24/2023] Open
Abstract
The nuclear protein FOG-1 binds transcription factor GATA-1 to facilitate erythroid and megakaryocytic maturation. However, little is known about the function of FOG-1 during myeloid and lymphoid development or how FOG-1 expression is regulated in any tissue. We used in situ hybridization, gain- and loss-of-function studies in zebrafish to address these problems. Zebrafish FOG-1 is expressed in early hematopoietic cells, as well as heart, viscera, and paraspinal neurons, suggesting that it has multifaceted functions in organogenesis. We found that FOG-1 is dispensable for endoderm specification but is required for endoderm patterning affecting the expression of late-stage T-cell markers, independent of GATA-1. The suppression of FOG-1, in the presence of normal GATA-1 levels, induces severe anemia and thrombocytopenia and expands myeloid-progenitor cells, indicating that FOG-1 is required during erythroid/myeloid commitment. To functionally interrogate whether GATA-1 regulates FOG-1 in vivo, we used bioinformatics combined with transgenic assays. Thus, we identified 2 cis-regulatory elements that control the tissue-specific gene expression of FOG-1. One of these enhancers contains functional GATA-binding sites, indicating the potential for a regulatory loop in which GATA factors control the expression of their partner protein FOG-1.
Collapse
Affiliation(s)
- Julio D Amigo
- Department of Medicine, Division of Hematology, Brigham & Women's Hospital, Boston, MA 02115, USA
| | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | |
Collapse
|
27
|
Maeda T, Ito K, Merghoub T, Poliseno L, Hobbs RM, Wang G, Dong L, Maeda M, Dore LC, Zelent A, Luzzatto L, Teruya-Feldstein J, Weiss MJ, Pandolfi PP. LRF is an essential downstream target of GATA1 in erythroid development and regulates BIM-dependent apoptosis. Dev Cell 2009; 17:527-40. [PMID: 19853566 DOI: 10.1016/j.devcel.2009.09.005] [Citation(s) in RCA: 82] [Impact Index Per Article: 5.5] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 04/26/2009] [Revised: 06/11/2009] [Accepted: 09/18/2009] [Indexed: 12/15/2022]
Abstract
GATA-1-dependent transcription is essential for erythroid differentiation and maturation. Suppression of programmed cell death is also thought to be critical for this process; however, the link between these two features of erythropoiesis has remained elusive. Here, we show that the POZ-Krüppel family transcription factor, LRF (also known as Zbtb7a/Pokemon), is a direct target of GATA1 and plays an essential antiapoptotic role during terminal erythroid differentiation. We find that loss of Lrf leads to lethal anemia in embryos, due to increased apoptosis of late-stage erythroblasts. This programmed cell death is Arf and p53 independent and is instead mediated by upregulation of the proapoptotic factor Bim. We identify Lrf as a direct repressor of Bim transcription. In strong support of this mechanism, genetic Bim loss delays the lethality of Lrf-deficient embryos and rescues their anemia phenotype. Thus, our data define a key transcriptional cascade for effective erythropoiesis, whereby GATA-1 suppresses BIM-mediated apoptosis via LRF.
Collapse
Affiliation(s)
- Takahiro Maeda
- Cancer Biology and Genetics Program, Sloan-Kettering Institute, Memorial Sloan-Kettering Cancer Center, 1275 York Avenue, New York, NY 10021, USA
| | | | | | | | | | | | | | | | | | | | | | | | | | | |
Collapse
|
28
|
Tomovic A, Stadler M, Oakeley EJ. Transcription factor site dependencies in human, mouse and rat genomes. BMC Bioinformatics 2009; 10:339. [PMID: 19835596 PMCID: PMC2770556 DOI: 10.1186/1471-2105-10-339] [Citation(s) in RCA: 2] [Impact Index Per Article: 0.1] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/10/2009] [Accepted: 10/16/2009] [Indexed: 01/14/2023] Open
Abstract
BACKGROUND It is known that transcription factors frequently act together to regulate gene expression in eukaryotes. In this paper we describe a computational analysis of transcription factor site dependencies in human, mouse and rat genomes. RESULTS Our approach for quantifying tendencies of transcription factor binding sites to co-occur is based on a binding site scoring function which incorporates dependencies between positions, the use of information about the structural class of each transcription factor (major/minor groove binder), and also considered the possible implications of varying GC content of the sequences. Significant tendencies (dependencies) have been detected by non-parametric statistical methodology (permutation tests). Evaluation of obtained results has been performed in several ways: reports from literature (many of the significant dependencies between transcription factors have previously been confirmed experimentally); dependencies between transcription factors are not biased due to similarities in their DNA-binding sites; the number of dependent transcription factors that belong to the same functional and structural class is significantly higher than would be expected by chance; supporting evidence from GO clustering of targeting genes. Based on dependencies between two transcription factor binding sites (second-order dependencies), it is possible to construct higher-order dependencies (networks). Moreover results about transcription factor binding sites dependencies can be used for prediction of groups of dependent transcription factors on a given promoter sequence. Our results, as well as a scanning tool for predicting groups of dependent transcription factors binding sites are available on the Internet. CONCLUSION We show that the computational analysis of transcription factor site dependencies is a valuable complement to experimental approaches for discovering transcription regulatory interactions and networks. Scanning promoter sequences with dependent groups of transcription factor binding sites improve the quality of transcription factor predictions.
Collapse
Affiliation(s)
- Andrija Tomovic
- Friedrich Miescher Institute for Biomedical Research, Novartis Research Foundation, Basel, Switzerland.
| | | | | |
Collapse
|
29
|
Chromatin architecture and transcription factor binding regulate expression of erythrocyte membrane protein genes. Mol Cell Biol 2009; 29:5399-412. [PMID: 19687298 DOI: 10.1128/mcb.00777-09] [Citation(s) in RCA: 29] [Impact Index Per Article: 1.9] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 12/22/2022] Open
Abstract
Erythrocyte membrane protein genes serve as excellent models of complex gene locus structure and function, but their study has been complicated by both their large size and their complexity. To begin to understand the intricate interplay of transcription, dynamic chromatin architecture, transcription factor binding, and genomic organization in regulation of erythrocyte membrane protein genes, we performed chromatin immunoprecipitation (ChIP) coupled with microarray analysis and ChIP coupled with massively parallel DNA sequencing in both erythroid and nonerythroid cells. Unexpectedly, most regions of GATA-1 and NF-E2 binding were remote from gene promoters and transcriptional start sites, located primarily in introns. Cooccupancy with FOG-1, SCL, and MTA-2 was found at all regions of GATA-1 binding, with cooccupancy of SCL and MTA-2 also found at regions of NF-E2 binding. Cooccupancy of GATA-1 and NF-E2 was found frequently. A common signature of histone H3 trimethylation at lysine 4, GATA-1, NF-E2, FOG-1, SCL, and MTA-2 binding and consensus GATA-1-E-box binding motifs located 34 to 90 bp away from NF-E2 binding motifs was found frequently in erythroid cell-expressed genes. These results provide insights into our understanding of membrane protein gene regulation in erythropoiesis and the regulation of complex genetic loci in erythroid and nonerythroid cells and identify numerous candidate regions for mutations associated with membrane-linked hemolytic anemia.
Collapse
|
30
|
Abstract
GATA-2 is an essential transcription factor that regulates multiple aspects of hematopoiesis. Dysregulation of GATA-2 is a hallmark of acute megakaryoblastic leukemia in children with Down syndrome, a malignancy that is defined by the combination of trisomy 21 and a GATA1 mutation. Here, we show that GATA-2 is required for normal megakaryocyte development as well as aberrant megakaryopoiesis in Gata1 mutant cells. Furthermore, we demonstrate that GATA-2 indirectly controls cell cycle progression in GATA-1-deficient megakaryocytes. Genome-wide microarray analysis and chromatin immunoprecipitation studies revealed that GATA-2 regulates a wide set of genes, including cell cycle regulators and megakaryocyte-specific genes. Surprisingly, GATA-2 also negatively regulates the expression of crucial myeloid transcription factors, such as Sfpi1 and Cebpa. In the absence of GATA-1, GATA-2 prevents induction of a latent myeloid gene expression program. Thus, GATA-2 contributes to cell cycle progression and the maintenance of megakaryocyte identity of GATA-1-deficient cells, including GATA-1s-expressing fetal megakaryocyte progenitors. Moreover, our data reveal that overexpression of GATA-2 facilitates aberrant megakaryopoiesis.
Collapse
|
31
|
It takes (LMO) 2 to tango. Blood 2009; 113:5693. [DOI: 10.1182/blood-2009-03-207258] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/20/2022] Open
|
32
|
Graded repression of PU.1/Sfpi1 gene transcription by GATA factors regulates hematopoietic cell fate. Blood 2009; 114:983-94. [PMID: 19491391 DOI: 10.1182/blood-2009-03-207944] [Citation(s) in RCA: 80] [Impact Index Per Article: 5.3] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 01/11/2023] Open
Abstract
GATA-1 and PU.1 are essential hematopoietic transcription factors that control erythromegakaryocytic and myelolymphoid differentiation, respectively. These proteins antagonize each other through direct physical interaction to repress alternate lineage programs. We used immortalized Gata1(-) erythromegakaryocytic progenitor cells to study how PU.1/Sfpi1 expression is regulated by GATA-1 and GATA-2, a related factor that is normally expressed at earlier stages of hematopoiesis. Both GATA factors bind the PU.1/Sfpi1 gene at 2 highly conserved regions. In the absence of GATA-1, GATA-2 binding is associated with an undifferentiated state, intermediate level PU.1/Sfpi1 expression, and low-level expression of its downstream myeloid target genes. Restoration of GATA-1 function induces erythromegakaryocytic differentiation. Concomitantly, GATA-1 replaces GATA-2 at the PU.1/Sfpi1 locus and PU.1/Sfpi1 expression is extinguished. In contrast, when GATA-1 is not present, shRNA knockdown of GATA-2 increases PU.1/Sfpi1 expression by 3-fold and reprograms the cells to become macrophages. Our findings indicate that GATA factors act sequentially to regulate lineage determination during hematopoiesis, in part by exerting variable repressive effects at the PU.1/Sfpi1 locus.
Collapse
|
33
|
Tallack MR, Keys JR, Humbert PO, Perkins AC. EKLF/KLF1 controls cell cycle entry via direct regulation of E2f2. J Biol Chem 2009; 284:20966-74. [PMID: 19457859 DOI: 10.1074/jbc.m109.006346] [Citation(s) in RCA: 56] [Impact Index Per Article: 3.7] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 01/09/2023] Open
Abstract
Differentiation of erythroid cells requires precise control over the cell cycle to regulate the balance between cell proliferation and differentiation. The zinc finger transcription factor, erythroid Krüppel-like factor (EKLF/KLF1), is essential for proper erythroid cell differentiation and regulates many erythroid genes. Here we show that loss of EKLF leads to aberrant entry into S-phase of the cell cycle during both primitive and definitive erythropoiesis. This cell cycle defect was associated with a significant reduction in the expression levels of E2f2 and E2f4, key factors necessary for the induction of S-phase gene expression and erythropoiesis. We found and validated novel intronic enhancers in both the E2f2 and E2f4 genes, which contain conserved CACC, GATA, and E-BOX elements. The E2f2 enhancer was occupied by EKLF in vivo. Furthermore, we were able to partially restore cell cycle dynamics in EKLF(-/-) fetal liver upon additional genetic depletion of Rb, establishing a genetic causal link between reduced E2f2 and the EKLF cell cycle defect. Finally, we propose direct regulation of the E2f2 enhancer is a generic mechanism by which many KLFs regulate proliferation and differentiation.
Collapse
Affiliation(s)
- Michael R Tallack
- Division of Molecular Genetics and Development, Institute for Molecular Bioscience, University of Queensland, Brisbane, 4072, Australia
| | | | | | | |
Collapse
|
34
|
Expression of the leukemia oncogene Lmo2 is controlled by an array of tissue-specific elements dispersed over 100 kb and bound by Tal1/Lmo2, Ets, and Gata factors. Blood 2009; 113:5783-92. [PMID: 19171877 DOI: 10.1182/blood-2008-11-187757] [Citation(s) in RCA: 64] [Impact Index Per Article: 4.3] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 01/07/2023] Open
Abstract
The Lmo2 gene encodes a transcriptional cofactor critical for the development of hematopoietic stem cells. Ectopic LMO2 expression causes leukemia in T-cell acute lymphoblastic leukemia (T-ALL) patients and severe combined immunodeficiency patients undergoing retroviral gene therapy. Tightly controlled Lmo2 expression is therefore essential, yet no comprehensive analysis of Lmo2 regulation has been published so far. By comparative genomics, we identified 17 highly conserved noncoding elements, 9 of which revealed specific acetylation marks in chromatin-immunoprecipitation and microarray (ChIP-chip) assays performed across 250 kb of the Lmo2 locus in 11 cell types covering different stages of hematopoietic differentiation. All candidate regulatory regions were tested in transgenic mice. An extended LMO2 proximal promoter fragment displayed strong endothelial activity, while the distal promoter showed weak forebrain activity. Eight of the 15 distal candidate elements functioned as enhancers, which together recapitulated the full expression pattern of Lmo2, directing expression to endothelium, hematopoietic cells, tail, and forebrain. Interestingly, distinct combinations of specific distal regulatory elements were required to extend endothelial activity of the LMO2 promoter to yolk sac or fetal liver hematopoietic cells. Finally, Sfpi1/Pu.1, Fli1, Gata2, Tal1/Scl, and Lmo2 were shown to bind to and transactivate Lmo2 hematopoietic enhancers, thus identifying key upstream regulators and positioning Lmo2 within hematopoietic regulatory networks.
Collapse
|
35
|
Alon S, Eisenberg E, Jacob-Hirsch J, Rechavi G, Vatine G, Toyama R, Coon SL, Klein DC, Gothilf Y. A new cis-acting regulatory element driving gene expression in the zebrafish pineal gland. Bioinformatics 2009; 25:559-62. [PMID: 19147662 DOI: 10.1093/bioinformatics/btp031] [Citation(s) in RCA: 12] [Impact Index Per Article: 0.8] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 12/26/2022] Open
Abstract
MOTIVATION The identification of functional cis-acting DNA regulatory elements is a crucial step towards understanding gene regulation. Ab initio motif detection algorithms have been extensively used in search of regulatory elements. Yet, their success in providing experimentally validated regulatory elements in vertebrates has been limited. RESULTS Here we report in silico identification and in vivo validation of regulatory elements that determine enhanced gene expression in the pineal gland of zebrafish. Microarray data enabled detection of genes that exhibit high expression in the pineal gland. The promoter regions of these genes were computationally analyzed in order to identify overrepresented motifs. The highest ranking motif identified is a CRX/OTX binding site, known to govern expression in the pineal gland and retina. The second highest ranking motif was not reported before; we experimentally validated its function in vivo by mutational analysis. The methodology presented here may be applicable as a general scheme for finding regulatory elements that contribute to tissue-specific gene expression.
Collapse
Affiliation(s)
- Shahar Alon
- Department of Neurobiology, The George S. Wise Faculty of Life Sciences, Tel Aviv University, Tel Aviv 69978, Israel
| | | | | | | | | | | | | | | | | |
Collapse
|
36
|
Attanasio C, Reymond A, Humbert R, Lyle R, Kuehn MS, Neph S, Sabo PJ, Goldy J, Weaver M, Haydock A, Lee K, Dorschner M, Dermitzakis ET, Antonarakis SE, Stamatoyannopoulos JA. Assaying the regulatory potential of mammalian conserved non-coding sequences in human cells. Genome Biol 2008; 9:R168. [PMID: 19055709 PMCID: PMC2646272 DOI: 10.1186/gb-2008-9-12-r168] [Citation(s) in RCA: 18] [Impact Index Per Article: 1.1] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/09/2008] [Revised: 09/24/2008] [Accepted: 12/02/2008] [Indexed: 01/26/2023] Open
Abstract
The fraction of experimentally active conserved non-coding sequences within any given cell type is low, so classical assays are unlikely to expose their potential. Background Conserved non-coding sequences in the human genome are approximately tenfold more abundant than known genes, and have been hypothesized to mark the locations of cis-regulatory elements. However, the global contribution of conserved non-coding sequences to the transcriptional regulation of human genes is currently unknown. Deeply conserved elements shared between humans and teleost fish predominantly flank genes active during morphogenesis and are enriched for positive transcriptional regulatory elements. However, such deeply conserved elements account for <1% of the conserved non-coding sequences in the human genome, which are predominantly mammalian. Results We explored the regulatory potential of a large sample of these 'common' conserved non-coding sequences using a variety of classic assays, including chromatin remodeling, and enhancer/repressor and promoter activity. When tested across diverse human model cell types, we find that the fraction of experimentally active conserved non-coding sequences within any given cell type is low (approximately 5%), and that this proportion increases only modestly when considered collectively across cell types. Conclusions The results suggest that classic assays of cis-regulatory potential are unlikely to expose the functional potential of the substantial majority of mammalian conserved non-coding sequences in the human genome.
Collapse
Affiliation(s)
- Catia Attanasio
- Department of Genetic Medicine and Development, University of Geneva Medical School, 1 rue Michel Servet, 1211, Geneva 4, Switzerland.
| | | | | | | | | | | | | | | | | | | | | | | | | | | | | |
Collapse
|
37
|
Buck I, Morceau F, Cristofanon S, Heintz C, Chateauvieux S, Reuter S, Dicato M, Diederich M. Tumor necrosis factor α inhibits erythroid differentiation in human erythropoietin-dependent cells involving p38 MAPK pathway, GATA-1 and FOG-1 downregulation and GATA-2 upregulation. Biochem Pharmacol 2008; 76:1229-39. [DOI: 10.1016/j.bcp.2008.08.025] [Citation(s) in RCA: 44] [Impact Index Per Article: 2.8] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 08/05/2008] [Revised: 08/22/2008] [Accepted: 08/25/2008] [Indexed: 01/06/2023]
|
38
|
Transcriptional enhancement by GATA1-occupied DNA segments is strongly associated with evolutionary constraint on the binding site motif. Genome Res 2008; 18:1896-905. [PMID: 18818370 DOI: 10.1101/gr.083089.108] [Citation(s) in RCA: 29] [Impact Index Per Article: 1.8] [Reference Citation Analysis] [Abstract] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 01/22/2023]
Abstract
Tissue development and function are exquisitely dependent on proper regulation of gene expression, but it remains controversial whether the genomic signals controlling this process are subject to strong selective constraint. While some studies show that highly constrained noncoding regions act to enhance transcription, other studies show that DNA segments with biochemical signatures of regulatory regions, such as occupancy by a transcription factor, are seemingly unconstrained across mammalian evolution. To test the possible correlation of selective constraint with enhancer activity, we used chromatin immunoprecipitation as an approach unbiased by either evolutionary constraint or prior knowledge of regulatory activity to identify DNA segments within a 66-Mb region of mouse chromosome 7 that are occupied by the erythroid transcription factor GATA1. DNA segments bound by GATA1 were identified by hybridization to high-density tiling arrays, validated by quantitative PCR, and tested for gene regulatory activity in erythroid cells. Whereas almost all of the occupied segments contain canonical WGATAR binding site motifs for GATA1, in only 45% of the cases is the motif deeply preserved (found at the orthologous position in placental mammals or more distant species). However, GATA1-bound segments with high enhancer activity tend to be the ones with an evolutionarily preserved WGATAR motif, and this relationship was confirmed by a loss-of-function assay. Thus, GATA1 binding sites that regulate gene expression during erythroid maturation are under strong selective constraint, while nonconstrained binding may have only a limited or indirect role in regulation.
Collapse
|
39
|
Bourque G, Leong B, Vega VB, Chen X, Lee YL, Srinivasan KG, Chew JL, Ruan Y, Wei CL, Ng HH, Liu ET. Evolution of the mammalian transcription factor binding repertoire via transposable elements. Genome Res 2008; 18:1752-62. [PMID: 18682548 DOI: 10.1101/gr.080663.108] [Citation(s) in RCA: 416] [Impact Index Per Article: 26.0] [Reference Citation Analysis] [Abstract] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 12/22/2022]
Abstract
Identification of lineage-specific innovations in genomic control elements is critical for understanding transcriptional regulatory networks and phenotypic heterogeneity. We analyzed, from an evolutionary perspective, the binding regions of seven mammalian transcription factors (ESR1, TP53, MYC, RELA, POU5F1, SOX2, and CTCF) identified on a genome-wide scale by different chromatin immunoprecipitation approaches and found that only a minority of sites appear to be conserved at the sequence level. Instead, we uncovered a pervasive association with genomic repeats by showing that a large fraction of the bona fide binding sites for five of the seven transcription factors (ESR1, TP53, POU5F1, SOX2, and CTCF) are embedded in distinctive families of transposable elements. Using the age of the repeats, we established that these repeat-associated binding sites (RABS) have been associated with significant regulatory expansions throughout the mammalian phylogeny. We validated the functional significance of these RABS by showing that they are over-represented in proximity of regulated genes and that the binding motifs within these repeats have undergone evolutionary selection. Our results demonstrate that transcriptional regulatory networks are highly dynamic in eukaryotic genomes and that transposable elements play an important role in expanding the repertoire of binding sites.
Collapse
Affiliation(s)
- Guillaume Bourque
- Computational and Mathematical Biology, Genome Institute of Singapore, Singapore 138672, Singapore.
| | | | | | | | | | | | | | | | | | | | | |
Collapse
|
40
|
Eisermann K, Tandon S, Bazarov A, Brett A, Fraizer G, Piontkivska H. Evolutionary conservation of zinc finger transcription factor binding sites in promoters of genes co-expressed with WT1 in prostate cancer. BMC Genomics 2008; 9:337. [PMID: 18631392 PMCID: PMC2515153 DOI: 10.1186/1471-2164-9-337] [Citation(s) in RCA: 20] [Impact Index Per Article: 1.3] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 01/22/2008] [Accepted: 07/16/2008] [Indexed: 11/10/2022] Open
Abstract
BACKGROUND Gene expression analyses have led to a better understanding of growth control of prostate cancer cells. We and others have identified the presence of several zinc finger transcription factors in the neoplastic prostate, suggesting a potential role for these genes in the regulation of the prostate cancer transcriptome. One of the transcription factors (TFs) identified in the prostate cancer epithelial cells was the Wilms tumor gene (WT1). To rapidly identify coordinately expressed prostate cancer growth control genes that may be regulated by WT1, we used an in silico approach. RESULTS Evolutionary conserved transcription factor binding sites (TFBS) recognized by WT1, EGR1, SP1, SP2, AP2 and GATA1 were identified in the promoters of 24 differentially expressed prostate cancer genes from eight mammalian species. To test the relationship between sequence conservation and function, chromatin of LNCaP prostate cancer and kidney 293 cells were tested for TF binding using chromatin immunoprecipitation (ChIP). Multiple putative TFBS in gene promoters of placental mammals were found to be shared with those in human gene promoters and some were conserved between genomes that diverged about 170 million years ago (i.e., primates and marsupials), therefore implicating these sites as candidate binding sites. Among those genes coordinately expressed with WT1 was the kallikrein-related peptidase 3 (KLK3) gene commonly known as the prostate specific antigen (PSA) gene. This analysis located several potential WT1 TFBS in the PSA gene promoter and led to the rapid identification of a novel putative binding site confirmed in vivo by ChIP. Conversely for two prostate growth control genes, androgen receptor (AR) and vascular endothelial growth factor (VEGF), known to be transcriptionally regulated by WT1, regulatory sequence conservation was observed and TF binding in vivo was confirmed by ChIP. CONCLUSION Overall, this targeted approach rapidly identified important candidate WT1-binding elements in genes coordinately expressed with WT1 in prostate cancer cells, thus enabling a more focused functional analysis of the most likely target genes in prostate cancer progression. Identifying these genes will help to better understand how gene regulation is altered in these tumor cells.
Collapse
Affiliation(s)
- Kurtis Eisermann
- School of Biomedical Sciences, Kent State University, Kent, Ohio, USA.
| | | | | | | | | | | |
Collapse
|
41
|
Portales-Casamar E, Kirov S, Lim J, Lithwick S, Swanson MI, Ticoll A, Snoddy J, Wasserman WW. PAZAR: a framework for collection and dissemination of cis-regulatory sequence annotation. Genome Biol 2008; 8:R207. [PMID: 17916232 PMCID: PMC2246282 DOI: 10.1186/gb-2007-8-10-r207] [Citation(s) in RCA: 74] [Impact Index Per Article: 4.6] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 04/30/2007] [Revised: 09/05/2007] [Accepted: 09/28/2007] [Indexed: 01/29/2023] Open
Abstract
PAZAR is an open-access and open-source database of transcription factor and regulatory sequence annotation with associated web interface and programming tools for data submission and extraction. PAZAR is an open-access and open-source database of transcription factor and regulatory sequence annotation with associated web interface and programming tools for data submission and extraction. Curated boutique data collections can be maintained and disseminated through the unified schema of the mall-like PAZAR repository. The Pleiades Promoter Project collection of brain-linked regulatory sequences is introduced to demonstrate the depth of annotation possible within PAZAR. PAZAR, located at , is open for business.
Collapse
Affiliation(s)
- Elodie Portales-Casamar
- Centre for Molecular Medicine and Therapeutics, CFRI, University of British Columbia, Vancouver, BC, V5Z 4H4, Canada.
| | | | | | | | | | | | | | | |
Collapse
|
42
|
Jing H, Vakoc CR, Ying L, Mandat S, Wang H, Zheng X, Blobel GA. Exchange of GATA factors mediates transitions in looped chromatin organization at a developmentally regulated gene locus. Mol Cell 2008; 29:232-42. [PMID: 18243117 DOI: 10.1016/j.molcel.2007.11.020] [Citation(s) in RCA: 166] [Impact Index Per Article: 10.4] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/13/2007] [Revised: 09/21/2007] [Accepted: 11/09/2007] [Indexed: 12/18/2022]
Abstract
Enhancers can regulate designate promoters over long distances by forming chromatin loops. Whether chromatin loops are lost or reconfigured during gene repression is largely unexplored. We examined the chromosome conformation of the Kit gene that is expressed during early erythropoiesis but is downregulated upon cell maturation. Kit expression is controlled by sequential occupancy of two GATA family transcription factors. In immature cells, a distal enhancer bound by GATA-2 is in physical proximity with the active Kit promoter. Upon cell maturation, GATA-1 displaces GATA-2 and triggers a loss of the enhancer/promoter interaction. Moreover, GATA-1 reciprocally increases the proximity in nuclear space among distinct downstream GATA elements. GATA-1-induced transitions in chromatin conformation are not simply the consequence of transcription inhibition and require the cofactor FOG-1. This work shows that a GATA factor exchange reconfigures higher-order chromatin organization, and suggests that de novo chromatin loop formation is employed by nuclear factors to specify repressive outcomes.
Collapse
Affiliation(s)
- Huie Jing
- Division of Hematology, The Children's Hospital of Philadelphia, Philadelphia, PA 19104, USA
| | | | | | | | | | | | | |
Collapse
|
43
|
A GATA-1-regulated microRNA locus essential for erythropoiesis. Proc Natl Acad Sci U S A 2008; 105:3333-8. [PMID: 18303114 DOI: 10.1073/pnas.0712312105] [Citation(s) in RCA: 265] [Impact Index Per Article: 16.6] [Reference Citation Analysis] [Abstract] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 01/07/2023] Open
Abstract
MicroRNAs (miRNAs) control tissue development, but their mechanism of regulation is not well understood. We used a gene complementation strategy combined with microarray screening to identify miRNAs involved in the formation of erythroid (red blood) cells. Two conserved miRNAs, miR 144 and miR 451, emerged as direct targets of the critical hematopoietic transcription factor GATA-1. In vivo, GATA-1 binds a distal upstream regulatory element to activate RNA polymerase II-mediated transcription of a single common precursor RNA (pri-miRNA) encoding both mature miRNAs. Zebrafish embryos depleted of miR 451 by using antisense morpholinos form erythroid precursors, but their development into mature circulating red blood cells is strongly and specifically impaired. These results reveal a miRNA locus that is required for erythropoiesis and uncover a new regulatory axis through which GATA-1 controls this process.
Collapse
|
44
|
Sturm RA, Duffy DL, Zhao ZZ, Leite FP, Stark MS, Hayward N, Martin NG, Montgomery GW. A single SNP in an evolutionary conserved region within intron 86 of the HERC2 gene determines human blue-brown eye color. Am J Hum Genet 2008; 82:424-31. [PMID: 18252222 DOI: 10.1016/j.ajhg.2007.11.005] [Citation(s) in RCA: 262] [Impact Index Per Article: 16.4] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 09/21/2007] [Revised: 11/19/2007] [Accepted: 11/19/2007] [Indexed: 01/22/2023] Open
Abstract
We have previously demonstrated that haplotypes of three single nucleotide polymorphisms (SNPs) within the first intron of the OCA2 gene are extremely strongly associated with variation in human eye color. In the present work, we describe additional fine association mapping of eye color SNPs in the intergenic region upstream of OCA2 and within the neighboring HERC2 (hect domain and RLD2) gene. We screened an additional 92 SNPs in 300-3000 European individuals and found that a single SNP in intron 86 of HERC2, rs12913832, predicted eye color significantly better (ordinal logistic regression R(2) = 0.68, association LOD = 444) than our previous best OCA2 haplotype. Comparison of sequence alignments of multiple species showed that this SNP lies in the center of a short highly conserved sequence and that the blue-eye-associated allele (frequency 78%) breaks up this conserved sequence, part of which forms a consensus binding site for the helicase-like transcription factor (HLTF). We were also able to demonstrate the OCA2 R419Q, rs1800407, coding SNP acts as a penetrance modifier of this new HERC2 SNP for eye color, and somewhat independently, of melanoma risk. We conclude that the conserved region around rs12913832 represents a regulatory region controlling constitutive expression of OCA2 and that the C allele at rs12913832 leads to decreased expression of OCA2, particularly within iris melanocytes, which we postulate to be the ultimate cause of blue eye color.
Collapse
|
45
|
Griffith OL, Montgomery SB, Bernier B, Chu B, Kasaian K, Aerts S, Mahony S, Sleumer MC, Bilenky M, Haeussler M, Griffith M, Gallo SM, Giardine B, Hooghe B, Van Loo P, Blanco E, Ticoll A, Lithwick S, Portales-Casamar E, Donaldson IJ, Robertson G, Wadelius C, De Bleser P, Vlieghe D, Halfon MS, Wasserman W, Hardison R, Bergman CM, Jones SJM. ORegAnno: an open-access community-driven resource for regulatory annotation. Nucleic Acids Res 2007; 36:D107-13. [PMID: 18006570 PMCID: PMC2239002 DOI: 10.1093/nar/gkm967] [Citation(s) in RCA: 199] [Impact Index Per Article: 11.7] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/13/2022] Open
Abstract
ORegAnno is an open-source, open-access database and literature curation system for community-based annotation of experimentally identified DNA regulatory regions, transcription factor binding sites and regulatory variants. The current release comprises 30 145 records curated from 922 publications and describing regulatory sequences for over 3853 genes and 465 transcription factors from 19 species. A new feature called the 'publication queue' allows users to input relevant papers from scientific literature as targets for annotation. The queue contains 4438 gene regulation papers entered by experts and another 54 351 identified by text-mining methods. Users can enter or 'check out' papers from the queue for manual curation using a series of user-friendly annotation pages. A typical record entry consists of species, sequence type, sequence, target gene, binding factor, experimental outcome and one or more lines of experimental evidence. An evidence ontology was developed to describe and categorize these experiments. Records are cross-referenced to Ensembl or Entrez gene identifiers, PubMed and dbSNP and can be visualized in the Ensembl or UCSC genome browsers. All data are freely available through search pages, XML data dumps or web services at: http://www.oreganno.org.
Collapse
Affiliation(s)
- Obi L Griffith
- Canada's Michael Smith Genome Sciences Centre, British Columbia Cancer Agency, Vancouver, BC V5Z 4S6, Canada.
| | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | |
Collapse
|
46
|
Nottingham WT, Jarratt A, Burgess M, Speck CL, Cheng JF, Prabhakar S, Rubin EM, Li PS, Sloane-Stanley J, Kong-A-San J, de Bruijn MFTR. Runx1-mediated hematopoietic stem-cell emergence is controlled by a Gata/Ets/SCL-regulated enhancer. Blood 2007; 110:4188-97. [PMID: 17823307 PMCID: PMC2234795 DOI: 10.1182/blood-2007-07-100883] [Citation(s) in RCA: 193] [Impact Index Per Article: 11.4] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/20/2022] Open
Abstract
The transcription factor Runx1/AML1 is an important regulator of hematopoiesis and is critically required for the generation of the first definitive hematopoietic stem cells (HSCs) in the major vasculature of the mouse embryo. As a pivotal factor in HSC ontogeny, its transcriptional regulation is of high interest but is largely undefined. In this study, we used a combination of comparative genomics and chromatin analysis to identify a highly conserved 531-bp enhancer located at position + 23.5 in the first intron of the 224-kb mouse Runx1 gene. We show that this enhancer contributes to the early hematopoietic expression of Runx1. Transcription factor binding in vivo and analysis of the mutated enhancer in transient transgenic mouse embryos implicate Gata2 and Ets proteins as critical factors for its function. We also show that the SCL/Lmo2/Ldb-1 complex is recruited to the enhancer in vivo. Importantly, transplantation experiments demonstrate that the intronic Runx1 enhancer targets all definitive HSCs in the mouse embryo, suggesting that it functions as a crucial cis-regulatory element that integrates the Gata, Ets, and SCL transcriptional networks to initiate HSC generation.
Collapse
Affiliation(s)
- Wade T Nottingham
- Medical Research Council (MRC) Molecular Haematology Unit, Weatherall Institute of Molecular Medicine, John Radcliffe Hospital, Oxford, UK
| | | | | | | | | | | | | | | | | | | | | |
Collapse
|
47
|
Davies SR, Chang LW, Patra D, Xing X, Posey K, Hecht J, Stormo GD, Sandell LJ. Computational identification and functional validation of regulatory motifs in cartilage-expressed genes. Genome Res 2007; 17:1438-47. [PMID: 17785538 PMCID: PMC1987341 DOI: 10.1101/gr.6224007] [Citation(s) in RCA: 25] [Impact Index Per Article: 1.5] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 12/20/2022]
Abstract
Chondrocyte gene regulation is important for the generation and maintenance of cartilage tissues. Several regulatory factors have been identified that play a role in chondrogenesis, including the positive transacting factors of the SOX family such as SOX9, SOX5, and SOX6, as well as negative transacting factors such as C/EBP and delta EF1. However, a complete understanding of the intricate regulatory network that governs the tissue-specific expression of cartilage genes is not yet available. We have taken a computational approach to identify cis-regulatory, transcription factor (TF) binding motifs in a set of cartilage characteristic genes to better define the transcriptional regulatory networks that regulate chondrogenesis. Our computational methods have identified several TFs, whose binding profiles are available in the TRANSFAC database, as important to chondrogenesis. In addition, a cartilage-specific SOX-binding profile was constructed and used to identify both known, and novel, functional paired SOX-binding motifs in chondrocyte genes. Using DNA pattern-recognition algorithms, we have also identified cis-regulatory elements for unknown TFs. We have validated our computational predictions through mutational analyses in cell transfection experiments. One novel regulatory motif, N1, found at high frequency in the COL2A1 promoter, was found to bind to chondrocyte nuclear proteins. Mutational analyses suggest that this motif binds a repressive factor that regulates basal levels of the COL2A1 promoter.
Collapse
Affiliation(s)
- Sherri R. Davies
- Department of Orthopaedic Surgery, Washington University School of Medicine, St. Louis, Missouri 63110, USA
| | - Li-Wei Chang
- Department of Biomedical Engineering, Washington University, St. Louis, Missouri 63130, USA
| | - Debabrata Patra
- Department of Orthopaedic Surgery, Washington University School of Medicine, St. Louis, Missouri 63110, USA
| | - Xiaoyun Xing
- Department of Orthopaedic Surgery, Washington University School of Medicine, St. Louis, Missouri 63110, USA
| | - Karen Posey
- Department of Pediatrics, University of Texas Medical School at Houston, Houston, Texas 77030, USA
| | - Jacqueline Hecht
- Department of Pediatrics, University of Texas Medical School at Houston, Houston, Texas 77030, USA
- Shriners Hospital for Children, Houston, Texas 77030, USA
| | - Gary D. Stormo
- Department of Biomedical Engineering, Washington University, St. Louis, Missouri 63130, USA
- Department of Genetics, Washington University School of Medicine, St. Louis, Missouri 63110, USA
| | - Linda J. Sandell
- Department of Orthopaedic Surgery, Washington University School of Medicine, St. Louis, Missouri 63110, USA
- Department of Cell Biology and Physiology, Washington University School of Medicine, St. Louis, Missouri 63110, USA
- Corresponding author.E-mail ; fax (314) 454-5900
| |
Collapse
|
48
|
Appanah R, Dickerson DR, Goyal P, Groudine M, Lorincz MC. An unmethylated 3' promoter-proximal region is required for efficient transcription initiation. PLoS Genet 2007; 3:e27. [PMID: 17305432 PMCID: PMC1797817 DOI: 10.1371/journal.pgen.0030027] [Citation(s) in RCA: 53] [Impact Index Per Article: 3.1] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 09/01/2006] [Accepted: 12/28/2006] [Indexed: 11/30/2022] Open
Abstract
The promoter regions of approximately 40% of genes in the human genome are embedded in CpG islands, CpG-rich regions that frequently extend on the order of one kb 3′ of the transcription start site (TSS) region. CpGs 3′ of the TSS of actively transcribed CpG island promoters typically remain methylation-free, indicating that maintaining promoter-proximal CpGs in an unmethylated state may be important for efficient transcription. Here we utilize recombinase-mediated cassette exchange to introduce a Moloney Murine Leukemia Virus (MoMuLV)-based reporter, in vitro methylated 1 kb downstream of the TSS, into a defined genomic site. In a subset of clones, methylation spreads to within ∼320 bp of the TSS, yielding a dramatic decrease in transcript level, even though the promoter/TSS region remains unmethylated. Chromatin immunoprecipitation analyses reveal that such promoter-proximal methylation results in loss of RNA polymerase II and TATA-box-binding protein (TBP) binding in the promoter region, suggesting that repression occurs at the level of transcription initiation. While DNA methylation-dependent trimethylation of H3 lysine (K)9 is confined to the intragenic methylated region, the promoter and downstream regions are hypo-acetylated on H3K9/K14. Furthermore, DNase I hypersensitivity and methylase-based single promoter analysis (M-SPA) experiments reveal that a nucleosome is positioned over the unmethylated TATA-box in these clones, indicating that dense DNA methylation downstream of the promoter region is sufficient to alter the chromatin structure of an unmethylated promoter. Based on these observations, we propose that a DNA methylation-free region extending several hundred bases downstream of the TSS may be a prerequisite for efficient transcription initiation. This model provides a biochemical explanation for the typical positioning of TSSs well upstream of the 3′ end of the CpG islands in which they are embedded. Genes, the functional units of heredity, are made up of DNA, which is packaged inside the nuclei of eukaryotic cells in association with a number of proteins in a structure called chromatin. In order for transcription, the process of transferring genetic information from DNA to RNA, to take place, chromatin must be decondensed to allow the transcription machinery to bind the genes that are to be transcribed. In mammals, promoters, the starting position of genes, are frequently embedded in “CpG islands,” regions with a relatively high density of the CpG dinucleotide. Paradoxically, while cytosines in the context of the CpG dinucleotide are generally methylated, CpGs flanking the start sites of genes typically remain methylation-free. As CpG methylation is associated with condensed chromatin, it is generally believed that promoter regions must remain free of methylation to allow for binding of the transcription machinery. Here, using a novel method for introducing methylated DNA into a defined genomic site, we demonstrate that DNA methylation in the promoter-proximal region of a gene is sufficient to block transcription via the generation of a chromatin structure that inhibits binding of the transcription machinery. Thus, methylation may inhibit transcription even when present outside the promoter region.
Collapse
Affiliation(s)
- Ruth Appanah
- Department of Medical Genetics, The University of British Columbia, Vancouver, British Columbia, Canada
| | - David R Dickerson
- Division of Basic Sciences, Fred Hutchinson Cancer Research Center, Seattle, Washington, United States of America
| | - Preeti Goyal
- Department of Medical Genetics, The University of British Columbia, Vancouver, British Columbia, Canada
| | - Mark Groudine
- Division of Basic Sciences, Fred Hutchinson Cancer Research Center, Seattle, Washington, United States of America
- Department of Radiation Oncology, University of Washington School of Medicine, Seattle, Washington, United States of America
| | - Matthew C Lorincz
- Department of Medical Genetics, The University of British Columbia, Vancouver, British Columbia, Canada
- * To whom correspondence should be addressed. E-mail:
| |
Collapse
|
49
|
Taylor J, Tyekucheva S, King DC, Hardison RC, Miller W, Chiaromonte F. ESPERR: learning strong and weak signals in genomic sequence alignments to identify functional elements. Genome Res 2006; 16:1596-604. [PMID: 17053093 PMCID: PMC1665643 DOI: 10.1101/gr.4537706] [Citation(s) in RCA: 101] [Impact Index Per Article: 5.6] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 02/01/2023]
Abstract
Genomic sequence signals - such as base composition, presence of particular motifs, or evolutionary constraint - have been used effectively to identify functional elements. However, approaches based only on specific signals known to correlate with function can be quite limiting. When training data are available, application of computational learning algorithms to multispecies alignments has the potential to capture broader and more informative sequence and evolutionary patterns that better characterize a class of elements. However, effective exploitation of patterns in multispecies alignments is impeded by the vast number of possible alignment columns and by a limited understanding of which particular strings of columns may characterize a given class. We have developed a computational method, called ESPERR (evolutionary and sequence pattern extraction through reduced representations), which uses training examples to learn encodings of multispecies alignments into reduced forms tailored for the prediction of chosen classes of functional elements. ESPERR produces a greatly improved Regulatory Potential score, which can discriminate regulatory regions from neutral sites with excellent accuracy ( approximately 94%). This score captures strong signals (GC content and conservation), as well as subtler signals (with small contributions from many different alignment patterns) that characterize the regulatory elements in our training set. ESPERR is also effective for predicting other classes of functional elements, as we show for DNaseI hypersensitive sites and highly conserved regions with developmental enhancer activity. Our software, training data, and genome-wide predictions are available from our Web site (http://www.bx.psu.edu/projects/esperr).
Collapse
Affiliation(s)
- James Taylor
- Center for Comparative Genomics and Bioinformatics, The Pennsylvania State University, University Park, Pennsylvania 16802, USA
- Corresponding authors.E-mail ; fax (814) 863-6699.E-mail ; fax (814) 863-6699
| | - Svitlana Tyekucheva
- Center for Comparative Genomics and Bioinformatics, The Pennsylvania State University, University Park, Pennsylvania 16802, USA
| | - David C. King
- Center for Comparative Genomics and Bioinformatics, The Pennsylvania State University, University Park, Pennsylvania 16802, USA
| | - Ross C. Hardison
- Center for Comparative Genomics and Bioinformatics, The Pennsylvania State University, University Park, Pennsylvania 16802, USA
| | - Webb Miller
- Center for Comparative Genomics and Bioinformatics, The Pennsylvania State University, University Park, Pennsylvania 16802, USA
| | - Francesca Chiaromonte
- Center for Comparative Genomics and Bioinformatics, The Pennsylvania State University, University Park, Pennsylvania 16802, USA
- Corresponding authors.E-mail ; fax (814) 863-6699.E-mail ; fax (814) 863-6699
| |
Collapse
|