51
|
Fiore C, Cohen BA. Interactions between pluripotency factors specify cis-regulation in embryonic stem cells. Genome Res 2016; 26:778-86. [PMID: 27197208 PMCID: PMC4889965 DOI: 10.1101/gr.200733.115] [Citation(s) in RCA: 29] [Impact Index Per Article: 3.6] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/12/2015] [Accepted: 04/13/2016] [Indexed: 01/06/2023]
Abstract
We investigated how interactions between pluripotency transcription factors (TFs) affect cis-regulation. We created hundreds of synthetic cis-regulatory elements (CREs) comprised of combinations of binding sites for pluripotency TFs and measured their expression in mouse embryonic stem (ES) cells. A thermodynamic model that incorporates interactions between TFs explains a large portion (72%) of the variance in expression of these CREs. These interactions include three favorable heterotypic interactions between TFs. The model also predicts an unfavorable homotypic interaction between TFs, helping to explain the observation that homotypic chains of binding sites express at low levels. We further investigated the expression driven by CREs comprised of homotypic chains of KLF4 binding sites. Our results suggest that KLF homologs make unique contributions to regulation by these CREs. We conclude that a specific set of interactions between pluripotency TFs plays a large role in setting the levels of expression driven by CREs in ES cells.
Collapse
Affiliation(s)
- Chris Fiore
- Center for Genome Sciences and Systems Biology, Department of Genetics, Washington University School of Medicine, St. Louis, Missouri 63110, USA
| | - Barak A Cohen
- Center for Genome Sciences and Systems Biology, Department of Genetics, Washington University School of Medicine, St. Louis, Missouri 63110, USA
| |
Collapse
|
52
|
Estrada J, Ruiz-Herrero T, Scholes C, Wunderlich Z, DePace AH. SiteOut: An Online Tool to Design Binding Site-Free DNA Sequences. PLoS One 2016; 11:e0151740. [PMID: 26987123 PMCID: PMC4795680 DOI: 10.1371/journal.pone.0151740] [Citation(s) in RCA: 10] [Impact Index Per Article: 1.3] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 01/12/2016] [Accepted: 03/03/2016] [Indexed: 11/18/2022] Open
Abstract
DNA-binding proteins control many fundamental biological processes such as transcription, recombination and replication. A major goal is to decipher the role that DNA sequence plays in orchestrating the binding and activity of such regulatory proteins. To address this goal, it is useful to rationally design DNA sequences with desired numbers, affinities and arrangements of protein binding sites. However, removing binding sites from DNA is computationally non-trivial since one risks creating new sites in the process of deleting or moving others. Here we present an online binding site removal tool, SiteOut, that enables users to design arbitrary DNA sequences that entirely lack binding sites for factors of interest. SiteOut can also be used to delete sites from a specific sequence, or to introduce site-free spacers between functional sequences without creating new sites at the junctions. In combination with commercial DNA synthesis services, SiteOut provides a powerful and flexible platform for synthetic projects that interrogate regulatory DNA. Here we describe the algorithm and illustrate the ways in which SiteOut can be used; it is publicly available at https://depace.med.harvard.edu/siteout/.
Collapse
Affiliation(s)
- Javier Estrada
- Department of Systems Biology, Harvard Medical School, Boston, MA, United States of America
| | - Teresa Ruiz-Herrero
- John A. Paulson School of Engineering and Applied Sciences, Harvard University, Cambridge, MA, United States of America
| | - Clarissa Scholes
- Department of Systems Biology, Harvard Medical School, Boston, MA, United States of America
| | - Zeba Wunderlich
- Department of Systems Biology, Harvard Medical School, Boston, MA, United States of America
| | - Angela H. DePace
- Department of Systems Biology, Harvard Medical School, Boston, MA, United States of America
- * E-mail:
| |
Collapse
|
53
|
Vincent BJ, Estrada J, DePace AH. The appeasement of Doug: a synthetic approach to enhancer biology. Integr Biol (Camb) 2016; 8:475-84. [DOI: 10.1039/c5ib00321k] [Citation(s) in RCA: 33] [Impact Index Per Article: 4.1] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 12/13/2022]
Affiliation(s)
- Ben J. Vincent
- Department of Systems Biology, Harvard Medical School, 200 Longwood Avenue, Boston, MA 02115, USA
| | - Javier Estrada
- Department of Systems Biology, Harvard Medical School, 200 Longwood Avenue, Boston, MA 02115, USA
| | - Angela H. DePace
- Department of Systems Biology, Harvard Medical School, 200 Longwood Avenue, Boston, MA 02115, USA
| |
Collapse
|
54
|
Quantitatively predictable control of Drosophila transcriptional enhancers in vivo with engineered transcription factors. Nat Genet 2016; 48:292-8. [DOI: 10.1038/ng.3509] [Citation(s) in RCA: 26] [Impact Index Per Article: 3.3] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 08/30/2015] [Accepted: 01/15/2016] [Indexed: 12/13/2022]
|
55
|
Gurdziel K, Lorberbaum DS, Udager AM, Song JY, Richards N, Parker DS, Johnson LA, Allen BL, Barolo S, Gumucio DL. Identification and Validation of Novel Hedgehog-Responsive Enhancers Predicted by Computational Analysis of Ci/Gli Binding Site Density. PLoS One 2015; 10:e0145225. [PMID: 26710299 PMCID: PMC4692483 DOI: 10.1371/journal.pone.0145225] [Citation(s) in RCA: 6] [Impact Index Per Article: 0.7] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 05/01/2015] [Accepted: 12/01/2015] [Indexed: 01/20/2023] Open
Abstract
The Hedgehog (Hh) signaling pathway directs a multitude of cellular responses during embryogenesis and adult tissue homeostasis. Stimulation of the pathway results in activation of Hh target genes by the transcription factor Ci/Gli, which binds to specific motifs in genomic enhancers. In Drosophila, only a few enhancers (patched, decapentaplegic, wingless, stripe, knot, hairy, orthodenticle) have been shown by in vivo functional assays to depend on direct Ci/Gli regulation. All but one (orthodenticle) contain more than one Ci/Gli site, prompting us to directly test whether homotypic clustering of Ci/Gli binding sites is sufficient to define a Hh-regulated enhancer. We therefore developed a computational algorithm to identify Ci/Gli clusters that are enriched over random expectation, within a given region of the genome. Candidate genomic regions containing Ci/Gli clusters were functionally tested in chicken neural tube electroporation assays and in transgenic flies. Of the 22 Ci/Gli clusters tested, seven novel enhancers (and the previously known patched enhancer) were identified as Hh-responsive and Ci/Gli-dependent in one or both of these assays, including: Cuticular protein 100A (Cpr100A); invected (inv), which encodes an engrailed-related transcription factor expressed at the anterior/posterior wing disc boundary; roadkill (rdx), the fly homolog of vertebrate Spop; the segment polarity gene gooseberry (gsb); and two previously untested regions of the Hh receptor-encoding patched (ptc) gene. We conclude that homotypic Ci/Gli clustering is not sufficient information to ensure Hh-responsiveness; however, it can provide a clue for enhancer recognition within putative Hedgehog target gene loci.
Collapse
Affiliation(s)
- Katherine Gurdziel
- Department of Cell and Developmental Biology, The University of Michigan, Ann Arbor, MI 48109, United States of America
- Department of Computational Medicine and Bioinformatics, The University of Michigan, Ann Arbor, MI 48109, United States of America
| | - David S. Lorberbaum
- Department of Cell and Developmental Biology, The University of Michigan, Ann Arbor, MI 48109, United States of America
- Cellular and Molecular Biology Program, The University of Michigan, Ann Arbor, MI 48109, United States of America
| | - Aaron M. Udager
- Department of Cell and Developmental Biology, The University of Michigan, Ann Arbor, MI 48109, United States of America
| | - Jane Y. Song
- Department of Cell and Developmental Biology, The University of Michigan, Ann Arbor, MI 48109, United States of America
- Cellular and Molecular Biology Program, The University of Michigan, Ann Arbor, MI 48109, United States of America
| | - Neil Richards
- Department of Cell and Developmental Biology, The University of Michigan, Ann Arbor, MI 48109, United States of America
| | - David S. Parker
- Department of Cell and Developmental Biology, The University of Michigan, Ann Arbor, MI 48109, United States of America
| | - Lisa A. Johnson
- Department of Cell and Developmental Biology, The University of Michigan, Ann Arbor, MI 48109, United States of America
| | - Benjamin L. Allen
- Department of Cell and Developmental Biology, The University of Michigan, Ann Arbor, MI 48109, United States of America
- * E-mail: (DLG); (SB); (BLA)
| | - Scott Barolo
- Department of Cell and Developmental Biology, The University of Michigan, Ann Arbor, MI 48109, United States of America
- * E-mail: (DLG); (SB); (BLA)
| | - Deborah L. Gumucio
- Department of Cell and Developmental Biology, The University of Michigan, Ann Arbor, MI 48109, United States of America
- * E-mail: (DLG); (SB); (BLA)
| |
Collapse
|
56
|
Handling Permutation in Sequence Comparison: Genome-Wide Enhancer Prediction in Vertebrates by a Novel Non-Linear Alignment Scoring Principle. PLoS One 2015; 10:e0141487. [PMID: 26505748 PMCID: PMC4624239 DOI: 10.1371/journal.pone.0141487] [Citation(s) in RCA: 1] [Impact Index Per Article: 0.1] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/13/2015] [Accepted: 10/08/2015] [Indexed: 01/01/2023] Open
Abstract
Enhancers have been described to evolve by permutation without changing function. This has posed the problem of how to predict enhancer elements that are hidden from alignment-based approaches due to the loss of co-linearity. Alignment-free algorithms have been proposed as one possible solution. However, this approach is hampered by several problems inherent to its underlying working principle. Here we present a new approach, which combines the power of alignment and alignment-free techniques into one algorithm. It allows the prediction of enhancers based on the query and target sequence only, no matter whether the regulatory logic is co-linear or reshuffled. To test our novel approach, we employ it for the prediction of enhancers across the evolutionary distance of ~450Myr between human and medaka. We demonstrate its efficacy by subsequent in vivo validation resulting in 82% (9/11) of the predicted medaka regions showing reporter activity. These include five candidates with partially co-linear and four with reshuffled motif patterns. Orthology in flanking genes and conservation of the detected co-linear motifs indicates that those candidates are likely functionally equivalent enhancers. In sum, our results demonstrate that the proposed principle successfully predicts mutated as well as permuted enhancer regions at an encouragingly high rate.
Collapse
|
57
|
Abstract
Transcriptional enhancers direct precise on-off patterns of gene expression during development. To explore the basis for this precision, we conducted a high-throughput analysis of the Otx-a enhancer, which mediates expression in the neural plate of Ciona embryos in response to fibroblast growth factor (FGF) signaling and a localized GATA determinant. We provide evidence that enhancer specificity depends on submaximal recognition motifs having reduced binding affinities ("suboptimization"). Native GATA and ETS (FGF) binding sites contain imperfect matches to consensus motifs. Perfect matches mediate robust but ectopic patterns of gene expression. The native sites are not arranged at optimal intervals, and subtle changes in their spacing alter enhancer activity. Multiple tiers of enhancer suboptimization produce specific, but weak, patterns of expression, and we suggest that clusters of weak enhancers, including certain "superenhancers," circumvent this trade-off in specificity and activity.
Collapse
Affiliation(s)
- Emma K Farley
- Department of Molecular and Cell Biology, Division of Genetics, Genomics and Development, Center for Integrative Genomics, University of California, Berkeley, CA 94720-3200, USA. Lewis-Sigler Institute for Integrative Genomics, Princeton University, Princeton, NJ 08544, USA.
| | - Katrina M Olson
- Department of Molecular and Cell Biology, Division of Genetics, Genomics and Development, Center for Integrative Genomics, University of California, Berkeley, CA 94720-3200, USA. Lewis-Sigler Institute for Integrative Genomics, Princeton University, Princeton, NJ 08544, USA
| | - Wei Zhang
- Department of Medicine, University of California, San Diego, CA 92093-0688, USA
| | - Alexander J Brandt
- Department of Chemistry, University of California, Berkeley, CA 94720-3200, USA
| | - Daniel S Rokhsar
- Department of Molecular and Cell Biology, Division of Genetics, Genomics and Development, Center for Integrative Genomics, University of California, Berkeley, CA 94720-3200, USA
| | - Michael S Levine
- Department of Molecular and Cell Biology, Division of Genetics, Genomics and Development, Center for Integrative Genomics, University of California, Berkeley, CA 94720-3200, USA. Lewis-Sigler Institute for Integrative Genomics, Princeton University, Princeton, NJ 08544, USA.
| |
Collapse
|
58
|
STARR-seq - principles and applications. Genomics 2015; 106:145-150. [PMID: 26072434 DOI: 10.1016/j.ygeno.2015.06.001] [Citation(s) in RCA: 68] [Impact Index Per Article: 7.6] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/12/2015] [Revised: 05/19/2015] [Accepted: 06/08/2015] [Indexed: 12/21/2022]
Abstract
Differential gene expression is the basis for cell type diversity in multicellular organisms and the driving force of development and differentiation. It is achieved by cell type-specific transcriptional enhancers, which are genomic DNA sequences that activate the transcription of their target genes. Their identification and characterization is fundamental to our understanding of gene regulation. Features that are associated with enhancer activity, such as regulatory factor binding or histone modifications can predict the location of enhancers. Nonetheless, enhancer activity can only be assessed by transcriptional reporter assays. Over the past years massively parallel reporter assays have been developed for large scale testing of enhancers. In this review we focus on the principles and applications of STARR-seq, a functional assay that quantifies enhancer strengths in complex candidate libraries and thus allows activity-based enhancer identification in entire genomes. We explain how STARR-seq works, discuss current uses and give an outlook to future applications.
Collapse
|
59
|
Suryamohan K, Halfon MS. Identifying transcriptional cis-regulatory modules in animal genomes. WILEY INTERDISCIPLINARY REVIEWS. DEVELOPMENTAL BIOLOGY 2015; 4:59-84. [PMID: 25704908 PMCID: PMC4339228 DOI: 10.1002/wdev.168] [Citation(s) in RCA: 47] [Impact Index Per Article: 5.2] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Subscribe] [Scholar Register] [Received: 09/24/2014] [Revised: 11/04/2014] [Accepted: 11/16/2014] [Indexed: 11/08/2022]
Abstract
UNLABELLED Gene expression is regulated through the activity of transcription factors (TFs) and chromatin-modifying proteins acting on specific DNA sequences, referred to as cis-regulatory elements. These include promoters, located at the transcription initiation sites of genes, and a variety of distal cis-regulatory modules (CRMs), the most common of which are transcriptional enhancers. Because regulated gene expression is fundamental to cell differentiation and acquisition of new cell fates, identifying, characterizing, and understanding the mechanisms of action of CRMs is critical for understanding development. CRM discovery has historically been challenging, as CRMs can be located far from the genes they regulate, have few readily identifiable sequence characteristics, and for many years were not amenable to high-throughput discovery methods. However, the recent availability of complete genome sequences and the development of next-generation sequencing methods have led to an explosion of both computational and empirical methods for CRM discovery in model and nonmodel organisms alike. Experimentally, CRMs can be identified through chromatin immunoprecipitation directed against TFs or histone post-translational modifications, identification of nucleosome-depleted 'open' chromatin regions, or sequencing-based high-throughput functional screening. Computational methods include comparative genomics, clustering of known or predicted TF-binding sites, and supervised machine-learning approaches trained on known CRMs. All of these methods have proven effective for CRM discovery, but each has its own considerations and limitations, and each is subject to a greater or lesser number of false-positive identifications. Experimental confirmation of predictions is essential, although shortcomings in current methods suggest that additional means of validation need to be developed. For further resources related to this article, please visit the WIREs website. CONFLICT OF INTEREST The authors have declared no conflicts of interest for this article.
Collapse
Affiliation(s)
- Kushal Suryamohan
- Department of Biochemistry, University at Buffalo-State University of New York, Buffalo, NY 14203, USA
- NY State Center of Excellence in Bioinformatics and Life Sciences, Buffalo, NY 14203, USA
| | - Marc S. Halfon
- Department of Biochemistry, University at Buffalo-State University of New York, Buffalo, NY 14203, USA
- Department of Biological Sciences, University at Buffalo-State University of New York, Buffalo, NY 14203, USA
- Department of Biomedical Informatics, University at Buffalo-State University of New York, Buffalo, NY 14203, USA
- NY State Center of Excellence in Bioinformatics and Life Sciences, Buffalo, NY 14203, USA
- Molecular and Cellular Biology Department and Program in Cancer Genetics, Roswell Park Cancer Institute, Buffalo, NY 14263, USA
| |
Collapse
|
60
|
Slattery M, Ma L, Spokony RF, Arthur RK, Kheradpour P, Kundaje A, Nègre N, Crofts A, Ptashkin R, Zieba J, Ostapenko A, Suchy S, Victorsen A, Jameel N, Grundstad AJ, Gao W, Moran JR, Rehm EJ, Grossman RL, Kellis M, White KP. Diverse patterns of genomic targeting by transcriptional regulators in Drosophila melanogaster. Genome Res 2015; 24:1224-35. [PMID: 24985916 PMCID: PMC4079976 DOI: 10.1101/gr.168807.113] [Citation(s) in RCA: 26] [Impact Index Per Article: 2.9] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 12/20/2022]
Abstract
Annotation of regulatory elements and identification of the transcription-related factors (TRFs) targeting these elements are key steps in understanding how cells interpret their genetic blueprint and their environment during development, and how that process goes awry in the case of disease. One goal of the modENCODE (model organism ENCyclopedia of DNA Elements) Project is to survey a diverse sampling of TRFs, both DNA-binding and non-DNA-binding factors, to provide a framework for the subsequent study of the mechanisms by which transcriptional regulators target the genome. Here we provide an updated map of the Drosophila melanogaster regulatory genome based on the location of 84 TRFs at various stages of development. This regulatory map reveals a variety of genomic targeting patterns, including factors with strong preferences toward proximal promoter binding, factors that target intergenic and intronic DNA, and factors with distinct chromatin state preferences. The data also highlight the stringency of the Polycomb regulatory network, and show association of the Trithorax-like (Trl) protein with hotspots of DNA binding throughout development. Furthermore, the data identify more than 5800 instances in which TRFs target DNA regions with demonstrated enhancer activity. Regions of high TRF co-occupancy are more likely to be associated with open enhancers used across cell types, while lower TRF occupancy regions are associated with complex enhancers that are also regulated at the epigenetic level. Together these data serve as a resource for the research community in the continued effort to dissect transcriptional regulatory mechanisms directing Drosophila development.
Collapse
Affiliation(s)
- Matthew Slattery
- Institute for Genomics & Systems Biology, Department of Human Genetics, The University of Chicago, Chicago, Illinois 60637, USA
| | - Lijia Ma
- Institute for Genomics & Systems Biology, Department of Human Genetics, The University of Chicago, Chicago, Illinois 60637, USA
| | - Rebecca F Spokony
- Institute for Genomics & Systems Biology, Department of Human Genetics, The University of Chicago, Chicago, Illinois 60637, USA
| | - Robert K Arthur
- Institute for Genomics & Systems Biology, Department of Human Genetics, The University of Chicago, Chicago, Illinois 60637, USA
| | - Pouya Kheradpour
- Computer Science and Artificial Intelligence Laboratory (CSAIL), Massachusetts Institute of Technology (MIT), Cambridge, Massachusetts 02139, USA
| | - Anshul Kundaje
- Computer Science and Artificial Intelligence Laboratory (CSAIL), Massachusetts Institute of Technology (MIT), Cambridge, Massachusetts 02139, USA
| | - Nicolas Nègre
- Institute for Genomics & Systems Biology, Department of Human Genetics, The University of Chicago, Chicago, Illinois 60637, USA; Université de Montpellier II and INRA, UMR1333 DGIMI, F-34095 Montpellier, France
| | - Alex Crofts
- Institute for Genomics & Systems Biology, Department of Human Genetics, The University of Chicago, Chicago, Illinois 60637, USA
| | - Ryan Ptashkin
- Institute for Genomics & Systems Biology, Department of Human Genetics, The University of Chicago, Chicago, Illinois 60637, USA
| | - Jennifer Zieba
- Institute for Genomics & Systems Biology, Department of Human Genetics, The University of Chicago, Chicago, Illinois 60637, USA
| | - Alexander Ostapenko
- Institute for Genomics & Systems Biology, Department of Human Genetics, The University of Chicago, Chicago, Illinois 60637, USA
| | - Sarah Suchy
- Institute for Genomics & Systems Biology, Department of Human Genetics, The University of Chicago, Chicago, Illinois 60637, USA
| | - Alec Victorsen
- Institute for Genomics & Systems Biology, Department of Human Genetics, The University of Chicago, Chicago, Illinois 60637, USA
| | - Nader Jameel
- Institute for Genomics & Systems Biology, Department of Human Genetics, The University of Chicago, Chicago, Illinois 60637, USA
| | - A Jason Grundstad
- Institute for Genomics & Systems Biology, Department of Human Genetics, The University of Chicago, Chicago, Illinois 60637, USA
| | - Wenxuan Gao
- Institute for Genomics & Systems Biology, Department of Human Genetics, The University of Chicago, Chicago, Illinois 60637, USA
| | - Jennifer R Moran
- Institute for Genomics & Systems Biology, Department of Human Genetics, The University of Chicago, Chicago, Illinois 60637, USA
| | - E Jay Rehm
- Institute for Genomics & Systems Biology, Department of Human Genetics, The University of Chicago, Chicago, Illinois 60637, USA
| | - Robert L Grossman
- Institute for Genomics & Systems Biology, Department of Human Genetics, The University of Chicago, Chicago, Illinois 60637, USA
| | - Manolis Kellis
- Computer Science and Artificial Intelligence Laboratory (CSAIL), Massachusetts Institute of Technology (MIT), Cambridge, Massachusetts 02139, USA; Broad Institute of MIT and Harvard, Cambridge, Massachusetts 02142, USA
| | - Kevin P White
- Institute for Genomics & Systems Biology, Department of Human Genetics, The University of Chicago, Chicago, Illinois 60637, USA
| |
Collapse
|
61
|
Barsi JC, Li E, Davidson EH. Geometric control of ciliated band regulatory states in the sea urchin embryo. Development 2015; 142:953-61. [PMID: 25655703 DOI: 10.1242/dev.117986] [Citation(s) in RCA: 20] [Impact Index Per Article: 2.2] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 12/21/2022]
Abstract
The trapezoidal ciliated band (CB) of the postgastrular sea urchin embryo surrounds the oral ectoderm, separating it from adjacent embryonic territories. Once differentiated, the CB is composed of densely arranged cells bearing long cilia that endow the larva with locomotion and feeding capability. The spatial pattern from which the CB will arise is first evidenced during pregastrular stages by expression of the pioneer gene onecut. Immediately after gastrulation, the CB consists of four separate regulatory state domains, each of which expresses a unique set of transcription factors: (1) the oral apical CB, located within the apical neurogenic field; (2) the animal lateral CB, which bilaterally separates the oral from aboral ectoderm; (3) the vegetal lateral CB, which bilaterally serves as signaling centers; and (4) the vegetal oral CB, which delineates the boundary with the underlying endoderm. Remarkably, almost all of the regulatory genes specifically expressed within these domains are downregulated by interference with SoxB1 expression, implying their common activation by this factor. Here, we show how the boundaries of the CB subdomains are established, and thus ascertain the design principle by which the geometry of this unique and complex regulatory state pattern is genomically controlled. Each of these boundaries, on either side of the CB, is defined by spatially confined transcriptional repressors, the products of regulatory genes operating across the border of each subdomain. In total this requires deployment of about ten different repressors, which we identify in this work, thus exemplifying the complexity of information required for spatial regulatory organization during embryogenesis.
Collapse
Affiliation(s)
- Julius C Barsi
- Division of Biology and Biological Engineering, Caltech, Pasadena, CA 91125, USA
| | - Enhu Li
- Division of Biology and Biological Engineering, Caltech, Pasadena, CA 91125, USA Warp Drive Bio, LLC, 400 Technology Square, Cambridge, MA 02139, USA
| | - Eric H Davidson
- Division of Biology and Biological Engineering, Caltech, Pasadena, CA 91125, USA
| |
Collapse
|
62
|
Crocker J, Abe N, Rinaldi L, McGregor AP, Frankel N, Wang S, Alsawadi A, Valenti P, Plaza S, Payre F, Mann RS, Stern DL. Low affinity binding site clusters confer hox specificity and regulatory robustness. Cell 2014; 160:191-203. [PMID: 25557079 DOI: 10.1016/j.cell.2014.11.041] [Citation(s) in RCA: 245] [Impact Index Per Article: 24.5] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/05/2014] [Revised: 09/11/2014] [Accepted: 11/13/2014] [Indexed: 11/26/2022]
Abstract
In animals, Hox transcription factors define regional identity in distinct anatomical domains. How Hox genes encode this specificity is a paradox, because different Hox proteins bind with high affinity in vitro to similar DNA sequences. Here, we demonstrate that the Hox protein Ultrabithorax (Ubx) in complex with its cofactor Extradenticle (Exd) bound specifically to clusters of very low affinity sites in enhancers of the shavenbaby gene of Drosophila. These low affinity sites conferred specificity for Ubx binding in vivo, but multiple clustered sites were required for robust expression when embryos developed in variable environments. Although most individual Ubx binding sites are not evolutionarily conserved, the overall enhancer architecture-clusters of low affinity binding sites-is maintained and required for enhancer function. Natural selection therefore works at the level of the enhancer, requiring a particular density of low affinity Ubx sites to confer both specific and robust expression.
Collapse
Affiliation(s)
- Justin Crocker
- Janelia Research Campus, Howard Hughes Medical Institute, 19700 Helix Drive, Ashburn, VA 20147, USA
| | - Namiko Abe
- Columbia University Medical Center, 701 West 168(th) Street, HHSC 1104, New York, NY 10032, USA
| | - Lucrezia Rinaldi
- Columbia University Medical Center, 701 West 168(th) Street, HHSC 1104, New York, NY 10032, USA
| | - Alistair P McGregor
- Department of Biological and Medical Sciences, Oxford Brookes University, Gipsy Lane, Oxford OX3 0BP, UK
| | - Nicolás Frankel
- Departamento de Ecología, Genética y Evolución, IEGEBA-CONICET, Facultad, de Ciencias Exactas y Naturales, Universidad de Buenos Aires, Ciudad, Universitaria, Pabellón 2, 1428 Buenos Aires, Argentina
| | - Shu Wang
- New Jersey Neuroscience Institute, 65 James Street, Edison, NJ 08820, USA
| | - Ahmad Alsawadi
- Centre de Biologie du Développement, Université de Toulouse, UPS, 31062 Cedex 9, France; CNRS, UMR5547, Centre de Biologie du Développement, Toulouse, 31062 Cedex 9, France
| | - Philippe Valenti
- Centre de Biologie du Développement, Université de Toulouse, UPS, 31062 Cedex 9, France; CNRS, UMR5547, Centre de Biologie du Développement, Toulouse, 31062 Cedex 9, France
| | - Serge Plaza
- Centre de Biologie du Développement, Université de Toulouse, UPS, 31062 Cedex 9, France; CNRS, UMR5547, Centre de Biologie du Développement, Toulouse, 31062 Cedex 9, France
| | - François Payre
- Centre de Biologie du Développement, Université de Toulouse, UPS, 31062 Cedex 9, France; CNRS, UMR5547, Centre de Biologie du Développement, Toulouse, 31062 Cedex 9, France
| | - Richard S Mann
- Columbia University Medical Center, 701 West 168(th) Street, HHSC 1104, New York, NY 10032, USA.
| | - David L Stern
- Janelia Research Campus, Howard Hughes Medical Institute, 19700 Helix Drive, Ashburn, VA 20147, USA.
| |
Collapse
|
63
|
Zhang CU, Blauwkamp TA, Burby PE, Cadigan KM. Wnt-mediated repression via bipartite DNA recognition by TCF in the Drosophila hematopoietic system. PLoS Genet 2014; 10:e1004509. [PMID: 25144371 PMCID: PMC4140642 DOI: 10.1371/journal.pgen.1004509] [Citation(s) in RCA: 33] [Impact Index Per Article: 3.3] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/16/2013] [Accepted: 05/30/2014] [Indexed: 11/18/2022] Open
Abstract
The Wnt/β-catenin signaling pathway plays many important roles in animal development, tissue homeostasis and human disease. Transcription factors of the TCF family mediate many Wnt transcriptional responses, promoting signal-dependent activation or repression of target gene expression. The mechanism of this specificity is poorly understood. Previously, we demonstrated that for activated targets in Drosophila, TCF/Pangolin (the fly TCF) recognizes regulatory DNA through two DNA binding domains, with the High Mobility Group (HMG) domain binding HMG sites and the adjacent C-clamp domain binding Helper sites. Here, we report that TCF/Pangolin utilizes a similar bipartite mechanism to recognize and regulate several Wnt-repressed targets, but through HMG and Helper sites whose sequences are distinct from those found in activated targets. The type of HMG and Helper sites is sufficient to direct activation or repression of Wnt regulated cis-regulatory modules, and protease digestion studies suggest that TCF/Pangolin adopts distinct conformations when bound to either HMG-Helper site pair. This repressive mechanism occurs in the fly lymph gland, the larval hematopoietic organ, where Wnt/β-catenin signaling controls prohemocytic differentiation. Our study provides a paradigm for direct repression of target gene expression by Wnt/β-catenin signaling and allosteric regulation of a transcription factor by DNA. During development and in adult tissues, cells communicate with each other through biochemical cascades known as signaling pathways. In this report, we study the Wnt signaling pathway, using the fruit fly Drosophila as a model system. This pathway is known to activate gene expression in cells receiving the Wnt signal, working through a transcription factor known as TCF. But sometimes Wnt signaling also instructs TCF to repress target gene expression. What determines whether TCF will positively or negatively regulate Wnt targets? We demonstrate that activated and repressed targets have distinct DNA sequences that dock TCF on their regulatory DNA. The type of site determines the output, i.e., activation or repression. We find that TCF adopts different conformations when bound to either DNA sequence, which most likely influences its regulatory activity. In addition, we demonstrate that Wnt-dependent repression occurs robustly in the fly larval lymph gland, the tissue responsible for generating macrophage-like cells known as hemocytes.
Collapse
Affiliation(s)
- Chen U. Zhang
- Department of Molecular, Cellular and Developmental Biology, University of Michigan, Ann Arbor, Michigan, United States of America
| | - Timothy A. Blauwkamp
- Department of Molecular, Cellular and Developmental Biology, University of Michigan, Ann Arbor, Michigan, United States of America
| | - Peter E. Burby
- Department of Molecular, Cellular and Developmental Biology, University of Michigan, Ann Arbor, Michigan, United States of America
| | - Ken M. Cadigan
- Department of Molecular, Cellular and Developmental Biology, University of Michigan, Ann Arbor, Michigan, United States of America
- * E-mail:
| |
Collapse
|
64
|
Slattery M, Zhou T, Yang L, Dantas Machado AC, Gordân R, Rohs R. Absence of a simple code: how transcription factors read the genome. Trends Biochem Sci 2014; 39:381-99. [PMID: 25129887 DOI: 10.1016/j.tibs.2014.07.002] [Citation(s) in RCA: 337] [Impact Index Per Article: 33.7] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/03/2014] [Revised: 07/11/2014] [Accepted: 07/15/2014] [Indexed: 12/21/2022]
Abstract
Transcription factors (TFs) influence cell fate by interpreting the regulatory DNA within a genome. TFs recognize DNA in a specific manner; the mechanisms underlying this specificity have been identified for many TFs based on 3D structures of protein-DNA complexes. More recently, structural views have been complemented with data from high-throughput in vitro and in vivo explorations of the DNA-binding preferences of many TFs. Together, these approaches have greatly expanded our understanding of TF-DNA interactions. However, the mechanisms by which TFs select in vivo binding sites and alter gene expression remain unclear. Recent work has highlighted the many variables that influence TF-DNA binding, while demonstrating that a biophysical understanding of these many factors will be central to understanding TF function.
Collapse
Affiliation(s)
- Matthew Slattery
- Department of Biomedical Sciences, University of Minnesota Medical School, Duluth, MN 55812, USA; Developmental Biology Center, University of Minnesota, Minneapolis, MN 55455, USA.
| | - Tianyin Zhou
- Molecular and Computational Biology Program, Departments of Biological Sciences, Chemistry, Physics, and Computer Science, University of Southern California, Los Angeles, CA 90089, USA
| | - Lin Yang
- Molecular and Computational Biology Program, Departments of Biological Sciences, Chemistry, Physics, and Computer Science, University of Southern California, Los Angeles, CA 90089, USA
| | - Ana Carolina Dantas Machado
- Molecular and Computational Biology Program, Departments of Biological Sciences, Chemistry, Physics, and Computer Science, University of Southern California, Los Angeles, CA 90089, USA
| | - Raluca Gordân
- Center for Genomic and Computational Biology, Departments of Biostatistics and Bioinformatics, Computer Science, and Molecular Genetics and Microbiology, Duke University, Durham, NC 27708, USA.
| | - Remo Rohs
- Molecular and Computational Biology Program, Departments of Biological Sciences, Chemistry, Physics, and Computer Science, University of Southern California, Los Angeles, CA 90089, USA.
| |
Collapse
|
65
|
Barrière A, Ruvinsky I. Pervasive divergence of transcriptional gene regulation in Caenorhabditis nematodes. PLoS Genet 2014; 10:e1004435. [PMID: 24968346 PMCID: PMC4072541 DOI: 10.1371/journal.pgen.1004435] [Citation(s) in RCA: 19] [Impact Index Per Article: 1.9] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 12/02/2013] [Accepted: 04/28/2014] [Indexed: 12/18/2022] Open
Abstract
Because there is considerable variation in gene expression even between closely related species, it is clear that gene regulatory mechanisms evolve relatively rapidly. Because primary sequence conservation is an unreliable proxy for functional conservation of cis-regulatory elements, their assessment must be carried out in vivo. We conducted a survey of cis-regulatory conservation between C. elegans and closely related species C. briggsae, C. remanei, C. brenneri, and C. japonica. We tested enhancers of eight genes from these species by introducing them into C. elegans and analyzing the expression patterns they drove. Our results support several notable conclusions. Most exogenous cis elements direct expression in the same cells as their C. elegans orthologs, confirming gross conservation of regulatory mechanisms. However, the majority of exogenous elements, when placed in C. elegans, also directed expression in cells outside endogenous patterns, suggesting functional divergence. Recurrent ectopic expression of different promoters in the same C. elegans cells may reflect biases in the directions in which expression patterns can evolve due to shared regulatory logic of coexpressed genes. The fact that, despite differences between individual genes, several patterns repeatedly emerged from our survey, encourages us to think that general rules governing regulatory evolution may exist and be discoverable.
Collapse
Affiliation(s)
- Antoine Barrière
- Department of Ecology and Evolution and Institute for Genomics and Systems Biology, The University of Chicago, Chicago, Illinois, United States of America
- * E-mail: (AB); (IR)
| | - Ilya Ruvinsky
- Department of Ecology and Evolution and Institute for Genomics and Systems Biology, The University of Chicago, Chicago, Illinois, United States of America
- Department of Organismal Biology and Anatomy, The University of Chicago, Chicago, Illinois, United States of America
- * E-mail: (AB); (IR)
| |
Collapse
|
66
|
Abstract
Instructions for when, where and to what level each gene should be expressed are encoded within regulatory sequences. The importance of motifs recognized by DNA-binding regulators has long been known, but their extensive characterization afforded by recent technologies only partly accounts for how regulatory instructions are encoded in the genome. Here, we review recent advances in our understanding of regulatory sequences that influence transcription and go beyond the description of motifs. We discuss how understanding different aspects of the sequence-encoded regulation can help to unravel the genotype-phenotype relationship, which would lead to a more accurate and mechanistic interpretation of personal genome sequences.
Collapse
Affiliation(s)
- Michal Levo
- Department of Molecular Cell Biology, and Department of Computer Science and Applied Mathematics, Weizmann Institute of Science, Rehovot 76100, Israel
| | - Eran Segal
- Department of Molecular Cell Biology, and Department of Computer Science and Applied Mathematics, Weizmann Institute of Science, Rehovot 76100, Israel
| |
Collapse
|
67
|
Schwarzer W, Spitz F. The architecture of gene expression: integrating dispersed cis-regulatory modules into coherent regulatory domains. Curr Opin Genet Dev 2014; 27:74-82. [PMID: 24907448 DOI: 10.1016/j.gde.2014.03.014] [Citation(s) in RCA: 33] [Impact Index Per Article: 3.3] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 01/20/2014] [Revised: 03/28/2014] [Accepted: 03/31/2014] [Indexed: 02/06/2023]
Abstract
Specificity and precision of expression are essential for the genes that regulate developmental processes. The specialized cis-acting modules, such as enhancers, that define gene expression patterns can be distributed across large regions, raising questions about the nature of the mechanisms that underline their action. Recent data has exposed the structural 3D context in which these long-range enhancers are operating. Here, we present how these studies shed new light on principles driving long-distance regulatory relationships. We discuss the molecular mechanisms that enable and accompany the action of long-range acting elements and the integration of multiple distributed regulatory inputs into the coherent and specific regulatory programs that are key to embryonic development.
Collapse
Affiliation(s)
- Wibke Schwarzer
- Developmental Biology Unit, European Molecular Biology Laboratory, Meyerhofstrasse 1, 69117 Heidelberg, Germany
| | - François Spitz
- Developmental Biology Unit, European Molecular Biology Laboratory, Meyerhofstrasse 1, 69117 Heidelberg, Germany.
| |
Collapse
|
68
|
Lettice LA, Williamson I, Devenney PS, Kilanowski F, Dorin J, Hill RE. Development of five digits is controlled by a bipartite long-range cis-regulator. Development 2014; 141:1715-25. [PMID: 24715461 PMCID: PMC3978833 DOI: 10.1242/dev.095430] [Citation(s) in RCA: 52] [Impact Index Per Article: 5.2] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 01/20/2023]
Abstract
Conservation within intergenic DNA often highlights regulatory elements that control gene expression from a long range. How conservation within a single element relates to regulatory information and how internal composition relates to function is unknown. Here, we examine the structural features of the highly conserved ZRS (also called MFCS1) cis-regulator responsible for the spatiotemporal control of Shh in the limb bud. By systematically dissecting the ZRS, both in transgenic assays and within in the endogenous locus, we show that the ZRS is, in effect, composed of two distinct domains of activity: one domain directs spatiotemporal activity but functions predominantly from a short range, whereas a second domain is required to promote long-range activity. We show further that these two domains encode activities that are highly integrated and that the second domain is crucial in promoting the chromosomal conformational changes correlated with gene activity. During limb bud development, these activities encoded by the ZRS are interpreted differently by the fore limbs and the hind limbs; in the absence of the second domain there is no Shh activity in the fore limb, and in the hind limb low levels of Shh lead to a variant digit pattern ranging from two to four digits. Hence, in the embryo, the second domain stabilises the developmental programme providing a buffer for SHH morphogen activity and this ensures that five digits form in both sets of limbs.
Collapse
Affiliation(s)
- Laura A Lettice
- MRC-Human Genetics Unit, MRC Institute of Genetics and Molecular Medicine, University of Edinburgh, Western General Hospital, Crewe Rd, Edinburgh EH4 2XU, UK
| | | | | | | | | | | |
Collapse
|
69
|
Abstract
Transcription factor binding sites (TFBSs) on the DNA are generally accepted as the key nodes of gene control. However, the multitudes of TFBSs identified in genome-wide studies, some of them seemingly unconstrained in evolution, have prompted the view that in many cases TF binding may serve no biological function. Yet, insights from transcriptional biochemistry, population genetics and functional genomics suggest that rather than segregating into 'functional' or 'non-functional', TFBS inputs to their target genes may be generally cumulative, with varying degrees of potency and redundancy. As TFBS redundancy can be diminished by mutations and environmental stress, some of the apparently 'spurious' sites may turn out to be important for maintaining adequate transcriptional regulation under these conditions. This has significant implications for interpreting the phenotypic effects of TFBS mutations, particularly in the context of genome-wide association studies for complex traits.
Collapse
|
70
|
Dissection of thousands of cell type-specific enhancers identifies dinucleotide repeat motifs as general enhancer features. Genome Res 2014; 24:1147-56. [PMID: 24714811 PMCID: PMC4079970 DOI: 10.1101/gr.169243.113] [Citation(s) in RCA: 99] [Impact Index Per Article: 9.9] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/30/2022]
Abstract
Gene expression is determined by genomic elements called enhancers, which contain short motifs bound by different transcription factors (TFs). However, how enhancer sequences and TF motifs relate to enhancer activity is unknown, and general sequence requirements for enhancers or comprehensive sets of important enhancer sequence elements have remained elusive. Here, we computationally dissect thousands of functional enhancer sequences from three different Drosophila cell lines. We find that the enhancers display distinct cis-regulatory sequence signatures, which are predictive of the enhancers’ cell type-specific or broad activities. These signatures contain transcription factor motifs and a novel class of enhancer sequence elements, dinucleotide repeat motifs (DRMs). DRMs are highly enriched in enhancers, particularly in enhancers that are broadly active across different cell types. We experimentally validate the importance of the identified TF motifs and DRMs for enhancer function and show that they can be sufficient to create an active enhancer de novo from a nonfunctional sequence. The function of DRMs as a novel class of general enhancer features that are also enriched in human regulatory regions might explain their implication in several diseases and provides important insights into gene regulation.
Collapse
|
71
|
Rouault H, Santolini M, Schweisguth F, Hakim V. Imogene: identification of motifs and cis-regulatory modules underlying gene co-regulation. Nucleic Acids Res 2014; 42:6128-45. [PMID: 24682824 PMCID: PMC4041412 DOI: 10.1093/nar/gku209] [Citation(s) in RCA: 11] [Impact Index Per Article: 1.1] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/23/2022] Open
Abstract
Cis-regulatory modules (CRMs) and motifs play a central role in tissue and condition-specific gene expression. Here we present Imogene, an ensemble of statistical tools that we have developed to facilitate their identification and implemented in a publicly available software. Starting from a small training set of mammalian or fly CRMs that drive similar gene expression profiles, Imogene determines de novocis-regulatory motifs that underlie this co-expression. It can then predict on a genome-wide scale other CRMs with a regulatory potential similar to the training set. Imogene bypasses the need of large datasets for statistical analyses by making central use of the information provided by the sequenced genomes of multiple species, based on the developed statistical tools and explicit models for transcription factor binding site evolution. We test Imogene on characterized tissue-specific mouse developmental CRMs. Its ability to identify CRMs with the same specificity based on its de novo created motifs is comparable to that of previously evaluated ‘motif-blind’ methods. We further show, both in flies and in mammals, that Imogene de novo generated motifs are sufficient to discriminate CRMs related to different developmental programs. Notably, purely relying on sequence data, Imogene performs as well in this discrimination task as a previously reported learning algorithm based on Chromatin Immunoprecipitation (ChIP) data for multiple transcription factors at multiple developmental stages.
Collapse
Affiliation(s)
- Hervé Rouault
- Developmental and Stem Cell Biology Department, Institut Pasteur, F-75015 Paris, France CNRS, URA2578, F-75015 Paris, France
| | - Marc Santolini
- Laboratoire de Physique Statistique, CNRS, École Normale Supérieure, Université P. et M. Curie, Université Paris-Diderot
| | - François Schweisguth
- Developmental and Stem Cell Biology Department, Institut Pasteur, F-75015 Paris, France CNRS, URA2578, F-75015 Paris, France
| | - Vincent Hakim
- Laboratoire de Physique Statistique, CNRS, École Normale Supérieure, Université P. et M. Curie, Université Paris-Diderot
| |
Collapse
|
72
|
Atkinson TJ, Halfon MS. Regulation of gene expression in the genomic context. Comput Struct Biotechnol J 2014; 9:e201401001. [PMID: 24688749 PMCID: PMC3962188 DOI: 10.5936/csbj.201401001] [Citation(s) in RCA: 33] [Impact Index Per Article: 3.3] [Reference Citation Analysis] [Abstract] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 08/02/2013] [Revised: 12/10/2013] [Accepted: 12/29/2013] [Indexed: 11/22/2022] Open
Abstract
Metazoan life is dependent on the proper temporal and spatial control of gene expression within the many cells-essentially all with the identical genome-that make up the organism. While much is understood about how individual gene regulatory elements function, many questions remain about how they interact to maintain correct regulation globally throughout the genome. In this review we summarize the basic features and functions of the crucial regulatory elements promoters, enhancers, and insulators and discuss some of the ways in which proper interactions between these elements is realized. We focus in particular on the role of core promoter sequences and propose explanations for some of the contradictory results seen in experiments aimed at understanding insulator function. We suggest that gene regulation depends on local genomic context and argue that more holistic in vivo investigations that take into account multiple local features will be necessary to understand how genome-wide gene regulation is maintained.
Collapse
Affiliation(s)
- Taylor J Atkinson
- Department of Biochemistry, University at Buffalo-State University of New York, Buffalo, NY 14203, USA
- NY State Center of Excellence in Bioinformatics and Life Sciences, Buffalo, NY 14203, USA
| | - Marc S Halfon
- Department of Biochemistry, University at Buffalo-State University of New York, Buffalo, NY 14203, USA
- Department of Biological Sciences, University at Buffalo-State University of New York, Buffalo, NY 14203, USA
- NY State Center of Excellence in Bioinformatics and Life Sciences, Buffalo, NY 14203, USA
- Molecular and Cellular Biology Department and Program in Cancer Genetics, Roswell Park Cancer Institute, Buffalo, NY 14263, USA
| |
Collapse
|
73
|
Erceg J, Saunders TE, Girardot C, Devos DP, Hufnagel L, Furlong EEM. Subtle changes in motif positioning cause tissue-specific effects on robustness of an enhancer's activity. PLoS Genet 2014; 10:e1004060. [PMID: 24391522 PMCID: PMC3879207 DOI: 10.1371/journal.pgen.1004060] [Citation(s) in RCA: 42] [Impact Index Per Article: 4.2] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/19/2013] [Accepted: 11/11/2013] [Indexed: 12/14/2022] Open
Abstract
Deciphering the specific contribution of individual motifs within cis-regulatory modules (CRMs) is crucial to understanding how gene expression is regulated and how this process is affected by sequence variation. But despite vast improvements in the ability to identify where transcription factors (TFs) bind throughout the genome, we are limited in our ability to relate information on motif occupancy to function from sequence alone. Here, we engineered 63 synthetic CRMs to systematically assess the relationship between variation in the content and spacing of motifs within CRMs to CRM activity during development using Drosophila transgenic embryos. In over half the cases, very simple elements containing only one or two types of TF binding motifs were capable of driving specific spatio-temporal patterns during development. Different motif organizations provide different degrees of robustness to enhancer activity, ranging from binary on-off responses to more subtle effects including embryo-to-embryo and within-embryo variation. By quantifying the effects of subtle changes in motif organization, we were able to model biophysical rules that explain CRM behavior and may contribute to the spatial positioning of CRM activity in vivo. For the same enhancer, the effects of small differences in motif positions varied in developmentally related tissues, suggesting that gene expression may be more susceptible to sequence variation in one tissue compared to another. This result has important implications for human eQTL studies in which many associated mutations are found in cis-regulatory regions, though the mechanism for how they affect tissue-specific gene expression is often not understood.
Collapse
Affiliation(s)
- Jelena Erceg
- Genome Biology Unit, European Molecular Biology Laboratory (EMBL), Heidelberg, Germany
| | - Timothy E. Saunders
- Genome Biology Unit, European Molecular Biology Laboratory (EMBL), Heidelberg, Germany
- Cell Biology and Biophysics Unit, European Molecular Biology Laboratory (EMBL), Heidelberg, Germany
| | - Charles Girardot
- Genome Biology Unit, European Molecular Biology Laboratory (EMBL), Heidelberg, Germany
| | - Damien P. Devos
- Genome Biology Unit, European Molecular Biology Laboratory (EMBL), Heidelberg, Germany
| | - Lars Hufnagel
- Cell Biology and Biophysics Unit, European Molecular Biology Laboratory (EMBL), Heidelberg, Germany
| | - Eileen E. M. Furlong
- Genome Biology Unit, European Molecular Biology Laboratory (EMBL), Heidelberg, Germany
- * E-mail:
| |
Collapse
|
74
|
Glassford WJ, Rebeiz M. Assessing constraints on the path of regulatory sequence evolution. Philos Trans R Soc Lond B Biol Sci 2013; 368:20130026. [PMID: 24218638 PMCID: PMC3826499 DOI: 10.1098/rstb.2013.0026] [Citation(s) in RCA: 10] [Impact Index Per Article: 0.9] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 01/22/2023] Open
Abstract
Structural and functional constraints are known to play a major role in restricting the path of evolution of protein activities. However, constraints acting on evolving transcriptional regulatory sequences, e.g. enhancers, are largely unknown. Recently, we elucidated how a novel expression pattern of the Neprilysin-1 (Nep1) gene in the optic lobe of Drosophila santomea evolved via co-option of existing enhancer activities. Drosophila santomea, which has diverged from Drosophila yakuba by approximately 400 000 years has accumulated four fixed mutations that each contribute to the full activity of this enhancer. Recreating and testing the optic lobe enhancer of the ancestor of D. santomea and D. yakuba revealed that the strong D. santomea enhancer activity evolved from a weak ancestral activity. Because each mutation on the path from the D. yakuba/santomea ancestor to modern-day D. santomea contributes to the newly derived optic lobe enhancer activity, we sought here to use this system to study the path of evolution of enhancer sequences. We inferred likely paths of evolution of this enhancer by observing the transcriptional output of all possible intermediate steps between the ancestral D. yakuba/santomea enhancer and the modern D. santomea enhancer. Many possible paths had epistatic and cooperative effects. Furthermore, we found that several paths significantly increased ectopic transcriptional activity or affected existing enhancer activities from which the novel activity was co-opted. We suggest that these attributes highlight constraints that guide the path of evolution of enhancers.
Collapse
Affiliation(s)
| | - Mark Rebeiz
- Department of Biological Sciences, University of Pittsburgh, 4249 Fifth Avenue, Pittsburgh, PA 15260, USA
| |
Collapse
|
75
|
Ramos AI, Barolo S. Low-affinity transcription factor binding sites shape morphogen responses and enhancer evolution. Philos Trans R Soc Lond B Biol Sci 2013; 368:20130018. [PMID: 24218631 DOI: 10.1098/rstb.2013.0018] [Citation(s) in RCA: 82] [Impact Index Per Article: 7.5] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 12/21/2022] Open
Abstract
In the era of functional genomics, the role of transcription factor (TF)-DNA binding affinity is of increasing interest: for example, it has recently been proposed that low-affinity genomic binding events, though frequent, are functionally irrelevant. Here, we investigate the role of binding site affinity in the transcriptional interpretation of Hedgehog (Hh) morphogen gradients. We noted that enhancers of several Hh-responsive Drosophila genes have low predicted affinity for Ci, the Gli family TF that transduces Hh signalling in the fly. Contrary to our initial hypothesis, improving the affinity of Ci/Gli sites in enhancers of dpp, wingless and stripe, by transplanting optimal sites from the patched gene, did not result in ectopic responses to Hh signalling. Instead, we found that these enhancers require low-affinity binding sites for normal activation in regions of relatively low signalling. When Ci/Gli sites in these enhancers were altered to improve their binding affinity, we observed patterning defects in the transcriptional response that are consistent with a switch from Ci-mediated activation to Ci-mediated repression. Synthetic transgenic reporters containing isolated Ci/Gli sites confirmed this finding in imaginal discs. We propose that the requirement for gene activation by Ci in the regions of low-to-moderate Hh signalling results in evolutionary pressure favouring weak binding sites in enhancers of certain Hh target genes.
Collapse
Affiliation(s)
- Andrea I Ramos
- Department of Cell and Developmental Biology and Program in Cellular and Molecular Biology, University of Michigan Medical School, , Ann Arbor, MI 48109, USA
| | | |
Collapse
|
76
|
Domené S, Bumaschny VF, de Souza FSJ, Franchini LF, Nasif S, Low MJ, Rubinstein M. Enhancer turnover and conserved regulatory function in vertebrate evolution. Philos Trans R Soc Lond B Biol Sci 2013; 368:20130027. [PMID: 24218639 DOI: 10.1098/rstb.2013.0027] [Citation(s) in RCA: 28] [Impact Index Per Article: 2.5] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 12/13/2022] Open
Abstract
Mutations in regulatory regions including enhancers are an important source of variation and innovation during evolution. Enhancers can evolve by changes in the sequence, arrangement and repertoire of transcription factor binding sites, but whole enhancers can also be lost or gained in certain lineages in a process of turnover. The proopiomelanocortin gene (Pomc), which encodes a prohormone, is expressed in the pituitary and hypothalamus of all jawed vertebrates. We have previously described that hypothalamic Pomc expression in mammals is controlled by two enhancers-nPE1 and nPE2-that are derived from transposable elements and that presumably replaced the ancestral neuronal Pomc regulatory regions. Here, we show that nPE1 and nPE2, even though they are mammalian novelties with no homologous counterpart in other vertebrates, nevertheless can drive gene expression specifically to POMC neurons in the hypothalamus of larval and adult transgenic zebrafish. This indicates that when neuronal Pomc enhancers originated de novo during early mammalian evolution, the newly created cis- and trans-codes were similar to the ancestral ones. We also identify the neuronal regulatory region of zebrafish pomca and confirm that it is not homologous to the mammalian enhancers. Our work sheds light on the process of gene regulatory evolution by showing how a locus can undergo enhancer turnover and nevertheless maintain the ancestral transcriptional output.
Collapse
Affiliation(s)
- Sabina Domené
- Instituto de Investigaciones en Ingeniería Genética y Biología Molecular, Consejo Nacional de Investigaciones Científicas y Técnicas, , C1428ADN Buenos Aires, Argentina
| | | | | | | | | | | | | |
Collapse
|
77
|
Rubinstein M, de Souza FSJ. Evolution of transcriptional enhancers and animal diversity. Philos Trans R Soc Lond B Biol Sci 2013; 368:20130017. [PMID: 24218630 DOI: 10.1098/rstb.2013.0017] [Citation(s) in RCA: 63] [Impact Index Per Article: 5.7] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 01/03/2023] Open
Abstract
Deciphering the genetic bases that drive animal diversity is one of the major challenges of modern biology. Although four decades ago it was proposed that animal evolution was mainly driven by changes in cis-regulatory DNA elements controlling gene expression rather than in protein-coding sequences, only now are powerful bioinformatics and experimental approaches available to accelerate studies into how the evolution of transcriptional enhancers contributes to novel forms and functions. In the introduction to this Theme Issue, we start by defining the general properties of transcriptional enhancers, such as modularity and the coexistence of tight sequence conservation with transcription factor-binding site shuffling as different mechanisms that maintain the enhancer grammar over evolutionary time. We discuss past and current methods used to identify cell-type-specific enhancers and provide examples of how enhancers originate de novo, change and are lost in particular lineages. We then focus in the central part of this Theme Issue on analysing examples of how the molecular evolution of enhancers may change form and function. Throughout this introduction, we present the main findings of the articles, reviews and perspectives contributed to this Theme Issue that together illustrate some of the great advances and current frontiers in the field.
Collapse
Affiliation(s)
- Marcelo Rubinstein
- Instituto de Investigaciones en Ingeniería Genética y Biología Molecular, Consejo Nacional de Investigaciones Científicas y Técnicas, , C1428ADN Buenos Aires, Argentina
| | | |
Collapse
|
78
|
Menoret D, Santolini M, Fernandes I, Spokony R, Zanet J, Gonzalez I, Latapie Y, Ferrer P, Rouault H, White KP, Besse P, Hakim V, Aerts S, Payre F, Plaza S. Genome-wide analyses of Shavenbaby target genes reveals distinct features of enhancer organization. Genome Biol 2013; 14:R86. [PMID: 23972280 PMCID: PMC4053989 DOI: 10.1186/gb-2013-14-8-r86] [Citation(s) in RCA: 36] [Impact Index Per Article: 3.3] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/07/2013] [Accepted: 08/23/2013] [Indexed: 12/17/2022] Open
Abstract
Background Developmental programs are implemented by regulatory interactions between Transcription Factors (TFs) and their target genes, which remain poorly understood. While recent studies have focused on regulatory cascades of TFs that govern early development, little is known about how the ultimate effectors of cell differentiation are selected and controlled. We addressed this question during late Drosophila embryogenesis, when the finely tuned expression of the TF Ovo/Shavenbaby (Svb) triggers the morphological differentiation of epidermal trichomes. Results We defined a sizeable set of genes downstream of Svb and used in vivo assays to delineate 14 enhancers driving their specific expression in trichome cells. Coupling computational modeling to functional dissection, we investigated the regulatory logic of these enhancers. Extending the repertoire of epidermal effectors using genome-wide approaches showed that the regulatory models learned from this first sample are representative of the whole set of trichome enhancers. These enhancers harbor remarkable features with respect to their functional architectures, including a weak or non-existent clustering of Svb binding sites. The in vivo function of each site relies on its intimate context, notably the flanking nucleotides. Two additional cis-regulatory motifs, present in a broad diversity of composition and positioning among trichome enhancers, critically contribute to enhancer activity. Conclusions Our results show that Svb directly regulates a large set of terminal effectors of the remodeling of epidermal cells. Further, these data reveal that trichome formation is underpinned by unexpectedly diverse modes of regulation, providing fresh insights into the functional architecture of enhancers governing a terminal differentiation program.
Collapse
|
79
|
Smith RP, Riesenfeld SJ, Holloway AK, Li Q, Murphy KK, Feliciano NM, Orecchia L, Oksenberg N, Pollard KS, Ahituv N. A compact, in vivo screen of all 6-mers reveals drivers of tissue-specific expression and guides synthetic regulatory element design. Genome Biol 2013; 14:R72. [PMID: 23867016 PMCID: PMC4054837 DOI: 10.1186/gb-2013-14-7-r72] [Citation(s) in RCA: 14] [Impact Index Per Article: 1.3] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 01/17/2013] [Revised: 03/08/2013] [Accepted: 07/18/2013] [Indexed: 11/28/2022] Open
Abstract
BACKGROUND Large-scale annotation efforts have improved our ability to coarsely predict regulatory elements throughout vertebrate genomes. However, it is unclear how complex spatiotemporal patterns of gene expression driven by these elements emerge from the activity of short, transcription factor binding sequences. RESULTS We describe a comprehensive promoter extension assay in which the regulatory potential of all 6 base-pair (bp) sequences was tested in the context of a minimal promoter. To enable this large-scale screen, we developed algorithms that use a reverse-complement aware decomposition of the de Bruijn graph to design a library of DNA oligomers incorporating every 6-bp sequence exactly once. Our library multiplexes all 4,096 unique 6-mers into 184 double-stranded 15-bp oligomers, which is sufficiently compact for in vivo testing. We injected each multiplexed construct into zebrafish embryos and scored GFP expression in 15 tissues at two developmental time points. Twenty-seven constructs produced consistent expression patterns, with the majority doing so in only one tissue. Functional sequences are enriched near biologically relevant genes, match motifs for developmental transcription factors, and are required for enhancer activity. By concatenating tissue-specific functional sequences, we generated completely synthetic enhancers for the notochord, epidermis, spinal cord, forebrain and otic lateral line, and show that short regulatory sequences do not always function modularly. CONCLUSIONS This work introduces a unique in vivo catalog of short, functional regulatory sequences and demonstrates several important principles of regulatory element organization. Furthermore, we provide resources for designing compact, reverse-complement aware k-mer libraries.
Collapse
Affiliation(s)
- Robin P Smith
- Department of Bioengineering and Therapeutic Sciences, University of California San Francisco, 1550 4th St, San Francisco, CA 94158, USA
- Institute for Human Genetics, University of California San Francisco, 1550 4th St, San Francisco, CA 94158, USA
| | - Samantha J Riesenfeld
- Gladstone Institutes, University of California San Francisco, 1650 Owens St, San Francisco, CA 94158, USA
| | - Alisha K Holloway
- Gladstone Institutes, University of California San Francisco, 1650 Owens St, San Francisco, CA 94158, USA
| | - Qiang Li
- Department of Bioengineering and Therapeutic Sciences, University of California San Francisco, 1550 4th St, San Francisco, CA 94158, USA
- Institute for Human Genetics, University of California San Francisco, 1550 4th St, San Francisco, CA 94158, USA
- Current address: Institute for Pediatrics, Translational Research Center for Development and Disease, Children's Hospital of Fudan University, Shanghai, 201102, China
| | - Karl K Murphy
- Department of Bioengineering and Therapeutic Sciences, University of California San Francisco, 1550 4th St, San Francisco, CA 94158, USA
- Institute for Human Genetics, University of California San Francisco, 1550 4th St, San Francisco, CA 94158, USA
| | - Natalie M Feliciano
- Department of Bioengineering and Therapeutic Sciences, University of California San Francisco, 1550 4th St, San Francisco, CA 94158, USA
- Institute for Human Genetics, University of California San Francisco, 1550 4th St, San Francisco, CA 94158, USA
| | - Lorenzo Orecchia
- Division of Biostatistics, University of California San Francisco, 1650 Owens St, CA 94158, USA
| | - Nir Oksenberg
- Department of Bioengineering and Therapeutic Sciences, University of California San Francisco, 1550 4th St, San Francisco, CA 94158, USA
- Institute for Human Genetics, University of California San Francisco, 1550 4th St, San Francisco, CA 94158, USA
| | - Katherine S Pollard
- Institute for Human Genetics, University of California San Francisco, 1550 4th St, San Francisco, CA 94158, USA
- Gladstone Institutes, University of California San Francisco, 1650 Owens St, San Francisco, CA 94158, USA
- Department of Mathematics, Massachusetts Institute of Technology, 77 Massachusetts Ave, Cambridge, MA 02139, USA
| | - Nadav Ahituv
- Department of Bioengineering and Therapeutic Sciences, University of California San Francisco, 1550 4th St, San Francisco, CA 94158, USA
- Institute for Human Genetics, University of California San Francisco, 1550 4th St, San Francisco, CA 94158, USA
| |
Collapse
|
80
|
Harmston N, Lenhard B. Chromatin and epigenetic features of long-range gene regulation. Nucleic Acids Res 2013; 41:7185-99. [PMID: 23766291 PMCID: PMC3753629 DOI: 10.1093/nar/gkt499] [Citation(s) in RCA: 88] [Impact Index Per Article: 8.0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 01/21/2023] Open
Abstract
The precise regulation of gene transcription during metazoan development is controlled by a complex system of interactions between transcription factors, histone modifications and modifying enzymes and chromatin conformation. Developments in chromosome conformation capture technologies have revealed that interactions between regions of chromatin are pervasive and highly cell-type specific. The movement of enhancers and promoters in and out of higher-order chromatin structures within the nucleus are associated with changes in expression and histone modifications. However, the factors responsible for mediating these changes and determining enhancer:promoter specificity are still not completely known. In this review, we summarize what is known about the patterns of epigenetic and chromatin features characteristic of elements involved in long-range interactions. In addition, we review the insights into both local and global patterns of chromatin interactions that have been revealed by the latest experimental and computational methods.
Collapse
Affiliation(s)
- Nathan Harmston
- MRC Clinical Sciences Centre, Faculty of Medicine, Imperial College, London W12 0NN, UK, Institute of Clinical Sciences, Faculty of Medicine, Imperial College, London W12 0NN, UK and Department of Informatics, University of Bergen, Thromøhlensgate 55, N-5008 Bergen, Norway
| | | |
Collapse
|
81
|
Wang C, Zhang MQ, Zhang Z. Computational identification of active enhancers in model organisms. GENOMICS, PROTEOMICS & BIOINFORMATICS 2013; 11:142-50. [PMID: 23685394 PMCID: PMC4357786 DOI: 10.1016/j.gpb.2013.04.002] [Citation(s) in RCA: 15] [Impact Index Per Article: 1.4] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Download PDF] [Figures] [Subscribe] [Scholar Register] [Received: 12/28/2012] [Revised: 04/01/2013] [Accepted: 04/20/2013] [Indexed: 12/11/2022]
Abstract
As a class of cis-regulatory elements, enhancers were first identified as the genomic regions that are able to markedly increase the transcription of genes nearly 30years ago. Enhancers can regulate gene expression in a cell-type specific and developmental stage specific manner. Although experimental technologies have been developed to identify enhancers genome-wide, the design principle of the regulatory elements and the way they rewire the transcriptional regulatory network tempo-spatially are far from clear. At present, developing predictive methods for enhancers, particularly for the cell-type specific activity of enhancers, is central to computational biology. In this review, we survey the current computational approaches for active enhancer prediction and discuss future directions.
Collapse
Affiliation(s)
- Chengqi Wang
- CAS Key Laboratory of Genome Sciences and Information, Beijing Institute of Genomics, Chinese Academy of Sciences, Beijing 100101, China
| | - Michael Q. Zhang
- Department of Molecular Cell Biology, Center for Systems Biology, University of Texas at Dallas, Richardson, TX 75080, USA
- Bioinformatics Division, Center for Synthetic and Systems Biology, TNLIST, Tsinghua University, Beijing 100084, China
| | - Zhihua Zhang
- CAS Key Laboratory of Genome Sciences and Information, Beijing Institute of Genomics, Chinese Academy of Sciences, Beijing 100101, China
| |
Collapse
|
82
|
Dickel DE, Visel A, Pennacchio LA. Functional anatomy of distant-acting mammalian enhancers. Philos Trans R Soc Lond B Biol Sci 2013; 368:20120359. [PMID: 23650633 DOI: 10.1098/rstb.2012.0359] [Citation(s) in RCA: 30] [Impact Index Per Article: 2.7] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 01/26/2023] Open
Abstract
Transcriptional enhancers are a major class of functional element embedded in the vast non-coding portion of the human genome. Acting over large genomic distances, enhancers play critical roles in the tissue and cell type-specific regulation of genes, and there is mounting evidence that they contribute to the aetiology of many human diseases. Methods for genome-wide mapping of enhancer regions are now available, but the functional architecture contained within human enhancer elements remains unclear. Here, we review recent approaches aimed at understanding the functional anatomy of individual enhancer elements, using systematic qualitative and quantitative assessments of mammalian enhancer variants in cultured cells and in vivo. These studies provide direct insight into common architectural characteristics of enhancers including the presence of multiple transcription factor-binding sites and the mixture of both transcriptionally activating and repressing domains within the same enhancer. Despite such progress in understanding the functional composition of enhancers, the inherent complexities of enhancer anatomy continue to limit our ability to predict the impact of sequence changes on in vivo enhancer function. While providing an initial glimpse into the mutability of mammalian enhancers, these observations highlight the continued need for experimental enhancer assessment as genome sequencing becomes routine in the clinic.
Collapse
Affiliation(s)
- D E Dickel
- Genomics Division, MS 84-171, Lawrence Berkeley National Laboratory, Berkeley, CA 94720, USA
| | | | | |
Collapse
|
83
|
Slattery M, Nègre N, White KP. Interpreting the regulatory genome: the genomics of transcription factor function in Drosophila melanogaster. Brief Funct Genomics 2013; 11:336-46. [PMID: 23023663 DOI: 10.1093/bfgp/els034] [Citation(s) in RCA: 4] [Impact Index Per Article: 0.4] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 12/19/2022] Open
Abstract
Researchers have now had access to the fully sequenced Drosophila melanogaster genome for over a decade, and the sequenced genomes of 11 additional Drosophila species have been available for almost 5 years, with more species' genomes becoming available every year [Adams MD, Celniker SE, Holt RA, et al. The genome sequence of Drosophila melanogaster. Science 2000;287:2185-95; Clark AG, Eisen MB, Smith DR, et al. Evolution of genes and genomes on the Drosophila phylogeny. Nature 2007;450:203-18]. Although the best studied of the D. melanogaster transcription factors (TFs) were cloned before sequencing of the genome, the availability of sequence data promised to transform our understanding of TFs and gene regulatory networks. Sequenced genomes have allowed researchers to generate tools for high-throughput characterization of gene expression levels, genome-wide TF localization and analyses of evolutionary constraints on DNA elements across multiple species. With an estimated 700 DNA-binding proteins in the Drosophila genome, it will be many years before each potential sequence-specific TF is studied in detail, yet the last decade of functional genomics research has already impacted our view of gene regulatory networks and TF DNA recognition.
Collapse
Affiliation(s)
- Matthew Slattery
- Institute for Genomics & Systems Biology, Chicago, IL 60637, USA
| | | | | |
Collapse
|
84
|
Quadrana L, Almeida J, Otaiza SN, Duffy T, Corrêa da Silva JV, de Godoy F, Asís R, Bermúdez L, Fernie AR, Carrari F, Rossi M. Transcriptional regulation of tocopherol biosynthesis in tomato. PLANT MOLECULAR BIOLOGY 2013; 81:309-25. [PMID: 23247837 DOI: 10.1007/s11103-012-0001-4] [Citation(s) in RCA: 53] [Impact Index Per Article: 4.8] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Subscribe] [Scholar Register] [Received: 09/11/2012] [Accepted: 12/10/2012] [Indexed: 05/21/2023]
Abstract
Tocopherols, compounds with vitamin E (VTE) activity, are potent lipid-soluble antioxidants synthesized only by photosynthetic organisms. Their biosynthesis requires the condensation of phytyl-diphosphate and homogentisate, derived from the methylerythritol phosphate (MEP) and shikimate pathways (SK), respectively. These metabolic pathways are central in plant chloroplast metabolism and are involved in the biosynthesis of important molecules such as chlorophyll, carotenoids, aromatic amino-acids and prenylquinones. In the last decade, few studies have provided insights into the regulation of VTE biosynthesis and its accumulation. However, the pathway regulatory mechanism/s at mRNA level remains unclear. We have recently identified a collection of tomato genes involved in tocopherol biosynthesis. In this work, by a dedicated qPCR array platform, the transcript levels of 47 genes, including paralogs, were determined in leaves and across fruit development. Expression data were analyzed for correlation with tocopherol profiles by coregulation network and neural clustering approaches. The results showed that tocopherol biosynthesis is controlled both temporally and spatially however total tocopherol content remains constant. These analyses exposed 18 key genes from MEP, SK, phytol recycling and VTE-core pathways highly associated with VTE content in leaves and fruits. Moreover, genomic analyses of promoter regions suggested that the expression of the tocopherol-core pathway genes is trancriptionally coregulated with specific genes of the upstream pathways. Whilst the transcriptional profiles of the precursor pathway genes would suggest an increase in VTE content across fruit development, the data indicate that in the M82 cultivar phytyl diphosphate supply limits tocopherol biosynthesis in later fruit stages. This is in part due to the decreasing transcript levels of geranylgeranyl reductase (GGDR) which restricts the isoprenoid precursor availability. As a proof of concept, by analyzing a collection of Andean landrace tomato genotypes, the role of the pinpointed genes in determining fruit tocopherol content was confirmed. The results uncovered a finely tuned regulation able to shift the precursor pathways controlling substrate influx for VTE biosynthesis and overcoming endogenous competition for intermediates. The whole set of data allowed to propose that 1-deoxy-D-xylulose-5-phosphate synthase and GGDR encoding genes, which determine phytyl-diphosphate availability, together with enzyme encoding genes involved in chlorophyll-derived phytol metabolism appear as the most plausible targets to be engineered aiming to improve tomato fruit nutritional value.
Collapse
Affiliation(s)
- Leandro Quadrana
- Instituto de Biotecnología, Instituto Nacional de Tecnología Agropecuaria and Consejo Nacional de Investigaciones Científicas y Técnicas, B1712WAA, Castelar, Argentina.
| | | | | | | | | | | | | | | | | | | | | |
Collapse
|
85
|
Wunderlich Z, Bragdon MD, Eckenrode KB, Lydiard-Martin T, Pearl-Waserman S, DePace AH. Dissecting sources of quantitative gene expression pattern divergence between Drosophila species. Mol Syst Biol 2013; 8:604. [PMID: 22893002 PMCID: PMC3435502 DOI: 10.1038/msb.2012.35] [Citation(s) in RCA: 25] [Impact Index Per Article: 2.3] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 02/27/2012] [Accepted: 07/12/2012] [Indexed: 12/21/2022] Open
Abstract
Gene expression patterns can diverge between species due to changes in a gene's regulatory DNA or changes in the proteins, e.g., transcription factors (TFs), that regulate the gene. We developed a modeling framework to uncover the sources of expression differences in blastoderm embryos of three Drosophila species, focusing on the regulatory circuit controlling expression of the hunchback (hb) posterior stripe. Using this framework and cellular-resolution expression measurements of hb and its regulating TFs, we found that changes in the expression patterns of hb's TFs account for much of the expression divergence. We confirmed our predictions using transgenic D. melanogaster lines, which demonstrate that this set of orthologous cis-regulatory elements (CREs) direct similar, but not identical, expression patterns. We related expression pattern differences to sequence changes in the CRE using a calculation of the CRE's TF binding site content. By applying this calculation in both the transgenic and endogenous contexts, we found that changes in binding site content affect sensitivity to regulating TFs and that compensatory evolution may occur in circuit components other than the CRE.
Collapse
Affiliation(s)
- Zeba Wunderlich
- Department of Systems Biology, Harvard Medical School, Boston, MA 02115, USA
| | | | | | | | | | | |
Collapse
|
86
|
Understanding the Dynamics of Gene Regulatory Systems; Characterisation and Clinical Relevance of cis-Regulatory Polymorphisms. BIOLOGY 2013; 2:64-84. [PMID: 24832652 PMCID: PMC4009875 DOI: 10.3390/biology2010064] [Citation(s) in RCA: 6] [Impact Index Per Article: 0.5] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Subscribe] [Scholar Register] [Received: 11/01/2012] [Revised: 12/21/2012] [Accepted: 01/04/2013] [Indexed: 12/02/2022]
Abstract
Modern genetic analysis has shown that most polymorphisms associated with human disease are non-coding. Much of the functional information contained in the non-coding genome consists of cis-regulatory sequences (CRSs) that are required to respond to signal transduction cues that direct cell specific gene expression. It has been hypothesised that many diseases may be due to polymorphisms within CRSs that alter their responses to signal transduction cues. However, identification of CRSs, and the effects of allelic variation on their ability to respond to signal transduction cues, is still at an early stage. In the current review we describe the use of comparative genomics and experimental techniques that allow for the identification of CRSs building on recent advances by the ENCODE consortium. In addition we describe techniques that allow for the analysis of the effects of allelic variation and epigenetic modification on CRS responses to signal transduction cues. Using specific examples we show that the interactions driving these elements are highly complex and the effects of disease associated polymorphisms often subtle. It is clear that gaining an understanding of the functions of CRSs, and how they are affected by SNPs and epigenetic modification, is essential to understanding the genetic basis of human disease and stratification whilst providing novel directions for the development of personalised medicine.
Collapse
|
87
|
Marsman J, Horsfield JA. Long distance relationships: enhancer-promoter communication and dynamic gene transcription. BIOCHIMICA ET BIOPHYSICA ACTA-GENE REGULATORY MECHANISMS 2012; 1819:1217-27. [PMID: 23124110 DOI: 10.1016/j.bbagrm.2012.10.008] [Citation(s) in RCA: 70] [Impact Index Per Article: 5.8] [Reference Citation Analysis] [Abstract] [Track Full Text] [Subscribe] [Scholar Register] [Received: 08/13/2012] [Revised: 10/18/2012] [Accepted: 10/22/2012] [Indexed: 11/27/2022]
Abstract
The three-dimensional regulation of gene transcription involves loop formation between enhancer and promoter elements, controlling spatiotemporal gene expression in multicellular organisms. Enhancers are usually located in non-coding DNA and can activate gene transcription by recruiting transcription factors, chromatin remodeling factors and RNA Polymerase II. Research over the last few years has revealed that enhancers have tell-tale characteristics that facilitate their detection by several approaches, although the hallmarks of enhancers are not always uniform. Enhancers likely play an important role in the activation of genes by functioning as a primary point of contact for transcriptional activators, and by making physical contact with gene promoters often by means of a chromatin loop. Although numerous transcriptional regulators participate in the formation of chromatin loops that bring enhancers into proximity with promoters, the mechanism(s) of enhancer-promoter connectivity remain enigmatic. Here we discuss enhancer function, review some of the many proteins shown to be involved in establishing enhancer-promoter loops, and describe the dynamics of enhancer-promoter contacts during development, differentiation and in specific cell types.
Collapse
Affiliation(s)
- Judith Marsman
- Department of Pathology, The University of Otago, Dunedin, New Zealand
| | | |
Collapse
|
88
|
Deciphering the transcriptional cis-regulatory code. Trends Genet 2012; 29:11-22. [PMID: 23102583 DOI: 10.1016/j.tig.2012.09.007] [Citation(s) in RCA: 85] [Impact Index Per Article: 7.1] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/09/2012] [Revised: 09/24/2012] [Accepted: 09/25/2012] [Indexed: 02/07/2023]
Abstract
Information about developmental gene expression resides in defined regulatory elements, called enhancers, in the non-coding part of the genome. Although cells reliably utilize enhancers to orchestrate gene expression, a cis-regulatory code that would allow their interpretation has remained one of the greatest challenges of modern biology. In this review, we summarize studies from the past three decades that describe progress towards revealing the properties of enhancers and discuss how recent approaches are providing unprecedented insights into regulatory elements in animal genomes. Over the next years, we believe that the functional characterization of regulatory sequences in entire genomes, combined with recent computational methods, will provide a comprehensive view of genomic regulatory elements and their building blocks and will enable researchers to begin to understand the sequence basis of the cis-regulatory code.
Collapse
|
89
|
Frankel N. Multiple layers of complexity incis-regulatory regions of developmental genes. Dev Dyn 2012; 241:1857-66. [DOI: 10.1002/dvdy.23871] [Citation(s) in RCA: 27] [Impact Index Per Article: 2.3] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Accepted: 09/06/2012] [Indexed: 12/19/2022] Open
|
90
|
Coevolution within and between regulatory loci can preserve promoter function despite evolutionary rate acceleration. PLoS Genet 2012; 8:e1002961. [PMID: 23028368 PMCID: PMC3447958 DOI: 10.1371/journal.pgen.1002961] [Citation(s) in RCA: 33] [Impact Index Per Article: 2.8] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/13/2012] [Accepted: 08/06/2012] [Indexed: 11/19/2022] Open
Abstract
Phenotypes that appear to be conserved could be maintained not only by strong purifying selection on the underlying genetic systems, but also by stabilizing selection acting via compensatory mutations with balanced effects. Such coevolution has been invoked to explain experimental results, but has rarely been the focus of study. Conserved expression driven by the unc-47 promoters of Caenorhabditis elegans and C. briggsae persists despite divergence within a cis-regulatory element and between this element and the trans-regulatory environment. Compensatory changes in cis and trans are revealed when these promoters are used to drive expression in the other species. Functional changes in the C. briggsae promoter, which has experienced accelerated sequence evolution, did not lead to alteration of gene expression in its endogenous environment. Coevolution among promoter elements suggests that complex epistatic interactions within cis-regulatory elements may facilitate their divergence. Our results offer a detailed picture of regulatory evolution in which subtle, lineage-specific, and compensatory modifications of interacting cis and trans regulators together maintain conserved gene expression patterns. Some phenotypes, including gene expression patterns, are conserved between distantly related species. However, the molecular bases of those phenotypes are not necessarily conserved. Instead, regulatory DNA sequences and the proteins with which they interact can change over time with balanced effects, preserving expression patterns and concealing regulatory divergence. Coevolution between interacting molecules makes gene regulation highly species-specific, and it can be detected when the cis-regulatory DNA of one species is used to drive expression in another species. In this way, we identified regions of the C. elegans and C. briggsae unc-47 promoters that have coevolved with the lineage-specific trans-regulatory environments of these organisms. The C. briggsae promoter experienced accelerated sequence change relative to related species. All of this evolution occurred without changing the expression pattern driven by the promoter in its endogenous environment.
Collapse
|
91
|
Lelli KM, Slattery M, Mann RS. Disentangling the many layers of eukaryotic transcriptional regulation. Annu Rev Genet 2012; 46:43-68. [PMID: 22934649 DOI: 10.1146/annurev-genet-110711-155437] [Citation(s) in RCA: 159] [Impact Index Per Article: 13.3] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 12/14/2022]
Abstract
Regulation of gene expression in eukaryotes is an extremely complex process. In this review, we break down several critical steps, emphasizing new data and techniques that have expanded current gene regulatory models. We begin at the level of DNA sequence where cis-regulatory modules (CRMs) provide important regulatory information in the form of transcription factor (TF) binding sites. In this respect, CRMs function as instructional platforms for the assembly of gene regulatory complexes. We discuss multiple mechanisms controlling complex assembly, including cooperative DNA binding, combinatorial codes, and CRM architecture. The second section of this review places CRM assembly in the context of nucleosomes and condensed chromatin. We discuss how DNA accessibility and histone modifications contribute to TF function. Lastly, new advances in chromosomal mapping techniques have provided increased understanding of intra- and interchromosomal interactions. We discuss how these topological maps influence gene regulatory models.
Collapse
Affiliation(s)
- Katherine M Lelli
- Department of Genetics and Development, College of Physicians and Surgeons, Columbia University, New York, NY 10032, USA
| | | | | |
Collapse
|
92
|
Spitz F, Furlong EEM. Transcription factors: from enhancer binding to developmental control. Nat Rev Genet 2012; 13:613-26. [PMID: 22868264 DOI: 10.1038/nrg3207] [Citation(s) in RCA: 1389] [Impact Index Per Article: 115.8] [Reference Citation Analysis] [Abstract] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 12/14/2022]
Abstract
Developmental progression is driven by specific spatiotemporal domains of gene expression, which give rise to stereotypically patterned embryos even in the presence of environmental and genetic variation. Views of how transcription factors regulate gene expression are changing owing to recent genome-wide studies of transcription factor binding and RNA expression. Such studies reveal patterns that, at first glance, seem to contrast with the robustness of the developmental processes they encode. Here, we review our current knowledge of transcription factor function from genomic and genetic studies and discuss how different strategies, including extensive cooperative regulation (both direct and indirect), progressive priming of regulatory elements, and the integration of activities from multiple enhancers, confer specificity and robustness to transcriptional regulation during development.
Collapse
Affiliation(s)
- François Spitz
- Developmental Biology Unit, European Molecular Biology Laboratory, D-69117 Heidelberg, Germany.
| | | |
Collapse
|
93
|
Role of architecture in the function and specificity of two Notch-regulated transcriptional enhancer modules. PLoS Genet 2012; 8:e1002796. [PMID: 22792075 PMCID: PMC3390367 DOI: 10.1371/journal.pgen.1002796] [Citation(s) in RCA: 29] [Impact Index Per Article: 2.4] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 12/16/2011] [Accepted: 05/15/2012] [Indexed: 11/19/2022] Open
Abstract
In Drosophila melanogaster, cis-regulatory modules that are activated by the Notch cell-cell signaling pathway all contain two types of transcription factor binding sites: those for the pathway's transducing factor Suppressor of Hairless [Su(H)] and those for one or more tissue- or cell type-specific factors called "local activators." The use of different "Su(H) plus local activator" motif combinations, or codes, is critical to ensure that only the correct subset of the broadly utilized Notch pathway's target genes are activated in each developmental context. However, much less is known about the role of enhancer "architecture"--the number, order, spacing, and orientation of its component transcription factor binding motifs--in determining the module's specificity. Here we investigate the relationship between architecture and function for two Notch-regulated enhancers with spatially distinct activities, each of which includes five high-affinity Su(H) sites. We find that the first, which is active specifically in the socket cells of external sensory organs, is largely resistant to perturbations of its architecture. By contrast, the second enhancer, active in the "non-SOP" cells of the proneural clusters from which neural precursors arise, is sensitive to even simple rearrangements of its transcription factor binding sites, responding with both loss of normal specificity and striking ectopic activity. Thus, diverse cryptic specificities can be inherent in an enhancer's particular combination of transcription factor binding motifs. We propose that for certain types of enhancer, architecture plays an essential role in determining specificity, not only by permitting factor-factor synergies necessary to generate the desired activity, but also by preventing other activator synergies that would otherwise lead to unwanted specificities.
Collapse
|
94
|
Abstract
Since the discovery of a single white-eyed male in a population of red eyed flies over 100 years ago (Morgan, 1910), the compound eye of the fruit fly, Drosophila melanogaster, has been a favorite experimental system for identifying genes that regulate various aspects of development. For example, a fair amount of what we know today about enzymatic pathways and vesicular transport is due to the discovery and subsequent characterization of eye color mutants such as white. Likewise, our present day understanding of organogenesis has been aided considerably by studies of mutations, such as eyeless, that either reduce or eliminate the compound eyes. But by far the phenotype that has provided levers into the greatest number of experimental fields has been the humble "rough" eye. The fly eye is composed of several hundred unit-eyes that are also called ommatidia. These unit eyes are packed into a hexagonal array of remarkable precision. The structure of the eye is so precise that it has been compared with that of a crystal (Ready et al., 1976). Even the slightest perturbations to the structure of the ommatidium can be visually detected by light or electron microscopy. The cause for this is two-fold: (1) any defect that affects the hexagonal geometry of a single ommatidium can and will disrupt the positioning of surrounding unit eyes thereby propagating structural flaws and (2) disruptions in genes that govern the development of even a single cell within an ommatidium will affect all unit eyes. In both cases, the effect is the visual magnification of even the smallest imperfection. Studies of rough eye mutants have provided key insights into the areas of cell fate specification, lateral inhibition, signal transduction, transcription factor networks, planar cell polarity, cell proliferation, and programmed cell death just to name a few. This review will attempt to summarize the key steps that are required to assemble each ommatidium.
Collapse
Affiliation(s)
- Justin P Kumar
- Department of Biology, Indiana University, Bloomington, Indiana 47405, USA.
| |
Collapse
|
95
|
Abstract
Transcription of eukaryotic genes is an exceedingly sophisticated and complicated process, orchestrated by layers of control mechanisms involving a myriad of transcription factors and DNA control sequences, with both groups subject to multiple modifications. The availability of various recent genomic approaches has provided previously unforeseen opportunities to examine the cis-regulatory landscape of the entire genome, resulting in the identification of a potentially overwhelming number of enhancers and novel enhancer functions. In this review, we focus on the activities of enhancers in metazoans and discuss how they serve to regulate gene expression during early development.
Collapse
Affiliation(s)
- Ken W Y Cho
- Developmental and Cell Biology, University of California, Irvine, Irvine, CA, USA.
| |
Collapse
|
96
|
Busser BW, Taher L, Kim Y, Tansey T, Bloom MJ, Ovcharenko I, Michelson AM. A machine learning approach for identifying novel cell type-specific transcriptional regulators of myogenesis. PLoS Genet 2012; 8:e1002531. [PMID: 22412381 PMCID: PMC3297574 DOI: 10.1371/journal.pgen.1002531] [Citation(s) in RCA: 34] [Impact Index Per Article: 2.8] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/30/2011] [Accepted: 12/23/2011] [Indexed: 12/22/2022] Open
Abstract
Transcriptional enhancers integrate the contributions of multiple classes of transcription factors (TFs) to orchestrate the myriad spatio-temporal gene expression programs that occur during development. A molecular understanding of enhancers with similar activities requires the identification of both their unique and their shared sequence features. To address this problem, we combined phylogenetic profiling with a DNA-based enhancer sequence classifier that analyzes the TF binding sites (TFBSs) governing the transcription of a co-expressed gene set. We first assembled a small number of enhancers that are active in Drosophila melanogaster muscle founder cells (FCs) and other mesodermal cell types. Using phylogenetic profiling, we increased the number of enhancers by incorporating orthologous but divergent sequences from other Drosophila species. Functional assays revealed that the diverged enhancer orthologs were active in largely similar patterns as their D. melanogaster counterparts, although there was extensive evolutionary shuffling of known TFBSs. We then built and trained a classifier using this enhancer set and identified additional related enhancers based on the presence or absence of known and putative TFBSs. Predicted FC enhancers were over-represented in proximity to known FC genes; and many of the TFBSs learned by the classifier were found to be critical for enhancer activity, including POU homeodomain, Myb, Ets, Forkhead, and T-box motifs. Empirical testing also revealed that the T-box TF encoded by org-1 is a previously uncharacterized regulator of muscle cell identity. Finally, we found extensive diversity in the composition of TFBSs within known FC enhancers, suggesting that motif combinatorics plays an essential role in the cellular specificity exhibited by such enhancers. In summary, machine learning combined with evolutionary sequence analysis is useful for recognizing novel TFBSs and for facilitating the identification of cognate TFs that coordinate cell type-specific developmental gene expression patterns.
Collapse
Affiliation(s)
- Brian W. Busser
- Laboratory of Developmental Systems Biology, National Heart, Lung, and Blood Institute, National Institutes of Health, Bethesda, Maryland, United States of America
| | - Leila Taher
- Computational Biology Branch, National Center for Biotechnology Information, National Library of Medicine, National Institutes of Health, Bethesda, Maryland, United States of America
| | - Yongsok Kim
- Laboratory of Developmental Systems Biology, National Heart, Lung, and Blood Institute, National Institutes of Health, Bethesda, Maryland, United States of America
| | - Terese Tansey
- Laboratory of Developmental Systems Biology, National Heart, Lung, and Blood Institute, National Institutes of Health, Bethesda, Maryland, United States of America
| | - Molly J. Bloom
- Laboratory of Developmental Systems Biology, National Heart, Lung, and Blood Institute, National Institutes of Health, Bethesda, Maryland, United States of America
| | - Ivan Ovcharenko
- Computational Biology Branch, National Center for Biotechnology Information, National Library of Medicine, National Institutes of Health, Bethesda, Maryland, United States of America
- * E-mail: (IO); (AMM)
| | - Alan M. Michelson
- Laboratory of Developmental Systems Biology, National Heart, Lung, and Blood Institute, National Institutes of Health, Bethesda, Maryland, United States of America
- * E-mail: (IO); (AMM)
| |
Collapse
|
97
|
Busser BW, Shokri L, Jaeger SA, Gisselbrecht SS, Singhania A, Berger MF, Zhou B, Bulyk ML, Michelson AM. Molecular mechanism underlying the regulatory specificity of a Drosophila homeodomain protein that specifies myoblast identity. Development 2012; 139:1164-74. [PMID: 22296846 PMCID: PMC3283125 DOI: 10.1242/dev.077362] [Citation(s) in RCA: 29] [Impact Index Per Article: 2.4] [Reference Citation Analysis] [Abstract] [Track Full Text] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 01/25/2023]
Abstract
A subfamily of Drosophila homeodomain (HD) transcription factors (TFs) controls the identities of individual muscle founder cells (FCs). However, the molecular mechanisms by which these TFs generate unique FC genetic programs remain unknown. To investigate this problem, we first applied genome-wide mRNA expression profiling to identify genes that are activated or repressed by the muscle HD TFs Slouch (Slou) and Muscle segment homeobox (Msh). Next, we used protein-binding microarrays to define the sequences that are bound by Slou, Msh and other HD TFs that have mesodermal expression. These studies revealed that a large class of HDs, including Slou and Msh, predominantly recognize TAAT core sequences but that each HD also binds to unique sites that deviate from this canonical motif. To understand better the regulatory specificity of an individual FC identity HD, we evaluated the functions of atypical binding sites that are preferentially bound by Slou relative to other HDs within muscle enhancers that are either activated or repressed by this TF. These studies showed that Slou regulates the activities of particular myoblast enhancers through Slou-preferred sequences, whereas swapping these sequences for sites that are capable of binding to multiple HD family members does not support the normal regulatory functions of Slou. Moreover, atypical Slou-binding sites are overrepresented in putative enhancers associated with additional Slou-responsive FC genes. Collectively, these studies provide new insights into the roles of individual HD TFs in determining cellular identity, and suggest that the diversity of HD binding preferences can confer regulatory specificity.
Collapse
Affiliation(s)
- Brian W Busser
- Laboratory of Developmental Systems Biology, Genetics and Developmental Biology Center, Division of Intramural Research, National Heart Lung and Blood Institute, National Institutes of Health, Bethesda, MD 20892, USA
| | | | | | | | | | | | | | | | | |
Collapse
|
98
|
Junion G, Spivakov M, Girardot C, Braun M, Gustafson E, Birney E, Furlong E. A Transcription Factor Collective Defines Cardiac Cell Fate and Reflects Lineage History. Cell 2012; 148:473-86. [DOI: 10.1016/j.cell.2012.01.030] [Citation(s) in RCA: 222] [Impact Index Per Article: 18.5] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 08/17/2010] [Revised: 08/16/2011] [Accepted: 01/17/2012] [Indexed: 11/28/2022]
|
99
|
Barolo S. Shadow enhancers: frequently asked questions about distributed cis-regulatory information and enhancer redundancy. Bioessays 2012; 34:135-41. [PMID: 22083793 PMCID: PMC3517143 DOI: 10.1002/bies.201100121] [Citation(s) in RCA: 115] [Impact Index Per Article: 9.6] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/07/2022]
Abstract
This paper, in the form of a frequently asked questions page (FAQ), addresses outstanding questions about "shadow enhancers", quasi-redundant cis-regulatory elements, and their proposed roles in transcriptional control. Questions include: What exactly are shadow enhancers? How many genes have shadow/redundant/distributed enhancers? How redundant are these elements? What is the function of distributed enhancers? How modular are enhancers? Is it useful to study a single enhancer in isolation? In addition, a revised definition of "shadow enhancers" is proposed, and possible mechanisms of shadow enhancer function and evolution are discussed.
Collapse
Affiliation(s)
- Scott Barolo
- Department of Cell and Developmental Biology, University of Michigan Medical School, Ann Arbor, MI, USA.
| |
Collapse
|
100
|
Abstract
Perennial questions of evolutionary biology can be applied to gene regulatory systems using the abundance of experimental data addressing gene regulation in a comparative context. What is the tempo (frequency, rate) and mode (way, mechanism) of transcriptional regulatory evolution? Here we synthesize the results of 230 experiments performed on insects and nematodes in which regulatory DNA from one species was used to drive gene expression in another species. General principles of regulatory evolution emerge. Gene regulatory evolution is widespread and accumulates with genetic divergence in both insects and nematodes. Divergence in cis is more common than divergence in trans. Coevolution between cis and trans shows a particular increase over greater evolutionary timespans, especially in sex-specific gene regulation. Despite these generalities, the evolution of gene regulation is gene- and taxon-specific. The congruence of these conclusions with evidence from other types of experiments suggests that general principles are discoverable, and a unified view of the tempo and mode of regulatory evolution may be achievable.
Collapse
|