1
|
Harden TT, Vincent BJ, DePace AH. Transcriptional activators in the early Drosophila embryo perform different kinetic roles. Cell Syst 2023; 14:258-272.e4. [PMID: 37080162 PMCID: PMC10473017 DOI: 10.1016/j.cels.2023.03.006] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/08/2021] [Revised: 06/26/2022] [Accepted: 03/21/2023] [Indexed: 04/22/2023]
Abstract
Combinatorial regulation of gene expression by transcription factors (TFs) may in part arise from kinetic synergy-wherein TFs regulate different steps in the transcription cycle. Kinetic synergy requires that TFs play distinguishable kinetic roles. Here, we used live imaging to determine the kinetic roles of three TFs that activate transcription in the Drosophila embryo-Zelda, Bicoid, and Stat92E-by introducing their binding sites into the even-skipped stripe 2 enhancer. These TFs influence different sets of kinetic parameters, and their influence can change over time. All three TFs increased the fraction of transcriptionally active nuclei; Zelda also shortened the first-passage time into transcription and regulated the interval between transcription events. Stat92E also increased the lifetimes of active transcription. Different TFs can therefore play distinct kinetic roles in activating the transcription. This has consequences for understanding the composition and flexibility of regulatory DNA sequences and the biochemical function of TFs. A record of this paper's transparent peer review process is included in the supplemental information.
Collapse
Affiliation(s)
- Timothy T Harden
- Department of Systems Biology, Harvard Medical School, Boston, MA 02115, USA
| | - Ben J Vincent
- Department of Systems Biology, Harvard Medical School, Boston, MA 02115, USA
| | - Angela H DePace
- Department of Systems Biology, Harvard Medical School, Boston, MA 02115, USA.
| |
Collapse
|
2
|
Lee K, O’Neill KM, Ku J, Shvartsman SY, Kim Y. Patterning potential of the terminal system in the Drosophila embryo. KOREAN J CHEM ENG 2023. [DOI: 10.1007/s11814-022-1298-6] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 02/05/2023]
|
3
|
Skok Gibbs C, Jackson CA, Saldi GA, Tjärnberg A, Shah A, Watters A, De Veaux N, Tchourine K, Yi R, Hamamsy T, Castro DM, Carriero N, Gorissen BL, Gresham D, Miraldi ER, Bonneau R. High-performance single-cell gene regulatory network inference at scale: the Inferelator 3.0. Bioinformatics 2022; 38:2519-2528. [PMID: 35188184 PMCID: PMC9048651 DOI: 10.1093/bioinformatics/btac117] [Citation(s) in RCA: 22] [Impact Index Per Article: 11.0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 05/25/2021] [Revised: 12/08/2021] [Accepted: 02/17/2022] [Indexed: 12/04/2022] Open
Abstract
MOTIVATION Gene regulatory networks define regulatory relationships between transcription factors and target genes within a biological system, and reconstructing them is essential for understanding cellular growth and function. Methods for inferring and reconstructing networks from genomics data have evolved rapidly over the last decade in response to advances in sequencing technology and machine learning. The scale of data collection has increased dramatically; the largest genome-wide gene expression datasets have grown from thousands of measurements to millions of single cells, and new technologies are on the horizon to increase to tens of millions of cells and above. RESULTS In this work, we present the Inferelator 3.0, which has been significantly updated to integrate data from distinct cell types to learn context-specific regulatory networks and aggregate them into a shared regulatory network, while retaining the functionality of the previous versions. The Inferelator is able to integrate the largest single-cell datasets and learn cell-type-specific gene regulatory networks. Compared to other network inference methods, the Inferelator learns new and informative Saccharomyces cerevisiae networks from single-cell gene expression data, measured by recovery of a known gold standard. We demonstrate its scaling capabilities by learning networks for multiple distinct neuronal and glial cell types in the developing Mus musculus brain at E18 from a large (1.3 million) single-cell gene expression dataset with paired single-cell chromatin accessibility data. AVAILABILITY AND IMPLEMENTATION The inferelator software is available on GitHub (https://github.com/flatironinstitute/inferelator) under the MIT license and has been released as python packages with associated documentation (https://inferelator.readthedocs.io/). SUPPLEMENTARY INFORMATION Supplementary data are available at Bioinformatics online.
Collapse
Affiliation(s)
- Claudia Skok Gibbs
- Flatiron Institute, Center for Computational Biology, Simons Foundation, New York, NY 10010, USA
- Center for Data Science, New York University, New York, NY 10003, USA
| | - Christopher A Jackson
- Center for Genomics and Systems Biology, New York University, New York, NY 10003, USA
- Department of Biology, New York University, New York, NY 10003, USA
| | - Giuseppe-Antonio Saldi
- Center for Genomics and Systems Biology, New York University, New York, NY 10003, USA
- Department of Biology, New York University, New York, NY 10003, USA
| | - Andreas Tjärnberg
- Center for Genomics and Systems Biology, New York University, New York, NY 10003, USA
- Department of Biology, New York University, New York, NY 10003, USA
| | - Aashna Shah
- Flatiron Institute, Center for Computational Biology, Simons Foundation, New York, NY 10010, USA
| | - Aaron Watters
- Flatiron Institute, Center for Computational Biology, Simons Foundation, New York, NY 10010, USA
| | - Nicholas De Veaux
- Flatiron Institute, Center for Computational Biology, Simons Foundation, New York, NY 10010, USA
| | | | - Ren Yi
- Computer Science Department, Courant Institute of Mathematical Sciences, New York University, New York, NY 10012, USA
| | - Tymor Hamamsy
- Center for Data Science, New York University, New York, NY 10003, USA
| | - Dayanne M Castro
- Center for Genomics and Systems Biology, New York University, New York, NY 10003, USA
- Department of Biology, New York University, New York, NY 10003, USA
| | - Nicholas Carriero
- Flatiron Institute, Scientific Computing Core, Simons Foundation, New York, NY 10010, USA
| | - Bram L Gorissen
- Stanley Center for Psychiatric Research, Broad Institute of MIT and Harvard, Cambridge, MA 02142, USA
| | - David Gresham
- Center for Genomics and Systems Biology, New York University, New York, NY 10003, USA
- Department of Biology, New York University, New York, NY 10003, USA
| | - Emily R Miraldi
- Divisions of Immunobiology and Biomedical Informatics, Cincinnati Children’s Hospital Medical Center, Cincinnati, OH 45229, USA
- Department of Pediatrics, University of Cincinnati College of Medicine, Cincinnati, OH 45267, USA
| | - Richard Bonneau
- Flatiron Institute, Center for Computational Biology, Simons Foundation, New York, NY 10010, USA
- Center for Data Science, New York University, New York, NY 10003, USA
- Center for Genomics and Systems Biology, New York University, New York, NY 10003, USA
- Department of Biology, New York University, New York, NY 10003, USA
- Computer Science Department, Courant Institute of Mathematical Sciences, New York University, New York, NY 10012, USA
| |
Collapse
|
4
|
Pérez E, Venkatanarayan A, Lundell MJ. Hunchback prevents notch-induced apoptosis in the serotonergic lineage of Drosophila Melanogaster. Dev Biol 2022; 486:109-120. [PMID: 35381219 DOI: 10.1016/j.ydbio.2022.03.012] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 12/21/2021] [Revised: 03/16/2022] [Accepted: 03/30/2022] [Indexed: 11/19/2022]
Abstract
The serotonergic lineage (NB7-3) in the Drosophila ventral nerve cord produces six cells during neurogenesis. Four of the cells differentiate into neurons: EW1, EW2, EW3 and GW. The other two cells undergo apoptosis. This simple lineage provides an opportunity to examine genes that are required to induce or repress apoptosis during cell specification. Previous studies have shown that Notch signaling induces apoptosis within the NB7-3 lineage. The three EW neurons are protected from Notch-induced apoptosis by asymmetric distribution of Numb protein, an inhibitor of Notch signaling. In a numb1 mutant EW2 and EW3 undergo apoptosis. The EW1 and GW neurons survive even in a numb1 mutant background suggesting that these cells are protected from Notch-induced apoptosis by some factor other than Numb. The EW1 and GW neurons are mitotic sister cells, and uniquely express the transcription factor Hunchback. We present evidence that Hunchback prevents apoptosis in the NB7-3 lineage during normal CNS development and can rescue the two apoptotic cells in the lineage when it is ectopically expressed. We show that hunchback overexpression produces ectopic cells that express markers similar to the EW2 neuron and changes the expression pattern of the EW3 neuron to a EW2 neuron In addition we show that hunchback overexpression can override apoptosis that is genetically induced by the pro-apoptotic genes grim and hid.
Collapse
Affiliation(s)
- Ernesto Pérez
- Department of Biology, University of Texas at San Antonio, One UTSA Circle, San Antonio, TX, 78249, USA
| | | | - Martha J Lundell
- Department of Biology, University of Texas at San Antonio, One UTSA Circle, San Antonio, TX, 78249, USA.
| |
Collapse
|
5
|
Dibaeinia P, Sinha S. Deciphering enhancer sequence using thermodynamics-based models and convolutional neural networks. Nucleic Acids Res 2021; 49:10309-10327. [PMID: 34508359 PMCID: PMC8501998 DOI: 10.1093/nar/gkab765] [Citation(s) in RCA: 1] [Impact Index Per Article: 0.3] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/16/2021] [Revised: 08/18/2021] [Accepted: 08/25/2021] [Indexed: 11/18/2022] Open
Abstract
Deciphering the sequence-function relationship encoded in enhancers holds the key to interpreting non-coding variants and understanding mechanisms of transcriptomic variation. Several quantitative models exist for predicting enhancer function and underlying mechanisms; however, there has been no systematic comparison of these models characterizing their relative strengths and shortcomings. Here, we interrogated a rich data set of neuroectodermal enhancers in Drosophila, representing cis- and trans- sources of expression variation, with a suite of biophysical and machine learning models. We performed rigorous comparisons of thermodynamics-based models implementing different mechanisms of activation, repression and cooperativity. Moreover, we developed a convolutional neural network (CNN) model, called CoNSEPT, that learns enhancer 'grammar' in an unbiased manner. CoNSEPT is the first general-purpose CNN tool for predicting enhancer function in varying conditions, such as different cell types and experimental conditions, and we show that such complex models can suggest interpretable mechanisms. We found model-based evidence for mechanisms previously established for the studied system, including cooperative activation and short-range repression. The data also favored one hypothesized activation mechanism over another and suggested an intriguing role for a direct, distance-independent repression mechanism. Our modeling shows that while fundamentally different models can yield similar fits to data, they vary in their utility for mechanistic inference. CoNSEPT is freely available at: https://github.com/PayamDiba/CoNSEPT.
Collapse
Affiliation(s)
- Payam Dibaeinia
- Department of Computer Science, University of Illinois at Urbana-Champaign, Urbana, IL 61801, USA
| | - Saurabh Sinha
- Department of Computer Science, University of Illinois at Urbana-Champaign, Urbana, IL 61801, USA
- Carl R. Woese Institute for Genomic Biology, University of Illinois at Urbana-Champaign, Urbana, IL 61801, USA
- Cancer Center at Illinois, University of Illinois at Urbana-Champaign, Urbana, IL 61801, USA
| |
Collapse
|
6
|
Waymack R, Gad M, Wunderlich Z. Molecular competition can shape enhancer activity in the Drosophila embryo. iScience 2021; 24:103034. [PMID: 34568782 PMCID: PMC8449247 DOI: 10.1016/j.isci.2021.103034] [Citation(s) in RCA: 2] [Impact Index Per Article: 0.7] [Reference Citation Analysis] [Abstract] [Key Words] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/07/2021] [Revised: 07/27/2021] [Accepted: 08/20/2021] [Indexed: 01/12/2023] Open
Abstract
Transgenic reporters allow the measurement of regulatory DNA activity in vivo and consequently have long been useful tools for studying enhancers. Despite their utility, few studies have investigated the effects these reporters may have on the expression of other genes. Understanding these effects is required to accurately interpret reporter data and characterize gene regulatory mechanisms. By measuring the expression of Kruppel (Kr) enhancer reporters in live Drosophila embryos, we find reporters inhibit one another's expression and that of a nearby endogenous gene. Using synthetic transcription factor (TF) binding site arrays, we present evidence that competition for TFs is partially responsible for the observed transcriptional inhibition. We develop a simple thermodynamic model that predicts competition of the measured magnitude specifically when TF binding is restricted to distinct nuclear subregions. Our findings underline an unexpected role of the non-homogenous nature of the nucleus in regulating gene expression.
Collapse
Affiliation(s)
- Rachel Waymack
- Department of Developmental and Cell Biology, University of California, Irvine, CA 92697, USA
| | - Mario Gad
- Department of Developmental and Cell Biology, University of California, Irvine, CA 92697, USA
| | - Zeba Wunderlich
- Department of Developmental and Cell Biology, University of California, Irvine, CA 92697, USA
- Department of Biology, Boston University, 610 Commonwealth Ave., Boston, MA 02215, USA
- Biological Design Center, Boston University, 610 Commonwealth Avenue, Boston, MA 02215, USA
| |
Collapse
|
7
|
Abstract
Determining whether and how a gene is transcribed are two of the central processes of life. The conceptual basis for understanding such gene regulation arose from pioneering biophysical studies in eubacteria. However, eukaryotic genomes exhibit vastly greater complexity, which raises questions not addressed by this bacterial paradigm. First, how is information integrated from many widely separated binding sites to determine how a gene is transcribed? Second, does the presence of multiple energy-expending mechanisms, which are absent from eubacterial genomes, indicate that eukaryotes are capable of improved forms of genetic information processing? An updated biophysical foundation is needed to answer such questions. We describe the linear framework, a graph-based approach to Markov processes, and show that it can accommodate many previous studies in the field. Under the assumption of thermodynamic equilibrium, we introduce a language of higher-order cooperativities and show how it can rigorously quantify gene regulatory properties suggested by experiment. We point out that fundamental limits to information processing arise at thermodynamic equilibrium and can only be bypassed through energy expenditure. Finally, we outline some of the mathematical challenges that must be overcome to construct an improved biophysical understanding of gene regulation.
Collapse
Affiliation(s)
- Felix Wong
- Institute for Medical Engineering & Science, Department of Biological Engineering, Massachusetts Institute of Technology, Cambridge, Massachusetts 02139, USA.,Infectious Disease and Microbiome Program, Broad Institute of MIT and Harvard, Cambridge, Massachusetts 02142, USA
| | - Jeremy Gunawardena
- Department of Systems Biology, Harvard Medical School, Boston, Massachusetts 02115, USA;
| |
Collapse
|
8
|
Myasnikova E, Spirov A. Gene regulatory networks in Drosophila early embryonic development as a model for the study of the temporal identity of neuroblasts. Biosystems 2020; 197:104192. [PMID: 32619531 DOI: 10.1016/j.biosystems.2020.104192] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 01/07/2020] [Revised: 04/30/2020] [Accepted: 06/21/2020] [Indexed: 11/27/2022]
Abstract
Genes belonging to the "gap" and "gap-like" family constitute the best-studied gene regulatory networks (GRNs) in Drosophila embryogenesis. Gap genes are a core of two subnetworks controlling embryonic segmentation: (hunchback, hb; Krüppel, Kr; giant, gt; and knirps, kni) and (hb; Kr; pou-domain, pdm; and, probably, castor, cas). Of particular interest is that (hb, Kr, pdm, cas) also specifies the temporal identity of stem cells, neuroblasts, in Drosophila neurogenesis. This GRN controls the sequential differentiation of neuroblasts during the asymmetric cell division. In the last decades, modeling of the patterning of gene ensemble (hb, Kr, gt, kni) in segmentation was in the center of attention. We show that our previously published and extensively studied model at a certain level of external factors is able to reproduce temporal patterns of (hb, Kr, pdm, cas) in neurogenesis with minor evolutionary explicable modifications. This result testifies in favor of a hypothesis that the similarity of two gene ensembles active in segmentation and neurogenesis is a result of co-option of the network architecture in evolution from the common ancestral form. By means of the model dynamical analysis, it is shown that the establishment of the robust patterns in both systems could be explained in terms of the action of attractors in the gap gene dynamical system. We formulate the common principles underlying the robustness of both GRNs in segmentation and neurogenesis due to the similar functional organization of the gene ensembles as having the same evolutionary origin.
Collapse
Affiliation(s)
- Ekaterina Myasnikova
- Peter the Great Saint-Petersburg Polytechnical University, 29 Politekhnicheskaya str, St. Petersburg, 195251, Russia.
| | - Alexander Spirov
- I. M. Sechenov Institute of Evolutionary Physiology and Biochemistry Russian Academy of Sciences, 44 Thorez Pr, St.Petersburg, 194223, Russia; Computer Science and CEWIT, SUNY Stony Brook, Stony Brook, 1500 Stony Brook Road, Stony Brook, 11794, NY, USA
| |
Collapse
|
9
|
Zubair A, Rosen IG, Nuzhdin SV, Marjoram P. Bayesian model selection for the Drosophila gap gene network. BMC Bioinformatics 2019; 20:327. [PMID: 31195954 PMCID: PMC6567646 DOI: 10.1186/s12859-019-2888-0] [Citation(s) in RCA: 1] [Impact Index Per Article: 0.2] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 04/24/2019] [Accepted: 05/09/2019] [Indexed: 11/10/2022] Open
Abstract
Background The gap gene system controls the early cascade of the segmentation pathway in Drosophila melanogaster as well as other insects. Owing to its tractability and key role in embryo patterning, this system has been the focus for both computational modelers and experimentalists. The gap gene expression dynamics can be considered strictly as a one-dimensional process and modeled as a system of reaction-diffusion equations. While substantial progress has been made in modeling this phenomenon, there still remains a deficit of approaches to evaluate competing hypotheses. Most of the model development has happened in isolation and there has been little attempt to compare candidate models. Results The Bayesian framework offers a means of doing formal model evaluation. Here, we demonstrate how this framework can be used to compare different models of gene expression. We focus on the Papatsenko-Levine formalism, which exploits a fractional occupancy based approach to incorporate activation of the gap genes by the maternal genes and cross-regulation by the gap genes themselves. The Bayesian approach provides insight about relationship between system parameters. In the regulatory pathway of segmentation, the parameters for number of binding sites and binding affinity have a negative correlation. The model selection analysis supports a stronger binding affinity for Bicoid compared to other regulatory edges, as shown by a larger posterior mean. The procedure doesn’t show support for activation of Kruppel by Bicoid. Conclusions We provide an efficient solver for the general representation of the Papatsenko-Levine model. We also demonstrate the utility of Bayes factor for evaluating candidate models for spatial pattering models. In addition, by using the parallel tempering sampler, the convergence of Markov chains can be remarkably improved and robust estimates of Bayes factors obtained. Electronic supplementary material The online version of this article (10.1186/s12859-019-2888-0) contains supplementary material, which is available to authorized users.
Collapse
Affiliation(s)
- Asif Zubair
- Molecular and Computational Biology, USC, 1050 Childs Way, Los Angeles, CA 90089-2532, US.
| | - I Gary Rosen
- Department of Mathematics, USC, 3620 S. Vermont Ave., Los Angeles, CA 90089-2532, US
| | - Sergey V Nuzhdin
- Molecular and Computational Biology, USC, 1050 Childs Way, Los Angeles, CA 90089-2532, US
| | - Paul Marjoram
- Molecular and Computational Biology, USC, 1050 Childs Way, Los Angeles, CA 90089-2532, US
| |
Collapse
|
10
|
Vincent BJ, Staller MV, Lopez-Rivera F, Bragdon MDJ, Pym ECG, Biette KM, Wunderlich Z, Harden TT, Estrada J, DePace AH. Hunchback is counter-repressed to regulate even-skipped stripe 2 expression in Drosophila embryos. PLoS Genet 2018; 14:e1007644. [PMID: 30192762 PMCID: PMC6145585 DOI: 10.1371/journal.pgen.1007644] [Citation(s) in RCA: 21] [Impact Index Per Article: 3.5] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/06/2017] [Revised: 09/19/2018] [Accepted: 08/17/2018] [Indexed: 01/18/2023] Open
Abstract
Hunchback is a bifunctional transcription factor that can activate and repress gene expression in Drosophila development. We investigated the regulatory DNA sequence features that control Hunchback function by perturbing enhancers for one of its target genes, even-skipped (eve). While Hunchback directly represses the eve stripe 3+7 enhancer, we found that in the eve stripe 2+7 enhancer, Hunchback repression is prevented by nearby sequences-this phenomenon is called counter-repression. We also found evidence that Caudal binding sites are responsible for counter-repression, and that this interaction may be a conserved feature of eve stripe 2 enhancers. Our results alter the textbook view of eve stripe 2 regulation wherein Hb is described as a direct activator. Instead, to generate stripe 2, Hunchback repression must be counteracted. We discuss how counter-repression may influence eve stripe 2 regulation and evolution.
Collapse
Affiliation(s)
- Ben J. Vincent
- Department of Systems Biology, Harvard Medical School, Boston, Massachusetts, United States of America
| | - Max V. Staller
- Department of Systems Biology, Harvard Medical School, Boston, Massachusetts, United States of America
| | - Francheska Lopez-Rivera
- Department of Systems Biology, Harvard Medical School, Boston, Massachusetts, United States of America
| | - Meghan D. J. Bragdon
- Department of Systems Biology, Harvard Medical School, Boston, Massachusetts, United States of America
| | - Edward C. G. Pym
- Department of Systems Biology, Harvard Medical School, Boston, Massachusetts, United States of America
| | - Kelly M. Biette
- Department of Systems Biology, Harvard Medical School, Boston, Massachusetts, United States of America
| | - Zeba Wunderlich
- Department of Systems Biology, Harvard Medical School, Boston, Massachusetts, United States of America
| | - Timothy T. Harden
- Department of Systems Biology, Harvard Medical School, Boston, Massachusetts, United States of America
| | - Javier Estrada
- Department of Systems Biology, Harvard Medical School, Boston, Massachusetts, United States of America
| | - Angela H. DePace
- Department of Systems Biology, Harvard Medical School, Boston, Massachusetts, United States of America
| |
Collapse
|
11
|
Myasnikova E, Spirov A. Robustness of expression pattern formation due to dynamic equilibrium in gap gene system of an early Drosophila embryo. Biosystems 2018; 166:50-60. [DOI: 10.1016/j.biosystems.2018.02.001] [Citation(s) in RCA: 2] [Impact Index Per Article: 0.3] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 08/20/2017] [Revised: 01/08/2018] [Accepted: 02/01/2018] [Indexed: 11/24/2022]
|
12
|
Papatsenko D, Darr H, Kulakovskiy IV, Waghray A, Makeev VJ, MacArthur BD, Lemischka IR. Single-Cell Analyses of ESCs Reveal Alternative Pluripotent Cell States and Molecular Mechanisms that Control Self-Renewal. Stem Cell Reports 2016; 5:207-20. [PMID: 26267829 PMCID: PMC4618835 DOI: 10.1016/j.stemcr.2015.07.004] [Citation(s) in RCA: 28] [Impact Index Per Article: 3.5] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/11/2015] [Revised: 07/14/2015] [Accepted: 07/14/2015] [Indexed: 12/22/2022] Open
Abstract
Analyses of gene expression in single mouse embryonic stem cells (mESCs) cultured in serum and LIF revealed the presence of two distinct cell subpopulations with individual gene expression signatures. Comparisons with published data revealed that cells in the first subpopulation are phenotypically similar to cells isolated from the inner cell mass (ICM). In contrast, cells in the second subpopulation appear to be more mature. Pluripotency Gene Regulatory Network (PGRN) reconstruction based on single-cell data and published data suggested antagonistic roles for Oct4 and Nanog in the maintenance of pluripotency states. Integrated analyses of published genomic binding (ChIP) data strongly supported this observation. Certain target genes alternatively regulated by OCT4 and NANOG, such as Sall4 and Zscan10, feed back into the top hierarchical regulator Oct4. Analyses of such incoherent feedforward loops with feedback (iFFL-FB) suggest a dynamic model for the maintenance of mESC pluripotency and self-renewal. Mouse embryonic stem cells grown on serum and LIF contain two subpopulations of cells Oct4 and Nanog alternatively regulate a class of pluripotency genes We demonstrate stabilization of Oct4 concentration and pluripotency via feedback control The “state exchange” model explains self-renewal
Collapse
Affiliation(s)
- Dmitri Papatsenko
- Department of Regenerative and Developmental Biology, Icahn School of Medicine at Mount Sinai, One Gustave L. Levy Place, New York, NY 10029, USA; Black Family Stem Cell Institute, Icahn School of Medicine at Mount Sinai, One Gustave L. Levy Place, New York, NY 10029, USA.
| | - Henia Darr
- Department of Regenerative and Developmental Biology, Icahn School of Medicine at Mount Sinai, One Gustave L. Levy Place, New York, NY 10029, USA; Black Family Stem Cell Institute, Icahn School of Medicine at Mount Sinai, One Gustave L. Levy Place, New York, NY 10029, USA
| | - Ivan V Kulakovskiy
- Engelhardt Institute of Molecular Biology, Russian Academy of Sciences, Vavilova Strasse 32, Moscow 119991, Russia; Department of Computational Systems Biology, Vavilov Institute of General Genetics, Russian Academy of Sciences, Gubkina Strasse 3, Moscow 119991, Russia
| | - Avinash Waghray
- Department of Regenerative and Developmental Biology, Icahn School of Medicine at Mount Sinai, One Gustave L. Levy Place, New York, NY 10029, USA; Black Family Stem Cell Institute, Icahn School of Medicine at Mount Sinai, One Gustave L. Levy Place, New York, NY 10029, USA
| | - Vsevolod J Makeev
- Engelhardt Institute of Molecular Biology, Russian Academy of Sciences, Vavilova Strasse 32, Moscow 119991, Russia; Department of Computational Systems Biology, Vavilov Institute of General Genetics, Russian Academy of Sciences, Gubkina Strasse 3, Moscow 119991, Russia
| | - Ben D MacArthur
- Centre for Human Development, Stem Cells, and Regeneration, Institute of Developmental Sciences, University of Southampton, Southampton SO17 1BJ, UK
| | - Ihor R Lemischka
- Department of Regenerative and Developmental Biology, Icahn School of Medicine at Mount Sinai, One Gustave L. Levy Place, New York, NY 10029, USA; Black Family Stem Cell Institute, Icahn School of Medicine at Mount Sinai, One Gustave L. Levy Place, New York, NY 10029, USA; Department of Pharmacology and System Therapeutics, Icahn School of Medicine at Mount Sinai, Systems Biology Center New York, One Gustave L. Levy Place, New York, NY 10029, USA.
| |
Collapse
|
13
|
Spirov AV, Myasnikova EM, Holloway DM. Sequential construction of a model for modular gene expression control, applied to spatial patterning of theDrosophilagenehunchback. J Bioinform Comput Biol 2016; 14:1641005. [DOI: 10.1142/s0219720016410055] [Citation(s) in RCA: 7] [Impact Index Per Article: 0.9] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/18/2022]
Abstract
Gene network simulations are increasingly used to quantify mutual gene regulation in biological tissues. These are generally based on linear interactions between single-entity regulatory and target genes. Biological genes, by contrast, commonly have multiple, partially independent, cis-regulatory modules (CRMs) for regulator binding, and can produce variant transcription and translation products. We present a modeling framework to address some of the gene regulatory dynamics implied by this biological complexity. Spatial patterning of the hunchback (hb) gene in Drosophila development involves control by three CRMs producing two distinct mRNA transcripts. We use this example to develop a differential equations model for transcription which takes into account the cis-regulatory architecture of the gene. Potential regulatory interactions are screened by a genetic algorithms (GAs) approach and compared to biological expression data.
Collapse
Affiliation(s)
- Alexander V. Spirov
- Computer Science and CEWIT, SUNY Stony Brook, 1500 Stony Brook Road, Stony Brook, NY 11794, USA
- Lab Modeling of Evolution, I. M. Sechenov Institute of Evolutionary Physiology and Biochemistry, Russian Academy of Sciences, pr. Torez 44, St. Petersburg 194223, Russia
| | - Ekaterina M. Myasnikova
- Center for Advanced Studies, Peter the Great St. Petersburg Polytechnical University, 29 Polytechnicheskaya St. Petersburg 195251, Russia
- Department of Bioinformatics, Moscow Institute of Physics and Technology, 9 Institutskiy per., Dolgoprudny, Moscow 141700, Russia
| | - David M. Holloway
- Mathematics Department, British Columbia Institute of Technology, 3700 Willingdon Avenue, Burnaby, BC, Canada V5G 3H2, Canada
- Department of Biology, University of Victoria, Victoria, BC, Canada V8W 2Y2, Canada
| |
Collapse
|
14
|
Vincent BJ, Estrada J, DePace AH. The appeasement of Doug: a synthetic approach to enhancer biology. Integr Biol (Camb) 2016; 8:475-84. [DOI: 10.1039/c5ib00321k] [Citation(s) in RCA: 33] [Impact Index Per Article: 4.1] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 12/13/2022]
Affiliation(s)
- Ben J. Vincent
- Department of Systems Biology, Harvard Medical School, 200 Longwood Avenue, Boston, MA 02115, USA
| | - Javier Estrada
- Department of Systems Biology, Harvard Medical School, 200 Longwood Avenue, Boston, MA 02115, USA
| | - Angela H. DePace
- Department of Systems Biology, Harvard Medical School, 200 Longwood Avenue, Boston, MA 02115, USA
| |
Collapse
|
15
|
Peng PC, Hassan Samee MA, Sinha S. Incorporating chromatin accessibility data into sequence-to-expression modeling. Biophys J 2016; 108:1257-67. [PMID: 25762337 DOI: 10.1016/j.bpj.2014.12.037] [Citation(s) in RCA: 7] [Impact Index Per Article: 0.9] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 08/18/2014] [Revised: 12/01/2014] [Accepted: 12/11/2014] [Indexed: 01/30/2023] Open
Abstract
Prediction of gene expression levels from regulatory sequences is one of the major challenges of genomic biology today. A particularly promising approach to this problem is that taken by thermodynamics-based models that interpret an enhancer sequence in a given cellular context specified by transcription factor concentration levels and predict precise expression levels driven by that enhancer. Such models have so far not accounted for the effect of chromatin accessibility on interactions between transcription factor and DNA and consequently on gene-expression levels. Here, we extend a thermodynamics-based model of gene expression, called GEMSTAT (Gene Expression Modeling Based on Statistical Thermodynamics), to incorporate chromatin accessibility data and quantify its effect on accuracy of expression prediction. In the new model, called GEMSTAT-A, accessibility at a binding site is assumed to affect the transcription factor's binding strength at the site, whereas all other aspects are identical to the GEMSTAT model. We show that this modification results in significantly better fits in a data set of over 30 enhancers regulating spatial expression patterns in the blastoderm-stage Drosophila embryo. It is important to note that the improved fits result not from an overall elevated accessibility in active enhancers but from the variation of accessibility levels within an enhancer. With whole-genome DNA accessibility measurements becoming increasingly popular, our work demonstrates how such data may be useful for sequence-to-expression models. It also calls for future advances in modeling accessibility levels from sequence and the transregulatory context, so as to predict accurately the effect of cis and trans perturbations on gene expression.
Collapse
Affiliation(s)
- Pei-Chen Peng
- Department of Computer Science, University of Illinois at Urbana-Champaign, Urbana, Illinois
| | - Md Abul Hassan Samee
- Department of Computer Science, University of Illinois at Urbana-Champaign, Urbana, Illinois
| | - Saurabh Sinha
- Department of Computer Science, University of Illinois at Urbana-Champaign, Urbana, Illinois; Institute for Genomic Biology, University of Illinois at Urbana-Champaign, Urbana, Illinois.
| |
Collapse
|
16
|
Samee MAH, Lim B, Samper N, Lu H, Rushlow CA, Jiménez G, Shvartsman SY, Sinha S. A Systematic Ensemble Approach to Thermodynamic Modeling of Gene Expression from Sequence Data. Cell Syst 2015; 1:396-407. [PMID: 27136354 DOI: 10.1016/j.cels.2015.12.002] [Citation(s) in RCA: 29] [Impact Index Per Article: 3.2] [Reference Citation Analysis] [Abstract] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 08/10/2015] [Revised: 10/19/2015] [Accepted: 12/02/2015] [Indexed: 11/17/2022]
Abstract
To understand the relationship between an enhancer DNA sequence and quantitative gene expression, thermodynamics-driven mathematical models of transcription are often employed. These "sequence-to-expression" models can describe an incomplete or even incorrect set of regulatory relationships if the parameter space is not searched systematically. Here, we focus on an enhancer of the Drosophila gene ind and demonstrate how a systematic search of parameter space can reveal a more comprehensive picture of a gene's regulatory mechanisms, resolve outstanding ambiguities, and suggest testable hypotheses. We describe an approach that generates an ensemble of ind models; all of these models are technically acceptable solutions to the sequence-to-expression problem in light of wild-type data, and some represent mechanistically distinct hypotheses about the regulation of ind. This ensemble can be restricted to biologically plausible models using requirements gleaned from in vivo perturbation experiments. Biologically plausible models make unique predictions about how specific ind enhancer sequences affect ind expression; we validate these predictions in vivo through site mutagenesis in transgenic Drosophila embryos.
Collapse
Affiliation(s)
- Md Abul Hassan Samee
- Department of Computer Science, University of Illinois at Urbana-Champaign, Urbana, IL 61801, USA
| | - Bomyi Lim
- Department of Chemical and Biological Engineering and Lewis-Sigler Institute for Integrative Genomics, Princeton University, Princeton, NJ 08544, USA
| | - Núria Samper
- Department of Developmental Biology, Instituto de Biología Molecular de Barcelona, Consejo Superior de Investigaciones Científicas (CSIC), Barcelona 08208, Spain
| | - Hang Lu
- School of Chemical and Biomolecular Engineering and Parker H. Petit Institute for Bioengineering and Bioscience, Georgia Institute of Technology, Atlanta, GA 30332, USA
| | | | - Gerardo Jiménez
- Department of Developmental Biology, Instituto de Biología Molecular de Barcelona, Consejo Superior de Investigaciones Científicas (CSIC), Barcelona 08208, Spain; Institució Catalana de Recerca i Estudis Avançats (ICREA), Barcelona 08010, Spain
| | - Stanislav Y Shvartsman
- Department of Chemical and Biological Engineering and Lewis-Sigler Institute for Integrative Genomics, Princeton University, Princeton, NJ 08544, USA
| | - Saurabh Sinha
- Department of Computer Science, University of Illinois at Urbana-Champaign, Urbana, IL 61801, USA; Institute for Genomic Biology, University of Illinois at Urbana-Champaign, Urbana, IL 61801, USA.
| |
Collapse
|
17
|
Kozlov K, Gursky VV, Kulakovskiy IV, Dymova A, Samsonova M. Analysis of functional importance of binding sites in the Drosophila gap gene network model. BMC Genomics 2015; 16 Suppl 13:S7. [PMID: 26694511 PMCID: PMC4686791 DOI: 10.1186/1471-2164-16-s13-s7] [Citation(s) in RCA: 9] [Impact Index Per Article: 1.0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/11/2022] Open
Abstract
BACKGROUND The statistical thermodynamics based approach provides a promising framework for construction of the genotype-phenotype map in many biological systems. Among important aspects of a good model connecting the DNA sequence information with that of a molecular phenotype (gene expression) is the selection of regulatory interactions and relevant transcription factor bindings sites. As the model may predict different levels of the functional importance of specific binding sites in different genomic and regulatory contexts, it is essential to formulate and study such models under different modeling assumptions. RESULTS We elaborate a two-layer model for the Drosophila gap gene network and include in the model a combined set of transcription factor binding sites and concentration dependent regulatory interaction between gap genes hunchback and Kruppel. We show that the new variants of the model are more consistent in terms of gene expression predictions for various genetic constructs in comparison to previous work. We quantify the functional importance of binding sites by calculating their impact on gene expression in the model and calculate how these impacts correlate across all sites under different modeling assumptions. CONCLUSIONS The assumption about the dual interaction between hb and Kr leads to the most consistent modeling results, but, on the other hand, may obscure existence of indirect interactions between binding sites in regulatory regions of distinct genes. The analysis confirms the previously formulated regulation concept of many weak binding sites working in concert. The model predicts a more or less uniform distribution of functionally important binding sites over the sets of experimentally characterized regulatory modules and other open chromatin domains.
Collapse
Affiliation(s)
- Konstantin Kozlov
- Peter the Great St. Petersburg Polytechnic University, 29 Polytechnicheskaya, 195251 St.Petersburg, Russia
| | - Vitaly V Gursky
- Peter the Great St. Petersburg Polytechnic University, 29 Polytechnicheskaya, 195251 St.Petersburg, Russia
- Ioffe Institute, 26 Polytechnicheskaya, 194021 St.Petersburg, Russia
| | - Ivan V Kulakovskiy
- Engelhardt Institute of Molecular Biology, 32 Vavilova, 119991 Moscow, Russia
| | - Arina Dymova
- Peter the Great St. Petersburg Polytechnic University, 29 Polytechnicheskaya, 195251 St.Petersburg, Russia
| | - Maria Samsonova
- Peter the Great St. Petersburg Polytechnic University, 29 Polytechnicheskaya, 195251 St.Petersburg, Russia
| |
Collapse
|
18
|
Temporal and spatial dynamics of scaling-specific features of a gene regulatory network in Drosophila. Nat Commun 2015; 6:10031. [PMID: 26644070 PMCID: PMC4686680 DOI: 10.1038/ncomms10031] [Citation(s) in RCA: 38] [Impact Index Per Article: 4.2] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 05/13/2015] [Accepted: 10/28/2015] [Indexed: 01/19/2023] Open
Abstract
A widely appreciated aspect of developmental robustness is pattern formation in proportion to size. But how such scaling features emerge dynamically remains poorly understood. Here we generate a data set of the expression profiles of six gap genes in Drosophila melanogaster embryos that differ significantly in size. Expression patterns exhibit size-dependent dynamics both spatially and temporally. We uncover a dynamic emergence of under-scaling in the posterior, accompanied by reduced expression levels of gap genes near the middle of large embryos. Simulation results show that a size-dependent Bicoid gradient input can lead to reduced Krüppel expression that can have long-range and dynamic effects on gap gene expression in the posterior. Thus, for emergence of scaled patterns, the entire embryo may be viewed as a single unified dynamic system where maternally derived size-dependent information interpreted locally can be propagated in space and time as governed by the dynamics of a gene regulatory network. How pattern formation is regulated relative to the size of an organism is unclear. Here, Wu et al. take data from gap gene expression in flies of different sizes together with simulations, identifying how scaling emerges dynamically and that local patterning influences global gene regulatory networks.
Collapse
|
19
|
Duque T, Sinha S. What does it take to evolve an enhancer? A simulation-based study of factors influencing the emergence of combinatorial regulation. Genome Biol Evol 2015; 7:1415-31. [PMID: 25956793 PMCID: PMC4494070 DOI: 10.1093/gbe/evv080] [Citation(s) in RCA: 13] [Impact Index Per Article: 1.4] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/13/2022] Open
Abstract
There is widespread interest today in understanding enhancers, which are regulatory elements typically harboring several transcription factor binding sites and mediating the combinatorial effect of transcription factors on gene expression. The evolution of enhancers poses interesting unanswered questions, for example, the evolutionary time taken for a typical enhancer to emerge or the factors shaping its evolution. Existing approaches to cis-regulatory evolution have often ignored the combinatorial nature and varied biochemical mechanisms of gene regulation encoded in enhancers. We report on our investigation of enhancer evolution through the use of PEBCRES, a framework for evolutionary simulation of enhancers that employs a mechanistic and well-supported sequence-to-expression model to assign fitness to the evolving enhancer genotype. We estimated the time necessary to evolve, from genomic background, enhancers capable of driving complex gene expression patterns similar to those involved in early development in Drosophila. We found the time-to-evolve to range between 0.5 and 10 Myr, and to vary greatly with the target expression pattern, complexity of the real enhancer known to encode that pattern, and the strength of input from specific transcription factors. To our knowledge, this is the first estimate of waiting times for realistic enhancers to evolve. The in silico evolved enhancers had, with a few interesting exceptions, site compositions similar to those seen in real enhancers for the same patterns. Our simulations also revealed that certain features of an enhancer might evolve not due to their biological function but as aids to the evolutionary process itself.
Collapse
Affiliation(s)
- Thyago Duque
- Department of Computer Science, University of Illinois at Urbana-Champaign
| | - Saurabh Sinha
- Department of Computer Science, University of Illinois at Urbana-Champaign Institute for Genomic Biology, University of Illinois at Urbana-Champaign
| |
Collapse
|
20
|
Holloway DM, Spirov AV. Mid-embryo patterning and precision in Drosophila segmentation: Krüppel dual regulation of hunchback. PLoS One 2015; 10:e0118450. [PMID: 25793381 PMCID: PMC4368514 DOI: 10.1371/journal.pone.0118450] [Citation(s) in RCA: 16] [Impact Index Per Article: 1.8] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 12/05/2014] [Accepted: 12/15/2014] [Indexed: 12/26/2022] Open
Abstract
In early development, genes are expressed in spatial patterns which later define cellular identities and tissue locations. The mechanisms of such pattern formation have been studied extensively in early Drosophila (fruit fly) embryos. The gap gene hunchback (hb) is one of the earliest genes to be expressed in anterior-posterior (AP) body segmentation. As a transcriptional regulator for a number of downstream genes, the spatial precision of hb expression can have significant effects in the development of the body plan. To investigate the factors contributing to hb precision, we used fine spatial and temporal resolution data to develop a quantitative model for the regulation of hb expression in the mid-embryo. In particular, modelling hb pattern refinement in mid nuclear cleavage cycle 14 (NC14) reveals some of the regulatory contributions of simultaneously-expressed gap genes. Matching the model to recent data from wild-type (WT) embryos and mutants of the gap gene Krüppel (Kr) indicates that a mid-embryo Hb concentration peak important in thoracic development (at parasegment 4, PS4) is regulated in a dual manner by Kr, with low Kr concentration activating hb and high Kr concentration repressing hb. The processes of gene expression (transcription, translation, transport) are intrinsically random. We used stochastic simulations to characterize the noise generated in hb expression. We find that Kr regulation can limit the positional variability of the Hb mid-embryo border. This has been recently corroborated in experimental comparisons of WT and Kr- mutant embryos. Further, Kr regulation can decrease uncertainty in mid-embryo hb expression (i.e. contribute to a smooth Hb boundary) and decrease between-copy transcriptional variability within nuclei. Since many tissue boundaries are first established by interactions between neighbouring gene expression domains, these properties of Hb-Kr dynamics to diminish the effects of intrinsic expression noise may represent a general mechanism contributing to robustness in early development.
Collapse
Affiliation(s)
- David M. Holloway
- Mathematics Department, British Columbia Institute of Technology, Burnaby, B.C., V5G 3H2, Canada
- * E-mail:
| | - Alexander V. Spirov
- Computer Science, and Center of Excellence in Wireless and Information Technology, State University of New York, Stony Brook, Stony Brook, New York, United States of America
- The Sechenov Institute of Evolutionary Physiology and Biochemistry, St. Petersburg, Russia
| |
Collapse
|
21
|
Staller MV, Fowlkes CC, Bragdon MDJ, Wunderlich Z, Estrada J, DePace AH. A gene expression atlas of a bicoid-depleted Drosophila embryo reveals early canalization of cell fate. Development 2015; 142:587-96. [PMID: 25605785 PMCID: PMC4302997 DOI: 10.1242/dev.117796] [Citation(s) in RCA: 20] [Impact Index Per Article: 2.2] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 09/11/2014] [Accepted: 12/01/2014] [Indexed: 01/31/2023]
Abstract
In developing embryos, gene regulatory networks drive cells towards discrete terminal fates, a process called canalization. We studied the behavior of the anterior-posterior segmentation network in Drosophila melanogaster embryos by depleting a key maternal input, bicoid (bcd), and measuring gene expression patterns of the network at cellular resolution. This method results in a gene expression atlas containing the levels of mRNA or protein expression of 13 core patterning genes over six time points for every cell of the blastoderm embryo. This is the first cellular resolution dataset of a genetically perturbed Drosophila embryo that captures all cells in 3D. We describe the technical developments required to build this atlas and how the method can be employed and extended by others. We also analyze this novel dataset to characterize the degree and timing of cell fate canalization in the segmentation network. We find that in two layers of this gene regulatory network, following depletion of bcd, individual cells rapidly canalize towards normal cell fates. This result supports the hypothesis that the segmentation network directly canalizes cell fate, rather than an alternative hypothesis whereby cells are initially mis-specified and later eliminated by apoptosis. Our gene expression atlas provides a high resolution picture of a classic perturbation and will enable further computational modeling of canalization and gene regulation in this transcriptional network.
Collapse
Affiliation(s)
- Max V Staller
- Department of Systems Biology, Harvard Medical School, Boston, MA 02115, USA
| | - Charless C Fowlkes
- Department of Computer Science, University of California Irvine, Irvine, CA 92697, USA
| | - Meghan D J Bragdon
- Department of Systems Biology, Harvard Medical School, Boston, MA 02115, USA
| | - Zeba Wunderlich
- Department of Systems Biology, Harvard Medical School, Boston, MA 02115, USA
| | - Javier Estrada
- Department of Systems Biology, Harvard Medical School, Boston, MA 02115, USA
| | - Angela H DePace
- Department of Systems Biology, Harvard Medical School, Boston, MA 02115, USA
| |
Collapse
|
22
|
Shadow enhancers enable Hunchback bifunctionality in the Drosophila embryo. Proc Natl Acad Sci U S A 2015; 112:785-90. [PMID: 25564665 DOI: 10.1073/pnas.1413877112] [Citation(s) in RCA: 41] [Impact Index Per Article: 4.6] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 12/22/2022] Open
Abstract
Hunchback (Hb) is a bifunctional transcription factor that activates and represses distinct enhancers. Here, we investigate the hypothesis that Hb can activate and repress the same enhancer. Computational models predicted that Hb bifunctionally regulates the even-skipped (eve) stripe 3+7 enhancer (eve3+7) in Drosophila blastoderm embryos. We measured and modeled eve expression at cellular resolution under multiple genetic perturbations and found that the eve3+7 enhancer could not explain endogenous eve stripe 7 behavior. Instead, we found that eve stripe 7 is controlled by two enhancers: the canonical eve3+7 and a sequence encompassing the minimal eve stripe 2 enhancer (eve2+7). Hb bifunctionally regulates eve stripe 7, but it executes these two activities on different pieces of regulatory DNA--it activates the eve2+7 enhancer and represses the eve3+7 enhancer. These two "shadow enhancers" use different regulatory logic to create the same pattern.
Collapse
|
23
|
Moris-Sanz M, Estacio-Gómez A, Álvarez-Rivero J, Díaz-Benjumea FJ. Specification of neuronal subtypes by different levels of Hunchback. Development 2014; 141:4366-74. [DOI: 10.1242/dev.113381] [Citation(s) in RCA: 19] [Impact Index Per Article: 1.9] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 02/06/2023]
Abstract
During the development of the central nervous system, neural progenitors generate an enormous number of distinct types of neuron and glial cells by asymmetric division. Intrinsic genetic programs define the combinations of transcription factors that determine the fate of each cell, but the precise mechanisms by which all these factors are integrated at the level of individual cells are poorly understood. Here, we analyzed the specification of the neurons in the ventral nerve cord of Drosophila that express Crustacean cardioactive peptide (CCAP). There are two types of CCAP neurons: interneurons and efferent neurons. We found that both are specified during the Hunchback temporal window of neuroblast 3-5, but are not sibling cells. Further, this temporal window generates two ganglion mother cells that give rise to four neurons, which can be identified by the expression of empty spiracles. We show that the expression of Hunchback in the neuroblast increases over time and provide evidence that the absolute levels of Hunchback expression specify the two different CCAP neuronal fates.
Collapse
Affiliation(s)
- Marta Moris-Sanz
- Centro de Biología Molecular-Severo Ochoa (CSIC-UAM), c./Nicolás Cabrera 1, Universidad Autónoma, Madrid 28049, Spain
| | - Alicia Estacio-Gómez
- Centro de Biología Molecular-Severo Ochoa (CSIC-UAM), c./Nicolás Cabrera 1, Universidad Autónoma, Madrid 28049, Spain
| | - Javier Álvarez-Rivero
- Centro de Biología Molecular-Severo Ochoa (CSIC-UAM), c./Nicolás Cabrera 1, Universidad Autónoma, Madrid 28049, Spain
| | - Fernando J. Díaz-Benjumea
- Centro de Biología Molecular-Severo Ochoa (CSIC-UAM), c./Nicolás Cabrera 1, Universidad Autónoma, Madrid 28049, Spain
| |
Collapse
|
24
|
Drewell RA, Nevarez MJ, Kurata JS, Winkler LN, Li L, Dresch JM. Deciphering the combinatorial architecture of a Drosophila homeotic gene enhancer. Mech Dev 2014; 131:68-77. [PMID: 24514265 DOI: 10.1016/j.mod.2013.10.002] [Citation(s) in RCA: 10] [Impact Index Per Article: 1.0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 05/22/2013] [Revised: 10/07/2013] [Accepted: 10/08/2013] [Indexed: 01/20/2023]
Abstract
In Drosophila, the 330 kb bithorax complex regulates cellular differentiation along the anterior–posterior axis during development in the thorax and abdomen and is comprised of three homeotic genes: Ultrabithorax, abdominal-A, and Abdominal-B. The expression of each of these genes is in turn controlled through interactions between transcription factors and a number of cis-regulatory modules in the neighboring intergenic regions. In this study, we examine how the sequence architecture of transcription factor binding sites mediates the functional activity of one of these cis-regulatory modules. Using computational, mathematical modeling and experimental molecular genetic approaches we investigate the IAB7b enhancer, which regulates Abdominal-B expression specifically in the presumptive seventh and ninth abdominal segments of the early embryo. A cross-species comparison of the IAB7b enhancer reveals an evolutionarily conserved signature motif containing two FUSHI-TARAZU activator transcription factor binding sites. We find that the transcriptional repressors KNIRPS, KRUPPEL and GIANT are able to restrict reporter gene expression to the posterior abdominal segments, using different molecular mechanisms including short-range repression and competitive binding. Additionally, we show the functional importance of the spacing between the two FUSHI-TARAZU binding sites and discuss the potential importance of cooperativity for transcriptional activation. Our results demonstrate that the transcriptional output of the IAB7b cis-regulatory module relies on a complex set of combinatorial inputs mediated by specific transcription factor binding and that the sequence architecture at this enhancer is critical to maintain robust regulatory function.
Collapse
|
25
|
Naturally occurring deletions of hunchback binding sites in the even-skipped stripe 3+7 enhancer. PLoS One 2014; 9:e91924. [PMID: 24786295 PMCID: PMC4006794 DOI: 10.1371/journal.pone.0091924] [Citation(s) in RCA: 10] [Impact Index Per Article: 1.0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 09/19/2013] [Accepted: 02/18/2014] [Indexed: 11/23/2022] Open
Abstract
Changes in regulatory DNA contribute to phenotypic differences within and between taxa. Comparative studies show that many transcription factor binding sites (TFBS) are conserved between species whereas functional studies reveal that some mutations segregating within species alter TFBS function. Consistently, in this analysis of 13 regulatory elements in Drosophila melanogaster populations, single base and insertion/deletion polymorphism are rare in characterized regulatory elements. Experimentally defined TFBS are nearly devoid of segregating mutations and, as has been shown before, are quite conserved. For instance 8 of 11 Hunchback binding sites in the stripe 3+7 enhancer of even-skipped are conserved between D. melanogaster and Drosophila virilis. Oddly, we found a 72 bp deletion that removes one of these binding sites (Hb8), segregating within D. melanogaster. Furthermore, a 45 bp deletion polymorphism in the spacer between the stripe 3+7 and stripe 2 enhancers, removes another predicted Hunchback site. These two deletions are separated by ∼250 bp, sit on distinct haplotypes, and segregate at appreciable frequency. The Hb8Δ is at 5 to 35% frequency in the new world, but also shows cosmopolitan distribution. There is depletion of sequence variation on the Hb8Δ-carrying haplotype. Quantitative genetic tests indicate that Hb8Δ affects developmental time, but not viability of offspring. The Eve expression pattern differs between inbred lines, but the stripe 3 and 7 boundaries seem unaffected by Hb8Δ. The data reveal segregating variation in regulatory elements, which may reflect evolutionary turnover of characterized TFBS due to drift or co-evolution.
Collapse
|
26
|
Zagrijchuk EA, Sabirov MA, Holloway DM, Spirov AV. In silico evolution of the hunchback gene indicates redundancy in cis-regulatory organization and spatial gene expression. J Bioinform Comput Biol 2014; 12:1441009. [PMID: 24712536 DOI: 10.1142/s0219720014410091] [Citation(s) in RCA: 5] [Impact Index Per Article: 0.5] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/18/2022]
Abstract
Biological development depends on the coordinated expression of genes in time and space. Developmental genes have extensive cis-regulatory regions which control their expression. These regions are organized in a modular manner, with different modules controlling expression at different times and locations. Both how modularity evolved and what function it serves are open questions. We present a computational model for the cis-regulation of the hunchback (hb) gene in the fruit fly (Drosophila). We simulate evolution (using an evolutionary computation approach from computer science) to find the optimal cis-regulatory arrangements for fitting experimental hb expression patterns. We find that the cis-regulatory region tends to readily evolve modularity. These cis-regulatory modules (CRMs) do not tend to control single spatial domains, but show a multi-CRM/multi-domain correspondence. We find that the CRM-domain correspondence seen in Drosophila evolves with a high probability in our model, supporting the biological relevance of the approach. The partial redundancy resulting from multi-CRM control may confer some biological robustness against corruption of regulatory sequences. The technique developed on hb could readily be applied to other multi-CRM developmental genes.
Collapse
Affiliation(s)
- Elizaveta A Zagrijchuk
- Lab Modeling of Evolution, I.M. Sechenov Institute of Evolutionary Physiology & Biochemistry, Russian Academy of Sciences, Thorez Pr. 44, St.-Petersburg, 2194223, Russia
| | | | | | | |
Collapse
|
27
|
Ilsley GR, Fisher J, Apweiler R, DePace AH, Luscombe NM. Cellular resolution models for even skipped regulation in the entire Drosophila embryo. eLife 2013; 2:e00522. [PMID: 23930223 PMCID: PMC3736529 DOI: 10.7554/elife.00522] [Citation(s) in RCA: 42] [Impact Index Per Article: 3.8] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 01/07/2013] [Accepted: 06/17/2013] [Indexed: 12/14/2022] Open
Abstract
Transcriptional control ensures genes are expressed in the right amounts at the correct times and locations. Understanding quantitatively how regulatory systems convert input signals to appropriate outputs remains a challenge. For the first time, we successfully model even skipped (eve) stripes 2 and 3+7 across the entire fly embryo at cellular resolution. A straightforward statistical relationship explains how transcription factor (TF) concentrations define eve's complex spatial expression, without the need for pairwise interactions or cross-regulatory dynamics. Simulating thousands of TF combinations, we recover known regulators and suggest new candidates. Finally, we accurately predict the intricate effects of perturbations including TF mutations and misexpression. Our approach imposes minimal assumptions about regulatory function; instead we infer underlying mechanisms from models that best fit the data, like the lack of TF-specific thresholds and the positional value of homotypic interactions. Our study provides a general and quantitative method for elucidating the regulation of diverse biological systems. DOI:http://dx.doi.org/10.7554/eLife.00522.001.
Collapse
Affiliation(s)
- Garth R Ilsley
- European Molecular Biology Laboratory, European Bioinformatics Institute, Wellcome Trust Genome Campus, Cambridge, United Kingdom
- Okinawa Institute of Science and Technology Graduate University, Okinawa, Japan
| | - Jasmin Fisher
- Microsoft Research Cambridge, Cambridge, United Kingdom
- Department of Biochemistry, University of Cambridge, Cambridge, United Kingdom
| | - Rolf Apweiler
- European Molecular Biology Laboratory, European Bioinformatics Institute, Wellcome Trust Genome Campus, Cambridge, United Kingdom
| | - Angela H DePace
- Department of Systems Biology, Harvard Medical School, Boston, United States
| | - Nicholas M Luscombe
- European Molecular Biology Laboratory, European Bioinformatics Institute, Wellcome Trust Genome Campus, Cambridge, United Kingdom
- Okinawa Institute of Science and Technology Graduate University, Okinawa, Japan
- UCL Genetics Institute, Department of Genetics, Evolution, and Environment, University College London, London, United Kingdom
- London Research Institute, Cancer Research UK, London, United Kingdom
| |
Collapse
|
28
|
Kim AR, Martinez C, Ionides J, Ramos AF, Ludwig MZ, Ogawa N, Sharp DH, Reinitz J. Rearrangements of 2.5 kilobases of noncoding DNA from the Drosophila even-skipped locus define predictive rules of genomic cis-regulatory logic. PLoS Genet 2013; 9:e1003243. [PMID: 23468638 PMCID: PMC3585115 DOI: 10.1371/journal.pgen.1003243] [Citation(s) in RCA: 48] [Impact Index Per Article: 4.4] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/15/2012] [Accepted: 11/30/2012] [Indexed: 01/19/2023] Open
Abstract
Rearrangements of about 2.5 kilobases of regulatory DNA located 5' of the transcription start site of the Drosophila even-skipped locus generate large-scale changes in the expression of even-skipped stripes 2, 3, and 7. The most radical effects are generated by juxtaposing the minimal stripe enhancers MSE2 and MSE3 for stripes 2 and 3 with and without small "spacer" segments less than 360 bp in length. We placed these fusion constructs in a targeted transformation site and obtained quantitative expression data for these transformants together with their controlling transcription factors at cellular resolution. These data demonstrated that the rearrangements can alter expression levels in stripe 2 and the 2-3 interstripe by a factor of more than 10. We reasoned that this behavior would place tight constraints on possible rules of genomic cis-regulatory logic. To find these constraints, we confronted our new expression data together with previously obtained data on other constructs with a computational model. The model contained representations of thermodynamic protein-DNA interactions including steric interference and cooperative binding, short-range repression, direct repression, activation, and coactivation. The model was highly constrained by the training data, which it described within the limits of experimental error. The model, so constrained, was able to correctly predict expression patterns driven by enhancers for other Drosophila genes; even-skipped enhancers not included in the training set; stripe 2, 3, and 7 enhancers from various Drosophilid and Sepsid species; and long segments of even-skipped regulatory DNA that contain multiple enhancers. The model further demonstrated that elevated expression driven by a fusion of MSE2 and MSE3 was a consequence of the recruitment of a portion of MSE3 to become a functional component of MSE2, demonstrating that cis-regulatory "elements" are not elementary objects.
Collapse
Affiliation(s)
- Ah-Ram Kim
- Department of Ecology and Evolution, Chicago Center for Systems Biology, University of Chicago, Chicago, Illinois, United States of America
- Department of Biochemistry and Cell Biology, Stony Brook University, Stony Brook, New York, United States of America
| | - Carlos Martinez
- Department of Ecology and Evolution, Chicago Center for Systems Biology, University of Chicago, Chicago, Illinois, United States of America
| | - John Ionides
- Department of Biochemistry, University of Cambridge, Cambridge, United Kingdom
| | - Alexandre F. Ramos
- Escola de Artes, Ciências e Humanidades, Universidade de São Paulo, São Paulo, Brazil
| | - Michael Z. Ludwig
- Department of Ecology and Evolution, Chicago Center for Systems Biology, University of Chicago, Chicago, Illinois, United States of America
| | - Nobuo Ogawa
- Genomics Division, Lawrence Berkeley National Laboratory, Berkeley, California, United States of America
| | - David H. Sharp
- Theoretical Division, Los Alamos National Laboratory, Los Alamos, New Mexico, United States of America
| | - John Reinitz
- Department of Ecology and Evolution, Chicago Center for Systems Biology, University of Chicago, Chicago, Illinois, United States of America
- Department of Statistics, Department of Molecular Genetics and Cell Biology, and Institute of Genomics and Systems Biology, University of Chicago, Chicago, Illinois, United States of America
| |
Collapse
|
29
|
Lopes FJP, Spirov AV, Bisch PM. The role of Bicoid cooperative binding in the patterning of sharp borders in Drosophila melanogaster. Dev Biol 2012; 370:165-72. [PMID: 22841642 DOI: 10.1016/j.ydbio.2012.07.020] [Citation(s) in RCA: 14] [Impact Index Per Article: 1.2] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 02/24/2012] [Revised: 07/06/2012] [Accepted: 07/16/2012] [Indexed: 10/28/2022]
Abstract
In Drosophila embryonic development, the Bicoid (Bcd) protein establishes positional information of downstream developmental genes like hunchback (hb), which has a strong anterior expression and a sharp on-off boundary in the mid-embryo. The role of Bcd cooperative binding in the positioning of the Hb pattern has been previously demonstrated. However, there are discrepancies in the reported results about the role of this mechanism in the sharp Hb border. Here, we determined the Hill coefficient (nH) required for Bcd to generate the sharp border of Hb in wild-type (WT) embryos. We found that an n(H) of approximately 6.3 (s.d. 1.4) and 10.8 (s.d. 4.0) is required to account for Hb sharpness at early and late cycle 14A, respectively. Additional mechanisms are possibly required because the high nH is likely unachievable for Bcd binding to the hb promoter. To test this idea, we determined the nH required to pattern the Hb profile of 15 embryos expressing an hb14F allele that is defective in self-activation and found nH to be 3.0 (s.d. 1.0). This result indicates that in WT embryos, the hb self-activation is important for Hb sharpness. Corroborating our results, we also found a progressive increase in the required value of n(H) spanning from 4.0 to 9.2 by determining this coefficient from averaged profiles of eight temporal classes at cycle 14A (T1 to T8). Our results indicate that there is a transition in the mechanisms responsible for the sharp Hb border during cycle 14A: in early stages of this cycle, Bcd cooperative binding is primarily responsible for Hb sharpness; in late cycle 14A, hb self-activation becomes the dominant mechanism.
Collapse
Affiliation(s)
- Francisco J P Lopes
- Laboratório de Física-Biológica, Instituto de Biofúsica Carlos Chagas Filho, Universidade Federal do Rio de Janeiro, Rio de Janeiro, Brazil.
| | | | | |
Collapse
|
30
|
Roth S. Mathematics and biology: a Kantian view on the history of pattern formation theory. Dev Genes Evol 2011; 221:255-79. [PMID: 22086125 PMCID: PMC3234355 DOI: 10.1007/s00427-011-0378-0] [Citation(s) in RCA: 26] [Impact Index Per Article: 2.0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Download PDF] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/15/2011] [Accepted: 10/19/2011] [Indexed: 12/20/2022]
Abstract
Driesch's statement, made around 1900, that the physics and chemistry of his day were unable to explain self-regulation during embryogenesis was correct and could be extended until the year 1972. The emergence of theories of self-organisation required progress in several areas including chemistry, physics, computing and cybernetics. Two parallel lines of development can be distinguished which both culminated in the early 1970s. Firstly, physicochemical theories of self-organisation arose from theoretical (Lotka 1910-1920) and experimental work (Bray 1920; Belousov 1951) on chemical oscillations. However, this research area gained broader acceptance only after thermodynamics was extended to systems far from equilibrium (1922-1967) and the mechanism of the prime example for a chemical oscillator, the Belousov-Zhabotinski reaction, was deciphered in the early 1970s. Secondly, biological theories of self-organisation were rooted in the intellectual environment of artificial intelligence and cybernetics. Turing wrote his The chemical basis of morphogenesis (1952) after working on the construction of one of the first electronic computers. Likewise, Gierer and Meinhardt's theory of local activation and lateral inhibition (1972) was influenced by ideas from cybernetics. The Gierer-Meinhardt theory provided an explanation for the first time of both spontaneous formation of spatial order and of self-regulation that proved to be extremely successful in elucidating a wide range of patterning processes. With the advent of developmental genetics in the 1980s, detailed molecular and functional data became available for complex developmental processes, allowing a new generation of data-driven theoretical approaches. Three examples of such approaches will be discussed. The successes and limitations of mathematical pattern formation theory throughout its history suggest a picture of the organism, which has structural similarity to views of the organic world held by the philosopher Immanuel Kant at the end of the eighteenth century.
Collapse
Affiliation(s)
- Siegfried Roth
- Institute of Developmental Biology, University of Cologne, Biowissenschaftliches Zentrum, Zülpicher Strasse 47b, 50674 Cologne, Germany.
| |
Collapse
|
31
|
Whole-embryo modeling of early segmentation in Drosophila identifies robust and fragile expression domains. Biophys J 2011; 101:287-96. [PMID: 21767480 DOI: 10.1016/j.bpj.2011.05.060] [Citation(s) in RCA: 32] [Impact Index Per Article: 2.5] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 02/03/2011] [Revised: 05/03/2011] [Accepted: 05/19/2011] [Indexed: 11/24/2022] Open
Abstract
Segmentation of the Drosophila melanogaster embryo results from the dynamic establishment of spatial mRNA and protein patterns. Here, we exploit recent temporal mRNA and protein expression measurements on the full surface of the blastoderm to calibrate a dynamical model of the gap gene network on the entire embryo cortex. We model the early mRNA and protein dynamics of the gap genes hunchback, Kruppel, giant, and knirps, taking as regulatory inputs the maternal Bicoid and Caudal gradients, plus the zygotic Tailless and Huckebein proteins. The model captures the expression patterns faithfully, and its predictions are assessed from gap gene mutants. The inferred network shows an architecture based on reciprocal repression between gap genes that can stably pattern the embryo on a realistic geometry but requires complex regulations such as those involving the Hunchback monomer and dimers. Sensitivity analysis identifies the posterior domain of giant as among the most fragile features of an otherwise robust network, and hints at redundant regulations by Bicoid and Hunchback, possibly reflecting recent evolutionary changes in the gap-gene network in insects.
Collapse
|
32
|
Struffi P, Corado M, Kaplan L, Yu D, Rushlow C, Small S. Combinatorial activation and concentration-dependent repression of the Drosophila even skipped stripe 3+7 enhancer. Development 2011; 138:4291-9. [PMID: 21865322 DOI: 10.1242/dev.065987] [Citation(s) in RCA: 43] [Impact Index Per Article: 3.3] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 12/17/2022]
Abstract
Despite years of study, the precise mechanisms that control position-specific gene expression during development are not understood. Here, we analyze an enhancer element from the even skipped (eve) gene, which activates and positions two stripes of expression (stripes 3 and 7) in blastoderm stage Drosophila embryos. Previous genetic studies showed that the JAK-STAT pathway is required for full activation of the enhancer, whereas the gap genes hunchback (hb) and knirps (kni) are required for placement of the boundaries of both stripes. We show that the maternal zinc-finger protein Zelda (Zld) is absolutely required for activation, and present evidence that Zld binds to multiple non-canonical sites. We also use a combination of in vitro binding experiments and bioinformatics analysis to redefine the Kni-binding motif, and mutational analysis and in vivo tests to show that Kni and Hb are dedicated repressors that function by direct DNA binding. These experiments significantly extend our understanding of how the eve enhancer integrates positive and negative transcriptional activities to generate sharp boundaries in the early embryo.
Collapse
Affiliation(s)
- Paolo Struffi
- Department of Biology, New York University, New York, NY 10003, USA
| | | | | | | | | | | |
Collapse
|
33
|
Papatsenko D, Levine M. The Drosophila gap gene network is composed of two parallel toggle switches. PLoS One 2011; 6:e21145. [PMID: 21747931 PMCID: PMC3128594 DOI: 10.1371/journal.pone.0021145] [Citation(s) in RCA: 40] [Impact Index Per Article: 3.1] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 04/11/2011] [Accepted: 05/20/2011] [Indexed: 11/30/2022] Open
Abstract
Drosophila “gap” genes provide the first response to maternal gradients in the early fly embryo. Gap genes are expressed in a series of broad bands across the embryo during first hours of development. The gene network controlling the gap gene expression patterns includes inputs from maternal gradients and mutual repression between the gap genes themselves. In this study we propose a modular design for the gap gene network, involving two relatively independent network domains. The core of each network domain includes a toggle switch corresponding to a pair of mutually repressive gap genes, operated in space by maternal inputs. The toggle switches present in the gap network are evocative of the phage lambda switch, but they are operated positionally (in space) by the maternal gradients, so the synthesis rates for the competing components change along the embryo anterior-posterior axis. Dynamic model, constructed based on the proposed principle, with elements of fractional site occupancy, required 5–7 parameters to fit quantitative spatial expression data for gap gradients. The identified model solutions (parameter combinations) reproduced major dynamic features of the gap gradient system and explained gap expression in a variety of segmentation mutants.
Collapse
Affiliation(s)
- Dmitri Papatsenko
- Department of Gene and Cell Medicine, Mount Sinai School of Medicine, Black Family Stem Cell Institute, New York, New York, United States of America.
| | | |
Collapse
|
34
|
Abstract
Gap genes are involved in segment determination during the early development of the fruit fly Drosophila melanogaster as well as in other insects. This review attempts to synthesize the current knowledge of the gap gene network through a comprehensive survey of the experimental literature. I focus on genetic and molecular evidence, which provides us with an almost-complete picture of the regulatory interactions responsible for trunk gap gene expression. I discuss the regulatory mechanisms involved, and highlight the remaining ambiguities and gaps in the evidence. This is followed by a brief discussion of molecular regulatory mechanisms for transcriptional regulation, as well as precision and size-regulation provided by the system. Finally, I discuss evidence on the evolution of gap gene expression from species other than Drosophila. My survey concludes that studies of the gap gene system continue to reveal interesting and important new insights into the role of gene regulatory networks in development and evolution.
Collapse
Affiliation(s)
- Johannes Jaeger
- Centre de Regulació Genòmica, Universtitat Pompeu Fabra, Barcelona, Spain.
| |
Collapse
|
35
|
The developmental timing regulator HBL-1 modulates the dauer formation decision in Caenorhabditis elegans. Genetics 2010; 187:345-53. [PMID: 20980238 DOI: 10.1534/genetics.110.123992] [Citation(s) in RCA: 10] [Impact Index Per Article: 0.7] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 12/16/2022] Open
Abstract
Animals developing in the wild encounter a range of environmental conditions, and so developmental mechanisms have evolved that can accommodate different environmental contingencies. Harsh environmental conditions cause Caenorhabditis elegans larvae to arrest as stress-resistant "dauer" larvae after the second larval stage (L2), thereby indefinitely postponing L3 cell fates. HBL-1 is a key transcriptional regulator of L2 vs. L3 cell fate. Through the analysis of genetic interactions between mutations of hbl-1 and of genes encoding regulators of dauer larva formation, we find that hbl-1 can also modulate the dauer formation decision in a complex manner. We propose that dynamic interactions between genes that regulate stage-specific cell fate decisions and those that regulate dauer formation promote the robustness of developmental outcomes to changing environmental conditions.
Collapse
|
36
|
Dual-functioning transcription factors in the developmental gene network of Drosophila melanogaster. BMC Bioinformatics 2010; 11:366. [PMID: 20594356 PMCID: PMC2912886 DOI: 10.1186/1471-2105-11-366] [Citation(s) in RCA: 18] [Impact Index Per Article: 1.3] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 11/16/2009] [Accepted: 07/02/2010] [Indexed: 01/10/2023] Open
Abstract
Background Quantitative models for transcriptional regulation have shown great promise for advancing our understanding of the biological mechanisms underlying gene regulation. However, all of the models to date assume a transcription factor (TF) to have either activating or repressing function towards all the genes it is regulating. Results In this paper we demonstrate, on the example of the developmental gene network in D. melanogaster, that the data-fit can be improved by up to 40% if the model is allowing certain TFs to have dual function, that is, acting as activator for some genes and as repressor for others. We demonstrate that the improvement is not due to additional flexibility in the model but rather derived from the data itself. We also found no evidence for the involvement of other known site-specific TFs in regulating this network. Finally, we propose SUMOylation as a candidate biological mechanism allowing TFs to switch their role when a small ubiquitin-like modifier (SUMO) is covalently attached to the TF. We strengthen this hypothesis by demonstrating that the TFs predicted to have dual function also contain the known SUMO consensus motif, while TFs predicted to have only one role lack this motif. Conclusions We argue that a SUMOylation-dependent mechanism allowing TFs to have dual function represents a promising area for further research and might be another step towards uncovering the biological mechanisms underlying transcriptional regulation.
Collapse
|
37
|
Tran KD, Miller MR, Doe CQ. Recombineering Hunchback identifies two conserved domains required to maintain neuroblast competence and specify early-born neuronal identity. Development 2010; 137:1421-30. [PMID: 20335359 DOI: 10.1242/dev.048678] [Citation(s) in RCA: 41] [Impact Index Per Article: 2.9] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 12/24/2022]
Abstract
The Hunchback/Ikaros family of zinc-finger transcription factors is essential for specifying the anterior/posterior body axis in insects, the fate of early-born pioneer neurons in Drosophila, and for retinal and immune development in mammals. Hunchback/Ikaros proteins can directly activate or repress target gene transcription during early insect development, but their mode of action during neural development is unknown. Here, we use recombineering to generate a series of Hunchback domain deletion variants and assay their function during neurogenesis in the absence of endogenous Hunchback. Previous studies have shown that Hunchback can specify early-born neuronal identity and maintain 'young' neural progenitor (neuroblast) competence. We identify two conserved domains required for Hunchback-mediated transcriptional repression, and show that transcriptional repression is necessary and sufficient to induce early-born neuronal identity and maintain neuroblast competence. We identify pdm2 as a direct target gene that must be repressed to maintain competence, but show that additional genes must also be repressed. We propose that Hunchback maintains early neuroblast competence by silencing a suite of late-expressed genes.
Collapse
Affiliation(s)
- Khoa D Tran
- Institute of Neuroscience, Institute of Molecular Biology, Howard Hughes Medical Institute, University of Oregon, Eugene, OR 97403, USA
| | | | | |
Collapse
|
38
|
Papatsenko D. Stripe formation in the early fly embryo: principles, models, and networks. Bioessays 2009; 31:1172-80. [DOI: 10.1002/bies.200900096] [Citation(s) in RCA: 22] [Impact Index Per Article: 1.5] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/11/2022]
|
39
|
Abstract
I provide a historical overview on the use of mathematical models to gain insight into pattern formation during early development of the fruit fly Drosophila melanogaster. It is my intention to illustrate how the aims and methodology of modelling have changed from the early beginnings of a theoretical developmental biology in the 1960s to modern-day systems biology. I show that even early modelling attempts addressed interesting and relevant questions, which were not tractable by experimental approaches. Unfortunately, their validation was severely hampered by a lack of specificity and appropriate experimental evidence. There is a simple lesson to be learned from this: we cannot deduce general rules for pattern formation from first principles or spurious reproduction of developmental phenomena. Instead, we must infer such rules (if any) from detailed and accurate studies of specific developmental systems. To achieve this, mathematical modelling must be closely integrated with experimental approaches. I report on progress that has been made in this direction in the past few years and illustrate the kind of novel insights that can be gained from such combined approaches. These insights demonstrate the great potential (and some pitfalls) of an integrative, systems-level investigation of pattern formation.
Collapse
Affiliation(s)
- Johannes Jaeger
- EMBL/CRG Research Unit in Systems Biology, CRG-Centre de Regulació Genòmica, Universitat Pompeu Fabra, Dr. Aiguader 88, 08003 Barcelona, Spain.
| |
Collapse
|
40
|
Dean D, Himes CM, Behrman E, Savage RM. Hunchback-like protein is expressed in cleavage blastomeres, gastrula epithelium, and ciliary structures in gastropods. THE BIOLOGICAL BULLETIN 2009; 217:189-201. [PMID: 19875823 DOI: 10.1086/bblv217n2p189] [Citation(s) in RCA: 1] [Impact Index Per Article: 0.1] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 05/28/2023]
Abstract
We report the expression of Hunchback (Hb)-like protein during embryonic and larval development in two caenogastropods, Crepidula fornicata and Ilyanassa obsoleta. During the cleavage stages of these species, Hb-like protein is uniformly expressed in micromere and macromere nuclei. At gastrulation, gastropod Hb-like protein is expressed in the surface epithelium that undergoes epiboly. During organogenesis, gastropod Hb-like protein is expressed in the developing ciliated structures associated with feeding and locomotion. We find no detectable gradient or regionalization of Hb-like protein in gastropod embryos or larvae that resembles the graded Hb pattern of expression observed in dipteran insect embryos. Rather we found that the spatiotemporal expression profile of gastropod Hb-like protein is nearly identical to the Hb-like patterns obtained from the polychaete Capitella sp. I and is highly similar to those reported for clitellate annelids. Based upon the comparative data collected from both ecdysozoans and lophotrochozoan lineages, our results support the hypothesis that the role of Hb in anteroposterior patterning is a derived trait specific to arthropods, and that the ancestral function of lophotrochozoan Hb-like protein played a role in the formation of the cleavage-stage blastomeres and the gastrula epithelium and in structures associated with larval feeding and locomotion.
Collapse
Affiliation(s)
- Derek Dean
- Williams College, Biology Department, Williamstown, Massachusetts 01267, USA
| | | | | | | |
Collapse
|
41
|
Papatsenko D, Goltsev Y, Levine M. Organization of developmental enhancers in the Drosophila embryo. Nucleic Acids Res 2009; 37:5665-77. [PMID: 19651877 PMCID: PMC2761283 DOI: 10.1093/nar/gkp619] [Citation(s) in RCA: 47] [Impact Index Per Article: 3.1] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 01/06/2023] Open
Abstract
Most cell-specific enhancers are thought to lack an inherent organization, with critical binding sites distributed in a more or less random fashion. However, there are examples of fixed arrangements of binding sites, such as helical phasing, that promote the formation of higher-order protein complexes on the enhancer DNA template. Here, we investigate the regulatory ‘grammar’ of nearly 100 characterized enhancers for developmental control genes active in the early Drosophila embryo. The conservation of grammar is examined in seven divergent Drosophila genomes. Linked binding sites are observed for particular combinations of binding motifs, including Bicoid–Bicoid, Hunchback–Hunchback, Bicoid–Dorsal, Bicoid–Caudal and Dorsal–Twist. Direct evidence is presented for the importance of Bicoid–Dorsal linkage in the integration of the anterior–posterior and dorsal–ventral patterning systems. Hunchback–Hunchback interactions help explain unresolved aspects of segmentation, including the differential regulation of the eve stripe 3 + 7 and stripe 4 + 6 enhancers. We also present evidence that there is an under-representation of nucleosome positioning sequences in many enhancers, raising the possibility for a subtle higher-order structure extending across certain enhancers. We conclude that grammar of gene control regions is pervasively used in the patterning of the Drosophila embryo.
Collapse
Affiliation(s)
- Dmitri Papatsenko
- Department of Molecular Cell Biology, Division of Genetics, Genomics & Development, Center for Integrative Genomics, University of California, Berkeley, CA 94720-200, USA.
| | | | | |
Collapse
|
42
|
Marco A, Konikoff C, Karr TL, Kumar S. Relationship between gene co-expression and sharing of transcription factor binding sites in Drosophila melanogaster. Bioinformatics 2009; 25:2473-7. [PMID: 19633094 DOI: 10.1093/bioinformatics/btp462] [Citation(s) in RCA: 36] [Impact Index Per Article: 2.4] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 01/16/2023] Open
Abstract
MOTIVATION In functional genomics, it is frequently useful to correlate expression levels of genes to identify transcription factor binding sites (TFBS) via the presence of common sequence motifs. The underlying assumption is that co-expressed genes are more likely to contain shared TFBS and, thus, TFBS can be identified computationally. Indeed, gene pairs with a very high expression correlation show a significant excess of shared binding sites in yeast. We have tested this assumption in a more complex organism, Drosophila melanogaster, by using experimentally determined TFBS and microarray expression data. We have also examined the reverse relationship between the expression correlation and the extent of TFBS sharing. RESULTS Pairs of genes with shared TFBS show, on average, a higher degree of co-expression than those with no common TFBS in Drosophila. However, the reverse does not hold true: gene pairs with high expression correlations do not share significantly larger numbers of TFBS. Exception to this observation exists when comparing expression of genes from the earliest stages of embryonic development. Interestingly, semantic similarity between gene annotations (Biological Process) is much better associated with TFBS sharing, as compared to the expression correlation. We discuss these results in light of reverse engineering approaches to computationally predict regulatory sequences by using comparative genomics.
Collapse
Affiliation(s)
- Antonio Marco
- Center for Evolutionary Functional Genomics, The Biodesign Institute, Arizona State University, Tempe, AZ 85287-5301, USA.
| | | | | | | |
Collapse
|
43
|
Cooper MB, Loose M, Brookfield JFY. The evolutionary influence of binding site organisation on gene regulatory networks. Biosystems 2009; 96:185-93. [PMID: 19428984 DOI: 10.1016/j.biosystems.2009.02.001] [Citation(s) in RCA: 5] [Impact Index Per Article: 0.3] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 09/11/2008] [Revised: 01/23/2009] [Accepted: 02/01/2009] [Indexed: 12/30/2022]
Abstract
Gene regulatory networks are shaped by selection for advantageous gene expression patterns. Can we use this fact to predict and explain the structure and properties of gene regulatory networks? Here we address this question with evolutionary simulations of small (two to four genes) transcriptional regulatory networks. Each modeled network is tested for the frequency with which it evolves to produce a bimodal spatial expression pattern of a target gene (the output), in response to a linear trigger gradient (the input). By including network features such as the organisation of binding sites that do not evolve in the model, we can compare the relative chances of evolutionary success between networks differing only in these features. Specifically, we show that networks with competitive binding sites (where binding of one transcription factor excludes another) are more likely to evolve bimodal patterns of gene repression than are networks with independent binding sites (where binding of multiple transcription factors can occur simultaneously). These predictions have implications for the likely structure of gene regulatory networks carrying out bimodal (including bistable) gene expression functions in vivo. The capacity to predict the evolution of structure-function relationships in gene regulatory networks is constrained by gaps in current understanding such as the unknown prior probabilities of the network features, and the quantitative nature of the molecular interactions involved in gene expression. Methods for the circumvention of these constraints, and the potential of the evolutionary modeling approach, are discussed.
Collapse
Affiliation(s)
- Max B Cooper
- Institute of Genetics, School of Biology, University of Nottingham, Queens Medical Centre, Nottingham, NG7 2UH, United Kingdom.
| | | | | |
Collapse
|
44
|
Kim J, He X, Sinha S. Evolution of regulatory sequences in 12 Drosophila species. PLoS Genet 2009; 5:e1000330. [PMID: 19132088 PMCID: PMC2607023 DOI: 10.1371/journal.pgen.1000330] [Citation(s) in RCA: 65] [Impact Index Per Article: 4.3] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/19/2008] [Accepted: 12/05/2008] [Indexed: 01/07/2023] Open
Abstract
Characterization of the evolutionary constraints acting on cis-regulatory sequences is crucial to comparative genomics and provides key insights on the evolution of organismal diversity. We study the relationships among orthologous cis-regulatory modules (CRMs) in 12 Drosophila species, especially with respect to the evolution of transcription factor binding sites, and report statistical evidence in favor of key evolutionary hypotheses. Binding sites are found to have position-specific substitution rates. However, the selective forces at different positions of a site do not act independently, and the evidence suggests that constraints on sites are often based on their exact binding affinities. Binding site loss is seen to conform to a molecular clock hypothesis. The rate of site loss is transcription factor–specific and depends on the strength of binding and, in some cases, the presence of other binding sites in close proximity. Our analysis is based on a novel computational method for aligning orthologous CRMs on a tree, which rigorously accounts for alignment uncertainties and exploits binding site predictions through a unified probabilistic framework. Finally, we report weak purifying selection on short deletions, providing important clues about overall spatial constraints on CRMs. Our results present a complex picture of regulatory sequence evolution, with substantial plasticity that depends on a number of factors. The insights gained in this study will help us to understand the combinatorial control of gene regulation and how it evolves. They will pave the way for theoretical models that are cognizant of the important determinants of regulatory sequence evolution and will be critical in genome-wide identification of non-coding sequences under purifying or positive selection. The spatial–temporal expression pattern of a gene, which is crucial to its function, is controlled by cis-regulatory DNA sequences. Forming the basic units of regulatory sequences are transcription factor binding sites, often organized into larger modules that determine gene expression in response to combinatorial environmental signals. Understanding the conservation and change of regulatory sequences is critical to our knowledge of the unity as well as diversity of animal development and phenotypes. In this paper, we study the evolution of sequences involved in the regulation of body patterning in the Drosophila embryo. We find that mutations of nucleotides within a binding site are constrained by evolutionary forces to preserve the site's binding affinity to the cognate transcription factor. Functional binding sites are frequently destroyed during evolution and the rate of loss across evolutionary spans is roughly constant. We also find that the evolutionary fate of a site strongly depends on its context; a pair of interacting sites are more likely to survive mutational forces than isolated sites. Together, these findings provide new insights and pose new challenges to our understanding of cis-regulatory sequences and their evolution.
Collapse
Affiliation(s)
- Jaebum Kim
- Department of Computer Science, University of Illinois at Urbana-Champaign, Urbana, Illinois, United States of America
| | - Xin He
- Department of Computer Science, University of Illinois at Urbana-Champaign, Urbana, Illinois, United States of America
| | - Saurabh Sinha
- Department of Computer Science, University of Illinois at Urbana-Champaign, Urbana, Illinois, United States of America
- * E-mail:
| |
Collapse
|