1
|
Dias C, Dilão R. Modelling and calibration of pair-rule protein patterns in Drosophila embryo: From Even-skipped and Fushi-tarazu to Wingless expression networks. Dev Biol 2024; 517:178-190. [PMID: 39369936 DOI: 10.1016/j.ydbio.2024.09.015] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 05/21/2024] [Revised: 09/16/2024] [Accepted: 09/30/2024] [Indexed: 10/08/2024]
Abstract
We modelled and calibrated the distributions of the seven-stripe patterns of Even-skipped (Eve) and Fushi-tarazu (Ftz) pair-rule proteins along the anteroposterior axis of the Drosphila embryo, established during early development. We have identified the putative repressive combinations for five Eve enhancers, and we have explored the relationship between Eve and Ftz for complementary patterns. The regulators of Eve and Ftz are stripe-specific DNA enhancers with embryo position-dependent activation rates and are regulated by the gap family of proteins. We achieved remarkable data matching of the Eve stripe pattern, and the calibrated model reproduces gap gene mutation experiments. Extended work inferring the Wingless (Wg) fourteen stripe pattern from Eve and Ftz enhancers have been proposed, clarifying the hierarchical structure of Drosphila's genetic expression network during early development.
Collapse
Affiliation(s)
- Catarina Dias
- University of Lisbon, IST, Dep. of Physics, Nonlinear Dynamics Group, Av. Rovisco Pais, 1049-001, Lisbon, Portugal.
| | - Rui Dilão
- University of Lisbon, IST, Dep. of Physics, Nonlinear Dynamics Group, Av. Rovisco Pais, 1049-001, Lisbon, Portugal.
| |
Collapse
|
2
|
Masuda LHP, Sabino AU, Reinitz J, Ramos AF, Machado-Lima A, Andrioli LP. Global repression by tailless during segmentation. Dev Biol 2024; 505:11-23. [PMID: 37879494 PMCID: PMC10949167 DOI: 10.1016/j.ydbio.2023.09.014] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 12/30/2022] [Revised: 09/26/2023] [Accepted: 09/28/2023] [Indexed: 10/27/2023]
Abstract
The orphan nuclear receptor Tailless (Tll) exhibits conserved roles in brain formation and maintenance that are shared, for example, with vertebrate orthologous forms (Tlx). However, the early expression of tll in two gap domains in the segmentation cascade of Drosophila is unusual even for most other insects. Here we investigate tll regulation on pair-rule stripes. With ectopic misexpression of tll we detected unexpected repression of almost all pair-rule stripes of hairy (h), even-skipped (eve), runt (run), and fushi-tarazu (ftz). Examining Tll embryonic ChIP-chip data with regions mapped as Cis-Regulatory Modules (CRMs) of pair-rule stripes we verified Tll interactions to these regions. With the ChIP-chip data we also verified Tll interactions to the CRMs of gap domains and in the misexpression assay, Tll-mediated repression on Kruppel (Kr), kni (kni) and giant (gt) according to their differential sensitivity to Tll. These results with gap genes confirmed previous data from the literature and argue against indirect repression roles of Tll in the striped pattern. Moreover, the prediction of Tll binding sites in the CRMs of eve stripes and the mathematical modeling of their removal using an experimentally validated theoretical framework shows effects on eve stripes compatible with the absence of a repressor binding to the CRMs. In addition, modeling increased tll levels in the embryo results in the differential repression of eve stripes, agreeing well with the results of the misexpression assay. In genetic assays we investigated eve 5, that is strongly repressed by the ectopic domain and representative of more central stripes not previously implied to be under direct regulation of tll. While this stripe is little affected in tll-, its posterior border is expanded in gt- but detected with even greater expansion in gt-;tll-. We end up by discussing tll with key roles in combinatorial repression mechanisms to contain the expression of medial patterns of the segmentation cascade in the extremities of the embryo.
Collapse
Affiliation(s)
| | - Alan Utsuni Sabino
- Departamento de Radiologia e Oncologia, Instituto do Câncer do Estado de São Paulo, Hospital das Clínicas, Faculdade de Medicina, Universidade de São Paulo, São Paulo, Brazil
| | - John Reinitz
- Departments of Statistics, Ecology and Evolution, Molecular Genetics & Cell Biology, Institute of Genomics and Systems Biology, University of Chicago, Chicago, IL, USA
| | | | - Ariane Machado-Lima
- Escola de Artes, Ciências e Humanidades da Universidade de São Paulo, São Paulo, Brazil
| | - Luiz Paulo Andrioli
- Escola de Artes, Ciências e Humanidades da Universidade de São Paulo, São Paulo, Brazil.
| |
Collapse
|
3
|
Shen J, Liu F, Tu Y, Tang C. Finding gene network topologies for given biological function with recurrent neural network. Nat Commun 2021; 12:3125. [PMID: 34035278 PMCID: PMC8149884 DOI: 10.1038/s41467-021-23420-5] [Citation(s) in RCA: 17] [Impact Index Per Article: 5.7] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 09/30/2020] [Accepted: 04/28/2021] [Indexed: 11/12/2022] Open
Abstract
Searching for possible biochemical networks that perform a certain function is a challenge in systems biology. For simple functions and small networks, this can be achieved through an exhaustive search of the network topology space. However, it is difficult to scale this approach up to larger networks and more complex functions. Here we tackle this problem by training a recurrent neural network (RNN) to perform the desired function. By developing a systematic perturbative method to interrogate the successfully trained RNNs, we are able to distill the underlying regulatory network among the biological elements (genes, proteins, etc.). Furthermore, we show several cases where the regulation networks found by RNN can achieve the desired biological function when its edges are expressed by more realistic response functions, such as the Hill-function. This method can be used to link topology and function by helping uncover the regulation logic and network topology for complex tasks. Networks are useful ways to describe interactions between molecules in a cell, but predicting the real topology of large networks can be challenging. Here, the authors use deep learning to predict the topology of networks that perform biologically-plausible functions.
Collapse
Affiliation(s)
- Jingxiang Shen
- Center for Quantitative Biology, Peking University, Beijing, China.,School of Physics, Peking University, Beijing, China
| | - Feng Liu
- Center for Quantitative Biology, Peking University, Beijing, China.,School of Physics, Peking University, Beijing, China
| | - Yuhai Tu
- IBM T. J. Watson Research Center, Yorktown Heights, New York, USA
| | - Chao Tang
- Center for Quantitative Biology, Peking University, Beijing, China. .,School of Physics, Peking University, Beijing, China. .,Peking-Tsinghua Center for Life Sciences, Peking University, Beijing, China.
| |
Collapse
|
4
|
Makashov AA, Myasnikova EM, Spirov AV. Fuzzy Linguistic Modeling of the Regulation of Drosophila Segmentation Genes. Biophysics (Nagoya-shi) 2021. [DOI: 10.1134/s0006350921010073] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/23/2022] Open
|
5
|
Myasnikova E, Spirov A. Gene regulatory networks in Drosophila early embryonic development as a model for the study of the temporal identity of neuroblasts. Biosystems 2020; 197:104192. [PMID: 32619531 DOI: 10.1016/j.biosystems.2020.104192] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 01/07/2020] [Revised: 04/30/2020] [Accepted: 06/21/2020] [Indexed: 11/27/2022]
Abstract
Genes belonging to the "gap" and "gap-like" family constitute the best-studied gene regulatory networks (GRNs) in Drosophila embryogenesis. Gap genes are a core of two subnetworks controlling embryonic segmentation: (hunchback, hb; Krüppel, Kr; giant, gt; and knirps, kni) and (hb; Kr; pou-domain, pdm; and, probably, castor, cas). Of particular interest is that (hb, Kr, pdm, cas) also specifies the temporal identity of stem cells, neuroblasts, in Drosophila neurogenesis. This GRN controls the sequential differentiation of neuroblasts during the asymmetric cell division. In the last decades, modeling of the patterning of gene ensemble (hb, Kr, gt, kni) in segmentation was in the center of attention. We show that our previously published and extensively studied model at a certain level of external factors is able to reproduce temporal patterns of (hb, Kr, pdm, cas) in neurogenesis with minor evolutionary explicable modifications. This result testifies in favor of a hypothesis that the similarity of two gene ensembles active in segmentation and neurogenesis is a result of co-option of the network architecture in evolution from the common ancestral form. By means of the model dynamical analysis, it is shown that the establishment of the robust patterns in both systems could be explained in terms of the action of attractors in the gap gene dynamical system. We formulate the common principles underlying the robustness of both GRNs in segmentation and neurogenesis due to the similar functional organization of the gene ensembles as having the same evolutionary origin.
Collapse
Affiliation(s)
- Ekaterina Myasnikova
- Peter the Great Saint-Petersburg Polytechnical University, 29 Politekhnicheskaya str, St. Petersburg, 195251, Russia.
| | - Alexander Spirov
- I. M. Sechenov Institute of Evolutionary Physiology and Biochemistry Russian Academy of Sciences, 44 Thorez Pr, St.Petersburg, 194223, Russia; Computer Science and CEWIT, SUNY Stony Brook, Stony Brook, 1500 Stony Brook Road, Stony Brook, 11794, NY, USA
| |
Collapse
|
6
|
An Atlas of Transcription Factors Expressed in Male Pupal Terminalia of Drosophila melanogaster. G3-GENES GENOMES GENETICS 2019; 9:3961-3972. [PMID: 31619460 PMCID: PMC6893207 DOI: 10.1534/g3.119.400788] [Citation(s) in RCA: 10] [Impact Index Per Article: 2.0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Subscribe] [Scholar Register] [Indexed: 12/18/2022]
Abstract
During development, transcription factors and signaling molecules govern gene regulatory networks to direct the formation of unique morphologies. As changes in gene regulatory networks are often implicated in morphological evolution, mapping transcription factor landscapes is important, especially in tissues that undergo rapid evolutionary change. The terminalia (genital and anal structures) of Drosophila melanogaster and its close relatives exhibit dramatic changes in morphology between species. While previous studies have identified network components important for patterning the larval genital disc, the networks governing adult structures during pupal development have remained uncharted. Here, we performed RNA-seq in whole Drosophila melanogaster male terminalia followed by in situ hybridization for 100 highly expressed transcription factors during pupal development. We find that the male terminalia are highly patterned during pupal stages and that specific transcription factors mark separate structures and substructures. Our results are housed online in a searchable database (https://flyterminalia.pitt.edu/) as a resource for the community. This work lays a foundation for future investigations into the gene regulatory networks governing the development and evolution of Drosophila terminalia.
Collapse
|
7
|
Wunderlich Z, Fowlkes CC, Eckenrode KB, Bragdon MDJ, Abiri A, DePace AH. Quantitative Comparison of the Anterior-Posterior Patterning System in the Embryos of Five Drosophila Species. G3 (BETHESDA, MD.) 2019; 9:2171-2182. [PMID: 31048401 PMCID: PMC6643877 DOI: 10.1534/g3.118.200953] [Citation(s) in RCA: 5] [Impact Index Per Article: 1.0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Subscribe] [Scholar Register] [Received: 12/07/2018] [Accepted: 05/01/2019] [Indexed: 11/18/2022]
Abstract
Complex spatiotemporal gene expression patterns direct the development of the fertilized egg into an adult animal. Comparisons across species show that, in spite of changes in the underlying regulatory DNA sequence, developmental programs can be maintained across millions of years of evolution. Reciprocally, changes in gene expression can be used to generate morphological novelty. Distinguishing between changes in regulatory DNA that lead to changes in gene expression and those that do not is therefore a central goal of evolutionary developmental biology. Quantitative, spatially-resolved measurements of developmental gene expression patterns play a crucial role in this goal, enabling the detection of subtle phenotypic differences between species and the development of computations models that link the sequence of regulatory DNA to expression patterns. Here we report the generation of two atlases of cellular resolution gene expression measurements for the primary anterior-posterior patterning genes in Drosophila simulans and Drosophila virilis By combining these data sets with existing atlases for three other Drosophila species, we detect subtle differences in the gene expression patterns and dynamics driving the highly conserved axis patterning system and delineate inter-species differences in the embryonic morphology. These data sets will be a resource for future modeling studies of the evolution of developmental gene regulatory networks.
Collapse
Affiliation(s)
- Zeba Wunderlich
- Department of Developmental and Cell Biology, University of California, Irvine, CA, 92697
| | - Charless C Fowlkes
- Department of Computer Science, University of California, Irvine, CA, 92697
| | - Kelly B Eckenrode
- Department of Systems Biology, Harvard Medical School, Boston, MA, 20115
| | - Meghan D J Bragdon
- Department of Systems Biology, Harvard Medical School, Boston, MA, 20115
| | - Arash Abiri
- Department of Developmental and Cell Biology, University of California, Irvine, CA, 92697
| | - Angela H DePace
- Department of Systems Biology, Harvard Medical School, Boston, MA, 20115
| |
Collapse
|
8
|
Park J, Estrada J, Johnson G, Vincent BJ, Ricci-Tam C, Bragdon MDJ, Shulgina Y, Cha A, Wunderlich Z, Gunawardena J, DePace AH. Dissecting the sharp response of a canonical developmental enhancer reveals multiple sources of cooperativity. eLife 2019; 8:e41266. [PMID: 31223115 PMCID: PMC6588347 DOI: 10.7554/elife.41266] [Citation(s) in RCA: 33] [Impact Index Per Article: 6.6] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 08/20/2018] [Accepted: 03/04/2019] [Indexed: 12/19/2022] Open
Abstract
Developmental enhancers integrate graded concentrations of transcription factors (TFs) to create sharp gene expression boundaries. Here we examine the hunchback P2 (HbP2) enhancer which drives a sharp expression pattern in the Drosophila blastoderm embryo in response to the transcriptional activator Bicoid (Bcd). We systematically interrogate cis and trans factors that influence the shape and position of expression driven by HbP2, and find that the prevailing model, based on pairwise cooperative binding of Bcd to HbP2 is not adequate. We demonstrate that other proteins, such as pioneer factors, Mediator and histone modifiers influence the shape and position of the HbP2 expression pattern. Comparing our results to theory reveals how higher-order cooperativity and energy expenditure impact boundary location and sharpness. Our results emphasize that the bacterial view of transcription regulation, where pairwise interactions between regulatory proteins dominate, must be reexamined in animals, where multiple molecular mechanisms collaborate to shape the gene regulatory function.
Collapse
Affiliation(s)
- Jeehae Park
- Department of Systems BiologyHarvard Medical SchoolBostonUnited States
| | - Javier Estrada
- Department of Systems BiologyHarvard Medical SchoolBostonUnited States
| | - Gemma Johnson
- Department of Systems BiologyHarvard Medical SchoolBostonUnited States
| | - Ben J Vincent
- Department of Systems BiologyHarvard Medical SchoolBostonUnited States
| | - Chiara Ricci-Tam
- Department of Systems BiologyHarvard Medical SchoolBostonUnited States
| | - Meghan DJ Bragdon
- Department of Systems BiologyHarvard Medical SchoolBostonUnited States
| | | | - Anna Cha
- Department of Systems BiologyHarvard Medical SchoolBostonUnited States
| | - Zeba Wunderlich
- Department of Systems BiologyHarvard Medical SchoolBostonUnited States
| | | | - Angela H DePace
- Department of Systems BiologyHarvard Medical SchoolBostonUnited States
| |
Collapse
|
9
|
Zubair A, Rosen IG, Nuzhdin SV, Marjoram P. Bayesian model selection for the Drosophila gap gene network. BMC Bioinformatics 2019; 20:327. [PMID: 31195954 PMCID: PMC6567646 DOI: 10.1186/s12859-019-2888-0] [Citation(s) in RCA: 1] [Impact Index Per Article: 0.2] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 04/24/2019] [Accepted: 05/09/2019] [Indexed: 11/10/2022] Open
Abstract
Background The gap gene system controls the early cascade of the segmentation pathway in Drosophila melanogaster as well as other insects. Owing to its tractability and key role in embryo patterning, this system has been the focus for both computational modelers and experimentalists. The gap gene expression dynamics can be considered strictly as a one-dimensional process and modeled as a system of reaction-diffusion equations. While substantial progress has been made in modeling this phenomenon, there still remains a deficit of approaches to evaluate competing hypotheses. Most of the model development has happened in isolation and there has been little attempt to compare candidate models. Results The Bayesian framework offers a means of doing formal model evaluation. Here, we demonstrate how this framework can be used to compare different models of gene expression. We focus on the Papatsenko-Levine formalism, which exploits a fractional occupancy based approach to incorporate activation of the gap genes by the maternal genes and cross-regulation by the gap genes themselves. The Bayesian approach provides insight about relationship between system parameters. In the regulatory pathway of segmentation, the parameters for number of binding sites and binding affinity have a negative correlation. The model selection analysis supports a stronger binding affinity for Bicoid compared to other regulatory edges, as shown by a larger posterior mean. The procedure doesn’t show support for activation of Kruppel by Bicoid. Conclusions We provide an efficient solver for the general representation of the Papatsenko-Levine model. We also demonstrate the utility of Bayes factor for evaluating candidate models for spatial pattering models. In addition, by using the parallel tempering sampler, the convergence of Markov chains can be remarkably improved and robust estimates of Bayes factors obtained. Electronic supplementary material The online version of this article (10.1186/s12859-019-2888-0) contains supplementary material, which is available to authorized users.
Collapse
Affiliation(s)
- Asif Zubair
- Molecular and Computational Biology, USC, 1050 Childs Way, Los Angeles, CA 90089-2532, US.
| | - I Gary Rosen
- Department of Mathematics, USC, 3620 S. Vermont Ave., Los Angeles, CA 90089-2532, US
| | - Sergey V Nuzhdin
- Molecular and Computational Biology, USC, 1050 Childs Way, Los Angeles, CA 90089-2532, US
| | - Paul Marjoram
- Molecular and Computational Biology, USC, 1050 Childs Way, Los Angeles, CA 90089-2532, US
| |
Collapse
|
10
|
Verd B, Monk NAM, Jaeger J. Modularity, criticality, and evolvability of a developmental gene regulatory network. eLife 2019; 8:e42832. [PMID: 31169494 PMCID: PMC6645726 DOI: 10.7554/elife.42832] [Citation(s) in RCA: 47] [Impact Index Per Article: 9.4] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 01/14/2019] [Accepted: 06/05/2019] [Indexed: 01/16/2023] Open
Abstract
The existence of discrete phenotypic traits suggests that the complex regulatory processes which produce them are functionally modular. These processes are usually represented by networks. Only modular networks can be partitioned into intelligible subcircuits able to evolve relatively independently. Traditionally, functional modularity is approximated by detection of modularity in network structure. However, the correlation between structure and function is loose. Many regulatory networks exhibit modular behaviour without structural modularity. Here we partition an experimentally tractable regulatory network-the gap gene system of dipteran insects-using an alternative approach. We show that this system, although not structurally modular, is composed of dynamical modules driving different aspects of whole-network behaviour. All these subcircuits share the same regulatory structure, but differ in components and sensitivity to regulatory interactions. Some subcircuits are in a state of criticality, while others are not, which explains the observed differential evolvability of the various expression features in the system.
Collapse
Affiliation(s)
- Berta Verd
- EMBL/CRG Systems Biology Research Unit, Centre for Genomic Regulation (CRG)The Barcelona Institute of Science and TechnologyBarcelonaSpain
- Universitat Pompeu Fabra (UPF)BarcelonaSpain
- Konrad Lorenz Institute for Evolution and Cognition Research (KLI)KlosterneuburgAustria
- Department of GeneticsUniversity of CambridgeCambridgeUnited Kingdom
| | - Nicholas AM Monk
- School of Mathematics and StatisticsUniversity of SheffieldSheffieldUnited States
| | - Johannes Jaeger
- EMBL/CRG Systems Biology Research Unit, Centre for Genomic Regulation (CRG)The Barcelona Institute of Science and TechnologyBarcelonaSpain
- Universitat Pompeu Fabra (UPF)BarcelonaSpain
- Konrad Lorenz Institute for Evolution and Cognition Research (KLI)KlosterneuburgAustria
- School of Mathematics and StatisticsUniversity of SheffieldSheffieldUnited States
- Wissenschaftskolleg zu BerlinBerlinGermany
- Center for Systems Biology Dresden (CSBD)DresdenGermany
- Complexity Science Hub (CSH)ViennaAustria
- Centre de Recherches Interdisciplinaires (CRI)ParisFrance
| |
Collapse
|
11
|
Zhornikova P, Golyandina N, Spirov A. Noise model estimation with application to gene expression. J Bioinform Comput Biol 2019; 17:1950009. [PMID: 31057070 DOI: 10.1142/s0219720019500094] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/18/2022]
Abstract
Algorithms for the estimation of noise level and the detection of noise model are proposed. They are applied to gene expression data for Drosophila embryos. The 2D data on gene expression and the extracted 1D profiles are considered. Since the 1D data contain processing errors, an algorithm for separation of these processing errors is constructed to estimate the biological noise level. An approach to discrimination between the additive and multiplicative models is suggested for the 1D and 2D cases. Singular spectrum analysis and its 2D extension are exploited for the pattern extraction. The algorithms are tested on artificial data similar to the real data. Comparison of the results, which are obtained by the 1D and 2D methods, is performed for Krüppel and giant genes.
Collapse
Affiliation(s)
- Polina Zhornikova
- * Faculty of Mathematics and Mechanics, St. Petersburg State University, Universitetskaya Nab. 7/9, 199034 St. Petersburg, Russia
| | - Nina Golyandina
- * Faculty of Mathematics and Mechanics, St. Petersburg State University, Universitetskaya Nab. 7/9, 199034 St. Petersburg, Russia
| | - Alexander Spirov
- † The Sechenov Institute of Evolutionary Physiology and Biochemistry Russian Academy of Sciences, Torez Pr. 44, 194223 St. Petersburg, Russia
| |
Collapse
|
12
|
Spatial gradient of bicoid is well explained by Birnbaum-Saunders distribution. Med Hypotheses 2018; 122:73-81. [PMID: 30593428 DOI: 10.1016/j.mehy.2018.10.023] [Citation(s) in RCA: 1] [Impact Index Per Article: 0.2] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/14/2018] [Revised: 10/08/2018] [Accepted: 10/21/2018] [Indexed: 01/17/2023]
Abstract
bicoid is a maternally transcribed gene which plays a pivotal role during the early developmental stage of Drosophila melanogaster by acting as an essential input to the segmentation network. Therefore, fundamental insights into gene cross-regulations of segmentation network expect to be unveiled by presenting an accurate mathematical model for bicoid gene expression profile. In this paper, an extended version of Birnbaum-Saunders with four parameters is introduced and evaluated to describe the spatial gradient of this gene. Theoretical aspects of four-parameter Birnbaum-Saunders and the estimated parameters are presented and thoroughly assessed for different embryos. The reliability and validity of the results are evaluated via both simulation studies and real data sets and thereby adding more confidence and value to the findings of this research.
Collapse
|
13
|
Surkova S, Golubkova E, Mamon L, Samsonova M. Dynamic maternal gradients and morphogenetic networks in Drosophila early embryo. Biosystems 2018; 173:207-213. [DOI: 10.1016/j.biosystems.2018.10.009] [Citation(s) in RCA: 7] [Impact Index Per Article: 1.2] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 08/14/2018] [Revised: 10/06/2018] [Accepted: 10/08/2018] [Indexed: 11/30/2022]
|
14
|
|
15
|
Vincent BJ, Staller MV, Lopez-Rivera F, Bragdon MDJ, Pym ECG, Biette KM, Wunderlich Z, Harden TT, Estrada J, DePace AH. Hunchback is counter-repressed to regulate even-skipped stripe 2 expression in Drosophila embryos. PLoS Genet 2018; 14:e1007644. [PMID: 30192762 PMCID: PMC6145585 DOI: 10.1371/journal.pgen.1007644] [Citation(s) in RCA: 21] [Impact Index Per Article: 3.5] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/06/2017] [Revised: 09/19/2018] [Accepted: 08/17/2018] [Indexed: 01/18/2023] Open
Abstract
Hunchback is a bifunctional transcription factor that can activate and repress gene expression in Drosophila development. We investigated the regulatory DNA sequence features that control Hunchback function by perturbing enhancers for one of its target genes, even-skipped (eve). While Hunchback directly represses the eve stripe 3+7 enhancer, we found that in the eve stripe 2+7 enhancer, Hunchback repression is prevented by nearby sequences-this phenomenon is called counter-repression. We also found evidence that Caudal binding sites are responsible for counter-repression, and that this interaction may be a conserved feature of eve stripe 2 enhancers. Our results alter the textbook view of eve stripe 2 regulation wherein Hb is described as a direct activator. Instead, to generate stripe 2, Hunchback repression must be counteracted. We discuss how counter-repression may influence eve stripe 2 regulation and evolution.
Collapse
Affiliation(s)
- Ben J. Vincent
- Department of Systems Biology, Harvard Medical School, Boston, Massachusetts, United States of America
| | - Max V. Staller
- Department of Systems Biology, Harvard Medical School, Boston, Massachusetts, United States of America
| | - Francheska Lopez-Rivera
- Department of Systems Biology, Harvard Medical School, Boston, Massachusetts, United States of America
| | - Meghan D. J. Bragdon
- Department of Systems Biology, Harvard Medical School, Boston, Massachusetts, United States of America
| | - Edward C. G. Pym
- Department of Systems Biology, Harvard Medical School, Boston, Massachusetts, United States of America
| | - Kelly M. Biette
- Department of Systems Biology, Harvard Medical School, Boston, Massachusetts, United States of America
| | - Zeba Wunderlich
- Department of Systems Biology, Harvard Medical School, Boston, Massachusetts, United States of America
| | - Timothy T. Harden
- Department of Systems Biology, Harvard Medical School, Boston, Massachusetts, United States of America
| | - Javier Estrada
- Department of Systems Biology, Harvard Medical School, Boston, Massachusetts, United States of America
| | - Angela H. DePace
- Department of Systems Biology, Harvard Medical School, Boston, Massachusetts, United States of America
| |
Collapse
|
16
|
Ghodsi Z, Hassani H, Kalantari M, Silva ES. Estimation of protein diffusion parameters. Stat (Int Stat Inst) 2018. [DOI: 10.1002/sta4.192] [Citation(s) in RCA: 1] [Impact Index Per Article: 0.2] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/09/2022]
Affiliation(s)
- Zara Ghodsi
- Statistical Research Centre Bournemouth University Poole BH12 5BB
| | - Hossein Hassani
- Research Institute for Energy Management and Planning University of Tehran No. 13, Qods St Tehran 1417466191 Iran
| | - Mahdi Kalantari
- Department of Statistics Payame Noor University Tehran 19395‐4697 Iran
| | - Emmanuel Sirimal Silva
- Fashion Business School, London College of Fashion University of the Arts London London UK
| |
Collapse
|
17
|
Movahedifar M, Yarmohammadi M, Hassani H. Bicoid signal extraction: Another powerful approach. Math Biosci 2018; 303:52-61. [PMID: 29981354 DOI: 10.1016/j.mbs.2018.06.002] [Citation(s) in RCA: 8] [Impact Index Per Article: 1.3] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 05/20/2018] [Accepted: 06/14/2018] [Indexed: 11/15/2022]
Abstract
This belief has been widely accepted that Bicoid (Bcd) protein distributes in a concentration gradient that organizes the anterior/posterior axis of the Drosophila embryo. Segment polarity protein provides positional cues for the development of head and thoracic segments. Therefore As a result of its essential role, modeling the Bicoid gradient has been welcomed by many researchers in various scientific fields. In this paper, We present investigation of gene expression profiles by means of Singular Spectrum Decomposition (SSD), an optimizing version of Singular Spectrum Analysis for filtering and extracting the bicoid gene expression signal. The results with strong evidence suggest that the proposed technique is capable of eliminating noise and can be considered as an acceptable method for filtering gene expression.
Collapse
Affiliation(s)
- Maryam Movahedifar
- Department of Statistics, Payame Noor University, 19395-4697, Tehran, Iran
| | | | - Hossein Hassani
- Research Institute for Energy Management and Planning, University of Tehran, No. 9 Qods St, Tehran, Iran.
| |
Collapse
|
18
|
Bothma JP, Norstad MR, Alamos S, Garcia HG. LlamaTags: A Versatile Tool to Image Transcription Factor Dynamics in Live Embryos. Cell 2018; 173:1810-1822.e16. [PMID: 29754814 DOI: 10.1016/j.cell.2018.03.069] [Citation(s) in RCA: 85] [Impact Index Per Article: 14.2] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/05/2017] [Revised: 02/28/2018] [Accepted: 03/27/2018] [Indexed: 11/18/2022]
Abstract
Embryonic cell fates are defined by transcription factors that are rapidly deployed, yet attempts to visualize these factors in vivo often fail because of slow fluorescent protein maturation. Here, we pioneer a protein tag, LlamaTag, which circumvents this maturation limit by binding mature fluorescent proteins, making it possible to visualize transcription factor concentration dynamics in live embryos. Implementing this approach in the fruit fly Drosophila melanogaster, we discovered stochastic bursts in the concentration of transcription factors that are correlated with bursts in transcription. We further used LlamaTags to show that the concentration of protein in a given nucleus heavily depends on transcription of that gene in neighboring nuclei; we speculate that this inter-nuclear signaling is an important mechanism for coordinating gene expression to delineate straight and sharp boundaries of gene expression. Thus, LlamaTags now make it possible to visualize the flow of information along the central dogma in live embryos.
Collapse
Affiliation(s)
- Jacques P Bothma
- Department of Molecular & Cell Biology, UC Berkeley, Berkeley, CA 94720, USA
| | - Matthew R Norstad
- Department of Molecular & Cell Biology, UC Berkeley, Berkeley, CA 94720, USA
| | - Simon Alamos
- Department of Plant and Microbial Biology, UC Berkeley, Berkeley, CA 94720, USA
| | - Hernan G Garcia
- Department of Molecular & Cell Biology, UC Berkeley, Berkeley, CA 94720, USA; Department of Physics, UC Berkeley, Berkeley, CA 94720, USA; Biophysics Graduate Group, UC Berkeley, Berkeley, CA 94720, USA; Institute for Quantitative Biosciences-QB3, University of California, Berkeley, CA 94720, USA.
| |
Collapse
|
19
|
A damped oscillator imposes temporal order on posterior gap gene expression in Drosophila. PLoS Biol 2018; 16:e2003174. [PMID: 29451884 PMCID: PMC5832388 DOI: 10.1371/journal.pbio.2003174] [Citation(s) in RCA: 38] [Impact Index Per Article: 6.3] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 05/31/2017] [Revised: 03/01/2018] [Accepted: 01/31/2018] [Indexed: 12/21/2022] Open
Abstract
Insects determine their body segments in two different ways. Short-germband insects, such as the flour beetle Tribolium castaneum, use a molecular clock to establish segments sequentially. In contrast, long-germband insects, such as the vinegar fly Drosophila melanogaster, determine all segments simultaneously through a hierarchical cascade of gene regulation. Gap genes constitute the first layer of the Drosophila segmentation gene hierarchy, downstream of maternal gradients such as that of Caudal (Cad). We use data-driven mathematical modelling and phase space analysis to show that shifting gap domains in the posterior half of the Drosophila embryo are an emergent property of a robust damped oscillator mechanism, suggesting that the regulatory dynamics underlying long- and short-germband segmentation are much more similar than previously thought. In Tribolium, Cad has been proposed to modulate the frequency of the segmentation oscillator. Surprisingly, our simulations and experiments show that the shift rate of posterior gap domains is independent of maternal Cad levels in Drosophila. Our results suggest a novel evolutionary scenario for the short- to long-germband transition and help explain why this transition occurred convergently multiple times during the radiation of the holometabolan insects. Different insect species exhibit one of two distinct modes of determining their body segments (known as segmentation) during development: they either use a molecular oscillator to position segments sequentially, or they generate segments simultaneously through a hierarchical gene-regulatory cascade. The sequential mode is ancestral, while the simultaneous mode has been derived from it independently several times during evolution. In this paper, we present evidence suggesting that simultaneous segmentation also involves an oscillator in the posterior end of the embryo of the vinegar fly, Drosophila melanogaster. This surprising result indicates that both modes of segment determination are much more similar than previously thought. Such similarity provides an important step towards our understanding of the frequent evolutionary transitions observed between sequential and simultaneous segmentation.
Collapse
|
20
|
|
21
|
Barr KA, Martinez C, Moran JR, Kim AR, Ramos AF, Reinitz J. Synthetic enhancer design by in silico compensatory evolution reveals flexibility and constraint in cis-regulation. BMC SYSTEMS BIOLOGY 2017; 11:116. [PMID: 29187214 PMCID: PMC5708098 DOI: 10.1186/s12918-017-0485-2] [Citation(s) in RCA: 10] [Impact Index Per Article: 1.4] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Subscribe] [Scholar Register] [Received: 07/05/2017] [Accepted: 11/09/2017] [Indexed: 11/12/2022]
Abstract
BACKGROUND Models that incorporate specific chemical mechanisms have been successful in describing the activity of Drosophila developmental enhancers as a function of underlying transcription factor binding motifs. Despite this, the minimum set of mechanisms required to reconstruct an enhancer from its constituent parts is not known. Synthetic biology offers the potential to test the sufficiency of known mechanisms to describe the activity of enhancers, as well as to uncover constraints on the number, order, and spacing of motifs. RESULTS Using a functional model and in silico compensatory evolution, we generated putative synthetic even-skipped stripe 2 enhancers with varying degrees of similarity to the natural enhancer. These elements represent the evolutionary trajectories of the natural stripe 2 enhancer towards two synthetic enhancers designed ab initio. In the first trajectory, spatially regulated expression was maintained, even after more than a third of binding sites were lost. In the second, sequences with high similarity to the natural element did not drive expression, but a highly diverged sequence about half the length of the minimal stripe 2 enhancer drove ten times greater expression. Additionally, homotypic clusters of Zelda or Stat92E motifs, but not Bicoid, drove expression in developing embryos. CONCLUSIONS Here, we present a functional model of gene regulation to test the degree to which the known transcription factors and their interactions explain the activity of the Drosophila even-skipped stripe 2 enhancer. Initial success in the first trajectory showed that the gene regulation model explains much of the function of the stripe 2 enhancer. Cases where expression deviated from prediction indicates that undescribed factors likely act to modulate expression. We also showed that activation driven Bicoid and Hunchback is highly sensitive to spatial organization of binding motifs. In contrast, Zelda and Stat92E drive expression from simple homotypic clusters, suggesting that activation driven by these factors is less constrained. Collectively, the 40 sequences generated in this work provides a powerful training set for building future models of gene regulation.
Collapse
Affiliation(s)
- Kenneth A Barr
- Committee on Genetics, Genomics, and Systems Biology, University of Chicago, Zoology 111, 1101 E 57th St, Chicago, 60637, Illinois, USA.
- Department of Ecology and Evolution, The University of Chicago, Chicago, 60637, Illinois, USA.
| | - Carlos Martinez
- Department Biochemistry and Molecular Genetics, Northwestern University, Chicago, 60611, Illinois, USA
| | - Jennifer R Moran
- Department Human Genetics, The University of Chicago, Chicago, 60637, Illinois, USA
- Institute for Genomics & Systems Biology, The University of Chicago, Chicago, 60637, Illinois, USA
| | - Ah-Ram Kim
- School of Life Science, Handong Global University, Pohang, 37554, Gyeongbuk, South Korea
| | - Alexandre F Ramos
- Departamento de Radiologia - Faculdade de Medicina, Universidade de São Paulo & Instituto do Câncer do Estado de São Paulo, São Paulo, SP CEP, 05403-911, Brazil
- Escola de Artes, Ciências e Humanidades & Núcleo de Estudos Interdisciplinares em Sistemas Complexos, Universidade de São Paulo, Av. Arlindo Béttio, São Paulo, 1000 CEP 03828-000, SP, Brazil
| | - John Reinitz
- Committee on Genetics, Genomics, and Systems Biology, University of Chicago, Zoology 111, 1101 E 57th St, Chicago, 60637, Illinois, USA
- Department of Ecology and Evolution, The University of Chicago, Chicago, 60637, Illinois, USA
- Institute for Genomics & Systems Biology, The University of Chicago, Chicago, 60637, Illinois, USA
- Department Statistics, The University of Chicago, 5747 S. Ellis Avenue Jones 312, Chicago, 60637, IL, USA
| |
Collapse
|
22
|
Abdol AM, Bedard A, Lánský I, Kaandorp JA. High-throughput method for extracting and visualizing the spatial gene expressions from in situ hybridization images: A case study of the early development of the sea anemone Nematostella vectensis. Gene Expr Patterns 2017; 27:36-45. [PMID: 29122675 DOI: 10.1016/j.gep.2017.10.005] [Citation(s) in RCA: 2] [Impact Index Per Article: 0.3] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/19/2017] [Revised: 10/12/2017] [Accepted: 10/17/2017] [Indexed: 01/29/2023]
Abstract
Studying the spatial gene expression profiles from in situ hybridization images of the embryo is one of the first steps toward the comprehensive understanding of gene interactions in an organism. In the case of N. vectensis, extracting and collecting these data is a challenging task due to the difficulty of detecting the cell layer through the transparent body plan and changing morphology during the blastula and gastrula stages. Here, first, we introduce a method to algorithmically identify and track the cell layer in N. vectensis embryo from the late blastula to the late gastrula stage. With this, we will be able to extract spatial expression profiles of genes alongside the cell layer and consequently reconstructing the 1D representation of gene expression profiles. Furthermore, we use the morphological configurations of the embryo extracted from confocal images, to model the dynamics of embryos morphology during the gastrulation process in 2D. Ultimately, we provide a visualization tool for studying and comparing the extracted spatial gene expression profiles over the simulated embryo. We anticipate that our method of extraction and visualization to be a starting point for quantifying and collecting more in situ images from various sources, which can potentially accelerate our understanding of gene interactions in the early development of N. vectensis. The method allows researchers to visualize and compare the different gene expressions from different in situ images or different experiments. As an example, we were able to show the complementary expression of NvFoxA-NvSnailA and NvBra-NvErg in the central domain and central/external rings during the development which suggests the possible repression effects between each pair; as it has been discovered by functional analysis.
Collapse
Affiliation(s)
- A M Abdol
- University of Amsterdam, Science Park 904, 1098XH Amsterdam, The Netherlands.
| | - Andrew Bedard
- University of Amsterdam, Science Park 904, 1098XH Amsterdam, The Netherlands.
| | - Imke Lánský
- University of Amsterdam, Science Park 904, 1098XH Amsterdam, The Netherlands.
| | - J A Kaandorp
- University of Amsterdam, Science Park 904, 1098XH Amsterdam, The Netherlands.
| |
Collapse
|
23
|
Howard J, Garzon-Coral C. Physical Limits on the Precision of Mitotic Spindle Positioning by Microtubule Pushing forces: Mechanics of mitotic spindle positioning. Bioessays 2017; 39:10.1002/bies.201700122. [PMID: 28960439 PMCID: PMC5698852 DOI: 10.1002/bies.201700122] [Citation(s) in RCA: 34] [Impact Index Per Article: 4.9] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/11/2017] [Revised: 08/13/2017] [Indexed: 01/07/2023]
Abstract
Tissues are shaped and patterned by mechanical and chemical processes. A key mechanical process is the positioning of the mitotic spindle, which determines the size and location of the daughter cells within the tissue. Recent force and position-fluctuation measurements indicate that pushing forces, mediated by the polymerization of astral microtubules against- the cell cortex, maintain the mitotic spindle at the cell center in Caenorhabditis elegans embryos. The magnitude of the centering forces suggests that the physical limit on the accuracy and precision of this centering mechanism is determined by the number of pushing microtubules rather than by thermally driven fluctuations. In cells that divide asymmetrically, anti-centering, pulling forces generated by cortically located dyneins, in conjunction with microtubule depolymerization, oppose the pushing forces to drive spindle displacements away from the center. Thus, a balance of centering pushing forces and anti-centering pulling forces localize the mitotic spindles within dividing C. elegans cells.
Collapse
Affiliation(s)
- Jonathon Howard
- Department of Molecular Biophysics & Biochemistry, Yale University, New Haven, CT 06511, USA
| | - Carlos Garzon-Coral
- Shriram Center for Chemical Engineering & Bioengineering, Stanford University, CA 94305, USA
| |
Collapse
|
24
|
Hassani H, Silva ES, Ghodsi Z. Optimizing bicoid signal extraction. Math Biosci 2017; 294:46-56. [PMID: 29030151 DOI: 10.1016/j.mbs.2017.09.008] [Citation(s) in RCA: 11] [Impact Index Per Article: 1.6] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 05/09/2017] [Revised: 09/01/2017] [Accepted: 09/27/2017] [Indexed: 11/28/2022]
Abstract
Signal extraction and analysis is of great importance, not only in fields such as economics and meteorology, but also in genetics and even biomedicine. There exists a range of parametric and nonparametric techniques which can perform signal extractions. However, the aim of this paper is to define a new approach for optimising signal extraction from bicoid gene expression profile. Having studied both parametric and nonparametric signal extraction techniques, we identified the lack of specific criteria enabling users to select the optimal signal extraction parameters. Exploiting the expression profile of bicoid gene, which is a maternal segmentation coordinate gene found in Drosophila melanogaster, we introduce a new approach for optimising the signal extraction using a nonparametric technique. The underlying criteria are based on the distribution of the residual, more specifically its skewness.
Collapse
Affiliation(s)
- Hossein Hassani
- Research Institute of Energy Management and Planning, University of Tehran, No. 13, Ghods St., Enghelab Ave., Tehran, Iran.
| | - Emmanuel Sirimal Silva
- Fashion Business School, London College of Fashion, University of the Arts London, 272 High Holborn, London, WC1V 7EY, UK.
| | - Zara Ghodsi
- Translational Genetics Group, Bournemouth University, Fern Barrow, Poole, BH125BB, UK.
| |
Collapse
|
25
|
Gursky VV, Kozlov KN, Kulakovskiy IV, Zubair A, Marjoram P, Lawrie DS, Nuzhdin SV, Samsonova MG. Translating natural genetic variation to gene expression in a computational model of the Drosophila gap gene regulatory network. PLoS One 2017; 12:e0184657. [PMID: 28898266 PMCID: PMC5595321 DOI: 10.1371/journal.pone.0184657] [Citation(s) in RCA: 5] [Impact Index Per Article: 0.7] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/05/2017] [Accepted: 08/28/2017] [Indexed: 11/18/2022] Open
Abstract
Annotating the genotype-phenotype relationship, and developing a proper quantitative description of the relationship, requires understanding the impact of natural genomic variation on gene expression. We apply a sequence-level model of gap gene expression in the early development of Drosophila to analyze single nucleotide polymorphisms (SNPs) in a panel of natural sequenced D. melanogaster lines. Using a thermodynamic modeling framework, we provide both analytical and computational descriptions of how single-nucleotide variants affect gene expression. The analysis reveals that the sequence variants increase (decrease) gene expression if located within binding sites of repressors (activators). We show that the sign of SNP influence (activation or repression) may change in time and space and elucidate the origin of this change in specific examples. The thermodynamic modeling approach predicts non-local and non-linear effects arising from SNPs, and combinations of SNPs, in individual fly genotypes. Simulation of individual fly genotypes using our model reveals that this non-linearity reduces to almost additive inputs from multiple SNPs. Further, we see signatures of the action of purifying selection in the gap gene regulatory regions. To infer the specific targets of purifying selection, we analyze the patterns of polymorphism in the data at two phenotypic levels: the strengths of binding and expression. We find that combinations of SNPs show evidence of being under selective pressure, while individual SNPs do not. The model predicts that SNPs appear to accumulate in the genotypes of the natural population in a way biased towards small increases in activating action on the expression pattern. Taken together, these results provide a systems-level view of how genetic variation translates to the level of gene regulatory networks via combinatorial SNP effects.
Collapse
Affiliation(s)
- Vitaly V. Gursky
- Theoretical Department, Ioffe Institute, Saint Petersburg, Russia
- Systems Biology and Bioinformatics Laboratory, Peter the Great Saint Petersburg Polytechnic University, Saint Petersburg, Russia
- * E-mail:
| | - Konstantin N. Kozlov
- Systems Biology and Bioinformatics Laboratory, Peter the Great Saint Petersburg Polytechnic University, Saint Petersburg, Russia
| | - Ivan V. Kulakovskiy
- Engelhardt Institute of Molecular Biology, Moscow, Russia
- Vavilov Institute of General Genetics, Moscow, Russia
- Center for Data-Intensive Biomedicine and Biotechnology, Skolkovo Institute of Science and Technology, Moscow, Russia
| | - Asif Zubair
- Molecular and Computational Biology, University of Southern California, Los Angeles, California, United States of America
| | - Paul Marjoram
- Molecular and Computational Biology, University of Southern California, Los Angeles, California, United States of America
| | - David S. Lawrie
- Molecular and Computational Biology, University of Southern California, Los Angeles, California, United States of America
| | - Sergey V. Nuzhdin
- Molecular and Computational Biology, University of Southern California, Los Angeles, California, United States of America
| | - Maria G. Samsonova
- Systems Biology and Bioinformatics Laboratory, Peter the Great Saint Petersburg Polytechnic University, Saint Petersburg, Russia
| |
Collapse
|
26
|
Barr KA, Reinitz J. A sequence level model of an intact locus predicts the location and function of nonadditive enhancers. PLoS One 2017; 12:e0180861. [PMID: 28715438 PMCID: PMC5513433 DOI: 10.1371/journal.pone.0180861] [Citation(s) in RCA: 10] [Impact Index Per Article: 1.4] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 11/07/2016] [Accepted: 06/22/2017] [Indexed: 01/24/2023] Open
Abstract
Metazoan gene expression is controlled through the action of long stretches of noncoding DNA that contain enhancers-shorter sequences responsible for controlling a single aspect of a gene's expression pattern. Models built on thermodynamics have shown how enhancers interpret protein concentration in order to determine specific levels of gene expression, but the emergent regulatory logic of a complete regulatory locus shows qualitative and quantitative differences from isolated enhancers. Such differences may arise from steric competition limiting the quantity of DNA that can simultaneously influence the transcription machinery. We incorporated this competition into a mechanistic model of gene regulation, generated efficient algorithms for this computation, and applied it to the regulation of Drosophila even-skipped (eve). This model finds the location of enhancers and identifies which factors control the boundaries of eve expression. This model predicts a new enhancer that, when assayed in vivo, drives expression in a non-eve pattern. Incorporation of chromatin accessibility eliminates this inconsistency.
Collapse
Affiliation(s)
- Kenneth A. Barr
- Committee on Genetics, Genomics, and Systems Biology, University of Chicago, Chicago, Illinois, United States of America
| | - John Reinitz
- Committee on Genetics, Genomics, and Systems Biology, University of Chicago, Chicago, Illinois, United States of America
- Department of Statistics, University of Chicago, Chicago, Illinois, United States of America
- Department of Ecology and Evolution, University of Chicago, Chicago, Illinois, United States of America
- Department of Molecular Genetics and Cell Biology, University of Chicago, Chicago, Illinois, United States of America
- Institute for Genomics and Systems Biology, University of Chicago, Chicago, Illinois, United States of America
| |
Collapse
|
27
|
Holloway DM, Spirov AV. Transcriptional bursting in Drosophila development: Stochastic dynamics of eve stripe 2 expression. PLoS One 2017; 12:e0176228. [PMID: 28437444 PMCID: PMC5402966 DOI: 10.1371/journal.pone.0176228] [Citation(s) in RCA: 14] [Impact Index Per Article: 2.0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 11/21/2016] [Accepted: 04/08/2017] [Indexed: 01/17/2023] Open
Abstract
Anterior-posterior (AP) body segmentation of the fruit fly (Drosophila) is first seen in the 7-stripe spatial expression patterns of the pair-rule genes, which regulate downstream genes determining specific segment identities. Regulation of pair-rule expression has been extensively studied for the even-skipped (eve) gene. Recent live imaging, of a reporter for the 2ndeve stripe, has demonstrated the stochastic nature of this process, with ‘bursts’ in the number of RNA transcripts being made over time. We developed a stochastic model of the spatial and temporal expression of eve stripe 2 (binding by transcriptional activators (Bicoid and Hunchback proteins) and repressors (Giant and Krüppel proteins), transcriptional initiation and termination; with all rate parameters constrained by features of the experimental data) in order to analyze the noisy experimental time series and test hypotheses for how eve transcription is regulated. These include whether eve transcription is simply OFF or ON, with a single ON rate, or whether it proceeds by a more complex mechanism, with multiple ON rates. We find that both mechanisms can produce long (multi-minute) RNA bursts, but that the short-time (minute-to-minute) statistics of the data is indicative of eve being transcribed with at least two distinct ON rates, consistent with data on the joint activation of eve by Bicoid and Hunchback. We also predict distinct statistical signatures for cases in which eve is repressed (e.g. along the edges of the stripe) vs. cases in which activation is reduced (e.g. by mutagenesis of transcription factor binding sites). Fundamental developmental processes such as gene transcription are intrinsically noisy; our approach presents a new way to quantify and analyze time series data during developmental patterning in order to understand regulatory mechanisms and how they propagate noise and impact embryonic robustness.
Collapse
Affiliation(s)
- David M. Holloway
- Mathematics Department, British Columbia Institute of Technology, Burnaby, B.C., Canada
- Biology Department, University of Victoria, Victoria, B.C., Canada
- * E-mail:
| | - Alexander V. Spirov
- Computer Science, and Center of Excellence in Wireless and Information Technology, State University of New York, Stony Brook, New York, United States of America
- Sechenov Institute of Evolutionary Physiology and Biochemistry, St. Petersburg, Russia
| |
Collapse
|
28
|
Ghodsi Z, Huang X, Hassani H. Causality analysis detects the regulatory role of maternal effect genes in the early Drosophila embryo. GENOMICS DATA 2017; 11:20-38. [PMID: 27924281 PMCID: PMC5129166 DOI: 10.1016/j.gdata.2016.11.013] [Citation(s) in RCA: 3] [Impact Index Per Article: 0.4] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Subscribe] [Scholar Register] [Received: 05/01/2016] [Revised: 10/28/2016] [Accepted: 11/10/2016] [Indexed: 11/28/2022]
Abstract
In developmental studies, inferring regulatory interactions of segmentation genetic network play a vital role in unveiling the mechanism of pattern formation. As such, there exists an opportune demand for theoretical developments and new mathematical models which can result in a more accurate illustration of this genetic network. Accordingly, this paper seeks to extract the meaningful regulatory role of the maternal effect genes using a variety of causality detection techniques and to explore whether these methods can suggest a new analytical view to the gene regulatory networks. We evaluate the use of three different powerful and widely-used models representing time and frequency domain Granger causality and convergent cross mapping technique with the results being thoroughly evaluated for statistical significance. Our findings show that the regulatory role of maternal effect genes is detectable in different time classes and thereby the method is applicable to infer the possible regulatory interactions present among the other genes of this network.
Collapse
Affiliation(s)
- Zara Ghodsi
- Statistical Research Centre, Bournemouth University, 89 Holdenhurst Road, Bournemouth BH8 8EB, UK; Translational Genetics Group, Bournemouth University, Fern Barrow, Poole BH125BB, UK
| | - Xu Huang
- Statistical Research Centre, Bournemouth University, 89 Holdenhurst Road, Bournemouth BH8 8EB, UK
| | - Hossein Hassani
- Institute for International Energy Studies (IIES), Tehran 1967743 711, Iran
| |
Collapse
|
29
|
Chertkova AA, Schiffman JS, Nuzhdin SV, Kozlov KN, Samsonova MG, Gursky VV. In silico evolution of the Drosophila gap gene regulatory sequence under elevated mutational pressure. BMC Evol Biol 2017; 17:4. [PMID: 28251865 PMCID: PMC5333172 DOI: 10.1186/s12862-016-0866-y] [Citation(s) in RCA: 6] [Impact Index Per Article: 0.9] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/10/2022] Open
Abstract
BACKGROUND Cis-regulatory sequences are often composed of many low-affinity transcription factor binding sites (TFBSs). Determining the evolutionary and functional importance of regulatory sequence composition is impeded without a detailed knowledge of the genotype-phenotype map. RESULTS We simulate the evolution of regulatory sequences involved in Drosophila melanogaster embryo segmentation during early development. Natural selection evaluates gene expression dynamics produced by a computational model of the developmental network. We observe a dramatic decrease in the total number of transcription factor binding sites through the course of evolution. Despite a decrease in average sequence binding energies through time, the regulatory sequences tend towards organisations containing increased high affinity transcription factor binding sites. Additionally, the binding energies of separate sequence segments demonstrate ubiquitous mutual correlations through time. Fewer than 10% of initial TFBSs are maintained throughout the entire simulation, deemed 'core' sites. These sites have increased functional importance as assessed under wild-type conditions and their binding energy distributions are highly conserved. Furthermore, TFBSs within close proximity of core sites exhibit increased longevity, reflecting functional regulatory interactions with core sites. CONCLUSION In response to elevated mutational pressure, evolution tends to sample regulatory sequence organisations with fewer, albeit on average, stronger functional transcription factor binding sites. These organisations are also shaped by the regulatory interactions among core binding sites with sites in their local vicinity.
Collapse
Affiliation(s)
- Aleksandra A. Chertkova
- Systems Biology and Bioinformatics Laboratory, Peter the Great St. Petersburg Polytechnic University, Polytechnicheskaya, 29, St. Petersburg, 195251 Russia
| | - Joshua S. Schiffman
- Molecular and Computational Biology, University of Southern California, Los Angeles, 90089 CA USA
| | - Sergey V. Nuzhdin
- Systems Biology and Bioinformatics Laboratory, Peter the Great St. Petersburg Polytechnic University, Polytechnicheskaya, 29, St. Petersburg, 195251 Russia
- Molecular and Computational Biology, University of Southern California, Los Angeles, 90089 CA USA
| | - Konstantin N. Kozlov
- Systems Biology and Bioinformatics Laboratory, Peter the Great St. Petersburg Polytechnic University, Polytechnicheskaya, 29, St. Petersburg, 195251 Russia
| | - Maria G. Samsonova
- Systems Biology and Bioinformatics Laboratory, Peter the Great St. Petersburg Polytechnic University, Polytechnicheskaya, 29, St. Petersburg, 195251 Russia
| | - Vitaly V. Gursky
- Systems Biology and Bioinformatics Laboratory, Peter the Great St. Petersburg Polytechnic University, Polytechnicheskaya, 29, St. Petersburg, 195251 Russia
- Theoretical Department, Ioffe Institute, Polytechnicheskaya, 26, St. Petersburg, 194021 Russia
| |
Collapse
|
30
|
Verd B, Crombach A, Jaeger J. Dynamic Maternal Gradients Control Timing and Shift-Rates for Drosophila Gap Gene Expression. PLoS Comput Biol 2017; 13:e1005285. [PMID: 28158178 PMCID: PMC5291410 DOI: 10.1371/journal.pcbi.1005285] [Citation(s) in RCA: 45] [Impact Index Per Article: 6.4] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 08/22/2016] [Accepted: 12/06/2016] [Indexed: 11/18/2022] Open
Abstract
Pattern formation during development is a highly dynamic process. In spite of this, few experimental and modelling approaches take into account the explicit time-dependence of the rules governing regulatory systems. We address this problem by studying dynamic morphogen interpretation by the gap gene network in Drosophila melanogaster. Gap genes are involved in segment determination during early embryogenesis. They are activated by maternal morphogen gradients encoded by bicoid (bcd) and caudal (cad). These gradients decay at the same time-scale as the establishment of the antero-posterior gap gene pattern. We use a reverse-engineering approach, based on data-driven regulatory models called gene circuits, to isolate and characterise the explicitly time-dependent effects of changing morphogen concentrations on gap gene regulation. To achieve this, we simulate the system in the presence and absence of dynamic gradient decay. Comparison between these simulations reveals that maternal morphogen decay controls the timing and limits the rate of gap gene expression. In the anterior of the embyro, it affects peak expression and leads to the establishment of smooth spatial boundaries between gap domains. In the posterior of the embryo, it causes a progressive slow-down in the rate of gap domain shifts, which is necessary to correctly position domain boundaries and to stabilise the spatial gap gene expression pattern. We use a newly developed method for the analysis of transient dynamics in non-autonomous (time-variable) systems to understand the regulatory causes of these effects. By providing a rigorous mechanistic explanation for the role of maternal gradient decay in gap gene regulation, our study demonstrates that such analyses are feasible and reveal important aspects of dynamic gene regulation which would have been missed by a traditional steady-state approach. More generally, it highlights the importance of transient dynamics for understanding complex regulatory processes in development. Animal development is a highly dynamic process. Biochemical or environmental signals can cause the rules that shape it to change over time. We know little about the effects of such changes. For the sake of simplicity, we usually leave them out of our models and experimental assays. Here, we do exactly the opposite. We characterise precisely those aspects of pattern formation caused by changing signalling inputs to a gene regulatory network, the gap gene system of Drosophila melanogaster. Gap genes are involved in determining the body segments of flies and other insects during early development. Gradients of maternal morphogens activate the expression of the gap genes. These gradients are highly dynamic themselves, as they decay while being read out. We show that this decay controls the peak concentration of gap gene products, produces smooth boundaries of gene expression, and slows down the observed positional shifts of gap domains in the posterior of the embryo, thereby stabilising the spatial pattern. Our analysis demonstrates that the dynamics of gene regulation not only affect the timing, but also the positioning of gene expression. This suggests that we must pay closer attention to transient dynamic aspects of development than is currently the case.
Collapse
Affiliation(s)
- Berta Verd
- EMBL/CRG Systems Biology Research Unit, Centre for Genomic Regulation (CRG), The Barcelona Institute of Science and Technology, Barcelona, Spain
- Universitat Pompeu Fabra (UPF), Barcelona, Spain
- KLI Klosterneuburg, Klosterneuburg, Austria
- * E-mail: (BV); (JJ)
| | - Anton Crombach
- EMBL/CRG Systems Biology Research Unit, Centre for Genomic Regulation (CRG), The Barcelona Institute of Science and Technology, Barcelona, Spain
- Universitat Pompeu Fabra (UPF), Barcelona, Spain
- Center for Interdisciplinary Research in Biology (CIRB), Collège de France, CNRS, INSERM, PSL Research University, Paris, France
| | - Johannes Jaeger
- EMBL/CRG Systems Biology Research Unit, Centre for Genomic Regulation (CRG), The Barcelona Institute of Science and Technology, Barcelona, Spain
- Universitat Pompeu Fabra (UPF), Barcelona, Spain
- KLI Klosterneuburg, Klosterneuburg, Austria
- Wissenschaftskolleg zu Berlin, Berlin, Germany
- * E-mail: (BV); (JJ)
| |
Collapse
|
31
|
Abstract
This chapter aims at discussing the content of multi-agent based simulation (MABS) applied to computational biology i.e., to modelling and simulating biological systems by means of computational models, methodologies, and frameworks. In particular, the adoption of agent-based modelling (ABM) in the field of multicellular systems biology is explored, focussing on the challenging scenarios of developmental biology. After motivating why agent-based abstractions are critical in representing multicellular systems behaviour, MABS is discussed as the source of the most natural and appropriate mechanism for analysing the self-organising behaviour of systems of cells. As a case study, an application of MABS to the development of Drosophila Melanogaster is finally presented, which exploits the ALCHEMIST platform for agent-based simulation.
Collapse
|
32
|
Clark E, Akam M. Odd-paired controls frequency doubling in Drosophila segmentation by altering the pair-rule gene regulatory network. eLife 2016; 5:e18215. [PMID: 27525481 PMCID: PMC5035143 DOI: 10.7554/elife.18215] [Citation(s) in RCA: 50] [Impact Index Per Article: 6.3] [Reference Citation Analysis] [Abstract] [Key Words] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 05/26/2016] [Accepted: 08/14/2016] [Indexed: 01/08/2023] Open
Abstract
The Drosophila embryo transiently exhibits a double-segment periodicity, defined by the expression of seven 'pair-rule' genes, each in a pattern of seven stripes. At gastrulation, interactions between the pair-rule genes lead to frequency doubling and the patterning of 14 parasegment boundaries. In contrast to earlier stages of Drosophila anteroposterior patterning, this transition is not well understood. By carefully analysing the spatiotemporal dynamics of pair-rule gene expression, we demonstrate that frequency-doubling is precipitated by multiple coordinated changes to the network of regulatory interactions between the pair-rule genes. We identify the broadly expressed but temporally patterned transcription factor, Odd-paired (Opa/Zic), as the cause of these changes, and show that the patterning of the even-numbered parasegment boundaries relies on Opa-dependent regulatory interactions. Our findings indicate that the pair-rule gene regulatory network has a temporally modulated topology, permitting the pair-rule genes to play stage-specific patterning roles.
Collapse
Affiliation(s)
- Erik Clark
- Laboratory for Development and Evolution, Department of Zoology, University of Cambridge, Cambridge, United Kingdom
| | - Michael Akam
- Laboratory for Development and Evolution, Department of Zoology, University of Cambridge, Cambridge, United Kingdom
| |
Collapse
|
33
|
Peng PC, Sinha S. Quantitative modeling of gene expression using DNA shape features of binding sites. Nucleic Acids Res 2016; 44:e120. [PMID: 27257066 PMCID: PMC5291265 DOI: 10.1093/nar/gkw446] [Citation(s) in RCA: 14] [Impact Index Per Article: 1.8] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 11/02/2015] [Revised: 05/06/2016] [Accepted: 05/09/2016] [Indexed: 12/11/2022] Open
Abstract
Prediction of gene expression levels driven by regulatory sequences is pivotal in genomic biology. A major focus in transcriptional regulation is sequence-to-expression modeling, which interprets the enhancer sequence based on transcription factor concentrations and DNA binding specificities and predicts precise gene expression levels in varying cellular contexts. Such models largely rely on the position weight matrix (PWM) model for DNA binding, and the effect of alternative models based on DNA shape remains unexplored. Here, we propose a statistical thermodynamics model of gene expression using DNA shape features of binding sites. We used rigorous methods to evaluate the fits of expression readouts of 37 enhancers regulating spatial gene expression patterns in Drosophila embryo, and show that DNA shape-based models perform arguably better than PWM-based models. We also observed DNA shape captures information complimentary to the PWM, in a way that is useful for expression modeling. Furthermore, we tested if combining shape and PWM-based features provides better predictions than using either binding model alone. Our work demonstrates that the increasingly popular DNA-binding models based on local DNA shape can be useful in sequence-to-expression modeling. It also provides a framework for future studies to predict gene expression better than with PWM models alone.
Collapse
Affiliation(s)
- Pei-Chen Peng
- Department of Computer Science, University of Illinois at Urbana-Champaign, Urbana, IL 61801, USA
| | - Saurabh Sinha
- Department of Computer Science, University of Illinois at Urbana-Champaign, Urbana, IL 61801, USA Carl R. Woese Institute for Genomic Biology, University of Illinois at Urbana-Champaign, Urbana, IL 61801, USA
| |
Collapse
|
34
|
Surkova SY, Golubkova EV, Mamon LA, Samsonova MG. Morphogenetic networks which determine the spatial expression of zygotic genes in early Drosophila embryo. Russ J Dev Biol 2016. [DOI: 10.1134/s1062360416040093] [Citation(s) in RCA: 1] [Impact Index Per Article: 0.1] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/23/2022]
|
35
|
Cox process representation and inference for stochastic reaction-diffusion processes. Nat Commun 2016; 7:11729. [PMID: 27222432 PMCID: PMC4894951 DOI: 10.1038/ncomms11729] [Citation(s) in RCA: 8] [Impact Index Per Article: 1.0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 12/16/2015] [Accepted: 04/26/2016] [Indexed: 01/30/2023] Open
Abstract
Complex behaviour in many systems arises from the stochastic interactions of spatially distributed particles or agents. Stochastic reaction–diffusion processes are widely used to model such behaviour in disciplines ranging from biology to the social sciences, yet they are notoriously difficult to simulate and calibrate to observational data. Here we use ideas from statistical physics and machine learning to provide a solution to the inverse problem of learning a stochastic reaction–diffusion process from data. Our solution relies on a non-trivial connection between stochastic reaction–diffusion processes and spatio-temporal Cox processes, a well-studied class of models from computational statistics. This connection leads to an efficient and flexible algorithm for parameter inference and model selection. Our approach shows excellent accuracy on numeric and real data examples from systems biology and epidemiology. Our work provides both insights into spatio-temporal stochastic systems, and a practical solution to a long-standing problem in computational modelling. Stochastic reaction-diffusion systems are used for modelling spatial dynamics in many disciplines, but parameter inference and model selection remain challenging. Here the authors offer a solution enabled by a connection between reaction-diffusion and the well-studied spatio-temporal Cox processes.
Collapse
|
36
|
Prata GN, Hornos JEM, Ramos AF. Stochastic model for gene transcription on Drosophila melanogaster embryos. Phys Rev E 2016; 93:022403. [PMID: 26986358 DOI: 10.1103/physreve.93.022403] [Citation(s) in RCA: 10] [Impact Index Per Article: 1.3] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 11/10/2014] [Indexed: 02/04/2023]
Abstract
We examine immunostaining experimental data for the formation of stripe 2 of even-skipped (eve) transcripts on D. melanogaster embryos. An estimate of the factor converting immunofluorescence intensity units into molecular numbers is given. The analysis of the eve dynamics at the region of stripe 2 suggests that the promoter site of the gene has two distinct regimes: an earlier phase when it is predominantly activated until a critical time when it becomes mainly repressed. That suggests proposing a stochastic binary model for gene transcription on D. melanogaster embryos. Our model has two random variables: the transcripts number and the state of the source of mRNAs given as active or repressed. We are able to reproduce available experimental data for the average number of transcripts. An analysis of the random fluctuations on the number of eves and their consequences on the spatial precision of stripe 2 is presented. We show that the position of the anterior or posterior borders fluctuate around their average position by ∼1% of the embryo length, which is similar to what is found experimentally. The fitting of data by such a simple model suggests that it can be useful to understand the functions of randomness during developmental processes.
Collapse
Affiliation(s)
- Guilherme N Prata
- Escola de Artes, Ciências e Humanidades, Universidade de São Paulo, Avenida Arlindo Béttio, 1000, Ermelino Matarazzo, São Paulo, SP CEP 03828-000, Brazil
| | - José Eduardo M Hornos
- Instituto de Física de São Carlos, Universidade de São Paulo, Av. Trabalhador São-Carlense, 400, São Carlos, SP CEP 13566-590, Brazil
| | - Alexandre F Ramos
- Escola de Artes, Ciências e Humanidades, Universidade de São Paulo, Avenida Arlindo Béttio, 1000, Ermelino Matarazzo, São Paulo, SP CEP 03828-000, Brazil.,Departamento de Radiologia - Faculdade de Medicina, Universidade de São Paulo, São Carlos, SP CEP 13566-590, Brazil.,Núcleo de Estudos Interdisciplinares em Sistemas Complexos, Universidade de São Paulo, São Carlos, SP CEP 13566-590, Brazil
| |
Collapse
|
37
|
Peng PC, Hassan Samee MA, Sinha S. Incorporating chromatin accessibility data into sequence-to-expression modeling. Biophys J 2016; 108:1257-67. [PMID: 25762337 DOI: 10.1016/j.bpj.2014.12.037] [Citation(s) in RCA: 7] [Impact Index Per Article: 0.9] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 08/18/2014] [Revised: 12/01/2014] [Accepted: 12/11/2014] [Indexed: 01/30/2023] Open
Abstract
Prediction of gene expression levels from regulatory sequences is one of the major challenges of genomic biology today. A particularly promising approach to this problem is that taken by thermodynamics-based models that interpret an enhancer sequence in a given cellular context specified by transcription factor concentration levels and predict precise expression levels driven by that enhancer. Such models have so far not accounted for the effect of chromatin accessibility on interactions between transcription factor and DNA and consequently on gene-expression levels. Here, we extend a thermodynamics-based model of gene expression, called GEMSTAT (Gene Expression Modeling Based on Statistical Thermodynamics), to incorporate chromatin accessibility data and quantify its effect on accuracy of expression prediction. In the new model, called GEMSTAT-A, accessibility at a binding site is assumed to affect the transcription factor's binding strength at the site, whereas all other aspects are identical to the GEMSTAT model. We show that this modification results in significantly better fits in a data set of over 30 enhancers regulating spatial expression patterns in the blastoderm-stage Drosophila embryo. It is important to note that the improved fits result not from an overall elevated accessibility in active enhancers but from the variation of accessibility levels within an enhancer. With whole-genome DNA accessibility measurements becoming increasingly popular, our work demonstrates how such data may be useful for sequence-to-expression models. It also calls for future advances in modeling accessibility levels from sequence and the transregulatory context, so as to predict accurately the effect of cis and trans perturbations on gene expression.
Collapse
Affiliation(s)
- Pei-Chen Peng
- Department of Computer Science, University of Illinois at Urbana-Champaign, Urbana, Illinois
| | - Md Abul Hassan Samee
- Department of Computer Science, University of Illinois at Urbana-Champaign, Urbana, Illinois
| | - Saurabh Sinha
- Department of Computer Science, University of Illinois at Urbana-Champaign, Urbana, Illinois; Institute for Genomic Biology, University of Illinois at Urbana-Champaign, Urbana, Illinois.
| |
Collapse
|
38
|
|
39
|
Kozlov K, Gursky VV, Kulakovskiy IV, Dymova A, Samsonova M. Analysis of functional importance of binding sites in the Drosophila gap gene network model. BMC Genomics 2015; 16 Suppl 13:S7. [PMID: 26694511 PMCID: PMC4686791 DOI: 10.1186/1471-2164-16-s13-s7] [Citation(s) in RCA: 9] [Impact Index Per Article: 1.0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/11/2022] Open
Abstract
BACKGROUND The statistical thermodynamics based approach provides a promising framework for construction of the genotype-phenotype map in many biological systems. Among important aspects of a good model connecting the DNA sequence information with that of a molecular phenotype (gene expression) is the selection of regulatory interactions and relevant transcription factor bindings sites. As the model may predict different levels of the functional importance of specific binding sites in different genomic and regulatory contexts, it is essential to formulate and study such models under different modeling assumptions. RESULTS We elaborate a two-layer model for the Drosophila gap gene network and include in the model a combined set of transcription factor binding sites and concentration dependent regulatory interaction between gap genes hunchback and Kruppel. We show that the new variants of the model are more consistent in terms of gene expression predictions for various genetic constructs in comparison to previous work. We quantify the functional importance of binding sites by calculating their impact on gene expression in the model and calculate how these impacts correlate across all sites under different modeling assumptions. CONCLUSIONS The assumption about the dual interaction between hb and Kr leads to the most consistent modeling results, but, on the other hand, may obscure existence of indirect interactions between binding sites in regulatory regions of distinct genes. The analysis confirms the previously formulated regulation concept of many weak binding sites working in concert. The model predicts a more or less uniform distribution of functionally important binding sites over the sets of experimentally characterized regulatory modules and other open chromatin domains.
Collapse
Affiliation(s)
- Konstantin Kozlov
- Peter the Great St. Petersburg Polytechnic University, 29 Polytechnicheskaya, 195251 St.Petersburg, Russia
| | - Vitaly V Gursky
- Peter the Great St. Petersburg Polytechnic University, 29 Polytechnicheskaya, 195251 St.Petersburg, Russia
- Ioffe Institute, 26 Polytechnicheskaya, 194021 St.Petersburg, Russia
| | - Ivan V Kulakovskiy
- Engelhardt Institute of Molecular Biology, 32 Vavilova, 119991 Moscow, Russia
| | - Arina Dymova
- Peter the Great St. Petersburg Polytechnic University, 29 Polytechnicheskaya, 195251 St.Petersburg, Russia
| | - Maria Samsonova
- Peter the Great St. Petersburg Polytechnic University, 29 Polytechnicheskaya, 195251 St.Petersburg, Russia
| |
Collapse
|
40
|
Temporal and spatial dynamics of scaling-specific features of a gene regulatory network in Drosophila. Nat Commun 2015; 6:10031. [PMID: 26644070 PMCID: PMC4686680 DOI: 10.1038/ncomms10031] [Citation(s) in RCA: 38] [Impact Index Per Article: 4.2] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 05/13/2015] [Accepted: 10/28/2015] [Indexed: 01/19/2023] Open
Abstract
A widely appreciated aspect of developmental robustness is pattern formation in proportion to size. But how such scaling features emerge dynamically remains poorly understood. Here we generate a data set of the expression profiles of six gap genes in Drosophila melanogaster embryos that differ significantly in size. Expression patterns exhibit size-dependent dynamics both spatially and temporally. We uncover a dynamic emergence of under-scaling in the posterior, accompanied by reduced expression levels of gap genes near the middle of large embryos. Simulation results show that a size-dependent Bicoid gradient input can lead to reduced Krüppel expression that can have long-range and dynamic effects on gap gene expression in the posterior. Thus, for emergence of scaled patterns, the entire embryo may be viewed as a single unified dynamic system where maternally derived size-dependent information interpreted locally can be propagated in space and time as governed by the dynamics of a gene regulatory network. How pattern formation is regulated relative to the size of an organism is unclear. Here, Wu et al. take data from gap gene expression in flies of different sizes together with simulations, identifying how scaling emerges dynamically and that local patterning influences global gene regulatory networks.
Collapse
|
41
|
Abstract
The Drosophila blastoderm and the vertebrate neural tube are archetypal examples of morphogen-patterned tissues that create precise spatial patterns of different cell types. In both tissues, pattern formation is dependent on molecular gradients that emanate from opposite poles. Despite distinct evolutionary origins and differences in time scales, cell biology and molecular players, both tissues exhibit striking similarities in the regulatory systems that establish gene expression patterns that foreshadow the arrangement of cell types. First, signaling gradients establish initial conditions that polarize the tissue, but there is no strict correspondence between specific morphogen thresholds and boundary positions. Second, gradients initiate transcriptional networks that integrate broadly distributed activators and localized repressors to generate patterns of gene expression. Third, the correct positioning of boundaries depends on the temporal and spatial dynamics of the transcriptional networks. These similarities reveal design principles that are likely to be broadly applicable to morphogen-patterned tissues.
Collapse
Affiliation(s)
- James Briscoe
- The Francis Crick Institute, Mill Hill Laboratory, The Ridgeway, Mill Hill, London NW7 1AA, UK
| | - Stephen Small
- Department of Biology, New York University, 100 Washington Square East, New York, NY 10003, USA
| |
Collapse
|
42
|
Ghodsi Z, Silva ES, Hassani H. Bicoid signal extraction with a selection of parametric and nonparametric signal processing techniques. GENOMICS, PROTEOMICS & BIOINFORMATICS 2015; 13:183-91. [PMID: 26197438 PMCID: PMC4563350 DOI: 10.1016/j.gpb.2015.02.006] [Citation(s) in RCA: 23] [Impact Index Per Article: 2.6] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Download PDF] [Figures] [Subscribe] [Scholar Register] [Received: 08/11/2014] [Revised: 01/22/2015] [Accepted: 02/12/2015] [Indexed: 11/27/2022]
Abstract
The maternal segmentation coordinate gene bicoid plays a significant role during Drosophila embryogenesis. The gradient of Bicoid, the protein encoded by this gene, determines most aspects of head and thorax development. This paper seeks to explore the applicability of a variety of signal processing techniques at extracting bicoid expression signal, and whether these methods can outperform the current model. We evaluate the use of six different powerful and widely-used models representing both parametric and nonparametric signal processing techniques to determine the most efficient method for signal extraction in bicoid. The results are evaluated using both real and simulated data. Our findings show that the Singular Spectrum Analysis technique proposed in this paper outperforms the synthesis diffusion degradation model for filtering the noisy protein profile of bicoid whilst the exponential smoothing technique was found to be the next best alternative followed by the autoregressive integrated moving average.
Collapse
Affiliation(s)
- Zara Ghodsi
- The Statistical Research Centre, Bournemouth University, Bournemouth BH8 8EB, UK
| | | | - Hossein Hassani
- The Statistical Research Centre, Bournemouth University, Bournemouth BH8 8EB, UK; Institute for International Energy Studies (IIES), Tehran 1967743 711, Iran.
| |
Collapse
|
43
|
Shaped singular spectrum analysis for quantifying gene expression, with application to the early Drosophila embryo. BIOMED RESEARCH INTERNATIONAL 2015; 2015:689745. [PMID: 25945341 PMCID: PMC4402483 DOI: 10.1155/2015/689745] [Citation(s) in RCA: 7] [Impact Index Per Article: 0.8] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Subscribe] [Scholar Register] [Received: 07/04/2014] [Revised: 09/10/2014] [Accepted: 09/10/2014] [Indexed: 11/17/2022]
Abstract
In recent years, with the development of automated microscopy technologies, the volume and complexity of image data on gene expression have increased tremendously. The only way to analyze quantitatively and comprehensively such biological data is by developing and applying new sophisticated mathematical approaches. Here, we present extensions of 2D singular spectrum analysis (2D-SSA) for application to 2D and 3D datasets of embryo images. These extensions, circular and shaped 2D-SSA, are applied to gene expression in the nuclear layer just under the surface of the Drosophila (fruit fly) embryo. We consider the commonly used cylindrical projection of the ellipsoidal Drosophila embryo. We demonstrate how circular and shaped versions of 2D-SSA help to decompose expression data into identifiable components (such as trend and noise), as well as separating signals from different genes. Detection and improvement of under- and overcorrection in multichannel imaging is addressed, as well as the extraction and analysis of 3D features in 3D gene expression patterns.
Collapse
|
44
|
Staller MV, Fowlkes CC, Bragdon MDJ, Wunderlich Z, Estrada J, DePace AH. A gene expression atlas of a bicoid-depleted Drosophila embryo reveals early canalization of cell fate. Development 2015; 142:587-96. [PMID: 25605785 PMCID: PMC4302997 DOI: 10.1242/dev.117796] [Citation(s) in RCA: 20] [Impact Index Per Article: 2.2] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 09/11/2014] [Accepted: 12/01/2014] [Indexed: 01/31/2023]
Abstract
In developing embryos, gene regulatory networks drive cells towards discrete terminal fates, a process called canalization. We studied the behavior of the anterior-posterior segmentation network in Drosophila melanogaster embryos by depleting a key maternal input, bicoid (bcd), and measuring gene expression patterns of the network at cellular resolution. This method results in a gene expression atlas containing the levels of mRNA or protein expression of 13 core patterning genes over six time points for every cell of the blastoderm embryo. This is the first cellular resolution dataset of a genetically perturbed Drosophila embryo that captures all cells in 3D. We describe the technical developments required to build this atlas and how the method can be employed and extended by others. We also analyze this novel dataset to characterize the degree and timing of cell fate canalization in the segmentation network. We find that in two layers of this gene regulatory network, following depletion of bcd, individual cells rapidly canalize towards normal cell fates. This result supports the hypothesis that the segmentation network directly canalizes cell fate, rather than an alternative hypothesis whereby cells are initially mis-specified and later eliminated by apoptosis. Our gene expression atlas provides a high resolution picture of a classic perturbation and will enable further computational modeling of canalization and gene regulation in this transcriptional network.
Collapse
Affiliation(s)
- Max V Staller
- Department of Systems Biology, Harvard Medical School, Boston, MA 02115, USA
| | - Charless C Fowlkes
- Department of Computer Science, University of California Irvine, Irvine, CA 92697, USA
| | - Meghan D J Bragdon
- Department of Systems Biology, Harvard Medical School, Boston, MA 02115, USA
| | - Zeba Wunderlich
- Department of Systems Biology, Harvard Medical School, Boston, MA 02115, USA
| | - Javier Estrada
- Department of Systems Biology, Harvard Medical School, Boston, MA 02115, USA
| | - Angela H DePace
- Department of Systems Biology, Harvard Medical School, Boston, MA 02115, USA
| |
Collapse
|
45
|
Vasquez Jaramillo JD, Álvarez MA, Orozco AA. Latent force models for describing transcriptional regulation processes in the embryo development problem for the Drosophila melanogaster. ANNUAL INTERNATIONAL CONFERENCE OF THE IEEE ENGINEERING IN MEDICINE AND BIOLOGY SOCIETY. IEEE ENGINEERING IN MEDICINE AND BIOLOGY SOCIETY. ANNUAL INTERNATIONAL CONFERENCE 2015; 2014:338-41. [PMID: 25569966 DOI: 10.1109/embc.2014.6943598] [Citation(s) in RCA: 1] [Impact Index Per Article: 0.1] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 11/07/2022]
Abstract
In the embryo development problem for the Drosophila melanogaster, a set of molecules known as mor-phogens are responsible for the embryo segmentation. These morphogens are encoded by different genes, including the GAP genes, maternal coordination genes and pair-rule genes. One of the maternal coordination genes encodes the Bicoid morphogen, which is the responsible for the development of the Drosophila embryo at the anterior part and for the control and regulation of the GAP genes in segmentation of the early development of the Drosophila melanogaster. The work presented in this document, reports a methodology that tends to integrate mechanistic and data driven based models, aiming at making inference over the mRNA Bicoid from gene expression data at the protein level for the Bicoid morphogen. The fundamental contribution of this work is the description of the concentration gradient of the Bicoid morphogen in the continuous spatio-temporal domain as well as the output regression (gene expression at protein level) using a Gaussian process described by a mechanistically inspired covariance function. Regression results and metrics computed for the Bicoid protein expression both in the temporal and spatial domains, showed outstanding performance with respect to reported experiments from previous studies. In this paper, a correlation coefficient of r = 0.9758 against a correlation coefficient of r = 0.9086 is being reported, as well as a SMSE of 0.0303±0.1512 against a SMSE of 0.1106±0.5090 and finally reporting a MSLL of -1.7036 ± 1.3472 against -1.0151±1.7669.
Collapse
|
46
|
Shadow enhancers enable Hunchback bifunctionality in the Drosophila embryo. Proc Natl Acad Sci U S A 2015; 112:785-90. [PMID: 25564665 DOI: 10.1073/pnas.1413877112] [Citation(s) in RCA: 41] [Impact Index Per Article: 4.6] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 12/22/2022] Open
Abstract
Hunchback (Hb) is a bifunctional transcription factor that activates and represses distinct enhancers. Here, we investigate the hypothesis that Hb can activate and repress the same enhancer. Computational models predicted that Hb bifunctionally regulates the even-skipped (eve) stripe 3+7 enhancer (eve3+7) in Drosophila blastoderm embryos. We measured and modeled eve expression at cellular resolution under multiple genetic perturbations and found that the eve3+7 enhancer could not explain endogenous eve stripe 7 behavior. Instead, we found that eve stripe 7 is controlled by two enhancers: the canonical eve3+7 and a sequence encompassing the minimal eve stripe 2 enhancer (eve2+7). Hb bifunctionally regulates eve stripe 7, but it executes these two activities on different pieces of regulatory DNA--it activates the eve2+7 enhancer and represses the eve3+7 enhancer. These two "shadow enhancers" use different regulatory logic to create the same pattern.
Collapse
|
47
|
Zabet NR, Adryan B. Estimating binding properties of transcription factors from genome-wide binding profiles. Nucleic Acids Res 2015; 43:84-94. [PMID: 25432957 PMCID: PMC4288167 DOI: 10.1093/nar/gku1269] [Citation(s) in RCA: 23] [Impact Index Per Article: 2.6] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 04/22/2014] [Revised: 10/22/2014] [Accepted: 11/19/2014] [Indexed: 12/20/2022] Open
Abstract
The binding of transcription factors (TFs) is essential for gene expression. One important characteristic is the actual occupancy of a putative binding site in the genome. In this study, we propose an analytical model to predict genomic occupancy that incorporates the preferred target sequence of a TF in the form of a position weight matrix (PWM), DNA accessibility data (in the case of eukaryotes), the number of TF molecules expected to be bound specifically to the DNA and a parameter that modulates the specificity of the TF. Given actual occupancy data in the form of ChIP-seq profiles, we backwards inferred copy number and specificity for five Drosophila TFs during early embryonic development: Bicoid, Caudal, Giant, Hunchback and Kruppel. Our results suggest that these TFs display thousands of molecules that are specifically bound to the DNA and that whilst Bicoid and Caudal display a higher specificity, the other three TFs (Giant, Hunchback and Kruppel) display lower specificity in their binding (despite having PWMs with higher information content). This study gives further weight to earlier investigations into TF copy numbers that suggest a significant proportion of molecules are not bound specifically to the DNA.
Collapse
Affiliation(s)
- Nicolae Radu Zabet
- Cambridge Systems Biology Centre, University of Cambridge, Tennis Court Road, Cambridge CB2 1QR, UK Department of Genetics, University of Cambridge, Downing Street, Cambridge CB2 3EH, UK
| | - Boris Adryan
- Cambridge Systems Biology Centre, University of Cambridge, Tennis Court Road, Cambridge CB2 1QR, UK Department of Genetics, University of Cambridge, Downing Street, Cambridge CB2 3EH, UK
| |
Collapse
|
48
|
Abstract
BACKGROUND The detailed analysis of transcriptional regulation is crucially important for understanding biological processes. The gap gene network in Drosophila attracts large interest among researches studying mechanisms of transcriptional regulation. It implements the most upstream regulatory layer of the segmentation gene network. The knowledge of molecular mechanisms involved in gap gene regulation is far less complete than that of genetics of the system. Mathematical modeling goes beyond insights gained by genetics and molecular approaches. It allows us to reconstruct wild-type gene expression patterns in silico, infer underlying regulatory mechanism and prove its sufficiency. RESULTS We developed a new model that provides a dynamical description of gap gene regulatory systems, using detailed DNA-based information, as well as spatial transcription factor concentration data at varying time points. We showed that this model correctly reproduces gap gene expression patterns in wild type embryos and is able to predict gap expression patterns in Kr mutants and four reporter constructs. We used four-fold cross validation test and fitting to random dataset to validate the model and proof its sufficiency in data description. The identifiability analysis showed that most model parameters are well identifiable. We reconstructed the gap gene network topology and studied the impact of individual transcription factor binding sites on the model output. We measured this impact by calculating the site regulatory weight as a normalized difference between the residual sum of squares error for the set of all annotated sites and for the set with the site of interest excluded. CONCLUSIONS The reconstructed topology of the gap gene network is in agreement with previous modeling results and data from literature. We showed that 1) the regulatory weights of transcription factor binding sites show very weak correlation with their PWM score; 2) sites with low regulatory weight are important for the model output; 3) functional important sites are not exclusively located in cis-regulatory elements, but are rather dispersed through regulatory region. It is of importance that some of the sites with high functional impact in hb, Kr and kni regulatory regions coincide with strong sites annotated and verified in Dnase I footprint assays.
Collapse
Affiliation(s)
- Konstantin Kozlov
- St.Petersburg State Polytechnical University, Polytekhnicheskaya 29, 195251 St.Petersburg, Russia
| | - Vitaly Gursky
- Ioffe Physical-Technical Institute, RAS, Polytekhnicheskaya 26, 194021 St.Petersburg, Russia
| | - Ivan Kulakovskiy
- Engelhardt Institute of Molecular Biology, RAS, Vavilov 32, 119991 Moscow, Russia
| | - Maria Samsonova
- St.Petersburg State Polytechnical University, Polytekhnicheskaya 29, 195251 St.Petersburg, Russia
| |
Collapse
|
49
|
Dilão R. Bicoid mRNA diffusion as a mechanism of morphogenesis in Drosophila early development. C R Biol 2014; 337:679-82. [PMID: 25433559 DOI: 10.1016/j.crvi.2014.09.004] [Citation(s) in RCA: 2] [Impact Index Per Article: 0.2] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 11/18/2013] [Revised: 09/22/2014] [Accepted: 09/22/2014] [Indexed: 11/28/2022]
Abstract
We show that mRNA diffusion is the main morphogenesis mechanism that consistently explains the establishment of Bicoid protein gradients in the embryo of Drosophila, contradicting the current view of protein diffusion. Moreover, we show that if diffusion for both bicoid mRNA and Bicoid protein were assumed, a steady distribution of Bicoid protein with a constant concentration along the embryo would result, contradicting observations.
Collapse
Affiliation(s)
- Rui Dilão
- Non-Linear Dynamics Group, IST, Department of Physics, avenue Rovisco Pais, 1049-001 Lisbon, Portugal; Institut des hautes études scientifiques, 35, route de Chartres, 91440 Bures-sur-Yvette, France.
| |
Collapse
|
50
|
Cicin-Sain D, Pulido AH, Crombach A, Wotton KR, Jiménez-Guri E, Taly JF, Roma G, Jaeger J. SuperFly: a comparative database for quantified spatio-temporal gene expression patterns in early dipteran embryos. Nucleic Acids Res 2014; 43:D751-5. [PMID: 25404137 PMCID: PMC4383950 DOI: 10.1093/nar/gku1142] [Citation(s) in RCA: 14] [Impact Index Per Article: 1.4] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/25/2022] Open
Abstract
We present SuperFly (http://superfly.crg.eu), a relational database for quantified spatio-temporal expression data of segmentation genes during early development in different species of dipteran insects (flies, midges and mosquitoes). SuperFly has a special focus on emerging non-drosophilid model systems. The database currently includes data of high spatio-temporal resolution for three species: the vinegar fly Drosophila melanogaster, the scuttle fly Megaselia abdita and the moth midge Clogmia albipunctata. At this point, SuperFly covers up to 9 genes and 16 time points per species, with a total of 1823 individual embryos. It provides an intuitive web interface, enabling the user to query and access original embryo images, quantified expression profiles, extracted positions of expression boundaries and integrated datasets, plus metadata and intermediate processing steps. SuperFly is a valuable new resource for the quantitative comparative study of gene expression patterns across dipteran species. Moreover, it provides an interesting test set for systems biologists interested in fitting mathematical gene network models to data. Both of these aspects are essential ingredients for progress toward a more quantitative and mechanistic understanding of developmental evolution.
Collapse
Affiliation(s)
- Damjan Cicin-Sain
- EMBL/CRG Research Unit in Systems Biology, Centre for Genomic Regulation (CRG), 08003 Barcelona, Spain Bioinformatics Core Facility, Centre for Genomic Regulation (CRG), 08003 Barcelona, Spain
| | - Antonio Hermoso Pulido
- Universitat Pompeu Fabra (UPF), 08002 Barcelona, Spain Bioinformatics Core Facility, Centre for Genomic Regulation (CRG), 08003 Barcelona, Spain
| | - Anton Crombach
- EMBL/CRG Research Unit in Systems Biology, Centre for Genomic Regulation (CRG), 08003 Barcelona, Spain Bioinformatics Core Facility, Centre for Genomic Regulation (CRG), 08003 Barcelona, Spain
| | - Karl R Wotton
- EMBL/CRG Research Unit in Systems Biology, Centre for Genomic Regulation (CRG), 08003 Barcelona, Spain Bioinformatics Core Facility, Centre for Genomic Regulation (CRG), 08003 Barcelona, Spain
| | - Eva Jiménez-Guri
- EMBL/CRG Research Unit in Systems Biology, Centre for Genomic Regulation (CRG), 08003 Barcelona, Spain Bioinformatics Core Facility, Centre for Genomic Regulation (CRG), 08003 Barcelona, Spain
| | - Jean-François Taly
- Universitat Pompeu Fabra (UPF), 08002 Barcelona, Spain Bioinformatics Core Facility, Centre for Genomic Regulation (CRG), 08003 Barcelona, Spain
| | - Guglielmo Roma
- Universitat Pompeu Fabra (UPF), 08002 Barcelona, Spain Bioinformatics Core Facility, Centre for Genomic Regulation (CRG), 08003 Barcelona, Spain
| | - Johannes Jaeger
- EMBL/CRG Research Unit in Systems Biology, Centre for Genomic Regulation (CRG), 08003 Barcelona, Spain Bioinformatics Core Facility, Centre for Genomic Regulation (CRG), 08003 Barcelona, Spain
| |
Collapse
|