1
|
Tabe-Bordbar S, Song YJ, Lunt BJ, Alavi Z, Prasanth KV, Sinha S. Mechanistic analysis of enhancer sequences in the estrogen receptor transcriptional program. Commun Biol 2024; 7:719. [PMID: 38862711 PMCID: PMC11167054 DOI: 10.1038/s42003-024-06400-5] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 05/21/2022] [Accepted: 05/30/2024] [Indexed: 06/13/2024] Open
Abstract
Estrogen Receptor α (ERα) is a major lineage determining transcription factor (TF) in mammary gland development. Dysregulation of ERα-mediated transcriptional program results in cancer. Transcriptomic and epigenomic profiling of breast cancer cell lines has revealed large numbers of enhancers involved in this regulatory program, but how these enhancers encode function in their sequence remains poorly understood. A subset of ERα-bound enhancers are transcribed into short bidirectional RNA (enhancer RNA or eRNA), and this property is believed to be a reliable marker of active enhancers. We therefore analyze thousands of ERα-bound enhancers and build quantitative, mechanism-aware models to discriminate eRNAs from non-transcribing enhancers based on their sequence. Our thermodynamics-based models provide insights into the roles of specific TFs in ERα-mediated transcriptional program, many of which are supported by the literature. We use in silico perturbations to predict TF-enhancer regulatory relationships and integrate these findings with experimentally determined enhancer-promoter interactions to construct a gene regulatory network. We also demonstrate that the model can prioritize breast cancer-related sequence variants while providing mechanistic explanations for their function. Finally, we experimentally validate the model-proposed mechanisms underlying three such variants.
Collapse
Affiliation(s)
- Shayan Tabe-Bordbar
- Department of Computer Science, University of Illinois at Urbana-Champaign, Urbana, IL, USA
| | - You Jin Song
- Department of Cell and Developmental Biology, University of Illinois at Urbana-Champaign, Urbana, IL, USA
| | - Bryan J Lunt
- Department of Computer Science, University of Illinois at Urbana-Champaign, Urbana, IL, USA
| | - Zahra Alavi
- Department of Physics, Loyola Marymount University, Los Angeles, CA, USA
| | - Kannanganattu V Prasanth
- Department of Cell and Developmental Biology, University of Illinois at Urbana-Champaign, Urbana, IL, USA
| | - Saurabh Sinha
- Department of Biomedical Engineering, Georgia Institute of Technology, Atlanta, GA, USA.
| |
Collapse
|
2
|
Lin S, Lim B. Multifaceted effects on even-skipped transcriptional dynamics upon Krüppel dosage changes. Development 2024; 151:dev202132. [PMID: 38345298 PMCID: PMC10948998 DOI: 10.1242/dev.202132] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/27/2023] [Accepted: 02/08/2024] [Indexed: 03/05/2024]
Abstract
Although fluctuations in transcription factor (TF) dosage are often well tolerated, TF dosage modulation can change the target gene expression dynamics and result in significant non-lethal developmental phenotypes. Using MS2/MCP-mediated quantitative live imaging in early Drosophila embryos, we analyzed how changing levels of the gap gene Krüppel (Kr) affects transcriptional dynamics of the pair-rule gene even-skipped (eve). Halving the Kr dosage leads to a transient posterior expansion of the eve stripe 2 and an anterior shift of stripe 5. Surprisingly, the most significant changes are observed in eve stripes 3 and 4, the enhancers of which do not contain Kr-binding sites. In Kr heterozygous embryos, both stripes 3 and 4 display narrower widths, anteriorly shifted boundaries and reduced mRNA production levels. We show that Kr dosage indirectly affects stripe 3 and 4 dynamics by modulating other gap gene dynamics. We quantitatively correlate moderate body segment phenotypes of Kr heterozygotes with spatiotemporal changes in eve expression. Our results indicate that nonlinear relationships between TF dosage and phenotypes underlie direct TF-DNA and indirect TF-TF interactions.
Collapse
Affiliation(s)
- Shufan Lin
- Department of Bioengineering, University of Pennsylvania, Philadelphia, PA 19104, USA
| | - Bomyi Lim
- Department of Chemical and Biomolecular Engineering, University of Pennsylvania, Philadelphia, PA 19104, USA
| |
Collapse
|
3
|
Sabarís G, Ortíz DM, Laiker I, Mayansky I, Naik S, Cavalli G, Stern DL, Preger-Ben Noon E, Frankel N. The Density of Regulatory Information Is a Major Determinant of Evolutionary Constraint on Noncoding DNA in Drosophila. Mol Biol Evol 2024; 41:msae004. [PMID: 38364113 PMCID: PMC10871701 DOI: 10.1093/molbev/msae004] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/04/2023] [Revised: 11/26/2023] [Accepted: 01/05/2024] [Indexed: 02/18/2024] Open
Abstract
Evolutionary analyses have estimated that ∼60% of nucleotides in intergenic regions of the Drosophila melanogaster genome are functionally relevant, suggesting that regulatory information may be encoded more densely in intergenic regions than has been revealed by most functional dissections of regulatory DNA. Here, we approached this issue through a functional dissection of the regulatory region of the gene shavenbaby (svb). Most of the ∼90 kb of this large regulatory region is highly conserved in the genus Drosophila, though characterized enhancers occupy a small fraction of this region. By analyzing the regulation of svb in different contexts of Drosophila development, we found that the regulatory information that drives svb expression in the abdominal pupal epidermis is organized in a different way than the elements that drive svb expression in the embryonic epidermis. While in the embryonic epidermis svb is activated by compact enhancers separated by large inactive DNA regions, svb expression in the pupal epidermis is driven by regulatory information distributed over broader regions of svb cis-regulatory DNA. In the same vein, we observed that other developmental genes also display a dense distribution of putative regulatory elements in their regulatory regions. Furthermore, we found that a large percentage of conserved noncoding DNA of the Drosophila genome is contained within regions of open chromatin. These results suggest that part of the evolutionary constraint on noncoding DNA of Drosophila is explained by the density of regulatory information, which may be greater than previously appreciated.
Collapse
Affiliation(s)
- Gonzalo Sabarís
- Instituto de Fisiología, Biología Molecular y Neurociencias (IFIBYNE), Consejo Nacional de Investigaciones Científicas y Técnicas (CONICET), Universidad de Buenos Aires (UBA), Buenos Aires 1428, Argentina
- Institute of Human Genetics, UMR 9002 CNRS-Université de Montpellier, Montpellier, France
| | - Daniela M Ortíz
- Instituto de Fisiología, Biología Molecular y Neurociencias (IFIBYNE), Consejo Nacional de Investigaciones Científicas y Técnicas (CONICET), Universidad de Buenos Aires (UBA), Buenos Aires 1428, Argentina
| | - Ian Laiker
- Instituto de Fisiología, Biología Molecular y Neurociencias (IFIBYNE), Consejo Nacional de Investigaciones Científicas y Técnicas (CONICET), Universidad de Buenos Aires (UBA), Buenos Aires 1428, Argentina
| | - Ignacio Mayansky
- Instituto de Fisiología, Biología Molecular y Neurociencias (IFIBYNE), Consejo Nacional de Investigaciones Científicas y Técnicas (CONICET), Universidad de Buenos Aires (UBA), Buenos Aires 1428, Argentina
| | - Sujay Naik
- Department of Genetics and Developmental Biology, The Rappaport Faculty of Medicine and Research Institute, Technion—Israel Institute of Technology, Haifa 3109601, Israel
| | - Giacomo Cavalli
- Institute of Human Genetics, UMR 9002 CNRS-Université de Montpellier, Montpellier, France
| | - David L Stern
- Janelia Research Campus, Howard Hughes Medical Institute, Ashburn, VA 20147, USA
| | - Ella Preger-Ben Noon
- Department of Genetics and Developmental Biology, The Rappaport Faculty of Medicine and Research Institute, Technion—Israel Institute of Technology, Haifa 3109601, Israel
| | - Nicolás Frankel
- Instituto de Fisiología, Biología Molecular y Neurociencias (IFIBYNE), Consejo Nacional de Investigaciones Científicas y Técnicas (CONICET), Universidad de Buenos Aires (UBA), Buenos Aires 1428, Argentina
- Departamento de Ecología, Genética y Evolución, Facultad de Ciencias Exactas y Naturales (FCEN), Universidad de Buenos Aires (UBA), Buenos Aires 1428, Argentina
| |
Collapse
|
4
|
Bishop TR, Onal P, Xu Z, Zheng M, Gunasinghe H, Nien CY, Small S, Datta RR. Multi-level regulation of even-skipped stripes by the ubiquitous factor Zelda. Development 2023; 150:dev201860. [PMID: 37934130 PMCID: PMC10730019 DOI: 10.1242/dev.201860] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 04/08/2023] [Accepted: 10/26/2023] [Indexed: 11/08/2023]
Abstract
The zinc-finger protein Zelda (Zld) is a key activator of zygotic transcription in early Drosophila embryos. Here, we study Zld-dependent regulation of the seven-striped pattern of the pair-rule gene even-skipped (eve). Individual stripes are regulated by discrete enhancers that respond to broadly distributed activators; stripe boundaries are formed by localized repressors encoded by the gap genes. The strongest effects of Zld are on stripes 2, 3 and 7, which are regulated by two enhancers in a 3.8 kb genomic fragment that includes the eve basal promoter. We show that Zld facilitates binding of the activator Bicoid and the gap repressors to this fragment, consistent with its proposed role as a pioneer protein. To test whether the effects of Zld are direct, we mutated all canonical Zld sites in the 3.8 kb fragment, which reduced expression but failed to phenocopy the abolishment of stripes caused by removing Zld in trans. We show that Zld also indirectly regulates the eve stripes by establishing specific gap gene expression boundaries, which provides the embryonic spacing required for proper stripe activation.
Collapse
Affiliation(s)
- Timothy R. Bishop
- Department of Biology, New York University, 100 Washington Square East, New York, NY 10003, USA
| | - Pinar Onal
- Department of Molecular Biology and Genetics, Ihsan Dogramaci Bilkent University, Universiteler Mahallesi, 06800 Ankara, Turkey
| | - Zhe Xu
- Department of Biology, New York University, 100 Washington Square East, New York, NY 10003, USA
| | - Michael Zheng
- Department of Biology, New York University, 100 Washington Square East, New York, NY 10003, USA
| | - Himari Gunasinghe
- Department of Biology, New York University, 100 Washington Square East, New York, NY 10003, USA
| | - Chung-Yi Nien
- Department of Life Sciences, National Central University, Taoyuan 32001, Taiwan
| | - Stephen Small
- Department of Biology, New York University, 100 Washington Square East, New York, NY 10003, USA
| | - Rhea R. Datta
- Department of Biology, Hamilton College, 198 College Hill Rd., Clinton, NY 13323, USA
| |
Collapse
|
5
|
Bhogale S, Seward C, Stubbs L, Sinha S. SEAMoD: A fully interpretable neural network for cis-regulatory analysis of differentially expressed genes. BIORXIV : THE PREPRINT SERVER FOR BIOLOGY 2023:2023.11.09.565900. [PMID: 38014229 PMCID: PMC10680628 DOI: 10.1101/2023.11.09.565900] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 11/29/2023]
Abstract
A common way to investigate gene regulatory mechanisms is to identify differentially expressed genes using transcriptomics, find their candidate enhancers using epigenomics, and search for over-represented transcription factor (TF) motifs in these enhancers using bioinformatics tools. A related follow-up task is to model gene expression as a function of enhancer sequences and rank TF motifs by their contribution to such models, thus prioritizing among regulators. We present a new computational tool called SEAMoD that performs the above tasks of motif finding and sequence-to-expression modeling simultaneously. It trains a convolutional neural network model to relate enhancer sequences to differential expression in one or more biological conditions. The model uses TF motifs to interpret the sequences, learning these motifs and their relative importance to each biological condition from data. It also utilizes epigenomic information in the form of activity scores of putative enhancers and automatically searches for the most promising enhancer for each gene. Compared to existing neural network models of non-coding sequences, SEAMoD uses far fewer parameters, requires far less training data, and emphasizes biological interpretability. We used SEAMoD to understand regulatory mechanisms underlying the differentiation of neural stem cell (NSC) derived from mouse forebrain. We profiled gene expression and histone modifications in NSC and three differentiated cell types and used SEAMoD to model differential expression of nearly 12,000 genes with an accuracy of 81%, in the process identifying the Olig2, E2f family TFs, Foxo3, and Tcf4 as key transcriptional regulators of the differentiation process.
Collapse
|
6
|
Nooti S, Naylor M, Long T, Groll B, Manu. LucFlow: A method to measure Luciferase reporter expression in single cells. PLoS One 2023; 18:e0292317. [PMID: 37792708 PMCID: PMC10550117 DOI: 10.1371/journal.pone.0292317] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/28/2023] [Accepted: 09/18/2023] [Indexed: 10/06/2023] Open
Abstract
Reporter assays, in which the expression of an inert protein is driven by gene regulatory elements such as promoters and enhancers, are a workhorse for investigating gene regulation. Techniques for measuring reporter gene expression vary from single-cell or single-molecule approaches having low throughput to bulk Luciferase assays that have high throughput. We developed a Luciferase Reporter Assay using Flow-Cytometry (LucFlow), which measures reporter expression in single cells immunostained for Luciferase. We optimized and tested LucFlow with a murine cell line that can be differentiated into neutrophils, into which promoter-reporter and enhancer-promoter-reporter constructs have been integrated in a site-specific manner. The single-cell measurements are comparable to bulk ones but we found that dead cells have no detectable Luciferase protein, so that bulk assays underestimate reporter expression. LucFlow is able to achieve a higher accuracy than bulk methods by excluding dead cells during flow cytometry. Prior to fixation and staining, the samples are spiked with stained cells that can be discriminated during flow cytometry and control for tube-to-tube variation in experimental conditions. Computing fold change relative to control cells allows LucFlow to achieve a high level of precision. LucFlow, therefore, enables the accurate and precise measurement of reporter expression in a high throughput manner.
Collapse
Affiliation(s)
- Sunil Nooti
- Department of Biology, University of North Dakota, Grand Forks, ND, United States of America
| | - Madison Naylor
- Department of Biology, University of North Dakota, Grand Forks, ND, United States of America
| | - Trevor Long
- Department of Biology, University of North Dakota, Grand Forks, ND, United States of America
| | - Brayden Groll
- Department of Biology, University of North Dakota, Grand Forks, ND, United States of America
| | - Manu
- Department of Biology, University of North Dakota, Grand Forks, ND, United States of America
| |
Collapse
|
7
|
Harden TT, Vincent BJ, DePace AH. Transcriptional activators in the early Drosophila embryo perform different kinetic roles. Cell Syst 2023; 14:258-272.e4. [PMID: 37080162 PMCID: PMC10473017 DOI: 10.1016/j.cels.2023.03.006] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/08/2021] [Revised: 06/26/2022] [Accepted: 03/21/2023] [Indexed: 04/22/2023]
Abstract
Combinatorial regulation of gene expression by transcription factors (TFs) may in part arise from kinetic synergy-wherein TFs regulate different steps in the transcription cycle. Kinetic synergy requires that TFs play distinguishable kinetic roles. Here, we used live imaging to determine the kinetic roles of three TFs that activate transcription in the Drosophila embryo-Zelda, Bicoid, and Stat92E-by introducing their binding sites into the even-skipped stripe 2 enhancer. These TFs influence different sets of kinetic parameters, and their influence can change over time. All three TFs increased the fraction of transcriptionally active nuclei; Zelda also shortened the first-passage time into transcription and regulated the interval between transcription events. Stat92E also increased the lifetimes of active transcription. Different TFs can therefore play distinct kinetic roles in activating the transcription. This has consequences for understanding the composition and flexibility of regulatory DNA sequences and the biochemical function of TFs. A record of this paper's transparent peer review process is included in the supplemental information.
Collapse
Affiliation(s)
- Timothy T Harden
- Department of Systems Biology, Harvard Medical School, Boston, MA 02115, USA
| | - Ben J Vincent
- Department of Systems Biology, Harvard Medical School, Boston, MA 02115, USA
| | - Angela H DePace
- Department of Systems Biology, Harvard Medical School, Boston, MA 02115, USA.
| |
Collapse
|
8
|
Thermodynamic Modelling of Transcriptional Control: A Sensitivity Analysis. MATHEMATICS 2022. [DOI: 10.3390/math10132169] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 02/05/2023]
Abstract
Modelling is a tool used to decipher the biochemical mechanisms involved in transcriptional control. Experimental evidence in genetics is usually supported by theoretical models in order to evaluate the effects of all the possible interactions that can occur in these complicated processes. Models derived from the thermodynamic method are critical in this labour because they are able to take into account multiple mechanisms operating simultaneously at the molecular micro-scale and relate them to transcriptional initiation at the tissular macro-scale. This work is devoted to adapting computational techniques to this context in order to theoretically evaluate the role played by several biochemical mechanisms. The interest of this theoretical analysis relies on the fact that it can be contrasted against those biological experiments where the response to perturbations in the transcriptional machinery environment is evaluated in terms of genetically activated/repressed regions. The theoretical reproduction of these experiments leads to a sensitivity analysis whose results are expressed in terms of the elasticity of a threshold function determining those activated/repressed regions. The study of this elasticity function in thermodynamic models already proposed in the literature reveals that certain modelling approaches can alter the balance between the biochemical mechanisms considered, and this can cause false/misleading outcomes. The reevaluation of classical thermodynamic models gives us a more accurate and complete picture of the interactions involved in gene regulation and transcriptional control, which enables more specific predictions. This sensitivity approach provides a definite advantage in the interpretation of a wide range of genetic experimental results.
Collapse
|
9
|
Duk MA, Gursky VV, Samsonova MG, Surkova SY. Application of Domain- and Genotype-Specific Models to Infer Post-Transcriptional Regulation of Segmentation Gene Expression in Drosophila. Life (Basel) 2021; 11:life11111232. [PMID: 34833107 PMCID: PMC8618293 DOI: 10.3390/life11111232] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 09/24/2021] [Revised: 11/05/2021] [Accepted: 11/10/2021] [Indexed: 11/16/2022] Open
Abstract
Unlike transcriptional regulation, the post-transcriptional mechanisms underlying zygotic segmentation gene expression in early Drosophila embryo have been insufficiently investigated. Condition-specific post-transcriptional regulation plays an important role in the development of many organisms. Our recent study revealed the domain- and genotype-specific differences between mRNA and the protein expression of Drosophila hb, gt, and eve genes in cleavage cycle 14A. Here, we use this dataset and the dynamic mathematical model to recapitulate protein expression from the corresponding mRNA patterns. The condition-specific nonuniformity in parameter values is further interpreted in terms of possible post-transcriptional modifications. For hb expression in wild-type embryos, our results predict the position-specific differences in protein production. The protein synthesis rate parameter is significantly higher in hb anterior domain compared to the posterior domain. The parameter sets describing Gt protein dynamics in wild-type embryos and Kr mutants are genotype-specific. The spatial discrepancy between gt mRNA and protein posterior expression in Kr mutants is well reproduced by the whole axis model, thus rejecting the involvement of post-transcriptional mechanisms. Our models fail to describe the full dynamics of eve expression, presumably due to its complex shape and the variable time delays between mRNA and protein patterns, which likely require a more complex model. Overall, our modeling approach enables the prediction of regulatory scenarios underlying the condition-specific differences between mRNA and protein expression in early embryo.
Collapse
Affiliation(s)
- Maria A. Duk
- Mathematical Biology and Bioinformatics Laboratory, Peter the Great Saint Petersburg Polytechnic University, 195251 St. Petersburg, Russia; (M.A.D.); (M.G.S.)
- Theoretical Department, Ioffe Institute, 194021 St. Petersburg, Russia;
| | - Vitaly V. Gursky
- Theoretical Department, Ioffe Institute, 194021 St. Petersburg, Russia;
| | - Maria G. Samsonova
- Mathematical Biology and Bioinformatics Laboratory, Peter the Great Saint Petersburg Polytechnic University, 195251 St. Petersburg, Russia; (M.A.D.); (M.G.S.)
| | - Svetlana Yu. Surkova
- Mathematical Biology and Bioinformatics Laboratory, Peter the Great Saint Petersburg Polytechnic University, 195251 St. Petersburg, Russia; (M.A.D.); (M.G.S.)
- Correspondence:
| |
Collapse
|
10
|
Gaiewski MJ, Drewell RA, Dresch JM. Fitting thermodynamic-based models: Incorporating parameter sensitivity improves the performance of an evolutionary algorithm. Math Biosci 2021; 342:108716. [PMID: 34687735 DOI: 10.1016/j.mbs.2021.108716] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 02/03/2021] [Revised: 09/10/2021] [Accepted: 09/17/2021] [Indexed: 11/30/2022]
Abstract
A detailed comprehension of transcriptional regulation is critical to understanding the genetic control of development and disease across many different organisms. To more fully investigate the complex molecular interactions controlling the precise expression of genes, many groups have constructed mathematical models to complement their experimental approaches. A critical step in such studies is choosing the most appropriate parameter estimation algorithm to enable detailed analysis of the parameters that contribute to the models. In this study, we develop a novel set of evolutionary algorithms that use a pseudo-random Sobol Set to construct the initial population and incorporate parameter sensitivities into the adaptation of mutation rates, using local, global, and hybrid strategies. Comparison of the performance of these new algorithms to a number of current state-of-the-art global parameter estimation algorithms on a range of continuous test functions, as well as synthetic biological data representing models of gene regulatory systems, reveals improved performance of the new algorithms in terms of runtime, error and reproducibility. In addition, by analyzing the ability of these algorithms to fit datasets of varying quality, we provide the experimentalist with a guide to how the algorithms perform across a range of noisy data. These results demonstrate the improved performance of the new set of parameter estimation algorithms and facilitate meaningful integration of model parameters and predictions in our understanding of the molecular mechanisms of gene regulation.
Collapse
Affiliation(s)
- Michael J Gaiewski
- Department of Mathematics and Computer Science, Clark University, Worcester, MA, USA; Department of Mathematics, University of Connecticut, Storrs, CT, USA.
| | | | | |
Collapse
|
11
|
Dibaeinia P, Sinha S. Deciphering enhancer sequence using thermodynamics-based models and convolutional neural networks. Nucleic Acids Res 2021; 49:10309-10327. [PMID: 34508359 PMCID: PMC8501998 DOI: 10.1093/nar/gkab765] [Citation(s) in RCA: 1] [Impact Index Per Article: 0.3] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/16/2021] [Revised: 08/18/2021] [Accepted: 08/25/2021] [Indexed: 11/18/2022] Open
Abstract
Deciphering the sequence-function relationship encoded in enhancers holds the key to interpreting non-coding variants and understanding mechanisms of transcriptomic variation. Several quantitative models exist for predicting enhancer function and underlying mechanisms; however, there has been no systematic comparison of these models characterizing their relative strengths and shortcomings. Here, we interrogated a rich data set of neuroectodermal enhancers in Drosophila, representing cis- and trans- sources of expression variation, with a suite of biophysical and machine learning models. We performed rigorous comparisons of thermodynamics-based models implementing different mechanisms of activation, repression and cooperativity. Moreover, we developed a convolutional neural network (CNN) model, called CoNSEPT, that learns enhancer 'grammar' in an unbiased manner. CoNSEPT is the first general-purpose CNN tool for predicting enhancer function in varying conditions, such as different cell types and experimental conditions, and we show that such complex models can suggest interpretable mechanisms. We found model-based evidence for mechanisms previously established for the studied system, including cooperative activation and short-range repression. The data also favored one hypothesized activation mechanism over another and suggested an intriguing role for a direct, distance-independent repression mechanism. Our modeling shows that while fundamentally different models can yield similar fits to data, they vary in their utility for mechanistic inference. CoNSEPT is freely available at: https://github.com/PayamDiba/CoNSEPT.
Collapse
Affiliation(s)
- Payam Dibaeinia
- Department of Computer Science, University of Illinois at Urbana-Champaign, Urbana, IL 61801, USA
| | - Saurabh Sinha
- Department of Computer Science, University of Illinois at Urbana-Champaign, Urbana, IL 61801, USA
- Carl R. Woese Institute for Genomic Biology, University of Illinois at Urbana-Champaign, Urbana, IL 61801, USA
- Cancer Center at Illinois, University of Illinois at Urbana-Champaign, Urbana, IL 61801, USA
| |
Collapse
|
12
|
Garbuzov FE, Gursky VV. Nonequilibrium model of short-range repression in gene transcription regulation. Phys Rev E 2021; 104:014407. [PMID: 34412298 DOI: 10.1103/physreve.104.014407] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 11/10/2020] [Accepted: 06/24/2021] [Indexed: 11/07/2022]
Abstract
Transcription factors are proteins that regulate gene activity by activating or repressing gene transcription. A special class of transcriptional repressors operates via a short-range mechanism, making local DNA regions inaccessible to binding by activators, and thus providing an indirect repressive action on the target gene. This mechanism is commonly modeled assuming that repressors interact with DNA under thermodynamic equilibrium and neglecting some configurations of the gene regulatory region. We elaborate on a more general nonequilibrium model of short-range repression using the graph formalism for transitions between gene states, and we apply analytical calculations to compare it with the equilibrium model in terms of the repression strength and expression noise. In contrast to the equilibrium approach, the new model allows us to separate two basic mechanisms of short-range repression. The first mechanism is associated with the recruiting of factors that mediate chromatin condensation, and the second one concerns the blocking of factors that mediate chromatin loosening. The nonequilibrium model demonstrates better performance on previously published gene expression data obtained for transcription factors controlling Drosophila development, and furthermore it predicts that the first repression mechanism is the most favorable in this system. The presented approach can be scaled to larger gene networks and can be used to infer specific modes and parameters of transcriptional regulation from gene expression data.
Collapse
Affiliation(s)
- F E Garbuzov
- Ioffe Institute, 26 Polytekhnicheskaya, St. Petersburg 194021, Russia
| | - V V Gursky
- Ioffe Institute, 26 Polytekhnicheskaya, St. Petersburg 194021, Russia
| |
Collapse
|
13
|
Cai X, Rondeel I, Baumgartner S. Modulating the bicoid gradient in space and time. Hereditas 2021; 158:29. [PMID: 34404481 PMCID: PMC8371787 DOI: 10.1186/s41065-021-00192-y] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 05/28/2021] [Accepted: 07/19/2021] [Indexed: 11/15/2022] Open
Abstract
Background The formation of the Bicoid (Bcd) gradient in the early Drosophila is one of the most fascinating observations in biology and serves as a paradigm for gradient formation, yet its mechanism is still not fully understood. Two distinct models were proposed in the past, the SDD and the ARTS model. Results We define novel cis- and trans-acting factors that are indispensable for gradient formation. The first one is the poly A tail length of the bcd mRNA where we demonstrate that it changes not only in time, but also in space. We show that posterior bcd mRNAs possess a longer poly tail than anterior ones and this elongation is likely mediated by wispy (wisp), a poly A polymerase. Consequently, modulating the activity of Wisp results in changes of the Bcd gradient, in controlling downstream targets such as the gap and pair-rule genes, and also in influencing the cuticular pattern. Attempts to modulate the Bcd gradient by subjecting the egg to an extra nuclear cycle, i.e. a 15th nuclear cycle by means of the maternal haploid (mh) mutation showed no effect, neither on the appearance of the gradient nor on the control of downstream target. This suggests that the segmental anlagen are determined during the first 14 nuclear cycles. Finally, we identify the Cyclin B (CycB) gene as a trans-acting factor that modulates the movement of Bcd such that Bcd movement is allowed to move through the interior of the egg. Conclusions Our analysis demonstrates that Bcd gradient formation is far more complex than previously thought requiring a revision of the models of how the gradient is formed.
Collapse
Affiliation(s)
- Xiaoli Cai
- Departmentof Experimental Medical Sciences, Lund University, BMC D10, 22184, Lund, Sweden
| | - Inge Rondeel
- Departmentof Experimental Medical Sciences, Lund University, BMC D10, 22184, Lund, Sweden.,Present address: Hubrecht Institute, 3584 CT, Utrecht, The Netherlands
| | - Stefan Baumgartner
- Departmentof Experimental Medical Sciences, Lund University, BMC D10, 22184, Lund, Sweden. .,Department of Biology, University of Konstanz, 78457, Konstanz, Germany.
| |
Collapse
|
14
|
Abstract
Motivation The universal expressibility assumption of Deep Neural Networks (DNNs) is the key motivation behind recent worksin the systems biology community to employDNNs to solve important problems in functional genomics and moleculargenetics. Typically, such investigations have taken a ‘black box’ approach in which the internal structure of themodel used is set purely by machine learning considerations with little consideration of representing the internalstructure of the biological system by the mathematical structure of the DNN. DNNs have not yet been applied to thedetailed modeling of transcriptional control in which mRNA production is controlled by the binding of specific transcriptionfactors to DNA, in part because such models are in part formulated in terms of specific chemical equationsthat appear different in form from those used in neural networks. Results In this paper, we give an example of a DNN whichcan model the detailed control of transcription in a precise and predictive manner. Its internal structure is fully interpretableand is faithful to underlying chemistry of transcription factor binding to DNA. We derive our DNN from asystems biology model that was not previously recognized as having a DNN structure. Although we apply our DNNto data from the early embryo of the fruit fly Drosophila, this system serves as a test bed for analysis of much larger datasets obtained by systems biology studies on a genomic scale. . Availability and implementation The implementation and data for the models used in this paper are in a zip file in the supplementary material. Supplementary information Supplementary data are available at Bioinformatics online.
Collapse
Affiliation(s)
- Yi Liu
- Department of Statistics, Ecology and Evolution, Molecular Genetics & Cell Biology, Institute of Genomics and Systems Biology, University of Chicago, Chicago, IL 60637, USA
| | - Kenneth Barr
- Department of Human Genetics, Ecology and Evolution, Molecular Genetics & Cell Biology, Institute of Genomics and Systems Biology, University of Chicago, Chicago, IL 60637, USA
| | - John Reinitz
- Departments of Statistics, Ecology and Evolution, Molecular Genetics & Cell Biology, Institute of Genomics and Systems Biology, University of Chicago, Chicago, IL 60637, USA
| |
Collapse
|
15
|
Makashov AA, Myasnikova EM, Spirov AV. Fuzzy Linguistic Modeling of the Regulation of Drosophila Segmentation Genes. Biophysics (Nagoya-shi) 2021. [DOI: 10.1134/s0006350921010073] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/23/2022] Open
|
16
|
Abstract
Determining whether and how a gene is transcribed are two of the central processes of life. The conceptual basis for understanding such gene regulation arose from pioneering biophysical studies in eubacteria. However, eukaryotic genomes exhibit vastly greater complexity, which raises questions not addressed by this bacterial paradigm. First, how is information integrated from many widely separated binding sites to determine how a gene is transcribed? Second, does the presence of multiple energy-expending mechanisms, which are absent from eubacterial genomes, indicate that eukaryotes are capable of improved forms of genetic information processing? An updated biophysical foundation is needed to answer such questions. We describe the linear framework, a graph-based approach to Markov processes, and show that it can accommodate many previous studies in the field. Under the assumption of thermodynamic equilibrium, we introduce a language of higher-order cooperativities and show how it can rigorously quantify gene regulatory properties suggested by experiment. We point out that fundamental limits to information processing arise at thermodynamic equilibrium and can only be bypassed through energy expenditure. Finally, we outline some of the mathematical challenges that must be overcome to construct an improved biophysical understanding of gene regulation.
Collapse
Affiliation(s)
- Felix Wong
- Institute for Medical Engineering & Science, Department of Biological Engineering, Massachusetts Institute of Technology, Cambridge, Massachusetts 02139, USA.,Infectious Disease and Microbiome Program, Broad Institute of MIT and Harvard, Cambridge, Massachusetts 02142, USA
| | - Jeremy Gunawardena
- Department of Systems Biology, Harvard Medical School, Boston, Massachusetts 02115, USA;
| |
Collapse
|
17
|
Berrocal A, Lammers NC, Garcia HG, Eisen MB. Kinetic sculpting of the seven stripes of the Drosophila even-skipped gene. eLife 2020; 9:61635. [PMID: 33300492 PMCID: PMC7864633 DOI: 10.7554/elife.61635] [Citation(s) in RCA: 17] [Impact Index Per Article: 4.3] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/31/2020] [Accepted: 10/09/2020] [Indexed: 12/14/2022] Open
Abstract
We used live imaging to visualize the transcriptional dynamics of the Drosophila melanogaster even-skipped gene at single-cell and high-temporal resolution as its seven stripe expression pattern forms, and developed tools to characterize and visualize how transcriptional bursting varies over time and space. We find that despite being created by the independent activity of five enhancers, even-skipped stripes are sculpted by the same kinetic phenomena: a coupled increase of burst frequency and amplitude. By tracking the position and activity of individual nuclei, we show that stripe movement is driven by the exchange of bursting nuclei from the posterior to anterior stripe flanks. Our work provides a conceptual, theoretical and computational framework for dissecting pattern formation in space and time, and reveals how the coordinated transcriptional activity of individual nuclei shapes complex developmental patterns.
Collapse
Affiliation(s)
- Augusto Berrocal
- Department of Molecular & Cell Biology, University of California at Berkeley, Berkeley, United States
| | - Nicholas C Lammers
- Biophysics Graduate Group, University of California at Berkeley, Berkeley, United States
| | - Hernan G Garcia
- Department of Molecular & Cell Biology, University of California at Berkeley, Berkeley, United States.,Biophysics Graduate Group, University of California at Berkeley, Berkeley, United States.,Department of Physics, University of California at Berkeley, Berkeley, United States.,Institute for Quantitative Biosciences-QB3, University of California at Berkeley, Berkeley, United States
| | - Michael B Eisen
- Department of Molecular & Cell Biology, University of California at Berkeley, Berkeley, United States.,Biophysics Graduate Group, University of California at Berkeley, Berkeley, United States.,Institute for Quantitative Biosciences-QB3, University of California at Berkeley, Berkeley, United States.,Department of Integrative Biology, University of California at Berkeley, Berkeley, United States.,Howard Hughes Medical Institute, University of California at Berkeley, Berkeley, United States
| |
Collapse
|
18
|
Rosenzweig S, Scholand N, Holme HCM, Uecker M. Cardiac and Respiratory Self-Gating in Radial MRI Using an Adapted Singular Spectrum Analysis (SSA-FARY). IEEE TRANSACTIONS ON MEDICAL IMAGING 2020; 39:3029-3041. [PMID: 32275585 DOI: 10.1109/tmi.2020.2985994] [Citation(s) in RCA: 12] [Impact Index Per Article: 3.0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 06/11/2023]
Abstract
Cardiac Magnetic Resonance Imaging (MRI) is time-consuming and error-prone. To ease the patient's burden and to increase the efficiency and robustness of cardiac exams, interest in methods based on continuous steady-state acquisition and self-gating has been growing in recent years. Self-gating methods extract the cardiac and respiratory signals from the measurement data and then retrospectively sort the data into cardiac and respiratory phases. Repeated breathholds and synchronization with the heart beat using some external device as required in conventional MRI are then not necessary. In this work, we introduce a novel self-gating method for radially acquired data based on a dimensionality reduction technique for time-series analysis (SSA-FARY). Building on Singular Spectrum Analysis, a zero-padded, time-delayed embedding of the auto-calibration data is analyzed using Principle Component Analysis. We demonstrate the basic functionality of SSA-FARY using numerical simulations and apply it to in-vivo cardiac radial single-slice bSSFP and Simultaneous Multi-Slice radiofrequency-spoiled gradient-echo measurements, as well as to Stack-of-Stars bSSFP measurements. SSA-FARY reliably detects the cardiac and respiratory motion and separates it from noise. We utilize the generated signals for high-dimensional image reconstruction using parallel imaging and compressed sensing with in-plane wavelet and (spatio-)temporal total-variation regularization.
Collapse
|
19
|
Abstract
Key discoveries in Drosophila have shaped our understanding of cellular "enhancers." With a special focus on the fly, this chapter surveys properties of these adaptable cis-regulatory elements, whose actions are critical for the complex spatial/temporal transcriptional regulation of gene expression in metazoa. The powerful combination of genetics, molecular biology, and genomics available in Drosophila has provided an arena in which the developmental role of enhancers can be explored. Enhancers are characterized by diverse low- or high-throughput assays, which are challenging to interpret, as not all of these methods of identifying enhancers produce concordant results. As a model metazoan, the fly offers important advantages to comprehensive analysis of the central functions that enhancers play in gene expression, and their critical role in mediating the production of phenotypes from genotype and environmental inputs. A major challenge moving forward will be obtaining a quantitative understanding of how these cis-regulatory elements operate in development and disease.
Collapse
Affiliation(s)
- Stephen Small
- Department of Biology, Developmental Systems Training Program, New York University, 10003 and
| | - David N Arnosti
- Department of Biochemistry and Molecular Biology, Michigan State University, East Lansing, Michigan 48824
| |
Collapse
|
20
|
Myasnikova E, Spirov A. Gene regulatory networks in Drosophila early embryonic development as a model for the study of the temporal identity of neuroblasts. Biosystems 2020; 197:104192. [PMID: 32619531 DOI: 10.1016/j.biosystems.2020.104192] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 01/07/2020] [Revised: 04/30/2020] [Accepted: 06/21/2020] [Indexed: 11/27/2022]
Abstract
Genes belonging to the "gap" and "gap-like" family constitute the best-studied gene regulatory networks (GRNs) in Drosophila embryogenesis. Gap genes are a core of two subnetworks controlling embryonic segmentation: (hunchback, hb; Krüppel, Kr; giant, gt; and knirps, kni) and (hb; Kr; pou-domain, pdm; and, probably, castor, cas). Of particular interest is that (hb, Kr, pdm, cas) also specifies the temporal identity of stem cells, neuroblasts, in Drosophila neurogenesis. This GRN controls the sequential differentiation of neuroblasts during the asymmetric cell division. In the last decades, modeling of the patterning of gene ensemble (hb, Kr, gt, kni) in segmentation was in the center of attention. We show that our previously published and extensively studied model at a certain level of external factors is able to reproduce temporal patterns of (hb, Kr, pdm, cas) in neurogenesis with minor evolutionary explicable modifications. This result testifies in favor of a hypothesis that the similarity of two gene ensembles active in segmentation and neurogenesis is a result of co-option of the network architecture in evolution from the common ancestral form. By means of the model dynamical analysis, it is shown that the establishment of the robust patterns in both systems could be explained in terms of the action of attractors in the gap gene dynamical system. We formulate the common principles underlying the robustness of both GRNs in segmentation and neurogenesis due to the similar functional organization of the gene ensembles as having the same evolutionary origin.
Collapse
Affiliation(s)
- Ekaterina Myasnikova
- Peter the Great Saint-Petersburg Polytechnical University, 29 Politekhnicheskaya str, St. Petersburg, 195251, Russia.
| | - Alexander Spirov
- I. M. Sechenov Institute of Evolutionary Physiology and Biochemistry Russian Academy of Sciences, 44 Thorez Pr, St.Petersburg, 194223, Russia; Computer Science and CEWIT, SUNY Stony Brook, Stony Brook, 1500 Stony Brook Road, Stony Brook, 11794, NY, USA
| |
Collapse
|
21
|
On the specificity of gene regulatory networks: How does network co-option affect subsequent evolution? Curr Top Dev Biol 2020; 139:375-405. [PMID: 32450967 DOI: 10.1016/bs.ctdb.2020.03.002] [Citation(s) in RCA: 19] [Impact Index Per Article: 4.8] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 02/04/2023]
Abstract
The process of multicellular organismal development hinges upon the specificity of developmental programs: for different parts of the organism to form unique features, processes must exist to specify each part. This specificity is thought to be hardwired into gene regulatory networks, which activate cohorts of genes in particular tissues at particular times during development. However, the evolution of gene regulatory networks sometimes occurs by mechanisms that sacrifice specificity. One such mechanism is network co-option, in which existing gene networks are redeployed in new developmental contexts. While network co-option may offer an efficient mechanism for generating novel phenotypes, losses of tissue specificity at redeployed network genes could restrict the ability of the affected traits to evolve independently. At present, there has not been a detailed discussion regarding how tissue specificity of network genes might be altered due to gene network co-option at its initiation, as well as how trait independence can be retained or restored after network co-option. A lack of clarity about network co-option makes it more difficult to speculate on the long-term evolutionary implications of this mechanism. In this review, we will discuss the possible initial outcomes of network co-option, outline the mechanisms by which networks may retain or subsequently regain specificity after network co-option, and comment on some of the possible evolutionary consequences of network co-option. We place special emphasis on the need to consider selectively-neutral outcomes of network co-option to improve our understanding of the role of this mechanism in trait evolution.
Collapse
|
22
|
Binary Expression Enhances Reliability of Messaging in Gene Networks. ENTROPY 2020; 22:e22040479. [PMID: 33286254 PMCID: PMC7516962 DOI: 10.3390/e22040479] [Citation(s) in RCA: 1] [Impact Index Per Article: 0.3] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Subscribe] [Scholar Register] [Received: 02/04/2020] [Revised: 04/20/2020] [Accepted: 04/20/2020] [Indexed: 01/31/2023]
Abstract
The promoter state of a gene and its expression levels are modulated by the amounts of transcription factors interacting with its regulatory regions. Hence, one may interpret a gene network as a communicating system in which the state of the promoter of a gene (the source) is communicated by the amounts of transcription factors that it expresses (the message) to modulate the state of the promoter and expression levels of another gene (the receptor). The reliability of the gene network dynamics can be quantified by Shannon's entropy of the message and the mutual information between the message and the promoter state. Here we consider a stochastic model for a binary gene and use its exact steady state solutions to calculate the entropy and mutual information. We show that a slow switching promoter with long and equally standing ON and OFF states maximizes the mutual information and reduces entropy. That is a binary gene expression regime generating a high variance message governed by a bimodal probability distribution with peaks of the same height. Our results indicate that Shannon's theory can be a powerful framework for understanding how bursty gene expression conciliates with the striking spatio-temporal precision exhibited in pattern formation of developing organisms.
Collapse
|
23
|
Bell K, Skier K, Chen KH, Gergen JP. Two pair-rule responsive enhancers regulate wingless transcription in the Drosophila blastoderm embryo. Dev Dyn 2019; 249:556-572. [PMID: 31837063 DOI: 10.1002/dvdy.142] [Citation(s) in RCA: 1] [Impact Index Per Article: 0.2] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/04/2019] [Revised: 11/25/2019] [Accepted: 11/26/2019] [Indexed: 11/07/2022] Open
Abstract
BACKGROUND While many developmentally relevant enhancers act in a modular fashion, there is growing evidence for nonadditive interactions between distinct cis-regulatory enhancers. We investigated if nonautonomous enhancer interactions underlie transcription regulation of the Drosophila segment polarity gene, wingless. RESULTS We identified two wg enhancers active at the blastoderm stage: wg 3613u, located from -3.6 to -1.3 kb upstream of the wg transcription start site (TSS) and 3046d, located in intron two of the wg gene, from 3.0 to 4.6 kb downstream of the TSS. Genetic experiments confirm that Even Skipped (Eve), Fushi-tarazu (Ftz), Runt, Odd-paired (Opa), Odd-skipped (Odd), and Paired (Prd) contribute to spatially regulated wg expression. Interestingly, there are enhancer specific differences in response to the gain or loss of function of pair-rule gene activity. Although each element recapitulates aspects of wg expression, a composite reporter containing both enhancers more faithfully recapitulates wg regulation than would be predicted from the sum of their individual responses. CONCLUSION These results suggest that the regulation of wg by pair-rule genes involves nonadditive interactions between distinct cis-regulatory enhancers.
Collapse
Affiliation(s)
- Kimberly Bell
- Department of Biochemistry and Cell Biology and the Center for Developmental Genetics, Stony Brook University, Stony Brook, New York
- Center for Excellence in Learning & Teaching, Stony Brook University, Stony Brook, New York
| | - Kevin Skier
- Department of Biochemistry and Cell Biology and the Center for Developmental Genetics, Stony Brook University, Stony Brook, New York
- University of Massachusetts Medical School, Worcester, Massachusetts
| | - Kevin H Chen
- Department of Biochemistry and Cell Biology and the Center for Developmental Genetics, Stony Brook University, Stony Brook, New York
- Boston University School of Medicine, Boston, Massachusetts
| | - John Peter Gergen
- Department of Biochemistry and Cell Biology and the Center for Developmental Genetics, Stony Brook University, Stony Brook, New York
| |
Collapse
|
24
|
Barr K, Reinitz J, Radulescu O. An in silico analysis of robust but fragile gene regulation links enhancer length to robustness. PLoS Comput Biol 2019; 15:e1007497. [PMID: 31730659 PMCID: PMC6881076 DOI: 10.1371/journal.pcbi.1007497] [Citation(s) in RCA: 4] [Impact Index Per Article: 0.8] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/21/2019] [Revised: 11/27/2019] [Accepted: 10/22/2019] [Indexed: 12/31/2022] Open
Abstract
Organisms must ensure that expression of genes is directed to the appropriate tissues at the correct times, while simultaneously ensuring that these gene regulatory systems are robust to perturbation. This idea is captured by a mathematical concept called r-robustness, which says that a system is robust to a perturbation in up to r - 1 randomly chosen parameters. r-robustness implies that the biological system has a small number of sensitive parameters and that this number can be used as a robustness measure. In this work we use this idea to investigate the robustness of gene regulation using a sequence level model of the Drosophila melanogaster gene even-skipped. We consider robustness with respect to mutations of the enhancer sequence and with respect to changes of the transcription factor concentrations. We find that gene regulation is r-robust with respect to mutations in the enhancer sequence and identify a number of sensitive nucleotides. In both natural and in silico predicted enhancers, the number of nucleotides that are sensitive to mutation correlates negatively with the length of the sequence, meaning that longer sequences are more robust. The exact degree of robustness obtained is dependent not only on DNA sequence, but also on the local concentration of regulatory factors. We find that gene regulation can be remarkably sensitive to changes in transcription factor concentrations at the boundaries of expression features, while it is robust to perturbation elsewhere.
Collapse
Affiliation(s)
- Kenneth Barr
- Department of Genetic Medicine, University of Chicago, Chicago, Illinois, United States of America
| | - John Reinitz
- Departments of Statistics, Ecology & Evolution, Molecular Genetics & Cell Biology, University of Chicago, Chicago, Illinois, United States of America
| | - Ovidiu Radulescu
- LPHI UMR CNRS 5235, University of Montpellier, Montpellier, France
| |
Collapse
|
25
|
Wunderlich Z, Fowlkes CC, Eckenrode KB, Bragdon MDJ, Abiri A, DePace AH. Quantitative Comparison of the Anterior-Posterior Patterning System in the Embryos of Five Drosophila Species. G3 (BETHESDA, MD.) 2019; 9:2171-2182. [PMID: 31048401 PMCID: PMC6643877 DOI: 10.1534/g3.118.200953] [Citation(s) in RCA: 5] [Impact Index Per Article: 1.0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Subscribe] [Scholar Register] [Received: 12/07/2018] [Accepted: 05/01/2019] [Indexed: 11/18/2022]
Abstract
Complex spatiotemporal gene expression patterns direct the development of the fertilized egg into an adult animal. Comparisons across species show that, in spite of changes in the underlying regulatory DNA sequence, developmental programs can be maintained across millions of years of evolution. Reciprocally, changes in gene expression can be used to generate morphological novelty. Distinguishing between changes in regulatory DNA that lead to changes in gene expression and those that do not is therefore a central goal of evolutionary developmental biology. Quantitative, spatially-resolved measurements of developmental gene expression patterns play a crucial role in this goal, enabling the detection of subtle phenotypic differences between species and the development of computations models that link the sequence of regulatory DNA to expression patterns. Here we report the generation of two atlases of cellular resolution gene expression measurements for the primary anterior-posterior patterning genes in Drosophila simulans and Drosophila virilis By combining these data sets with existing atlases for three other Drosophila species, we detect subtle differences in the gene expression patterns and dynamics driving the highly conserved axis patterning system and delineate inter-species differences in the embryonic morphology. These data sets will be a resource for future modeling studies of the evolution of developmental gene regulatory networks.
Collapse
Affiliation(s)
- Zeba Wunderlich
- Department of Developmental and Cell Biology, University of California, Irvine, CA, 92697
| | - Charless C Fowlkes
- Department of Computer Science, University of California, Irvine, CA, 92697
| | - Kelly B Eckenrode
- Department of Systems Biology, Harvard Medical School, Boston, MA, 20115
| | - Meghan D J Bragdon
- Department of Systems Biology, Harvard Medical School, Boston, MA, 20115
| | - Arash Abiri
- Department of Developmental and Cell Biology, University of California, Irvine, CA, 92697
| | - Angela H DePace
- Department of Systems Biology, Harvard Medical School, Boston, MA, 20115
| |
Collapse
|
26
|
Park J, Estrada J, Johnson G, Vincent BJ, Ricci-Tam C, Bragdon MDJ, Shulgina Y, Cha A, Wunderlich Z, Gunawardena J, DePace AH. Dissecting the sharp response of a canonical developmental enhancer reveals multiple sources of cooperativity. eLife 2019; 8:e41266. [PMID: 31223115 PMCID: PMC6588347 DOI: 10.7554/elife.41266] [Citation(s) in RCA: 33] [Impact Index Per Article: 6.6] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 08/20/2018] [Accepted: 03/04/2019] [Indexed: 12/19/2022] Open
Abstract
Developmental enhancers integrate graded concentrations of transcription factors (TFs) to create sharp gene expression boundaries. Here we examine the hunchback P2 (HbP2) enhancer which drives a sharp expression pattern in the Drosophila blastoderm embryo in response to the transcriptional activator Bicoid (Bcd). We systematically interrogate cis and trans factors that influence the shape and position of expression driven by HbP2, and find that the prevailing model, based on pairwise cooperative binding of Bcd to HbP2 is not adequate. We demonstrate that other proteins, such as pioneer factors, Mediator and histone modifiers influence the shape and position of the HbP2 expression pattern. Comparing our results to theory reveals how higher-order cooperativity and energy expenditure impact boundary location and sharpness. Our results emphasize that the bacterial view of transcription regulation, where pairwise interactions between regulatory proteins dominate, must be reexamined in animals, where multiple molecular mechanisms collaborate to shape the gene regulatory function.
Collapse
Affiliation(s)
- Jeehae Park
- Department of Systems BiologyHarvard Medical SchoolBostonUnited States
| | - Javier Estrada
- Department of Systems BiologyHarvard Medical SchoolBostonUnited States
| | - Gemma Johnson
- Department of Systems BiologyHarvard Medical SchoolBostonUnited States
| | - Ben J Vincent
- Department of Systems BiologyHarvard Medical SchoolBostonUnited States
| | - Chiara Ricci-Tam
- Department of Systems BiologyHarvard Medical SchoolBostonUnited States
| | - Meghan DJ Bragdon
- Department of Systems BiologyHarvard Medical SchoolBostonUnited States
| | | | - Anna Cha
- Department of Systems BiologyHarvard Medical SchoolBostonUnited States
| | - Zeba Wunderlich
- Department of Systems BiologyHarvard Medical SchoolBostonUnited States
| | | | - Angela H DePace
- Department of Systems BiologyHarvard Medical SchoolBostonUnited States
| |
Collapse
|
27
|
Zubair A, Rosen IG, Nuzhdin SV, Marjoram P. Bayesian model selection for the Drosophila gap gene network. BMC Bioinformatics 2019; 20:327. [PMID: 31195954 PMCID: PMC6567646 DOI: 10.1186/s12859-019-2888-0] [Citation(s) in RCA: 1] [Impact Index Per Article: 0.2] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 04/24/2019] [Accepted: 05/09/2019] [Indexed: 11/10/2022] Open
Abstract
Background The gap gene system controls the early cascade of the segmentation pathway in Drosophila melanogaster as well as other insects. Owing to its tractability and key role in embryo patterning, this system has been the focus for both computational modelers and experimentalists. The gap gene expression dynamics can be considered strictly as a one-dimensional process and modeled as a system of reaction-diffusion equations. While substantial progress has been made in modeling this phenomenon, there still remains a deficit of approaches to evaluate competing hypotheses. Most of the model development has happened in isolation and there has been little attempt to compare candidate models. Results The Bayesian framework offers a means of doing formal model evaluation. Here, we demonstrate how this framework can be used to compare different models of gene expression. We focus on the Papatsenko-Levine formalism, which exploits a fractional occupancy based approach to incorporate activation of the gap genes by the maternal genes and cross-regulation by the gap genes themselves. The Bayesian approach provides insight about relationship between system parameters. In the regulatory pathway of segmentation, the parameters for number of binding sites and binding affinity have a negative correlation. The model selection analysis supports a stronger binding affinity for Bicoid compared to other regulatory edges, as shown by a larger posterior mean. The procedure doesn’t show support for activation of Kruppel by Bicoid. Conclusions We provide an efficient solver for the general representation of the Papatsenko-Levine model. We also demonstrate the utility of Bayes factor for evaluating candidate models for spatial pattering models. In addition, by using the parallel tempering sampler, the convergence of Markov chains can be remarkably improved and robust estimates of Bayes factors obtained. Electronic supplementary material The online version of this article (10.1186/s12859-019-2888-0) contains supplementary material, which is available to authorized users.
Collapse
Affiliation(s)
- Asif Zubair
- Molecular and Computational Biology, USC, 1050 Childs Way, Los Angeles, CA 90089-2532, US.
| | - I Gary Rosen
- Department of Mathematics, USC, 3620 S. Vermont Ave., Los Angeles, CA 90089-2532, US
| | - Sergey V Nuzhdin
- Molecular and Computational Biology, USC, 1050 Childs Way, Los Angeles, CA 90089-2532, US
| | - Paul Marjoram
- Molecular and Computational Biology, USC, 1050 Childs Way, Los Angeles, CA 90089-2532, US
| |
Collapse
|
28
|
Repele A, Krueger S, Bhattacharyya T, Tuineau MY. The regulatory control of Cebpa enhancers and silencers in the myeloid and red-blood cell lineages. PLoS One 2019; 14:e0217580. [PMID: 31181110 PMCID: PMC6557489 DOI: 10.1371/journal.pone.0217580] [Citation(s) in RCA: 8] [Impact Index Per Article: 1.6] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 01/18/2019] [Accepted: 05/14/2019] [Indexed: 12/31/2022] Open
Abstract
Cebpa encodes a transcription factor (TF) that plays an instructive role in the development of multiple myeloid lineages. The expression of Cebpa itself is finely modulated, as Cebpa is expressed at high and intermediate levels in neutrophils and macrophages respectively and downregulated in non-myeloid lineages. The cis-regulatory logic underlying the lineage-specific modulation of Cebpa's expression level is yet to be fully characterized. Previously, we had identified 6 new cis-regulatory modules (CRMs) in a 78kb region surrounding Cebpa. We had also inferred the TFs that regulate each CRM by fitting a sequence-based thermodynamic model to a comprehensive reporter activity dataset. Here, we report the cis-regulatory logic of Cebpa CRMs at the resolution of individual binding sites. We tested the binding sites and functional roles of inferred TFs by designing and constructing mutated CRMs and comparing theoretical predictions of their activity against empirical measurements in a myeloid cell line. The enhancers were confirmed to be activated by combinations of PU.1, C/EBP family TFs, Egr1, and Gfi1 as predicted by the model. We show that silencers repress the activity of the proximal promoter in a dominant manner in G1ME cells, which are derived from the red-blood cell lineage. Dominant repression in G1ME cells can be traced to binding sites for GATA and Myb, a motif shared by all of the silencers. Finally, we demonstrate that GATA and Myb act redundantly to silence the proximal promoter. These results indicate that dominant repression is a novel mechanism for resolving hematopoietic lineages. Furthermore, Cebpa has a fail-safe cis-regulatory architecture, featuring several functionally similar CRMs, each of which contains redundant binding sites for multiple TFs. Lastly, by experimentally demonstrating the predictive ability of our sequence-based thermodynamic model, this work highlights the utility of this computational approach for understanding mammalian gene regulation.
Collapse
Affiliation(s)
- Andrea Repele
- Department of Biology, University of North Dakota, Grand Forks, ND, United States of America
| | - Shawn Krueger
- Department of Biology, University of North Dakota, Grand Forks, ND, United States of America
| | - Tapas Bhattacharyya
- Department of Biology, University of North Dakota, Grand Forks, ND, United States of America
| | - Michelle Y Tuineau
- Department of Biology, University of North Dakota, Grand Forks, ND, United States of America
| |
Collapse
|
29
|
Yildirim N, Aktas ME, Ozcan SN, Akbas E, Ay A. Differential transcriptional regulation by alternatively designed mechanisms: A mathematical modeling approach. In Silico Biol 2019; 12:95-127. [PMID: 27497472 DOI: 10.3233/isb-160467] [Citation(s) in RCA: 1] [Impact Index Per Article: 0.2] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/15/2022]
Abstract
Cells maintain cellular homeostasis employing different regulatory mechanisms to respond external stimuli. We study two groups of signal-dependent transcriptional regulatory mechanisms. In the first group, we assume that repressor and activator proteins compete for binding to the same regulatory site on DNA (competitive mechanisms). In the second group, they can bind to different regulatory regions in a noncompetitive fashion (noncompetitive mechanisms). For both competitive and noncompetitive mechanisms, we studied the gene expression dynamics by increasing the repressor or decreasing the activator abundance (inhibition mechanisms), or by decreasing the repressor or increasing the activator abundance (activation mechanisms). We employed delay differential equation models. Our simulation results show that the competitive and noncompetitive inhibition mechanisms exhibit comparable repression effectiveness. However, response time is fastest in the noncompetitive inhibition mechanism due to increased repressor abundance, and slowest in the competitive inhibition mechanism by increased repressor level. The competitive and noncompetitive inhibition mechanisms through decreased activator abundance show comparable and moderate response times, while the competitive and noncompetitive activation mechanisms by increased activator protein level display more effective and faster response. Our study exemplifies the importance of mathematical modeling and computer simulation in the analysis of gene expression dynamics.
Collapse
Affiliation(s)
- Necmettin Yildirim
- Division of Natural Sciences, New College of Florida, Bayshore Road, Sarasota, FL, USA
| | - Mehmet Emin Aktas
- Department of Mathematics, Florida State University, W College Ave, Tallahassee, FL, USA
| | - Seyma Nur Ozcan
- Department of Mathematics, North Carolina State University, Raleigh, NC, USA
| | - Esra Akbas
- Department of Computer Science, Florida State University, W College Ave, Tallahassee, FL, USA
| | - Ahmet Ay
- Departments of Biology and Mathematics, Colgate University, Oak Drive, Hamilton, NY, USA
| |
Collapse
|
30
|
Surkova S, Sokolkova A, Kozlov K, Nuzhdin SV, Samsonova M. Quantitative analysis reveals genotype- and domain- specific differences between mRNA and protein expression of segmentation genes in Drosophila. Dev Biol 2019; 448:48-58. [PMID: 30629954 DOI: 10.1016/j.ydbio.2019.01.006] [Citation(s) in RCA: 9] [Impact Index Per Article: 1.8] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 02/18/2018] [Revised: 12/12/2018] [Accepted: 01/04/2019] [Indexed: 10/27/2022]
Abstract
In many biological systems gene expression at mRNA and protein levels is not identical. Rigorous comparison of such differences on a spatio-temporal scale is still not feasible by high-throughput transcriptomic and proteomic analyses of early embryo development. Here, we characterize differences between mRNA and protein expression of Drosophila segmentation genes at the level of individual gene expression domains. We obtained quantitative imaging data on expression of gap genes gt and hb and pair-rule gene eve for Drosophila wild type embryos, Kr null mutants and Kr+/Kr- heterozygotes. To compare mRNA and protein expression we use several criteria including difference in amplitude and positions of expression domains, pattern shape and positional variability. For a number of gene expression domains we show examples where protein expression does not repeat mRNA expression even after a temporal delay. We calculated time delays between eve pattern formation at the level of mRNA and protein for wild type embryos, Kr mutants and Kr+/Kr- heterozygotes. We detect that in wild type embryos, the amplitudes of eve stripes 3 and 7 do not differ significantly at the level of mRNA, however, stripe 3 is higher than stripe 7 at the protein level. We further show that hb mRNA and protein expression in both anterior and posterior domains significantly differs at specific time points. The formation of hb PS4 stripe at the mRNA level proceeds five times faster than at the level of protein. With regard to spatial expression, we show that the offset between posterior gt mRNA and protein domains is much larger in Kr mutants than in wild type embryos and heterozygotes. Finally, we analyze differences in positional variability of eve stripe 7 expression in Kr mutants and Kr+/Kr- heterozygotes at the level of mRNA and protein. These results enable further perspectives to uncover mechanisms underlying discrepancies between mRNA and protein expression in early embryo.
Collapse
Affiliation(s)
- Svetlana Surkova
- Peter the Great St. Petersburg Polytechnic University, Polytechnicheskaya, 29, St. Petersburg 195251, Russia.
| | - Alena Sokolkova
- Peter the Great St. Petersburg Polytechnic University, Polytechnicheskaya, 29, St. Petersburg 195251, Russia
| | - Konstantin Kozlov
- Peter the Great St. Petersburg Polytechnic University, Polytechnicheskaya, 29, St. Petersburg 195251, Russia
| | - Sergey V Nuzhdin
- Peter the Great St. Petersburg Polytechnic University, Polytechnicheskaya, 29, St. Petersburg 195251, Russia; Section of Molecular and Computational Biology, University of Southern California, Los Angeles 90089, CA, USA
| | - Maria Samsonova
- Peter the Great St. Petersburg Polytechnic University, Polytechnicheskaya, 29, St. Petersburg 195251, Russia.
| |
Collapse
|
31
|
Halfon MS. Studying Transcriptional Enhancers: The Founder Fallacy, Validation Creep, and Other Biases. Trends Genet 2018; 35:93-103. [PMID: 30553552 DOI: 10.1016/j.tig.2018.11.004] [Citation(s) in RCA: 44] [Impact Index Per Article: 7.3] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 09/25/2018] [Revised: 11/15/2018] [Accepted: 11/21/2018] [Indexed: 12/21/2022]
Abstract
Transcriptional enhancers play a major role in regulating metazoan gene expression. Recent developments in genomics and next-generation sequencing have accelerated and revitalized the study of this important class of sequence elements. Increased interest and attention, however, has also led to troubling trends in the enhancer literature. In this Opinion, I describe some of these issues and show how they arise from shifting and nonuniform enhancer definitions, and genome-era biases. I discuss how they can lead to interpretative errors and an unduly narrow focus on certain aspects of enhancer biology to the potential exclusion of others.
Collapse
Affiliation(s)
- Marc S Halfon
- Department of Biochemistry, University at Buffalo-State University of New York, Buffalo, NY, USA; NY State Center of Excellence in Bioinformatics and Life Sciences, Buffalo, NY, USA; Department of Biological Sciences, University at Buffalo-State University of New York, Buffalo, NY, USA; Department of Biomedical Informatics, University at Buffalo-State University of New York, Buffalo, NY, USA; Department of Molecular and Cellular Biology and Program in Cancer Genetics, Roswell Park Comprehensive Cancer Center, Buffalo, NY, USA.
| |
Collapse
|
32
|
An information theoretic treatment of sequence-to-expression modeling. PLoS Comput Biol 2018; 14:e1006459. [PMID: 30256780 PMCID: PMC6175532 DOI: 10.1371/journal.pcbi.1006459] [Citation(s) in RCA: 4] [Impact Index Per Article: 0.7] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 02/01/2018] [Revised: 10/08/2018] [Accepted: 08/24/2018] [Indexed: 11/23/2022] Open
Abstract
Studying a gene’s regulatory mechanisms is a tedious process that involves identification of candidate regulators by transcription factor (TF) knockout or over-expression experiments, delineation of enhancers by reporter assays, and demonstration of direct TF influence by site mutagenesis, among other approaches. Such experiments are often chosen based on the biologist’s intuition, from several testable hypotheses. We pursue the goal of making this process systematic by using ideas from information theory to reason about experiments in gene regulation, in the hope of ultimately enabling rigorous experiment design strategies. For this, we make use of a state-of-the-art mathematical model of gene expression, which provides a way to formalize our current knowledge of cis- as well as trans- regulatory mechanisms of a gene. Ambiguities in such knowledge can be expressed as uncertainties in the model, which we capture formally by building an ensemble of plausible models that fit the existing data and defining a probability distribution over the ensemble. We then characterize the impact of a new experiment on our understanding of the gene’s regulation based on how the ensemble of plausible models and its probability distribution changes when challenged with results from that experiment. This allows us to assess the ‘value’ of the experiment retroactively as the reduction in entropy of the distribution (information gain) resulting from the experiment’s results. We fully formalize this novel approach to reasoning about gene regulation experiments and use it to evaluate a variety of perturbation experiments on two developmental genes of D. melanogaster. We also provide objective and ‘biologist-friendly’ descriptions of the information gained from each such experiment. The rigorously defined information theoretic approaches presented here can be used in the future to formulate systematic strategies for experiment design pertaining to studies of gene regulatory mechanisms. In-depth studies of gene regulatory mechanisms employ a variety of experimental approaches such as identifying a gene’s enhancer(s) and testing its variants through reporter assays, followed by transcription factor mis-expression or knockouts, site mutagenesis, etc. The biologist is often faced with the challenging problem of selecting the ideal next experiment to perform so that its results provide novel mechanistic insights, and has to rely on their intuition about what is currently known on the topic and which experiments may add to that knowledge. We seek to make this intuition-based process more systematic, by borrowing ideas from the mature statistical field of experiment design. Towards this goal, we use the language of mathematical models to formally describe what is known about a gene’s regulatory mechanisms, and how an experiment’s results enhance that knowledge. We use information theoretic ideas to assign a ‘value’ to an experiment as well as explain objectively what is learned from that experiment. We demonstrate use of this novel approach on two extensively studied developmental genes in fruitfly. We expect our work to lead to systematic strategies for selecting the most informative experiments in a study of gene regulation.
Collapse
|
33
|
Sabino AU, Vasconcelos MFS, Sittoni MY, Lautenschlager WW, Queiroga AS, Morais MCC, Ramos AF. Lessons and perspectives for applications of stochastic models in biological and cancer research. Clinics (Sao Paulo) 2018; 73:e536s. [PMID: 30281699 PMCID: PMC6131223 DOI: 10.6061/clinics/2018/e536s] [Citation(s) in RCA: 1] [Impact Index Per Article: 0.2] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Submit a Manuscript] [Subscribe] [Scholar Register] [Received: 04/11/2018] [Accepted: 06/14/2018] [Indexed: 11/18/2022] Open
Abstract
The effects of randomness, an unavoidable feature of intracellular environments, are observed at higher hierarchical levels of living matter organization, such as cells, tissues, and organisms. Additionally, the many compounds interacting as a well-orchestrated network of reactions increase the difficulties of assessing these systems using only experiments. This limitation indicates that elucidation of the dynamics of biological systems is a complex task that will benefit from the establishment of principles to help describe, categorize, and predict the behavior of these systems. The theoretical machinery already available, or ones to be discovered to help solve biological problems, might play an important role in these processes. Here, we demonstrate the application of theoretical tools by discussing some biological problems that we have approached mathematically: fluctuations in gene expression and cell proliferation in the context of loss of contact inhibition. We discuss the methods that have been employed to provide the reader with a biologically motivated phenomenological perspective of the use of theoretical methods. Finally, we end this review with a discussion of new research perspectives motivated by our results.
Collapse
Affiliation(s)
- Alan U Sabino
- Escola de Artes Ciências e Humanidades (EACH), Universidade de Sao Paulo, Sao Paulo, SP, BR
| | - Miguel FS Vasconcelos
- Escola de Artes Ciências e Humanidades (EACH), Universidade de Sao Paulo, Sao Paulo, SP, BR
| | - Misaki Yamada Sittoni
- Escola de Artes Ciências e Humanidades (EACH), Universidade de Sao Paulo, Sao Paulo, SP, BR
- Departamento de Radiologia e Oncologia, Instituto do Cancer do Estado de Sao Paulo (ICESP), Faculdade de Medicina FMUSP, Universidade de Sao Paulo, Sao Paulo, SP, BR
| | | | - Alexandre S Queiroga
- Escola de Artes Ciências e Humanidades (EACH), Universidade de Sao Paulo, Sao Paulo, SP, BR
- Departamento de Radiologia e Oncologia, Instituto do Cancer do Estado de Sao Paulo (ICESP), Faculdade de Medicina FMUSP, Universidade de Sao Paulo, Sao Paulo, SP, BR
| | - Mauro CC Morais
- Escola de Artes Ciências e Humanidades (EACH), Universidade de Sao Paulo, Sao Paulo, SP, BR
- Departamento de Radiologia e Oncologia, Instituto do Cancer do Estado de Sao Paulo (ICESP), Faculdade de Medicina FMUSP, Universidade de Sao Paulo, Sao Paulo, SP, BR
| | - Alexandre F Ramos
- Escola de Artes Ciências e Humanidades (EACH), Universidade de Sao Paulo, Sao Paulo, SP, BR
- Departamento de Radiologia e Oncologia, Instituto do Cancer do Estado de Sao Paulo (ICESP), Faculdade de Medicina FMUSP, Universidade de Sao Paulo, Sao Paulo, SP, BR
- *Corresponding author. E-mail:
| |
Collapse
|
34
|
Vincent BJ, Staller MV, Lopez-Rivera F, Bragdon MDJ, Pym ECG, Biette KM, Wunderlich Z, Harden TT, Estrada J, DePace AH. Hunchback is counter-repressed to regulate even-skipped stripe 2 expression in Drosophila embryos. PLoS Genet 2018; 14:e1007644. [PMID: 30192762 PMCID: PMC6145585 DOI: 10.1371/journal.pgen.1007644] [Citation(s) in RCA: 21] [Impact Index Per Article: 3.5] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/06/2017] [Revised: 09/19/2018] [Accepted: 08/17/2018] [Indexed: 01/18/2023] Open
Abstract
Hunchback is a bifunctional transcription factor that can activate and repress gene expression in Drosophila development. We investigated the regulatory DNA sequence features that control Hunchback function by perturbing enhancers for one of its target genes, even-skipped (eve). While Hunchback directly represses the eve stripe 3+7 enhancer, we found that in the eve stripe 2+7 enhancer, Hunchback repression is prevented by nearby sequences-this phenomenon is called counter-repression. We also found evidence that Caudal binding sites are responsible for counter-repression, and that this interaction may be a conserved feature of eve stripe 2 enhancers. Our results alter the textbook view of eve stripe 2 regulation wherein Hb is described as a direct activator. Instead, to generate stripe 2, Hunchback repression must be counteracted. We discuss how counter-repression may influence eve stripe 2 regulation and evolution.
Collapse
Affiliation(s)
- Ben J. Vincent
- Department of Systems Biology, Harvard Medical School, Boston, Massachusetts, United States of America
| | - Max V. Staller
- Department of Systems Biology, Harvard Medical School, Boston, Massachusetts, United States of America
| | - Francheska Lopez-Rivera
- Department of Systems Biology, Harvard Medical School, Boston, Massachusetts, United States of America
| | - Meghan D. J. Bragdon
- Department of Systems Biology, Harvard Medical School, Boston, Massachusetts, United States of America
| | - Edward C. G. Pym
- Department of Systems Biology, Harvard Medical School, Boston, Massachusetts, United States of America
| | - Kelly M. Biette
- Department of Systems Biology, Harvard Medical School, Boston, Massachusetts, United States of America
| | - Zeba Wunderlich
- Department of Systems Biology, Harvard Medical School, Boston, Massachusetts, United States of America
| | - Timothy T. Harden
- Department of Systems Biology, Harvard Medical School, Boston, Massachusetts, United States of America
| | - Javier Estrada
- Department of Systems Biology, Harvard Medical School, Boston, Massachusetts, United States of America
| | - Angela H. DePace
- Department of Systems Biology, Harvard Medical School, Boston, Massachusetts, United States of America
| |
Collapse
|
35
|
Myasnikova E, Spirov A. Relative sensitivity analysis of the predictive properties of sloppy models. J Bioinform Comput Biol 2018; 16:1840008. [DOI: 10.1142/s0219720018400085] [Citation(s) in RCA: 5] [Impact Index Per Article: 0.8] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/18/2022]
Abstract
Commonly among the model parameters characterizing complex biological systems are those that do not significantly influence the quality of the fit to experimental data, so-called “sloppy” parameters. The sloppiness can be mathematically expressed through saturating response functions (Hill’s, sigmoid) thereby embodying biological mechanisms responsible for the system robustness to external perturbations. However, if a sloppy model is used for the prediction of the system behavior at the altered input (e.g. knock out mutations, natural expression variability), it may demonstrate the poor predictive power due to the ambiguity in the parameter estimates. We introduce a method of the predictive power evaluation under the parameter estimation uncertainty, Relative Sensitivity Analysis. The prediction problem is addressed in the context of gene circuit models describing the dynamics of segmentation gene expression in Drosophila embryo. Gene regulation in these models is introduced by a saturating sigmoid function of the concentrations of the regulatory gene products. We show how our approach can be applied to characterize the essential difference between the sensitivity properties of robust and non-robust solutions and select among the existing solutions those providing the correct system behavior at any reasonable input. In general, the method allows to uncover the sources of incorrect predictions and proposes the way to overcome the estimation uncertainties.
Collapse
Affiliation(s)
- Ekaterina Myasnikova
- Center for Advanced Studies, St. Petersburg State Polytechnical University 29, Polytekhnicheskaya, St. Petersburg 195251, Russia
| | - Alexander Spirov
- Computer Science and CEWIT, SUNY Stony Brook 1500 Stony Brook Road, Stony Brook, NY 11794, USA
- Lab Modeling of Evolution I. M. Sechenov Institute of Evolutionary Physiology and Biochemistry Russian Academy of Sciences, Pr. Torez 44, St. Petersburg 194223, Russia
| |
Collapse
|
36
|
Ahsendorf T, Müller FJ, Topkar V, Gunawardena J, Eils R. Transcription factors, coregulators, and epigenetic marks are linearly correlated and highly redundant. PLoS One 2017; 12:e0186324. [PMID: 29216191 PMCID: PMC5720766 DOI: 10.1371/journal.pone.0186324] [Citation(s) in RCA: 9] [Impact Index Per Article: 1.3] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/29/2017] [Accepted: 08/28/2017] [Indexed: 11/30/2022] Open
Abstract
The DNA microstates that regulate transcription include sequence-specific transcription factors (TFs), coregulatory complexes, nucleosomes, histone modifications, DNA methylation, and parts of the three-dimensional architecture of genomes, which could create an enormous combinatorial complexity across the genome. However, many proteins and epigenetic marks are known to colocalize, suggesting that the information content encoded in these marks can be compressed. It has so far proved difficult to understand this compression in a systematic and quantitative manner. Here, we show that simple linear models can reliably predict the data generated by the ENCODE and Roadmap Epigenomics consortia. Further, we demonstrate that a small number of marks can predict all other marks with high average correlation across the genome, systematically revealing the substantial information compression that is present in different cell lines. We find that the linear models for activating marks are typically cell line-independent, while those for silencing marks are predominantly cell line-specific. Of particular note, a nuclear receptor corepressor, transducin beta-like 1 X-linked receptor 1 (TBLR1), was highly predictive of other marks in two hematopoietic cell lines. The methodology presented here shows how the potentially vast complexity of TFs, coregulators, and epigenetic marks at eukaryotic genes is highly redundant and that the information present can be compressed onto a much smaller subset of marks. These findings could be used to efficiently characterize cell lines and tissues based on a small number of diagnostic marks and suggest how the DNA microstates, which regulate the expression of individual genes, can be specified.
Collapse
Affiliation(s)
- Tobias Ahsendorf
- Division of Theoretical Bioinformatics, German Cancer Research Center, Heidelberg, Baden-Württemberg, Germany
- Institute of Pharmacy and Molecular Biotechnology, Bioquant, University of Heidelberg, Heidelberg, Baden-Württemberg, Germany
- Department of Systems Biology, Harvard Medical School, Boston, Massachusetts, United States of America
| | | | - Ved Topkar
- Department of Systems Biology, Harvard Medical School, Boston, Massachusetts, United States of America
- Harvard College, Boston, Massachusetts, United States of America
| | - Jeremy Gunawardena
- Department of Systems Biology, Harvard Medical School, Boston, Massachusetts, United States of America
| | - Roland Eils
- Division of Theoretical Bioinformatics, German Cancer Research Center, Heidelberg, Baden-Württemberg, Germany
- Institute of Pharmacy and Molecular Biotechnology, Bioquant, University of Heidelberg, Heidelberg, Baden-Württemberg, Germany
- * E-mail:
| |
Collapse
|
37
|
Crocker J, Ilsley GR. Using synthetic biology to study gene regulatory evolution. Curr Opin Genet Dev 2017; 47:91-101. [DOI: 10.1016/j.gde.2017.09.001] [Citation(s) in RCA: 17] [Impact Index Per Article: 2.4] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/05/2017] [Revised: 09/06/2017] [Accepted: 09/11/2017] [Indexed: 12/21/2022]
|
38
|
Barr KA, Martinez C, Moran JR, Kim AR, Ramos AF, Reinitz J. Synthetic enhancer design by in silico compensatory evolution reveals flexibility and constraint in cis-regulation. BMC SYSTEMS BIOLOGY 2017; 11:116. [PMID: 29187214 PMCID: PMC5708098 DOI: 10.1186/s12918-017-0485-2] [Citation(s) in RCA: 10] [Impact Index Per Article: 1.4] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Subscribe] [Scholar Register] [Received: 07/05/2017] [Accepted: 11/09/2017] [Indexed: 11/12/2022]
Abstract
BACKGROUND Models that incorporate specific chemical mechanisms have been successful in describing the activity of Drosophila developmental enhancers as a function of underlying transcription factor binding motifs. Despite this, the minimum set of mechanisms required to reconstruct an enhancer from its constituent parts is not known. Synthetic biology offers the potential to test the sufficiency of known mechanisms to describe the activity of enhancers, as well as to uncover constraints on the number, order, and spacing of motifs. RESULTS Using a functional model and in silico compensatory evolution, we generated putative synthetic even-skipped stripe 2 enhancers with varying degrees of similarity to the natural enhancer. These elements represent the evolutionary trajectories of the natural stripe 2 enhancer towards two synthetic enhancers designed ab initio. In the first trajectory, spatially regulated expression was maintained, even after more than a third of binding sites were lost. In the second, sequences with high similarity to the natural element did not drive expression, but a highly diverged sequence about half the length of the minimal stripe 2 enhancer drove ten times greater expression. Additionally, homotypic clusters of Zelda or Stat92E motifs, but not Bicoid, drove expression in developing embryos. CONCLUSIONS Here, we present a functional model of gene regulation to test the degree to which the known transcription factors and their interactions explain the activity of the Drosophila even-skipped stripe 2 enhancer. Initial success in the first trajectory showed that the gene regulation model explains much of the function of the stripe 2 enhancer. Cases where expression deviated from prediction indicates that undescribed factors likely act to modulate expression. We also showed that activation driven Bicoid and Hunchback is highly sensitive to spatial organization of binding motifs. In contrast, Zelda and Stat92E drive expression from simple homotypic clusters, suggesting that activation driven by these factors is less constrained. Collectively, the 40 sequences generated in this work provides a powerful training set for building future models of gene regulation.
Collapse
Affiliation(s)
- Kenneth A Barr
- Committee on Genetics, Genomics, and Systems Biology, University of Chicago, Zoology 111, 1101 E 57th St, Chicago, 60637, Illinois, USA.
- Department of Ecology and Evolution, The University of Chicago, Chicago, 60637, Illinois, USA.
| | - Carlos Martinez
- Department Biochemistry and Molecular Genetics, Northwestern University, Chicago, 60611, Illinois, USA
| | - Jennifer R Moran
- Department Human Genetics, The University of Chicago, Chicago, 60637, Illinois, USA
- Institute for Genomics & Systems Biology, The University of Chicago, Chicago, 60637, Illinois, USA
| | - Ah-Ram Kim
- School of Life Science, Handong Global University, Pohang, 37554, Gyeongbuk, South Korea
| | - Alexandre F Ramos
- Departamento de Radiologia - Faculdade de Medicina, Universidade de São Paulo & Instituto do Câncer do Estado de São Paulo, São Paulo, SP CEP, 05403-911, Brazil
- Escola de Artes, Ciências e Humanidades & Núcleo de Estudos Interdisciplinares em Sistemas Complexos, Universidade de São Paulo, Av. Arlindo Béttio, São Paulo, 1000 CEP 03828-000, SP, Brazil
| | - John Reinitz
- Committee on Genetics, Genomics, and Systems Biology, University of Chicago, Zoology 111, 1101 E 57th St, Chicago, 60637, Illinois, USA
- Department of Ecology and Evolution, The University of Chicago, Chicago, 60637, Illinois, USA
- Institute for Genomics & Systems Biology, The University of Chicago, Chicago, 60637, Illinois, USA
- Department Statistics, The University of Chicago, 5747 S. Ellis Avenue Jones 312, Chicago, 60637, IL, USA
| |
Collapse
|
39
|
Bentovim L, Harden TT, DePace AH. Transcriptional precision and accuracy in development: from measurements to models and mechanisms. Development 2017; 144:3855-3866. [PMID: 29089359 PMCID: PMC5702068 DOI: 10.1242/dev.146563] [Citation(s) in RCA: 21] [Impact Index Per Article: 3.0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 12/19/2022]
Abstract
During development, genes are transcribed at specific times, locations and levels. In recent years, the emergence of quantitative tools has significantly advanced our ability to measure transcription with high spatiotemporal resolution in vivo. Here, we highlight recent studies that have used these tools to characterize transcription during development, and discuss the mechanisms that contribute to the precision and accuracy of the timing, location and level of transcription. We attempt to disentangle the discrepancies in how physicists and biologists use the term ‘precision' to facilitate interactions using a common language. We also highlight selected examples in which the coupling of mathematical modeling with experimental approaches has provided important mechanistic insights, and call for a more expansive use of mathematical modeling to exploit the wealth of quantitative data and advance our understanding of animal transcription. Summary: This Review highlights how high-resolution quantitative tools and theoretical models have formed our current view of the mechanisms determining precision and accuracy in the timing, location and level of transcription in the Drosophila embryo.
Collapse
Affiliation(s)
- Lital Bentovim
- Department of Systems Biology, Harvard Medical School, Boston, MA 02115, USA
| | - Timothy T Harden
- Department of Systems Biology, Harvard Medical School, Boston, MA 02115, USA
| | - Angela H DePace
- Department of Systems Biology, Harvard Medical School, Boston, MA 02115, USA
| |
Collapse
|
40
|
Myasnikova EM, Spirov AV. A Method for Estimating the Predictive Power in a Model of a Biological System with Low Sensitivity to Parameters. Biophysics (Nagoya-shi) 2017. [DOI: 10.1134/s0006350917060185] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/23/2022] Open
|
41
|
Barr KA, Reinitz J. A sequence level model of an intact locus predicts the location and function of nonadditive enhancers. PLoS One 2017; 12:e0180861. [PMID: 28715438 PMCID: PMC5513433 DOI: 10.1371/journal.pone.0180861] [Citation(s) in RCA: 10] [Impact Index Per Article: 1.4] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 11/07/2016] [Accepted: 06/22/2017] [Indexed: 01/24/2023] Open
Abstract
Metazoan gene expression is controlled through the action of long stretches of noncoding DNA that contain enhancers-shorter sequences responsible for controlling a single aspect of a gene's expression pattern. Models built on thermodynamics have shown how enhancers interpret protein concentration in order to determine specific levels of gene expression, but the emergent regulatory logic of a complete regulatory locus shows qualitative and quantitative differences from isolated enhancers. Such differences may arise from steric competition limiting the quantity of DNA that can simultaneously influence the transcription machinery. We incorporated this competition into a mechanistic model of gene regulation, generated efficient algorithms for this computation, and applied it to the regulation of Drosophila even-skipped (eve). This model finds the location of enhancers and identifies which factors control the boundaries of eve expression. This model predicts a new enhancer that, when assayed in vivo, drives expression in a non-eve pattern. Incorporation of chromatin accessibility eliminates this inconsistency.
Collapse
Affiliation(s)
- Kenneth A. Barr
- Committee on Genetics, Genomics, and Systems Biology, University of Chicago, Chicago, Illinois, United States of America
| | - John Reinitz
- Committee on Genetics, Genomics, and Systems Biology, University of Chicago, Chicago, Illinois, United States of America
- Department of Statistics, University of Chicago, Chicago, Illinois, United States of America
- Department of Ecology and Evolution, University of Chicago, Chicago, Illinois, United States of America
- Department of Molecular Genetics and Cell Biology, University of Chicago, Chicago, Illinois, United States of America
- Institute for Genomics and Systems Biology, University of Chicago, Chicago, Illinois, United States of America
| |
Collapse
|
42
|
Holloway DM, Spirov AV. Transcriptional bursting in Drosophila development: Stochastic dynamics of eve stripe 2 expression. PLoS One 2017; 12:e0176228. [PMID: 28437444 PMCID: PMC5402966 DOI: 10.1371/journal.pone.0176228] [Citation(s) in RCA: 14] [Impact Index Per Article: 2.0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 11/21/2016] [Accepted: 04/08/2017] [Indexed: 01/17/2023] Open
Abstract
Anterior-posterior (AP) body segmentation of the fruit fly (Drosophila) is first seen in the 7-stripe spatial expression patterns of the pair-rule genes, which regulate downstream genes determining specific segment identities. Regulation of pair-rule expression has been extensively studied for the even-skipped (eve) gene. Recent live imaging, of a reporter for the 2ndeve stripe, has demonstrated the stochastic nature of this process, with ‘bursts’ in the number of RNA transcripts being made over time. We developed a stochastic model of the spatial and temporal expression of eve stripe 2 (binding by transcriptional activators (Bicoid and Hunchback proteins) and repressors (Giant and Krüppel proteins), transcriptional initiation and termination; with all rate parameters constrained by features of the experimental data) in order to analyze the noisy experimental time series and test hypotheses for how eve transcription is regulated. These include whether eve transcription is simply OFF or ON, with a single ON rate, or whether it proceeds by a more complex mechanism, with multiple ON rates. We find that both mechanisms can produce long (multi-minute) RNA bursts, but that the short-time (minute-to-minute) statistics of the data is indicative of eve being transcribed with at least two distinct ON rates, consistent with data on the joint activation of eve by Bicoid and Hunchback. We also predict distinct statistical signatures for cases in which eve is repressed (e.g. along the edges of the stripe) vs. cases in which activation is reduced (e.g. by mutagenesis of transcription factor binding sites). Fundamental developmental processes such as gene transcription are intrinsically noisy; our approach presents a new way to quantify and analyze time series data during developmental patterning in order to understand regulatory mechanisms and how they propagate noise and impact embryonic robustness.
Collapse
Affiliation(s)
- David M. Holloway
- Mathematics Department, British Columbia Institute of Technology, Burnaby, B.C., Canada
- Biology Department, University of Victoria, Victoria, B.C., Canada
- * E-mail:
| | - Alexander V. Spirov
- Computer Science, and Center of Excellence in Wireless and Information Technology, State University of New York, Stony Brook, New York, United States of America
- Sechenov Institute of Evolutionary Physiology and Biochemistry, St. Petersburg, Russia
| |
Collapse
|
43
|
A Looping-Based Model for Quenching Repression. PLoS Comput Biol 2017; 13:e1005337. [PMID: 28085884 PMCID: PMC5279812 DOI: 10.1371/journal.pcbi.1005337] [Citation(s) in RCA: 1] [Impact Index Per Article: 0.1] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 11/14/2016] [Revised: 01/30/2017] [Accepted: 12/29/2016] [Indexed: 12/18/2022] Open
Abstract
We model the regulatory role of proteins bound to looped DNA using a simulation in which dsDNA is represented as a self-avoiding chain, and proteins as spherical protrusions. We simulate long self-avoiding chains using a sequential importance sampling Monte-Carlo algorithm, and compute the probabilities for chain looping with and without a protrusion. We find that a protrusion near one of the chain’s termini reduces the probability of looping, even for chains much longer than the protrusion–chain-terminus distance. This effect increases with protrusion size, and decreases with protrusion-terminus distance. The reduced probability of looping can be explained via an eclipse-like model, which provides a novel inhibitory mechanism. We test the eclipse model on two possible transcription-factor occupancy states of the D. melanogastereve 3/7 enhancer, and show that it provides a possible explanation for the experimentally-observed eve stripe 3 and 7 expression patterns. Biological regulation-at-a-distance, whereby a transcription factor (TF) is able to generate susbstantial regulatory effects on gene expression even though it may be bound a large distance away from its target (500 bp–1 Mbp), is only partially understood. Using a biophysical model and a computer simulation that take dsDNA and TF volumes into account, we identify a downregulatory mechanism which functions at large distances, whereby a TF bound within ∼ 150 bp from an activator decreases the probability of looping-based interaction between the activator and the distant core promoter. This “eclipse” mechanism provides insight into the question of how enhancer architecture dictates gene expression.
Collapse
|
44
|
Peng PC, Sinha S. Quantitative modeling of gene expression using DNA shape features of binding sites. Nucleic Acids Res 2016; 44:e120. [PMID: 27257066 PMCID: PMC5291265 DOI: 10.1093/nar/gkw446] [Citation(s) in RCA: 14] [Impact Index Per Article: 1.8] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 11/02/2015] [Revised: 05/06/2016] [Accepted: 05/09/2016] [Indexed: 12/11/2022] Open
Abstract
Prediction of gene expression levels driven by regulatory sequences is pivotal in genomic biology. A major focus in transcriptional regulation is sequence-to-expression modeling, which interprets the enhancer sequence based on transcription factor concentrations and DNA binding specificities and predicts precise gene expression levels in varying cellular contexts. Such models largely rely on the position weight matrix (PWM) model for DNA binding, and the effect of alternative models based on DNA shape remains unexplored. Here, we propose a statistical thermodynamics model of gene expression using DNA shape features of binding sites. We used rigorous methods to evaluate the fits of expression readouts of 37 enhancers regulating spatial gene expression patterns in Drosophila embryo, and show that DNA shape-based models perform arguably better than PWM-based models. We also observed DNA shape captures information complimentary to the PWM, in a way that is useful for expression modeling. Furthermore, we tested if combining shape and PWM-based features provides better predictions than using either binding model alone. Our work demonstrates that the increasingly popular DNA-binding models based on local DNA shape can be useful in sequence-to-expression modeling. It also provides a framework for future studies to predict gene expression better than with PWM models alone.
Collapse
Affiliation(s)
- Pei-Chen Peng
- Department of Computer Science, University of Illinois at Urbana-Champaign, Urbana, IL 61801, USA
| | - Saurabh Sinha
- Department of Computer Science, University of Illinois at Urbana-Champaign, Urbana, IL 61801, USA Carl R. Woese Institute for Genomic Biology, University of Illinois at Urbana-Champaign, Urbana, IL 61801, USA
| |
Collapse
|
45
|
Spirov AV, Myasnikova EM, Holloway DM. Sequential construction of a model for modular gene expression control, applied to spatial patterning of theDrosophilagenehunchback. J Bioinform Comput Biol 2016; 14:1641005. [DOI: 10.1142/s0219720016410055] [Citation(s) in RCA: 7] [Impact Index Per Article: 0.9] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/18/2022]
Abstract
Gene network simulations are increasingly used to quantify mutual gene regulation in biological tissues. These are generally based on linear interactions between single-entity regulatory and target genes. Biological genes, by contrast, commonly have multiple, partially independent, cis-regulatory modules (CRMs) for regulator binding, and can produce variant transcription and translation products. We present a modeling framework to address some of the gene regulatory dynamics implied by this biological complexity. Spatial patterning of the hunchback (hb) gene in Drosophila development involves control by three CRMs producing two distinct mRNA transcripts. We use this example to develop a differential equations model for transcription which takes into account the cis-regulatory architecture of the gene. Potential regulatory interactions are screened by a genetic algorithms (GAs) approach and compared to biological expression data.
Collapse
Affiliation(s)
- Alexander V. Spirov
- Computer Science and CEWIT, SUNY Stony Brook, 1500 Stony Brook Road, Stony Brook, NY 11794, USA
- Lab Modeling of Evolution, I. M. Sechenov Institute of Evolutionary Physiology and Biochemistry, Russian Academy of Sciences, pr. Torez 44, St. Petersburg 194223, Russia
| | - Ekaterina M. Myasnikova
- Center for Advanced Studies, Peter the Great St. Petersburg Polytechnical University, 29 Polytechnicheskaya St. Petersburg 195251, Russia
- Department of Bioinformatics, Moscow Institute of Physics and Technology, 9 Institutskiy per., Dolgoprudny, Moscow 141700, Russia
| | - David M. Holloway
- Mathematics Department, British Columbia Institute of Technology, 3700 Willingdon Avenue, Burnaby, BC, Canada V5G 3H2, Canada
- Department of Biology, University of Victoria, Victoria, BC, Canada V8W 2Y2, Canada
| |
Collapse
|
46
|
Prata GN, Hornos JEM, Ramos AF. Stochastic model for gene transcription on Drosophila melanogaster embryos. Phys Rev E 2016; 93:022403. [PMID: 26986358 DOI: 10.1103/physreve.93.022403] [Citation(s) in RCA: 10] [Impact Index Per Article: 1.3] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 11/10/2014] [Indexed: 02/04/2023]
Abstract
We examine immunostaining experimental data for the formation of stripe 2 of even-skipped (eve) transcripts on D. melanogaster embryos. An estimate of the factor converting immunofluorescence intensity units into molecular numbers is given. The analysis of the eve dynamics at the region of stripe 2 suggests that the promoter site of the gene has two distinct regimes: an earlier phase when it is predominantly activated until a critical time when it becomes mainly repressed. That suggests proposing a stochastic binary model for gene transcription on D. melanogaster embryos. Our model has two random variables: the transcripts number and the state of the source of mRNAs given as active or repressed. We are able to reproduce available experimental data for the average number of transcripts. An analysis of the random fluctuations on the number of eves and their consequences on the spatial precision of stripe 2 is presented. We show that the position of the anterior or posterior borders fluctuate around their average position by ∼1% of the embryo length, which is similar to what is found experimentally. The fitting of data by such a simple model suggests that it can be useful to understand the functions of randomness during developmental processes.
Collapse
Affiliation(s)
- Guilherme N Prata
- Escola de Artes, Ciências e Humanidades, Universidade de São Paulo, Avenida Arlindo Béttio, 1000, Ermelino Matarazzo, São Paulo, SP CEP 03828-000, Brazil
| | - José Eduardo M Hornos
- Instituto de Física de São Carlos, Universidade de São Paulo, Av. Trabalhador São-Carlense, 400, São Carlos, SP CEP 13566-590, Brazil
| | - Alexandre F Ramos
- Escola de Artes, Ciências e Humanidades, Universidade de São Paulo, Avenida Arlindo Béttio, 1000, Ermelino Matarazzo, São Paulo, SP CEP 03828-000, Brazil.,Departamento de Radiologia - Faculdade de Medicina, Universidade de São Paulo, São Carlos, SP CEP 13566-590, Brazil.,Núcleo de Estudos Interdisciplinares em Sistemas Complexos, Universidade de São Paulo, São Carlos, SP CEP 13566-590, Brazil
| |
Collapse
|
47
|
Vincent BJ, Estrada J, DePace AH. The appeasement of Doug: a synthetic approach to enhancer biology. Integr Biol (Camb) 2016; 8:475-84. [DOI: 10.1039/c5ib00321k] [Citation(s) in RCA: 33] [Impact Index Per Article: 4.1] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 12/13/2022]
Affiliation(s)
- Ben J. Vincent
- Department of Systems Biology, Harvard Medical School, 200 Longwood Avenue, Boston, MA 02115, USA
| | - Javier Estrada
- Department of Systems Biology, Harvard Medical School, 200 Longwood Avenue, Boston, MA 02115, USA
| | - Angela H. DePace
- Department of Systems Biology, Harvard Medical School, 200 Longwood Avenue, Boston, MA 02115, USA
| |
Collapse
|
48
|
Bertolino E, Reinitz J, Manu. The analysis of novel distal Cebpa enhancers and silencers using a transcriptional model reveals the complex regulatory logic of hematopoietic lineage specification. Dev Biol 2016; 413:128-44. [PMID: 26945717 DOI: 10.1016/j.ydbio.2016.02.030] [Citation(s) in RCA: 12] [Impact Index Per Article: 1.5] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/16/2015] [Revised: 01/13/2016] [Accepted: 02/15/2016] [Indexed: 11/25/2022]
Abstract
C/EBPα plays an instructive role in the macrophage-neutrophil cell-fate decision and its expression is necessary for neutrophil development. How Cebpa itself is regulated in the myeloid lineage is not known. We decoded the cis-regulatory logic of Cebpa, and two other myeloid transcription factors, Egr1 and Egr2, using a combined experimental-computational approach. With a reporter design capable of detecting both distal enhancers and silencers, we analyzed 46 putative cis-regulatory modules (CRMs) in cells representing myeloid progenitors, and derived early macrophages or neutrophils. In addition to novel enhancers, this analysis revealed a surprisingly large number of silencers. We determined the regulatory roles of 15 potential transcriptional regulators by testing 32,768 alternative sequence-based transcriptional models against CRM activity data. This comprehensive analysis allowed us to infer the cis-regulatory logic for most of the CRMs. Silencer-mediated repression of Cebpa was found to be effected mainly by TFs expressed in non-myeloid lineages, highlighting a previously unappreciated contribution of long-distance silencing to hematopoietic lineage resolution. The repression of Cebpa by multiple factors expressed in alternative lineages suggests that hematopoietic genes are organized into densely interconnected repressive networks instead of hierarchies of mutually repressive pairs of pivotal TFs. More generally, our results demonstrate that de novo cis-regulatory dissection is feasible on a large scale with the aid of transcriptional modeling. Current address: Department of Biology, University of North Dakota, 10 Cornell Street, Stop 9019, Grand Forks, ND 58202-9019, USA.
Collapse
Affiliation(s)
- Eric Bertolino
- Department of Molecular Genetics and Cell Biology, The University of Chicago, Chicago, IL 60637, USA.
| | - John Reinitz
- Department of Molecular Genetics and Cell Biology, The University of Chicago, Chicago, IL 60637, USA; Department of Statistics, The University of Chicago, Chicago, IL 60637, USA; Department of Ecology and Evolution and Institute of Genomics and Systems Biology, The University of Chicago, Chicago, IL 60637, USA
| | - Manu
- Department of Ecology and Evolution and Institute of Genomics and Systems Biology, The University of Chicago, Chicago, IL 60637, USA; Department of Biology, University of North Dakota, 10 Cornell Street, Stop 9019, Grand Forks, ND 58202-9019, USA.
| |
Collapse
|
49
|
Hoermann A, Cicin-Sain D, Jaeger J. A quantitative validated model reveals two phases of transcriptional regulation for the gap gene giant in Drosophila. Dev Biol 2016; 411:325-338. [DOI: 10.1016/j.ydbio.2016.01.005] [Citation(s) in RCA: 11] [Impact Index Per Article: 1.4] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 09/24/2015] [Revised: 12/22/2015] [Accepted: 01/08/2016] [Indexed: 01/05/2023]
|
50
|
Quantitatively predictable control of Drosophila transcriptional enhancers in vivo with engineered transcription factors. Nat Genet 2016; 48:292-8. [DOI: 10.1038/ng.3509] [Citation(s) in RCA: 26] [Impact Index Per Article: 3.3] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 08/30/2015] [Accepted: 01/15/2016] [Indexed: 12/13/2022]
|