151
|
Kelley DR, Reshef YA, Bileschi M, Belanger D, McLean CY, Snoek J. Sequential regulatory activity prediction across chromosomes with convolutional neural networks. Genome Res 2018; 28:739-750. [PMID: 29588361 PMCID: PMC5932613 DOI: 10.1101/gr.227819.117] [Citation(s) in RCA: 280] [Impact Index Per Article: 40.0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/17/2017] [Accepted: 03/23/2018] [Indexed: 01/10/2023]
Abstract
Models for predicting phenotypic outcomes from genotypes have important applications to understanding genomic function and improving human health. Here, we develop a machine-learning system to predict cell-type-specific epigenetic and transcriptional profiles in large mammalian genomes from DNA sequence alone. By use of convolutional neural networks, this system identifies promoters and distal regulatory elements and synthesizes their content to make effective gene expression predictions. We show that model predictions for the influence of genomic variants on gene expression align well to causal variants underlying eQTLs in human populations and can be useful for generating mechanistic hypotheses to enable fine mapping of disease loci.
Collapse
Affiliation(s)
| | - Yakir A Reshef
- Department of Computer Science, Harvard University, Cambridge, Massachusetts 02138, USA
| | | | | | | | - Jasper Snoek
- Google Brain, Cambridge, Massachusetts 02142, USA
| |
Collapse
|
152
|
Mitra S, Biswas A, Narlikar L. DIVERSITY in binding, regulation, and evolution revealed from high-throughput ChIP. PLoS Comput Biol 2018; 14:e1006090. [PMID: 29684008 PMCID: PMC5933800 DOI: 10.1371/journal.pcbi.1006090] [Citation(s) in RCA: 8] [Impact Index Per Article: 1.1] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 09/07/2017] [Revised: 05/03/2018] [Accepted: 03/14/2018] [Indexed: 12/27/2022] Open
Abstract
Genome-wide in vivo protein-DNA interactions are routinely mapped using high-throughput chromatin immunoprecipitation (ChIP). ChIP-reported regions are typically investigated for enriched sequence-motifs, which are likely to model the DNA-binding specificity of the profiled protein and/or of co-occurring proteins. However, simple enrichment analyses can miss insights into the binding-activity of the protein. Note that ChIP reports regions making direct contact with the protein as well as those binding through intermediaries. For example, consider a ChIP experiment targeting protein X, which binds DNA at its cognate sites, but simultaneously interacts with four other proteins. Each of these proteins also binds to its own specific cognate sites along distant parts of the genome, a scenario consistent with the current view of transcriptional hubs and chromatin loops. Since ChIP will pull down all X-associated regions, the final reported data will be a union of five distinct sets of regions, each containing binding sites of one of the five proteins, respectively. Characterizing all five different motifs and the corresponding sets is important to interpret the ChIP experiment and ultimately, the role of X in regulation. We present diversity which attempts exactly this: it partitions the data so that each partition can be characterized with its own de novo motif. Diversity uses a Bayesian approach to identify the optimal number of motifs and the associated partitions, which together explain the entire dataset. This is in contrast to standard motif finders, which report motifs individually enriched in the data, but do not necessarily explain all reported regions. We show that the different motifs and associated regions identified by diversity give insights into the various complexes that may be forming along the chromatin, something that has so far not been attempted from ChIP data. Webserver at http://diversity.ncl.res.in/; standalone (Mac OS X/Linux) from https://github.com/NarlikarLab/DIVERSITY/releases/tag/v1.0.0. A high-throughput chromatin immunoprecipitation (ChIP) experiment identifies genomic regions bound by a protein in vivo. Current motif-discovery approaches seek an enriched motif signature in the reported regions, which they can attribute to the protein’s binding preferences. However, Diversity models the fact that since a ChIP experiment pulls down regions participating in all complexes involving the profiled protein, the reported regions are in all likelihood, a collection of different types of protein-DNA contacts. Diversity asks a different question: what sequence component caused a specific region to be reported in a ChIP experiment? The answer, in combination with additional data such as sequence conservation, SNPs, chromatin structure, downstream gene-expression, etc. can yield insights into the diverse regulatory mechanisms at play. The added benefits of a webserver and a standalone parallel version make diversity a practical tool for discovering new biology from ChIP experiments.
Collapse
Affiliation(s)
- Sneha Mitra
- Department of Chemical Engineering, CSIR-National Chemical Laboratory, Pune, India
| | - Anushua Biswas
- Department of Chemical Engineering, CSIR-National Chemical Laboratory, Pune, India
| | - Leelavati Narlikar
- Department of Chemical Engineering, CSIR-National Chemical Laboratory, Pune, India
- * E-mail:
| |
Collapse
|
153
|
Gu B, Swigut T, Spencley A, Bauer MR, Chung M, Meyer T, Wysocka J. Transcription-coupled changes in nuclear mobility of mammalian cis-regulatory elements. Science 2018; 359:1050-1055. [PMID: 29371426 PMCID: PMC6590518 DOI: 10.1126/science.aao3136] [Citation(s) in RCA: 250] [Impact Index Per Article: 35.7] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/07/2017] [Accepted: 01/16/2018] [Indexed: 12/15/2022]
Abstract
To achieve guide RNA (gRNA) multiplexing and an efficient delivery of tens of distinct gRNAs into single cells, we developed a molecular assembly strategy termed chimeric array of gRNA oligonucleotides (CARGO). We coupled CARGO with dCas9 (catalytically dead Cas9) imaging to quantitatively measure the movement of enhancers and promoters that undergo differentiation-associated activity changes in live embryonic stem cells. Whereas all examined functional elements exhibited subdiffusive behavior, their relative mobility increased concurrently with transcriptional activation. Furthermore, acute perturbation of RNA polymerase II activity can reverse these activity-linked increases in loci mobility. Through quantitative CARGO-dCas9 imaging, we provide direct measurements of cis-regulatory element dynamics in living cells and distinct cellular and activity states and uncover an intrinsic connection between cis-regulatory element mobility and transcription.
Collapse
Affiliation(s)
- Bo Gu
- Department of Chemical and Systems Biology, Stanford University School of Medicine, Stanford, CA 94305, USA
| | - Tomek Swigut
- Department of Chemical and Systems Biology, Stanford University School of Medicine, Stanford, CA 94305, USA
| | - Andrew Spencley
- Department of Chemical and Systems Biology, Stanford University School of Medicine, Stanford, CA 94305, USA
- Cancer Biology Program, Stanford University School of Medicine, Stanford, CA 94305, USA
| | - Matthew R Bauer
- Department of Chemical and Systems Biology, Stanford University School of Medicine, Stanford, CA 94305, USA
| | - Mingyu Chung
- Department of Chemical and Systems Biology, Stanford University School of Medicine, Stanford, CA 94305, USA
| | - Tobias Meyer
- Department of Chemical and Systems Biology, Stanford University School of Medicine, Stanford, CA 94305, USA
| | - Joanna Wysocka
- Department of Chemical and Systems Biology, Stanford University School of Medicine, Stanford, CA 94305, USA.
- Department of Developmental Biology, Stanford University School of Medicine, Stanford, CA 94305, USA
- Institute for Stem Cell Biology and Regenerative Medicine, Stanford University School of Medicine, Stanford, CA 94305, USA
- Howard Hughes Medical Institute, Stanford University School of Medicine, Stanford, CA 94305, USA
| |
Collapse
|
154
|
Yan J, Chen SAA, Local A, Liu T, Qiu Y, Dorighi KM, Preissl S, Rivera CM, Wang C, Ye Z, Ge K, Hu M, Wysocka J, Ren B. Histone H3 lysine 4 monomethylation modulates long-range chromatin interactions at enhancers. Cell Res 2018; 28:204-220. [PMID: 29313530 PMCID: PMC5799818 DOI: 10.1038/cr.2018.1] [Citation(s) in RCA: 104] [Impact Index Per Article: 14.9] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 02/17/2017] [Revised: 04/05/2017] [Accepted: 11/14/2017] [Indexed: 12/22/2022] Open
Abstract
Long-range chromatin interactions between enhancers and promoters are essential for transcription of many developmentally controlled genes in mammals and other metazoans. Currently, the exact mechanisms that connect distal enhancers to their specific target promoters remain to be fully elucidated. Here, we show that the enhancer-specific histone H3 lysine 4 monomethylation (H3K4me1) and the histone methyltransferases MLL3 and MLL4 (MLL3/4) play an active role in this process. We demonstrate that in differentiating mouse embryonic stem cells, MLL3/4-dependent deposition of H3K4me1 at enhancers correlates with increased levels of chromatin interactions, whereas loss of this histone modification leads to reduced levels of chromatin interactions and defects in gene activation during differentiation. H3K4me1 facilitates recruitment of the Cohesin complex, a known regulator of chromatin organization, to chromatin in vitro and in vivo, providing a potential mechanism for MLL3/4 to promote chromatin interactions between enhancers and promoters. Taken together, our results support a role for MLL3/4-dependent H3K4me1 in orchestrating long-range chromatin interactions at enhancers in mammalian cells.
Collapse
Affiliation(s)
- Jian Yan
- Ludwig Institute for Cancer Research, 9500 Gilman Dr., La Jolla, CA 92093, USA
- Department of Medical Biochemistry and Biophysics, Division of Functional Genomics and Systems Biology, Karolinska Institutet, 171 65 Stockholm, Sweden
| | - Shi-An A Chen
- Ludwig Institute for Cancer Research, 9500 Gilman Dr., La Jolla, CA 92093, USA
| | - Andrea Local
- Ludwig Institute for Cancer Research, 9500 Gilman Dr., La Jolla, CA 92093, USA
- Current address: Aptose Biosciences Inc., 3550 General Atomics Ct, San Diego, CA 92122, USA
| | - Tristin Liu
- Ludwig Institute for Cancer Research, 9500 Gilman Dr., La Jolla, CA 92093, USA
| | - Yunjiang Qiu
- Ludwig Institute for Cancer Research, 9500 Gilman Dr., La Jolla, CA 92093, USA
| | - Kristel M Dorighi
- Department of Chemical and Systems Biology, Stanford University School of Medicine, Stanford, CA 94305, USA
| | - Sebastian Preissl
- Ludwig Institute for Cancer Research, 9500 Gilman Dr., La Jolla, CA 92093, USA
| | - Chloe M Rivera
- Ludwig Institute for Cancer Research, 9500 Gilman Dr., La Jolla, CA 92093, USA
| | - Chaochen Wang
- Laboratory of Endocrinology and Receptor Biology, National Institute of Diabetes and Digestive and Kidney Diseases, NIH, Bethesda, MD 20892, USA
| | - Zhen Ye
- Ludwig Institute for Cancer Research, 9500 Gilman Dr., La Jolla, CA 92093, USA
| | - Kai Ge
- Laboratory of Endocrinology and Receptor Biology, National Institute of Diabetes and Digestive and Kidney Diseases, NIH, Bethesda, MD 20892, USA
| | - Ming Hu
- Department of Quantitative Health Sciences, Lerner Research Institute, Cleveland Clinic Foundation, Cleveland, OH 44195, USA
| | - Joanna Wysocka
- Department of Chemical and Systems Biology, Stanford University School of Medicine, Stanford, CA 94305, USA
| | - Bing Ren
- Ludwig Institute for Cancer Research, 9500 Gilman Dr., La Jolla, CA 92093, USA
- Department of Cellular and Molecular Medicine, University of California San Diego, School of Medicine, Institute of Genomic Medicine, 9500 Gilman Dr., La Jolla, CA 92093, USA
| |
Collapse
|
155
|
Henriques T, Scruggs BS, Inouye MO, Muse GW, Williams LH, Burkholder AB, Lavender CA, Fargo DC, Adelman K. Widespread transcriptional pausing and elongation control at enhancers. Genes Dev 2018; 32:26-41. [PMID: 29378787 PMCID: PMC5828392 DOI: 10.1101/gad.309351.117] [Citation(s) in RCA: 233] [Impact Index Per Article: 33.3] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 11/07/2017] [Accepted: 12/21/2017] [Indexed: 02/07/2023]
Abstract
In this study, Henriques et al. demonstrate that transcription is a nearly universal feature of enhancers in Drosophila and mammalian cells and that nascent RNA sequencing strategies are optimal for identification of both enhancers and superenhancers. Their findings provide insights into the unique characteristics of superenhancers, which stimulate high-level gene expression through rapid pause release; interestingly, this property renders associated genes resistant to loss of factors that stabilize paused RNAPII. Regulation by gene-distal enhancers is critical for cell type-specific and condition-specific patterns of gene expression. Thus, to understand the basis of gene activity in a given cell type or tissue, we must identify the precise locations of enhancers and functionally characterize their behaviors. Here, we demonstrate that transcription is a nearly universal feature of enhancers in Drosophila and mammalian cells and that nascent RNA sequencing strategies are optimal for identification of both enhancers and superenhancers. We dissect the mechanisms governing enhancer transcription and discover remarkable similarities to transcription at protein-coding genes. We show that RNA polymerase II (RNAPII) undergoes regulated pausing and release at enhancers. However, as compared with mRNA genes, RNAPII at enhancers is less stable and more prone to early termination. Furthermore, we found that the level of histone H3 Lys4 (H3K4) methylation at enhancers corresponds to transcriptional activity such that highly active enhancers display H3K4 trimethylation rather than the H3K4 monomethylation considered a hallmark of enhancers. Finally, our work provides insights into the unique characteristics of superenhancers, which stimulate high-level gene expression through rapid pause release; interestingly, this property renders associated genes resistant to the loss of factors that stabilize paused RNAPII.
Collapse
Affiliation(s)
- Telmo Henriques
- Epigenetics and Stem Cell Biology Laboratory, National Institute of Environmental Health Sciences, Research Triangle Park, North Carolina 27709, USA.,Department of Biological Chemistry and Molecular Pharmacology, Harvard Medical School, Boston, Massachusetts 02115, USA
| | - Benjamin S Scruggs
- Epigenetics and Stem Cell Biology Laboratory, National Institute of Environmental Health Sciences, Research Triangle Park, North Carolina 27709, USA
| | - Michiko O Inouye
- Epigenetics and Stem Cell Biology Laboratory, National Institute of Environmental Health Sciences, Research Triangle Park, North Carolina 27709, USA.,Department of Biological Chemistry and Molecular Pharmacology, Harvard Medical School, Boston, Massachusetts 02115, USA
| | - Ginger W Muse
- Epigenetics and Stem Cell Biology Laboratory, National Institute of Environmental Health Sciences, Research Triangle Park, North Carolina 27709, USA
| | - Lucy H Williams
- Epigenetics and Stem Cell Biology Laboratory, National Institute of Environmental Health Sciences, Research Triangle Park, North Carolina 27709, USA
| | - Adam B Burkholder
- Center for Integrative Bioinformatics, National Institute of Environmental Health Sciences, Research Triangle Park, North Carolina 27709, USA
| | - Christopher A Lavender
- Center for Integrative Bioinformatics, National Institute of Environmental Health Sciences, Research Triangle Park, North Carolina 27709, USA
| | - David C Fargo
- Center for Integrative Bioinformatics, National Institute of Environmental Health Sciences, Research Triangle Park, North Carolina 27709, USA
| | - Karen Adelman
- Epigenetics and Stem Cell Biology Laboratory, National Institute of Environmental Health Sciences, Research Triangle Park, North Carolina 27709, USA.,Department of Biological Chemistry and Molecular Pharmacology, Harvard Medical School, Boston, Massachusetts 02115, USA
| |
Collapse
|
156
|
Worley MI, Alexander LA, Hariharan IK. CtBP impedes JNK- and Upd/STAT-driven cell fate misspecifications in regenerating Drosophila imaginal discs. eLife 2018; 7:30391. [PMID: 29372681 PMCID: PMC5823544 DOI: 10.7554/elife.30391] [Citation(s) in RCA: 18] [Impact Index Per Article: 2.6] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/13/2017] [Accepted: 01/19/2018] [Indexed: 12/27/2022] Open
Abstract
Regeneration following tissue damage often necessitates a mechanism for cellular re-programming, so that surviving cells can give rise to all cell types originally found in the damaged tissue. This process, if unchecked, can also generate cell types that are inappropriate for a given location. We conducted a screen for genes that negatively regulate the frequency of notum-to-wing transformations following genetic ablation and regeneration of the wing pouch, from which we identified mutations in the transcriptional co-repressor C-terminal Binding Protein (CtBP). When CtBP function is reduced, ablation of the pouch can activate the JNK/AP-1 and JAK/STAT pathways in the notum to destabilize cell fates. Ectopic expression of Wingless and Dilp8 precede the formation of the ectopic pouch, which is subsequently generated by recruitment of both anterior and posterior cells near the compartment boundary. Thus, CtBP stabilizes cell fates following damage by opposing the destabilizing effects of the JNK/AP-1 and JAK/STAT pathways.
Collapse
Affiliation(s)
- Melanie I Worley
- Department of Molecular and Cell Biology, University of California, Berkeley, Berkeley, United States
| | - Larissa A Alexander
- Department of Molecular and Cell Biology, University of California, Berkeley, Berkeley, United States
| | - Iswar K Hariharan
- Department of Molecular and Cell Biology, University of California, Berkeley, Berkeley, United States
| |
Collapse
|
157
|
Abstract
We developed a predictive, stable, and interpretable tool: the iterative random forest algorithm (iRF). iRF discovers high-order interactions among biomolecules with the same order of computational cost as random forests. We demonstrate the efficacy of iRF by finding known and promising interactions among biomolecules, of up to fifth and sixth order, in two data examples in transcriptional regulation and alternative splicing. Genomics has revolutionized biology, enabling the interrogation of whole transcriptomes, genome-wide binding sites for proteins, and many other molecular processes. However, individual genomic assays measure elements that interact in vivo as components of larger molecular machines. Understanding how these high-order interactions drive gene expression presents a substantial statistical challenge. Building on random forests (RFs) and random intersection trees (RITs) and through extensive, biologically inspired simulations, we developed the iterative random forest algorithm (iRF). iRF trains a feature-weighted ensemble of decision trees to detect stable, high-order interactions with the same order of computational cost as the RF. We demonstrate the utility of iRF for high-order interaction discovery in two prediction problems: enhancer activity in the early Drosophila embryo and alternative splicing of primary transcripts in human-derived cell lines. In Drosophila, among the 20 pairwise transcription factor interactions iRF identifies as stable (returned in more than half of bootstrap replicates), 80% have been previously reported as physical interactions. Moreover, third-order interactions, e.g., between Zelda (Zld), Giant (Gt), and Twist (Twi), suggest high-order relationships that are candidates for follow-up experiments. In human-derived cells, iRF rediscovered a central role of H3K36me3 in chromatin-mediated splicing regulation and identified interesting fifth- and sixth-order interactions, indicative of multivalent nucleosomes with specific roles in splicing regulation. By decoupling the order of interactions from the computational cost of identification, iRF opens additional avenues of inquiry into the molecular mechanisms underlying genome biology.
Collapse
|
158
|
Roeske MJ, Camino EM, Grover S, Rebeiz M, Williams TM. Cis-regulatory evolution integrated the Bric-à-brac transcription factors into a novel fruit fly gene regulatory network. eLife 2018; 7. [PMID: 29297463 PMCID: PMC5752203 DOI: 10.7554/elife.32273] [Citation(s) in RCA: 29] [Impact Index Per Article: 4.1] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 09/25/2017] [Accepted: 12/19/2017] [Indexed: 11/13/2022] Open
Abstract
Gene expression evolution through gene regulatory network (GRN) changes has gained appreciation as a driver of morphological evolution. However, understanding how GRNs evolve is hampered by finding relevant cis-regulatory element (CRE) mutations, and interpreting the protein-DNA interactions they alter. We investigated evolutionary changes in the duplicated Bric-à-brac (Bab) transcription factors and a key Bab target gene in a GRN underlying the novel dimorphic pigmentation of D. melanogaster and its relatives. It has remained uncertain how Bab was integrated within the pigmentation GRN. Here, we show that the ancestral transcription factor activity of Bab gained a role in sculpting sex-specific pigmentation through the evolution of binding sites in a CRE of the pigment-promoting yellow gene. This work demonstrates how a new trait can evolve by incorporating existing transcription factors into a GRN through CRE evolution, an evolutionary path likely to predominate newly evolved functions of transcription factors.
Collapse
Affiliation(s)
- Maxwell J Roeske
- Department of Biology, University of Dayton, Dayton, United States
| | - Eric M Camino
- Department of Biology, University of Dayton, Dayton, United States
| | - Sumant Grover
- Department of Biology, University of Dayton, Dayton, United States
| | - Mark Rebeiz
- Department of Biological Sciences, University of Pittsburgh, Pittsburgh, United States
| | - Thomas Michael Williams
- Department of Biology, University of Dayton, Dayton, United States.,Center for Tissue Regeneration and Engineering at Dayton, University of Dayton, Dayton, United States
| |
Collapse
|
159
|
Maguire JE, Pandey A, Wu Y, Di Gregorio A. Investigating Evolutionarily Conserved Molecular Mechanisms Controlling Gene Expression in the Notochord. TRANSGENIC ASCIDIANS 2018. [DOI: 10.1007/978-981-10-7545-2_8] [Citation(s) in RCA: 4] [Impact Index Per Article: 0.6] [Reference Citation Analysis] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 01/10/2023]
|
160
|
Sundaram V, Wang T. Transposable Element Mediated Innovation in Gene Regulatory Landscapes of Cells: Re-Visiting the "Gene-Battery" Model. Bioessays 2018; 40:10.1002/bies.201700155. [PMID: 29206283 PMCID: PMC5912915 DOI: 10.1002/bies.201700155] [Citation(s) in RCA: 28] [Impact Index Per Article: 4.0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 08/23/2017] [Revised: 10/25/2017] [Indexed: 01/31/2023]
Abstract
Transposable elements (TEs) are no longer considered to be "junk" DNA. Here, we review how TEs can impact gene regulation systematically. TEs encode various regulatory elements that enables them to regulate gene expression. RJ Britten and EH Davidson hypothesized that TEs can integrate the function of various transcriptional regulators into gene regulatory networks. Uniquely TEs can deposit regulatory sites across the genome when they transpose, and thereby bring multiple genes under control of the same regulatory logic. Several studies together have robustly established that TEs participate in embryonic development and oncogenesis. We discuss the regulatory characteristics of TEs in context of evolution to understand the extent of their impact on gene networks. Understanding these features of TEs is central to future investigations of TEs in cellular processes and phenotypic presentations, which are applicable to development and disease studies. We re-visit the Britten-Davidson "gene-battery" model and understand the genetic and transcriptional impact of TEs in innovating gene regulatory networks.
Collapse
Affiliation(s)
- Vasavi Sundaram
- European Molecular Biology Laboratory, European Bioinformatics Institute, Wellcome Genome Campus, Hinxton, Cambridge, CB10 1SD, United Kingdom
| | - Ting Wang
- Department of Genetics, Center for Genome Sciences & Systems Biology, Washington University School of Medicine, St Louis, Missouri 63110, United States of America
| |
Collapse
|
161
|
Blinka S, Rao S. Nanog Expression in Embryonic Stem Cells - An Ideal Model System to Dissect Enhancer Function. Bioessays 2017; 39:10.1002/bies.201700086. [PMID: 28977693 PMCID: PMC5878941 DOI: 10.1002/bies.201700086] [Citation(s) in RCA: 13] [Impact Index Per Article: 1.6] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 05/20/2017] [Revised: 08/31/2017] [Indexed: 01/17/2023]
Abstract
Embryonic stem cells (ESCs) are derived from the preimplantation embryo and can differentiate into virtually any other cell type (termed pluripotency), which is governed by lineage specific transcriptions factors (TFs) binding to cis regulatory elements (CREs) to mediate changes in gene expression. The reliance on transcriptional regulation to maintain pluripotency makes ESCs a valuable model to study the role of distal CREs such as enhancers in modulating gene expression to affect cell fate decisions. This review will highlight recent advance on transcriptional enhancers, focusing on studies performed in ESCs. In addition, we argue that the Nanog locus, which encodes for an ESC-critical TF, is particularly informative because it contains multiple co-regulated genes and enhancers in close proximity to one another. The unique landscape at Nanog permits the study of ongoing questions including whether multiple enhancers function additively versus synergistically, determinants of gene specificity, and cell-to-cell variability in gene expression.
Collapse
Affiliation(s)
- Steven Blinka
- Department of Cell Biology, Neurobiology, and Anatomy, Medical College of Wisconsin, Milwaukee, WI 53226, USA
- Blood Research Institute, Blood Center of Wisconsin, 8733 West Watertown Plank Road, Milwaukee, WI 53226, USA
| | - Sridhar Rao
- Department of Cell Biology, Neurobiology, and Anatomy, Medical College of Wisconsin, Milwaukee, WI 53226, USA
- Blood Research Institute, Blood Center of Wisconsin, 8733 West Watertown Plank Road, Milwaukee, WI 53226, USA
- Department of Pediatrics, Medical College of Wisconsin, Milwaukee, WI 53226, USA
| |
Collapse
|
162
|
Barr KA, Martinez C, Moran JR, Kim AR, Ramos AF, Reinitz J. Synthetic enhancer design by in silico compensatory evolution reveals flexibility and constraint in cis-regulation. BMC SYSTEMS BIOLOGY 2017; 11:116. [PMID: 29187214 PMCID: PMC5708098 DOI: 10.1186/s12918-017-0485-2] [Citation(s) in RCA: 11] [Impact Index Per Article: 1.4] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Subscribe] [Scholar Register] [Received: 07/05/2017] [Accepted: 11/09/2017] [Indexed: 11/12/2022]
Abstract
BACKGROUND Models that incorporate specific chemical mechanisms have been successful in describing the activity of Drosophila developmental enhancers as a function of underlying transcription factor binding motifs. Despite this, the minimum set of mechanisms required to reconstruct an enhancer from its constituent parts is not known. Synthetic biology offers the potential to test the sufficiency of known mechanisms to describe the activity of enhancers, as well as to uncover constraints on the number, order, and spacing of motifs. RESULTS Using a functional model and in silico compensatory evolution, we generated putative synthetic even-skipped stripe 2 enhancers with varying degrees of similarity to the natural enhancer. These elements represent the evolutionary trajectories of the natural stripe 2 enhancer towards two synthetic enhancers designed ab initio. In the first trajectory, spatially regulated expression was maintained, even after more than a third of binding sites were lost. In the second, sequences with high similarity to the natural element did not drive expression, but a highly diverged sequence about half the length of the minimal stripe 2 enhancer drove ten times greater expression. Additionally, homotypic clusters of Zelda or Stat92E motifs, but not Bicoid, drove expression in developing embryos. CONCLUSIONS Here, we present a functional model of gene regulation to test the degree to which the known transcription factors and their interactions explain the activity of the Drosophila even-skipped stripe 2 enhancer. Initial success in the first trajectory showed that the gene regulation model explains much of the function of the stripe 2 enhancer. Cases where expression deviated from prediction indicates that undescribed factors likely act to modulate expression. We also showed that activation driven Bicoid and Hunchback is highly sensitive to spatial organization of binding motifs. In contrast, Zelda and Stat92E drive expression from simple homotypic clusters, suggesting that activation driven by these factors is less constrained. Collectively, the 40 sequences generated in this work provides a powerful training set for building future models of gene regulation.
Collapse
Affiliation(s)
- Kenneth A Barr
- Committee on Genetics, Genomics, and Systems Biology, University of Chicago, Zoology 111, 1101 E 57th St, Chicago, 60637, Illinois, USA.
- Department of Ecology and Evolution, The University of Chicago, Chicago, 60637, Illinois, USA.
| | - Carlos Martinez
- Department Biochemistry and Molecular Genetics, Northwestern University, Chicago, 60611, Illinois, USA
| | - Jennifer R Moran
- Department Human Genetics, The University of Chicago, Chicago, 60637, Illinois, USA
- Institute for Genomics & Systems Biology, The University of Chicago, Chicago, 60637, Illinois, USA
| | - Ah-Ram Kim
- School of Life Science, Handong Global University, Pohang, 37554, Gyeongbuk, South Korea
| | - Alexandre F Ramos
- Departamento de Radiologia - Faculdade de Medicina, Universidade de São Paulo & Instituto do Câncer do Estado de São Paulo, São Paulo, SP CEP, 05403-911, Brazil
- Escola de Artes, Ciências e Humanidades & Núcleo de Estudos Interdisciplinares em Sistemas Complexos, Universidade de São Paulo, Av. Arlindo Béttio, São Paulo, 1000 CEP 03828-000, SP, Brazil
| | - John Reinitz
- Committee on Genetics, Genomics, and Systems Biology, University of Chicago, Zoology 111, 1101 E 57th St, Chicago, 60637, Illinois, USA
- Department of Ecology and Evolution, The University of Chicago, Chicago, 60637, Illinois, USA
- Institute for Genomics & Systems Biology, The University of Chicago, Chicago, 60637, Illinois, USA
- Department Statistics, The University of Chicago, 5747 S. Ellis Avenue Jones 312, Chicago, 60637, IL, USA
| |
Collapse
|
163
|
Abstract
Animal development depends on not only the linear genome sequence that embeds millions of cis-regulatory elements, but also the three-dimensional (3D) chromatin architecture that orchestrates the interplay between cis-regulatory elements and their target genes. Compared to our knowledge of the cis-regulatory sequences, the understanding of the 3D genome organization in human and other eukaryotes is still limited. Recent advances in technologies to map the 3D genome architecture have greatly accelerated the pace of discovery. Here, we review emerging concepts of chromatin organization in mammalian cells, discuss the dynamics of chromatin conformation during development, and highlight important roles for chromatin organization in cancer and other human diseases.
Collapse
Affiliation(s)
- Miao Yu
- Ludwig Institute for Cancer Research, La Jolla, California 92093;
| | - Bing Ren
- Ludwig Institute for Cancer Research, La Jolla, California 92093;
- Center for Epigenomics, Department of Cellular and Molecular Medicine, and Institute of Genomic Medicine, and Moores Cancer Center, University of California at San Diego, La Jolla, California 92093
| |
Collapse
|
164
|
Abstract
What made us human? Gene expression changes clearly played a significant part in human evolution, but pinpointing the causal regulatory mutations is hard. Comparative genomics enabled the identification of human accelerated regions (HARs) and other human-specific genome sequences. The major challenge in the past decade has been to link diverged sequences to uniquely human biology. This review discusses approaches to this problem, progress made at the molecular level, and prospects for moving towards genetic causes for uniquely human biology.
Collapse
Affiliation(s)
- Lucía F Franchini
- Instituto de Investigaciones en Ingeniería Genética y Biología Molecular (INGEBI), Consejo Nacional de Investigaciones Científicas y Técnicas (CONICET), Buenos Aires, Argentina
| | - Katherine S Pollard
- Gladstone Institutes, San Francisco, CA, 94158, USA. .,Department of Epidemiology & Biostatistics, Institute for Human Genetics, Institute for Computational Health Sciences, University of California, San Francisco, CA, 94158, USA.
| |
Collapse
|
165
|
Smith AF, Posakony JW, Rebeiz M. Automated tools for comparative sequence analysis of genic regions using the GenePalette application. Dev Biol 2017; 429:158-164. [PMID: 28673819 PMCID: PMC5623810 DOI: 10.1016/j.ydbio.2017.06.033] [Citation(s) in RCA: 9] [Impact Index Per Article: 1.1] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 05/01/2017] [Revised: 06/28/2017] [Accepted: 06/28/2017] [Indexed: 10/19/2022]
Abstract
Comparative sequence analysis methods, such as phylogenetic footprinting, represent one of the most effective ways to decode regulatory sequence functions based upon DNA sequence information alone. The laborious task of assembling orthologous sequences to perform these comparisons is a hurdle to these analyses, which is further aggravated by the relative paucity of tools for visualization of sequence comparisons in large genic regions. Here, we describe a second-generation implementation of the GenePalette DNA sequence analysis software to facilitate comparative studies of gene function and regulation. We have developed an automated module called OrthologGrabber (OG) that performs BLAT searches against the UC Santa Cruz genome database to identify and retrieve segments homologous to a region of interest. Upon acquisition, sequences are compared to identify high-confidence anchor-points, which are graphically displayed. The visualization of anchor-points alongside other DNA features, such as transcription factor binding sites, allows users to precisely examine whether a binding site of interest is conserved, even if the surrounding region exhibits poor sequence identity. This approach also aids in identifying orthologous segments of regulatory DNA, facilitating studies of regulatory sequence evolution. As with previous versions of the software, GenePalette 2.1 takes the form of a platform-independent, single-windowed interface that is simple to use.
Collapse
Affiliation(s)
- Andrew F Smith
- Department of Biological Sciences, University of Pittsburgh, Pittsburgh, PA 15260, USA
| | - James W Posakony
- Division of Biological Sciences/CDB, University of California San Diego, La Jolla, CA 92093, USA
| | - Mark Rebeiz
- Department of Biological Sciences, University of Pittsburgh, Pittsburgh, PA 15260, USA.
| |
Collapse
|
166
|
Simple Expression Domains Are Regulated by Discrete CRMs During Drosophila Oogenesis. G3-GENES GENOMES GENETICS 2017. [PMID: 28634244 PMCID: PMC5555475 DOI: 10.1534/g3.117.043810] [Citation(s) in RCA: 5] [Impact Index Per Article: 0.6] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Subscribe] [Scholar Register] [Indexed: 11/18/2022]
Abstract
Eggshell patterning has been extensively studied in Drosophila melanogaster. However, the cis-regulatory modules (CRMs), which control spatiotemporal expression of these patterns, are vastly unexplored. The FlyLight collection contains >7000 intergenic and intronic DNA fragments that, if containing CRMs, can drive the transcription factor GAL4. We cross-listed the 84 genes known to be expressed during D. melanogaster oogenesis with the ∼1200 listed genes of the FlyLight collection, and found 22 common genes that are represented by 281 FlyLight fly lines. Of these lines, 54 show expression patterns during oogenesis when crossed to an UAS-GFP reporter. Of the 54 lines, 16 recapitulate the full or partial pattern of the associated gene pattern. Interestingly, while the average DNA fragment size is ∼3 kb in length, the vast majority of fragments show one type of spatiotemporal pattern in oogenesis. Mapping the distribution of all 54 lines, we found a significant enrichment of CRMs in the first intron of the associated genes’ model. In addition, we demonstrate the use of different anteriorly active FlyLight lines as tools to disrupt eggshell patterning in a targeted manner. Our screen provides further evidence that complex gene patterns are assembled combinatorially by different CRMs controlling the expression of genes in simple domains.
Collapse
|
167
|
Crocker J, Stern DL. Functional regulatory evolution outside of the minimal even-skipped stripe 2 enhancer. Development 2017; 144:3095-3101. [PMID: 28760812 DOI: 10.1242/dev.149427] [Citation(s) in RCA: 10] [Impact Index Per Article: 1.3] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 01/19/2017] [Accepted: 07/19/2017] [Indexed: 12/27/2022]
Abstract
Transcriptional enhancers are regions of DNA that drive precise patterns of gene expression. Although many studies have elucidated how individual enhancers can evolve, most of this work has focused on what are called 'minimal' enhancers, the smallest DNA regions that drive expression that approximates an aspect of native gene expression. Here, we explore how the Drosophila erecta even-skipped (eve) locus has evolved by testing its activity in the divergent D. melanogaster genome. We found, as has been reported previously, that the D. erecta eve stripe 2 enhancer (eveS2) fails to drive appreciable expression in D. melanogaster However, we found that a large transgene carrying the entire D. erecta eve locus drives normal eve expression, including in stripe 2. We performed a functional dissection of the region upstream of the D. erecta eveS2 region and found multiple Zelda motifs that are required for normal expression. Our results illustrate how sequences outside of minimal enhancer regions can evolve functionally through mechanisms other than changes in transcription factor-binding sites that drive patterning.
Collapse
Affiliation(s)
- Justin Crocker
- Janelia Research Campus, Howard Hughes Medical Institute, 19700 Helix Drive, Ashburn, VA 20147, USA
| | - David L Stern
- Janelia Research Campus, Howard Hughes Medical Institute, 19700 Helix Drive, Ashburn, VA 20147, USA
| |
Collapse
|
168
|
Colbran LL, Chen L, Capra JA. Short DNA sequence patterns accurately identify broadly active human enhancers. BMC Genomics 2017; 18:536. [PMID: 28716036 PMCID: PMC5512948 DOI: 10.1186/s12864-017-3934-9] [Citation(s) in RCA: 19] [Impact Index Per Article: 2.4] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 12/05/2016] [Accepted: 07/09/2017] [Indexed: 12/25/2022] Open
Abstract
Background Enhancers are DNA regulatory elements that influence gene expression. There is substantial diversity in enhancers’ activity patterns: some enhancers drive expression in a single cellular context, while others are active across many. Sequence characteristics, such as transcription factor (TF) binding motifs, influence the activity patterns of regulatory sequences; however, the regulatory logic through which specific sequences drive enhancer activity patterns is poorly understood. Recent analysis of Drosophila enhancers suggested that short dinucleotide repeat motifs (DRMs) are general enhancer sequence features that drive broad regulatory activity. However, it is not known whether the regulatory role of DRMs is conserved across species. Results We performed a comprehensive analysis of the relationship between short DNA sequence patterns, including DRMs, and human enhancer activity in 38,538 enhancers across 411 different contexts. In a machine-learning framework, the occurrence patterns of short sequence motifs accurately predicted broadly active human enhancers. However, DRMs alone were weakly predictive of broad enhancer activity in humans and showed different enrichment patterns than in Drosophila. In general, GC-rich sequence motifs were significantly associated with broad enhancer activity, and consistent with this enrichment, broadly active human TFs recognize GC-rich motifs. Conclusions Our results reveal the importance of specific sequence motifs in broadly active human enhancers, demonstrate the lack of evolutionary conservation of the role of DRMs, and provide a computational framework for investigating the logic of enhancer sequences. Electronic supplementary material The online version of this article (doi:10.1186/s12864-017-3934-9) contains supplementary material, which is available to authorized users.
Collapse
Affiliation(s)
- Laura L Colbran
- Vanderbilt Genetics Institute, Vanderbilt University, Nashville, TN, 37235, USA
| | - Ling Chen
- Department of Biological Sciences, Vanderbilt University, Nashville, TN, 37235, USA
| | - John A Capra
- Vanderbilt Genetics Institute, Vanderbilt University, Nashville, TN, 37235, USA. .,Department of Biological Sciences, Vanderbilt University, Nashville, TN, 37235, USA. .,Center for Structural Biology, Departments of Biomedical Informatics and Computer Science, Vanderbilt University, Nashville, TN, 37235, USA.
| |
Collapse
|
169
|
Frank TD, Kiyatkin A, Cheong A, Kholodenko BN. Three-factor models versus time series models: quantifying time-dependencies of interactions between stimuli in cell biology and psychobiology for short longitudinal data. MATHEMATICAL MEDICINE AND BIOLOGY-A JOURNAL OF THE IMA 2017; 34:177-191. [PMID: 27079221 DOI: 10.1093/imammb/dqw001] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Subscribe] [Scholar Register] [Received: 03/21/2013] [Accepted: 01/04/2016] [Indexed: 11/14/2022]
Abstract
Signal integration determines cell fate on the cellular level, affects cognitive processes and affective responses on the behavioural level, and is likely to be involved in psychoneurobiological processes underlying mood disorders. Interactions between stimuli may subjected to time effects. Time-dependencies of interactions between stimuli typically lead to complex cell responses and complex responses on the behavioural level. We show that both three-factor models and time series models can be used to uncover such time-dependencies. However, we argue that for short longitudinal data the three factor modelling approach is more suitable. In order to illustrate both approaches, we re-analysed previously published short longitudinal data sets. We found that in human embryonic kidney 293 cells cells the interaction effect in the regulation of extracellular signal-regulated kinase (ERK) 1 signalling activation by insulin and epidermal growth factor is subjected to a time effect and dramatically decays at peak values of ERK activation. In contrast, we found that the interaction effect induced by hypoxia and tumour necrosis factor-alpha for the transcriptional activity of the human cyclo-oxygenase-2 promoter in HEK293 cells is time invariant at least in the first 12-h time window after stimulation. Furthermore, we applied the three-factor model to previously reported animal studies. In these studies, memory storage was found to be subjected to an interaction effect of the beta-adrenoceptor agonist clenbuterol and certain antagonists acting on the alpha-1-adrenoceptor / glucocorticoid-receptor system. Our model-based analysis suggests that only if the antagonist drug is administer in a critical time window, then the interaction effect is relevant.
Collapse
Affiliation(s)
- Till D Frank
- Department of Psychology, University of Connecticut, Storrs, CT 06269, USA
| | - Anatoly Kiyatkin
- Department of Pathology, Thomas Jefferson University, Philadelphia, PA 19107, USA
| | - Alex Cheong
- Systems Biology Ireland, University College Dublin, Belfield, Dublin 4, Ireland
| | - Boris N Kholodenko
- Systems Biology Ireland, University College Dublin, Belfield, Dublin 4, Ireland
| |
Collapse
|
170
|
Liu F. Enhancer-derived RNA: A Primer. GENOMICS PROTEOMICS & BIOINFORMATICS 2017; 15:196-200. [PMID: 28533025 PMCID: PMC5487531 DOI: 10.1016/j.gpb.2016.12.006] [Citation(s) in RCA: 35] [Impact Index Per Article: 4.4] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Download PDF] [Figures] [Subscribe] [Scholar Register] [Received: 11/03/2016] [Revised: 12/16/2016] [Accepted: 12/26/2016] [Indexed: 12/16/2022]
Abstract
Enhancer-derived RNAs (eRNAs) are a group of RNAs transcribed by RNA polymerase II from the domain of transcription enhancers, a major type of cis-regulatory elements in the genome. The correlation between eRNA production and enhancer activity has stimulated studies on the potential role of eRNAs in transcriptional regulation. Additionally, eRNA has also served as a marker for global identification of enhancers. Here I review the brief history and fascinating properties of eRNAs.
Collapse
Affiliation(s)
- Feng Liu
- National Research Center for Translational Medicine, Ruijin Hospital, Shanghai Jiao Tong University School of Medicine, Shanghai 200025, China.
| |
Collapse
|
171
|
Rebeiz M, Tsiantis M. Enhancer evolution and the origins of morphological novelty. Curr Opin Genet Dev 2017; 45:115-123. [PMID: 28527813 DOI: 10.1016/j.gde.2017.04.006] [Citation(s) in RCA: 71] [Impact Index Per Article: 8.9] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 12/12/2016] [Revised: 03/25/2017] [Accepted: 04/27/2017] [Indexed: 01/07/2023]
Abstract
A central goal of evolutionary biology is to understand the genetic origin of morphological novelties-i.e. anatomical structures unique to a taxonomic group. Elaboration of morphology during development depends on networks of regulatory genes that activate patterned gene expression through transcriptional enhancer regions. We summarize recent case studies and genome-wide investigations that have uncovered diverse mechanisms though which new enhancers arise. We also discuss how these enhancer-originating mechanisms have clarified the history of genetic networks underlying diversification of genital structures in flies, limbs and neural crest in chordates, and plant leaves. These studies have identified enhancers that were pivotal for morphological divergence and highlighted how novel genetic networks shaping form emerged from pre-existing ones.
Collapse
Affiliation(s)
- Mark Rebeiz
- Department of Biological Sciences, University of Pittsburgh, 4249 Fifth Avenue, Pittsburgh, PA 15215, USA.
| | - Miltos Tsiantis
- Department of Comparative Development and Genetics, Max Planck Institute for Plant Breeding Research, Carl-von-Linne-Weg 10, 50829 Köln, Germany.
| |
Collapse
|
172
|
Phan AT, Goldrath AW, Glass CK. Metabolic and Epigenetic Coordination of T Cell and Macrophage Immunity. Immunity 2017; 46:714-729. [PMID: 28514673 PMCID: PMC5505665 DOI: 10.1016/j.immuni.2017.04.016] [Citation(s) in RCA: 218] [Impact Index Per Article: 27.3] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 04/04/2017] [Revised: 04/25/2017] [Accepted: 04/26/2017] [Indexed: 02/08/2023]
Abstract
Recognition of pathogens by innate and adaptive immune cells instructs rapid alterations of cellular processes to promote effective resolution of infection. To accommodate increased bioenergetic and biosynthetic demands, metabolic pathways are harnessed to maximize proliferation and effector molecule production. In parallel, activation initiates context-specific gene-expression programs that drive effector functions and cell fates that correlate with changes in epigenetic landscapes. Many chromatin- and DNA-modifying enzymes make use of substrates and cofactors that are intermediates of metabolic pathways, providing potential cross talk between metabolism and epigenetic regulation of gene expression. In this review, we discuss recent studies of T cells and macrophages supporting a role for metabolic activity in integrating environmental signals with activation-induced gene-expression programs through modulation of the epigenome and speculate as to how this may influence context-specific macrophage and T cell responses to infection.
Collapse
Affiliation(s)
- Anthony T Phan
- Division of Biological Sciences, University of California San Diego, La Jolla, CA 92093, USA
| | - Ananda W Goldrath
- Division of Biological Sciences, University of California San Diego, La Jolla, CA 92093, USA.
| | - Christopher K Glass
- Department of Cellular and Molecular Medicine, School of Medicine, University of California San Diego, La Jolla, CA 92093, USA.
| |
Collapse
|
173
|
Abstract
The first animals evolved from an unknown single-celled ancestor in the Precambrian period. Recently, the identification and characterization of the genomic and cellular traits of the protists most closely related to animals have shed light on the origin of animals. Comparisons of animals with these unicellular relatives allow us to reconstruct the first evolutionary steps towards animal multicellularity. Here, we review the results of these investigations and discuss their implications for understanding the earliest stages of animal evolution, including the origin of metazoan genes and genome function.
Collapse
|
174
|
Mll3 and Mll4 Facilitate Enhancer RNA Synthesis and Transcription from Promoters Independently of H3K4 Monomethylation. Mol Cell 2017; 66:568-576.e4. [PMID: 28483418 DOI: 10.1016/j.molcel.2017.04.018] [Citation(s) in RCA: 269] [Impact Index Per Article: 33.6] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 01/26/2017] [Revised: 03/30/2017] [Accepted: 04/25/2017] [Indexed: 01/24/2023]
Abstract
Monomethylation of histone H3 at lysine 4 (H3K4me1) and acetylation of histone H3 at lysine 27 (H3K27ac) are correlated with transcriptionally engaged enhancer elements, but the functional impact of these modifications on enhancer activity is not well understood. Here we used CRISPR/Cas9 genome editing to separate catalytic activity-dependent and independent functions of Mll3 (Kmt2c) and Mll4 (Kmt2d, Mll2), the major enhancer H3K4 monomethyltransferases. Loss of H3K4me1 from enhancers in Mll3/4 catalytically deficient cells causes partial reduction of H3K27ac, but has surprisingly minor effects on transcription from either enhancers or promoters. In contrast, loss of Mll3/4 proteins leads to strong depletion of enhancer Pol II occupancy and eRNA synthesis, concomitant with downregulation of target genes. Interestingly, downregulated genes exhibit reduced polymerase levels in gene bodies, but not at promoters, suggestive of pause-release defects. Altogether, our results suggest that enhancer H3K4me1 provides only a minor contribution to the long-range coactivator function of Mll3/4.
Collapse
|
175
|
Holloway DM, Spirov AV. Transcriptional bursting in Drosophila development: Stochastic dynamics of eve stripe 2 expression. PLoS One 2017; 12:e0176228. [PMID: 28437444 PMCID: PMC5402966 DOI: 10.1371/journal.pone.0176228] [Citation(s) in RCA: 14] [Impact Index Per Article: 1.8] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 11/21/2016] [Accepted: 04/08/2017] [Indexed: 01/17/2023] Open
Abstract
Anterior-posterior (AP) body segmentation of the fruit fly (Drosophila) is first seen in the 7-stripe spatial expression patterns of the pair-rule genes, which regulate downstream genes determining specific segment identities. Regulation of pair-rule expression has been extensively studied for the even-skipped (eve) gene. Recent live imaging, of a reporter for the 2ndeve stripe, has demonstrated the stochastic nature of this process, with ‘bursts’ in the number of RNA transcripts being made over time. We developed a stochastic model of the spatial and temporal expression of eve stripe 2 (binding by transcriptional activators (Bicoid and Hunchback proteins) and repressors (Giant and Krüppel proteins), transcriptional initiation and termination; with all rate parameters constrained by features of the experimental data) in order to analyze the noisy experimental time series and test hypotheses for how eve transcription is regulated. These include whether eve transcription is simply OFF or ON, with a single ON rate, or whether it proceeds by a more complex mechanism, with multiple ON rates. We find that both mechanisms can produce long (multi-minute) RNA bursts, but that the short-time (minute-to-minute) statistics of the data is indicative of eve being transcribed with at least two distinct ON rates, consistent with data on the joint activation of eve by Bicoid and Hunchback. We also predict distinct statistical signatures for cases in which eve is repressed (e.g. along the edges of the stripe) vs. cases in which activation is reduced (e.g. by mutagenesis of transcription factor binding sites). Fundamental developmental processes such as gene transcription are intrinsically noisy; our approach presents a new way to quantify and analyze time series data during developmental patterning in order to understand regulatory mechanisms and how they propagate noise and impact embryonic robustness.
Collapse
Affiliation(s)
- David M. Holloway
- Mathematics Department, British Columbia Institute of Technology, Burnaby, B.C., Canada
- Biology Department, University of Victoria, Victoria, B.C., Canada
- * E-mail:
| | - Alexander V. Spirov
- Computer Science, and Center of Excellence in Wireless and Information Technology, State University of New York, Stony Brook, New York, United States of America
- Sechenov Institute of Evolutionary Physiology and Biochemistry, St. Petersburg, Russia
| |
Collapse
|
176
|
Gaiti F, Jindrich K, Fernandez-Valverde SL, Roper KE, Degnan BM, Tanurdžić M. Landscape of histone modifications in a sponge reveals the origin of animal cis-regulatory complexity. eLife 2017; 6:22194. [PMID: 28395144 PMCID: PMC5429095 DOI: 10.7554/elife.22194] [Citation(s) in RCA: 33] [Impact Index Per Article: 4.1] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/08/2016] [Accepted: 03/27/2017] [Indexed: 01/24/2023] Open
Abstract
Combinatorial patterns of histone modifications regulate developmental and cell type-specific gene expression and underpin animal complexity, but it is unclear when this regulatory system evolved. By analysing histone modifications in a morphologically-simple, early branching animal, the sponge Amphimedonqueenslandica, we show that the regulatory landscape used by complex bilaterians was already in place at the dawn of animal multicellularity. This includes distal enhancers, repressive chromatin and transcriptional units marked by H3K4me3 that vary with levels of developmental regulation. Strikingly, Amphimedon enhancers are enriched in metazoan-specific microsyntenic units, suggesting that their genomic location is extremely ancient and likely to place constraints on the evolution of surrounding genes. These results suggest that the regulatory foundation for spatiotemporal gene expression evolved prior to the divergence of sponges and eumetazoans, and was necessary for the evolution of animal multicellularity.
Collapse
Affiliation(s)
- Federico Gaiti
- School of Biological Sciences, University of Queensland, Brisbane, Australia
| | - Katia Jindrich
- School of Biological Sciences, University of Queensland, Brisbane, Australia
| | | | - Kathrein E Roper
- School of Biological Sciences, University of Queensland, Brisbane, Australia
| | - Bernard M Degnan
- School of Biological Sciences, University of Queensland, Brisbane, Australia
| | - Miloš Tanurdžić
- School of Biological Sciences, University of Queensland, Brisbane, Australia
| |
Collapse
|
177
|
Charney RM, Paraiso KD, Blitz IL, Cho KWY. A gene regulatory program controlling early Xenopus mesendoderm formation: Network conservation and motifs. Semin Cell Dev Biol 2017; 66:12-24. [PMID: 28341363 DOI: 10.1016/j.semcdb.2017.03.003] [Citation(s) in RCA: 30] [Impact Index Per Article: 3.8] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 02/20/2017] [Revised: 03/12/2017] [Accepted: 03/20/2017] [Indexed: 02/08/2023]
Abstract
Germ layer formation is among the earliest differentiation events in metazoan embryos. In triploblasts, three germ layers are formed, among which the endoderm gives rise to the epithelial lining of the gut tube and associated organs including the liver, pancreas and lungs. In frogs (Xenopus), where early germ layer formation has been studied extensively, the process of endoderm specification involves the interplay of dozens of transcription factors. Here, we review the interactions between these factors, summarized in a transcriptional gene regulatory network (GRN). We highlight regulatory connections conserved between frog, fish, mouse, and human endodermal lineages. Especially prominent is the conserved role and regulatory targets of the Nodal signaling pathway and the T-box transcription factors, Vegt and Eomes. Additionally, we highlight network topologies and motifs, and speculate on their possible roles in development.
Collapse
Affiliation(s)
- Rebekah M Charney
- Department of Developmental and Cell Biology, Ayala School of Biological Sciences, University of California, Irvine, CA 92697, USA
| | - Kitt D Paraiso
- Department of Developmental and Cell Biology, Ayala School of Biological Sciences, University of California, Irvine, CA 92697, USA
| | - Ira L Blitz
- Department of Developmental and Cell Biology, Ayala School of Biological Sciences, University of California, Irvine, CA 92697, USA
| | - Ken W Y Cho
- Department of Developmental and Cell Biology, Ayala School of Biological Sciences, University of California, Irvine, CA 92697, USA.
| |
Collapse
|
178
|
Maricque BB, Dougherty JD, Cohen BA. A genome-integrated massively parallel reporter assay reveals DNA sequence determinants of cis-regulatory activity in neural cells. Nucleic Acids Res 2017; 45:e16. [PMID: 28204611 PMCID: PMC5389540 DOI: 10.1093/nar/gkw942] [Citation(s) in RCA: 36] [Impact Index Per Article: 4.5] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 05/02/2016] [Revised: 10/05/2016] [Accepted: 10/11/2016] [Indexed: 11/12/2022] Open
Abstract
Recent large-scale genomics efforts to characterize the cis-regulatory sequences that orchestrate genome-wide expression patterns have produced impressive catalogues of putative regulatory elements. Most of these sequences have not been functionally tested, and our limited understanding of the non-coding genome prevents us from predicting which sequences are bona fide cis-regulatory elements. Recently, massively parallel reporter assays (MPRAs) have been deployed to measure the activity of putative cis-regulatory sequences in several biological contexts, each with specific advantages and distinct limitations. We developed LV-MPRA, a novel lentiviral-based, massively parallel reporter gene assay, to study the function of genome-integrated regulatory elements in any mammalian cell type; thus, making it possible to apply MPRAs in more biologically relevant contexts. We measured the activity of 2,600 sequences in U87 glioblastoma cells and human neural progenitor cells (hNPCs) and explored how regulatory activity is encoded in DNA sequence. We demonstrate that LV-MPRA can be applied to estimate the effects of local DNA sequence and regional chromatin on regulatory activity. Our data reveal that primary DNA sequence features, such as GC content and dinucleotide composition, accurately distinguish sequences with high activity from sequences with low activity in a full chromosomal context, and may also function in combination with different transcription factor binding sites to determine cell type specificity. We conclude that LV-MPRA will be an important tool for identifying cis-regulatory elements and stimulating new understanding about how the non-coding genome encodes information.
Collapse
Affiliation(s)
- Brett B. Maricque
- Center for Genome Sciences and Systems Biology, Washington University School of Medicine, Saint Louis, MO 63108, USA
- Department of Genetics, Washington University School of Medicine, Saint Louis, MO 63108, USA
| | - Joseph D. Dougherty
- Department of Genetics, Washington University School of Medicine, Saint Louis, MO 63108, USA
- Department of Psychiatry, Washington University School of Medicine, Saint Louis, MO 63108, USA
| | - Barak A. Cohen
- Center for Genome Sciences and Systems Biology, Washington University School of Medicine, Saint Louis, MO 63108, USA
- Department of Genetics, Washington University School of Medicine, Saint Louis, MO 63108, USA
| |
Collapse
|
179
|
Rainbow Enhancers Regulate Restrictive Transcription in Teleost Green, Red, and Blue Cones. J Neurosci 2017; 37:2834-2848. [PMID: 28193687 DOI: 10.1523/jneurosci.3421-16.2017] [Citation(s) in RCA: 9] [Impact Index Per Article: 1.1] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 11/04/2016] [Revised: 12/31/2016] [Accepted: 01/27/2017] [Indexed: 01/24/2023] Open
Abstract
Photoreceptor-specific transcription of individual genes collectively constitutes the transcriptional profile that orchestrates the structural and functional characteristics of each photoreceptor type. It is challenging, however, to study the transcriptional specificity of individual photoreceptor genes because each gene's distinct spatiotemporal transcription patterns are determined by the unique interactions between a specific set of transcription factors and the gene's own cis-regulatory elements (CREs), which remain unknown for most of the genes. For example, it is unknown what CREs underlie the zebrafish mpp5bponli (ponli) and crumbs2b (crb2b) apical polarity genes' restrictive transcription in the red, green, and blue (RGB) cones in the retina, but not in other retinal cell types. Here we show that the intronic enhancers of both the ponli and crb2b genes are conserved among teleost species and that they share sequence motifs that are critical for RGB cone-specific transcription. Given their similarities in sequences and functions, we name the ponli and crb2b enhancers collectively rainbow enhancers. Rainbow enhancers may represent a cis-regulatory mechanism to turn on a group of genes that are commonly and restrictively expressed in RGB cones, which largely define the beginning of the color vision pathway.SIGNIFICANCE STATEMENT Dim-light achromatic vision and bright-light color vision are initiated in rod and several types of cone photoreceptors, respectively; these photoreceptors are structurally distinct from each other. In zebrafish, although quite different from rods and UV cones, RGB cones (red, green, and blue cones) are structurally similar and unite into mirror-symmetric pentamers (G-R-B-R-G) by adhesion. This structural commonality and unity suggest that a set of genes is commonly expressed only in RGB cones but not in other cells. Here, we report that the rainbow enhancers activate RGB cone-specific transcription of the ponli and crb2b genes. This study provides a starting point to study how RGB cone-specific transcription defines RGB cones' distinct functions for color vision.
Collapse
|
180
|
Tschopp P, Tabin CJ. Deep homology in the age of next-generation sequencing. Philos Trans R Soc Lond B Biol Sci 2017; 372:20150475. [PMID: 27994118 PMCID: PMC5182409 DOI: 10.1098/rstb.2015.0475] [Citation(s) in RCA: 26] [Impact Index Per Article: 3.3] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Accepted: 07/08/2016] [Indexed: 12/14/2022] Open
Abstract
The principle of homology is central to conceptualizing the comparative aspects of morphological evolution. The distinctions between homologous or non-homologous structures have become blurred, however, as modern evolutionary developmental biology (evo-devo) has shown that novel features often result from modification of pre-existing developmental modules, rather than arising completely de novo. With this realization in mind, the term 'deep homology' was coined, in recognition of the remarkably conserved gene expression during the development of certain animal structures that would not be considered homologous by previous strict definitions. At its core, it can help to formulate an understanding of deeper layers of ontogenetic conservation for anatomical features that lack any clear phylogenetic continuity. Here, we review deep homology and related concepts in the context of a gene expression-based homology discussion. We then focus on how these conceptual frameworks have profited from the recent rise of high-throughput next-generation sequencing. These techniques have greatly expanded the range of organisms amenable to such studies. Moreover, they helped to elevate the traditional gene-by-gene comparison to a transcriptome-wide level. We will end with an outlook on the next challenges in the field and how technological advances might provide exciting new strategies to tackle these questions.This article is part of the themed issue 'Evo-devo in the genomics era, and the origins of morphological diversity'.
Collapse
Affiliation(s)
- Patrick Tschopp
- Department of Genetics, Harvard Medical School, 77 Avenue Louis Pasteur, Boston, MA 02115, USA
| | - Clifford J Tabin
- Department of Genetics, Harvard Medical School, 77 Avenue Louis Pasteur, Boston, MA 02115, USA
| |
Collapse
|
181
|
Abstract
The leap from simple unicellularity to complex multicellularity remains one of life's major enigmas. The origins of metazoan developmental gene regulatory mechanisms are sought by analyzing gene regulation in extant eumetazoans, sponges, and unicellular organisms. The main hypothesis of this manuscript is that, developmental enhancers evolved from unicellular inducible promoters that diversified the expression of regulatory genes during metazoan evolution. Promoters and enhancers are functionally similar; both can regulate the transcription of distal promoters and both direct local transcription. Additionally, enhancers have experimentally characterized structural features that reveal their origin from inducible promoters. The distal co-operative regulation among promoters identified in unicellular opisthokonts possibly represents the precursor of distal regulation of promoters by enhancers. During metazoan evolution, constitutive-type promoters of regulatory genes would have acquired novel receptivity to distal regulatory inputs from promoters of inducible genes that eventually specialized as enhancers. The novel regulatory interactions would have caused constitutively expressed genes controlling differential gene expression in unicellular organisms to become themselves differentially expressed. The consequence of the novel regulatory interactions was that regulatory pathways of unicellular organisms became interlaced and ultimately evolved into the intricate developmental gene regulatory networks (GRNs) of extant metazoans.
Collapse
Affiliation(s)
- César Arenas-Mena
- Department of Biology, College of Staten Island and Graduate Center, The City University of New York (CUNY), Staten Island, NY 10314, USA
| |
Collapse
|
182
|
Shin DH, Hong JW. Transcriptional activity of the short gastrulation primary enhancer in the ventral midline requires its early activity in the presumptive neurogenic ectoderm. BMB Rep 2017; 49:572-577. [PMID: 27616358 PMCID: PMC5227300 DOI: 10.5483/bmbrep.2016.49.10.119] [Citation(s) in RCA: 2] [Impact Index Per Article: 0.3] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/21/2016] [Indexed: 11/22/2022] Open
Abstract
The short gastrulation (sog) shadow enhancer directs early and late sog expression in the neurogenic ectoderm and the ventral midline of the developing Drosophila embryo, respectively. Here, evidence is presented that the sog primary enhancer also has both activities, with the late enhancer activity dependent on the early activity. Computational analyses showed that the sog primary enhancer contains five Dorsal (Dl)-, four Zelda (Zld)-, three Bicoid (Bcd)-, and no Single-minded (Sim)-binding sites. In contrast to many ventral midline enhancers, the primary enhancer can direct lacZ expression in the ventral midline as well as in the neurogenic ectoderm without a canonical Simbinding site. Intriguingly, the impaired transcriptional synergy between Dl and either Zld or Bcd led to aberrant and abolished lacZ expression in the neurogenic ectoderm and in the ventral midline, respectively. These findings suggest that the two enhancer activities of the sog primary enhancer are functionally consolidated and geographically inseparable. [BMB Reports 2016; 49(10): 572-577]
Collapse
Affiliation(s)
- Dong-Hyeon Shin
- Graduate School of East-West Medical Science, Kyung Hee University, Yongin 17104, Korea
| | - Joung-Woo Hong
- Graduate School of East-West Medical Science, Kyung Hee University, Yongin 17104, Korea
| |
Collapse
|
183
|
Abstract
Ras-associated protein-1 (Rap1), a small GTPase in the Ras-related protein family, is an important regulator of basic cellular functions (e.g., formation and control of cell adhesions and junctions), cellular migration, and polarization. Through its interaction with other proteins, Rap1 plays many roles during cell invasion and metastasis in different cancers. The basic function of Rap1 is straightforward; it acts as a switch during cellular signaling transduction and regulated by its binding to either guanosine triphosphate (GTP) or guanosine diphosphate (GDP). However, its remarkably diverse function is rendered by its interplay with a large number of distinct Rap guanine nucleotide exchange factors and Rap GTPase activating proteins. This review summarizes the mechanisms by which Rap1 signaling can regulate cell invasion and metastasis, focusing on its roles in integrin and cadherin regulation, Rho GTPase control, and matrix metalloproteinase expression.
Collapse
Affiliation(s)
- Yi-Lei Zhang
- Key Laboratory of Molecular Biophysics of Ministry of Education, School of Life Science and Technology, Huazhong University of Science and Technology, Wuhan 430074, China
| | - Ruo-Chen Wang
- Key Laboratory of Molecular Biophysics of Ministry of Education, School of Life Science and Technology, Huazhong University of Science and Technology, Wuhan 430074, China
| | - Ken Cheng
- Sun Yat-sen University, Guangzhou 510275, China
| | - Brian Z Ring
- Key Laboratory of Molecular Biophysics of Ministry of Education, School of Life Science and Technology, Huazhong University of Science and Technology, Wuhan 430074, China
| | - Li Su
- Key Laboratory of Molecular Biophysics of Ministry of Education, School of Life Science and Technology, Huazhong University of Science and Technology, Wuhan 430074, China.,Research Institute of Huazhong University of Science and Technology in Shenzhen, Shenzhen 518063, China
| |
Collapse
|
184
|
He W, Jia C. EnhancerPred2.0: predicting enhancers and their strength based on position-specific trinucleotide propensity and electron–ion interaction potential feature selection. MOLECULAR BIOSYSTEMS 2017; 13:767-774. [DOI: 10.1039/c7mb00054e] [Citation(s) in RCA: 28] [Impact Index Per Article: 3.5] [Reference Citation Analysis] [Abstract] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 12/16/2022]
Abstract
Enhancers arecis-acting elements that play major roles in upregulating eukaryotic gene expression by providing binding sites for transcription factors and their complexes.
Collapse
Affiliation(s)
- Wenying He
- Department of Mathematics
- Dalian Maritime University
- Dalian 116026
- China
| | - Cangzhi Jia
- Department of Mathematics
- Dalian Maritime University
- Dalian 116026
- China
| |
Collapse
|
185
|
Abstract
Animals have modular cis-regulatory regions in their genomes, and expression of a single gene is often regulated by multiple enhancers residing in such a region. In the laboratory, and also in natural populations, loss of an enhancer can result in a loss of gene expression. Although only a few examples have been well characterized to date, some studies have suggested that an evolutionary gain of a new enhancer function can establish a new gene expression domain. Our recent study showed that Drosophila guttifera has more enhancers and additional expression domains of the wingless gene during the pupal stage, compared to D. melanogaster, and that these new features appear to have evolved in the ancestral lineage leading to D. guttifera. (1) Gain of a new expression domain of a developmental regulatory gene (toolkit gene), such as wingless, can cause co-option of the expression of its downstream genes to the new domain, resulting in duplication of a preexisting structure at this new body position. Recently, with the advancement of evo-devo studies, we have learned that the developmental regulatory systems are strikingly similar across various animal taxa, in spite of the great diversity of the animals' morphology. Even behind "new" traits, co-options of essential developmental genes from known systems are very common. We previously provided concrete evidence of gains of enhancer activities of a developmental regulatory gene underlying gains of new traits. (1) Broad occurrence of this scenario is testable and should be validated in the future.
Collapse
Affiliation(s)
- Shigeyuki Koshikawa
- a The Hakubi Center for Advanced Research and Graduate School of Science; Kyoto University; Kitashirakawa-Oiwake-Cho ; Sakyo-Ku , Kyoto 606-8 502 , Japan
| |
Collapse
|
186
|
Jia C, He W. EnhancerPred: a predictor for discovering enhancers based on the combination and selection of multiple features. Sci Rep 2016; 6:38741. [PMID: 27941893 PMCID: PMC5150536 DOI: 10.1038/srep38741] [Citation(s) in RCA: 67] [Impact Index Per Article: 7.4] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 09/30/2016] [Accepted: 11/11/2016] [Indexed: 12/31/2022] Open
Abstract
Enhancers are cis elements that play an important role in regulating gene expression by enhancing it. Recent study of modifications revealed that enhancers are a large group of functional elements with many different subgroups, which have different biological activities and regulatory effects on target genes. As powerful auxiliary tools, several computational methods have been proposed to distinguish enhancers from other regulatory elements, but only one method has been considered to clustering them into subgroups. In this study, we developed a predictor (called EnhancerPred) to distinguish between enhancers and nonenhancers and to determine enhancers' strength. A two-step wrapper-based feature selection method was applied in high dimension feature vector from bi-profile Bayes and pseudo-nucleotide composition. Finally, the combination of 104 features from bi-profile Bayes, 1 feature from nucleotide composition and 9 features from pseudo-nucleotide composition yielded the best performance for identifying enhancers and nonenhancers, with overall Acc of 77.39%. The combination of 89 features from bi-profile Bayes and 10 features from pseudo-nucleotide composition yielded the best performance for identifying strong and weak enhancers, with overall Acc of 68.19%. The process and steps of feature optimization illustrated that it is necessary to construct a particular model for identifying strong enhancers and weak enhancers.
Collapse
Affiliation(s)
- Cangzhi Jia
- Department of Mathematics, Dalian Maritime University, No. 1 Linghai Road, Dalian 116026, China
| | - Wenying He
- Department of Mathematics, Dalian Maritime University, No. 1 Linghai Road, Dalian 116026, China
| |
Collapse
|
187
|
Hajdu M, Calle J, Puno A, Haruna A, Arenas-Mena C. Transcriptional and post-transcriptional regulation of histone variantH2A.Zduring sea urchin development. Dev Growth Differ 2016; 58:727-740. [DOI: 10.1111/dgd.12329] [Citation(s) in RCA: 4] [Impact Index Per Article: 0.4] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 08/21/2016] [Revised: 10/06/2016] [Accepted: 10/27/2016] [Indexed: 01/04/2023]
Affiliation(s)
- Mihai Hajdu
- Department of Biology; College of Staten Island and Graduate Center; The City University of New York (CUNY); Staten Island New York 10314 USA
| | - Jasmine Calle
- Department of Biology; College of Staten Island and Graduate Center; The City University of New York (CUNY); Staten Island New York 10314 USA
| | - Andrea Puno
- Department of Biology; College of Staten Island and Graduate Center; The City University of New York (CUNY); Staten Island New York 10314 USA
| | - Aminat Haruna
- Department of Biology; College of Staten Island and Graduate Center; The City University of New York (CUNY); Staten Island New York 10314 USA
| | - César Arenas-Mena
- Department of Biology; College of Staten Island and Graduate Center; The City University of New York (CUNY); Staten Island New York 10314 USA
| |
Collapse
|
188
|
Georgomanolis T, Sofiadis K, Papantonis A. Cutting a Long Intron Short: Recursive Splicing and Its Implications. Front Physiol 2016; 7:598. [PMID: 27965595 PMCID: PMC5126111 DOI: 10.3389/fphys.2016.00598] [Citation(s) in RCA: 19] [Impact Index Per Article: 2.1] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 09/30/2016] [Accepted: 11/16/2016] [Indexed: 11/13/2022] Open
Abstract
Over time eukaryotic genomes have evolved to host genes carrying multiple exons separated by increasingly larger intronic, mostly non-protein-coding, sequences. Initially, little attention was paid to these intronic sequences, as they were considered not to contain regulatory information. However, advances in molecular biology, sequencing, and computational tools uncovered that numerous segments within these genomic elements do contribute to the regulation of gene expression. Introns are differentially removed in a cell type-specific manner to produce a range of alternatively-spliced transcripts, and many span tens to hundreds of kilobases. Recent work in human and fruitfly tissues revealed that long introns are extensively processed cotranscriptionally and in a stepwise manner, before their two flanking exons are spliced together. This process, called "recursive splicing," often involves non-canonical splicing elements positioned deep within introns, and different mechanisms for its deployment have been proposed. Still, the very existence and widespread nature of recursive splicing offers a new regulatory layer in the transcript maturation pathway, which may also have implications in human disease.
Collapse
Affiliation(s)
- Theodore Georgomanolis
- Chromatin Systems Biology Laboratory, Center for Molecular Medicine, University of Cologne Cologne, Germany
| | - Konstantinos Sofiadis
- Chromatin Systems Biology Laboratory, Center for Molecular Medicine, University of Cologne Cologne, Germany
| | - Argyris Papantonis
- Chromatin Systems Biology Laboratory, Center for Molecular Medicine, University of Cologne Cologne, Germany
| |
Collapse
|
189
|
Pan-cancer analysis of somatic copy-number alterations implicates IRS4 and IGF2 in enhancer hijacking. Nat Genet 2016; 49:65-74. [PMID: 27869826 DOI: 10.1038/ng.3722] [Citation(s) in RCA: 289] [Impact Index Per Article: 32.1] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 04/26/2016] [Accepted: 10/19/2016] [Indexed: 02/06/2023]
Abstract
Extensive prior research focused on somatic copy-number alterations (SCNAs) affecting cancer genes, yet the extent to which recurrent SCNAs exert their influence through rearrangement of cis-regulatory elements (CREs) remains unclear. Here we present a framework for inferring cancer-related gene overexpression resulting from CRE reorganization (e.g., enhancer hijacking) by integrating SCNAs, gene expression data and information on topologically associating domains (TADs). Analysis of 7,416 cancer genomes uncovered several pan-cancer candidate genes, including IRS4, SMARCA1 and TERT. We demonstrate that IRS4 overexpression in lung cancer is associated with recurrent deletions in cis, and we present evidence supporting a tumor-promoting role. We additionally pursued cancer-type-specific analyses and uncovered IGF2 as a target for enhancer hijacking in colorectal cancer. Recurrent tandem duplications intersecting with a TAD boundary mediate de novo formation of a 3D contact domain comprising IGF2 and a lineage-specific super-enhancer, resulting in high-level gene activation. Our framework enables systematic inference of CRE rearrangements mediating dysregulation in cancer.
Collapse
|
190
|
Gaiti F, Calcino AD, Tanurdžić M, Degnan BM. Origin and evolution of the metazoan non-coding regulatory genome. Dev Biol 2016; 427:193-202. [PMID: 27880868 DOI: 10.1016/j.ydbio.2016.11.013] [Citation(s) in RCA: 35] [Impact Index Per Article: 3.9] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 09/11/2016] [Revised: 11/14/2016] [Accepted: 11/18/2016] [Indexed: 02/09/2023]
Abstract
Animals rely on genomic regulatory systems to direct the dynamic spatiotemporal and cell-type specific gene expression that is essential for the development and maintenance of a multicellular lifestyle. Although it is widely appreciated that these systems ultimately evolved from genomic regulatory mechanisms present in single-celled stem metazoans, it remains unclear how this occurred. Here, we focus on the contribution of the non-coding portion of the genome to the evolution of animal gene regulation, specifically on recent insights from non-bilaterian metazoan lineages, and unicellular and colonial holozoan sister taxa. High-throughput next-generation sequencing, largely in bilaterian model species, has led to the discovery of tens of thousands of non-coding RNA genes (ncRNAs), including short, long and circular forms, and uncovered the central roles they play in development. Based on the analysis of non-bilaterian metazoan, unicellular holozoan and fungal genomes, the evolution of some ncRNAs, such as Piwi-interacting RNAs, correlates with the emergence of metazoan multicellularity, while others, including microRNAs, long non-coding RNAs and circular RNAs, appear to be more ancient. Analysis of non-coding regulatory DNA and histone post-translational modifications have revealed that some cis-regulatory mechanisms, such as those associated with proximal promoters, are present in non-animal holozoans, while others appear to be metazoan innovations, most notably distal enhancers. In contrast, the cohesin-CTCF system for regulating higher-order chromatin structure and enhancer-promoter long-range interactions appears to be restricted to bilaterians. Taken together, most bilaterian non-coding regulatory mechanisms appear to have originated before the divergence of crown metazoans. However, differential expansion of non-coding RNA and cis-regulatory DNA repertoires in bilaterians may account for their increased regulatory and morphological complexity relative to non-bilaterians.
Collapse
Affiliation(s)
- Federico Gaiti
- School of Biological Sciences, University of Queensland, Brisbane, Australia.
| | - Andrew D Calcino
- Department of Integrative Zoology, University of Vienna, Vienna, Austria.
| | - Miloš Tanurdžić
- School of Biological Sciences, University of Queensland, Brisbane, Australia.
| | - Bernard M Degnan
- School of Biological Sciences, University of Queensland, Brisbane, Australia.
| |
Collapse
|
191
|
Long HK, Prescott SL, Wysocka J. Ever-Changing Landscapes: Transcriptional Enhancers in Development and Evolution. Cell 2016; 167:1170-1187. [PMID: 27863239 PMCID: PMC5123704 DOI: 10.1016/j.cell.2016.09.018] [Citation(s) in RCA: 607] [Impact Index Per Article: 67.4] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/17/2016] [Revised: 08/24/2016] [Accepted: 09/07/2016] [Indexed: 12/27/2022]
Abstract
A class of cis-regulatory elements, called enhancers, play a central role in orchestrating spatiotemporally precise gene-expression programs during development. Consequently, divergence in enhancer sequence and activity is thought to be an important mediator of inter- and intra-species phenotypic variation. Here, we give an overview of emerging principles of enhancer function, current models of enhancer architecture, genomic substrates from which enhancers emerge during evolution, and the influence of three-dimensional genome organization on long-range gene regulation. We discuss intricate relationships between distinct elements within complex regulatory landscapes and consider their potential impact on specificity and robustness of transcriptional regulation.
Collapse
Affiliation(s)
- Hannah K Long
- Department of Chemical and Systems Biology, Stanford School of Medicine, Stanford University, Stanford, CA 94305, USA; Institute of Stem Cell Biology and Regenerative Medicine, Stanford School of Medicine, Stanford University, Stanford, CA 94305, USA
| | - Sara L Prescott
- Department of Chemical and Systems Biology, Stanford School of Medicine, Stanford University, Stanford, CA 94305, USA; Department of Developmental Biology, Stanford School of Medicine, Stanford University, Stanford, CA 94305, USA
| | - Joanna Wysocka
- Department of Chemical and Systems Biology, Stanford School of Medicine, Stanford University, Stanford, CA 94305, USA; Institute of Stem Cell Biology and Regenerative Medicine, Stanford School of Medicine, Stanford University, Stanford, CA 94305, USA; Department of Developmental Biology, Stanford School of Medicine, Stanford University, Stanford, CA 94305, USA; Howard Hughes Medical Institute, Stanford School of Medicine, Stanford University, Stanford, CA 94305, USA.
| |
Collapse
|
192
|
Akiyama Y, Koda Y, Byeon SJ, Shimada S, Nishikawaji T, Sakamoto A, Chen Y, Kojima K, Kawano T, Eishi Y, Deng D, Kim WH, Zhu WG, Yuasa Y, Tanaka S. Reduced expression of SET7/9, a histone mono-methyltransferase, is associated with gastric cancer progression. Oncotarget 2016; 7:3966-83. [PMID: 26701885 PMCID: PMC4826183 DOI: 10.18632/oncotarget.6681] [Citation(s) in RCA: 32] [Impact Index Per Article: 3.6] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 09/03/2015] [Accepted: 11/25/2015] [Indexed: 11/25/2022] Open
Abstract
SET7/9, a histone methyltransferase, has two distinct functions for lysine methylation. SET7/9 methylates non-histone proteins, such as p53, and participates in their posttranslational modifications. Although SET7/9 transcriptionally activate the genes via H3K4 mono-methylation, its target genes are poorly understood. To clarify whether or not SET7/9 is related to carcinogenesis, we studied alterations of SET7/9 in gastric cancers (GCs). Among the 376 primary GCs, 129 cases (34.3%) showed loss or weak expression of SET7/9 protein compared to matched non-cancerous tissues by immunohistochemistry. Reduced SET7/9 expression was significantly correlated with clinical aggressiveness and worse prognosis. Knockdown of SET7/9 in GC cells markedly increased cell proliferation, migration and invasion. Expression of SREK1IP1, PGC and CCDC28B were inhibited in GC cells with SET7/9 knockdown, while matrix metalloproteinase genes (MMP1, MMP7 and MMP9) were activated. SET7/9 bound and mono-methylated H3K4 at the region of the approximately 4-6 kb upstream from the SREK1IP1 transcriptional start site and the promoters of PGC and CDC28B. Cell proliferation, migration and invasion, and expression of three MMPs were increased in GC cells with SREK1IP knockdown, which were similar to those of SET7/9 knockdown. These data suggest that SET7/9 has tumor suppressor functions, and loss of SET7/9 may contribute to gastric cancer progression.
Collapse
Affiliation(s)
- Yoshimitsu Akiyama
- Department of Molecular Oncology, Graduate School of Medical and Dental Sciences, Tokyo Medical and Dental University, Yushima, Bunkyo-ku, Tokyo 113-8519, Japan
| | - Yuki Koda
- Department of Molecular Oncology, Graduate School of Medical and Dental Sciences, Tokyo Medical and Dental University, Yushima, Bunkyo-ku, Tokyo 113-8519, Japan
| | - Sun-Ju Byeon
- Department of Pathology, Seoul National University College of Medicine, Jongno-gu, Seoul 110-799, Korea
| | - Shu Shimada
- Department of Molecular Oncology, Graduate School of Medical and Dental Sciences, Tokyo Medical and Dental University, Yushima, Bunkyo-ku, Tokyo 113-8519, Japan
| | - Taketo Nishikawaji
- Department of Molecular Oncology, Graduate School of Medical and Dental Sciences, Tokyo Medical and Dental University, Yushima, Bunkyo-ku, Tokyo 113-8519, Japan
| | - Ayuna Sakamoto
- Department of Molecular Oncology, Graduate School of Medical and Dental Sciences, Tokyo Medical and Dental University, Yushima, Bunkyo-ku, Tokyo 113-8519, Japan
| | - Yingxuan Chen
- Division of Gastroenterology and Hepatology, Ren Ji Hospital, School of Medicine, Shanghai Jiao Tong University, Shanghai 200001, China
| | - Kazuyuki Kojima
- Department of Surgical Oncology, Graduate School of Medical and Dental Sciences, Tokyo Medical and Dental University, Yushima, Bunkyo-ku, Tokyo 113-8519, Japan
| | - Tatsuyuki Kawano
- Department of Surgery, Graduate School of Medical and Dental Sciences, Tokyo Medical and Dental University, Yushima, Bunkyo-ku, Tokyo 113-8519, Japan
| | - Yoshinobu Eishi
- Department of Human Pathology, Graduate School of Medical and Dental Sciences, Tokyo Medical and Dental University, Yushima, Bunkyo-ku, Tokyo 113-8519, Japan
| | - Dajun Deng
- Division of Cancer Etiology, Peking University Cancer Hospital and Institute, Beijing 100142, China
| | - Woo Ho Kim
- Department of Pathology, Seoul National University College of Medicine, Jongno-gu, Seoul 110-799, Korea
| | - Wei-Guo Zhu
- Department of Biochemistry and Molecular Biology, Peking University Health Science Center, Beijing 100191, China
| | - Yasuhito Yuasa
- Department of Molecular Oncology, Graduate School of Medical and Dental Sciences, Tokyo Medical and Dental University, Yushima, Bunkyo-ku, Tokyo 113-8519, Japan
| | - Shinji Tanaka
- Department of Molecular Oncology, Graduate School of Medical and Dental Sciences, Tokyo Medical and Dental University, Yushima, Bunkyo-ku, Tokyo 113-8519, Japan
| |
Collapse
|
193
|
Sparks EE, Drapek C, Gaudinier A, Li S, Ansariola M, Shen N, Hennacy JH, Zhang J, Turco G, Petricka JJ, Foret J, Hartemink AJ, Gordân R, Megraw M, Brady SM, Benfey PN. Establishment of Expression in the SHORTROOT-SCARECROW Transcriptional Cascade through Opposing Activities of Both Activators and Repressors. Dev Cell 2016; 39:585-596. [PMID: 27923776 DOI: 10.1016/j.devcel.2016.09.031] [Citation(s) in RCA: 44] [Impact Index Per Article: 4.9] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/22/2015] [Revised: 05/27/2016] [Accepted: 09/29/2016] [Indexed: 12/28/2022]
Abstract
Tissue-specific gene expression is often thought to arise from spatially restricted transcriptional cascades. However, it is unclear how expression is established at the top of these cascades in the absence of pre-existing specificity. We generated a transcriptional network to explore how transcription factor expression is established in the Arabidopsis thaliana root ground tissue. Regulators of the SHORTROOT-SCARECROW transcriptional cascade were validated in planta. At the top of this cascade, we identified both activators and repressors of SHORTROOT. The aggregate spatial expression of these regulators is not sufficient to predict transcriptional specificity. Instead, modeling, transcriptional reporters, and synthetic promoters support a mechanism whereby expression at the top of the SHORTROOT-SCARECROW cascade is established through opposing activities of activators and repressors.
Collapse
Affiliation(s)
- Erin E Sparks
- Department of Biology, Duke University, Durham, NC 27708, USA
| | - Colleen Drapek
- Department of Biology, Duke University, Durham, NC 27708, USA
| | - Allison Gaudinier
- Department of Plant Biology and Genome Center, University of California Davis, Davis, CA 95616, USA
| | - Song Li
- Department of Crop and Soil Environmental Sciences, Virginia Tech, Blacksburg, VA 24061, USA
| | - Mitra Ansariola
- Department of Botany and Plant Pathology, Oregon State University, Corvallis, OR 97331, USA
| | - Ning Shen
- Department of Pharmacology and Cancer Biology, Duke University, Durham, NC 27710, USA; Center for Genomic and Computational Biology, Duke University, Durham, NC 27708, USA
| | | | - Jingyuan Zhang
- Department of Biology, Duke University, Durham, NC 27708, USA
| | - Gina Turco
- Department of Plant Biology and Genome Center, University of California Davis, Davis, CA 95616, USA
| | | | - Jessica Foret
- Department of Plant Biology and Genome Center, University of California Davis, Davis, CA 95616, USA
| | - Alexander J Hartemink
- Department of Biology, Duke University, Durham, NC 27708, USA; Center for Genomic and Computational Biology, Duke University, Durham, NC 27708, USA; Department of Computer Science, Duke University, Durham, NC 27708, USA
| | - Raluca Gordân
- Center for Genomic and Computational Biology, Duke University, Durham, NC 27708, USA; Department of Computer Science, Duke University, Durham, NC 27708, USA; Department of Biostatistics and Bioinformatics, Duke University, Durham, NC 27710, USA
| | - Molly Megraw
- Department of Botany and Plant Pathology, Oregon State University, Corvallis, OR 97331, USA
| | - Siobhan M Brady
- Department of Plant Biology and Genome Center, University of California Davis, Davis, CA 95616, USA
| | - Philip N Benfey
- Department of Biology, Duke University, Durham, NC 27708, USA; Howard Hughes Medical Institute, Duke University, Durham, NC 27708, USA.
| |
Collapse
|
194
|
Abstract
Mutations in enhancer-associated chromatin-modifying components and genomic alterations in non-coding regions of the genome occur frequently in cancer, and other diseases pointing to the importance of enhancer fidelity to ensure proper tissue homeostasis. In this review, I will use specific examples to discuss how mutations in chromatin-modifying factors might affect enhancer activity of disease-relevant genes. I will then consider direct evidence from single nucleotide polymorphisms, small insertions, or deletions but also larger genomic rearrangements such as duplications, deletions, translocations, and inversions of specific enhancers to demonstrate how they have the ability to impact enhancer activity of disease genes including oncogenes and tumor suppressor genes. Considering that the scientific community only fairly recently has begun to focus its attention on "enhancer malfunction" in disease, I propose that multiple new enhancer-regulated and disease-relevant processes will be uncovered in the near future that will constitute the mechanistic basis for novel therapeutic avenues.
Collapse
Affiliation(s)
- Hans-Martin Herz
- Department of Cell & Molecular Biology, St. Jude Children's Research Hospital, Memphis, TN, USA.
| |
Collapse
|
195
|
Shin DH, Hong JW. Midline enhancer activity of the short gastrulation shadow enhancer is characterized by three unusual features for cis-regulatory DNA. BMB Rep 2016; 48:589-94. [PMID: 26277983 PMCID: PMC4911187 DOI: 10.5483/bmbrep.2015.48.10.155] [Citation(s) in RCA: 1] [Impact Index Per Article: 0.1] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/22/2015] [Indexed: 01/10/2023] Open
Abstract
The shadow enhancer of the short gastrulation
(sog) gene directs its sequential expression in the
neurogenic ectoderm and the ventral midline of the developing
Drosophila embryo. Here, we characterize three unusual
features of the shadow enhancer midline activity. First, the minimal regions for
the two different enhancer activities exhibit high overlap within the shadow
enhancer, meaning that one developmental enhancer possesses dual enhancer
activities. Second, the midline enhancer activity relies on five Single-minded
(Sim)-binding sites, two of which have not been found in any Sim target
enhancers. Finally, two linked Dorsal (Dl)- and Zelda (Zld)-binding sites,
critical for the neurogenic ectoderm enhancer activity, are also required for
the midline enhancer activity. These results suggest that early activation by Dl
and Zld may facilitate late activation via the noncanonical sites occupied by
Sim. We discuss a model for Zld as a pioneer factor and speculate its role in
midline enhancer activity. [BMB Reports 2015; 48(10): 589-594]
Collapse
Affiliation(s)
- Dong-Hyeon Shin
- Graduate School of East-West Medical Science, Kyung Hee University, Yongin 17104, Korea
| | - Joung-Woo Hong
- Graduate School of East-West Medical Science, Kyung Hee University, Yongin 17104, Korea
| |
Collapse
|
196
|
Savic D, Ramaker RC, Roberts BS, Dean EC, Burwell TC, Meadows SK, Cooper SJ, Garabedian MJ, Gertz J, Myers RM. Distinct gene regulatory programs define the inhibitory effects of liver X receptors and PPARG on cancer cell proliferation. Genome Med 2016; 8:74. [PMID: 27401066 PMCID: PMC4940857 DOI: 10.1186/s13073-016-0328-6] [Citation(s) in RCA: 33] [Impact Index Per Article: 3.7] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 04/18/2016] [Accepted: 06/14/2016] [Indexed: 12/28/2022] Open
Abstract
Background The liver X receptors (LXRs, NR1H2 and NR1H3) and peroxisome proliferator-activated receptor gamma (PPARG, NR1C3) nuclear receptor transcription factors (TFs) are master regulators of energy homeostasis. Intriguingly, recent studies suggest that these metabolic regulators also impact tumor cell proliferation. However, a comprehensive temporal molecular characterization of the LXR and PPARG gene regulatory responses in tumor cells is still lacking. Methods To better define the underlying molecular processes governing the genetic control of cellular growth in response to extracellular metabolic signals, we performed a comprehensive, genome-wide characterization of the temporal regulatory cascades mediated by LXR and PPARG signaling in HT29 colorectal cancer cells. For this analysis, we applied a multi-tiered approach that incorporated cellular phenotypic assays, gene expression profiles, chromatin state dynamics, and nuclear receptor binding patterns. Results Our results illustrate that the activation of both nuclear receptors inhibited cell proliferation and further decreased glutathione levels, consistent with increased cellular oxidative stress. Despite a common metabolic reprogramming, the gene regulatory network programs initiated by these nuclear receptors were widely distinct. PPARG generated a rapid and short-term response while maintaining a gene activator role. By contrast, LXR signaling was prolonged, with initial, predominantly activating functions that transitioned to repressive gene regulatory activities at late time points. Conclusions Through the use of a multi-tiered strategy that integrated various genomic datasets, our data illustrate that distinct gene regulatory programs elicit common phenotypic effects, highlighting the complexity of the genome. These results further provide a detailed molecular map of metabolic reprogramming in cancer cells through LXR and PPARG activation. As ligand-inducible TFs, these nuclear receptors can potentially serve as attractive therapeutic targets for the treatment of various cancers. Electronic supplementary material The online version of this article (doi:10.1186/s13073-016-0328-6) contains supplementary material, which is available to authorized users.
Collapse
Affiliation(s)
- Daniel Savic
- HudsonAlpha Institute for Biotechnology, Huntsville, AL, 35806, USA
| | - Ryne C Ramaker
- HudsonAlpha Institute for Biotechnology, Huntsville, AL, 35806, USA.,Department of Genetics, University of Alabama at Birmingham, Birmingham, AL, 35294, USA
| | - Brian S Roberts
- HudsonAlpha Institute for Biotechnology, Huntsville, AL, 35806, USA
| | - Emma C Dean
- HudsonAlpha Institute for Biotechnology, Huntsville, AL, 35806, USA
| | - Todd C Burwell
- HudsonAlpha Institute for Biotechnology, Huntsville, AL, 35806, USA
| | - Sarah K Meadows
- HudsonAlpha Institute for Biotechnology, Huntsville, AL, 35806, USA
| | - Sara J Cooper
- HudsonAlpha Institute for Biotechnology, Huntsville, AL, 35806, USA
| | - Michael J Garabedian
- Departments of Microbiology and Urology, New York University, New York, NY, 10016, USA
| | - Jason Gertz
- Department of Oncological Sciences, Huntsman Cancer Institute, University of Utah, Salt Lake City, UT, 84112, USA
| | - Richard M Myers
- HudsonAlpha Institute for Biotechnology, Huntsville, AL, 35806, USA.
| |
Collapse
|
197
|
Liu F, Li H, Ren C, Bo X, Shu W. PEDLA: predicting enhancers with a deep learning-based algorithmic framework. Sci Rep 2016; 6:28517. [PMID: 27329130 PMCID: PMC4916453 DOI: 10.1038/srep28517] [Citation(s) in RCA: 64] [Impact Index Per Article: 7.1] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/09/2016] [Accepted: 06/02/2016] [Indexed: 01/08/2023] Open
Abstract
Transcriptional enhancers are non-coding segments of DNA that play a central role in the spatiotemporal regulation of gene expression programs. However, systematically and precisely predicting enhancers remain a major challenge. Although existing methods have achieved some success in enhancer prediction, they still suffer from many issues. We developed a deep learning-based algorithmic framework named PEDLA (https://github.com/wenjiegroup/PEDLA), which can directly learn an enhancer predictor from massively heterogeneous data and generalize in ways that are mostly consistent across various cell types/tissues. We first trained PEDLA with 1,114-dimensional heterogeneous features in H1 cells, and demonstrated that PEDLA framework integrates diverse heterogeneous features and gives state-of-the-art performance relative to five existing methods for enhancer prediction. We further extended PEDLA to iteratively learn from 22 training cell types/tissues. Our results showed that PEDLA manifested superior performance consistency in both training and independent test sets. On average, PEDLA achieved 95.0% accuracy and a 96.8% geometric mean (GM) of sensitivity and specificity across 22 training cell types/tissues, as well as 95.7% accuracy and a 96.8% GM across 20 independent test cell types/tissues. Together, our work illustrates the power of harnessing state-of-the-art deep learning techniques to consistently identify regulatory elements at a genome-wide scale from massively heterogeneous data across diverse cell types/tissues.
Collapse
Affiliation(s)
- Feng Liu
- Department of Biotechnology, Beijing Institute of Radiation Medicine, Beijing 100850, China
| | - Hao Li
- Department of Biotechnology, Beijing Institute of Radiation Medicine, Beijing 100850, China
| | - Chao Ren
- Department of Biotechnology, Beijing Institute of Radiation Medicine, Beijing 100850, China
| | - Xiaochen Bo
- Department of Biotechnology, Beijing Institute of Radiation Medicine, Beijing 100850, China
| | - Wenjie Shu
- Department of Biotechnology, Beijing Institute of Radiation Medicine, Beijing 100850, China
| |
Collapse
|
198
|
Engel KL, Mackiewicz M, Hardigan AA, Myers RM, Savic D. Decoding transcriptional enhancers: Evolving from annotation to functional interpretation. Semin Cell Dev Biol 2016; 57:40-50. [PMID: 27224938 DOI: 10.1016/j.semcdb.2016.05.014] [Citation(s) in RCA: 9] [Impact Index Per Article: 1.0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 02/25/2016] [Revised: 05/06/2016] [Accepted: 05/18/2016] [Indexed: 12/18/2022]
Abstract
Deciphering the intricate molecular processes that orchestrate the spatial and temporal regulation of genes has become an increasingly major focus of biological research. The differential expression of genes by diverse cell types with a common genome is a hallmark of complex cellular functions, as well as the basis for multicellular life. Importantly, a more coherent understanding of gene regulation is critical for defining developmental processes, evolutionary principles and disease etiologies. Here we present our current understanding of gene regulation by focusing on the role of enhancer elements in these complex processes. Although functional genomic methods have provided considerable advances to our understanding of gene regulation, these assays, which are usually performed on a genome-wide scale, typically provide correlative observations that lack functional interpretation. Recent innovations in genome editing technologies have placed gene regulatory studies at an exciting crossroads, as systematic, functional evaluation of enhancers and other transcriptional regulatory elements can now be performed in a coordinated, high-throughput manner across the entire genome. This review provides insights on transcriptional enhancer function, their role in development and disease, and catalogues experimental tools commonly used to study these elements. Additionally, we discuss the crucial role of novel techniques in deciphering the complex gene regulatory landscape and how these studies will shape future research.
Collapse
Affiliation(s)
- Krysta L Engel
- HudsonAlpha Institute for Biotechnology, Huntsville, AL 35806, United States
| | - Mark Mackiewicz
- HudsonAlpha Institute for Biotechnology, Huntsville, AL 35806, United States
| | - Andrew A Hardigan
- HudsonAlpha Institute for Biotechnology, Huntsville, AL 35806, United States; Department of Genetics, University of Alabama at Birmingham, Birmingham, AL 35294, United States
| | - Richard M Myers
- HudsonAlpha Institute for Biotechnology, Huntsville, AL 35806, United States
| | - Daniel Savic
- HudsonAlpha Institute for Biotechnology, Huntsville, AL 35806, United States.
| |
Collapse
|
199
|
Sebé-Pedrós A, Ballaré C, Parra-Acero H, Chiva C, Tena JJ, Sabidó E, Gómez-Skarmeta JL, Di Croce L, Ruiz-Trillo I. The Dynamic Regulatory Genome of Capsaspora and the Origin of Animal Multicellularity. Cell 2016; 165:1224-1237. [PMID: 27114036 PMCID: PMC4877666 DOI: 10.1016/j.cell.2016.03.034] [Citation(s) in RCA: 113] [Impact Index Per Article: 12.6] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/21/2015] [Revised: 02/03/2016] [Accepted: 03/18/2016] [Indexed: 12/16/2022]
Abstract
The unicellular ancestor of animals had a complex repertoire of genes linked to multicellular processes. This suggests that changes in the regulatory genome, rather than in gene innovation, were key to the origin of animals. Here, we carry out multiple functional genomic assays in Capsaspora owczarzaki, the unicellular relative of animals with the largest known gene repertoire for transcriptional regulation. We show that changing chromatin states, differential lincRNA expression, and dynamic cis-regulatory sites are associated with life cycle transitions in Capsaspora. Moreover, we demonstrate conservation of animal developmental transcription-factor networks and extensive network interconnection in this premetazoan organism. In contrast, however, Capsaspora lacks animal promoter types, and its regulatory sites are small, proximal, and lack signatures of animal enhancers. Overall, our results indicate that the emergence of animal multicellularity was linked to a major shift in genome cis-regulatory complexity, most notably the appearance of distal enhancer regulation.
Collapse
Affiliation(s)
- Arnau Sebé-Pedrós
- Institut de Biologia Evolutiva (CSIC-Universitat Pompeu Fabra), Passeig Marítim de la Barceloneta 37-49, 08003 Barcelona, Spain.
| | - Cecilia Ballaré
- Center for Genomic Regulation, Doctor Aiguader 88, 08003 Barcelona, Spain; Universitat Pompeu Fabra (UPF), Doctor Aiguader 88, 08003 Barcelona, Spain
| | - Helena Parra-Acero
- Institut de Biologia Evolutiva (CSIC-Universitat Pompeu Fabra), Passeig Marítim de la Barceloneta 37-49, 08003 Barcelona, Spain
| | - Cristina Chiva
- Center for Genomic Regulation, Doctor Aiguader 88, 08003 Barcelona, Spain; Universitat Pompeu Fabra (UPF), Doctor Aiguader 88, 08003 Barcelona, Spain
| | - Juan J Tena
- Centro Andaluz de Biología del Desarrollo (CABD), CSIC-Universidad Pablo de Olavide-Junta de Andalucía, Carretera de Utrera Km1, 41013 Sevilla, Spain
| | - Eduard Sabidó
- Center for Genomic Regulation, Doctor Aiguader 88, 08003 Barcelona, Spain; Universitat Pompeu Fabra (UPF), Doctor Aiguader 88, 08003 Barcelona, Spain
| | - José Luis Gómez-Skarmeta
- Centro Andaluz de Biología del Desarrollo (CABD), CSIC-Universidad Pablo de Olavide-Junta de Andalucía, Carretera de Utrera Km1, 41013 Sevilla, Spain
| | - Luciano Di Croce
- Center for Genomic Regulation, Doctor Aiguader 88, 08003 Barcelona, Spain; Universitat Pompeu Fabra (UPF), Doctor Aiguader 88, 08003 Barcelona, Spain; Institució Catalana de Recerca i Estudis Avançats, Pg Lluis Companys 23, 08010 Barcelona, Spain
| | - Iñaki Ruiz-Trillo
- Institut de Biologia Evolutiva (CSIC-Universitat Pompeu Fabra), Passeig Marítim de la Barceloneta 37-49, 08003 Barcelona, Spain; Institució Catalana de Recerca i Estudis Avançats, Pg Lluis Companys 23, 08010 Barcelona, Spain; Departament de Genètica, Universitat de Barcelona, 08028 Barcelona, Spain.
| |
Collapse
|
200
|
Sayal R, Dresch JM, Pushel I, Taylor BR, Arnosti DN. Quantitative perturbation-based analysis of gene expression predicts enhancer activity in early Drosophila embryo. eLife 2016; 5. [PMID: 27152947 PMCID: PMC4859806 DOI: 10.7554/elife.08445] [Citation(s) in RCA: 33] [Impact Index Per Article: 3.7] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 04/30/2015] [Accepted: 04/04/2016] [Indexed: 01/02/2023] Open
Abstract
Enhancers constitute one of the major components of regulatory machinery of metazoans. Although several genome-wide studies have focused on finding and locating enhancers in the genomes, the fundamental principles governing their internal architecture and cis-regulatory grammar remain elusive. Here, we describe an extensive, quantitative perturbation analysis targeting the dorsal-ventral patterning gene regulatory network (GRN) controlled by Drosophila NF-κB homolog Dorsal. To understand transcription factor interactions on enhancers, we employed an ensemble of mathematical models, testing effects of cooperativity, repression, and factor potency. Models trained on the dataset correctly predict activity of evolutionarily divergent regulatory regions, providing insights into spatial relationships between repressor and activator binding sites. Importantly, the collective predictions of sets of models were effective at novel enhancer identification and characterization. Our study demonstrates how experimental dataset and modeling can be effectively combined to provide quantitative insights into cis-regulatory information on a genome-wide scale. DOI:http://dx.doi.org/10.7554/eLife.08445.001 DNA contains regions known as genes, which may be “transcribed” to produce the RNA molecules that act as templates for building proteins and regulate cell activity. Proteins called transcription factors can bind to specific sequences of DNA to influence whether nearby genes are transcribed. For example, so-called enhancer regions of DNA contain several binding sites for transcription factors, and this binding activates gene transcription. Little is known about how the transcription factor binding sites are organized in enhancer regions, which makes it difficult to use DNA sequence information alone to predict the regulation of genes. A transcription factor called Dorsal controls the activity of a network of genes that plays a crucial role in the development of fruit fly embryos. Dorsal binds to the enhancer region of a gene called rhomboid, which has been well studied and is known to be a fairly typical example of an enhancer region. To understand the regulatory information encoded in the DNA sequences of enhancers, Sayal, Dresch et al. have now used a technique called perturbation analysis to investigate the interactions that are likely to occur between Dorsal and other transcription factors as they bind to the rhomboid enhancer. This technique involves systematically mutating the enhancer to remove different combinations of transcription factor binding sites and quantitatively investigating the effect this has on gene activity. A large set of mathematical models were then trained using this data and shown to correctly predict the activity of a range of other gene regulatory regions. The collective predictions of the models identified new enhancer regions and revealed details about how different types of transcription factor binding sites are arranged within enhancers. As we enter an era where the DNA sequences of entire human populations are increasingly accessible, we would like to know the functional significance of changes in gene regulatory regions. Sayal, Dresch et al. show that the regulatory properties of specific control proteins are accessible by employing quantitative experiments and mathematical models. Similar studies will be required to learn how mutations found across the genome may alter gene expression, leading to better diagnosis and treatment of disease. DOI:http://dx.doi.org/10.7554/eLife.08445.002
Collapse
Affiliation(s)
- Rupinder Sayal
- Department of Biochemistry and Molecular Biology, Michigan State University, East Lansing, United States.,Department of Biochemistry, DAV University, Jalandhar, India
| | - Jacqueline M Dresch
- Department of Mathematics, Michigan State University, East Lansing, United States.,Department of Mathematics and Computer Science, Clark University, Worcester, United States
| | - Irina Pushel
- Department of Biochemistry and Molecular Biology, Michigan State University, East Lansing, United States.,Stowers Institute for Medical Research, Kansas City, United States
| | - Benjamin R Taylor
- Department of Computer Science and Engineering, Michigan State University, East Lansing, United States.,School of Computer Science, Georgia Institute of Technology, Atlanta, United States
| | - David N Arnosti
- Department of Biochemistry and Molecular Biology, Michigan State University, East Lansing, United States
| |
Collapse
|