1
|
Dickson ZW, Golding GB. Evolution of Transcript Abundance is Influenced by Indels in Protein Low Complexity Regions. J Mol Evol 2024; 92:153-168. [PMID: 38485789 DOI: 10.1007/s00239-024-10158-z] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/05/2023] [Accepted: 01/24/2024] [Indexed: 04/02/2024]
Abstract
Protein Protein low complexity regions (LCRs) are compositionally biased amino acid sequences, many of which have significant evolutionary impacts on the proteins which contain them. They are mutationally unstable experiencing higher rates of indels and substitutions than higher complexity regions. LCRs also impact the expression of their proteins, likely through multiple effects along the path from gene transcription, through translation, and eventual protein degradation. It has been observed that proteins which contain LCRs are associated with elevated transcript abundance (TAb), despite having lower protein abundance. We have gathered and integrated human data to investigate the co-evolution of TAb and LCRs through ancestral reconstructions and model inference using an approximate Bayesian calculation based method. We observe that on short evolutionary timescales TAb evolution is significantly impacted by changes in LCR length, with insertions driving TAb down. But in contrast, the observed data is best explained by indel rates in LCRs which are unaffected by shifts in TAb. Our work demonstrates a coupling between LCR and TAb evolution, and the utility of incorporating multiple responses into evolutionary analyses.
Collapse
Affiliation(s)
| | - G Brian Golding
- Department of Biology, McMaster University, Hamilton, ON, Canada
| |
Collapse
|
2
|
Rosales-Vega M, Reséndez-Pérez D, Vázquez M. Antennapedia: The complexity of a master developmental transcription factor. Genesis 2024; 62:e23561. [PMID: 37830148 DOI: 10.1002/dvg.23561] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/23/2023] [Revised: 09/25/2023] [Accepted: 09/27/2023] [Indexed: 10/14/2023]
Abstract
Hox genes encode transcription factors that play an important role in establishing the basic body plan of animals. In Drosophila, Antennapedia is one of the five genes that make up the Antennapedia complex (ANT-C). Antennapedia determines the identity of the second thoracic segment, known as the mesothorax. Misexpression of Antennapedia at different developmental stages changes the identity of the mesothorax, including the muscles, nervous system, and cuticle. In Drosophila, Antennapedia has two distinct promoters highly regulated throughout development by several transcription factors. Antennapedia proteins are found with other transcription factors in different ANTENNAPEDIA transcriptional complexes to regulate multiple subsets of target genes. In this review, we describe the different mechanisms that regulate the expression and function of Antennapedia and the role of this Hox gene in the development of Drosophila.
Collapse
Affiliation(s)
- Marco Rosales-Vega
- Departamento de Genética del Desarrollo y Fisiología Molecular, Instituto de Biotecnología, Universidad Nacional Autónoma de México, Cuernavaca, Morelos, Mexico
| | - Diana Reséndez-Pérez
- Facultad de Ciencias Biológicas, Departamento de Inmunología y Virología, Universidad Autónoma de Nuevo León, San Nicolás de los Garza, Nuevo León, Mexico
| | - Martha Vázquez
- Departamento de Genética del Desarrollo y Fisiología Molecular, Instituto de Biotecnología, Universidad Nacional Autónoma de México, Cuernavaca, Morelos, Mexico
| |
Collapse
|
3
|
Harris SE, Alexis MS, Giri G, Cavazos FF, Murn J, Aleman MM, Burge CB, Dominguez D. Understanding species-specific and conserved RNA-protein interactions in vivo and in vitro. BIORXIV : THE PREPRINT SERVER FOR BIOLOGY 2024:2024.01.29.577729. [PMID: 38352439 PMCID: PMC10862761 DOI: 10.1101/2024.01.29.577729] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Grants] [Track Full Text] [Download PDF] [Figures] [Subscribe] [Scholar Register] [Indexed: 02/22/2024]
Abstract
While evolution is often considered from a DNA- and protein-centric view, RNA-based regulation can also impact gene expression and protein sequences. Here we examined interspecies differences in RNA-protein interactions using the conserved neuronal RNA binding protein, Unkempt (UNK) as model. We find that roughly half of mRNAs bound in human are also bound in mouse. Unexpectedly, even when transcript-level binding was conserved across species differential motif usage was prevalent. To understand the biochemical basis of UNK-RNA interactions, we reconstituted the human and mouse UNK-RNA interactomes using a high-throughput biochemical assay. We uncover detailed features driving binding, show that in vivo patterns are captured in vitro, find that highly conserved sites are the strongest bound, and associate binding strength with downstream regulation. Furthermore, subtle sequence differences surrounding motifs are key determinants of species-specific binding. We highlight the complex features driving protein-RNA interactions and how these evolve to confer species-specific regulation.
Collapse
Affiliation(s)
- Sarah E. Harris
- Department of Biochemistry and Biophysics, University of North Carolina, Chapel Hill, NC
- Department of Pharmacology, University of North Carolina, Chapel Hill, NC
| | - Maria S. Alexis
- Department of Biology, Massachusetts Institute of Technology, Cambridge, MA
- Current address: Remix Therapeutics, Cambridge, MA
| | - Gilbert Giri
- Department of Pharmacology, University of North Carolina, Chapel Hill, NC
- Curriculum in Bioinformatics and Computational Biology, University of North Carolina, Chapel Hill, NC
| | | | - Jernej Murn
- Department of Biochemistry, University of California, Riverside, CA
- Center for RNA Biology and Medicine, Riverside, CA
| | - Maria M. Aleman
- Department of Pharmacology, University of North Carolina, Chapel Hill, NC
| | | | - Daniel Dominguez
- Department of Biochemistry and Biophysics, University of North Carolina, Chapel Hill, NC
- Department of Pharmacology, University of North Carolina, Chapel Hill, NC
- Curriculum in Bioinformatics and Computational Biology, University of North Carolina, Chapel Hill, NC
- RNA Discovery Center, University of North Carolina, Chapel Hill, NC
| |
Collapse
|
4
|
Selvakumar P, Siddharthan R. Position-specific evolution in transcription factor binding sites, and a fast likelihood calculation for the F81 model. ROYAL SOCIETY OPEN SCIENCE 2024; 11:231088. [PMID: 38269075 PMCID: PMC10805598 DOI: 10.1098/rsos.231088] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Grants] [Track Full Text] [Figures] [Subscribe] [Scholar Register] [Received: 07/26/2023] [Accepted: 12/20/2023] [Indexed: 01/26/2024]
Abstract
Transcription factor binding sites (TFBS), like other DNA sequence, evolve via mutation and selection relating to their function. Models of nucleotide evolution describe DNA evolution via single-nucleotide mutation. A stationary vector of such a model is the long-term distribution of nucleotides, unchanging under the model. Neutrally evolving sites may have uniform stationary vectors, but one expects that sites within a TFBS instead have stationary vectors reflective of the fitness of various nucleotides at those positions. We introduce 'position-specific stationary vectors' (PSSVs), the collection of stationary vectors at each site in a TFBS locus, analogous to the position weight matrix (PWM) commonly used to describe TFBS. We infer PSSVs for human TFs using two evolutionary models (Felsenstein 1981 and Hasegawa-Kishino-Yano 1985). We find that PSSVs reflect the nucleotide distribution from PWMs, but with reduced specificity. We infer ancestral nucleotide distributions at individual positions and calculate 'conditional PSSVs' conditioned on specific choices of majority ancestral nucleotide. We find that certain ancestral nucleotides exert a strong evolutionary pressure on neighbouring sequence while others have a negligible effect. Finally, we present a fast likelihood calculation for the F81 model on moderate-sized trees that makes this approach feasible for large-scale studies along these lines.
Collapse
Affiliation(s)
- Pavitra Selvakumar
- The Institute of Mathematical Sciences, Chennai, India
- Homi Bhabha National Institute, Mumbai, India
| | - Rahul Siddharthan
- The Institute of Mathematical Sciences, Chennai, India
- Homi Bhabha National Institute, Mumbai, India
| |
Collapse
|
5
|
Carelli FN, Cerrato C, Dong Y, Appert A, Dernburg A, Ahringer J. Widespread transposon co-option in the Caenorhabditis germline regulatory network. SCIENCE ADVANCES 2022; 8:eabo4082. [PMID: 36525485 PMCID: PMC9757741 DOI: 10.1126/sciadv.abo4082] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Grants] [Track Full Text] [Figures] [Subscribe] [Scholar Register] [Received: 02/01/2022] [Accepted: 11/18/2022] [Indexed: 06/17/2023]
Abstract
The movement of selfish DNA elements can lead to widespread genomic alterations with potential to create novel functions. We show that transposon expansions in Caenorhabditis nematodes led to extensive rewiring of germline transcriptional regulation. We find that about one-third of Caenorhabditis elegans germline-specific promoters have been co-opted from two related miniature inverted repeat transposable elements (TEs), CERP2 and CELE2. These promoters are regulated by HIM-17, a THAP domain-containing transcription factor related to a transposase. Expansion of CERP2 occurred before radiation of the Caenorhabditis genus, as did fixation of mutations in HIM-17 through positive selection, whereas CELE2 expanded only in C. elegans. Through comparative analyses in Caenorhabditis briggsae, we find not only evolutionary conservation of most CERP2 co-opted promoters but also a substantial fraction that are species-specific. Our work reveals the emergence and evolutionary conservation of a novel transcriptional network driven by TE co-option with a major impact on regulatory evolution.
Collapse
Affiliation(s)
- Francesco Nicola Carelli
- Wellcome Trust/Cancer Research UK Gurdon Institute, Cambridge, UK
- Department of Genetics, University of Cambridge, Cambridge, UK
| | - Chiara Cerrato
- Wellcome Trust/Cancer Research UK Gurdon Institute, Cambridge, UK
- Department of Genetics, University of Cambridge, Cambridge, UK
| | - Yan Dong
- Wellcome Trust/Cancer Research UK Gurdon Institute, Cambridge, UK
- Department of Genetics, University of Cambridge, Cambridge, UK
| | - Alex Appert
- Wellcome Trust/Cancer Research UK Gurdon Institute, Cambridge, UK
- Department of Genetics, University of Cambridge, Cambridge, UK
| | - Abby Dernburg
- Department of Molecular and Cell Biology, University of California, Berkeley, Berkeley, CA 94720-3200, USA
- Howard Hughes Medical Institute, 4000 Jones Bridge Road, Chevy Chase, MD 20815, USA
- Biological Sciences and Engineering Division, Lawrence Berkeley National Laboratory, Berkeley, CA 94720, USA
- California Institute for Quantitative Biosciences, Berkeley, CA 94720, USA
| | - Julie Ahringer
- Wellcome Trust/Cancer Research UK Gurdon Institute, Cambridge, UK
- Department of Genetics, University of Cambridge, Cambridge, UK
| |
Collapse
|
6
|
Krieger G, Lupo O, Wittkopp P, Barkai N. Evolution of transcription factor binding through sequence variations and turnover of binding sites. Genome Res 2022; 32:1099-1111. [PMID: 35618416 DOI: 10.1101/gr.276715.122] [Citation(s) in RCA: 9] [Impact Index Per Article: 4.5] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 02/28/2022] [Accepted: 05/20/2022] [Indexed: 01/08/2023]
Abstract
Variations in noncoding regulatory sequences play a central role in evolution. Interpreting such variations, however, remains difficult even in the context of defined attributes such as transcription factor (TF) binding sites. Here, we systematically link variations in cis-regulatory sequences to TF binding by profiling the allele-specific binding of 27 TFs expressed in a yeast hybrid, in which two related genomes are present within the same nucleus. TFs localize preferentially to sites containing their known consensus motifs but occupy only a small fraction of the motif-containing sites available within the genomes. Differential binding of TFs to the orthologous alleles was well explained by variations that alter motif sequence, whereas differences in chromatin accessibility between alleles were of little apparent effect. Motif variations that abolished binding when present in only one allele were still bound when present in both alleles, suggesting evolutionary compensation, with a potential role for sequence conservation at the motif's vicinity. At the level of the full promoter, we identify cases of binding-site turnover, in which binding sites are reciprocally gained and lost, yet most interspecific differences remained uncompensated. Our results show the flexibility of TFs to bind imprecise motifs and the fast evolution of TF binding sites between related species.
Collapse
Affiliation(s)
- Gat Krieger
- Department of Molecular Genetics, Weizmann Institute of Science, Rehovot 76100, Israel
| | - Offir Lupo
- Department of Molecular Genetics, Weizmann Institute of Science, Rehovot 76100, Israel
| | - Patricia Wittkopp
- Department of Ecology and Evolutionary Biology, Department of Molecular, Cellular, and Developmental Biology, University of Michigan, Ann Arbor, Michigan 48109, USA
| | - Naama Barkai
- Department of Molecular Genetics, Weizmann Institute of Science, Rehovot 76100, Israel
| |
Collapse
|
7
|
Vatov E, Ludewig U, Zentgraf U. Disparate Dynamics of Gene Body and cis-Regulatory Element Evolution Illustrated for the Senescence-Associated Cysteine Protease Gene SAG12 of Plants. PLANTS 2021; 10:plants10071380. [PMID: 34371583 PMCID: PMC8309469 DOI: 10.3390/plants10071380] [Citation(s) in RCA: 2] [Impact Index Per Article: 0.7] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Subscribe] [Scholar Register] [Received: 05/31/2021] [Revised: 07/01/2021] [Accepted: 07/02/2021] [Indexed: 11/16/2022]
Abstract
Gene regulation networks precisely orchestrate the expression of genes that are closely associated with defined physiological and developmental processes such as leaf senescence in plants. The Arabidopsis thaliana senescence-associated gene 12 (AtSAG12) encodes a cysteine protease that is (i) involved in the degradation of chloroplast proteins and (ii) almost exclusively expressed during senescence. Transcription factors, such as WRKY53 and WRKY45, bind to W-boxes in the promoter region of AtSAG12 and play key roles in its activation. Other transcription factors, such as bZIPs, might have accessory functions in their gene regulation, as several A-boxes have been identified and appear to be highly overrepresented in the promoter region compared to the whole genome distribution but are not localized within the regulatory regions driving senescence-associated expression. To address whether these two regulatory elements exhibiting these different properties are conserved in other closely related species, we constructed phylogenetic trees of the coding sequences of orthologs of AtSAG12 and screened their respective 2000 bp promoter regions for the presence of conserved cis-regulatory elements, such as bZIP and WRKY binding sites. Interestingly, the functional relevant upstream located W-boxes were absent in plant species as closely related as Arabidopsis lyrata, whereas an A-box cluster appeared to be conserved in the Arabidopsis species but disappeared in Brassica napus. Several orthologs were present in other species, possibly because of local or whole genome duplication events, but with distinct cis-regulatory sites in different locations. However, at least one gene copy in each family analyzed carried one W-box and one A-box in its promoter. These gene differences in SAG12 orthologs are discussed in the framework of cis- and trans-regulatory factors, of promoter and gene evolution, of genetic variation, and of the enhancement of the adaptability of plants to changing environmental conditions.
Collapse
Affiliation(s)
- Emil Vatov
- Center for Plant Molecular Biology (ZMBP), University of Tübingen, Auf der Morgenstelle 32, 72076 Tübingen, Germany;
- Institute of Crop Science, Nutritional Crop Physiology, University of Hohenheim, Fruwirthstr. 20, 70599 Stuttgart, Germany;
| | - Uwe Ludewig
- Institute of Crop Science, Nutritional Crop Physiology, University of Hohenheim, Fruwirthstr. 20, 70599 Stuttgart, Germany;
| | - Ulrike Zentgraf
- Center for Plant Molecular Biology (ZMBP), University of Tübingen, Auf der Morgenstelle 32, 72076 Tübingen, Germany;
- Correspondence:
| |
Collapse
|
8
|
Dai A, Wang Y, Greenberg A, Liufu Z, Tang T. Rapid Evolution of Autosomal Binding Sites of the Dosage Compensation Complex in Drosophila melanogaster and Its Association With Transcription Divergence. Front Genet 2021; 12:675027. [PMID: 34194473 PMCID: PMC8238462 DOI: 10.3389/fgene.2021.675027] [Citation(s) in RCA: 1] [Impact Index Per Article: 0.3] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/02/2021] [Accepted: 05/19/2021] [Indexed: 11/25/2022] Open
Abstract
How pleiotropy influences evolution of protein sequence remains unclear. The male-specific lethal (MSL) complex in Drosophila mediates dosage compensation by 2-fold upregulation of the X chromosome in males. Nevertheless, several MSL proteins also bind autosomes and likely perform functions not related to dosage compensation. Here, we study the evolution of MOF, MSL1, and MSL2 biding sites in Drosophila melanogaster and its close relative Drosophila simulans. We found pervasive expansion of the MSL binding sites in D. melanogaster, particularly on autosomes. The majority of these newly-bound regions are unlikely to function in dosage compensation and associated with an increase in expression divergence between D. melanogaster and D. simulans. While dosage-compensation related sites show clear signatures of adaptive evolution, these signatures are even more marked among autosomal regions. Our study points to an intriguing avenue of investigation of pleiotropy as a mechanism promoting rapid protein sequence evolution.
Collapse
Affiliation(s)
- Aimei Dai
- State Key Laboratory of Biocontrol and Guangdong Key Laboratory of Plant Resources, School of Life Sciences, Sun Yat-sen University, Guangzhou, China
| | - Yushuai Wang
- State Key Laboratory of Biocontrol and Guangdong Key Laboratory of Plant Resources, School of Life Sciences, Sun Yat-sen University, Guangzhou, China
| | | | - Zhongqi Liufu
- State Key Laboratory of Biocontrol and Guangdong Key Laboratory of Plant Resources, School of Life Sciences, Sun Yat-sen University, Guangzhou, China
| | - Tian Tang
- State Key Laboratory of Biocontrol and Guangdong Key Laboratory of Plant Resources, School of Life Sciences, Sun Yat-sen University, Guangzhou, China
| |
Collapse
|
9
|
Coulcher JF, Roure A, Chowdhury R, Robert M, Lescat L, Bouin A, Carvajal Cadavid J, Nishida H, Darras S. Conservation of peripheral nervous system formation mechanisms in divergent ascidian embryos. eLife 2020; 9:e59157. [PMID: 33191918 PMCID: PMC7710358 DOI: 10.7554/elife.59157] [Citation(s) in RCA: 5] [Impact Index Per Article: 1.3] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 05/21/2020] [Accepted: 11/13/2020] [Indexed: 01/23/2023] Open
Abstract
Ascidians with very similar embryos but highly divergent genomes are thought to have undergone extensive developmental system drift. We compared, in four species (Ciona and Phallusia for Phlebobranchia, Molgula and Halocynthia for Stolidobranchia), gene expression and gene regulation for a network of six transcription factors regulating peripheral nervous system (PNS) formation in Ciona. All genes, but one in Molgula, were expressed in the PNS with some differences correlating with phylogenetic distance. Cross-species transgenesis indicated strong levels of conservation, except in Molgula, in gene regulation despite lack of sequence conservation of the enhancers. Developmental system drift in ascidians is thus higher for gene regulation than for gene expression and is impacted not only by phylogenetic distance, but also in a clade-specific manner and unevenly within a network. Finally, considering that Molgula is divergent in our analyses, this suggests deep conservation of developmental mechanisms in ascidians after 390 My of separate evolution.
Collapse
Affiliation(s)
- Joshua F Coulcher
- Sorbonne Université, CNRS, Biologie Intégrative des Organismes Marins (BIOM)Banyuls-sur-MerFrance
| | - Agnès Roure
- Sorbonne Université, CNRS, Biologie Intégrative des Organismes Marins (BIOM)Banyuls-sur-MerFrance
| | - Rafath Chowdhury
- Sorbonne Université, CNRS, Biologie Intégrative des Organismes Marins (BIOM)Banyuls-sur-MerFrance
| | - Méryl Robert
- Sorbonne Université, CNRS, Biologie Intégrative des Organismes Marins (BIOM)Banyuls-sur-MerFrance
| | - Laury Lescat
- Sorbonne Université, CNRS, Biologie Intégrative des Organismes Marins (BIOM)Banyuls-sur-MerFrance
| | - Aurélie Bouin
- Sorbonne Université, CNRS, Biologie Intégrative des Organismes Marins (BIOM)Banyuls-sur-MerFrance
| | - Juliana Carvajal Cadavid
- Sorbonne Université, CNRS, Biologie Intégrative des Organismes Marins (BIOM)Banyuls-sur-MerFrance
| | - Hiroki Nishida
- Department of Biological Sciences, Graduate School of Science, Osaka UniversityToyonakaJapan
| | - Sébastien Darras
- Sorbonne Université, CNRS, Biologie Intégrative des Organismes Marins (BIOM)Banyuls-sur-MerFrance
| |
Collapse
|
10
|
Liu J, Shively CA, Mitra RD. Quantitative analysis of transcription factor binding and expression using calling cards reporter arrays. Nucleic Acids Res 2020; 48:e50. [PMID: 32133534 PMCID: PMC7229839 DOI: 10.1093/nar/gkaa141] [Citation(s) in RCA: 1] [Impact Index Per Article: 0.3] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 09/18/2019] [Revised: 01/31/2020] [Accepted: 02/25/2020] [Indexed: 12/13/2022] Open
Abstract
We report a tool, Calling Cards Reporter Arrays (CCRA), that measures transcription factor (TF) binding and the consequences on gene expression for hundreds of synthetic promoters in yeast. Using Cbf1p and MAX, we demonstrate that the CCRA method is able to detect small changes in binding free energy with a sensitivity comparable to in vitro methods, enabling the measurement of energy landscapes in vivo. We then demonstrate the quantitative analysis of cooperative interactions by measuring Cbf1p binding at synthetic promoters with multiple sites. We find that the cooperativity between Cbf1p dimers varies sinusoidally with a period of 10.65 bp and energetic cost of 1.37 KBT for sites that are positioned ‘out of phase’. Finally, we characterize the binding and expression of a group of TFs, Tye7p, Gcr1p and Gcr2p, that act together as a ‘TF collective’, an important but poorly characterized model of TF cooperativity. We demonstrate that Tye7p often binds promoters without its recognition site because it is recruited by other collective members, whereas these other members require their recognition sites, suggesting a hierarchy where these factors recruit Tye7p but not vice versa. Our experiments establish CCRA as a useful tool for quantitative investigations into TF binding and function.
Collapse
Affiliation(s)
- Jiayue Liu
- Department of Genetics, Washington University School of Medicine in St. Louis, St. Louis, MO 63108, USA.,The Edison Family Center for Genome Sciences & Systems Biology, Washington University School of Medicine in St. Louis, St. Louis, MO 63108, USA
| | - Christian A Shively
- Department of Genetics, Washington University School of Medicine in St. Louis, St. Louis, MO 63108, USA.,The Edison Family Center for Genome Sciences & Systems Biology, Washington University School of Medicine in St. Louis, St. Louis, MO 63108, USA
| | - Robi D Mitra
- Department of Genetics, Washington University School of Medicine in St. Louis, St. Louis, MO 63108, USA.,The Edison Family Center for Genome Sciences & Systems Biology, Washington University School of Medicine in St. Louis, St. Louis, MO 63108, USA.,McDonnell Genome Institute, Washington University School of Medicine in St. Louis, St. Louis, MO 63108, USA
| |
Collapse
|
11
|
Dukler N, Huang YF, Siepel A. Phylogenetic Modeling of Regulatory Element Turnover Based on Epigenomic Data. Mol Biol Evol 2020; 37:2137-2152. [PMID: 32176292 PMCID: PMC7306682 DOI: 10.1093/molbev/msaa073] [Citation(s) in RCA: 10] [Impact Index Per Article: 2.5] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 12/17/2022] Open
Abstract
Evolutionary changes in gene expression are often driven by gains and losses of cis-regulatory elements (CREs). The dynamics of CRE evolution can be examined using multispecies epigenomic data, but so far such analyses have generally been descriptive and model-free. Here, we introduce a probabilistic modeling framework for the evolution of CREs that operates directly on raw chromatin immunoprecipitation and sequencing (ChIP-seq) data and fully considers the phylogenetic relationships among species. Our framework includes a phylogenetic hidden Markov model, called epiPhyloHMM, for identifying the locations of multiply aligned CREs, and a combined phylogenetic and generalized linear model, called phyloGLM, for accounting for the influence of a rich set of genomic features in describing their evolutionary dynamics. We apply these methods to previously published ChIP-seq data for the H3K4me3 and H3K27ac histone modifications in liver tissue from nine mammals. We find that enhancers are gained and lost during mammalian evolution at about twice the rate of promoters, and that turnover rates are negatively correlated with DNA sequence conservation, expression level, and tissue breadth, and positively correlated with distance from the transcription start site, consistent with previous findings. In addition, we find that the predicted dosage sensitivity of target genes positively correlates with DNA sequence constraint in CREs but not with turnover rates, perhaps owing to differences in the effect sizes of the relevant mutations. Altogether, our probabilistic modeling framework enables a variety of powerful new analyses.
Collapse
Affiliation(s)
- Noah Dukler
- Simons Center for Quantitative Biology, Cold Spring Harbor Laboratory, Cold Spring Harbor, NY
- Physiology, Biophysics, and Systems Biology, Weill Cornell Medical College, New York, NY
| | - Yi-Fei Huang
- Department of Biology and Huck Institute of Life Sciences, Pennsylvania State University, University Park, PA
| | - Adam Siepel
- Simons Center for Quantitative Biology, Cold Spring Harbor Laboratory, Cold Spring Harbor, NY
| |
Collapse
|
12
|
Shokri L, Inukai S, Hafner A, Weinand K, Hens K, Vedenko A, Gisselbrecht SS, Dainese R, Bischof J, Furger E, Feuz JD, Basler K, Deplancke B, Bulyk ML. A Comprehensive Drosophila melanogaster Transcription Factor Interactome. Cell Rep 2020; 27:955-970.e7. [PMID: 30995488 PMCID: PMC6485956 DOI: 10.1016/j.celrep.2019.03.071] [Citation(s) in RCA: 46] [Impact Index Per Article: 11.5] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 11/30/2018] [Revised: 02/04/2019] [Accepted: 03/18/2019] [Indexed: 12/14/2022] Open
Abstract
Combinatorial interactions among transcription factors (TFs) play essential roles in generating gene expression specificity and diversity in metazoans. Using yeast 2-hybrid (Y2H) assays on nearly all sequence-specific Drosophila TFs, we identified 1,983 protein-protein interactions (PPIs), more than doubling the number of currently known PPIs among Drosophila TFs. For quality assessment, we validated a subset of our interactions using MITOMI and bimolecular fluorescence complementation assays. We combined our interactome with prior PPI data to generate an integrated Drosophila TF-TF binary interaction network. Our analysis of ChIP-seq data, integrating PPI and gene expression information, uncovered different modes by which interacting TFs are recruited to DNA. We further demonstrate the utility of our Drosophila interactome in shedding light on human TF-TF interactions. This study reveals how TFs interact to bind regulatory elements in vivo and serves as a resource of Drosophila TF-TF binary PPIs for understanding tissue-specific gene regulation. Combinatorial regulation by transcription factors (TFs) is one mechanism for achieving condition and tissue-specific gene regulation. Shokri et al. mapped TF-TF interactions between most Drosophila TFs, reporting a comprehensive TF-TF network integrated with previously known interactions. They used this network to discern distinct TF-DNA binding modes.
Collapse
Affiliation(s)
- Leila Shokri
- Department of Medicine, Division of Genetics, Brigham and Women's Hospital and Harvard Medical School, Boston, MA 02115, USA
| | - Sachi Inukai
- Department of Medicine, Division of Genetics, Brigham and Women's Hospital and Harvard Medical School, Boston, MA 02115, USA
| | - Antonina Hafner
- Department of Systems Biology, Harvard Medical School, Boston, MA 02115, USA; Systems Biology Graduate Program, Harvard University, Cambridge, MA 02138, USA; Laboratory of Systems Biology and Genetics, Institute of Bioengineering, School of Life Sciences, École Polytechnique Fédérale de Lausanne, Swiss Institute of Bioinformatics, 1015 Lausanne, Switzerland
| | - Kathryn Weinand
- Department of Medicine, Division of Genetics, Brigham and Women's Hospital and Harvard Medical School, Boston, MA 02115, USA; Bioinformatics and Integrative Genomics Ph.D. Program, Harvard University, Cambridge, MA 02138, USA
| | - Korneel Hens
- Laboratory of Systems Biology and Genetics, Institute of Bioengineering, School of Life Sciences, École Polytechnique Fédérale de Lausanne, Swiss Institute of Bioinformatics, 1015 Lausanne, Switzerland
| | - Anastasia Vedenko
- Department of Medicine, Division of Genetics, Brigham and Women's Hospital and Harvard Medical School, Boston, MA 02115, USA
| | - Stephen S Gisselbrecht
- Department of Medicine, Division of Genetics, Brigham and Women's Hospital and Harvard Medical School, Boston, MA 02115, USA
| | - Riccardo Dainese
- Laboratory of Systems Biology and Genetics, Institute of Bioengineering, School of Life Sciences, École Polytechnique Fédérale de Lausanne, Swiss Institute of Bioinformatics, 1015 Lausanne, Switzerland
| | - Johannes Bischof
- Institute of Molecular Life Sciences, University of Zurich, 8057 Zurich, Switzerland
| | - Edy Furger
- Institute of Molecular Life Sciences, University of Zurich, 8057 Zurich, Switzerland
| | - Jean-Daniel Feuz
- Laboratory of Systems Biology and Genetics, Institute of Bioengineering, School of Life Sciences, École Polytechnique Fédérale de Lausanne, Swiss Institute of Bioinformatics, 1015 Lausanne, Switzerland
| | - Konrad Basler
- Institute of Molecular Life Sciences, University of Zurich, 8057 Zurich, Switzerland
| | - Bart Deplancke
- Laboratory of Systems Biology and Genetics, Institute of Bioengineering, School of Life Sciences, École Polytechnique Fédérale de Lausanne, Swiss Institute of Bioinformatics, 1015 Lausanne, Switzerland.
| | - Martha L Bulyk
- Department of Medicine, Division of Genetics, Brigham and Women's Hospital and Harvard Medical School, Boston, MA 02115, USA; Systems Biology Graduate Program, Harvard University, Cambridge, MA 02138, USA; Bioinformatics and Integrative Genomics Ph.D. Program, Harvard University, Cambridge, MA 02138, USA; Department of Pathology, Brigham and Women's Hospital and Harvard Medical School, Boston, MA 02115, USA.
| |
Collapse
|
13
|
Arginine Deiminase and Biotin Metabolism Signaling Pathways Play an Important Role in Human-Derived Serotype V, ST1 Streptococcus agalactiae Virulent Strain upon Infected Tilapia. Animals (Basel) 2020; 10:ani10050849. [PMID: 32423070 PMCID: PMC7278441 DOI: 10.3390/ani10050849] [Citation(s) in RCA: 1] [Impact Index Per Article: 0.3] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 02/22/2020] [Revised: 04/14/2020] [Accepted: 04/15/2020] [Indexed: 01/18/2023] Open
Abstract
Simple Summary Patients who were infected with Streptococcus agalactiae (ST1) were mainly associated with asymptomatic carriage. However, the invasive diseases in non-pregnant adults caused by S. agalactiae (serotype V, ST1) have increased recently. We have previously reported that human-derived S. agalactiae (serotype V, ST1) could infect tilapia with virulence and pathologic characteristics similar to highly virulent tilapia-derived S. agalactiae (ST7) strains. The potential risk of cross-species infection cannot be ignored. Therefore, our research provided a multi-omics analysis of the human-derived serotype V ST1 S. agalactiae strains, which were virulent and non-virulent to tilapia and provided a more comprehensive understanding of the virulence mechanism. Abstract Our previous study showed that human-derived Streptococcus agalactiae (serotype V) could infect tilapia, but the mechanism underlying the cross-species infection remains unrecognized. In this study, a multi-omics analysis was performed on human-derived S. agalactiae strain NNA048 (virulent to tilapia, serotype V, ST1) and human-derived S. agalactiae strain NNA038 (non-virulent to tilapia, serotype V, ST1). The results showed that 907 genes (504 up/403 down) and 89 proteins (51 up/38 down) were differentially expressed (p < 0.05) between NNA038 and NNA048. Among them, 56 genes (proteins) were altered with similar trends at both mRNA and protein levels. Functional annotation of them showed that the main differences were enriched in the arginine deiminase system signaling pathway and biotin metabolism signaling pathway: gdhA, glnA, ASL, ADI, OTC, arcC, FabF, FabG, FabZ, BioB and BirA genes may have been important factors leading to the pathogenicity differences between NNA038 and NNA048. We aimed to provide a comprehensive analysis of the human-derived serotype V ST1 S. agalactiae strains, which were virulent and non-virulent to tilapia, and provide a more comprehensive understanding of the virulence mechanism.
Collapse
|
14
|
Tran H, Walczak AM, Dostatni N. Constraints and limitations on the transcriptional response downstream of the Bicoid morphogen gradient. Curr Top Dev Biol 2020; 137:119-142. [PMID: 32143741 DOI: 10.1016/bs.ctdb.2019.12.002] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 03/06/2023]
Abstract
The regulation of the hunchback promoter expression by the maternal Bicoid gradient has been studied as a model system in development for many years. Yet, at the level of quantitative agreement between data and theoretical models, even the first step of this regulation, transcription, continues to be challenging. This situation is slowly progressing, thanks to quantitative live-imaging techniques coupled to advanced statistical data analysis and modeling. Here, we outline the current state of our knowledge of this apparently "simple" step, highlighting the newly appreciated role of bursty transcription dynamics and its regulation.
Collapse
Affiliation(s)
- Huy Tran
- Institut Curie, PSL Research University, CNRS, Sorbonne Université, Nuclear Dynamics, Paris, France; Ecole Normale Supérieure, PSL Research University, CNRS, Sorbonne Université, Laboratoire de Physique, Paris, France
| | - Aleksandra M Walczak
- Ecole Normale Supérieure, PSL Research University, CNRS, Sorbonne Université, Laboratoire de Physique, Paris, France.
| | - Nathalie Dostatni
- Institut Curie, PSL Research University, CNRS, Sorbonne Université, Nuclear Dynamics, Paris, France.
| |
Collapse
|
15
|
Peng PC, Khoueiry P, Girardot C, Reddington JP, Garfield DA, Furlong EEM, Sinha S. The Role of Chromatin Accessibility in cis-Regulatory Evolution. Genome Biol Evol 2020; 11:1813-1828. [PMID: 31114856 PMCID: PMC6601868 DOI: 10.1093/gbe/evz103] [Citation(s) in RCA: 11] [Impact Index Per Article: 2.8] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Accepted: 05/13/2019] [Indexed: 02/07/2023] Open
Abstract
Transcription factor (TF) binding is determined by sequence as well as chromatin accessibility. Although the role of accessibility in shaping TF-binding landscapes is well recorded, its role in evolutionary divergence of TF binding, which in turn can alter cis-regulatory activities, is not well understood. In this work, we studied the evolution of genome-wide binding landscapes of five major TFs in the core network of mesoderm specification, between Drosophila melanogaster and Drosophila virilis, and examined its relationship to accessibility and sequence-level changes. We generated chromatin accessibility data from three important stages of embryogenesis in both Drosophila melanogaster and Drosophila virilis and recorded conservation and divergence patterns. We then used multivariable models to correlate accessibility and sequence changes to TF-binding divergence. We found that accessibility changes can in some cases, for example, for the master regulator Twist and for earlier developmental stages, more accurately predict binding change than is possible using TF-binding motif changes between orthologous enhancers. Accessibility changes also explain a significant portion of the codivergence of TF pairs. We noted that accessibility and motif changes offer complementary views of the evolution of TF binding and developed a combined model that captures the evolutionary data much more accurately than either view alone. Finally, we trained machine learning models to predict enhancer activity from TF binding and used these functional models to argue that motif and accessibility-based predictors of TF-binding change can substitute for experimentally measured binding change, for the purpose of predicting evolutionary changes in enhancer activity.
Collapse
Affiliation(s)
- Pei-Chen Peng
- Department of Computer Science, University of Illinois at Urbana-Champaign.,Center for Bioinformatics and Functional Genomics, Department of Biomedical Sciences, Cedars-Sinai Medical Center, Los Angeles, CA
| | - Pierre Khoueiry
- European Molecular Biology Laboratory, Genome Biology Unit, Heidelberg, Germany.,American University of Beirut (AUB), Department of Biochemistry and Molecular Genetics, Beirut, Lebanon
| | - Charles Girardot
- European Molecular Biology Laboratory, Genome Biology Unit, Heidelberg, Germany
| | - James P Reddington
- European Molecular Biology Laboratory, Genome Biology Unit, Heidelberg, Germany
| | - David A Garfield
- European Molecular Biology Laboratory, Genome Biology Unit, Heidelberg, Germany.,IRI-Life Sciences, Humboldt Universität zu Berlin, Berlin, Germany
| | - Eileen E M Furlong
- European Molecular Biology Laboratory, Genome Biology Unit, Heidelberg, Germany
| | - Saurabh Sinha
- Department of Computer Science, University of Illinois at Urbana-Champaign.,Carl R. Woese Institute for Genomic Biology, University of Illinois at Urbana-Champaign
| |
Collapse
|
16
|
Bogan SN, Place SP. Accelerated evolution at chaperone promoters among Antarctic notothenioid fishes. BMC Evol Biol 2019; 19:205. [PMID: 31694524 PMCID: PMC6836667 DOI: 10.1186/s12862-019-1524-y] [Citation(s) in RCA: 7] [Impact Index Per Article: 1.4] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 05/15/2019] [Accepted: 10/01/2019] [Indexed: 01/07/2023] Open
Abstract
BACKGROUND Antarctic fishes of the Notothenioidei suborder constitutively upregulate multiple inducible chaperones, a highly derived adaptation that preserves proteostasis in extreme cold, and represent a system for studying the evolution of gene frontloading. We screened for Hsf1-binding sites, as Hsf1 is a master transcription factor of the heat shock response, and highly-conserved non-coding elements within proximal promoters of chaperone genes across 10 Antarctic notothens, 2 subpolar notothens, and 17 perciform fishes. We employed phylogenetic models of molecular evolution to determine whether (i) changes in motifs associated with Hsf1-binding and/or (ii) relaxed purifying selection or exaptation at ancestral cis-regulatory elements coincided with the evolution of chaperone frontloading in Antarctic notothens. RESULTS Antarctic notothens exhibited significantly fewer Hsf1-binding sites per bp at chaperone promoters than subpolar notothens and Serranoidei, the most closely-related suborder to Notothenioidei included in this study. 90% of chaperone promoters exhibited accelerated substitution rates among Antarctic notothens relative to other perciformes. The proportion of bases undergoing accelerated evolution (i) was significantly greater in Antarctic notothens than in subpolar notothens and Perciformes in 70% of chaperone genes and (ii) increased among bases that were more conserved among perciformes. Lastly, we detected evidence of relaxed purifying selection and exaptation acting on ancestrally conserved cis-regulatory elements in the Antarctic notothen lineage and its major branches. CONCLUSION A large degree of turnover has occurred in Notothenioidei at chaperone promoter regions that are conserved among perciform fishes following adaptation to the cooling of the Southern Ocean. Additionally, derived reductions in Hsf1-binding site frequency suggest cis-regulatory modifications to the classical heat shock response. Of note, turnover events within chaperone promoters were less frequent in the ancestral node of Antarctic notothens relative to younger Antarctic lineages. This suggests that cis-regulatory divergence at chaperone promoters may be greater between Antarctic notothen lineages than between subpolar and Antarctic clades. These findings demonstrate that strong selective forces have acted upon cis-regulatory elements of chaperone genes among Antarctic notothens.
Collapse
Affiliation(s)
- Samuel N Bogan
- Department of Biology, Sonoma State University, Rohnert Park, CA, 94928, USA.
- Department of Ecology, Evolution and Marine Biology, University of California, Santa Barbara, CA, 93106, USA.
| | - Sean P Place
- Department of Biology, Sonoma State University, Rohnert Park, CA, 94928, USA
| |
Collapse
|
17
|
Li L, Liu Y, Huang T, Liang W, Chen M. Development of an attenuated oral vaccine strain of tilapia Group B Streptococci serotype Ia by gene knockout technology. FISH & SHELLFISH IMMUNOLOGY 2019; 93:924-933. [PMID: 31374315 DOI: 10.1016/j.fsi.2019.07.081] [Citation(s) in RCA: 2] [Impact Index Per Article: 0.4] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Subscribe] [Scholar Register] [Received: 05/06/2019] [Revised: 07/26/2019] [Accepted: 07/29/2019] [Indexed: 06/10/2023]
Abstract
Our previous studies demonstrated that the deletion of D2 fragment in tilapia Streptococcus agalactiae(GBS) attenuated strain YM001 is the main reason for the loss of virulence to tilapia. In this study, a Δ2 mutant that deletion of D2 fragment in parental virulent strain HN016 was constructed, and the safety, stability, immunogenicity, and growth characteristics, as well as the virulence mechanism of Δ2 mutant were evaluated. The results showed that Δ2 mutant was not pathogenic to tilapia, and the virulent revertants were not observed after 50 generations of passage. The RPS reached 96.11% at 15 days and 93.05% at 30 days, respectively, after intraperitoneal injection, while RPS reached 74.80% at 15 days and 53.16% at 30 days, respectively, after oral immunization. The growth of Δ2 mutant was significantly faster than YM001, and genes that were enriched in the nitrogen metabolism and arginine biosynthesis signaling pathway (arc, glnA, and gdhA) were identified as important candidate genes responsible for growth rate of S. agalactiae. The absence of D2 fragment affected the expression of Sip, therefore influencing the bacterial virulence. Altogether, this study demonstrated that deletion of D2 fragment in HN016 causes the loss of virulence to tilapia, and Δ2 mutant is a promising, better attenuated oral vaccine strain of S. agalactiae compared to YM001.
Collapse
Affiliation(s)
- Liping Li
- Guangxi Academy of Fishery Sciences, Qingshan Road NO.8, Nanning, 530021, China
| | - Yu Liu
- Guangxi Academy of Fishery Sciences, Qingshan Road NO.8, Nanning, 530021, China
| | - Ting Huang
- Guangxi Academy of Fishery Sciences, Qingshan Road NO.8, Nanning, 530021, China
| | - Wanwen Liang
- Guangxi Academy of Fishery Sciences, Qingshan Road NO.8, Nanning, 530021, China
| | - Ming Chen
- Guangxi Academy of Fishery Sciences, Qingshan Road NO.8, Nanning, 530021, China.
| |
Collapse
|
18
|
Liu Y, Li L, Huang T, Wang R, Liang W, Yang Q, Lei A, Chen M. Comparative multi-omics systems analysis reveal the glycolysis / gluconeogenesis signal pathway play an important role in virulence attenuation in fish-derived GBS YM001. PLoS One 2019; 14:e0221634. [PMID: 31449567 PMCID: PMC6709914 DOI: 10.1371/journal.pone.0221634] [Citation(s) in RCA: 2] [Impact Index Per Article: 0.4] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 04/16/2019] [Accepted: 08/12/2019] [Indexed: 01/18/2023] Open
Abstract
Streptococcus agalactiae(GBS) is a seriously threat to the farmed tilapia, and oral vaccination was considered to be the most desirable means which requires deep understanding of virulence mechanism of the fish-derived GBS. Our previous genome study of the fish-derived attenuated strain YM001 showed that there were two large deletions in YM001 compared to its parental virulent strain HN016. In this study, a combined transcriptomic and proteomic analysis was performed on YM001 and HN016 strains, and the important genes were verified by RT-qPCR in bacteria strains and infected-tilapia tissues. Overall, we have shown that a total of 958 genes and 331 proteins were significantly differential expressed between YM001 and HN016. By functional annotation of these DEGs and DEPs, genes that were enriched in pentose phosphate pathway(pgm, ptsG, pgi pfkA, fbaA and FBP3) and pyruvate metabolism pathway(pdhA, pdhB, pdhC and pdhD) were identifed as important candidate genes for leads low growth ability in attenuated strain, which may be an important reasons leading virulence attenuation in the end. The expression levels the candidate genes in pentose phosphate pathway and pyruvate metabolism pathway were significant differential expressed in tilapia’ brain and spleen when infected with YM001 and HN016. Our study indicated that the pentose phosphate pathway and pyruvate metabolism pathway that affecting the growth of the strain may be one of the important reasons for the virulence attenuation in HN016.
Collapse
Affiliation(s)
- Yu Liu
- Guangxi Academy of Fishery Sciences, Nanning,China,P.R. China
| | - Liping Li
- Guangxi Academy of Fishery Sciences, Nanning,China,P.R. China
| | - Ting Huang
- Guangxi Academy of Fishery Sciences, Nanning,China,P.R. China
| | - Rui Wang
- Guangxi Academy of Fishery Sciences, Nanning,China,P.R. China
| | - Wanwen Liang
- Guangxi Academy of Fishery Sciences, Nanning,China,P.R. China
| | - Qiong Yang
- Guangxi Academy of Fishery Sciences, Nanning,China,P.R. China
| | - Aiying Lei
- Guangxi Academy of Fishery Sciences, Nanning,China,P.R. China
| | - Ming Chen
- Guangxi Academy of Fishery Sciences, Nanning,China,P.R. China
- * E-mail:
| |
Collapse
|
19
|
Boltz TA, Khuri S, Wuchty S. Promoter conservation in HDACs points to functional implications. BMC Genomics 2019; 20:613. [PMID: 31351464 PMCID: PMC6660948 DOI: 10.1186/s12864-019-5973-x] [Citation(s) in RCA: 9] [Impact Index Per Article: 1.8] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/25/2018] [Accepted: 07/12/2019] [Indexed: 01/05/2023] Open
Abstract
Background Histone deacetylases (HDACs) are the proteins responsible for removing the acetyl group from lysine residues of core histones in chromosomes, a crucial component of gene regulation. Eleven known HDACs exist in humans and most other vertebrates. While the basic function of HDACs has been well characterized and new discoveries are still being made, the transcriptional regulation of their corresponding genes is still poorly understood. Results Here, we conducted a computational analysis of the eleven HDAC promoter sequences in 25 vertebrate species to determine whether transcription factor binding sites (TFBSs) are conserved in HDAC evolution, and if so, whether they provide useful information about HDAC expression and function. Furthermore, we used tissue-specific information of transcription factors to investigate the potential expression patterns of HDACs in different human tissues based on their transcription factor binding sites. We found that the TFBS profiles of most of the HDACs were well conserved in closely related species for all HDAC promoters except HDAC7 and HDAC10. HDAC5 had particularly strong conservation across over half of the species studied, with nearly identical profiles in the primate species. Our comparisons of TFBSs with the tissue specific gene expression profiles of their corresponding TFs showed that most HDACs had the ability to be ubiquitously expressed. A few HDAC promoters exhibited the potential for preferential expression in certain tissues, most notably HDAC11 in gall bladder, while HDAC9 seemed to have less propensity for expression in the nervous system. Conclusions In general, we found evolutionary conservation in HDAC promoters that seems to be more prominent for the ubiquitously expressed HDACs. In turn, when conservation did not follow usual phylogeny, human TFBS patterns indicated possible functional relevance. While we found that HDACs appear to uniformly expressed, we confirm that the functional differences in HDACs may be less a matter of location of activity than a question of which proteins and which acetyl groups they may be acting on. Electronic supplementary material The online version of this article (10.1186/s12864-019-5973-x) contains supplementary material, which is available to authorized users.
Collapse
Affiliation(s)
- Toni A Boltz
- Department of Computer Science, University of Miami, Coral Gables, FL, USA.,Present address: University of California, Los Angeles, Los Angeles, CA, USA
| | - Sawsan Khuri
- University of Exeter College of Medicine and Health, Exeter, UK
| | - Stefan Wuchty
- Department of Computer Science, University of Miami, Coral Gables, FL, USA. .,Department of Biology, University of Miami, Coral Gables, FL, USA. .,Center of Computational Science, University of Miami, Coral Gables, FL, USA. .,Sylvester Comprehensive Cancer Center, University of Miami, Miami, FL, USA.
| |
Collapse
|
20
|
Lamrabet O, Plumbridge J, Martin M, Lenski RE, Schneider D, Hindré T. Plasticity of Promoter-Core Sequences Allows Bacteria to Compensate for the Loss of a Key Global Regulatory Gene. Mol Biol Evol 2019; 36:1121-1133. [PMID: 30825312 DOI: 10.1093/molbev/msz042] [Citation(s) in RCA: 10] [Impact Index Per Article: 2.0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 01/14/2023] Open
Abstract
Transcription regulatory networks (TRNs) are of central importance for both short-term phenotypic adaptation in response to environmental fluctuations and long-term evolutionary adaptation, with global regulatory genes often being targets of natural selection in laboratory experiments. Here, we combined evolution experiments, whole-genome resequencing, and molecular genetics to investigate the driving forces, genetic constraints, and molecular mechanisms that dictate how bacteria can cope with a drastic perturbation of their TRNs. The crp gene, encoding a major global regulator in Escherichia coli, was deleted in four different genetic backgrounds, all derived from the Long-Term Evolution Experiment (LTEE) but with different TRN architectures. We confirmed that crp deletion had a more deleterious effect on growth rate in the LTEE-adapted genotypes; and we showed that the ptsG gene, which encodes the major glucose-PTS transporter, gained CRP (cyclic AMP receptor protein) dependence over time in the LTEE. We then further evolved the four crp-deleted genotypes in glucose minimal medium, and we found that they all quickly recovered from their growth defects by increasing glucose uptake. We showed that this recovery was specific to the selective environment and consistently relied on mutations in the cis-regulatory region of ptsG, regardless of the initial genotype. These mutations affected the interplay of transcription factors acting at the promoters, changed the intrinsic properties of the existing promoters, or produced new transcription initiation sites. Therefore, the plasticity of even a single promoter region can compensate by three different mechanisms for the loss of a key regulatory hub in the E. coli TRN.
Collapse
Affiliation(s)
- Otmane Lamrabet
- Université Grenoble Alpes, CNRS, Grenoble INP, TIMC-IMAG, Grenoble, France
| | - Jacqueline Plumbridge
- CNRS UMR8261, Université Paris Diderot, Sorbonne Paris Cité, Institut de Biologie Physico-chimique, Paris, France
| | - Mikaël Martin
- Université Grenoble Alpes, CNRS, Grenoble INP, TIMC-IMAG, Grenoble, France
| | - Richard E Lenski
- Department of Microbiology and Molecular Genetics, Michigan State University, East Lansing, MI.,BEACON Center for the Study of Evolution in Action, Michigan State University, East Lansing, MI
| | | | - Thomas Hindré
- Université Grenoble Alpes, CNRS, Grenoble INP, TIMC-IMAG, Grenoble, France
| |
Collapse
|
21
|
Berger MJ, Wenger AM, Guturu H, Bejerano G. Independent erosion of conserved transcription factor binding sites points to shared hindlimb, vision and external testes loss in different mammals. Nucleic Acids Res 2019; 46:9299-9308. [PMID: 30137416 PMCID: PMC6182171 DOI: 10.1093/nar/gky741] [Citation(s) in RCA: 11] [Impact Index Per Article: 2.2] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 11/13/2017] [Accepted: 08/21/2018] [Indexed: 02/05/2023] Open
Abstract
Genetic variation in cis-regulatory elements is thought to be a major driving force in morphological and physiological changes. However, identifying transcription factor binding events that code for complex traits remains a challenge, motivating novel means of detecting putatively important binding events. Using a curated set of 1154 high-quality transcription factor motifs, we demonstrate that independently eroded binding sites are enriched for independently lost traits in three distinct pairs of placental mammals. We show that these independently eroded events pinpoint the loss of hindlimbs in dolphin and manatee, degradation of vision in naked mole-rat and star-nosed mole, and the loss of external testes in white rhinoceros and Weddell seal. We additionally show that our method may also be utilized with more than two species. Our study exhibits a novel methodology to detect cis-regulatory mutations which help explain a portion of the molecular mechanism underlying complex trait formation and loss.
Collapse
Affiliation(s)
- Mark J Berger
- Department of Computer Science, Stanford University, Stanford, CA 94305-5329, USA
| | - Aaron M Wenger
- Department of Computer Science, Stanford University, Stanford, CA 94305-5329, USA
| | - Harendra Guturu
- Department of Electrical Engineering, Stanford University, Stanford, CA 94305-5008, USA
| | - Gill Bejerano
- Department of Computer Science, Stanford University, Stanford, CA 94305-5329, USA.,Department of Developmental Biology, Stanford University, Stanford, CA 94305-5329, USA.,Department of Pediatrics, Stanford University, Stanford, CA 94305-5208, USA.,Department of Biomedical Data Science, Stanford University, Stanford, CA 94305-5464, USA
| |
Collapse
|
22
|
Bozek M, Cortini R, Storti AE, Unnerstall U, Gaul U, Gompel N. ATAC-seq reveals regional differences in enhancer accessibility during the establishment of spatial coordinates in the Drosophila blastoderm. Genome Res 2019; 29:771-783. [PMID: 30962180 PMCID: PMC6499308 DOI: 10.1101/gr.242362.118] [Citation(s) in RCA: 32] [Impact Index Per Article: 6.4] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/31/2018] [Accepted: 03/26/2019] [Indexed: 12/21/2022]
Abstract
Establishment of spatial coordinates during Drosophila embryogenesis relies on differential regulatory activity of axis patterning enhancers. Concentration gradients of activator and repressor transcription factors (TFs) provide positional information to each enhancer, which in turn promotes transcription of a target gene in a specific spatial pattern. However, the interplay between an enhancer regulatory activity and its accessibility as determined by local chromatin organization is not well understood. We profiled chromatin accessibility with ATAC-seq in narrow, genetically tagged domains along the antero-posterior axis in the Drosophila blastoderm. We demonstrate that one-quarter of the accessible genome displays significant regional variation in its ATAC-seq signal immediately after zygotic genome activation. Axis patterning enhancers are enriched among the most variable intervals, and their accessibility changes correlate with their regulatory activity. In an embryonic domain where an enhancer receives a net activating TF input and promotes transcription, it displays elevated accessibility in comparison to a domain where it receives a net repressive input. We propose that differential accessibility is a signature of patterning cis-regulatory elements in the Drosophila blastoderm and discuss potential mechanisms by which accessibility of enhancers may be modulated by activator and repressor TFs.
Collapse
Affiliation(s)
- Marta Bozek
- Ludwig-Maximilians-Universität München, Department Biochemie, Genzentrum, 81377 München, Germany
| | - Roberto Cortini
- Ludwig-Maximilians-Universität München, Department Biochemie, Genzentrum, 81377 München, Germany
| | - Andrea Ennio Storti
- Ludwig-Maximilians-Universität München, Department Biochemie, Genzentrum, 81377 München, Germany
| | - Ulrich Unnerstall
- Ludwig-Maximilians-Universität München, Department Biochemie, Genzentrum, 81377 München, Germany
| | - Ulrike Gaul
- Ludwig-Maximilians-Universität München, Department Biochemie, Genzentrum, 81377 München, Germany
| | - Nicolas Gompel
- Ludwig-Maximilians Universität München, Fakultät für Biologie, Biozentrum, 82152 Planegg-Martinsried, Germany
| |
Collapse
|
23
|
Farré M, Kim J, Proskuryakova AA, Zhang Y, Kulemzina AI, Li Q, Zhou Y, Xiong Y, Johnson JL, Perelman PL, Johnson WE, Warren WC, Kukekova AV, Zhang G, O'Brien SJ, Ryder OA, Graphodatsky AS, Ma J, Lewin HA, Larkin DM. Evolution of gene regulation in ruminants differs between evolutionary breakpoint regions and homologous synteny blocks. Genome Res 2019; 29:576-589. [PMID: 30760546 PMCID: PMC6442394 DOI: 10.1101/gr.239863.118] [Citation(s) in RCA: 25] [Impact Index Per Article: 5.0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/25/2018] [Accepted: 02/08/2019] [Indexed: 02/02/2023]
Abstract
The role of chromosome rearrangements in driving evolution has been a long-standing question of evolutionary biology. Here we focused on ruminants as a model to assess how rearrangements may have contributed to the evolution of gene regulation. Using reconstructed ancestral karyotypes of Cetartiodactyls, Ruminants, Pecorans, and Bovids, we traced patterns of gross chromosome changes. We found that the lineage leading to the ruminant ancestor after the split from other cetartiodactyls was characterized by mostly intrachromosomal changes, whereas the lineage leading to the pecoran ancestor (including all livestock ruminants) included multiple interchromosomal changes. We observed that the liver cell putative enhancers in the ruminant evolutionary breakpoint regions are highly enriched for DNA sequences under selective constraint acting on lineage-specific transposable elements (TEs) and a set of 25 specific transcription factor (TF) binding motifs associated with recently active TEs. Coupled with gene expression data, we found that genes near ruminant breakpoint regions exhibit more divergent expression profiles among species, particularly in cattle, which is consistent with the phylogenetic origin of these breakpoint regions. This divergence was significantly greater in genes with enhancers that contain at least one of the 25 specific TF binding motifs and located near bovidae-to-cattle lineage breakpoint regions. Taken together, by combining ancestral karyotype reconstructions with analysis of cis regulatory element and gene expression evolution, our work demonstrated that lineage-specific regulatory elements colocalized with gross chromosome rearrangements may have provided valuable functional modifications that helped to shape ruminant evolution.
Collapse
Affiliation(s)
- Marta Farré
- Royal Veterinary College, University of London, London NW1 0TU, United Kingdom
| | - Jaebum Kim
- Department of Biomedical Science and Engineering, Konkuk University, Seoul 05029, Korea
| | - Anastasia A Proskuryakova
- Institute of Molecular and Cellular Biology, SB RAS, Novosibirsk 630090, Russia.,Synthetic Biology Unit, Novosibirsk State University, Novosibirsk 630090, Russia
| | - Yang Zhang
- Computational Biology Department, School of Computer Science, Carnegie Mellon University, Pittsburgh, Pennsylvania 15213, USA
| | | | - Qiye Li
- China National GeneBank, BGI-Shenzhen, Shenzhen 518083, China
| | - Yang Zhou
- China National GeneBank, BGI-Shenzhen, Shenzhen 518083, China
| | - Yingqi Xiong
- China National GeneBank, BGI-Shenzhen, Shenzhen 518083, China
| | - Jennifer L Johnson
- Department of Animal Sciences, College of Agricultural, Consumer and Environmental Sciences, University of Illinois at Urbana-Champaign, Urbana, Illinois 61801, USA
| | - Polina L Perelman
- Institute of Molecular and Cellular Biology, SB RAS, Novosibirsk 630090, Russia.,Synthetic Biology Unit, Novosibirsk State University, Novosibirsk 630090, Russia
| | - Warren E Johnson
- Smithsonian Conservation Biology Institute, National Zoological Park, Front Royal, Virginia 22630, USA.,Walter Reed Biosystematics Unit, Museum Support Center, Smithsonian Institution, Suitland, Maryland 20746, USA
| | - Wesley C Warren
- Bond Life Sciences Center, University of Missouri, Columbia, Missouri 63201, USA
| | - Anna V Kukekova
- Department of Animal Sciences, College of Agricultural, Consumer and Environmental Sciences, University of Illinois at Urbana-Champaign, Urbana, Illinois 61801, USA
| | - Guojie Zhang
- China National GeneBank, BGI-Shenzhen, Shenzhen 518083, China.,State Key Laboratory of Genetic Resources and Evolution, Kunming Institute of Zoology, Chinese Academy of Sciences, Kunming 650223, China.,Centre for Social Evolution, Department of Biology, University of Copenhagen, DK-2100 Copenhagen, Denmark
| | - Stephen J O'Brien
- Theodosius Dobzhansky Center for Genome Bioinformatics, St. Petersburg State University, St. Petersburg 199004, Russia.,Guy Harvey Oceanographic Center, Halmos College of Natural Sciences and Oceanography, Nova Southeastern University, Fort Lauderdale, Florida 33004, USA
| | - Oliver A Ryder
- Institute for Conservation Research, San Diego Zoo, Escondido, California 92027, USA
| | - Alexander S Graphodatsky
- Institute of Molecular and Cellular Biology, SB RAS, Novosibirsk 630090, Russia.,Synthetic Biology Unit, Novosibirsk State University, Novosibirsk 630090, Russia
| | - Jian Ma
- Computational Biology Department, School of Computer Science, Carnegie Mellon University, Pittsburgh, Pennsylvania 15213, USA
| | - Harris A Lewin
- Department of Evolution and Ecology and the UC Davis Genome Center, University of California, Davis, California 95616, USA
| | - Denis M Larkin
- Royal Veterinary College, University of London, London NW1 0TU, United Kingdom.,The Federal Research Center Institute of Cytology and Genetics, The Siberian Branch of the Russian Academy of Sciences (ICG SB RAS), Novosibirsk 630090, Russia
| |
Collapse
|
24
|
Sen SQ, Chanchani S, Southall TD, Doe CQ. Neuroblast-specific open chromatin allows the temporal transcription factor, Hunchback, to bind neuroblast-specific loci. eLife 2019; 8:44036. [PMID: 30694180 PMCID: PMC6377230 DOI: 10.7554/elife.44036] [Citation(s) in RCA: 33] [Impact Index Per Article: 6.6] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 11/30/2018] [Accepted: 01/24/2019] [Indexed: 12/12/2022] Open
Abstract
Spatial and temporal cues are required to specify neuronal diversity, but how these cues are integrated in neural progenitors remains unknown. Drosophila progenitors (neuroblasts) are a good model: they are individually identifiable with relevant spatial and temporal transcription factors known. Here we test whether spatial/temporal factors act independently or sequentially in neuroblasts. We used Targeted DamID to identify genomic binding sites of the Hunchback temporal factor in two neuroblasts (NB5-6 and NB7-4) that make different progeny. Hunchback targets were different in each neuroblast, ruling out the independent specification model. Moreover, each neuroblast had distinct open chromatin domains, which correlated with differential Hb-bound loci in each neuroblast. Importantly, the Gsb/Pax3 spatial factor, expressed in NB5-6 but not NB7-4, had genomic binding sites correlated with open chromatin in NB5-6, but not NB7-4. Our data support a model in which early-acting spatial factors like Gsb establish neuroblast-specific open chromatin domains, leading to neuroblast-specific temporal factor binding and the production of different neurons in each neuroblast lineage. The human brain is considered to be the most complicated object in the universe, but it only takes a handful of stem cells to make one. The process depends on two types of information: signals separated across space and time. Spatial cues tell a stem cell what type of cell it is going to be, while temporal cues work as molecular clocks to generate a sequence of different neurons over time. Together, these cues generate the large array of cell types in the nervous system. Each stem cell occupies its own space in the developing body and receives its own spatial cues, but they all follow the same timeline. For example, proteins called transcription factors act as molecular clocks and interact with specific genes, telling the cell when to turn them on or off. The same series of transcription factors operates in different stem cells, but they have different effects. So far, it has been unclear whether spatial and temporal signals work independently or sequentially to generate new cell types. To find out, Sen et al. studied two distinct, developing stem cells in fruit flies, which receive different spatial signals. Transcription factors only work if they are able to get to their target genes. Cells can open or close access to different genes by changing the structure of the chromatin wrapping that surrounds the genes. In the experiments, a marker was used to reveal the areas of open chromatin in each of the cells. Another marker was used to track the transcription factors. The results showed that the areas of open chromatin varied between stem cells. Moreover, although both cells used the same transcription factor called Hunchback, it targeted different genes in each stem cell. This was due to changes in the chromatin wrapping: Hunchback only acted in areas where the chromatin was open. This suggests that the spatial cues first sculpt the chromatin, making some genes easier to get to than others. Then, the same transcription factors go to the accessible gene, which will differ from one stem cell to another. These findings help us to understand how different types of brain cells develop, which may also aid us in finding a way how to engineer specific cell types. If we could turn stem cells into different types of brain cells, it might help us to treat brain diseases. This may involve giving the right spatial signal before starting the temporal cues.
Collapse
Affiliation(s)
- Sonia Q Sen
- Institute of Neuroscience, Institute of Molecular Biology, Howard Hughes Medical Institute, University of Oregon, Eugene, United States
| | - Sachin Chanchani
- Institute of Neuroscience, Institute of Molecular Biology, Howard Hughes Medical Institute, University of Oregon, Eugene, United States
| | - Tony D Southall
- Department of Life Sciences, Imperial College London, London, United Kingdom
| | - Chris Q Doe
- Institute of Neuroscience, Institute of Molecular Biology, Howard Hughes Medical Institute, University of Oregon, Eugene, United States
| |
Collapse
|
25
|
Evidence for Stabilizing Selection Driving Mutational Turnover of Short Motifs in the Eukaryotic Complementary Sex Determiner (Csd) Protein. G3-GENES GENOMES GENETICS 2018; 8:3803-3812. [PMID: 30287489 PMCID: PMC6288827 DOI: 10.1534/g3.118.200527] [Citation(s) in RCA: 6] [Impact Index Per Article: 1.0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Subscribe] [Scholar Register] [Indexed: 01/15/2023]
Abstract
Short linear motifs (SLiMs) can play pivotal functional roles in proteins, such as targeting proteins to specific subcellular localizations, modulating the efficiency of translation and tagging proteins for degradation. Until recently we had little knowledge about SLiM evolution. Only a few amino acids in these motifs are functionally important, making them likely to evolve ex nihilo and suggesting that they can play key roles in protein evolution. Several reports now suggest that these motifs can appear and disappear while their function in the protein is preserved, a process sometimes referred to as “turnover”. However, there has been a lack of specific experiments to determine whether independently evolved motifs do indeed have the same function, which would conclusively determine whether the process of turnover actually occurs. In this study, we experimentally detected evidence for such a mutational turnover process for nuclear localization signals (NLS) during the post-duplication divergence of the Complementary sex determiner (Csd) and Feminizer (Fem) proteins in the honeybee (Apis mellifera) lineage. Experiments on the nuclear transport activity of protein segments and those of the most recent common ancestor (MRCA) sequences revealed that three new NLS motifs evolved in the Csd protein during the post-duplication divergence while other NLS motifs were lost that existed before duplication. A screen for essential and newly evolved amino acids revealed that new motifs in the Csd protein evolved by one or two missense mutations coding for lysine. Amino acids that were predating the duplication were also essential in the acquisition of the C1 motif suggesting that the ex nihilo origin was constrained by preexisting amino acids in the physical proximity. Our data support a model in which stabilizing selection maintains the constancy of nuclear transport function but allowed mutational turnover of the encoding NLS motifs.
Collapse
|
26
|
Hamm DC, Harrison MM. Regulatory principles governing the maternal-to-zygotic transition: insights from Drosophila melanogaster. Open Biol 2018; 8:180183. [PMID: 30977698 PMCID: PMC6303782 DOI: 10.1098/rsob.180183] [Citation(s) in RCA: 49] [Impact Index Per Article: 8.2] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 09/28/2018] [Accepted: 11/09/2018] [Indexed: 12/19/2022] Open
Abstract
The onset of metazoan development requires that two terminally differentiated germ cells, a sperm and an oocyte, become reprogrammed to the totipotent embryo, which can subsequently give rise to all the cell types of the adult organism. In nearly all animals, maternal gene products regulate the initial events of embryogenesis while the zygotic genome remains transcriptionally silent. Developmental control is then passed from mother to zygote through a process known as the maternal-to-zygotic transition (MZT). The MZT comprises an intimately connected set of molecular events that mediate degradation of maternally deposited mRNAs and transcriptional activation of the zygotic genome. This essential developmental transition is conserved among metazoans but is perhaps best understood in the fruit fly, Drosophila melanogaster. In this article, we will review our understanding of the events that drive the MZT in Drosophila embryos and highlight parallel mechanisms driving this transition in other animals.
Collapse
Affiliation(s)
| | - Melissa M. Harrison
- Department of Biomolecular Chemistry, University of Wisconsin School of Medicine and Public Health, Madison, WI 53706, USA
| |
Collapse
|
27
|
Vincent BJ, Staller MV, Lopez-Rivera F, Bragdon MDJ, Pym ECG, Biette KM, Wunderlich Z, Harden TT, Estrada J, DePace AH. Hunchback is counter-repressed to regulate even-skipped stripe 2 expression in Drosophila embryos. PLoS Genet 2018; 14:e1007644. [PMID: 30192762 PMCID: PMC6145585 DOI: 10.1371/journal.pgen.1007644] [Citation(s) in RCA: 21] [Impact Index Per Article: 3.5] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/06/2017] [Revised: 09/19/2018] [Accepted: 08/17/2018] [Indexed: 01/18/2023] Open
Abstract
Hunchback is a bifunctional transcription factor that can activate and repress gene expression in Drosophila development. We investigated the regulatory DNA sequence features that control Hunchback function by perturbing enhancers for one of its target genes, even-skipped (eve). While Hunchback directly represses the eve stripe 3+7 enhancer, we found that in the eve stripe 2+7 enhancer, Hunchback repression is prevented by nearby sequences-this phenomenon is called counter-repression. We also found evidence that Caudal binding sites are responsible for counter-repression, and that this interaction may be a conserved feature of eve stripe 2 enhancers. Our results alter the textbook view of eve stripe 2 regulation wherein Hb is described as a direct activator. Instead, to generate stripe 2, Hunchback repression must be counteracted. We discuss how counter-repression may influence eve stripe 2 regulation and evolution.
Collapse
Affiliation(s)
- Ben J. Vincent
- Department of Systems Biology, Harvard Medical School, Boston, Massachusetts, United States of America
| | - Max V. Staller
- Department of Systems Biology, Harvard Medical School, Boston, Massachusetts, United States of America
| | - Francheska Lopez-Rivera
- Department of Systems Biology, Harvard Medical School, Boston, Massachusetts, United States of America
| | - Meghan D. J. Bragdon
- Department of Systems Biology, Harvard Medical School, Boston, Massachusetts, United States of America
| | - Edward C. G. Pym
- Department of Systems Biology, Harvard Medical School, Boston, Massachusetts, United States of America
| | - Kelly M. Biette
- Department of Systems Biology, Harvard Medical School, Boston, Massachusetts, United States of America
| | - Zeba Wunderlich
- Department of Systems Biology, Harvard Medical School, Boston, Massachusetts, United States of America
| | - Timothy T. Harden
- Department of Systems Biology, Harvard Medical School, Boston, Massachusetts, United States of America
| | - Javier Estrada
- Department of Systems Biology, Harvard Medical School, Boston, Massachusetts, United States of America
| | - Angela H. DePace
- Department of Systems Biology, Harvard Medical School, Boston, Massachusetts, United States of America
| |
Collapse
|
28
|
Haines JE, Eisen MB. Patterns of chromatin accessibility along the anterior-posterior axis in the early Drosophila embryo. PLoS Genet 2018; 14:e1007367. [PMID: 29727464 PMCID: PMC5955596 DOI: 10.1371/journal.pgen.1007367] [Citation(s) in RCA: 30] [Impact Index Per Article: 5.0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 11/02/2017] [Revised: 05/16/2018] [Accepted: 04/17/2018] [Indexed: 12/20/2022] Open
Abstract
As the Drosophila embryo transitions from the use of maternal RNAs to zygotic transcription, domains of open chromatin, with relatively low nucleosome density and specific histone marks, are established at promoters and enhancers involved in patterned embryonic transcription. However it remains unclear how regions of activity are established during early embryogenesis, and if they are the product of spatially restricted or ubiquitous processes. To shed light on this question, we probed chromatin accessibility across the anterior-posterior axis (A-P) of early Drosophila melanogaster embryos by applying a transposon based assay for chromatin accessibility (ATAC-seq) to anterior and posterior halves of hand-dissected, cellular blastoderm embryos. We find that genome-wide chromatin accessibility is highly similar between the two halves, with regions that manifest significant accessibility in one half of the embryo almost always accessible in the other half, even for promoters that are active in exclusively one half of the embryo. These data support previous studies that show that chromatin accessibility is not a direct result of activity, and point to a role for ubiquitous factors or processes in establishing chromatin accessibility at promoters in the early embryo. However, in concordance with similar works, we find that at enhancers active exclusively in one half of the embryo, we observe a significant skew towards greater accessibility in the region of their activity, highlighting the role of patterning factors such as Bicoid in this process.
Collapse
Affiliation(s)
- Jenna E. Haines
- Department of Molecular and Cell Biology, University of California, Berkeley, Berkeley, United States of America
| | - Michael B. Eisen
- Department of Molecular and Cell Biology, University of California, Berkeley, Berkeley, United States of America
- Department of Integrative Biology, University of California, Berkeley, Berkeley, United States of America
- Howard Hughes Medical Institute, University of California, Berkeley, Berkeley, United States of America
| |
Collapse
|
29
|
True equilibrium measurement of transcription factor-DNA binding affinities using automated polarization microscopy. Nat Commun 2018; 9:1605. [PMID: 29686282 PMCID: PMC5913336 DOI: 10.1038/s41467-018-03977-4] [Citation(s) in RCA: 20] [Impact Index Per Article: 3.3] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 09/20/2017] [Accepted: 03/16/2018] [Indexed: 01/31/2023] Open
Abstract
The complex patterns of gene expression in metazoans are controlled by selective binding of transcription factors (TFs) to regulatory DNA. To improve the quantitative understanding of this process, we have developed a novel method that uses fluorescence anisotropy measurements in a controlled delivery system to determine TF-DNA binding energies in solution with high sensitivity and throughput. Owing to its large dynamic range, the method, named high performance fluorescence anisotropy (HiP-FA), allows for reliable quantification of both weak and strong binding; binding specificities are calculated on the basis of equilibrium constant measurements for mutational DNA variants. We determine the binding preference landscapes for 26 TFs and measure high absolute affinities, but mostly lower binding specificities than reported by other methods. The revised binding preferences give rise to improved predictions of in vivo TF occupancy and enhancer expression. Our approach provides a powerful new tool for the systems-biological analysis of gene regulation. Methods to measure selective transcription factor-DNA binding often lack sensitivity and are not performed in solution. Here the authors develop a method to perform fluorescence anisotropy measurements of transcription factor-DNA binding energies with high sensitivity and throughput.
Collapse
|
30
|
Marinov GK, Kundaje A. ChIP-ping the branches of the tree: functional genomics and the evolution of eukaryotic gene regulation. Brief Funct Genomics 2018; 17:116-137. [PMID: 29529131 PMCID: PMC5889016 DOI: 10.1093/bfgp/ely004] [Citation(s) in RCA: 5] [Impact Index Per Article: 0.8] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 12/23/2022] Open
Abstract
Advances in the methods for detecting protein-DNA interactions have played a key role in determining the directions of research into the mechanisms of transcriptional regulation. The most recent major technological transformation happened a decade ago, with the move from using tiling arrays [chromatin immunoprecipitation (ChIP)-on-Chip] to high-throughput sequencing (ChIP-seq) as a readout for ChIP assays. In addition to the numerous other ways in which it is superior to arrays, by eliminating the need to design and manufacture them, sequencing also opened the door to carrying out comparative analyses of genome-wide transcription factor occupancy across species and studying chromatin biology in previously less accessible model and nonmodel organisms, thus allowing us to understand the evolution and diversity of regulatory mechanisms in unprecedented detail. Here, we review the biological insights obtained from such studies in recent years and discuss anticipated future developments in the field.
Collapse
Affiliation(s)
- Georgi K Marinov
- Corresponding author: Georgi K. Marinov, Department of Genetics, Stanford University School of Medicine, Stanford, CA 94305, USA. E-mail:
| | | |
Collapse
|
31
|
Dynamic evolution of regulatory element ensembles in primate CD4 + T cells. Nat Ecol Evol 2018; 2:537-548. [PMID: 29379187 DOI: 10.1038/s41559-017-0447-5] [Citation(s) in RCA: 44] [Impact Index Per Article: 7.3] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/31/2017] [Accepted: 12/08/2017] [Indexed: 12/12/2022]
Abstract
How evolutionary changes at enhancers affect the transcription of target genes remains an important open question. Previous comparative studies of gene expression have largely measured the abundance of messenger RNA, which is affected by post-transcriptional regulatory processes, hence limiting inferences about the mechanisms underlying expression differences. Here, we directly measured nascent transcription in primate species, allowing us to separate transcription from post-transcriptional regulation. We used precision run-on and sequencing to map RNA polymerases in resting and activated CD4+ T cells in multiple human, chimpanzee and rhesus macaque individuals, with rodents as outgroups. We observed general conservation in coding and non-coding transcription, punctuated by numerous differences between species, particularly at distal enhancers and non-coding RNAs. Genes regulated by larger numbers of enhancers are more frequently transcribed at evolutionarily stable levels, despite reduced conservation at individual enhancers. Adaptive nucleotide substitutions are associated with lineage-specific transcription and at one locus, SGPP2, we predict and experimentally validate that multiple substitutions contribute to human-specific transcription. Collectively, our findings suggest a pervasive role for evolutionary compensation across ensembles of enhancers that jointly regulate target genes.
Collapse
|
32
|
A conserved maternal-specific repressive domain in Zelda revealed by Cas9-mediated mutagenesis in Drosophila melanogaster. PLoS Genet 2017; 13:e1007120. [PMID: 29261646 PMCID: PMC5752043 DOI: 10.1371/journal.pgen.1007120] [Citation(s) in RCA: 25] [Impact Index Per Article: 3.6] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/03/2017] [Revised: 01/03/2018] [Accepted: 11/20/2017] [Indexed: 12/13/2022] Open
Abstract
In nearly all metazoans, the earliest stages of development are controlled by maternally deposited mRNAs and proteins. The zygotic genome becomes transcriptionally active hours after fertilization. Transcriptional activation during this maternal-to-zygotic transition (MZT) is tightly coordinated with the degradation of maternally provided mRNAs. In Drosophila melanogaster, the transcription factor Zelda plays an essential role in widespread activation of the zygotic genome. While Zelda expression is required both maternally and zygotically, the mechanisms by which it functions to remodel the embryonic genome and prepare the embryo for development remain unclear. Using Cas9-mediated genome editing to generate targeted mutations in the endogenous zelda locus, we determined the functional relevance of protein domains conserved amongst Zelda orthologs. We showed that neither a conserved N-terminal zinc finger nor an acidic patch were required for activity. Similarly, a previously identified splice isoform of zelda is dispensable for viability. By contrast, we identified a highly conserved zinc-finger domain that is essential for the maternal, but not zygotic functions of Zelda. Animals homozygous for mutations in this domain survived to adulthood, but embryos inheriting these loss-of-function alleles from their mothers died late in embryogenesis. These mutations did not interfere with the capacity of Zelda to activate transcription in cell culture. Unexpectedly, these mutations generated a hyperactive form of the protein and enhanced Zelda-dependent gene expression. These data have defined a protein domain critical for controlling Zelda activity during the MZT, but dispensable for its roles later in development, for the first time separating the maternal and zygotic requirements for Zelda. This demonstrates that highly regulated levels of Zelda activity are required for establishing the developmental program during the MZT. We propose that tightly regulated gene expression is essential to navigate the MZT and that failure to precisely execute this developmental program leads to embryonic lethality. Following fertilization, the one-celled zygote must be rapidly reprogrammed to enable the development of a new, unique organism. During these initial stages of development there is little or no transcription of the zygotic genome, and maternally deposited products control this process. Among the essential maternal products are mRNAs that encode transcription factors required for preparing the zygotic genome for transcriptional activation. This ensures that there is a precisely coordinated hand-off from maternal to zygotic control. In Drosophila melanogaster, the transcription factor Zelda is essential for activating the zygotic genome and coupling this activation to the degradation of the maternally deposited products. Nonetheless, the mechanism by which Zelda functions remains unclear. Here we used Cas9-mediated genome engineering to determine the functional requirements for highly conserved domains within Zelda. We identified a domain required specifically for Zelda’s role in reprogramming the early embryonic genome, but not essential for its functions later in development. Surprisingly, this domain restricts the ability of Zelda to activate transcription. These data demonstrate that Zelda activity is tightly regulated, and we propose that precise regulation of both the timing and levels of genome activation is required for the embryo to successfully transition from maternal to zygotic control.
Collapse
|
33
|
Characterization of dFOXO binding sites upstream of the Insulin Receptor P2 promoter across the Drosophila phylogeny. PLoS One 2017; 12:e0188357. [PMID: 29200426 PMCID: PMC5714339 DOI: 10.1371/journal.pone.0188357] [Citation(s) in RCA: 3] [Impact Index Per Article: 0.4] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/06/2017] [Accepted: 11/06/2017] [Indexed: 01/01/2023] Open
Abstract
The insulin/TOR signal transduction pathway plays a critical role in determining such important traits as body and organ size, metabolic homeostasis and life span. Although this pathway is highly conserved across the animal kingdom, the affected traits can exhibit important differences even between closely related species. Evolutionary studies of regulatory regions require the reliable identification of transcription factor binding sites. Here we have focused on the Insulin Receptor (InR) expression from its P2 promoter in the Drosophila genus, which in D. melanogaster is up-regulated by hypophosphorylated Drosophila FOXO (dFOXO). We have finely characterized this transcription factor binding sites in vitro along the 1.3 kb region upstream of the InR P2 promoter in five Drosophila species. Moreover, we have tested the effect of mutations in the characterized dFOXO sites of D. melanogaster in transgenic flies. The number of experimentally established binding sites varies across the 1.3 kb region of any particular species, and their distribution also differs among species. In D. melanogaster, InR expression from P2 is differentially affected by dFOXO binding sites at the proximal and distal halves of the species 1.3 kb fragment. The observed uneven distribution of binding sites across this fragment might underlie their differential contribution to regulate InR transcription.
Collapse
|
34
|
Divergence of regulatory networks governed by the orthologous transcription factors FLC and PEP1 in Brassicaceae species. Proc Natl Acad Sci U S A 2017; 114:E11037-E11046. [PMID: 29203652 PMCID: PMC5754749 DOI: 10.1073/pnas.1618075114] [Citation(s) in RCA: 30] [Impact Index Per Article: 4.3] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/18/2022] Open
Abstract
Genome-wide landscapes of transcription factor (TF) binding sites (BSs) diverge during evolution, conferring species-specific transcriptional patterns. The rate of divergence varies in different metazoan lineages but has not been widely studied in plants. We identified the BSs and assessed the effects on transcription of FLOWERING LOCUS C (FLC) and PERPETUAL FLOWERING 1 (PEP1), two orthologous MADS-box TFs that repress flowering and confer vernalization requirement in the Brassicaceae species Arabidopsis thaliana and Arabis alpina, respectively. We found that only 14% of their BSs were conserved in both species and that these contained a CArG-box that is recognized by MADS-box TFs. The CArG-box consensus at conserved BSs was extended compared with the core motif. By contrast, species-specific BSs usually lacked the CArG-box in the other species. Flowering-time genes were highly overrepresented among conserved targets, and their CArG-boxes were widely conserved among Brassicaceae species. Cold-regulated (COR) genes were also overrepresented among targets, but the cognate BSs and the identity of the regulated genes were usually different in each species. In cold, COR gene transcript levels were increased in flc and pep1-1 mutants compared with WT, and this correlated with reduced growth in pep1-1 Therefore, FLC orthologs regulate a set of conserved target genes mainly involved in reproductive development and were later independently recruited to modulate stress responses in different Brassicaceae lineages. Analysis of TF BSs in these lineages thus distinguishes widely conserved targets representing the core function of the TF from those that were recruited later in evolution.
Collapse
|
35
|
Momen-Roknabadi A, Di Talia S, Wieschaus E. Transcriptional Timers Regulating Mitosis in Early Drosophila Embryos. Cell Rep 2017; 16:2793-2801. [PMID: 27626650 DOI: 10.1016/j.celrep.2016.08.034] [Citation(s) in RCA: 6] [Impact Index Per Article: 0.9] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 02/29/2016] [Revised: 06/30/2016] [Accepted: 08/11/2016] [Indexed: 11/30/2022] Open
Abstract
The development of an embryo requires precise spatiotemporal regulation of cellular processes. During Drosophila gastrulation, a precise temporal pattern of cell division is encoded through transcriptional regulation of cdc25(string) in 25 distinct mitotic domains. Using a genetic screen, we demonstrate that the same transcription factors that regulate the spatial pattern of cdc25(string) transcription encode its temporal activation. We identify buttonhead and empty spiracles as the major activators of cdc25(string) expression in mitotic domain 2. The effect of these activators is balanced through repression by hairy, sloppy paired 1, and huckebein. Within the mitotic domain, temporal precision of mitosis is robust and unaffected by changing dosage of rate-limiting transcriptional factors. However, precision can be disrupted by altering the levels of the two activators or two repressors. We propose that the additive and balanced action of activators and repressors is a general strategy for precise temporal regulation of cellular transitions during development.
Collapse
Affiliation(s)
- Amir Momen-Roknabadi
- Howard Hughes Medical Institute, Department of Molecular Biology, Princeton University, Princeton, NJ 08544, USA
| | - Stefano Di Talia
- Department of Cell Biology, Duke University Medical Center, Durham, NC 27710, USA.
| | - Eric Wieschaus
- Howard Hughes Medical Institute, Department of Molecular Biology, Princeton University, Princeton, NJ 08544, USA.
| |
Collapse
|
36
|
Yang B, Wittkopp PJ. Structure of the Transcriptional Regulatory Network Correlates with Regulatory Divergence in Drosophila. Mol Biol Evol 2017; 34:1352-1362. [PMID: 28333240 DOI: 10.1093/molbev/msx068] [Citation(s) in RCA: 20] [Impact Index Per Article: 2.9] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 01/09/2023] Open
Abstract
Transcriptional control of gene expression is regulated by biochemical interactions between cis-regulatory DNA sequences and trans-acting factors that form complex regulatory networks. Genetic changes affecting both cis- and trans-acting sequences in these networks have been shown to alter patterns of gene expression as well as higher-order organismal phenotypes. Here, we investigate how the structure of these regulatory networks relates to patterns of polymorphism and divergence in gene expression. To do this, we compared a transcriptional regulatory network inferred for Drosophila melanogaster to differences in gene regulation observed between two strains of D. melanogaster as well as between two pairs of closely related species: Drosophila sechellia and Drosophila simulans, and D. simulans and D. melanogaster. We found that the number of transcription factors predicted to directly regulate a gene ("in-degree") was negatively correlated with divergence in both gene expression (mRNA abundance) and cis-regulation. This observation suggests that the number of transcription factors directly regulating a gene's expression affects the conservation of cis-regulation and gene expression over evolutionary time. We also tested the hypothesis that transcription factors regulating more target genes (higher "out-degree") are less likely to evolve changes in their cis-regulation and expression (presumably due to increased pleiotropy), but found little support for this predicted relationship. Taken together, these data show how the architecture of regulatory networks can influence regulatory evolution.
Collapse
Affiliation(s)
- Bing Yang
- Department of Molecular, Cellular, and Developmental Biology, University of Michigan, Ann Arbor, MI
| | - Patricia J Wittkopp
- Department of Molecular, Cellular, and Developmental Biology, University of Michigan, Ann Arbor, MI.,Department of Ecology and Evolutionary Biology, University of Michigan, Ann Arbor, MI
| |
Collapse
|
37
|
Mir M, Reimer A, Haines JE, Li XY, Stadler M, Garcia H, Eisen MB, Darzacq X. Dense Bicoid hubs accentuate binding along the morphogen gradient. Genes Dev 2017; 31:1784-1794. [PMID: 28982761 PMCID: PMC5666676 DOI: 10.1101/gad.305078.117] [Citation(s) in RCA: 122] [Impact Index Per Article: 17.4] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/23/2017] [Accepted: 09/06/2017] [Indexed: 11/24/2022]
Abstract
Morphogen gradients direct the spatial patterning of developing embryos; however, the mechanisms by which these gradients are interpreted remain elusive. Here we used lattice light-sheet microscopy to perform in vivo single-molecule imaging in early Drosophila melanogaster embryos of the transcription factor Bicoid that forms a gradient and initiates patterning along the anteroposterior axis. In contrast to canonical models, we observed that Bicoid binds to DNA with a rapid off rate throughout the embryo such that its average occupancy at target loci is on-rate-dependent. We further observed Bicoid forming transient "hubs" of locally high density that facilitate binding as factor levels drop, including in the posterior, where we observed Bicoid binding despite vanishingly low protein levels. We propose that localized modulation of transcription factor on rates via clustering provides a general mechanism to facilitate binding to low-affinity targets and that this may be a prevalent feature of other developmental transcription factors.
Collapse
Affiliation(s)
- Mustafa Mir
- Department of Molecular and Cell Biology, University of California at Berkeley, Berkeley, California, 94720, USA
| | - Armando Reimer
- Biophysics Graduate Group, University of California at Berkeley, Berkeley, California, 94720, USA
| | - Jenna E Haines
- Department of Molecular and Cell Biology, University of California at Berkeley, Berkeley, California, 94720, USA
| | - Xiao-Yong Li
- Howard Hughes Medical Institute, University of California at Berkeley, Berkeley, California, 94720, USA
| | - Michael Stadler
- Department of Molecular and Cell Biology, University of California at Berkeley, Berkeley, California, 94720, USA
| | - Hernan Garcia
- Department of Molecular and Cell Biology, University of California at Berkeley, Berkeley, California, 94720, USA
- Biophysics Graduate Group, University of California at Berkeley, Berkeley, California, 94720, USA
- Department of Physics, University of California at Berkeley, Berkeley, California, 94720, USA
| | - Michael B Eisen
- Department of Molecular and Cell Biology, University of California at Berkeley, Berkeley, California, 94720, USA
- Biophysics Graduate Group, University of California at Berkeley, Berkeley, California, 94720, USA
- Howard Hughes Medical Institute, University of California at Berkeley, Berkeley, California, 94720, USA
- Department of Integrative Biology, University of California at Berkeley, Berkeley, California, 94720, USA
| | - Xavier Darzacq
- Department of Molecular and Cell Biology, University of California at Berkeley, Berkeley, California, 94720, USA
| |
Collapse
|
38
|
Wang Y, Ung MH, Xia T, Cheng W, Cheng C. Cancer cell line specific co-factors modulate the FOXM1 cistrome. Oncotarget 2017; 8:76498-76515. [PMID: 29100329 PMCID: PMC5652723 DOI: 10.18632/oncotarget.20405] [Citation(s) in RCA: 7] [Impact Index Per Article: 1.0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 04/12/2017] [Accepted: 08/14/2017] [Indexed: 12/11/2022] Open
Abstract
ChIP-seq has been commonly applied to identify genomic occupation of transcription factors (TFs) in a context-specific manner. It is generally assumed that a TF should have similar binding patterns in cells from the same or closely related tissues. Surprisingly, this assumption has not been carefully examined. To this end, we systematically compared the genomic binding of the cell cycle regulator FOXM1 in eight cell lines from seven different human tissues at binding signal, peaks and target genes levels. We found that FOXM1 binding in ER-positive breast cancer cell line MCF-7 are distinct comparing to those in not only other non-breast cell lines, but also MDA-MB-231, ER-negative breast cancer cell line. However, binding sites in MDA-MB-231 and non-breast cell lines were highly consistent. The recruitment of estrogen receptor alpha (ERα) caused the unique FOXM1 binding patterns in MCF-7. Moreover, the activity of FOXM1 in MCF-7 reflects the regulatory functions of ERα, while in MDA-MB-231 and non-breast cell lines, FOXM1 activities regulate cell proliferation. Our results suggest that tissue similarity, in some specific contexts, does not hold precedence over TF-cofactors interactions in determining transcriptional states and that the genomic binding of a TF can be dramatically affected by a particular co-factor under certain conditions.
Collapse
Affiliation(s)
- Yue Wang
- School of Electronic Information and Communications, Huazhong University of Science and Technology, Wuhan, Hubei 430074, China.,Department of Molecular and Systems Biology, Geisel School of Medicine at Dartmouth, Hanover, NH 03755, USA
| | - Matthew H Ung
- Department of Molecular and Systems Biology, Geisel School of Medicine at Dartmouth, Hanover, NH 03755, USA
| | - Tian Xia
- School of Electronic Information and Communications, Huazhong University of Science and Technology, Wuhan, Hubei 430074, China
| | - Wenqing Cheng
- School of Electronic Information and Communications, Huazhong University of Science and Technology, Wuhan, Hubei 430074, China
| | - Chao Cheng
- Department of Molecular and Systems Biology, Geisel School of Medicine at Dartmouth, Hanover, NH 03755, USA.,Norris Cotton Cancer Center, Geisel School of Medicine at Dartmouth, Lebanon, NH 03766, USA.,Department of Biomedical Data Sciences, Geisel School of Medicine at Dartmouth, Lebanon, NH 03766, USA
| |
Collapse
|
39
|
Khoueiry P, Girardot C, Ciglar L, Peng PC, Gustafson EH, Sinha S, Furlong EE. Uncoupling evolutionary changes in DNA sequence, transcription factor occupancy and enhancer activity. eLife 2017; 6. [PMID: 28792889 PMCID: PMC5550276 DOI: 10.7554/elife.28440] [Citation(s) in RCA: 33] [Impact Index Per Article: 4.7] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 05/08/2017] [Accepted: 07/21/2017] [Indexed: 12/15/2022] Open
Abstract
Sequence variation within enhancers plays a major role in both evolution and disease, yet its functional impact on transcription factor (TF) occupancy and enhancer activity remains poorly understood. Here, we assayed the binding of five essential TFs over multiple stages of embryogenesis in two distant Drosophila species (with 1.4 substitutions per neutral site), identifying thousands of orthologous enhancers with conserved or diverged combinatorial occupancy. We used these binding signatures to dissect two properties of developmental enhancers: (1) potential TF cooperativity, using signatures of co-associations and co-divergence in TF occupancy. This revealed conserved combinatorial binding despite sequence divergence, suggesting protein-protein interactions sustain conserved collective occupancy. (2) Enhancer in-vivo activity, revealing orthologous enhancers with conserved activity despite divergence in TF occupancy. Taken together, we identify enhancers with diverged motifs yet conserved occupancy and others with diverged occupancy yet conserved activity, emphasising the need to functionally measure the effect of divergence on enhancer activity.
Collapse
Affiliation(s)
- Pierre Khoueiry
- European Molecular Biology Laboratory, Genome Biology Unit, Heidelberg, Germany
| | - Charles Girardot
- European Molecular Biology Laboratory, Genome Biology Unit, Heidelberg, Germany
| | - Lucia Ciglar
- European Molecular Biology Laboratory, Genome Biology Unit, Heidelberg, Germany
| | - Pei-Chen Peng
- Carl R. Woese Institute of Genomic Biology, University of Illinois, Champaign, United States
| | - E Hilary Gustafson
- European Molecular Biology Laboratory, Genome Biology Unit, Heidelberg, Germany
| | - Saurabh Sinha
- European Molecular Biology Laboratory, Genome Biology Unit, Heidelberg, Germany.,Carl R. Woese Institute of Genomic Biology, University of Illinois, Champaign, United States
| | - Eileen Em Furlong
- European Molecular Biology Laboratory, Genome Biology Unit, Heidelberg, Germany
| |
Collapse
|
40
|
Moshe A, Kaplan T. Genome-wide search for Zelda-like chromatin signatures identifies GAF as a pioneer factor in early fly development. Epigenetics Chromatin 2017; 10:33. [PMID: 28676122 PMCID: PMC5496641 DOI: 10.1186/s13072-017-0141-5] [Citation(s) in RCA: 13] [Impact Index Per Article: 1.9] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/28/2017] [Accepted: 06/28/2017] [Indexed: 11/10/2022] Open
Abstract
BACKGROUND The protein Zelda was shown to play a key role in early Drosophila development, binding thousands of promoters and enhancers prior to maternal-to-zygotic transition (MZT), and marking them for transcriptional activation. Recently, we showed that Zelda acts through specific chromatin patterns of histone modifications to mark developmental enhancers and active promoters. Intriguingly, some Zelda sites still maintain these chromatin patterns in Drosophila embryos lacking maternal Zelda protein. This suggests that additional Zelda-like pioneer factors may act in early fly embryos. RESULTS We developed a computational method to analyze and refine the chromatin landscape surrounding early Zelda peaks, using a multichannel spectral clustering. This allowed us to characterize their chromatin patterns through MZT (mitotic cycles 8-14). Specifically, we focused on H3K4me1, H3K4me3, H3K18ac, H3K27ac, and H3K27me3 and identified three different classes of chromatin signatures, matching "promoters," "enhancers" and "transiently bound" Zelda peaks. We then further scanned the genome using these chromatin patterns and identified additional loci-with no Zelda binding-that show similar chromatin patterns, resulting with hundreds of Zelda-independent putative enhancers. These regions were found to be enriched with GAGA factor (GAF, Trl) and are typically located near early developmental zygotic genes. Overall our analysis suggests that GAF, together with Zelda, plays an important role in activating the zygotic genome. CONCLUSIONS As we show, our computational approach offers an efficient algorithm for characterizing chromatin signatures around some loci of interest and allows a genome-wide identification of additional loci with similar chromatin patterns.
Collapse
Affiliation(s)
- Arbel Moshe
- School of Computer Science and Engineering, The Hebrew University of Jerusalem, Jerusalem, 91904, Israel
| | - Tommy Kaplan
- School of Computer Science and Engineering, The Hebrew University of Jerusalem, Jerusalem, 91904, Israel.
| |
Collapse
|
41
|
Lee KB, Wang J, Palme J, Escalante-Chong R, Hua B, Springer M. Polymorphisms in the yeast galactose sensor underlie a natural continuum of nutrient-decision phenotypes. PLoS Genet 2017; 13:e1006766. [PMID: 28542190 PMCID: PMC5464677 DOI: 10.1371/journal.pgen.1006766] [Citation(s) in RCA: 15] [Impact Index Per Article: 2.1] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 11/20/2016] [Revised: 06/08/2017] [Accepted: 04/19/2017] [Indexed: 01/26/2023] Open
Abstract
In nature, microbes often need to "decide" which of several available nutrients to utilize, a choice that depends on a cell's inherent preference and external nutrient levels. While natural environments can have mixtures of different nutrients, phenotypic variation in microbes' decisions of which nutrient to utilize is poorly studied. Here, we quantified differences in the concentration of glucose and galactose required to induce galactose-responsive (GAL) genes across 36 wild S. cerevisiae strains. Using bulk segregant analysis, we found that a locus containing the galactose sensor GAL3 was associated with differences in GAL signaling in eight different crosses. Using allele replacements, we confirmed that GAL3 is the major driver of GAL induction variation, and that GAL3 allelic variation alone can explain as much as 90% of the variation in GAL induction in a cross. The GAL3 variants we found modulate the diauxic lag, a selectable trait. These results suggest that ecological constraints on the galactose pathway may have led to variation in a single protein, allowing cells to quantitatively tune their response to nutrient changes in the environment.
Collapse
Affiliation(s)
- Kayla B. Lee
- Department of Molecular and Cellular Biology, Harvard University, Cambridge, Massachusetts, United States of America
| | - Jue Wang
- Department of Systems Biology, Harvard Medical School, Boston, Massachusetts, United States of America
- Systems Biology Graduate Program, Harvard University, Cambridge, Massachusetts, United States of America
- Ginkgo Bioworks, Boston, Massachusetts, United States of America
| | - Julius Palme
- Plant Systems Biology, School of Life Sciences Weihenstephan, Technische Universität, München, Freising, Germany
| | | | - Bo Hua
- Department of Systems Biology, Harvard Medical School, Boston, Massachusetts, United States of America
- Systems Biology Graduate Program, Harvard University, Cambridge, Massachusetts, United States of America
| | - Michael Springer
- Department of Systems Biology, Harvard Medical School, Boston, Massachusetts, United States of America
| |
Collapse
|
42
|
Chertkova AA, Schiffman JS, Nuzhdin SV, Kozlov KN, Samsonova MG, Gursky VV. In silico evolution of the Drosophila gap gene regulatory sequence under elevated mutational pressure. BMC Evol Biol 2017; 17:4. [PMID: 28251865 PMCID: PMC5333172 DOI: 10.1186/s12862-016-0866-y] [Citation(s) in RCA: 6] [Impact Index Per Article: 0.9] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/10/2022] Open
Abstract
BACKGROUND Cis-regulatory sequences are often composed of many low-affinity transcription factor binding sites (TFBSs). Determining the evolutionary and functional importance of regulatory sequence composition is impeded without a detailed knowledge of the genotype-phenotype map. RESULTS We simulate the evolution of regulatory sequences involved in Drosophila melanogaster embryo segmentation during early development. Natural selection evaluates gene expression dynamics produced by a computational model of the developmental network. We observe a dramatic decrease in the total number of transcription factor binding sites through the course of evolution. Despite a decrease in average sequence binding energies through time, the regulatory sequences tend towards organisations containing increased high affinity transcription factor binding sites. Additionally, the binding energies of separate sequence segments demonstrate ubiquitous mutual correlations through time. Fewer than 10% of initial TFBSs are maintained throughout the entire simulation, deemed 'core' sites. These sites have increased functional importance as assessed under wild-type conditions and their binding energy distributions are highly conserved. Furthermore, TFBSs within close proximity of core sites exhibit increased longevity, reflecting functional regulatory interactions with core sites. CONCLUSION In response to elevated mutational pressure, evolution tends to sample regulatory sequence organisations with fewer, albeit on average, stronger functional transcription factor binding sites. These organisations are also shaped by the regulatory interactions among core binding sites with sites in their local vicinity.
Collapse
Affiliation(s)
- Aleksandra A. Chertkova
- Systems Biology and Bioinformatics Laboratory, Peter the Great St. Petersburg Polytechnic University, Polytechnicheskaya, 29, St. Petersburg, 195251 Russia
| | - Joshua S. Schiffman
- Molecular and Computational Biology, University of Southern California, Los Angeles, 90089 CA USA
| | - Sergey V. Nuzhdin
- Systems Biology and Bioinformatics Laboratory, Peter the Great St. Petersburg Polytechnic University, Polytechnicheskaya, 29, St. Petersburg, 195251 Russia
- Molecular and Computational Biology, University of Southern California, Los Angeles, 90089 CA USA
| | - Konstantin N. Kozlov
- Systems Biology and Bioinformatics Laboratory, Peter the Great St. Petersburg Polytechnic University, Polytechnicheskaya, 29, St. Petersburg, 195251 Russia
| | - Maria G. Samsonova
- Systems Biology and Bioinformatics Laboratory, Peter the Great St. Petersburg Polytechnic University, Polytechnicheskaya, 29, St. Petersburg, 195251 Russia
| | - Vitaly V. Gursky
- Systems Biology and Bioinformatics Laboratory, Peter the Great St. Petersburg Polytechnic University, Polytechnicheskaya, 29, St. Petersburg, 195251 Russia
- Theoretical Department, Ioffe Institute, Polytechnicheskaya, 26, St. Petersburg, 194021 Russia
| |
Collapse
|
43
|
|
44
|
Ren C, Chen H, Yang B, Liu F, Ouyang Z, Bo X, Shu W. iFORM: Incorporating Find Occurrence of Regulatory Motifs. PLoS One 2016; 11:e0168607. [PMID: 27992540 PMCID: PMC5167396 DOI: 10.1371/journal.pone.0168607] [Citation(s) in RCA: 2] [Impact Index Per Article: 0.3] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 08/03/2016] [Accepted: 12/02/2016] [Indexed: 11/18/2022] Open
Abstract
Accurately identifying the binding sites of transcription factors (TFs) is crucial to understanding the mechanisms of transcriptional regulation and human disease. We present incorporating Find Occurrence of Regulatory Motifs (iFORM), an easy-to-use and efficient tool for scanning DNA sequences with TF motifs described as position weight matrices (PWMs). Both performance assessment with a receiver operating characteristic (ROC) curve and a correlation-based approach demonstrated that iFORM achieves higher accuracy and sensitivity by integrating five classical motif discovery programs using Fisher’s combined probability test. We have used iFORM to provide accurate results on a variety of data in the ENCODE Project and the NIH Roadmap Epigenomics Project, and the tool has demonstrated its utility in further elucidating individual roles of functional elements. Both the source and binary codes for iFORM can be freely accessed at https://github.com/wenjiegroup/iFORM. The identified TF binding sites across human cell and tissue types using iFORM have been deposited in the Gene Expression Omnibus under the accession ID GSE53962.
Collapse
Affiliation(s)
- Chao Ren
- Department of Biotechnology, Beijing Institute of Radiation Medicine, Beijing, China
| | - Hebing Chen
- Department of Biotechnology, Beijing Institute of Radiation Medicine, Beijing, China
| | - Bite Yang
- Department of Biotechnology, Beijing Institute of Radiation Medicine, Beijing, China
| | - Feng Liu
- Department of Biotechnology, Beijing Institute of Radiation Medicine, Beijing, China
| | - Zhangyi Ouyang
- Department of Biotechnology, Beijing Institute of Radiation Medicine, Beijing, China
| | - Xiaochen Bo
- Department of Biotechnology, Beijing Institute of Radiation Medicine, Beijing, China
- * E-mail: (WS); (XB)
| | - Wenjie Shu
- Department of Biotechnology, Beijing Institute of Radiation Medicine, Beijing, China
- * E-mail: (WS); (XB)
| |
Collapse
|
45
|
Thompson DA, Cubillos FA. Natural gene expression variation studies in yeast. Yeast 2016; 34:3-17. [PMID: 27668700 DOI: 10.1002/yea.3210] [Citation(s) in RCA: 16] [Impact Index Per Article: 2.0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/26/2016] [Revised: 09/16/2016] [Accepted: 09/18/2016] [Indexed: 11/06/2022] Open
Abstract
The rise of sequence information across different yeast species and strains is driving an increasing number of studies in the emerging field of genomics to associate polymorphic variants, mRNA abundance and phenotypic differences between individuals. Here, we gathered evidence from recent studies covering several layers that define the genotype-phenotype gap, such as mRNA abundance, allele-specific expression and translation efficiency to demonstrate how genetic variants co-evolve and define an individual's genome. Moreover, we exposed several antecedents where inter- and intra-specific studies led to opposite conclusions, probably owing to genetic divergence. Future studies in this area will benefit from the access to a massive array of well-annotated genomes and new sequencing technologies, which will allow the fine breakdown of the complex layers that delineate the genotype-phenotype map. Copyright © 2016 John Wiley & Sons, Ltd.
Collapse
Affiliation(s)
| | - Francisco A Cubillos
- Centro de Estudios en Ciencia y Tecnología de Alimentos, Universidad de Santiago de Chile, Santiago, Chile.,Millennium Nucleus for Fungal Integrative and Synthetic Biology.,Departamento de Biología, Facultad de Química y Biología, Universidad de Santiago de Chile, Santiago, Chile
| |
Collapse
|
46
|
Laarits T, Bordalo P, Lemos B. Genes under weaker stabilizing selection increase network evolvability and rapid regulatory adaptation to an environmental shift. J Evol Biol 2016; 29:1602-16. [DOI: 10.1111/jeb.12897] [Citation(s) in RCA: 4] [Impact Index Per Article: 0.5] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 01/22/2016] [Revised: 05/03/2016] [Accepted: 05/13/2016] [Indexed: 11/28/2022]
Affiliation(s)
| | - P. Bordalo
- Department of Systems Biology; Harvard Medical School; Boston MA USA
| | - B. Lemos
- Program in Molecular and Integrative Physiological Sciences; Department of Environmental Health; Harvard T. H. Chan School of Public Health; Boston MA USA
| |
Collapse
|
47
|
Qidwai T, Khan MY. Impact of genetic variations in C-C chemokine receptors and ligands on infectious diseases. Hum Immunol 2016; 77:961-971. [PMID: 27316325 DOI: 10.1016/j.humimm.2016.06.010] [Citation(s) in RCA: 20] [Impact Index Per Article: 2.5] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 01/25/2016] [Revised: 06/13/2016] [Accepted: 06/13/2016] [Indexed: 12/24/2022]
Abstract
Chemokine receptors and ligands are crucial for extensive immune response against infectious diseases such as malaria, leishmaniasis, HIV and tuberculosis and a wide variety of other diseases. Role of chemokines are evidenced in the activation and regulation of immune cell migration which is important for immune response against diseases. Outcome of disease is determined by complex interaction among pathogen, host genetic variability and surrounding milieu. Variation in expression or function of chemokines caused by genetic polymorphisms could be associated with attenuated immune responses. Exploration of chemokine genetic polymorphisms in therapeutic response, gene regulation and disease outcome is important. Infectious agents in human host alter the expression of chemokines via epigenetic alterations and thus contribute to disease pathogenesis. Although some fragmentary data are available on chemokine genetic variations and their contribution in diseases, no unequivocal conclusion has been arrived as yet. We therefore, aim to investigate the association of CCR5-CCL5 and CCR2-CCL2 genetic polymorphisms with different infectious diseases, transcriptional regulation of gene, disease severity and response to therapy. Furthermore, the role of epigenetics in genes related to chemokines and infectious disease are also discussed.
Collapse
Affiliation(s)
- Tabish Qidwai
- Department of Biotechnology, Babasaheb Bhimrao Ambedkar University, Lucknow 226 025, India.
| | - M Y Khan
- Department of Biotechnology, Babasaheb Bhimrao Ambedkar University, Lucknow 226 025, India.
| |
Collapse
|
48
|
Dresch JM, Zellers RG, Bork DK, Drewell RA. Nucleotide Interdependency in Transcription Factor Binding Sites in the Drosophila Genome. GENE REGULATION AND SYSTEMS BIOLOGY 2016; 10:21-33. [PMID: 27330274 PMCID: PMC4907338 DOI: 10.4137/grsb.s38462] [Citation(s) in RCA: 3] [Impact Index Per Article: 0.4] [Reference Citation Analysis] [Abstract] [Key Words] [Grants] [Track Full Text] [Download PDF] [Figures] [Subscribe] [Scholar Register] [Received: 02/05/2016] [Revised: 04/17/2016] [Accepted: 04/28/2016] [Indexed: 01/14/2023]
Abstract
A long-standing objective in modern biology is to characterize the molecular components that drive the development of an organism. At the heart of eukaryotic development lies gene regulation. On the molecular level, much of the research in this field has focused on the binding of transcription factors (TFs) to regulatory regions in the genome known as cis-regulatory modules (CRMs). However, relatively little is known about the sequence-specific binding preferences of many TFs, especially with respect to the possible interdependencies between the nucleotides that make up binding sites. A particular limitation of many existing algorithms that aim to predict binding site sequences is that they do not allow for dependencies between nonadjacent nucleotides. In this study, we use a recently developed computational algorithm, MARZ, to compare binding site sequences using 32 distinct models in a systematic and unbiased approach to explore nucleotide dependencies within binding sites for 15 distinct TFs known to be critical to Drosophila development. Our results indicate that many of these proteins have varying levels of nucleotide interdependencies within their DNA recognition sequences, and that, in some cases, models that account for these dependencies greatly outperform traditional models that are used to predict binding sites. We also directly compare the ability of different models to identify the known KRUPPEL TF binding sites in CRMs and demonstrate that a more complex model that accounts for nucleotide interdependencies performs better when compared with simple models. This ability to identify TFs with critical nucleotide interdependencies in their binding sites will lead to a deeper understanding of how these molecular characteristics contribute to the architecture of CRMs and the precise regulation of transcription during organismal development.
Collapse
Affiliation(s)
- Jacqueline M. Dresch
- Department of Mathematics and Computer Science, Clark University, Worcester, MA, USA
| | - Rowan G. Zellers
- Computer Science Department, Harvey Mudd College, Claremont, CA, USA
- Mathematics Department, Harvey Mudd College, Claremont, CA, USA
| | - Daniel K. Bork
- Computer Science Department, Harvey Mudd College, Claremont, CA, USA
- Mathematics Department, Harvey Mudd College, Claremont, CA, USA
| | | |
Collapse
|
49
|
Young RS. Lineage-specific genomics: Frequent birth and death in the human genome: The human genome contains many lineage-specific elements created by both sequence and functional turnover. Bioessays 2016; 38:654-63. [PMID: 27231054 PMCID: PMC4949557 DOI: 10.1002/bies.201500192] [Citation(s) in RCA: 4] [Impact Index Per Article: 0.5] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/25/2022]
Abstract
Frequent evolutionary birth and death events have created a large quantity of biologically important, lineage‐specific DNA within mammalian genomes. The birth and death of DNA sequences is so frequent that the total number of these insertions and deletions in the human population remains unknown, although there are differences between these groups, e.g. transposable elements contribute predominantly to sequence insertion. Functional turnover – where the activity of a locus is specific to one lineage, but the underlying DNA remains conserved – can also drive birth and death. However, this does not appear to be a major driver of divergent transcriptional regulation. Both sequence and functional turnover have contributed to the birth and death of thousands of functional promoters in the human and mouse genomes. These findings reveal the pervasive nature of evolutionary birth and death and suggest that lineage‐specific regions may play an important but previously underappreciated role in human biology and disease.
Collapse
Affiliation(s)
- Robert S Young
- MRC Human Genetics Unit, MRC IGMM, University of Edinburgh, Edinburgh, UK
| |
Collapse
|
50
|
Knox DA, Dowell RD. A Modeling Framework for Generation of Positional and Temporal Simulations of Transcriptional Regulation. IEEE/ACM TRANSACTIONS ON COMPUTATIONAL BIOLOGY AND BIOINFORMATICS 2016; 13:459-471. [PMID: 27295631 DOI: 10.1109/tcbb.2015.2459708] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 06/06/2023]
Abstract
We present a modeling framework aimed at capturing both the positional and temporal behavior of transcriptional regulatory proteins in eukaryotic cells. There is growing evidence that transcriptional regulation is the complex behavior that emerges not solely from the individual components, but rather from their collective behavior, including competition and cooperation. Our framework describes individual regulatory components using generic action oriented descriptions of their biochemical interactions with a DNA sequence. All the possible actions are based on the current state of factors bound to the DNA. We developed a rule builder to automatically generate the complete set of biochemical interaction rules for any given DNA sequence. Off-the-shelf stochastic simulation engines can model the behavior of a system of rules and the resulting changes in the configuration of bound factors can be visualized. We compared our model to experimental data at well-studied loci in yeast, confirming that our model captures both the positional and temporal behavior of transcriptional regulation.
Collapse
Affiliation(s)
- David A Knox
- Computational Bioscience Program, University of Colorado, School of Medicine, Anschutz Medical Campus, Aurora, CO
| | - Robin D Dowell
- Molecular, Cellular, Developmental Biology Department, BioFrontiers Institute, University of Colorado, Boulder, CO
| |
Collapse
|