1
|
Wang F, Ding P, Liang X, Ding X, Brandt CB, Sjöstedt E, Zhu J, Bolund S, Zhang L, de Rooij LPMH, Luo L, Wei Y, Zhao W, Lv Z, Haskó J, Li R, Qin Q, Jia Y, Wu W, Yuan Y, Pu M, Wang H, Wu A, Xie L, Liu P, Chen F, Herold J, Kalucka J, Karlsson M, Zhang X, Helmig RB, Fagerberg L, Lindskog C, Pontén F, Uhlen M, Bolund L, Jessen N, Jiang H, Xu X, Yang H, Carmeliet P, Mulder J, Chen D, Lin L, Luo Y. Endothelial cell heterogeneity and microglia regulons revealed by a pig cell landscape at single-cell level. Nat Commun 2022; 13:3620. [PMID: 35750885 PMCID: PMC9232580 DOI: 10.1038/s41467-022-31388-z] [Citation(s) in RCA: 15] [Impact Index Per Article: 5.0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 11/24/2021] [Accepted: 06/16/2022] [Indexed: 11/23/2022] Open
Abstract
Pigs are valuable large animal models for biomedical and genetic research, but insights into the tissue- and cell-type-specific transcriptome and heterogeneity remain limited. By leveraging single-cell RNA sequencing, we generate a multiple-organ single-cell transcriptomic map containing over 200,000 pig cells from 20 tissues/organs. We comprehensively characterize the heterogeneity of cells in tissues and identify 234 cell clusters, representing 58 major cell types. In-depth integrative analysis of endothelial cells reveals a high degree of heterogeneity. We identify several functionally distinct endothelial cell phenotypes, including an endothelial to mesenchymal transition subtype in adipose tissues. Intercellular communication analysis predicts tissue- and cell type-specific crosstalk between endothelial cells and other cell types through the VEGF, PDGF, TGF-β, and BMP pathways. Regulon analysis of single-cell transcriptome of microglia in pig and 12 other species further identifies MEF2C as an evolutionally conserved regulon in the microglia. Our work describes the landscape of single-cell transcriptomes within diverse pig organs and identifies the heterogeneity of endothelial cells and evolutionally conserved regulon in microglia.
Collapse
Affiliation(s)
- Fei Wang
- Lars Bolund Institute of Regenerative Medicine, Qingdao-Europe Advanced Institute for Life Sciences, BGI-Qingdao, BGI-Shenzhen, Qingdao, China
- Department of Biomedicine, Aarhus University, Aarhus, Denmark
- BGI-Shenzhen, Shenzhen, China
| | - Peiwen Ding
- BGI-Shenzhen, Shenzhen, China
- College of Life Sciences, University of Chinese Academy of Sciences, Beijing, China
| | - Xue Liang
- Lars Bolund Institute of Regenerative Medicine, Qingdao-Europe Advanced Institute for Life Sciences, BGI-Qingdao, BGI-Shenzhen, Qingdao, China
- Department of Biology, University of Copenhagen, Copenhagen, Denmark
| | - Xiangning Ding
- BGI-Shenzhen, Shenzhen, China
- College of Life Sciences, University of Chinese Academy of Sciences, Beijing, China
| | - Camilla Blunk Brandt
- Department of Biomedicine, Aarhus University, Aarhus, Denmark
- Steno Diabetes Center Aarhus, Aarhus University Hospital, Aarhus, Denmark
| | - Evelina Sjöstedt
- Department of Neuroscience, Karolinska Institutet, Stockholm, Sweden
| | - Jiacheng Zhu
- BGI-Shenzhen, Shenzhen, China
- College of Life Sciences, University of Chinese Academy of Sciences, Beijing, China
| | - Saga Bolund
- Department of Neuroscience, Karolinska Institutet, Stockholm, Sweden
| | - Lijing Zhang
- BGI-Shenzhen, Shenzhen, China
- College of Life Sciences, University of Chinese Academy of Sciences, Beijing, China
- MGI, BGI-Shenzhen, Shenzhen, China
| | - Laura P M H de Rooij
- Laboratory of Angiogenesis and Vascular Metabolism, Center for Cancer Biology, VIB, Leuven, Belgium
- Department of Oncology, Leuven Cancer Institute, KU Leuven, Leuven, Belgium
| | - Lihua Luo
- BGI-Shenzhen, Shenzhen, China
- College of Life Sciences, University of Chinese Academy of Sciences, Beijing, China
| | - Yanan Wei
- BGI-Shenzhen, Shenzhen, China
- College of Basic Medicine, Qingdao University, Qingdao, China
| | - Wandong Zhao
- BGI-Shenzhen, Shenzhen, China
- College of Basic Medicine, Qingdao University, Qingdao, China
| | - Zhiyuan Lv
- BGI-Shenzhen, Shenzhen, China
- College of Basic Medicine, Qingdao University, Qingdao, China
| | - János Haskó
- Department of Biomedicine, Aarhus University, Aarhus, Denmark
| | - Runchu Li
- BGI-Shenzhen, Shenzhen, China
- College of Basic Medicine, Qingdao University, Qingdao, China
| | - Qiuyu Qin
- BGI-Shenzhen, Shenzhen, China
- College of Basic Medicine, Qingdao University, Qingdao, China
| | - Yi Jia
- BGI-Shenzhen, Shenzhen, China
- College of Basic Medicine, Qingdao University, Qingdao, China
| | - Wendi Wu
- BGI-Shenzhen, Shenzhen, China
- College of Basic Medicine, Qingdao University, Qingdao, China
| | - Yuting Yuan
- School of Basic Medical Sciences, Binzhou Medical University, Yantai, China
| | - Mingyi Pu
- BGI-Shenzhen, Shenzhen, China
- College of Basic Medicine, Qingdao University, Qingdao, China
| | - Haoyu Wang
- BGI-Shenzhen, Shenzhen, China
- College of Life Sciences, University of Chinese Academy of Sciences, Beijing, China
| | - Aiping Wu
- Institute of Systems Medicine, Chinese Academy of Medical Sciences, Peking Union Medical College, Beijing, China
- Suzhou Institute of Systems Medicine, Suzhou, China
| | - Lin Xie
- MGI, BGI-Shenzhen, Shenzhen, China
| | - Ping Liu
- MGI, BGI-Shenzhen, Shenzhen, China
| | | | | | - Joanna Kalucka
- Department of Biomedicine, Aarhus University, Aarhus, Denmark
- Steno Diabetes Center Aarhus, Aarhus University Hospital, Aarhus, Denmark
- Aarhus University of Advanced Studies (AIAS), Aarhus University, Aarhus, Denmark
| | - Max Karlsson
- Department of Protein Science, Science for Life Laboratory, KTH-Royal Institute of Technology, Stockholm, Sweden
| | - Xiuqing Zhang
- BGI-Shenzhen, Shenzhen, China
- College of Life Sciences, University of Chinese Academy of Sciences, Beijing, China
| | - Rikke Bek Helmig
- Department of Obstetrics and Gynecology, Aarhus University Hospital, Aarhus, Denmark
| | - Linn Fagerberg
- Department of Protein Science, Science for Life Laboratory, KTH-Royal Institute of Technology, Stockholm, Sweden
| | - Cecilia Lindskog
- Department of Immunology, Genetics and Pathology, Uppsala University, Uppsala, Sweden
| | - Fredrik Pontén
- Department of Immunology, Genetics and Pathology, Uppsala University, Uppsala, Sweden
| | - Mathias Uhlen
- Department of Neuroscience, Karolinska Institutet, Stockholm, Sweden
- Department of Protein Science, Science for Life Laboratory, KTH-Royal Institute of Technology, Stockholm, Sweden
| | - Lars Bolund
- Lars Bolund Institute of Regenerative Medicine, Qingdao-Europe Advanced Institute for Life Sciences, BGI-Qingdao, BGI-Shenzhen, Qingdao, China
- Department of Biomedicine, Aarhus University, Aarhus, Denmark
| | - Niels Jessen
- Steno Diabetes Center Aarhus, Aarhus University Hospital, Aarhus, Denmark
| | | | - Xun Xu
- BGI-Shenzhen, Shenzhen, China
| | - Huanming Yang
- BGI-Shenzhen, Shenzhen, China
- IBMC-BGI Center, the Cancer Hospital of the University of Chinese Academy of Sciences (Zhejiang Cancer Hospital), Institute of Basic Medicine and Cancer (IBMC), Chinese Academy of Sciences, Hangzhou, China
| | - Peter Carmeliet
- Department of Biomedicine, Aarhus University, Aarhus, Denmark
- Laboratory of Angiogenesis and Vascular Metabolism, Center for Cancer Biology, VIB, Leuven, Belgium
- Department of Oncology, Leuven Cancer Institute, KU Leuven, Leuven, Belgium
- Center for Biotechnology, Khalifa University of Science and Technology, Abu Dhabi, United Arab Emirates
| | - Jan Mulder
- Department of Neuroscience, Karolinska Institutet, Stockholm, Sweden
| | - Dongsheng Chen
- BGI-Shenzhen, Shenzhen, China.
- Institute of Systems Medicine, Chinese Academy of Medical Sciences, Peking Union Medical College, Beijing, China.
- Suzhou Institute of Systems Medicine, Suzhou, China.
| | - Lin Lin
- Department of Biomedicine, Aarhus University, Aarhus, Denmark.
- Steno Diabetes Center Aarhus, Aarhus University Hospital, Aarhus, Denmark.
| | - Yonglun Luo
- Lars Bolund Institute of Regenerative Medicine, Qingdao-Europe Advanced Institute for Life Sciences, BGI-Qingdao, BGI-Shenzhen, Qingdao, China.
- Department of Biomedicine, Aarhus University, Aarhus, Denmark.
- BGI-Shenzhen, Shenzhen, China.
- Steno Diabetes Center Aarhus, Aarhus University Hospital, Aarhus, Denmark.
- IBMC-BGI Center, the Cancer Hospital of the University of Chinese Academy of Sciences (Zhejiang Cancer Hospital), Institute of Basic Medicine and Cancer (IBMC), Chinese Academy of Sciences, Hangzhou, China.
| |
Collapse
|
2
|
Baumgart LA, Lee JE, Salamov A, Dilworth DJ, Na H, Mingay M, Blow MJ, Zhang Y, Yoshinaga Y, Daum CG, O'Malley RC. Persistence and plasticity in bacterial gene regulation. Nat Methods 2021; 18:1499-1505. [PMID: 34824476 DOI: 10.1038/s41592-021-01312-2] [Citation(s) in RCA: 24] [Impact Index Per Article: 6.0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 04/20/2021] [Accepted: 09/24/2021] [Indexed: 11/09/2022]
Abstract
Organisms orchestrate cellular functions through transcription factor (TF) interactions with their target genes, although these regulatory relationships are largely unknown in most species. Here we report a high-throughput approach for characterizing TF-target gene interactions across species and its application to 354 TFs across 48 bacteria, generating 17,000 genome-wide binding maps. This dataset revealed themes of ancient conservation and rapid evolution of regulatory modules. We observed rewiring, where the TF sensing and regulatory role is maintained while the arrangement and identity of target genes diverges, in some cases encoding entirely new functions. We further integrated phenotypic information to define new functional regulatory modules and pathways. Finally, we identified 242 new TF DNA binding motifs, including a 70% increase of known Escherichia coli motifs and the first annotation in Pseudomonas simiae, revealing deep conservation in bacterial promoter architecture. Our method provides a versatile tool for functional characterization of genetic pathways in prokaryotes and eukaryotes.
Collapse
Affiliation(s)
- Leo A Baumgart
- U.S. Department of Energy, Joint Genome Institute, Lawrence Berkeley National Laboratory, Berkeley, CA, USA
| | - Ji Eun Lee
- U.S. Department of Energy, Joint Genome Institute, Lawrence Berkeley National Laboratory, Berkeley, CA, USA
| | - Asaf Salamov
- U.S. Department of Energy, Joint Genome Institute, Lawrence Berkeley National Laboratory, Berkeley, CA, USA
| | - David J Dilworth
- U.S. Department of Energy, Joint Genome Institute, Lawrence Berkeley National Laboratory, Berkeley, CA, USA
| | - Hyunsoo Na
- U.S. Department of Energy, Joint Genome Institute, Lawrence Berkeley National Laboratory, Berkeley, CA, USA
| | - Matthew Mingay
- U.S. Department of Energy, Joint Genome Institute, Lawrence Berkeley National Laboratory, Berkeley, CA, USA
| | - Matthew J Blow
- U.S. Department of Energy, Joint Genome Institute, Lawrence Berkeley National Laboratory, Berkeley, CA, USA.,Environmental Genomics and Systems Biology, Lawrence Berkeley National Laboratory, Berkeley, CA, USA
| | - Yu Zhang
- U.S. Department of Energy, Joint Genome Institute, Lawrence Berkeley National Laboratory, Berkeley, CA, USA
| | - Yuko Yoshinaga
- U.S. Department of Energy, Joint Genome Institute, Lawrence Berkeley National Laboratory, Berkeley, CA, USA
| | - Chris G Daum
- U.S. Department of Energy, Joint Genome Institute, Lawrence Berkeley National Laboratory, Berkeley, CA, USA
| | - Ronan C O'Malley
- U.S. Department of Energy, Joint Genome Institute, Lawrence Berkeley National Laboratory, Berkeley, CA, USA. .,Environmental Genomics and Systems Biology, Lawrence Berkeley National Laboratory, Berkeley, CA, USA.
| |
Collapse
|
3
|
Tayara H, Chong KT. Improved Predicting of The Sequence Specificities of RNA Binding Proteins by Deep Learning. IEEE/ACM TRANSACTIONS ON COMPUTATIONAL BIOLOGY AND BIOINFORMATICS 2021; 18:2526-2534. [PMID: 32191896 DOI: 10.1109/tcbb.2020.2981335] [Citation(s) in RCA: 7] [Impact Index Per Article: 1.8] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 06/10/2023]
Abstract
RNA-binding proteins (RBPs) have a significant role in various regulatory tasks. However, the mechanism by which RBPs identify the subsequence target RNAs is still not clear. In recent years, several machine and deep learning-based computational models have been proposed for understanding the binding preferences of RBPs. These methods required integrating multiple features with raw RNA sequences such as secondary structure and their performances can be further improved. In this paper, we propose an efficient and simple convolution neural network, RBPCNN, that relies on the combination of the raw RNA sequence and evolutionary information. We show that conservation scores (evolutionary information) for the RNA sequences can significantly improve the overall performance of the proposed predictor. In addition, the automatic extraction of the binding sequence motifs can enhance our understanding of the binding specificities of RBPs. The experimental results show that RBPCNN outperforms significantly the current state-of-the-art methods. More specifically, the average area under the receiver operator curve was improved by 2.67 percent and the mean average precision was improved by 8.03 percent. The datasets and results can be downloaded from https://home.jbnu.ac.kr/NSCL/RBPCNN.htm.
Collapse
|
4
|
Abstract
BACKGROUND RNA-binding proteins (RBPs) are crucial in modulating RNA metabolism in eukaryotes thereby controlling an extensive network of RBP-RNA interactions. Although previous studies on the conservation of RBP targets have been carried out in lower eukaryotes such as yeast, relatively little is known about the extent of conservation of the binding sites of RBPs across mammalian species. RESULTS In this study, we employ CLIP-seq datasets for 60 human RBPs and demonstrate that most binding sites for a third of these RBPs are conserved in at least 50% of the studied vertebrate species. Across the studied RBPs, binding sites were found to exhibit a median conservation of 58%, ~ 20% higher than random genomic locations, suggesting a significantly higher preservation of RBP-RNA interaction networks across vertebrates. RBP binding sites were highly conserved across primates with weak conservation profiles in birds and fishes. We also note that phylogenetic relationship between members of an RBP family does not explain the extent of conservation of their binding sites across species. Multivariate analysis to uncover features contributing to differences in the extents of conservation of binding sites across RBPs revealed RBP expression level and number of post-transcriptional targets to be the most prominent factors. Examination of the location of binding sites at the gene level confirmed that binding sites occurring on the 3' region of a gene are highly conserved across species with 90% of the RBPs exhibiting a significantly higher conservation of binding sites in 3' regions of a gene than those occurring in the 5'. Gene set enrichment analysis on the extent of conservation of binding sites to identify significantly associated human phenotypes revealed an enrichment for multiple developmental abnormalities. CONCLUSIONS Our results suggest that binding sites of human RBPs are highly conserved across primates with weak conservation profiles in lower vertebrates and evolutionary relationship between members of an RBP family does not explain the extent of conservation of their binding sites. Expression level and number of targets of an RBP are important factors contributing to the differences in the extent of conservation of binding sites. RBP binding sites on 3' ends of a gene are the most conserved across species. Phenotypic analysis on the extent of conservation of binding sites revealed the importance of lineage-specific developmental events in post-transcriptional regulatory network evolution.
Collapse
Affiliation(s)
- Aarthi Ramakrishnan
- Department of Biohealth Informatics, School of Informatics and Computing, Indiana University Purdue University, Indianapolis, IN, 46202, USA
| | - Sarath Chandra Janga
- Department of Biohealth Informatics, School of Informatics and Computing, Indiana University Purdue University, Indianapolis, IN, 46202, USA. .,Centre for Computational Biology and Bioinformatics, Indiana University School of Medicine, Indianapolis, IN, 46202, USA. .,Department of Medical and Molecular Genetics, Indiana University School of Medicine, Indianapolis, IN, 46202, USA.
| |
Collapse
|
5
|
Sonawane AR, Weiss ST, Glass K, Sharma A. Network Medicine in the Age of Biomedical Big Data. Front Genet 2019; 10:294. [PMID: 31031797 PMCID: PMC6470635 DOI: 10.3389/fgene.2019.00294] [Citation(s) in RCA: 111] [Impact Index Per Article: 18.5] [Reference Citation Analysis] [Abstract] [Key Words] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 12/26/2018] [Accepted: 03/19/2019] [Indexed: 12/13/2022] Open
Abstract
Network medicine is an emerging area of research dealing with molecular and genetic interactions, network biomarkers of disease, and therapeutic target discovery. Large-scale biomedical data generation offers a unique opportunity to assess the effect and impact of cellular heterogeneity and environmental perturbations on the observed phenotype. Marrying the two, network medicine with biomedical data provides a framework to build meaningful models and extract impactful results at a network level. In this review, we survey existing network types and biomedical data sources. More importantly, we delve into ways in which the network medicine approach, aided by phenotype-specific biomedical data, can be gainfully applied. We provide three paradigms, mainly dealing with three major biological network archetypes: protein-protein interaction, expression-based, and gene regulatory networks. For each of these paradigms, we discuss a broad overview of philosophies under which various network methods work. We also provide a few examples in each paradigm as a test case of its successful application. Finally, we delineate several opportunities and challenges in the field of network medicine. We hope this review provides a lexicon for researchers from biological sciences and network theory to come on the same page to work on research areas that require interdisciplinary expertise. Taken together, the understanding gained from combining biomedical data with networks can be useful for characterizing disease etiologies and identifying therapeutic targets, which, in turn, will lead to better preventive medicine with translational impact on personalized healthcare.
Collapse
Affiliation(s)
- Abhijeet R. Sonawane
- Channing Division of Network Medicine, Brigham and Women’s Hospital, Boston, MA, United States
- Department of Medicine, Harvard Medical School, Boston, MA, United States
| | - Scott T. Weiss
- Channing Division of Network Medicine, Brigham and Women’s Hospital, Boston, MA, United States
- Department of Medicine, Harvard Medical School, Boston, MA, United States
| | - Kimberly Glass
- Channing Division of Network Medicine, Brigham and Women’s Hospital, Boston, MA, United States
- Department of Medicine, Harvard Medical School, Boston, MA, United States
| | - Amitabh Sharma
- Channing Division of Network Medicine, Brigham and Women’s Hospital, Boston, MA, United States
- Department of Medicine, Harvard Medical School, Boston, MA, United States
- Center for Interdisciplinary Cardiovascular Sciences, Cardiovascular Division, Brigham and Women’s Hospital, Boston, MA, United States
| |
Collapse
|
6
|
Kelle D, Kırımtay K, Selçuk E, Karabay A. Elk1 affects katanin and spastin proteins via differential transcriptional and post-transcriptional regulations. PLoS One 2019; 14:e0212518. [PMID: 30789974 PMCID: PMC6383945 DOI: 10.1371/journal.pone.0212518] [Citation(s) in RCA: 9] [Impact Index Per Article: 1.5] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/24/2018] [Accepted: 02/04/2019] [Indexed: 01/06/2023] Open
Abstract
Microtubule severing, which is highly critical for the survival of both mitotic and post-mitotic cells, has to be precisely adjusted by regulating the expression levels of severing proteins, katanin and spastin. Even though severing mechanism is relatively well-studied, there are limited studies for the transcriptional regulation of microtubule severing proteins. In this study, we identified the main regulatory region of KATNA1 gene encoding katanin-p60 as 5’ UTR, which has a key role for its expression, and showed Elk1 binding to KATNA1. Furthermore, we identified that Elk1 decreased katanin-p60 and spastin protein expressions, while mRNA levels were increased upon Elk1 overexpression. In addition, SUMOylation is a known post-translational modification regulating Elk1 activity. A previous study suggested that K230, K249, K254 amino acids in the R domain are the main SUMOylation sites; however, we identified that these amino acids are neither essential nor substantial for Elk1 SUMOylation. Also, we determined that KATNA1 methylation results in the reduction of Elk1 binding whereas SPG4 methylation does not. Together, our findings emphasizing the impacts of both transcriptional and post-transcriptional regulations of katanin-p60 and spastin suggest that Elk1 has a key role for differential expression patterns of microtubule severing proteins, thereby regulating cellular functions through alterations of microtubule organization.
Collapse
Affiliation(s)
- Dolunay Kelle
- Department of Molecular Biology and Genetics, Istanbul Technical University, Maslak, Istanbul, Turkey
| | - Koray Kırımtay
- Department of Molecular Biology and Genetics, Istanbul Technical University, Maslak, Istanbul, Turkey
| | - Ece Selçuk
- Department of Molecular Biology and Genetics, Istanbul Technical University, Maslak, Istanbul, Turkey
| | - Arzu Karabay
- Department of Molecular Biology and Genetics, Istanbul Technical University, Maslak, Istanbul, Turkey
- * E-mail:
| |
Collapse
|
7
|
Niu B, Coslo DM, Bataille AR, Albert I, Pugh BF, Omiecinski CJ. In vivo genome-wide binding interactions of mouse and human constitutive androstane receptors reveal novel gene targets. Nucleic Acids Res 2018; 46:8385-8403. [PMID: 30102401 PMCID: PMC6144799 DOI: 10.1093/nar/gky692] [Citation(s) in RCA: 22] [Impact Index Per Article: 3.1] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 05/04/2018] [Revised: 07/17/2018] [Accepted: 07/20/2018] [Indexed: 12/13/2022] Open
Abstract
The constitutive androstane receptor (CAR; NR1I3) is a nuclear receptor orchestrating complex roles in cell and systems biology. Species differences in CAR's effector pathways remain poorly understood, including its role in regulating liver tumor promotion. We developed transgenic mouse models to assess genome-wide binding of mouse and human CAR, following receptor activation in liver with direct ligands and with phenobarbital, an indirect CAR activator. Genomic interaction profiles were integrated with transcriptional and biological pathway analyses. Newly identified CAR target genes included Gdf15 and Foxo3, important regulators of the carcinogenic process. Approximately 1000 genes exhibited differential binding interactions between mouse and human CAR, including the proto-oncogenes, Myc and Ikbke, which demonstrated preferential binding by mouse CAR as well as mouse CAR-selective transcriptional enhancement. The ChIP-exo analyses also identified distinct binding motifs for the respective mouse and human receptors. Together, the results provide new insights into the important roles that CAR contributes as a key modulator of numerous signaling pathways in mammalian organisms, presenting a genomic context that specifies species variation in biological processes under CAR's control, including liver cell proliferation and tumor promotion.
Collapse
Affiliation(s)
- Ben Niu
- Center for Molecular Toxicology and Carcinogenesis, Department of Veterinary and Biomedical Sciences, The Pennsylvania State University, University Park, PA 16802, USA
| | - Denise M Coslo
- Center for Molecular Toxicology and Carcinogenesis, Department of Veterinary and Biomedical Sciences, The Pennsylvania State University, University Park, PA 16802, USA
| | - Alain R Bataille
- Department of Biochemistry and Molecular Biology, The Pennsylvania State University, University Park, PA 16802, USA
| | - Istvan Albert
- Department of Biochemistry and Molecular Biology, The Pennsylvania State University, University Park, PA 16802, USA
| | - B Franklin Pugh
- Department of Biochemistry and Molecular Biology, The Pennsylvania State University, University Park, PA 16802, USA
| | - Curtis J Omiecinski
- Center for Molecular Toxicology and Carcinogenesis, Department of Veterinary and Biomedical Sciences, The Pennsylvania State University, University Park, PA 16802, USA
| |
Collapse
|
8
|
Khoueiry P, Girardot C, Ciglar L, Peng PC, Gustafson EH, Sinha S, Furlong EE. Uncoupling evolutionary changes in DNA sequence, transcription factor occupancy and enhancer activity. eLife 2017; 6. [PMID: 28792889 PMCID: PMC5550276 DOI: 10.7554/elife.28440] [Citation(s) in RCA: 31] [Impact Index Per Article: 3.9] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 05/08/2017] [Accepted: 07/21/2017] [Indexed: 12/15/2022] Open
Abstract
Sequence variation within enhancers plays a major role in both evolution and disease, yet its functional impact on transcription factor (TF) occupancy and enhancer activity remains poorly understood. Here, we assayed the binding of five essential TFs over multiple stages of embryogenesis in two distant Drosophila species (with 1.4 substitutions per neutral site), identifying thousands of orthologous enhancers with conserved or diverged combinatorial occupancy. We used these binding signatures to dissect two properties of developmental enhancers: (1) potential TF cooperativity, using signatures of co-associations and co-divergence in TF occupancy. This revealed conserved combinatorial binding despite sequence divergence, suggesting protein-protein interactions sustain conserved collective occupancy. (2) Enhancer in-vivo activity, revealing orthologous enhancers with conserved activity despite divergence in TF occupancy. Taken together, we identify enhancers with diverged motifs yet conserved occupancy and others with diverged occupancy yet conserved activity, emphasising the need to functionally measure the effect of divergence on enhancer activity.
Collapse
Affiliation(s)
- Pierre Khoueiry
- European Molecular Biology Laboratory, Genome Biology Unit, Heidelberg, Germany
| | - Charles Girardot
- European Molecular Biology Laboratory, Genome Biology Unit, Heidelberg, Germany
| | - Lucia Ciglar
- European Molecular Biology Laboratory, Genome Biology Unit, Heidelberg, Germany
| | - Pei-Chen Peng
- Carl R. Woese Institute of Genomic Biology, University of Illinois, Champaign, United States
| | - E Hilary Gustafson
- European Molecular Biology Laboratory, Genome Biology Unit, Heidelberg, Germany
| | - Saurabh Sinha
- European Molecular Biology Laboratory, Genome Biology Unit, Heidelberg, Germany.,Carl R. Woese Institute of Genomic Biology, University of Illinois, Champaign, United States
| | - Eileen Em Furlong
- European Molecular Biology Laboratory, Genome Biology Unit, Heidelberg, Germany
| |
Collapse
|
9
|
Ruffalo M, Bar-Joseph Z. Genome wide predictions of miRNA regulation by transcription factors. Bioinformatics 2017; 32:i746-i754. [PMID: 27587697 DOI: 10.1093/bioinformatics/btw452] [Citation(s) in RCA: 8] [Impact Index Per Article: 1.0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 01/28/2023] Open
Abstract
MOTIVATION Reconstructing regulatory networks from expression and interaction data is a major goal of systems biology. While much work has focused on trying to experimentally and computationally determine the set of transcription-factors (TFs) and microRNAs (miRNAs) that regulate genes in these networks, relatively little work has focused on inferring the regulation of miRNAs by TFs. Such regulation can play an important role in several biological processes including development and disease. The main challenge for predicting such interactions is the very small positive training set currently available. Another challenge is the fact that a large fraction of miRNAs are encoded within genes making it hard to determine the specific way in which they are regulated. RESULTS To enable genome wide predictions of TF-miRNA interactions, we extended semi-supervised machine-learning approaches to integrate a large set of different types of data including sequence, expression, ChIP-seq and epigenetic data. As we show, the methods we develop achieve good performance on both a labeled test set, and when analyzing general co-expression networks. We next analyze mRNA and miRNA cancer expression data, demonstrating the advantage of using the predicted set of interactions for identifying more coherent and relevant modules, genes, and miRNAs. The complete set of predictions is available on the supporting website and can be used by any method that combines miRNAs, genes, and TFs. AVAILABILITY AND IMPLEMENTATION Code and full set of predictions are available from the supporting website: http://cs.cmu.edu/~mruffalo/tf-mirna/ CONTACT zivbj@cs.cmu.edu SUPPLEMENTARY INFORMATION Supplementary data are available at Bioinformatics online.
Collapse
Affiliation(s)
- Matthew Ruffalo
- Department of Computational Biology, School of Computer Science, Carnegie Mellon University, Pittsburgh, PA, USA 15213
| | - Ziv Bar-Joseph
- Department of Computational Biology, School of Computer Science, Carnegie Mellon University, Pittsburgh, PA, USA 15213
| |
Collapse
|
10
|
Becker MG, Walker PL, Pulgar-Vidal NC, Belmonte MF. SeqEnrich: A tool to predict transcription factor networks from co-expressed Arabidopsis and Brassica napus gene sets. PLoS One 2017; 12:e0178256. [PMID: 28575075 PMCID: PMC5456048 DOI: 10.1371/journal.pone.0178256] [Citation(s) in RCA: 22] [Impact Index Per Article: 2.8] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 01/06/2017] [Accepted: 05/10/2017] [Indexed: 01/08/2023] Open
Abstract
Transcription factors and their associated DNA binding sites are key regulatory elements of cellular differentiation, development, and environmental response. New tools that predict transcriptional regulation of biological processes are valuable to researchers studying both model and emerging-model plant systems. SeqEnrich predicts transcription factor networks from co-expressed Arabidopsis or Brassica napus gene sets. The networks produced by SeqEnrich are supported by existing literature and predicted transcription factor–DNA interactions that can be functionally validated at the laboratory bench. The program functions with gene sets of varying sizes and derived from diverse tissues and environmental treatments. SeqEnrich presents as a powerful predictive framework for the analysis of Arabidopsis and Brassica napus co-expression data, and is designed so that researchers at all levels can easily access and interpret predicted transcriptional circuits. The program outperformed its ancestral program ChipEnrich, and produced detailed transcription factor networks from Arabidopsis and Brassica napus gene expression data. The SeqEnrich program is ideal for generating new hypotheses and distilling biological information from large-scale expression data.
Collapse
Affiliation(s)
- Michael G. Becker
- Department of Biological Sciences, University of Manitoba, Winnipeg, Manitoba, Canada
| | - Philip L. Walker
- Department of Biological Sciences, University of Manitoba, Winnipeg, Manitoba, Canada
| | | | - Mark F. Belmonte
- Department of Biological Sciences, University of Manitoba, Winnipeg, Manitoba, Canada
- * E-mail:
| |
Collapse
|
11
|
Abstract
Motivation: Eukaryotic gene expression is controlled through molecular logic circuits that combine regulatory signals of many different factors. In particular, complexation of transcription factors (TFs) and other regulatory proteins is a prevailing and highly conserved mechanism of signal integration within critical regulatory pathways and enables us to infer controlled genes as well as the exerted regulatory mechanism. Common approaches for protein complex prediction that only use protein interaction networks, however, are designed to detect self-contained functional complexes and have difficulties to reveal dynamic combinatorial assemblies of physically interacting proteins. Results: We developed the novel algorithm DACO that combines protein–protein interaction networks and domain–domain interaction networks with the cluster-quality metric cohesiveness. The metric is locally maximized on the holistic level of protein interactions, and connectivity constraints on the domain level are used to account for the exclusive and thus inherently combinatorial nature of the interactions within such assemblies. When applied to predicting TF complexes in the yeast Saccharomyces cerevisiae, the proposed approach outperformed popular complex prediction methods by far. Furthermore, we were able to assign many of the predictions to target genes, as well as to a potential regulatory effect in agreement with literature evidence. Availability and implementation: A prototype implementation is freely available at https://sourceforge.net/projects/dacoalgorithm/. Contact:volkhard.helms@bioinformatik.uni-saarland.de Supplementary information:Supplementary data are available at Bioinformatics online.
Collapse
Affiliation(s)
- Thorsten Will
- Center for Bioinformatics, Campus Building E2.1, Saarland University, D-66123 Saarbrücken, Germany
| | - Volkhard Helms
- Center for Bioinformatics, Campus Building E2.1, Saarland University, D-66123 Saarbrücken, Germany
| |
Collapse
|
12
|
Rhrissorrakrai K, Belcastro V, Bilal E, Norel R, Poussin C, Mathis C, Dulize RHJ, Ivanov NV, Alexopoulos L, Rice JJ, Peitsch MC, Stolovitzky G, Meyer P, Hoeng J. Understanding the limits of animal models as predictors of human biology: lessons learned from the sbv IMPROVER Species Translation Challenge. Bioinformatics 2014; 31:471-83. [PMID: 25236459 PMCID: PMC4325540 DOI: 10.1093/bioinformatics/btu611] [Citation(s) in RCA: 47] [Impact Index Per Article: 4.3] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/18/2022] Open
Abstract
Motivation: Inferring how humans respond to external cues such as drugs, chemicals, viruses or hormones is an essential question in biomedicine. Very often, however, this question cannot be addressed because it is not possible to perform experiments in humans. A reasonable alternative consists of generating responses in animal models and ‘translating’ those results to humans. The limitations of such translation, however, are far from clear, and systematic assessments of its actual potential are urgently needed. sbv IMPROVER (systems biology verification for Industrial Methodology for PROcess VErification in Research) was designed as a series of challenges to address translatability between humans and rodents. This collaborative crowd-sourcing initiative invited scientists from around the world to apply their own computational methodologies on a multilayer systems biology dataset composed of phosphoproteomics, transcriptomics and cytokine data derived from normal human and rat bronchial epithelial cells exposed in parallel to 52 different stimuli under identical conditions. Our aim was to understand the limits of species-to-species translatability at different levels of biological organization: signaling, transcriptional and release of secreted factors (such as cytokines). Participating teams submitted 49 different solutions across the sub-challenges, two-thirds of which were statistically significantly better than random. Additionally, similar computational methods were found to range widely in their performance within the same challenge, and no single method emerged as a clear winner across all sub-challenges. Finally, computational methods were able to effectively translate some specific stimuli and biological processes in the lung epithelial system, such as DNA synthesis, cytoskeleton and extracellular matrix, translation, immune/inflammation and growth factor/proliferation pathways, better than the expected response similarity between species. Contact:pmeyerr@us.ibm.com or Julia.Hoeng@pmi.com Supplementary information:Supplementary data are available at Bioinformatics online.
Collapse
Affiliation(s)
- Kahn Rhrissorrakrai
- IBM T.J. Watson Research Center, Computational Biology Center, Yorktown Heights, NY 10003, USA, Philip Morris International R&D, Philip Morris Products S.A., 2000 Neuchâtel, Switzerland, Telethon Institute of Genetics and Medicine, Via Pietro Castellino, 111, 80131 Naples, Italy, ProtATonce Ltd, Scientific Park Lefkippos, Patriarchou Grigoriou & Neapoleos 15343 Ag. Paraskevi, Attiki and National Technical University of Athens, Heroon Polytechniou 9, Zografou 15780, Greece
| | - Vincenzo Belcastro
- IBM T.J. Watson Research Center, Computational Biology Center, Yorktown Heights, NY 10003, USA, Philip Morris International R&D, Philip Morris Products S.A., 2000 Neuchâtel, Switzerland, Telethon Institute of Genetics and Medicine, Via Pietro Castellino, 111, 80131 Naples, Italy, ProtATonce Ltd, Scientific Park Lefkippos, Patriarchou Grigoriou & Neapoleos 15343 Ag. Paraskevi, Attiki and National Technical University of Athens, Heroon Polytechniou 9, Zografou 15780, Greece IBM T.J. Watson Research Center, Computational Biology Center, Yorktown Heights, NY 10003, USA, Philip Morris International R&D, Philip Morris Products S.A., 2000 Neuchâtel, Switzerland, Telethon Institute of Genetics and Medicine, Via Pietro Castellino, 111, 80131 Naples, Italy, ProtATonce Ltd, Scientific Park Lefkippos, Patriarchou Grigoriou & Neapoleos 15343 Ag. Paraskevi, Attiki and National Technical University of Athens, Heroon Polytechniou 9, Zografou 15780, Greece
| | - Erhan Bilal
- IBM T.J. Watson Research Center, Computational Biology Center, Yorktown Heights, NY 10003, USA, Philip Morris International R&D, Philip Morris Products S.A., 2000 Neuchâtel, Switzerland, Telethon Institute of Genetics and Medicine, Via Pietro Castellino, 111, 80131 Naples, Italy, ProtATonce Ltd, Scientific Park Lefkippos, Patriarchou Grigoriou & Neapoleos 15343 Ag. Paraskevi, Attiki and National Technical University of Athens, Heroon Polytechniou 9, Zografou 15780, Greece
| | - Raquel Norel
- IBM T.J. Watson Research Center, Computational Biology Center, Yorktown Heights, NY 10003, USA, Philip Morris International R&D, Philip Morris Products S.A., 2000 Neuchâtel, Switzerland, Telethon Institute of Genetics and Medicine, Via Pietro Castellino, 111, 80131 Naples, Italy, ProtATonce Ltd, Scientific Park Lefkippos, Patriarchou Grigoriou & Neapoleos 15343 Ag. Paraskevi, Attiki and National Technical University of Athens, Heroon Polytechniou 9, Zografou 15780, Greece
| | - Carine Poussin
- IBM T.J. Watson Research Center, Computational Biology Center, Yorktown Heights, NY 10003, USA, Philip Morris International R&D, Philip Morris Products S.A., 2000 Neuchâtel, Switzerland, Telethon Institute of Genetics and Medicine, Via Pietro Castellino, 111, 80131 Naples, Italy, ProtATonce Ltd, Scientific Park Lefkippos, Patriarchou Grigoriou & Neapoleos 15343 Ag. Paraskevi, Attiki and National Technical University of Athens, Heroon Polytechniou 9, Zografou 15780, Greece
| | - Carole Mathis
- IBM T.J. Watson Research Center, Computational Biology Center, Yorktown Heights, NY 10003, USA, Philip Morris International R&D, Philip Morris Products S.A., 2000 Neuchâtel, Switzerland, Telethon Institute of Genetics and Medicine, Via Pietro Castellino, 111, 80131 Naples, Italy, ProtATonce Ltd, Scientific Park Lefkippos, Patriarchou Grigoriou & Neapoleos 15343 Ag. Paraskevi, Attiki and National Technical University of Athens, Heroon Polytechniou 9, Zografou 15780, Greece
| | - Rémi H J Dulize
- IBM T.J. Watson Research Center, Computational Biology Center, Yorktown Heights, NY 10003, USA, Philip Morris International R&D, Philip Morris Products S.A., 2000 Neuchâtel, Switzerland, Telethon Institute of Genetics and Medicine, Via Pietro Castellino, 111, 80131 Naples, Italy, ProtATonce Ltd, Scientific Park Lefkippos, Patriarchou Grigoriou & Neapoleos 15343 Ag. Paraskevi, Attiki and National Technical University of Athens, Heroon Polytechniou 9, Zografou 15780, Greece
| | - Nikolai V Ivanov
- IBM T.J. Watson Research Center, Computational Biology Center, Yorktown Heights, NY 10003, USA, Philip Morris International R&D, Philip Morris Products S.A., 2000 Neuchâtel, Switzerland, Telethon Institute of Genetics and Medicine, Via Pietro Castellino, 111, 80131 Naples, Italy, ProtATonce Ltd, Scientific Park Lefkippos, Patriarchou Grigoriou & Neapoleos 15343 Ag. Paraskevi, Attiki and National Technical University of Athens, Heroon Polytechniou 9, Zografou 15780, Greece
| | - Leonidas Alexopoulos
- IBM T.J. Watson Research Center, Computational Biology Center, Yorktown Heights, NY 10003, USA, Philip Morris International R&D, Philip Morris Products S.A., 2000 Neuchâtel, Switzerland, Telethon Institute of Genetics and Medicine, Via Pietro Castellino, 111, 80131 Naples, Italy, ProtATonce Ltd, Scientific Park Lefkippos, Patriarchou Grigoriou & Neapoleos 15343 Ag. Paraskevi, Attiki and National Technical University of Athens, Heroon Polytechniou 9, Zografou 15780, Greece IBM T.J. Watson Research Center, Computational Biology Center, Yorktown Heights, NY 10003, USA, Philip Morris International R&D, Philip Morris Products S.A., 2000 Neuchâtel, Switzerland, Telethon Institute of Genetics and Medicine, Via Pietro Castellino, 111, 80131 Naples, Italy, ProtATonce Ltd, Scientific Park Lefkippos, Patriarchou Grigoriou & Neapoleos 15343 Ag. Paraskevi, Attiki and National Technical University of Athens, Heroon Polytechniou 9, Zografou 15780, Greece
| | - J Jeremy Rice
- IBM T.J. Watson Research Center, Computational Biology Center, Yorktown Heights, NY 10003, USA, Philip Morris International R&D, Philip Morris Products S.A., 2000 Neuchâtel, Switzerland, Telethon Institute of Genetics and Medicine, Via Pietro Castellino, 111, 80131 Naples, Italy, ProtATonce Ltd, Scientific Park Lefkippos, Patriarchou Grigoriou & Neapoleos 15343 Ag. Paraskevi, Attiki and National Technical University of Athens, Heroon Polytechniou 9, Zografou 15780, Greece
| | - Manuel C Peitsch
- IBM T.J. Watson Research Center, Computational Biology Center, Yorktown Heights, NY 10003, USA, Philip Morris International R&D, Philip Morris Products S.A., 2000 Neuchâtel, Switzerland, Telethon Institute of Genetics and Medicine, Via Pietro Castellino, 111, 80131 Naples, Italy, ProtATonce Ltd, Scientific Park Lefkippos, Patriarchou Grigoriou & Neapoleos 15343 Ag. Paraskevi, Attiki and National Technical University of Athens, Heroon Polytechniou 9, Zografou 15780, Greece
| | - Gustavo Stolovitzky
- IBM T.J. Watson Research Center, Computational Biology Center, Yorktown Heights, NY 10003, USA, Philip Morris International R&D, Philip Morris Products S.A., 2000 Neuchâtel, Switzerland, Telethon Institute of Genetics and Medicine, Via Pietro Castellino, 111, 80131 Naples, Italy, ProtATonce Ltd, Scientific Park Lefkippos, Patriarchou Grigoriou & Neapoleos 15343 Ag. Paraskevi, Attiki and National Technical University of Athens, Heroon Polytechniou 9, Zografou 15780, Greece
| | - Pablo Meyer
- IBM T.J. Watson Research Center, Computational Biology Center, Yorktown Heights, NY 10003, USA, Philip Morris International R&D, Philip Morris Products S.A., 2000 Neuchâtel, Switzerland, Telethon Institute of Genetics and Medicine, Via Pietro Castellino, 111, 80131 Naples, Italy, ProtATonce Ltd, Scientific Park Lefkippos, Patriarchou Grigoriou & Neapoleos 15343 Ag. Paraskevi, Attiki and National Technical University of Athens, Heroon Polytechniou 9, Zografou 15780, Greece
| | - Julia Hoeng
- IBM T.J. Watson Research Center, Computational Biology Center, Yorktown Heights, NY 10003, USA, Philip Morris International R&D, Philip Morris Products S.A., 2000 Neuchâtel, Switzerland, Telethon Institute of Genetics and Medicine, Via Pietro Castellino, 111, 80131 Naples, Italy, ProtATonce Ltd, Scientific Park Lefkippos, Patriarchou Grigoriou & Neapoleos 15343 Ag. Paraskevi, Attiki and National Technical University of Athens, Heroon Polytechniou 9, Zografou 15780, Greece
| |
Collapse
|
13
|
Relative specificity: all substrates are not created equal. GENOMICS PROTEOMICS & BIOINFORMATICS 2014; 12:1-7. [PMID: 24491634 PMCID: PMC4411342 DOI: 10.1016/j.gpb.2014.01.001] [Citation(s) in RCA: 2] [Impact Index Per Article: 0.2] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Subscribe] [Scholar Register] [Received: 11/26/2013] [Revised: 12/21/2013] [Accepted: 01/07/2014] [Indexed: 11/24/2022]
Abstract
A biological molecule, e.g., an enzyme, tends to interact with its many cognate substrates, targets, or partners differentially. Such a property is termed relative specificity and has been proposed to regulate important physiological functions, even though it has not been examined explicitly in most complex biochemical systems. This essay reviews several recent large-scale studies that investigate protein folding, signal transduction, RNA binding, translation and transcription in the context of relative specificity. These results and others support a pervasive role of relative specificity in diverse biological processes. It is becoming clear that relative specificity contributes fundamentally to the diversity and complexity of biological systems, which has significant implications in disease processes as well.
Collapse
|
14
|
Goode DK, Elgar G. Capturing the regulatory interactions of eukaryote genomes. Brief Funct Genomics 2012; 12:142-60. [PMID: 23117864 DOI: 10.1093/bfgp/els041] [Citation(s) in RCA: 3] [Impact Index Per Article: 0.2] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 01/20/2023] Open
Abstract
A key finding from early genomics research is the remarkable consistency in the number of protein-coding regions across diverse species. This has led many researchers to look to the cis-regulatory elements of genes as the fundamental influence behind evolving gene function and subsequent species diversification. Historically, since these elements are often located in vast intergenic and intronic regions of the genome, their identification has been recalcitrant. Now, with the deluge of whole-genome data from representatives of numerous eukaryotic lineages, various approaches have enabled us to begin to recognize features that characterize regulatory regions of the genome. Here we endeavour to collate these approaches in order to give an overview of the complexities involved in extrapolating regulatory signatures. The resource provided by the escalating richness of whole-genome datasets enables more sophisticated modelling of these regulatory signatures yet at the same time introduces increasing potential for noise. While we are only at the advent of making these discoveries, the next decade promises to be a very exciting and rewarding time for genome researchers.
Collapse
Affiliation(s)
- Debbie K Goode
- Cambridge Institute for Medical Research, Deptartment of Haematology, Addenbrooke's Hospital, Hills Road, Cambridge, UK
| | | |
Collapse
|
15
|
Cooperativity of stress-responsive transcription factors in core hypoxia-inducible factor binding regions. PLoS One 2012; 7:e45708. [PMID: 23029193 PMCID: PMC3454324 DOI: 10.1371/journal.pone.0045708] [Citation(s) in RCA: 27] [Impact Index Per Article: 2.1] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/23/2012] [Accepted: 08/22/2012] [Indexed: 11/19/2022] Open
Abstract
The transcriptional response driven by Hypoxia-inducible factor (HIF) is central to the adaptation to oxygen restriction. Despite recent characterization of genome-wide HIF DNA binding locations and hypoxia-regulated transcripts in different cell types, the molecular bases of HIF target selection remain unresolved. Herein, we combined multi-level experimental data and computational predictions to identify sequence motifs that may contribute to HIF target selectivity. We obtained a core set of bona fide HIF binding regions by integrating multiple HIF1 DNA binding and hypoxia expression profiling datasets. This core set exhibits evolutionarily conserved binding regions and is enriched in functional responses to hypoxia. Computational prediction of enriched transcription factor binding sites identified sequence motifs corresponding to several stress-responsive transcription factors, such as activator protein 1 (AP1), cAMP response element-binding (CREB), or CCAAT-enhancer binding protein (CEBP). Experimental validations on HIF-regulated promoters suggest a functional role of the identified motifs in modulating HIF-mediated transcription. Accordingly, transcriptional targets of these factors are over-represented in a sorted list of hypoxia-regulated genes. Altogether, our results implicate cooperativity among stress-responsive transcription factors in fine-tuning the HIF transcriptional response.
Collapse
|
16
|
Schmidt SF, Jørgensen M, Sandelin A, Mandrup S. Cross-species ChIP-seq studies provide insights into regulatory strategies of PPARγ in adipocytes. Transcription 2012; 3:19-24. [PMID: 22456316 DOI: 10.4161/trns.3.1.19302] [Citation(s) in RCA: 5] [Impact Index Per Article: 0.4] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 12/26/2022] Open
Abstract
Three recent studies have investigated interspecies retention of binding sites of peroxisome proliferator-activated receptor γ (PPARγ), the master regulator of adipocyte differention, between mouse and human adipocytes. Here we discuss the major findings and demonstrate that retention of binding events is highly context-dependent.
Collapse
Affiliation(s)
- Søren F Schmidt
- Department of Biochemistry and Molecular Biology, University of Southern Denmark, Odense, Denmark
| | | | | | | |
Collapse
|
17
|
Herrmann C, Van de Sande B, Potier D, Aerts S. i-cisTarget: an integrative genomics method for the prediction of regulatory features and cis-regulatory modules. Nucleic Acids Res 2012; 40:e114. [PMID: 22718975 PMCID: PMC3424583 DOI: 10.1093/nar/gks543] [Citation(s) in RCA: 129] [Impact Index Per Article: 9.9] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 01/08/2023] Open
Abstract
The field of regulatory genomics today is characterized by the generation of high-throughput data sets that capture genome-wide transcription factor (TF) binding, histone modifications, or DNAseI hypersensitive regions across many cell types and conditions. In this context, a critical question is how to make optimal use of these publicly available datasets when studying transcriptional regulation. Here, we address this question in Drosophila melanogaster for which a large number of high-throughput regulatory datasets are available. We developed i-cisTarget (where the 'i' stands for integrative), for the first time enabling the discovery of different types of enriched 'regulatory features' in a set of co-regulated sequences in one analysis, being either TF motifs or 'in vivo' chromatin features, or combinations thereof. We have validated our approach on 15 co-expressed gene sets, 21 ChIP data sets, 628 curated gene sets and multiple individual case studies, and show that meaningful regulatory features can be confidently discovered; that bona fide enhancers can be identified, both by in vivo events and by TF motifs; and that combinations of in vivo events and TF motifs further increase the performance of enhancer prediction.
Collapse
Affiliation(s)
- Carl Herrmann
- TAGC - Inserm U1090 and Aix-Marseille Université, Campus de Luminy, 13288 Marseille, France.
| | | | | | | |
Collapse
|
18
|
Hemberg M, Gray JM, Cloonan N, Kuersten S, Grimmond S, Greenberg ME, Kreiman G. Integrated genome analysis suggests that most conserved non-coding sequences are regulatory factor binding sites. Nucleic Acids Res 2012; 40:7858-69. [PMID: 22684627 PMCID: PMC3439890 DOI: 10.1093/nar/gks477] [Citation(s) in RCA: 34] [Impact Index Per Article: 2.6] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 01/23/2023] Open
Abstract
More than 98% of a typical vertebrate genome does not code for proteins. Although non-coding regions are sprinkled with short (<200 bp) islands of evolutionarily conserved sequences, the function of most of these unannotated conserved islands remains unknown. One possibility is that unannotated conserved islands could encode non-coding RNAs (ncRNAs); alternatively, unannotated conserved islands could serve as promoter-distal regulatory factor binding sites (RFBSs) like enhancers. Here we assess these possibilities by comparing unannotated conserved islands in the human and mouse genomes to transcribed regions and to RFBSs, relying on a detailed case study of one human and one mouse cell type. We define transcribed regions by applying a novel transcript-calling algorithm to RNA-Seq data obtained from total cellular RNA, and we define RFBSs using ChIP-Seq and DNAse-hypersensitivity assays. We find that unannotated conserved islands are four times more likely to coincide with RFBSs than with unannotated ncRNAs. Thousands of conserved RFBSs can be categorized as insulators based on the presence of CTCF or as enhancers based on the presence of p300/CBP and H3K4me1. While many unannotated conserved RFBSs are transcriptionally active to some extent, the transcripts produced tend to be unspliced, non-polyadenylated and expressed at levels 10 to 100-fold lower than annotated coding or ncRNAs. Extending these findings across multiple cell types and tissues, we propose that most conserved non-coding genomic DNA in vertebrate genomes corresponds to promoter-distal regulatory elements.
Collapse
Affiliation(s)
- Martin Hemberg
- Department of Ophthalmology, Children's Hospital Boston, Harvard Medical School, Boston, MA 02215, USA.
| | | | | | | | | | | | | |
Collapse
|
19
|
Göke J, Jung M, Behrens S, Chavez L, O'Keeffe S, Timmermann B, Lehrach H, Adjaye J, Vingron M. Combinatorial binding in human and mouse embryonic stem cells identifies conserved enhancers active in early embryonic development. PLoS Comput Biol 2011; 7:e1002304. [PMID: 22215994 PMCID: PMC3245296 DOI: 10.1371/journal.pcbi.1002304] [Citation(s) in RCA: 41] [Impact Index Per Article: 2.9] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/27/2011] [Accepted: 10/30/2011] [Indexed: 12/31/2022] Open
Abstract
Transcription factors are proteins that regulate gene expression by binding to cis-regulatory sequences such as promoters and enhancers. In embryonic stem (ES) cells, binding of the transcription factors OCT4, SOX2 and NANOG is essential to maintain the capacity of the cells to differentiate into any cell type of the developing embryo. It is known that transcription factors interact to regulate gene expression. In this study we show that combinatorial binding is strongly associated with co-localization of the transcriptional co-activator Mediator, H3K27ac and increased expression of nearby genes in embryonic stem cells. We observe that the same loci bound by Oct4, Nanog and Sox2 in ES cells frequently drive expression in early embryonic development. Comparison of mouse and human ES cells shows that less than 5% of individual binding events for OCT4, SOX2 and NANOG are shared between species. In contrast, about 15% of combinatorial binding events and even between 53% and 63% of combinatorial binding events at enhancers active in early development are conserved. Our analysis suggests that the combination of OCT4, SOX2 and NANOG binding is critical for transcription in ES cells and likely plays an important role for embryogenesis by binding at conserved early developmental enhancers. Our data suggests that the fast evolutionary rewiring of regulatory networks mainly affects individual binding events, whereas “gene regulatory hotspots” which are bound by multiple factors and active in multiple tissues throughout early development are under stronger evolutionary constraints. The mammalian body is composed of hundreds of distinct cell types. During embryogenesis, this diversity is created by multiple cell fate decisions and differentiation events. Embryonic stem (ES) cells provide the in vitro model to study differentiation and early development. Their pluripotent state is maintained by transcription factors such as OCT4, SOX2 and NANOG which bind to regulatory elements within the genome. Understanding the interplay between transcription factor binding, gene expression and cellular differentiation is key to understanding the development of the mammalian embryo. In this study we find that combinatorial binding of OCT4, SOX2 and NANOG in ES cells identifies enhancers which are associated with active transcription. We observe that these enhancers also frequently show activity at later developmental stages. Using data from mouse and human ES cells we find that these combinatorially bound enhancers which are active in pluripotent cells and development show extraordinarily high levels of binding conservation (>50%). Our analysis suggests that these conserved “gene regulatory hotspots” integrate the transcriptional network that promotes pluripotency into the gene regulatory networks that promote cell fate decisions and differentiation during early embryonic development.
Collapse
Affiliation(s)
- Jonathan Göke
- Department of Computational Molecular Biology, Max-Planck Institute for Molecular Genetics, Berlin, Germany.
| | | | | | | | | | | | | | | | | |
Collapse
|