101
|
Wang L, Michoel T. Efficient and accurate causal inference with hidden confounders from genome-transcriptome variation data. PLoS Comput Biol 2017; 13:e1005703. [PMID: 28821014 PMCID: PMC5576763 DOI: 10.1371/journal.pcbi.1005703] [Citation(s) in RCA: 28] [Impact Index Per Article: 3.5] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 02/02/2017] [Revised: 08/30/2017] [Accepted: 07/26/2017] [Indexed: 02/07/2023] Open
Abstract
Mapping gene expression as a quantitative trait using whole genome-sequencing and transcriptome analysis allows to discover the functional consequences of genetic variation. We developed a novel method and ultra-fast software Findr for higly accurate causal inference between gene expression traits using cis-regulatory DNA variations as causal anchors, which improves current methods by taking into consideration hidden confounders and weak regulations. Findr outperformed existing methods on the DREAM5 Systems Genetics challenge and on the prediction of microRNA and transcription factor targets in human lymphoblastoid cells, while being nearly a million times faster. Findr is publicly available at https://github.com/lingfeiwang/findr. Understanding how genetic variation between individuals determines variation in observable traits or disease risk is one of the core aims of genetics. It is known that genetic variation often affects gene regulatory DNA elements and directly causes variation in expression of nearby genes. This effect in turn cascades down to other genes via the complex pathways and gene interaction networks that ultimately govern how cells operate in an ever changing environment. In theory, when genetic variation and gene expression levels are measured simultaneously in a large number of individuals, the causal effects of genes on each other can be inferred using statistical models similar to those used in randomized controlled trials. We developed a novel method and ultra-fast software Findr which, unlike existing methods, takes into account the complex but unknown network context when predicting causality between specific gene pairs. Findr’s predictions have a significantly higher overlap with known gene networks compared to existing methods, using both simulated and real data. Findr is also nearly a million times faster, and hence the only software in its class that can handle modern datasets where the expression levels of ten-thousands of genes are simultaneously measured in hundreds to thousands of individuals.
Collapse
Affiliation(s)
- Lingfei Wang
- Division of Genetics and Genomics, The Roslin Institute, The University of Edinburgh, Easter Bush, Midlothian, United Kingdom
| | - Tom Michoel
- Division of Genetics and Genomics, The Roslin Institute, The University of Edinburgh, Easter Bush, Midlothian, United Kingdom
- * E-mail:
| |
Collapse
|
102
|
Abstract
Strong DNA conservation among divergent species is an indicator of enduring functionality. With weaker sequence conservation we enter a vast ‘twilight zone’ in which sequence subject to transient or lower constraint cannot be distinguished easily from neutrally evolving, non-functional sequence. Twilight zone functional sequence is illuminated instead by principles of selective constraint and positive selection using genomic data acquired from within a species’ population. Application of these principles reveals that despite being biochemically active, most twilight zone sequence is not functional.
Collapse
Affiliation(s)
- Chris P Ponting
- MRC Human Genetics Unit, The Institute of Genetics and Molecular Medicine, University of Edinburgh, Western General Hospital, Crewe Road, Edinburgh, EH4 2XU, UK.
| |
Collapse
|
103
|
Arthur RK, An N, Khan S, McNerney ME. The haploinsufficient tumor suppressor, CUX1, acts as an analog transcriptional regulator that controls target genes through distal enhancers that loop to target promoters. Nucleic Acids Res 2017; 45:6350-6361. [PMID: 28369554 PMCID: PMC5499738 DOI: 10.1093/nar/gkx218] [Citation(s) in RCA: 16] [Impact Index Per Article: 2.0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 12/26/2016] [Revised: 03/21/2017] [Accepted: 03/24/2017] [Indexed: 01/19/2023] Open
Abstract
One third of tumor suppressors are haploinsufficient transcriptional regulators, yet it remains unknown how a 50% reduction of a transcription factor is translated at the cis-regulatory level into a malignant transcriptional program. We studied CUX1, a haploinsufficient transcription factor that is recurrently mutated in hematopoietic and solid tumors. We determined CUX1 DNA-binding and target gene regulation in the wildtype and haploinsufficient states. CUX1 binds with transcriptional activators and cohesin at distal enhancers across three different human cell types. Haploinsufficiency of CUX1 altered the expression of a large number of genes, including cell cycle regulators, with concomitant increased cellular proliferation. Surprisingly, CUX1 occupancy decreased genome-wide in the haploinsufficient state, and binding site affinity did not correlate with differential gene expression. Instead, differentially expressed genes had multiple, low-affinity CUX1 binding sites, features of analog gene regulation. A machine-learning algorithm determined that chromatin accessibility, enhancer activity, and distance to the transcription start site are features of dose-sensitive CUX1 transcriptional regulation. Moreover, CUX1 is enriched at sites of DNA looping, as determined by Hi-C analysis, and these loops connect CUX1 to the promoters of regulated genes. We propose an analog model for haploinsufficient transcriptional deregulation mediated by higher order genome architecture.
Collapse
Affiliation(s)
- Robert K. Arthur
- Department of Pathology, Department of Pediatrics, Section of Hematology/Oncology, and The University of Chicago Medicine Comprehensive Cancer Center, The University of Chicago, Chicago, IL 60637, USA
| | - Ningfei An
- Department of Pathology, Department of Pediatrics, Section of Hematology/Oncology, and The University of Chicago Medicine Comprehensive Cancer Center, The University of Chicago, Chicago, IL 60637, USA
| | - Saira Khan
- Department of Pathology, Department of Pediatrics, Section of Hematology/Oncology, and The University of Chicago Medicine Comprehensive Cancer Center, The University of Chicago, Chicago, IL 60637, USA
| | - Megan E. McNerney
- Department of Pathology, Department of Pediatrics, Section of Hematology/Oncology, and The University of Chicago Medicine Comprehensive Cancer Center, The University of Chicago, Chicago, IL 60637, USA
| |
Collapse
|
104
|
Gallone G, Haerty W, Disanto G, Ramagopalan SV, Ponting CP, Berlanga-Taylor AJ. Identification of genetic variants affecting vitamin D receptor binding and associations with autoimmune disease. Hum Mol Genet 2017; 26:2164-2176. [PMID: 28335003 PMCID: PMC5886188 DOI: 10.1093/hmg/ddx092] [Citation(s) in RCA: 22] [Impact Index Per Article: 2.8] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/16/2016] [Revised: 02/28/2017] [Accepted: 03/07/2017] [Indexed: 01/24/2023] Open
Abstract
Large numbers of statistically significant associations between sentinel SNPs and case-control status have been replicated by genome-wide association studies. Nevertheless, few underlying molecular mechanisms of complex disease are currently known. We investigated whether variation in binding of a transcription factor, the vitamin D receptor (VDR), whose activating ligand vitamin D has been proposed as a modifiable factor in multiple disorders, could explain any of these associations. VDR modifies gene expression by binding DNA as a heterodimer with the Retinoid X receptor (RXR). We identified 43,332 genetic variants significantly associated with altered VDR binding affinity (VDR-BVs) using a high-resolution (ChIP-exo) genome-wide analysis of 27 HapMap lymphoblastoid cell lines. VDR-BVs are enriched in consensus RXR::VDR binding motifs, yet most fell outside of these motifs, implying that genetic variation often affects the binding affinity only indirectly. Finally, we compared 341 VDR-BVs replicating by position in multiple individuals against background sets of variants lying within VDR-binding regions that had been matched in allele frequency and were independent with respect to linkage disequilibrium. In this stringent test, these replicated VDR-BVs were significantly (q < 0.1) and substantially (>2-fold) enriched in genomic intervals associated with autoimmune and other diseases, including inflammatory bowel disease, Crohn's disease and rheumatoid arthritis. The approach's validity is underscored by RXR::VDR motif sequence being predictive of binding strength and being evolutionarily constrained. Our findings are consistent with altered RXR::VDR binding contributing to immunity-related diseases. Replicated VDR-BVs associated with these disorders could represent causal disease risk alleles whose effect may be modifiable by vitamin D levels.
Collapse
Affiliation(s)
- Giuseppe Gallone
- MRC Functional Genomics Unit
- Department of Physiology, Anatomy and Genetics, University of Oxford, South Parks Road, Oxford OX1 3PT, UK
- Wellcome Trust Sanger Institute, Wellcome Trust Genome Campus, Hinxton CB10 1SA, UK
| | - Wilfried Haerty
- MRC Functional Genomics Unit
- Department of Physiology, Anatomy and Genetics, University of Oxford, South Parks Road, Oxford OX1 3PT, UK
| | - Giulio Disanto
- Department of Physiology, Anatomy and Genetics, University of Oxford, South Parks Road, Oxford OX1 3PT, UK
| | | | - Chris P. Ponting
- MRC Functional Genomics Unit
- Department of Physiology, Anatomy and Genetics, University of Oxford, South Parks Road, Oxford OX1 3PT, UK
- MRC Human Genetics Unit, The Institute of Genetics and Molecular Medicine, University of Edinburgh, Western General Hospital, Crewe Road, Edinburgh EH4 2XU, UK
| | - Antonio J. Berlanga-Taylor
- Wellcome Trust Centre for Human Genetics, Nuffield Department of Clinical Medicine, University of Oxford, Oxford OX3 7BN, UK
- CGAT, MRC Functional Genomics Unit, Department of Physiology, Anatomy and Genetics, University of Oxford, Oxford OX1 3PT, UK
- MRC-PHE Centre for Environment and Health, Department of Epidemiology & Biostatistics, School of Public Health, Faculty of Medicine, Imperial College London, St Mary’s Campus, Norfolk Place, London W2 1PG, UK
| |
Collapse
|
105
|
Bussemaker HJ, Causton HC, Fazlollahi M, Lee E, Muroff I. Network-based approaches that exploit inferred transcription factor activity to analyze the impact of genetic variation on gene expression. ACTA ACUST UNITED AC 2017; 2:98-102. [PMID: 28691107 DOI: 10.1016/j.coisb.2017.04.002] [Citation(s) in RCA: 4] [Impact Index Per Article: 0.5] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 12/26/2022]
Abstract
Over the past decade, a number of methods have emerged for inferring protein-level transcription factor activities in individual samples based on prior information about the structure of the gene regulatory network. We discuss how this has enabled new methods for dissecting trans-acting mechanisms that underpin genetic variation in gene expression.
Collapse
Affiliation(s)
- Harmen J Bussemaker
- Department of Biological Sciences, Columbia University, New York, NY 10027.,Department of Systems Biology, Columbia University, New York, NY 10032
| | - Helen C Causton
- Department of Pathology and Cell Biology, Columbia University Medical Center, New York, NY 10032
| | - Mina Fazlollahi
- Department of Genetics and Genomic Sciences, Mount Sinai School of Medicine, New York, NY 10029
| | - Eunjee Lee
- Department of Genetics and Genomic Sciences, Mount Sinai School of Medicine, New York, NY 10029
| | - Ivor Muroff
- Department of Biological Sciences, Columbia University, New York, NY 10027
| |
Collapse
|
106
|
Dekhang R, Wu C, Smith KM, Lamb TM, Peterson M, Bredeweg EL, Ibarra O, Emerson JM, Karunarathna N, Lyubetskaya A, Azizi E, Hurley JM, Dunlap JC, Galagan JE, Freitag M, Sachs MS, Bell-Pedersen D. The Neurospora Transcription Factor ADV-1 Transduces Light Signals and Temporal Information to Control Rhythmic Expression of Genes Involved in Cell Fusion. G3 (BETHESDA, MD.) 2017; 7:129-142. [PMID: 27856696 PMCID: PMC5217103 DOI: 10.1534/g3.116.034298] [Citation(s) in RCA: 20] [Impact Index Per Article: 2.5] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Subscribe] [Scholar Register] [Received: 08/05/2016] [Accepted: 11/01/2016] [Indexed: 12/20/2022]
Abstract
Light and the circadian clock have a profound effect on the biology of organisms through the regulation of large sets of genes. Toward understanding how light and the circadian clock regulate gene expression, we used genome-wide approaches to identify the direct and indirect targets of the light-responsive and clock-controlled transcription factor ADV-1 in Neurospora crassa A large proportion of ADV-1 targets were found to be light- and/or clock-controlled, and enriched for genes involved in development, metabolism, cell growth, and cell fusion. We show that ADV-1 is necessary for transducing light and/or temporal information to its immediate downstream targets, including controlling rhythms in genes critical to somatic cell fusion. However, while ADV-1 targets are altered in predictable ways in Δadv-1 cells in response to light, this is not always the case for rhythmic target gene expression. These data suggest that a complex regulatory network downstream of ADV-1 functions to generate distinct temporal dynamics of target gene expression relative to the central clock mechanism.
Collapse
Affiliation(s)
- Rigzin Dekhang
- Department of Biology, Texas A&M University, College Station, Texas 77843
| | - Cheng Wu
- Department of Biology, Texas A&M University, College Station, Texas 77843
| | - Kristina M Smith
- Department of Biochemistry and Biophysics, Oregon State University, Corvallis, Oregon 97331
| | - Teresa M Lamb
- Department of Biology, Texas A&M University, College Station, Texas 77843
| | | | - Erin L Bredeweg
- Department of Biochemistry and Biophysics, Oregon State University, Corvallis, Oregon 97331
| | - Oneida Ibarra
- Department of Biology, Texas A&M University, College Station, Texas 77843
| | - Jillian M Emerson
- Department of Genetics, Geisel School of Medicine at Dartmouth, Hanover, New Hampshire 03755
| | | | | | - Elham Azizi
- Bioinformatics Program, Boston University, Massachusetts 02215
| | - Jennifer M Hurley
- Department of Biological Sciences, Rensselaer Polytechnic Institute, Troy, New York 12180
| | - Jay C Dunlap
- Department of Genetics, Geisel School of Medicine at Dartmouth, Hanover, New Hampshire 03755
| | - James E Galagan
- Bioinformatics Program, Boston University, Massachusetts 02215
- National Emerging Infectious Diseases Laboratories, Boston University, Massachusetts 02118
- Department of Microbiology, Boston University, Massachusetts 02215
- Department of Biomedical Engineering, Boston University, Massachusetts 02215
| | - Michael Freitag
- Department of Biochemistry and Biophysics, Oregon State University, Corvallis, Oregon 97331
| | - Matthew S Sachs
- Department of Biology, Texas A&M University, College Station, Texas 77843
| | | |
Collapse
|
107
|
Handel AE, Gallone G, Zameel Cader M, Ponting CP. Most brain disease-associated and eQTL haplotypes are not located within transcription factor DNase-seq footprints in brain. Hum Mol Genet 2017; 26:79-89. [PMID: 27798116 PMCID: PMC5351933 DOI: 10.1093/hmg/ddw369] [Citation(s) in RCA: 2] [Impact Index Per Article: 0.3] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 09/19/2016] [Revised: 09/19/2016] [Accepted: 10/24/2016] [Indexed: 11/20/2022] Open
Abstract
Dense genotyping approaches have revealed much about the genetic architecture both of gene expression and disease susceptibility. However, assigning causality to genetic variants associated with a transcriptomic or phenotypic trait presents a far greater challenge. The development of epigenomic resources by ENCODE, the Epigenomic Roadmap and others has led to strategies that seek to infer the likely functional variants underlying these genome-wide association signals. It is known, for example, that such variants tend to be located within areas of open chromatin, as detected by techniques such as DNase-seq and FAIRE-seq. We aimed to assess what proportion of variants associated with phenotypic or transcriptomic traits in the human brain are located within transcription factor binding sites. The bioinformatic tools, Wellington and HINT, were used to infer transcription factor footprints from existing DNase-seq data derived from central nervous system tissues with high spatial resolution. This dataset was then employed to assess the likely contribution of altered transcription factor binding to both expression quantitative trait loci (eQTL) and genome-wide association study (GWAS) signals. Surprisingly, we show that most haplotypes associated with GWAS or eQTL phenotypes are located outside of DNase-seq footprints. This could imply that DNase-seq footprinting is too insensitive an approach to identify a large proportion of true transcription factor binding sites. Importantly, this suggests that prioritising variants for genome engineering studies to establish causality will continue to be frustrated by an inability of footprinting to identify the causative variant within a haplotype.
Collapse
Affiliation(s)
- Adam E. Handel
- MRC Functional Genomics Unit, Department of Physiology, Anatomy and Genetics
- Weatherall Institute of Molecular Medicine, University of Oxford, Oxford, Oxfordshire, UK
| | - Giuseppe Gallone
- MRC Functional Genomics Unit, Department of Physiology, Anatomy and Genetics
| | - M. Zameel Cader
- Weatherall Institute of Molecular Medicine, University of Oxford, Oxford, Oxfordshire, UK
| | - Chris P. Ponting
- MRC Functional Genomics Unit, Department of Physiology, Anatomy and Genetics
| |
Collapse
|
108
|
Deplancke B, Alpern D, Gardeux V. The Genetics of Transcription Factor DNA Binding Variation. Cell 2016; 166:538-554. [PMID: 27471964 DOI: 10.1016/j.cell.2016.07.012] [Citation(s) in RCA: 260] [Impact Index Per Article: 28.9] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 02/05/2016] [Indexed: 12/23/2022]
Abstract
Most complex trait-associated variants are located in non-coding regulatory regions of the genome, where they have been shown to disrupt transcription factor (TF)-DNA binding motifs. Variable TF-DNA interactions are therefore increasingly considered as key drivers of phenotypic variation. However, recent genome-wide studies revealed that the majority of variable TF-DNA binding events are not driven by sequence alterations in the motif of the studied TF. This observation implies that the molecular mechanisms underlying TF-DNA binding variation and, by extrapolation, inter-individual phenotypic variation are more complex than originally anticipated. Here, we summarize the findings that led to this important paradigm shift and review proposed mechanisms for local, proximal, or distal genetic variation-driven variable TF-DNA binding. In addition, we discuss the biomedical implications of these findings for our ability to dissect the molecular role(s) of non-coding genetic variants in complex traits, including disease susceptibility.
Collapse
Affiliation(s)
- Bart Deplancke
- Laboratory of Systems Biology and Genetics, Institute of Bioengineering, Ecole Polytechnique Fédérale de Lausanne and Swiss Institute of Bioinformatics, 1015 Lausanne, Switzerland.
| | - Daniel Alpern
- Laboratory of Systems Biology and Genetics, Institute of Bioengineering, Ecole Polytechnique Fédérale de Lausanne and Swiss Institute of Bioinformatics, 1015 Lausanne, Switzerland
| | - Vincent Gardeux
- Laboratory of Systems Biology and Genetics, Institute of Bioengineering, Ecole Polytechnique Fédérale de Lausanne and Swiss Institute of Bioinformatics, 1015 Lausanne, Switzerland
| |
Collapse
|
109
|
Chen CY, Shi W, Balaton BP, Matthews AM, Li Y, Arenillas DJ, Mathelier A, Itoh M, Kawaji H, Lassmann T, Hayashizaki Y, Carninci P, Forrest ARR, Brown CJ, Wasserman WW. YY1 binding association with sex-biased transcription revealed through X-linked transcript levels and allelic binding analyses. Sci Rep 2016; 6:37324. [PMID: 27857184 PMCID: PMC5114649 DOI: 10.1038/srep37324] [Citation(s) in RCA: 24] [Impact Index Per Article: 2.7] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 04/04/2016] [Accepted: 10/24/2016] [Indexed: 12/27/2022] Open
Abstract
Sex differences in susceptibility and progression have been reported in numerous diseases. Female cells have two copies of the X chromosome with X-chromosome inactivation imparting mono-allelic gene silencing for dosage compensation. However, a subset of genes, named escapees, escape silencing and are transcribed bi-allelically resulting in sexual dimorphism. Here we conducted in silico analyses of the sexes using human datasets to gain perspectives into such regulation. We identified transcription start sites of escapees (escTSSs) based on higher transcription levels in female cells using FANTOM5 CAGE data. Significant over-representations of YY1 transcription factor binding motif and ChIP-seq peaks around escTSSs highlighted its positive association with escapees. Furthermore, YY1 occupancy is significantly biased towards the inactive X (Xi) at long non-coding RNA loci that are frequent contacts of Xi-specific superloops. Our study suggests a role for YY1 in transcriptional activity on Xi in general through sequence-specific binding, and its involvement at superloop anchors.
Collapse
Affiliation(s)
- Chih-Yu Chen
- Centre for Molecular Medicine and Therapeutics, Child and Family Research Institute, University of British Columbia, Vancouver, British Columbia, Canada.,Graduate Program in Bioinformatics, University of British Columbia, Vancouver, British Columbia, Canada
| | - Wenqiang Shi
- Centre for Molecular Medicine and Therapeutics, Child and Family Research Institute, University of British Columbia, Vancouver, British Columbia, Canada.,Graduate Program in Bioinformatics, University of British Columbia, Vancouver, British Columbia, Canada
| | - Bradley P Balaton
- Department of Medical Genetics, University of British Columbia, Vancouver, British Columbia, Canada
| | - Allison M Matthews
- Centre for Molecular Medicine and Therapeutics, Child and Family Research Institute, University of British Columbia, Vancouver, British Columbia, Canada
| | - Yifeng Li
- Centre for Molecular Medicine and Therapeutics, Child and Family Research Institute, University of British Columbia, Vancouver, British Columbia, Canada
| | - David J Arenillas
- Centre for Molecular Medicine and Therapeutics, Child and Family Research Institute, University of British Columbia, Vancouver, British Columbia, Canada
| | - Anthony Mathelier
- Centre for Molecular Medicine and Therapeutics, Child and Family Research Institute, University of British Columbia, Vancouver, British Columbia, Canada
| | - Masayoshi Itoh
- RIKEN Omics Science Center, Yokohama, Japan.,RIKEN Center for Life Science Technologies, Division of Genomic Technologies, Yokohama, Japan.,RIKEN Preventive Medicine and Diagnosis Innovation Program, Wako, Saitama, Japan
| | - Hideya Kawaji
- RIKEN Omics Science Center, Yokohama, Japan.,RIKEN Center for Life Science Technologies, Division of Genomic Technologies, Yokohama, Japan.,RIKEN Preventive Medicine and Diagnosis Innovation Program, Wako, Saitama, Japan
| | - Timo Lassmann
- RIKEN Omics Science Center, Yokohama, Japan.,RIKEN Center for Life Science Technologies, Division of Genomic Technologies, Yokohama, Japan
| | - Yoshihide Hayashizaki
- RIKEN Omics Science Center, Yokohama, Japan.,RIKEN Preventive Medicine and Diagnosis Innovation Program, Wako, Saitama, Japan
| | - Piero Carninci
- RIKEN Omics Science Center, Yokohama, Japan.,RIKEN Center for Life Science Technologies, Division of Genomic Technologies, Yokohama, Japan
| | - Alistair R R Forrest
- RIKEN Omics Science Center, Yokohama, Japan.,RIKEN Center for Life Science Technologies, Division of Genomic Technologies, Yokohama, Japan.,Harry Perkins Institute of Medical Research, QEII Medical Centre and Centre for Medical Research, the University of Western Australia, Nedlands, Western Australia, Australia
| | - Carolyn J Brown
- Department of Medical Genetics, University of British Columbia, Vancouver, British Columbia, Canada
| | - Wyeth W Wasserman
- Centre for Molecular Medicine and Therapeutics, Child and Family Research Institute, University of British Columbia, Vancouver, British Columbia, Canada.,Department of Medical Genetics, University of British Columbia, Vancouver, British Columbia, Canada
| |
Collapse
|
110
|
Modeling Gene Regulation in Liver Hepatocellular Carcinoma with Random Forests. BIOMED RESEARCH INTERNATIONAL 2016; 2016:1035945. [PMID: 27818995 PMCID: PMC5080476 DOI: 10.1155/2016/1035945] [Citation(s) in RCA: 7] [Impact Index Per Article: 0.8] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Subscribe] [Scholar Register] [Received: 05/18/2016] [Accepted: 09/21/2016] [Indexed: 11/29/2022]
Abstract
Liver hepatocellular carcinoma (HCC) remains a leading cause of cancer-related death. Poor understanding of the mechanisms underlying HCC prevents early detection and leads to high mortality. We developed a random forest model that incorporates copy-number variation, DNA methylation, transcription factor, and microRNA binding information as features to predict gene expression in HCC. Our model achieved a highly significant correlation between predicted and measured expression of held-out genes. Furthermore, we identified potential regulators of gene expression in HCC. Many of these regulators have been previously found to be associated with cancer and are differentially expressed in HCC. We also evaluated our predicted target sets for these regulators by making comparison with experimental results. Lastly, we found that the transcription factor E2F6, one of the candidate regulators inferred by our model, is predictive of survival rate in HCC. Results of this study will provide directions for future prospective studies in HCC.
Collapse
|
111
|
Yalamanchili N, Kriete A, Alfego D, Danowski KM, Kari C, Rodeck U. Distinct Cell Stress Responses Induced by ATP Restriction in Quiescent Human Fibroblasts. Front Genet 2016; 7:171. [PMID: 27757122 PMCID: PMC5047886 DOI: 10.3389/fgene.2016.00171] [Citation(s) in RCA: 10] [Impact Index Per Article: 1.1] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/14/2016] [Accepted: 09/09/2016] [Indexed: 12/22/2022] Open
Abstract
Quiescence is the prevailing state of many cell types under homeostatic conditions. Yet, surprisingly little is known about how quiescent cells respond to energetic and metabolic challenges. To better understand compensatory responses of quiescent cells to metabolic stress, we established, in human primary dermal fibroblasts, an experimental ‘energy restriction’ model. Quiescence was achieved by short-term culture in serum-deprived media and ATP supply restricted using a combination of glucose transport inhibitors and mitochondrial uncouplers. In aggregate, these measures led to markedly reduced intracellular ATP levels while not compromising cell viability over the observation period of 48 h. Analysis of the transcription factor (TF) landscape induced by this treatment revealed alterations in several signal transduction nodes beyond the expected biosynthetic adaptations. These included increased abundance of NF-κB regulated TFs and altered TF subsets regulated by Akt and p53. The observed changes in gene regulation and corresponding alterations in key signaling nodes are likely to contribute to cell survival at intracellular ATP concentrations substantially below those achieved by growth factor deprivation alone. This experimental model provides a benchmark for the investigation of cell survival pathways and related molecular targets that are associated with restricted energy supply associated with biological aging and metabolic diseases.
Collapse
Affiliation(s)
- Nirupama Yalamanchili
- School of Biomedical Engineering, Science and Health Systems, Drexel University, Philadelphia PA, USA
| | - Andres Kriete
- School of Biomedical Engineering, Science and Health Systems, Drexel University, Philadelphia PA, USA
| | - David Alfego
- School of Biomedical Engineering, Science and Health Systems, Drexel University, Philadelphia PA, USA
| | - Kelli M Danowski
- Department of Dermatology, St. Joseph Mercy Health System, Michigan State University, East Lansing MI, USA
| | - Csaba Kari
- Department of Dermatology and Cutaneous Biology, Thomas Jefferson University, Philadelphia PA, USA
| | - Ulrich Rodeck
- Department of Dermatology and Cutaneous Biology, Thomas Jefferson University, Philadelphia PA, USA
| |
Collapse
|
112
|
Awah CU, Tamm S, Hedtfeld S, Steinemann D, Tümmler B, Tsiavaliaris G, Stanke F. Mechanism of allele specific assembly and disruption of master regulator transcription factor complexes of NF-KBp50, NF-KBp65 and HIF1a on a non-coding FAS SNP. BIOCHIMICA ET BIOPHYSICA ACTA-GENE REGULATORY MECHANISMS 2016; 1859:1411-1428. [PMID: 27616356 DOI: 10.1016/j.bbagrm.2016.09.002] [Citation(s) in RCA: 3] [Impact Index Per Article: 0.3] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Subscribe] [Scholar Register] [Received: 05/31/2016] [Revised: 08/22/2016] [Accepted: 09/07/2016] [Indexed: 12/30/2022]
Abstract
A challenging question in genetics is to understand the molecular function of non-coding variants of the genome. By using differential EMSA, ChIP and functional genome analysis, we have found that changes in transcription factors (TF) apparent binding affinity and dissociation rates are responsible for allele specific assembly or disruption of master TFs: we observed that NF-KBp50, NF-KBp65 and HIF1a bind with an affinity of up to 10 fold better to the C-allele than to the T-allele of rs7901656 both in vivo and in vitro. Furthermore, we showed that NF-KBp50, p65 and HIF1a form higher order heteromultimeric complexes overlapping rs7901656, implying synergism of action among TFs governing cellular response to infection and hypoxia. With rs7901656 on the FAS gene as a paradigm, we show how allele specific transcription factor complex assembly and disruption by a causal variant contributes to disease and phenotypic diversity. This finding provides the highly needed mechanistic insight into how the molecular etiology of regulatory SNPs can be understood in functional terms.
Collapse
Affiliation(s)
- Chidiebere U Awah
- Department of Paediatric Pneumology, Neonatology and Allergology, Hannover Medical School, Hannover, Germany; Graduate School of Excellence, MD/PhD Programme Molecular Medicine Hannover Biomedical Research School, Hannover Biomedical Research School, Hannover Medical School, Hannover, Germany
| | - Stephanie Tamm
- Department of Paediatric Pneumology, Neonatology and Allergology, Hannover Medical School, Hannover, Germany
| | - Silke Hedtfeld
- Department of Paediatric Pneumology, Neonatology and Allergology, Hannover Medical School, Hannover, Germany
| | - Doris Steinemann
- Institute for Human Genetics, Hannover Medical School, Hannover, Germany
| | - Burkhard Tümmler
- Department of Paediatric Pneumology, Neonatology and Allergology, Hannover Medical School, Hannover, Germany; Biomedical Research in Endstage and Obstructive Lung Disease Hannover (BREATH), Member of the German Centre for Lung Research, Germany
| | | | - Frauke Stanke
- Department of Paediatric Pneumology, Neonatology and Allergology, Hannover Medical School, Hannover, Germany; Biomedical Research in Endstage and Obstructive Lung Disease Hannover (BREATH), Member of the German Centre for Lung Research, Germany.
| |
Collapse
|
113
|
Ehrlichia chaffeensis TRP32 is a Nucleomodulin that Directly Regulates Expression of Host Genes Governing Differentiation and Proliferation. Infect Immun 2016; 84:3182-3194. [PMID: 27572329 DOI: 10.1128/iai.00657-16] [Citation(s) in RCA: 21] [Impact Index Per Article: 2.3] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 12/14/2022] Open
Abstract
Ehrlichia chaffeensis is an obligately intracellular bacterium that reprograms the mononuclear phagocyte through diverse effector-host interactions to modulate numerous host cell processes, including transcription. In a previous study, we reported that E. chaffeensis TRP32, a type 1 secreted effector, interacts with multiple host nucleus-associated proteins and also auto-activates reporter gene expression in yeast. In this study, we demonstrate that TRP32 is a nucleomodulin that binds host DNA and alters host gene transcription. TRP32 enters the host cell nucleus via a noncanonical translocation mechanism that involves phosphorylation of Y179 located in a C-terminal tri-tyrosine motif. Both genistein and mutation of Y179 inhibited TRP32 nuclear entry. An electromobility shift assay (EMSA) demonstrated TRP32 host DNA binding via its tandem repeat domain. TRP32 DNA binding and motif preference were further confirmed by supershift assays, as well as competition and mutant probe analyses. Using ChIP-Seq, we determined that TRP32 binds a G-rich motif primarily within ±500 bp of the gene transcription start site. An ontology analysis identified genes involved in processes such as immune cell differentiation, chromatin remodeling, and RNA transcription and processing, as primary TRP32 targets. TRP32 bound genes (n=1223) were distributed on all chromosomes and included several global regulators of proliferation and inflammation such as FOS and JUN, AKT3 and NRAS, and non-coding RNA genes, miRNA 21 and miRNA 142. TRP32 target genes were differentially regulated during infection, the majority of which were repressed, and direct repression/activation of these genes by TRP32 was confirmed in vitro with a cellular luciferase reporter assay.
Collapse
|
114
|
Humburg P, Maugeri N, Lee W, Mohr B, Knight JC. Characterisation of the global transcriptional response to heat shock and the impact of individual genetic variation. Genome Med 2016; 8:87. [PMID: 27553423 PMCID: PMC4995779 DOI: 10.1186/s13073-016-0345-5] [Citation(s) in RCA: 3] [Impact Index Per Article: 0.3] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/11/2016] [Accepted: 08/09/2016] [Indexed: 12/28/2022] Open
Abstract
BACKGROUND The heat shock transcriptional response is essential to effective cellular function under stress. This is a highly heritable trait but the nature and extent of inter-individual variation in heat shock response remains unresolved. METHODS We determined global transcription profiles of the heat shock response for a panel of lymphoblastoid cell lines established from 60 founder individuals in the Yoruba HapMap population. We explore the observed differentially expressed gene sets following heat shock, establishing functional annotations, underlying networks and nodal genes involving heat shock factor 1 recruitment. We define a multivariate phenotype for the global transcriptional response to heat shock using partial least squares regression and map this quantitative trait to associated genetic variation in search of the major genomic modulators. RESULTS A comprehensive dataset of differentially expressed genes following heat shock in humans is presented. We identify nodal genes downstream of heat shock factor 1 in this gene set, notably involving ubiquitin C and small ubiquitin-like modifiers together with transcription factors. We dissect a multivariate phenotype for the global heat shock response which reveals distinct clustering of individuals in terms of variance of the heat shock response and involves differential expression of genes involved in DNA replication and cell division in some individuals. We find evidence of genetic associations for this multivariate response phenotype that involves trans effects modulating expression of genes following heat shock, including HSF1 and UBQLN1. CONCLUSION This study defines gene expression following heat shock for a cohort of individuals, establishing insights into the biology of the heat shock response and hypotheses for how variation in this may be modulated by underlying genetic diversity.
Collapse
Affiliation(s)
- Peter Humburg
- Wellcome Trust Centre for Human Genetics, University of Oxford, Oxford, UK
| | - Narelle Maugeri
- Wellcome Trust Centre for Human Genetics, University of Oxford, Oxford, UK
- Queensland Institute of Medical Research, Brisbane, 4029 Queensland Australia
| | - Wanseon Lee
- Wellcome Trust Centre for Human Genetics, University of Oxford, Oxford, UK
| | - Bert Mohr
- Wellcome Trust Centre for Human Genetics, University of Oxford, Oxford, UK
- Hatter Institute for Cardiovascular Research in Africa, Department of Medicine, Faculty of Health Sciences, University of Cape Town, Cape Town, South Africa
| | - Julian C. Knight
- Wellcome Trust Centre for Human Genetics, University of Oxford, Oxford, UK
| |
Collapse
|
115
|
Integrating Transcriptomic and Proteomic Data Using Predictive Regulatory Network Models of Host Response to Pathogens. PLoS Comput Biol 2016; 12:e1005013. [PMID: 27403523 PMCID: PMC4942116 DOI: 10.1371/journal.pcbi.1005013] [Citation(s) in RCA: 17] [Impact Index Per Article: 1.9] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/29/2015] [Accepted: 06/06/2016] [Indexed: 12/17/2022] Open
Abstract
Mammalian host response to pathogenic infections is controlled by a complex regulatory network connecting regulatory proteins such as transcription factors and signaling proteins to target genes. An important challenge in infectious disease research is to understand molecular similarities and differences in mammalian host response to diverse sets of pathogens. Recently, systems biology studies have produced rich collections of omic profiles measuring host response to infectious agents such as influenza viruses at multiple levels. To gain a comprehensive understanding of the regulatory network driving host response to multiple infectious agents, we integrated host transcriptomes and proteomes using a network-based approach. Our approach combines expression-based regulatory network inference, structured-sparsity based regression, and network information flow to infer putative physical regulatory programs for expression modules. We applied our approach to identify regulatory networks, modules and subnetworks that drive host response to multiple influenza infections. The inferred regulatory network and modules are significantly enriched for known pathways of immune response and implicate apoptosis, splicing, and interferon signaling processes in the differential response of viral infections of different pathogenicities. We used the learned network to prioritize regulators and study virus and time-point specific networks. RNAi-based knockdown of predicted regulators had significant impact on viral replication and include several previously unknown regulators. Taken together, our integrated analysis identified novel module level patterns that capture strain and pathogenicity-specific patterns of expression and helped identify important regulators of host response to influenza infection. An important challenge in infectious disease research is to understand how the human immune system responds to different types of pathogenic infections. An important component of mounting proper response is the transcriptional regulatory network that specifies the context-specific gene expression program in the host cell. However, our understanding of this regulatory network and how it drives context-specific transcriptional programs is incomplete. To address this gap, we performed a network-based analysis of host response to influenza viruses that integrated high-throughput mRNA- and protein measurements and protein-protein interaction networks to identify virus and pathogenicity-specific modules and their upstream physical regulatory programs. We inferred regulatory networks for human cell line and mouse host systems, which recapitulated several known regulators and pathways of the immune response and viral life cycle. We used the networks to study time point and strain-specific subnetworks and to prioritize important regulators of host response. We predicted several novel regulators, both at the mRNA and protein levels, and experimentally verified their role in the virus life cycle based on their ability to significantly impact viral replication.
Collapse
|
116
|
Smith NC, Matthews JM. Mechanisms of DNA-binding specificity and functional gene regulation by transcription factors. Curr Opin Struct Biol 2016; 38:68-74. [PMID: 27295424 DOI: 10.1016/j.sbi.2016.05.006] [Citation(s) in RCA: 42] [Impact Index Per Article: 4.7] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 01/19/2016] [Revised: 05/14/2016] [Accepted: 05/17/2016] [Indexed: 10/21/2022]
Abstract
Eukaryotic transcription factors up-regulate and down-regulate the expression of genes in a very controlled manner. The DNA-binding domains of these proteins have quite well established mechanisms for binding to DNA, but a surprisingly poor intrinsic ability to discriminate target and variant non-target DNA sequences. Here, we summarise established mechanisms of protein-DNA recognition, as specified by both macromolecules. We also review recent advances in the fields of genome binding, molecular dynamics and biomolecular interaction studies that bring us close to a full understanding of how eukaryotic transcription factors find and target DNA in vivo to form functional centres of gene regulation through networks of protein-protein and protein-DNA interactions.
Collapse
Affiliation(s)
- Ngaio C Smith
- School of Life and Environmental Science, The University of Sydney, NSW 2006, Australia
| | - Jacqueline M Matthews
- School of Life and Environmental Science, The University of Sydney, NSW 2006, Australia.
| |
Collapse
|
117
|
Binding Sites in the EFG1 Promoter for Transcription Factors in a Proposed Regulatory Network: A Functional Analysis in the White and Opaque Phases of Candida albicans. G3-GENES GENOMES GENETICS 2016; 6:1725-37. [PMID: 27172219 PMCID: PMC4889668 DOI: 10.1534/g3.116.029785] [Citation(s) in RCA: 2] [Impact Index Per Article: 0.2] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Subscribe] [Scholar Register] [Indexed: 11/18/2022]
Abstract
In Candida albicans the transcription factor Efg1, which is differentially expressed in the white phase of the white-opaque transition, is essential for expression of the white phenotype. It is one of six transcription factors included in a proposed interactive transcription network regulating white-opaque switching and maintenance of the alternative phenotypes. Ten sites were identified in the EFG1 promoter that differentially bind one or more of the network transcription factors in the white and/or opaque phase. To explore the functionality of these binding sites in the differential expression of EFG1, we generated targeted deletions of each of the 10 binding sites, combinatorial deletions, and regional deletions using a Renillareniformis luciferase reporter system. Individually targeted deletion of only four of the 10 sites had minor effects consistent with differential expression of EFG1, and only in the opaque phase. Alternative explanations are considered.
Collapse
|
118
|
Chen X, Jung JG, Shajahan-Haq AN, Clarke R, Shih IM, Wang Y, Magnani L, Wang TL, Xuan J. ChIP-BIT: Bayesian inference of target genes using a novel joint probabilistic model of ChIP-seq profiles. Nucleic Acids Res 2016; 44:e65. [PMID: 26704972 PMCID: PMC4838354 DOI: 10.1093/nar/gkv1491] [Citation(s) in RCA: 14] [Impact Index Per Article: 1.6] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 04/21/2015] [Revised: 11/16/2015] [Accepted: 12/09/2015] [Indexed: 11/16/2022] Open
Abstract
Chromatin immunoprecipitation with massively parallel DNA sequencing (ChIP-seq) has greatly improved the reliability with which transcription factor binding sites (TFBSs) can be identified from genome-wide profiling studies. Many computational tools are developed to detect binding events or peaks, however the robust detection of weak binding events remains a challenge for current peak calling tools. We have developed a novel Bayesian approach (ChIP-BIT) to reliably detect TFBSs and their target genes by jointly modeling binding signal intensities and binding locations of TFBSs. Specifically, a Gaussian mixture model is used to capture both binding and background signals in sample data. As a unique feature of ChIP-BIT, background signals are modeled by a local Gaussian distribution that is accurately estimated from the input data. Extensive simulation studies showed a significantly improved performance of ChIP-BIT in target gene prediction, particularly for detecting weak binding signals at gene promoter regions. We applied ChIP-BIT to find target genes from NOTCH3 and PBX1 ChIP-seq data acquired from MCF-7 breast cancer cells. TF knockdown experiments have initially validated about 30% of co-regulated target genes identified by ChIP-BIT as being differentially expressed in MCF-7 cells. Functional analysis on these genes further revealed the existence of crosstalk between Notch and Wnt signaling pathways.
Collapse
Affiliation(s)
- Xi Chen
- Bradley Department of Electrical and Computer Engineering, Virginia Polytechnic Institute and State University, 900 North Glebe Road, Arlington, VA 22203, USA
| | - Jin-Gyoung Jung
- Department of Pathology, Johns Hopkins Medical Institutions, 1550 Orleans Street, CRB-II, Baltimore, MD 21231, USA
| | - Ayesha N Shajahan-Haq
- Department of Oncology, Lombardi Comprehensive Cancer Center, Georgetown University Medical Center, 3970 Reservoir Road NW, Washington, DC 20057, USA
| | - Robert Clarke
- Department of Oncology, Lombardi Comprehensive Cancer Center, Georgetown University Medical Center, 3970 Reservoir Road NW, Washington, DC 20057, USA
| | - Ie-Ming Shih
- Department of Pathology, Johns Hopkins Medical Institutions, 1550 Orleans Street, CRB-II, Baltimore, MD 21231, USA
| | - Yue Wang
- Bradley Department of Electrical and Computer Engineering, Virginia Polytechnic Institute and State University, 900 North Glebe Road, Arlington, VA 22203, USA
| | - Luca Magnani
- Department of Surgery and Cancer, Imperial College London, ICTEM building, Hammersmith Hospital, DuCane Road, London W120NN, UK
| | - Tian-Li Wang
- Department of Pathology, Johns Hopkins Medical Institutions, 1550 Orleans Street, CRB-II, Baltimore, MD 21231, USA
| | - Jianhua Xuan
- Bradley Department of Electrical and Computer Engineering, Virginia Polytechnic Institute and State University, 900 North Glebe Road, Arlington, VA 22203, USA
| |
Collapse
|
119
|
Cusanovich DA, Caliskan M, Billstrand C, Michelini K, Chavarria C, De Leon S, Mitrano A, Lewellyn N, Elias JA, Chupp GL, Lang RM, Shah SJ, Decara JM, Gilad Y, Ober C. Integrated analyses of gene expression and genetic association studies in a founder population. Hum Mol Genet 2016; 25:2104-2112. [PMID: 26931462 PMCID: PMC5062579 DOI: 10.1093/hmg/ddw061] [Citation(s) in RCA: 17] [Impact Index Per Article: 1.9] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 08/15/2015] [Accepted: 02/21/2016] [Indexed: 12/17/2022] Open
Abstract
Genome-wide association studies (GWASs) have become a standard tool for dissecting genetic contributions to disease risk. However, these studies typically require extraordinarily large sample sizes to be adequately powered. Strategies that incorporate functional information alongside genetic associations have proved successful in increasing GWAS power. Following this paradigm, we present the results of 20 different genetic association studies for quantitative traits related to complex diseases, conducted in the Hutterites of South Dakota. To boost the power of these association studies, we collected RNA-sequencing data from lymphoblastoid cell lines for 431 Hutterite individuals. We then used Sherlock, a tool that integrates GWAS and expression quantitative trait locus (eQTL) data, to identify weak GWAS signals that are also supported by eQTL data. Using this approach, we found novel associations with quantitative phenotypes related to cardiovascular disease, including carotid intima-media thickness, left atrial volume index, monocyte count and serum YKL-40 levels.
Collapse
Affiliation(s)
| | | | | | | | | | | | | | | | - Jack A Elias
- Division of Biology and Medicine, Brown University, Providence, RI 02912, USA and
| | - Geoffrey L Chupp
- Pulmonary and Critical Care, Yale School of Medicine, New Haven, CT 06519, USA
| | - Roberto M Lang
- Department of Medicine, Section of Cardiology, University of Chicago, Chicago, IL 60637, USA
| | - Sanjiv J Shah
- Department of Medicine, Section of Cardiology, University of Chicago, Chicago, IL 60637, USA
| | - Jeanne M Decara
- Department of Medicine, Section of Cardiology, University of Chicago, Chicago, IL 60637, USA
| | | | | |
Collapse
|
120
|
Douglas GM, Wilson MD, Moses AM. Decreased Transcription Factor Binding Levels Nearby Primate Pseudogenes Suggest Regulatory Degeneration. Mol Biol Evol 2016; 33:1478-85. [PMID: 26882985 PMCID: PMC4868113 DOI: 10.1093/molbev/msw030] [Citation(s) in RCA: 1] [Impact Index Per Article: 0.1] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 01/21/2023] Open
Abstract
Characteristics of pseudogene degeneration at the coding level are well-known, such as a shift toward neutral rates of nonsynonymous substitutions and gain of frameshift mutations. In contrast, degeneration of pseudogene transcriptional regulation is not well understood. Here, we test two predictions of regulatory degeneration along a pseudogenized lineage: 1) Decreased transcription factor (TF) binding and 2) accelerated evolution in putative cis-regulatory regions.We find evidence for decreased TF binding levels nearby two primate pseudogenes compared with functional liver genes. However, the majority of TF-bound sequences nearby pseudogenes do not show evidence for lineage-specific accelerated rates of evolution. We conclude that decreases in TF binding level could be a marker for regulatory degeneration, while sequence degeneration in primate cis-regulatory modules may be obscured by background rates of TF binding site turnover.
Collapse
Affiliation(s)
- Gavin M Douglas
- Department of Ecology and Evolutionary Biology, University of Toronto, Toronto, ON, Canada
| | - Michael D Wilson
- Genetics and Genome Biology Program, SickKids Research Institute, Toronto, ON, Canada Department of Molecular Genetics, University of Toronto, Toronto, ON, Canada
| | - Alan M Moses
- Department of Ecology and Evolutionary Biology, University of Toronto, Toronto, ON, Canada Department of Cell and Systems Biology, University of Toronto, Toronto, ON, Canada
| |
Collapse
|
121
|
Functional transcription factor target discovery via compendia of binding and expression profiles. Sci Rep 2016; 6:20649. [PMID: 26857150 PMCID: PMC4746627 DOI: 10.1038/srep20649] [Citation(s) in RCA: 15] [Impact Index Per Article: 1.7] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/19/2015] [Accepted: 01/07/2016] [Indexed: 12/21/2022] Open
Abstract
Genome-wide experiments to map the DNA-binding locations of transcription-associated factors (TFs) have shown that the number of genes bound by a TF far exceeds the number of possible direct target genes. Distinguishing functional from non-functional binding is therefore a major challenge in the study of transcriptional regulation. We hypothesized that functional targets can be discovered by correlating binding and expression profiles across multiple experimental conditions. To test this hypothesis, we obtained ChIP-seq and RNA-seq data from matching cell types from the human ENCODE resource, considered promoter-proximal and distal cumulative regulatory models to map binding sites to genes, and used a combination of linear and non-linear measures to correlate binding and expression data. We found that a high degree of correlation between a gene’s TF-binding and expression profiles was significantly more predictive of the gene being differentially expressed upon knockdown of that TF, compared to using binding sites in the cell type of interest only. Remarkably, TF targets predicted from correlation across a compendium of cell types were also predictive of functional targets in other cell types. Finally, correlation across a time course of ChIP-seq and RNA-seq experiments was also predictive of functional TF targets in that tissue.
Collapse
|
122
|
Liao BY, Weng MP. Functionalities of expressed messenger RNAs revealed from mutant phenotypes. WILEY INTERDISCIPLINARY REVIEWS-RNA 2016; 7:416-27. [PMID: 26748449 DOI: 10.1002/wrna.1329] [Citation(s) in RCA: 1] [Impact Index Per Article: 0.1] [Reference Citation Analysis] [Abstract] [Track Full Text] [Subscribe] [Scholar Register] [Received: 08/12/2015] [Revised: 11/23/2015] [Accepted: 12/02/2015] [Indexed: 12/11/2022]
Abstract
Total messenger RNAs mRNAs that are produced from a given gene under a certain set of conditions include both functional and nonfunctional transcripts. The high prevalence of nonfunctional mRNAs that have been detected in cells has raised questions regarding the functional implications of mRNA expression patterns and divergences. Phenotypes that result from the mutagenesis of protein-coding genes have provided the most straightforward descriptions of gene functions, and such data obtained from model organisms have facilitated investigations of the functionalities of expressed mRNAs. Mutant phenotype data from mouse tissues have revealed various attributes of functional mRNAs, including tissue-specificity, strength of expression, and evolutionary conservation. In addition, the role that mRNA expression evolution plays in driving morphological evolution has been revealed from studies designed to exploit morphological and physiological phenotypes of mouse mutants. Investigations into yeast essential genes (defined by an absence of colony growth after gene deletion) have further described gene regulatory strategies that reduce protein expression noise by mediating the rates of transcription and translation. In addition to the functional significance of expressed mRNAs as described in the abovementioned findings, the functionalities of other type of RNAs (i.e., noncoding RNAs) remain to be characterized with systematic mutations and phenotyping of the DNA regions that encode these RNA molecules. WIREs RNA 2016, 7:416-427. doi: 10.1002/wrna.1329 For further resources related to this article, please visit the WIREs website.
Collapse
Affiliation(s)
- Ben-Yang Liao
- Institute of Population Health Sciences, National Health Research Institutes, Zhunan, Miaoli County, Taiwan, Republic of China
| | - Meng-Pin Weng
- Institute of Population Health Sciences, National Health Research Institutes, Zhunan, Miaoli County, Taiwan, Republic of China
| |
Collapse
|
123
|
Kaur S, Mirza AH, Brorsson CA, Fløyel T, Størling J, Mortensen HB, Pociot F. The genetic and regulatory architecture of ERBB3-type 1 diabetes susceptibility locus. Mol Cell Endocrinol 2016; 419:83-91. [PMID: 26450151 DOI: 10.1016/j.mce.2015.10.002] [Citation(s) in RCA: 30] [Impact Index Per Article: 3.3] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Submit a Manuscript] [Subscribe] [Scholar Register] [Received: 07/22/2015] [Revised: 09/29/2015] [Accepted: 10/01/2015] [Indexed: 12/11/2022]
Abstract
The study aimed to explore the role of ERBB3 in type 1 diabetes (T1D). We examined whether genetic variation of ERBB3 (rs2292239) affects residual β-cell function in T1D cases. Furthermore, we examined the expression of ERBB3 in human islets, the effect of ERBB3 knockdown on apoptosis in insulin-producing INS-1E cells and the genetic and regulatory architecture of the ERBB3 locus to provide insights to how rs2292239 may confer disease susceptibility. rs2292239 strongly correlated with residual β-cell function and metabolic control in children with T1D. ERBB3 locus associated lncRNA (NONHSAG011351) was found to be expressed in human islets. ERBB3 was expressed and down-regulated by pro-inflammatory cytokines in human islets and INS-1E cells; knockdown of ERBB3 in INS-1E cells decreased basal and cytokine-induced apoptosis. Our data suggests an important functional role of ERBB3 and its potential regulators in the β-cells and may constitute novel targets to prevent β-cell destruction in T1D.
Collapse
Affiliation(s)
- Simranjeet Kaur
- Copenhagen Diabetes Research Center (CPH-DIRECT), Department of Pediatrics, Herlev University Hospital, Herlev Ringvej 75, DK-2730 Herlev, Denmark; Faculty of Health and Medical Sciences, University of Copenhagen, Denmark
| | - Aashiq H Mirza
- Copenhagen Diabetes Research Center (CPH-DIRECT), Department of Pediatrics, Herlev University Hospital, Herlev Ringvej 75, DK-2730 Herlev, Denmark; Faculty of Health and Medical Sciences, University of Copenhagen, Denmark; Center for Non-coding RNA in Technology and Health, University of Copenhagen, Denmark
| | - Caroline A Brorsson
- Copenhagen Diabetes Research Center (CPH-DIRECT), Department of Pediatrics, Herlev University Hospital, Herlev Ringvej 75, DK-2730 Herlev, Denmark
| | - Tina Fløyel
- Copenhagen Diabetes Research Center (CPH-DIRECT), Department of Pediatrics, Herlev University Hospital, Herlev Ringvej 75, DK-2730 Herlev, Denmark
| | - Joachim Størling
- Copenhagen Diabetes Research Center (CPH-DIRECT), Department of Pediatrics, Herlev University Hospital, Herlev Ringvej 75, DK-2730 Herlev, Denmark
| | - Henrik B Mortensen
- Copenhagen Diabetes Research Center (CPH-DIRECT), Department of Pediatrics, Herlev University Hospital, Herlev Ringvej 75, DK-2730 Herlev, Denmark; Faculty of Health and Medical Sciences, University of Copenhagen, Denmark
| | - Flemming Pociot
- Copenhagen Diabetes Research Center (CPH-DIRECT), Department of Pediatrics, Herlev University Hospital, Herlev Ringvej 75, DK-2730 Herlev, Denmark; Faculty of Health and Medical Sciences, University of Copenhagen, Denmark; Center for Non-coding RNA in Technology and Health, University of Copenhagen, Denmark.
| |
Collapse
|
124
|
Wu WS, Lai FJ. Functional redundancy of transcription factors explains why most binding targets of a transcription factor are not affected when the transcription factor is knocked out. BMC SYSTEMS BIOLOGY 2015; 9 Suppl 6:S2. [PMID: 26678747 PMCID: PMC4674858 DOI: 10.1186/1752-0509-9-s6-s2] [Citation(s) in RCA: 21] [Impact Index Per Article: 2.1] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Subscribe] [Scholar Register] [Indexed: 12/19/2022]
Abstract
Background Biologists are puzzled by the extremely low percentage (3%) of the binding targets of a yeast transcription factor (TF) affected when the TF is knocked out, a phenomenon observed by comparing the TF binding dataset and TF knockout effect dataset. Results This study gives a plausible biological explanation of this counterintuitive phenomenon. Our analyses find that TFs with high functional redundancy show significantly lower percentage than do TFs with low functional redundancy. This suggests that functional redundancy may lead to one TF compensating for another, thus masking the TF knockout effect on the binding targets of the knocked-out TF. In addition, we show that seven classes of genes (lowly expressed genes, TATA box-less genes, genes containing a nucleosome-free region immediately upstream of the transcriptional start site (TSS), genes with low transcriptional plasticity, genes with a low number of bound TFs, genes with a low number of TFBSs, and genes with a short average distance of TFBSs to the TSS) are insensitive to the knockout of their promoter-binding TFs, providing clues for finding other biological explanations of the surprisingly low percentage of the binding targets of a TF affected when the TF is knocked out. Conclusions This study shows that one property of TFs (functional redundancy) and seven properties of genes (expression level, TATA box, nucleosome, transcriptional plasticity, the number of bound TFs, the number of TFBSs, and the average distance of TFBSs to the TSS) may be useful for explaining a counterintuitive phenomenon: most binding targets of a yeast transcription factor are not affected when the transcription factor is knocked out.
Collapse
|
125
|
Svetlichnyy D, Imrichova H, Fiers M, Kalender Atak Z, Aerts S. Identification of High-Impact cis-Regulatory Mutations Using Transcription Factor Specific Random Forest Models. PLoS Comput Biol 2015; 11:e1004590. [PMID: 26562774 PMCID: PMC4642938 DOI: 10.1371/journal.pcbi.1004590] [Citation(s) in RCA: 21] [Impact Index Per Article: 2.1] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 05/07/2015] [Accepted: 10/10/2015] [Indexed: 02/02/2023] Open
Abstract
Cancer genomes contain vast amounts of somatic mutations, many of which are passenger mutations not involved in oncogenesis. Whereas driver mutations in protein-coding genes can be distinguished from passenger mutations based on their recurrence, non-coding mutations are usually not recurrent at the same position. Therefore, it is still unclear how to identify cis-regulatory driver mutations, particularly when chromatin data from the same patient is not available, thus relying only on sequence and expression information. Here we use machine-learning methods to predict functional regulatory regions using sequence information alone, and compare the predicted activity of the mutated region with the reference sequence. This way we define the Predicted Regulatory Impact of a Mutation in an Enhancer (PRIME). We find that the recently identified driver mutation in the TAL1 enhancer has a high PRIME score, representing a “gain-of-target” for MYB, whereas the highly recurrent TERT promoter mutation has a surprisingly low PRIME score. We trained Random Forest models for 45 cancer-related transcription factors, and used these to score variations in the HeLa genome and somatic mutations across more than five hundred cancer genomes. Each model predicts only a small fraction of non-coding mutations with a potential impact on the function of the encompassing regulatory region. Nevertheless, as these few candidate driver mutations are often linked to gains in chromatin activity and gene expression, they may contribute to the oncogenic program by altering the expression levels of specific oncogenes and tumor suppressor genes. Precise regulation of gene expression is controlled by cis-regulatory modules (CRM) containing binding sites for transcription factors (TF). The genome-wide location of all TF binding sites can often be obtained by ChIP-seq (chromatin immunoprecipitation followed by deep sequencing), yet in most cases only a minority of the binding peaks actually represent functional CRMs that control the transcription initiation of a bona fide TF target gene. Here, we investigated for 45 cancer-related TFs how machine-learning approaches can be used to predict functional TF target CRMs. After careful evaluation of their performance, we used these TF-target classifiers to predict which cis-regulatory mutations may have a significant impact on gene regulation by evaluating whether the mutation causes a significant gain or loss in the probability that the CRM is a functional TF target. We found that Random Forest classifiers can achieve more than 100-fold higher specificity for mutation prediction compared to the simple approaches based on scanning with position weight matrices. By scanning somatic mutations in breast cancer genomes and in the HeLa genome, we finally show that our TF-target classifiers can identify high impact non-coding mutations that are associated with concordant TF binding, gene expression changes and chromatin activity. In conclusion, TF-specific Random Forest classifiers can be used to prioritize cis-regulatory mutations in cancer genomes with high accuracy.
Collapse
Affiliation(s)
- Dmitry Svetlichnyy
- Laboratory of Computational Biology, KU Leuven Center for Human Genetics, Leuven, Belgium
| | - Hana Imrichova
- Laboratory of Computational Biology, KU Leuven Center for Human Genetics, Leuven, Belgium
| | - Mark Fiers
- VIB Center for the Biology of Disease, Leuven, Belgium
| | - Zeynep Kalender Atak
- Laboratory of Computational Biology, KU Leuven Center for Human Genetics, Leuven, Belgium
| | - Stein Aerts
- Laboratory of Computational Biology, KU Leuven Center for Human Genetics, Leuven, Belgium
- * E-mail:
| |
Collapse
|
126
|
Moyerbrailean GA, Davis GO, Harvey CT, Watza D, Wen X, Pique-Regi R, Luca F. A high-throughput RNA-seq approach to profile transcriptional responses. Sci Rep 2015; 5:14976. [PMID: 26510397 PMCID: PMC4625130 DOI: 10.1038/srep14976] [Citation(s) in RCA: 19] [Impact Index Per Article: 1.9] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 05/27/2015] [Accepted: 09/01/2015] [Indexed: 12/12/2022] Open
Abstract
In recent years RNA-seq protocols have been developed to investigate a variety of biological problems by measuring the abundance of different RNAs. Many study designs involve performing expensive preliminary studies to screen or optimize experimental conditions. Testing a large number of conditions in parallel may be more cost effective. For example, analyzing tissue/environment-specific gene expression generally implies screening a large number of cellular conditions and samples, without prior knowledge of which conditions are most informative (e.g., some cell types may not respond to certain treatments). To circumvent these challenges, we have established a new two-step high-throughput RNA-seq approach: the first step consists of gene expression screening of a large number of conditions, while the second step focuses on deep sequencing of the most relevant conditions (e.g., largest number of differentially expressed genes). This study design allows for a fast and economical screen in step one, with a more efficient allocation of resources for the deep sequencing of the most biologically relevant libraries in step two. We have applied this approach to study the response to 23 treatments in three lymphoblastoid cell lines demonstrating that it should also be useful for other high-throughput transcriptome profiling applications requiring iterative refinement or screening.
Collapse
Affiliation(s)
- G A Moyerbrailean
- Wayne State University, Center for Molecular Medicine and Genetics, Detroit, 48201, USA
| | - G O Davis
- Wayne State University, Center for Molecular Medicine and Genetics, Detroit, 48201, USA
| | - C T Harvey
- Wayne State University, Center for Molecular Medicine and Genetics, Detroit, 48201, USA
| | - D Watza
- Wayne State University, Center for Molecular Medicine and Genetics, Detroit, 48201, USA
| | - X Wen
- University of Michigan, Department of Biostatistics, Ann Arbor, postcode, USA
| | - R Pique-Regi
- Wayne State University, Center for Molecular Medicine and Genetics, Detroit, 48201, USA.,Wayne State University, Department of Obstetrics and Gynecology, Detroit, 48201, USA
| | - F Luca
- Wayne State University, Center for Molecular Medicine and Genetics, Detroit, 48201, USA.,Wayne State University, Department of Obstetrics and Gynecology, Detroit, 48201, USA
| |
Collapse
|
127
|
Liscovitch N, Bazak L, Levanon EY, Chechik G. Positive correlation between ADAR expression and its targets suggests a complex regulation mediated by RNA editing in the human brain. RNA Biol 2015; 11:1447-56. [PMID: 25692240 DOI: 10.4161/15476286.2014.992286] [Citation(s) in RCA: 14] [Impact Index Per Article: 1.4] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 12/20/2022] Open
Abstract
A-to-I RNA editing by adenosine deaminases acting on RNA is a post-transcriptional modification that is crucial for normal life and development in vertebrates. RNA editing has been shown to be very abundant in the human transcriptome, specifically at the primate-specific Alu elements. The functional role of this wide-spread effect is still not clear; it is believed that editing of transcripts is a mechanism for their down-regulation via processes such as nuclear retention or RNA degradation. Here we combine 2 neural gene expression datasets with genome-level editing information to examine the relation between the expression of ADAR genes with the expression of their target genes. Specifically, we computed the spatial correlation across structures of post-mortem human brains between ADAR and a large set of targets that were found to be edited in their Alu repeats. Surprisingly, we found that a large fraction of the edited genes are positively correlated with ADAR, opposing the assumption that editing would reduce expression. When considering the correlations between ADAR and its targets over development, 2 gene subsets emerge, positively correlated and negatively correlated with ADAR expression. Specifically, in embryonic time points, ADAR is positively correlated with many genes related to RNA processing and regulation of gene expression. These findings imply that the suggested mechanism of regulation of expression by editing is probably not a global one; ADAR expression does not have a genome wide effect reducing the expression of editing targets. It is possible, however, that RNA editing by ADAR in non-coding regions of the gene might be a part of a more complex expression regulation mechanism.
Collapse
Affiliation(s)
- Noa Liscovitch
- a Gonda Multidisiplinary Brain Research Center ; Bar-Ilan University ; Ramat Gan , Israel
| | | | | | | |
Collapse
|
128
|
Muiño JM, de Bruijn S, Pajoro A, Geuten K, Vingron M, Angenent GC, Kaufmann K. Evolution of DNA-Binding Sites of a Floral Master Regulatory Transcription Factor. Mol Biol Evol 2015; 33:185-200. [PMID: 26429922 PMCID: PMC4693976 DOI: 10.1093/molbev/msv210] [Citation(s) in RCA: 29] [Impact Index Per Article: 2.9] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 12/11/2022] Open
Abstract
Flower development is controlled by the action of key regulatory transcription factors of the MADS-domain family. The function of these factors appears to be highly conserved among species based on mutant phenotypes. However, the conservation of their downstream processes is much less well understood, mostly because the evolutionary turnover and variation of their DNA-binding sites (BSs) among plant species have not yet been experimentally determined. Here, we performed comparative ChIP (chromatin immunoprecipitation)-seq experiments of the MADS-domain transcription factor SEPALLATA3 (SEP3) in two closely related Arabidopsis species: Arabidopsis thaliana and A. lyrata which have very similar floral organ morphology. We found that BS conservation is associated with DNA sequence conservation, the presence of the CArG-box BS motif and on the relative position of the BS to its potential target gene. Differences in genome size and structure can explain that SEP3 BSs in A. lyrata can be located more distantly to their potential target genes than their counterparts in A. thaliana. In A. lyrata, we identified transposition as a mechanism to generate novel SEP3 binding locations in the genome. Comparative gene expression analysis shows that the loss/gain of BSs is associated with a change in gene expression. In summary, this study investigates the evolutionary dynamics of DNA BSs of a floral key-regulatory transcription factor and explores factors affecting this phenomenon.
Collapse
Affiliation(s)
- Jose M Muiño
- Computational Molecular Biology, Max Planck Institute for Molecular Genetics, Berlin, Germany Laboratory of Bioinformatics, Wageningen University, Wageningen, The Netherlands
| | - Suzanne de Bruijn
- Institute for Biochemistry and Biology, Potsdam University, Potsdam, Germany Laboratory of Molecular Biology, Wageningen University, Wageningen, The Netherlands
| | - Alice Pajoro
- Laboratory of Molecular Biology, Wageningen University, Wageningen, The Netherlands
| | - Koen Geuten
- Laboratory of Molecular Plant Biology, Department of Biology, University of Leuven (KU Leuven), Leuven, Belgium
| | - Martin Vingron
- Computational Molecular Biology, Max Planck Institute for Molecular Genetics, Berlin, Germany
| | - Gerco C Angenent
- Laboratory of Molecular Biology, Wageningen University, Wageningen, The Netherlands Bioscience, Plant Research International, Wageningen, The Netherlands
| | - Kerstin Kaufmann
- Institute for Biochemistry and Biology, Potsdam University, Potsdam, Germany
| |
Collapse
|
129
|
Genetic Variation Determines PPARγ Function and Anti-diabetic Drug Response In Vivo. Cell 2015; 162:33-44. [PMID: 26140591 DOI: 10.1016/j.cell.2015.06.025] [Citation(s) in RCA: 98] [Impact Index Per Article: 9.8] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 12/16/2014] [Revised: 03/25/2015] [Accepted: 05/21/2015] [Indexed: 12/25/2022]
Abstract
SNPs affecting disease risk often reside in non-coding genomic regions. Here, we show that SNPs are highly enriched at mouse strain-selective adipose tissue binding sites for PPARγ, a nuclear receptor for anti-diabetic drugs. Many such SNPs alter binding motifs for PPARγ or cooperating factors and functionally regulate nearby genes whose expression is strain selective and imbalanced in heterozygous F1 mice. Moreover, genetically determined binding of PPARγ accounts for mouse strain-specific transcriptional effects of TZD drugs, providing proof of concept for personalized medicine related to nuclear receptor genomic occupancy. In human fat, motif-altering SNPs cause differential PPARγ binding, provide a molecular mechanism for some expression quantitative trait loci, and are risk factors for dysmetabolic traits in genome-wide association studies. One PPARγ motif-altering SNP is associated with HDL levels and other metabolic syndrome parameters. Thus, natural genetic variation in PPARγ genomic occupancy determines individual disease risk and drug response.
Collapse
|
130
|
Kok K, Ay A, Li LM, Arnosti DN. Genome-wide errant targeting by Hairy. eLife 2015; 4. [PMID: 26305409 PMCID: PMC4547095 DOI: 10.7554/elife.06394] [Citation(s) in RCA: 10] [Impact Index Per Article: 1.0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 01/09/2015] [Accepted: 07/24/2015] [Indexed: 01/08/2023] Open
Abstract
Metazoan transcriptional repressors regulate chromatin through diverse histone modifications. Contributions of individual factors to the chromatin landscape in development is difficult to establish, as global surveys reflect multiple changes in regulators. Therefore, we studied the conserved Hairy/Enhancer of Split family repressor Hairy, analyzing histone marks and gene expression in Drosophila embryos. This long-range repressor mediates histone acetylation and methylation in large blocks, with highly context-specific effects on target genes. Most strikingly, Hairy exhibits biochemical activity on many loci that are uncoupled to changes in gene expression. Rather than representing inert binding sites, as suggested for many eukaryotic factors, many regions are targeted errantly by Hairy to modify the chromatin landscape. Our findings emphasize that identification of active cis-regulatory elements must extend beyond the survey of prototypical chromatin marks. We speculate that this errant activity may provide a path for creation of new regulatory elements, facilitating the evolution of novel transcriptional circuits.
Collapse
Affiliation(s)
- Kurtulus Kok
- Genetics Program, Michigan State University, East Lansing, United States
| | - Ahmet Ay
- Departments of Biology and Mathematics, Colgate University, Hamilton, United States
| | - Li M Li
- Janelia Research Campus, Howard Hughes Medical Institute, Ashburn, United States
| | - David N Arnosti
- Genetics Program, Michigan State University, East Lansing, United States
| |
Collapse
|
131
|
Waszak SM, Delaneau O, Gschwind AR, Kilpinen H, Raghav SK, Witwicki RM, Orioli A, Wiederkehr M, Panousis NI, Yurovsky A, Romano-Palumbo L, Planchon A, Bielser D, Padioleau I, Udin G, Thurnheer S, Hacker D, Hernandez N, Reymond A, Deplancke B, Dermitzakis ET. Population Variation and Genetic Control of Modular Chromatin Architecture in Humans. Cell 2015; 162:1039-50. [PMID: 26300124 DOI: 10.1016/j.cell.2015.08.001] [Citation(s) in RCA: 156] [Impact Index Per Article: 15.6] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/01/2014] [Revised: 03/17/2015] [Accepted: 07/29/2015] [Indexed: 12/12/2022]
Abstract
Chromatin state variation at gene regulatory elements is abundant across individuals, yet we understand little about the genetic basis of this variability. Here, we profiled several histone modifications, the transcription factor (TF) PU.1, RNA polymerase II, and gene expression in lymphoblastoid cell lines from 47 whole-genome sequenced individuals. We observed that distinct cis-regulatory elements exhibit coordinated chromatin variation across individuals in the form of variable chromatin modules (VCMs) at sub-Mb scale. VCMs were associated with thousands of genes and preferentially cluster within chromosomal contact domains. We mapped strong proximal and weak, yet more ubiquitous, distal-acting chromatin quantitative trait loci (cQTL) that frequently explain this variation. cQTLs were associated with molecular activity at clusters of cis-regulatory elements and mapped preferentially within TF-bound regions. We propose that local, sequence-independent chromatin variation emerges as a result of genetic perturbations in cooperative interactions between cis-regulatory elements that are located within the same genomic domain.
Collapse
Affiliation(s)
- Sebastian M Waszak
- Institute of Bioengineering, School of Life Sciences, École Polytechnique Fédérale de Lausanne (EPFL), Lausanne 1015, Switzerland; Swiss Institute of Bioinformatics, Lausanne 1015, Switzerland
| | - Olivier Delaneau
- Swiss Institute of Bioinformatics, Lausanne 1015, Switzerland; Department of Genetic Medicine and Development, University of Geneva Medical School, Geneva 1211, Switzerland; Institute of Genetics and Genomics in Geneva, University of Geneva, Geneva 1211, Switzerland
| | - Andreas R Gschwind
- Swiss Institute of Bioinformatics, Lausanne 1015, Switzerland; Center for Integrative Genomics, Faculty of Biology and Medicine, University of Lausanne, Lausanne 1015, Switzerland
| | - Helena Kilpinen
- Swiss Institute of Bioinformatics, Lausanne 1015, Switzerland; Department of Genetic Medicine and Development, University of Geneva Medical School, Geneva 1211, Switzerland; Institute of Genetics and Genomics in Geneva, University of Geneva, Geneva 1211, Switzerland
| | - Sunil K Raghav
- Institute of Bioengineering, School of Life Sciences, École Polytechnique Fédérale de Lausanne (EPFL), Lausanne 1015, Switzerland
| | - Robert M Witwicki
- Center for Integrative Genomics, Faculty of Biology and Medicine, University of Lausanne, Lausanne 1015, Switzerland
| | - Andrea Orioli
- Center for Integrative Genomics, Faculty of Biology and Medicine, University of Lausanne, Lausanne 1015, Switzerland
| | - Michael Wiederkehr
- Center for Integrative Genomics, Faculty of Biology and Medicine, University of Lausanne, Lausanne 1015, Switzerland
| | - Nikolaos I Panousis
- Swiss Institute of Bioinformatics, Lausanne 1015, Switzerland; Department of Genetic Medicine and Development, University of Geneva Medical School, Geneva 1211, Switzerland; Institute of Genetics and Genomics in Geneva, University of Geneva, Geneva 1211, Switzerland
| | - Alisa Yurovsky
- Swiss Institute of Bioinformatics, Lausanne 1015, Switzerland; Department of Genetic Medicine and Development, University of Geneva Medical School, Geneva 1211, Switzerland; Institute of Genetics and Genomics in Geneva, University of Geneva, Geneva 1211, Switzerland
| | - Luciana Romano-Palumbo
- Department of Genetic Medicine and Development, University of Geneva Medical School, Geneva 1211, Switzerland
| | - Alexandra Planchon
- Department of Genetic Medicine and Development, University of Geneva Medical School, Geneva 1211, Switzerland
| | - Deborah Bielser
- Department of Genetic Medicine and Development, University of Geneva Medical School, Geneva 1211, Switzerland
| | - Ismael Padioleau
- Swiss Institute of Bioinformatics, Lausanne 1015, Switzerland; Department of Genetic Medicine and Development, University of Geneva Medical School, Geneva 1211, Switzerland; Institute of Genetics and Genomics in Geneva, University of Geneva, Geneva 1211, Switzerland
| | - Gilles Udin
- Institute of Bioengineering, School of Life Sciences, École Polytechnique Fédérale de Lausanne (EPFL), Lausanne 1015, Switzerland
| | - Sarah Thurnheer
- Protein Expression Core Facility, School of Life Sciences, École Polytechnique Fédérale de Lausanne (EPFL), Lausanne 1015, Switzerland
| | - David Hacker
- Protein Expression Core Facility, School of Life Sciences, École Polytechnique Fédérale de Lausanne (EPFL), Lausanne 1015, Switzerland
| | - Nouria Hernandez
- Center for Integrative Genomics, Faculty of Biology and Medicine, University of Lausanne, Lausanne 1015, Switzerland
| | - Alexandre Reymond
- Center for Integrative Genomics, Faculty of Biology and Medicine, University of Lausanne, Lausanne 1015, Switzerland
| | - Bart Deplancke
- Institute of Bioengineering, School of Life Sciences, École Polytechnique Fédérale de Lausanne (EPFL), Lausanne 1015, Switzerland; Swiss Institute of Bioinformatics, Lausanne 1015, Switzerland.
| | - Emmanouil T Dermitzakis
- Swiss Institute of Bioinformatics, Lausanne 1015, Switzerland; Department of Genetic Medicine and Development, University of Geneva Medical School, Geneva 1211, Switzerland; Institute of Genetics and Genomics in Geneva, University of Geneva, Geneva 1211, Switzerland.
| |
Collapse
|
132
|
Ground State Conditions Induce Rapid Reorganization of Core Pluripotency Factor Binding before Global Epigenetic Reprogramming. Cell Stem Cell 2015; 17:462-70. [PMID: 26235340 DOI: 10.1016/j.stem.2015.07.005] [Citation(s) in RCA: 90] [Impact Index Per Article: 9.0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/12/2014] [Revised: 03/25/2015] [Accepted: 07/08/2015] [Indexed: 12/17/2022]
Abstract
Mouse embryonic stem cells (mESCs) cultured under serum/LIF conditions exhibit heterogeneous expression of pluripotency-associated factors that can be overcome by two inhibitors (2i) of the MEK and GSK3 pathways. Several studies have shown that the "ground state" induced by 2i is characterized by global hypomethylation and specific transcriptional profiles, but little is known about the contributing effectors. Here we show that 2i conditions rapidly alter the global binding landscape of OCT4, SOX2, and NANOG. The dynamic binding influences enhancer activity and shows enrichment for regulators linked to Wnt and Erk signaling. Epigenomic characterization provided limited insights to the immediate transcriptional dynamics, suggesting that these are likely more secondary effects. Likewise, loss of the PRC2 component EED to prevent H3K27me3 deposition had minimal effect on the transcriptome, implying that it is largely dispensable for continued repression of bivalent genes and de novo silencing in 2i.
Collapse
|
133
|
Liu S, Kracher B, Ziegler J, Birkenbihl RP, Somssich IE. Negative regulation of ABA signaling by WRKY33 is critical for Arabidopsis immunity towards Botrytis cinerea 2100. eLife 2015. [PMID: 26076231 DOI: 10.7554/elife.07295.033] [Citation(s) in RCA: 3] [Impact Index Per Article: 0.3] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 05/14/2023] Open
Abstract
The Arabidopsis mutant wrky33 is highly susceptible to Botrytis cinerea. We identified >1680 Botrytis-induced WRKY33 binding sites associated with 1576 Arabidopsis genes. Transcriptional profiling defined 318 functional direct target genes at 14 hr post inoculation. Comparative analyses revealed that WRKY33 possesses dual functionality acting either as a repressor or as an activator in a promoter-context dependent manner. We confirmed known WRKY33 targets involved in hormone signaling and phytoalexin biosynthesis, but also uncovered a novel negative role of abscisic acid (ABA) in resistance towards B. cinerea 2100. The ABA biosynthesis genes NCED3 and NCED5 were identified as direct targets required for WRKY33-mediated resistance. Loss-of-WRKY33 function resulted in elevated ABA levels and genetic studies confirmed that WRKY33 acts upstream of NCED3/NCED5 to negatively regulate ABA biosynthesis. This study provides the first detailed view of the genome-wide contribution of a specific plant transcription factor in modulating the transcriptional network associated with plant immunity.
Collapse
Affiliation(s)
- Shouan Liu
- Department of Plant Microbe Interactions, Max Planck Institute for Plant Breeding Research, Köln, Germany
| | - Barbara Kracher
- Department of Plant Microbe Interactions, Max Planck Institute for Plant Breeding Research, Köln, Germany
| | - Jörg Ziegler
- Department of Molecular Signal Processing, Leibniz Institute of Plant Biochemistry, Halle, Germany
| | - Rainer P Birkenbihl
- Department of Plant Microbe Interactions, Max Planck Institute for Plant Breeding Research, Köln, Germany
| | - Imre E Somssich
- Department of Plant Microbe Interactions, Max Planck Institute for Plant Breeding Research, Köln, Germany
| |
Collapse
|
134
|
Liu S, Kracher B, Ziegler J, Birkenbihl RP, Somssich IE. Negative regulation of ABA signaling by WRKY33 is critical for Arabidopsis immunity towards Botrytis cinerea 2100. eLife 2015. [PMID: 26076231 DOI: 10.7554/elife.07295.001] [Citation(s) in RCA: 1] [Impact Index Per Article: 0.1] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 05/15/2023] Open
Abstract
The Arabidopsis mutant wrky33 is highly susceptible to Botrytis cinerea. We identified >1680 Botrytis-induced WRKY33 binding sites associated with 1576 Arabidopsis genes. Transcriptional profiling defined 318 functional direct target genes at 14 hr post inoculation. Comparative analyses revealed that WRKY33 possesses dual functionality acting either as a repressor or as an activator in a promoter-context dependent manner. We confirmed known WRKY33 targets involved in hormone signaling and phytoalexin biosynthesis, but also uncovered a novel negative role of abscisic acid (ABA) in resistance towards B. cinerea 2100. The ABA biosynthesis genes NCED3 and NCED5 were identified as direct targets required for WRKY33-mediated resistance. Loss-of-WRKY33 function resulted in elevated ABA levels and genetic studies confirmed that WRKY33 acts upstream of NCED3/NCED5 to negatively regulate ABA biosynthesis. This study provides the first detailed view of the genome-wide contribution of a specific plant transcription factor in modulating the transcriptional network associated with plant immunity.
Collapse
Affiliation(s)
- Shouan Liu
- Department of Plant Microbe Interactions, Max Planck Institute for Plant Breeding Research, Köln, Germany
| | - Barbara Kracher
- Department of Plant Microbe Interactions, Max Planck Institute for Plant Breeding Research, Köln, Germany
| | - Jörg Ziegler
- Department of Molecular Signal Processing, Leibniz Institute of Plant Biochemistry, Halle, Germany
| | - Rainer P Birkenbihl
- Department of Plant Microbe Interactions, Max Planck Institute for Plant Breeding Research, Köln, Germany
| | - Imre E Somssich
- Department of Plant Microbe Interactions, Max Planck Institute for Plant Breeding Research, Köln, Germany
| |
Collapse
|
135
|
Liu S, Kracher B, Ziegler J, Birkenbihl RP, Somssich IE. Negative regulation of ABA signaling by WRKY33 is critical for Arabidopsis immunity towards Botrytis cinerea 2100. eLife 2015; 4:e07295. [PMID: 26076231 PMCID: PMC4487144 DOI: 10.7554/elife.07295] [Citation(s) in RCA: 169] [Impact Index Per Article: 16.9] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/03/2015] [Accepted: 06/13/2015] [Indexed: 02/07/2023] Open
Abstract
The Arabidopsis mutant wrky33 is highly susceptible to Botrytis cinerea. We identified >1680 Botrytis-induced WRKY33 binding sites associated with 1576 Arabidopsis genes. Transcriptional profiling defined 318 functional direct target genes at 14 hr post inoculation. Comparative analyses revealed that WRKY33 possesses dual functionality acting either as a repressor or as an activator in a promoter-context dependent manner. We confirmed known WRKY33 targets involved in hormone signaling and phytoalexin biosynthesis, but also uncovered a novel negative role of abscisic acid (ABA) in resistance towards B. cinerea 2100. The ABA biosynthesis genes NCED3 and NCED5 were identified as direct targets required for WRKY33-mediated resistance. Loss-of-WRKY33 function resulted in elevated ABA levels and genetic studies confirmed that WRKY33 acts upstream of NCED3/NCED5 to negatively regulate ABA biosynthesis. This study provides the first detailed view of the genome-wide contribution of a specific plant transcription factor in modulating the transcriptional network associated with plant immunity.
Collapse
Affiliation(s)
- Shouan Liu
- Department of Plant Microbe Interactions, Max Planck Institute for Plant Breeding Research, Köln, Germany
| | - Barbara Kracher
- Department of Plant Microbe Interactions, Max Planck Institute for Plant Breeding Research, Köln, Germany
| | - Jörg Ziegler
- Department of Molecular Signal Processing, Leibniz Institute of Plant Biochemistry, Halle, Germany
| | - Rainer P Birkenbihl
- Department of Plant Microbe Interactions, Max Planck Institute for Plant Breeding Research, Köln, Germany
| | - Imre E Somssich
- Department of Plant Microbe Interactions, Max Planck Institute for Plant Breeding Research, Köln, Germany
| |
Collapse
|
136
|
Dailey L. High throughput technologies for the functional discovery of mammalian enhancers: new approaches for understanding transcriptional regulatory network dynamics. Genomics 2015; 106:151-158. [PMID: 26072436 DOI: 10.1016/j.ygeno.2015.06.004] [Citation(s) in RCA: 24] [Impact Index Per Article: 2.4] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/01/2015] [Revised: 04/18/2015] [Accepted: 06/08/2015] [Indexed: 01/08/2023]
Abstract
Completion of the human and mouse genomes has inspired new initiatives to obtain a global understanding of the functional regulatory networks governing gene expression. Enhancers are primary regulatory DNA elements determining precise spatio- and temporal gene expression patterns, but the observation that they can function at any distance from the gene(s) they regulate has made their genome-wide characterization challenging. Since traditional, single reporter approaches would be unable to accomplish this enormous task, high throughput technologies for mapping chromatin features associated with enhancers have emerged as an effective surrogate for enhancer discovery. However, the last few years have witnessed the development of several new innovative approaches that can effectively screen for and discover enhancers based on their functional activation of transcription using massively parallel reporter systems. In addition to their application for genome annotation, these new high throughput functional approaches open new and exciting avenues for modeling gene regulatory networks.
Collapse
Affiliation(s)
- Lisa Dailey
- NYU School of Medicine, Department of Microbiology, Kimmel Center for Stem Cell Biology, 550 First Avenue, MSB 252, New York, NY 10016, United States.
| |
Collapse
|
137
|
White MA. Understanding how cis-regulatory function is encoded in DNA sequence using massively parallel reporter assays and designed sequences. Genomics 2015; 106:165-170. [PMID: 26072432 DOI: 10.1016/j.ygeno.2015.06.003] [Citation(s) in RCA: 50] [Impact Index Per Article: 5.0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 02/18/2015] [Revised: 05/09/2015] [Accepted: 06/08/2015] [Indexed: 01/07/2023]
Abstract
Genome-scale methods have identified thousands of candidate cis-regulatory elements (CREs), but methods to directly assay the regulatory function of these elements on a comparably large scale have not been available. The inability to directly test and perturb the regulatory activity of large numbers of DNA sequences has hindered efforts to discover how cis-regulatory function is encoded in genomic sequence. Recently developed massively parallel reporter gene assays combine next generation sequencing with high-throughput oligonucleotide synthesis to offer the capacity to test and mutationally perturb thousands of specifically chosen or designed cis-regulatory sequences in a single experiment. These assays are the basis of recent studies that include large-scale functional validation of genomic CREs, exhaustive mutational analyses of individual regulatory sequences, and tests of large libraries of synthetic CREs. The results demonstrate how massively parallel reporter assays with libraries of designed sequences provide the statistical power required to address previously intractable questions about cis-regulatory function.
Collapse
Affiliation(s)
- Michael A White
- Center for Genome Sciences and Systems Biology, Department of Genetics, Washington University in St. Louis School of Medicine, St. Louis, MO 63108, USA.
| |
Collapse
|
138
|
Kim K, Yang W, Lee KS, Bang H, Jang K, Kim SC, Yang JO, Park S, Park K, Choi JK. Global transcription network incorporating distal regulator binding reveals selective cooperation of cancer drivers and risk genes. Nucleic Acids Res 2015; 43:5716-29. [PMID: 26001967 PMCID: PMC4499150 DOI: 10.1093/nar/gkv532] [Citation(s) in RCA: 6] [Impact Index Per Article: 0.6] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 02/02/2015] [Accepted: 05/09/2015] [Indexed: 01/04/2023] Open
Abstract
Global network modeling of distal regulatory interactions is essential in understanding the overall architecture of gene expression programs. Here, we developed a Bayesian probabilistic model and computational method for global causal network construction with breast cancer as a model. Whereas physical regulator binding was well supported by gene expression causality in general, distal elements in intragenic regions or loci distant from the target gene exhibited particularly strong functional effects. Modeling the action of long-range enhancers was critical in recovering true biological interactions with increased coverage and specificity overall and unraveling regulatory complexity underlying tumor subclasses and drug responses in particular. Transcriptional cancer drivers and risk genes were discovered based on the network analysis of somatic and genetic cancer-related DNA variants. Notably, we observed that the risk genes were functionally downstream of the cancer drivers and were selectively susceptible to network perturbation by tumorigenic changes in their upstream drivers. Furthermore, cancer risk alleles tended to increase the susceptibility of the transcription of their associated genes. These findings suggest that transcriptional cancer drivers selectively induce a combinatorial misregulation of downstream risk genes, and that genetic risk factors, mostly residing in distal regulatory regions, increase transcriptional susceptibility to upstream cancer-driving somatic changes.
Collapse
Affiliation(s)
- Kwoneel Kim
- Department of Bio and Brain Engineering, KAIST, Daejeon 305-701, Republic of Korea
| | - Woojin Yang
- Department of Bio and Brain Engineering, KAIST, Daejeon 305-701, Republic of Korea
| | - Kang Seon Lee
- Department of Bio and Brain Engineering, KAIST, Daejeon 305-701, Republic of Korea
| | - Hyoeun Bang
- Department of Bio and Brain Engineering, KAIST, Daejeon 305-701, Republic of Korea
| | - Kiwon Jang
- Department of Bio and Brain Engineering, KAIST, Daejeon 305-701, Republic of Korea
| | - Sang Cheol Kim
- Samsung Genome Institute, Samsung Medical Center, Seoul 135-710, Republic of Korea
| | - Jin Ok Yang
- Korean Bioinformation Center, KRIBB, Daejeon 305-806, Republic of Korea
| | - Seongjin Park
- Korean Bioinformation Center, KRIBB, Daejeon 305-806, Republic of Korea
| | - Kiejung Park
- Korean Bioinformation Center, KRIBB, Daejeon 305-806, Republic of Korea
| | - Jung Kyoon Choi
- Department of Bio and Brain Engineering, KAIST, Daejeon 305-701, Republic of Korea
| |
Collapse
|
139
|
Danko CG, Hyland SL, Core LJ, Martins AL, Waters CT, Lee HW, Cheung VG, Kraus WL, Lis JT, Siepel A. Identification of active transcriptional regulatory elements from GRO-seq data. Nat Methods 2015; 12:433-8. [PMID: 25799441 PMCID: PMC4507281 DOI: 10.1038/nmeth.3329] [Citation(s) in RCA: 127] [Impact Index Per Article: 12.7] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 08/06/2014] [Accepted: 02/24/2015] [Indexed: 12/26/2022]
Abstract
Modifications to the global run-on and sequencing (GRO-seq) protocol that enrich for 5'-capped RNAs can be used to reveal active transcriptional regulatory elements (TREs) with high accuracy. Here, we introduce discriminative regulatory-element detection from GRO-seq (dREG), a sensitive machine learning method that uses support vector regression to identify active TREs from GRO-seq data without requiring cap-based enrichment (https://github.com/Danko-Lab/dREG/). This approach allows TREs to be assayed together with gene expression levels and other transcriptional features in a single experiment. Predicted TREs are more enriched for several marks of transcriptional activation—including expression quantitative trait loci, disease-associated polymorphisms, acetylated histone 3 lysine 27 (H3K27ac) and transcription factor binding—than those identified by alternative functional assays. Using dREG, we surveyed TREs in eight human cell types and provide new insights into global patterns of TRE function.
Collapse
Affiliation(s)
- Charles G. Danko
- Baker Institute for Animal Health, Cornell University, Ithaca, NY, USA
- Department of Biomedical Sciences, Cornell University, Ithaca, NY, USA
- Department of Biological Statistics and Computational Biology, Cornell University, Ithaca, NY,USA
| | - Stephanie L. Hyland
- Tri-Institutional Training Program in Computational Biology and Medicine, New York, New York, USA
| | - Leighton J. Core
- Department of Molecular Biology and Genetics, Cornell University, Ithaca, NY, USA
| | - Andre L. Martins
- Graduate Field in Computational Biology, Cornell University, Ithaca, NY, USA
| | - Colin T Waters
- Department of Molecular Biology and Genetics, Cornell University, Ithaca, NY, USA
| | - Hyung Won Lee
- Department of Molecular Biology and Genetics, Cornell University, Ithaca, NY, USA
| | - Vivian G. Cheung
- Life Sciences Institute, University of Michigan, Ann Arbor, MI, USA
- Howard Hughes Medical Institute, Chevy Chase, MD, USA
| | - W. Lee Kraus
- Laboratory of Signaling and Gene Regulation, Cecil H. and Ida Green Center for Reproductive Biology Sciences, University of Texas Southwestern Medical Center, Dallas, TX, USA
- Division of Basic Research, Department of Obstetrics and Gynecology, University of Texas Southwestern Medical Center, Dallas, TX, USA
| | - John T. Lis
- Graduate Field in Computational Biology, Cornell University, Ithaca, NY, USA
| | - Adam Siepel
- Department of Biological Statistics and Computational Biology, Cornell University, Ithaca, NY,USA
| |
Collapse
|
140
|
Mathelier A, Lefebvre C, Zhang AW, Arenillas DJ, Ding J, Wasserman WW, Shah SP. Cis-regulatory somatic mutations and gene-expression alteration in B-cell lymphomas. Genome Biol 2015; 16:84. [PMID: 25903198 PMCID: PMC4467049 DOI: 10.1186/s13059-015-0648-7] [Citation(s) in RCA: 31] [Impact Index Per Article: 3.1] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 12/01/2014] [Accepted: 04/07/2015] [Indexed: 02/08/2023] Open
Abstract
BACKGROUND With the rapid increase of whole-genome sequencing of human cancers, an important opportunity to analyze and characterize somatic mutations lying within cis-regulatory regions has emerged. A focus on protein-coding regions to identify nonsense or missense mutations disruptive to protein structure and/or function has led to important insights; however, the impact on gene expression of mutations lying within cis-regulatory regions remains under-explored. We analyzed somatic mutations from 84 matched tumor-normal whole genomes from B-cell lymphomas with accompanying gene expression measurements to elucidate the extent to which these cancers are disrupted by cis-regulatory mutations. RESULTS We characterize mutations overlapping a high quality set of well-annotated transcription factor binding sites (TFBSs), covering a similar portion of the genome as protein-coding exons. Our results indicate that cis-regulatory mutations overlapping predicted TFBSs are enriched in promoter regions of genes involved in apoptosis or growth/proliferation. By integrating gene expression data with mutation data, our computational approach culminates with identification of cis-regulatory mutations most likely to participate in dysregulation of the gene expression program. The impact can be measured along with protein-coding mutations to highlight key mutations disrupting gene expression and pathways in cancer. CONCLUSIONS Our study yields specific genes with disrupted expression triggered by genomic mutations in either the coding or the regulatory space. It implies that mutated regulatory components of the genome contribute substantially to cancer pathways. Our analyses demonstrate that identifying genomically altered cis-regulatory elements coupled with analysis of gene expression data will augment biological interpretation of mutational landscapes of cancers.
Collapse
Affiliation(s)
- Anthony Mathelier
- Department of Medical Genetics, Centre for Molecular Medicine and Therapeutics, Child and Family Research Institute, University of British Columbia, 950 West 28th Avenue, V5Z 4H4, Vancouver, BC, Canada.
| | - Calvin Lefebvre
- Department of Molecular Oncology, British Columbia Cancer Agency, Vancouver, V5Z 1L3, BC, Canada. .,Bioinformatics Graduate Program, University of British Columbia, Vancouver, V5Z 1L3, BC, Canada.
| | - Allen W Zhang
- Department of Medical Genetics, Centre for Molecular Medicine and Therapeutics, Child and Family Research Institute, University of British Columbia, 950 West 28th Avenue, V5Z 4H4, Vancouver, BC, Canada. .,Bioinformatics Graduate Program, University of British Columbia, Vancouver, V5Z 1L3, BC, Canada.
| | - David J Arenillas
- Department of Medical Genetics, Centre for Molecular Medicine and Therapeutics, Child and Family Research Institute, University of British Columbia, 950 West 28th Avenue, V5Z 4H4, Vancouver, BC, Canada.
| | - Jiarui Ding
- Department of Molecular Oncology, British Columbia Cancer Agency, Vancouver, V5Z 1L3, BC, Canada. .,Department of Computer Science, University of British Columbia, Vancouver, V6T 1Z4, BC, Canada.
| | - Wyeth W Wasserman
- Department of Medical Genetics, Centre for Molecular Medicine and Therapeutics, Child and Family Research Institute, University of British Columbia, 950 West 28th Avenue, V5Z 4H4, Vancouver, BC, Canada.
| | - Sohrab P Shah
- Department of Molecular Oncology, British Columbia Cancer Agency, Vancouver, V5Z 1L3, BC, Canada. .,Department of Pathology and Laboratory Medicine, University of British Columbia, Vancouver, G227-2211, BC, Canada.
| |
Collapse
|
141
|
Ignatieva EV, Podkolodnaya OA, Orlov YL, Vasiliev GV, Kolchanov NA. Regulatory genomics: Combined experimental and computational approaches. RUSS J GENET+ 2015. [DOI: 10.1134/s1022795415040067] [Citation(s) in RCA: 7] [Impact Index Per Article: 0.7] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/23/2022]
|
142
|
Ibrahim-Alobaide MA, Abdelsalam AG, Alobydi H, Rasul KI, Zhang R, Srivenugopal KS. Characterization of regulatory sequences in alternative promoters of hypermethylated genes associated with tumor resistance to cisplatin. Mol Clin Oncol 2015; 3:408-414. [PMID: 25798277 DOI: 10.3892/mco.2014.468] [Citation(s) in RCA: 1] [Impact Index Per Article: 0.1] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/07/2014] [Accepted: 10/23/2014] [Indexed: 01/03/2023] Open
Abstract
The development of cisplatin resistance in human cancers is controlled by multiple genes and leads to therapeutic failure. Hypermethylation of specific gene promoters is a key event in clinical resistance to cisplatin. Although the usage of multiple promoters is frequent in the transcription of human genes, the role of alternative promoters and their regulatory sequences have not yet been investigated in cisplatin resistance genes. In a new approach, we hypothesized that human cancers exploit the specific transcription factor-binding sites (TFBS) and CpG islands (CGIs) located in the alternative promoters of certain genes to acquire platinum drug resistance. To provide a useful resource of regulatory elements associated with cisplatin resistance, we investigated the TFBS and CGIs in 48 alternative promoters of 14 hypermethylated cisplatin resistance genes previously reported. CGIs prone to methylation were identified in 28 alternative promoters of 11 hypermethylated genes. The majority of alternative promoters harboring CGIs (93%) were clustered in one phylogenetic subclass, whereas the ones lacking CGIs were distributed in two unrelated subclasses. Regulatory sequences, initiator and TATA-532 prevailed over TATA-8 and were found in all the promoters. B recognition element (BRE) sequences were present only in alternative promoters harboring CGIs, but CCAAT and TAACC were found in both types of alternative promoters, whereas downstream promoter element sequences were significantly less frequent. Therefore, it was hypothesized that BRE and CGI sequences co-localized in alternative promoters of cisplatin resistance genes may be used to design molecular markers for drug resistance. A more extensive knowledge of alternative promoters and their regulatory elements in clinical resistance to cisplatin is likely to usher novel avenues for sensitizing human cancers to treatment.
Collapse
Affiliation(s)
- Mohammed A Ibrahim-Alobaide
- Department of Biomedical Sciences and Cancer Biology Research Center, School of Pharmacy, Texas Tech University Health Sciences Center, Amarillo, TX 79106, USA
| | - Abdelsalam G Abdelsalam
- Department of Mathematics, Statistics and Physics, College of Arts and Sciences, Qatar University, Doha, Qatar ; Department of Statistics, Faculty of Economics and Political Sciences, Cairo University, Giza 12613, Egypt
| | | | | | - Ruiwen Zhang
- Department of Pharmaceutical Sciences and Cancer Biology Research Center, School of Pharmacy, Texas Tech University Health Sciences Center, Amarillo, TX 79106, USA
| | - Kalkunte S Srivenugopal
- Department of Biomedical Sciences and Cancer Biology Research Center, School of Pharmacy, Texas Tech University Health Sciences Center, Amarillo, TX 79106, USA
| |
Collapse
|
143
|
Fontaine F, Overman J, François M. Pharmacological manipulation of transcription factor protein-protein interactions: opportunities and obstacles. CELL REGENERATION (LONDON, ENGLAND) 2015; 4:2. [PMID: 25848531 PMCID: PMC4365538 DOI: 10.1186/s13619-015-0015-x] [Citation(s) in RCA: 43] [Impact Index Per Article: 4.3] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Subscribe] [Scholar Register] [Received: 11/04/2014] [Accepted: 02/10/2015] [Indexed: 12/19/2022]
Abstract
Much research on transcription factor biology and their genetic pathways has been undertaken over the last 30 years, especially in the field of developmental biology and cancer. Yet, very little is known about the molecular modalities of highly dynamic interactions between transcription factors, genomic DNA, and protein partners. Methodological breakthroughs such as RNA-seq (RNA-sequencing), ChIP-seq (chromatin immunoprecipitation sequencing), RIME (rapid immunoprecipitation mass spectrometry of endogenous proteins), and single-molecule imaging will dramatically accelerate the discovery rate of their molecular mode of action in the next few years. From a pharmacological viewpoint, conventional methods used to target transcription factor activity with molecules mimicking endogenous ligands fail to achieve high specificity and are limited by a lack of identification of new molecular targets. Protein-protein interactions are likely to represent one of the next major classes of therapeutic targets. Transcription factors, known to act mostly via protein-protein interaction, may well be at the forefront of this type of drug development. One hurdle in this field remains the difficulty to collate structural data into meaningful information for rational drug design. Another hurdle is the lack of chemical libraries meeting the structural requirements of protein-protein interaction disruption. As more attempts at modulating transcription factor activity are undertaken, valuable knowledge will be accumulated on the modality of action required to modulate transcription and how these findings can be applied to developing transcription factor drugs. Key discoveries will spawn into new therapeutic approaches not only as anticancer targets but also for other indications, such as those with an inflammatory component including neurodegenerative disorders, diabetes, and chronic liver and kidney diseases.
Collapse
Affiliation(s)
- Frank Fontaine
- Division of Genomics of Development and Diseases, Institute for Molecular Bioscience, The University of Queensland, 306 Carmody Road, St Lucia, QLD 4072 Australia
| | - Jeroen Overman
- Division of Genomics of Development and Diseases, Institute for Molecular Bioscience, The University of Queensland, 306 Carmody Road, St Lucia, QLD 4072 Australia
| | - Mathias François
- Division of Genomics of Development and Diseases, Institute for Molecular Bioscience, The University of Queensland, 306 Carmody Road, St Lucia, QLD 4072 Australia
| |
Collapse
|
144
|
Cloonan N. Re-thinking miRNA-mRNA interactions: intertwining issues confound target discovery. Bioessays 2015; 37:379-88. [PMID: 25683051 PMCID: PMC4671252 DOI: 10.1002/bies.201400191] [Citation(s) in RCA: 88] [Impact Index Per Article: 8.8] [Reference Citation Analysis] [Abstract] [Key Words] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/27/2014] [Revised: 12/19/2014] [Accepted: 12/19/2014] [Indexed: 12/20/2022]
Abstract
Despite a library full of literature on miRNA biology, core issues relating to miRNA target detection, biological effect, and mode of action remain controversial. This essay proposes that the predominant mechanism of direct miRNA action is translational inhibition, whereas the bulk of miRNA effects are mRNA based. It explores several issues confounding miRNA target detection, and discusses their impact on the dominance of “miRNA seed” dogma and the exploration of non-canonical binding sites. Finally, it makes comparisons between miRNA target prediction and transcription factor binding prediction, and questions the value of characterizing miRNA binding sites based on which miRNA nucleotides are paired with an mRNA.
Collapse
Affiliation(s)
- Nicole Cloonan
- QIMR Berghofer Medical Research Institute, Genomic Biology Lab, Herston, QLD, Australia
| |
Collapse
|
145
|
Wong ES, Thybert D, Schmitt BM, Stefflova K, Odom DT, Flicek P. Decoupling of evolutionary changes in transcription factor binding and gene expression in mammals. Genome Res 2015; 25:167-78. [PMID: 25394363 PMCID: PMC4315291 DOI: 10.1101/gr.177840.114] [Citation(s) in RCA: 38] [Impact Index Per Article: 3.8] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 05/01/2014] [Accepted: 11/12/2014] [Indexed: 11/25/2022]
Abstract
To understand the evolutionary dynamics between transcription factor (TF) binding and gene expression in mammals, we compared transcriptional output and the binding intensities for three tissue-specific TFs in livers from four closely related mouse species. For each transcription factor, TF-dependent genes and the TF binding sites most likely to influence mRNA expression were identified by comparing mRNA expression levels between wild-type and TF knockout mice. Independent evolution was observed genome-wide between the rate of change in TF binding and the rate of change in mRNA expression across taxa, with the exception of a small number of TF-dependent genes. We also found that binding intensities are preferentially conserved near genes whose expression is dependent on the TF, and the conservation is shared among binding peaks in close proximity to each other near the TSS. Expression of TF-dependent genes typically showed an increased sensitivity to changes in binding levels as measured by mRNA abundance. Taken together, these results highlight a significant tolerance to evolutionary changes in TF binding intensity in mammalian transcriptional networks and suggest that some TF-dependent genes may be largely regulated by a single TF across evolution.
Collapse
Affiliation(s)
- Emily S Wong
- European Molecular Biology Laboratory, European Bioinformatics Institute, Wellcome Trust Genome Campus, Hinxton, Cambridge, CB10 1SD, United Kingdom
| | - David Thybert
- European Molecular Biology Laboratory, European Bioinformatics Institute, Wellcome Trust Genome Campus, Hinxton, Cambridge, CB10 1SD, United Kingdom
| | - Bianca M Schmitt
- University of Cambridge, Cancer Research UK - Cambridge Institute, Li Ka Shing Centre, Cambridge, CB2 0RE, United Kingdom
| | - Klara Stefflova
- University of Cambridge, Cancer Research UK - Cambridge Institute, Li Ka Shing Centre, Cambridge, CB2 0RE, United Kingdom
| | - Duncan T Odom
- University of Cambridge, Cancer Research UK - Cambridge Institute, Li Ka Shing Centre, Cambridge, CB2 0RE, United Kingdom; Wellcome Trust Sanger Institute, Wellcome Trust Genome Campus, Hinxton, Cambridge, CB10 1SA, United Kingdom
| | - Paul Flicek
- European Molecular Biology Laboratory, European Bioinformatics Institute, Wellcome Trust Genome Campus, Hinxton, Cambridge, CB10 1SD, United Kingdom; Wellcome Trust Sanger Institute, Wellcome Trust Genome Campus, Hinxton, Cambridge, CB10 1SA, United Kingdom
| |
Collapse
|
146
|
Abstract
It is now well established that noncoding regulatory variants play a central role in the genetics of common diseases and in evolution. However, until recently, we have known little about the mechanisms by which most regulatory variants act. For instance, what types of functional elements in DNA, RNA, or proteins are most often affected by regulatory variants? Which stages of gene regulation are typically altered? How can we predict which variants are most likely to impact regulation in a given cell type? Recent studies, in many cases using quantitative trait loci (QTL)-mapping approaches in cell lines or tissue samples, have provided us with considerable insight into the properties of genetic loci that have regulatory roles. Such studies have uncovered novel biochemical regulatory interactions and led to the identification of previously unrecognized regulatory mechanisms. We have learned that genetic variation is often directly associated with variation in regulatory activities (namely, we can map regulatory QTLs, not just expression QTLs [eQTLs]), and we have taken the first steps towards understanding the causal order of regulatory events (for example, the role of pioneer transcription factors). Yet, in most cases, we still do not know how to interpret overlapping combinations of regulatory interactions, and we are still far from being able to predict how variation in regulatory mechanisms is propagated through a chain of interactions to eventually result in changes in gene expression profiles.
Collapse
Affiliation(s)
- Athma A. Pai
- Department of Biology, Massachusetts Institute of Technology, Cambridge, Massachusetts, United States of America
| | - Jonathan K. Pritchard
- Departments of Genetics and Biology, and Howard Hughes Medical Institute; Stanford University, Stanford, California, United States of America
- * E-mail: (JKP); (YG)
| | - Yoav Gilad
- Department of Human Genetics, University of Chicago, Chicago, Illinois, United States of America
- * E-mail: (JKP); (YG)
| |
Collapse
|
147
|
Nord AS, Pattabiraman K, Visel A, Rubenstein JLR. Genomic perspectives of transcriptional regulation in forebrain development. Neuron 2015; 85:27-47. [PMID: 25569346 PMCID: PMC4438709 DOI: 10.1016/j.neuron.2014.11.011] [Citation(s) in RCA: 106] [Impact Index Per Article: 10.6] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 01/16/2023]
Abstract
The forebrain is the seat of higher-order brain functions, and many human neuropsychiatric disorders are due to genetic defects affecting forebrain development, making it imperative to understand the underlying genetic circuitry. Recent progress now makes it possible to begin fully elucidating the genomic regulatory mechanisms that control forebrain gene expression. Herein, we discuss the current knowledge of how transcription factors drive gene expression programs through their interactions with cis-acting genomic elements, such as enhancers; how analyses of chromatin and DNA modifications provide insights into gene expression states; and how these approaches yield insights into the evolution of the human brain.
Collapse
Affiliation(s)
- Alex S Nord
- Department of Neurobiology, Physiology, and Behavior and Department of Psychiatry and Behavioral Sciences, Center for Neuroscience, University of California, Davis, Davis, CA 95618, USA.
| | - Kartik Pattabiraman
- Department of Psychiatry, Rock Hall, University of California, San Francisco, San Francisco, CA 94158-2324, USA
| | - Axel Visel
- Genomics Division, Lawrence Berkeley National Laboratory, Berkeley, CA 94720, USA; US Department of Energy Joint Genome Institute, Walnut Creek, CA 94598, USA; School of Natural Sciences, University of California, Merced, Merced, CA 95343, USA
| | - John L R Rubenstein
- Department of Psychiatry, Rock Hall, University of California, San Francisco, San Francisco, CA 94158-2324, USA
| |
Collapse
|
148
|
Burren OS, Guo H, Wallace C. VSEAMS: a pipeline for variant set enrichment analysis using summary GWAS data identifies IKZF3, BATF and ESRRA as key transcription factors in type 1 diabetes. Bioinformatics 2014; 30:3342-8. [PMID: 25170024 PMCID: PMC4296156 DOI: 10.1093/bioinformatics/btu571] [Citation(s) in RCA: 14] [Impact Index Per Article: 1.3] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 04/17/2014] [Revised: 08/18/2014] [Accepted: 08/20/2014] [Indexed: 02/06/2023] Open
Abstract
MOTIVATION Genome-wide association studies (GWAS) have identified many loci implicated in disease susceptibility. Integration of GWAS summary statistics (P-values) and functional genomic datasets should help to elucidate mechanisms. RESULTS We extended a non-parametric SNP set enrichment method to test for enrichment of GWAS signals in functionally defined loci to a situation where only GWAS P-values are available. The approach is implemented in VSEAMS, a freely available software pipeline. We use VSEAMS to identify enrichment of type 1 diabetes (T1D) GWAS associations near genes that are targets for the transcription factors IKZF3, BATF and ESRRA. IKZF3 lies in a known T1D susceptibility region, while BATF and ESRRA overlap other immune disease susceptibility regions, validating our approach and suggesting novel avenues of research for T1D. AVAILABILITY AND IMPLEMENTATION VSEAMS is available for download (http://github.com/ollyburren/vseams).
Collapse
Affiliation(s)
- Oliver S Burren
- Department of Medical Genetics, JDRF/Wellcome Trust Diabetes and Inflammation Laboratory, NIHR Cambridge Biomedical Research Centre, Cambridge Institute for Medical Research, University of Cambridge, Wellcome Trust/MRC Building, Cambridge Biomedical Campus, Cambridge, CB2 0XY, UK and MRC Biostatistics Unit, Cambridge Institute of Public Health, Forvie Site, Robinson Way, Cambridge Biomedical Campus, Cambridge, CB2 0SR, UK
| | - Hui Guo
- Department of Medical Genetics, JDRF/Wellcome Trust Diabetes and Inflammation Laboratory, NIHR Cambridge Biomedical Research Centre, Cambridge Institute for Medical Research, University of Cambridge, Wellcome Trust/MRC Building, Cambridge Biomedical Campus, Cambridge, CB2 0XY, UK and MRC Biostatistics Unit, Cambridge Institute of Public Health, Forvie Site, Robinson Way, Cambridge Biomedical Campus, Cambridge, CB2 0SR, UK
| | - Chris Wallace
- Department of Medical Genetics, JDRF/Wellcome Trust Diabetes and Inflammation Laboratory, NIHR Cambridge Biomedical Research Centre, Cambridge Institute for Medical Research, University of Cambridge, Wellcome Trust/MRC Building, Cambridge Biomedical Campus, Cambridge, CB2 0XY, UK and MRC Biostatistics Unit, Cambridge Institute of Public Health, Forvie Site, Robinson Way, Cambridge Biomedical Campus, Cambridge, CB2 0SR, UK Department of Medical Genetics, JDRF/Wellcome Trust Diabetes and Inflammation Laboratory, NIHR Cambridge Biomedical Research Centre, Cambridge Institute for Medical Research, University of Cambridge, Wellcome Trust/MRC Building, Cambridge Biomedical Campus, Cambridge, CB2 0XY, UK and MRC Biostatistics Unit, Cambridge Institute of Public Health, Forvie Site, Robinson Way, Cambridge Biomedical Campus, Cambridge, CB2 0SR, UK
| |
Collapse
|
149
|
Chalei V, Sansom SN, Kong L, Lee S, Montiel JF, Vance KW, Ponting CP. The long non-coding RNA Dali is an epigenetic regulator of neural differentiation. eLife 2014; 3:e04530. [PMID: 25415054 PMCID: PMC4383022 DOI: 10.7554/elife.04530] [Citation(s) in RCA: 123] [Impact Index Per Article: 11.2] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 08/28/2014] [Accepted: 11/21/2014] [Indexed: 12/11/2022] Open
Abstract
Many intergenic long noncoding RNA (lncRNA) loci regulate the expression of adjacent protein coding genes. Less clear is whether intergenic lncRNAs commonly regulate transcription by modulating chromatin at genomically distant loci. Here, we report both genomically local and distal RNA-dependent roles of Dali, a conserved central nervous system expressed intergenic lncRNA. Dali is transcribed downstream of the Pou3f3 transcription factor gene and its depletion disrupts the differentiation of neuroblastoma cells. Locally, Dali transcript regulates transcription of the Pou3f3 locus. Distally, it preferentially targets active promoters and regulates expression of neural differentiation genes, in part through physical association with the POU3F3 protein. Dali interacts with the DNMT1 DNA methyltransferase in mouse and human and regulates DNA methylation status of CpG island-associated promoters in trans. These results demonstrate, for the first time, that a single intergenic lncRNA controls the activity and methylation of genomically distal regulatory elements to modulate large-scale transcriptional programmes.
Collapse
Affiliation(s)
- Vladislava Chalei
- MRC Functional Genomics
Unit, Department of Physiology, Anatomy and Genetics,
University of Oxford, Oxford, United
Kingdom
| | - Stephen N Sansom
- MRC Functional Genomics
Unit, Department of Physiology, Anatomy and Genetics,
University of Oxford, Oxford, United
Kingdom
- Computational Genomics
Analysis and Training Programme, University of
Oxford, Oxford, United Kingdom
| | - Lesheng Kong
- MRC Functional Genomics
Unit, Department of Physiology, Anatomy and Genetics,
University of Oxford, Oxford, United
Kingdom
| | - Sheena Lee
- Department of
Physiology, Anatomy and Genetics, University of
Oxford, Oxford, United Kingdom
| | - Juan F Montiel
- MRC Functional Genomics
Unit, Department of Physiology, Anatomy and Genetics,
University of Oxford, Oxford, United
Kingdom
| | - Keith W Vance
- MRC Functional Genomics
Unit, Department of Physiology, Anatomy and Genetics,
University of Oxford, Oxford, United
Kingdom
| | - Chris P Ponting
- MRC Functional Genomics
Unit, Department of Physiology, Anatomy and Genetics,
University of Oxford, Oxford, United
Kingdom
| |
Collapse
|
150
|
Necsulea A, Kaessmann H. Evolutionary dynamics of coding and non-coding transcriptomes. Nat Rev Genet 2014; 15:734-48. [PMID: 25297727 DOI: 10.1038/nrg3802] [Citation(s) in RCA: 156] [Impact Index Per Article: 14.2] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 02/07/2023]
Abstract
Gene expression changes may underlie much of phenotypic evolution. The development of high-throughput RNA sequencing protocols has opened the door to unprecedented large-scale and cross-species transcriptome comparisons by allowing accurate and sensitive assessments of transcript sequences and expression levels. Here, we review the initial wave of the new generation of comparative transcriptomic studies in mammals and vertebrate outgroup species in the context of earlier work. Together with various large-scale genomic and epigenomic data, these studies have unveiled commonalities and differences in the dynamics of gene expression evolution for various types of coding and non-coding genes across mammalian lineages, organs, developmental stages, chromosomes and sexes. They have also provided intriguing new clues to the regulatory basis and phenotypic implications of evolutionary gene expression changes.
Collapse
Affiliation(s)
- Anamaria Necsulea
- Laboratory of Developmental Genomics, School of Life Sciences, École Polytechnique Fédérale de Lausanne, 1015 Lausanne, Switzerland
| | - Henrik Kaessmann
- 1] Center for Integrative Genomics, University of Lausanne, 1015 Lausanne, Switzerland. [2] Swiss Institute of Bioinformatics, 1015 Lausanne, Switzerland
| |
Collapse
|