1
|
Han J, Dai Y, Zhou J, Tian J, Chen Q, Kou X, Raza G, Zhang B, Wang K. Tissue-specific chromatin accessibility and transcriptional regulation in maize cold stress response. Genomics 2025; 117:110981. [PMID: 39701501 DOI: 10.1016/j.ygeno.2024.110981] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 09/09/2024] [Revised: 11/19/2024] [Accepted: 12/11/2024] [Indexed: 12/21/2024]
Abstract
Maize, a vital crop globally, faces significant yield losses due to its sensitivity to cold stress, especially in temperate regions. Understanding the molecular mechanisms governing maize response to cold stress is crucial for developing strategies to enhance cold tolerance. However, the precise chromatin-level regulatory mechanisms involved remain largely unknown. In this study, we employed DNase-seq and RNA-seq techniques to investigate chromatin accessibility and gene expression changes in maize root, stem, and leaf tissues subjected to cold treatment. We discovered widespread changes in chromatin accessibility and gene expression across these tissues, with strong tissue specificity. Cold stress-induced DNase I hypersensitive sites (coiDHSs) were associated with differentially expressed genes, suggesting a direct link between chromatin accessibility and gene regulation under cold stress. Motif enrichment analysis identified ERF transcription factors (TFs) as central regulators conserved across tissues, with ERF5 emerging as pivotal in the cold response regulatory network. Additionally, TF co-localization analysis highlighted six TF pairs (ERF115-SHN3, ERF9-LEP, ERF7-SHN3, LEP-SHN3, LOB-SHN3, and AS2-LOB) conserved across tissues but showing tissue-specific binding preferences. These findings indicate intricate regulatory networks in maize cold response. Overall, our study provides insights into the chromatin-level regulatory mechanisms underpinning maize adaptive response to cold stress, offering potential targets for enhancing cold tolerance in agricultural contexts.
Collapse
Affiliation(s)
- Jinlei Han
- School of Life Sciences, Nantong University, Nantong 226019, China.
| | - Yan Dai
- School of Life Sciences, Nantong University, Nantong 226019, China
| | - Jialiang Zhou
- School of Life Sciences, Nantong University, Nantong 226019, China
| | - Jingjing Tian
- School of Life Sciences, Nantong University, Nantong 226019, China
| | - Qi Chen
- School of Life Sciences, Nantong University, Nantong 226019, China
| | - Xiaobing Kou
- School of Life Sciences, Nantong University, Nantong 226019, China
| | - Ghulam Raza
- National Institute for Biotechnology and Genetic Engineering (NIBGE), Faisalabad 38000, Pakistan
| | - Baohong Zhang
- Department of Biology, East Carolina University, Greenville, NC 27858, USA
| | - Kai Wang
- School of Life Sciences, Nantong University, Nantong 226019, China.
| |
Collapse
|
2
|
Lin TC, Tsai CH, Shiau CK, Huang JH, Tsai HK. Predicting splicing patterns from the transcription factor binding sites in the promoter with deep learning. BMC Genomics 2024; 25:830. [PMID: 39227799 PMCID: PMC11373144 DOI: 10.1186/s12864-024-10667-7] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/30/2022] [Accepted: 07/25/2024] [Indexed: 09/05/2024] Open
Abstract
BACKGROUND Alternative splicing is a pivotal mechanism of post-transcriptional modification that contributes to the transcriptome plasticity and proteome diversity in metazoan cells. Although many splicing regulations around the exon/intron regions are known, the relationship between promoter-bound transcription factors and the downstream alternative splicing largely remains unexplored. RESULTS In this study, we present computational approaches to unravel the regulatory relationship between promoter-bound transcription factor binding sites (TFBSs) and the splicing patterns. We curated a fine dataset that includes DNase I hypersensitive site sequencing and transcriptomes across fifteen human tissues from ENCODE. Specifically, we proposed different representations of TF binding context and splicing patterns to examine the associations between the promoter and downstream splicing events. While machine learning models demonstrated potential in predicting splicing patterns based on TFBS occupancies, the limitations in the generalization of predicting the splicing forms of singleton genes across diverse tissues was observed with carefully examination using different cross-validation methods. We further investigated the association between alterations in individual TFBS at promoters and shifts in exon splicing efficiency. Our results demonstrate that the convolutional neural network (CNN) models, trained on TF binding changes in the promoters, can predict the changes in splicing patterns. Furthermore, a systemic in silico substitutions analysis on the CNN models highlighted several potential splicing regulators. Notably, using empirical validation using K562 CTCFL shRNA knock-down data, we showed the significant role of CTCFL in splicing regulation. CONCLUSION In conclusion, our finding highlights the potential role of promoter-bound TFBSs in influencing the regulation of downstream splicing patterns and provides insights for discovering alternative splicing regulations.
Collapse
Affiliation(s)
- Tzu-Chieh Lin
- Institute of Information Science, Academia Sinica, Taipei, 11529, Taiwan
| | - Cheng-Hung Tsai
- Institute of Information Science, Academia Sinica, Taipei, 11529, Taiwan
| | - Cheng-Kai Shiau
- Institute of Information Science, Academia Sinica, Taipei, 11529, Taiwan
| | - Jia-Hsin Huang
- Institute of Information Science, Academia Sinica, Taipei, 11529, Taiwan.
- Taiwan AI Labs & Foundation, Taipei, 10351, Taiwan.
| | - Huai-Kuang Tsai
- Institute of Information Science, Academia Sinica, Taipei, 11529, Taiwan.
- Taiwan AI Labs & Foundation, Taipei, 10351, Taiwan.
| |
Collapse
|
3
|
Biddie SC, Weykopf G, Hird EF, Friman ET, Bickmore WA. DNA-binding factor footprints and enhancer RNAs identify functional non-coding genetic variants. Genome Biol 2024; 25:208. [PMID: 39107801 PMCID: PMC11304670 DOI: 10.1186/s13059-024-03352-1] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/06/2024] [Accepted: 07/25/2024] [Indexed: 08/10/2024] Open
Abstract
BACKGROUND Genome-wide association studies (GWAS) have revealed a multitude of candidate genetic variants affecting the risk of developing complex traits and diseases. However, the highlighted regions are typically in the non-coding genome, and uncovering the functional causative single nucleotide variants (SNVs) is challenging. Prioritization of variants is commonly based on genomic annotation with markers of active regulatory elements, but current approaches still poorly predict functional variants. To address this, we systematically analyze six markers of active regulatory elements for their ability to identify functional variants. RESULTS We benchmark against molecular quantitative trait loci (molQTL) from assays of regulatory element activity that identify allelic effects on DNA-binding factor occupancy, reporter assay expression, and chromatin accessibility. We identify the combination of DNase footprints and divergent enhancer RNA (eRNA) as markers for functional variants. This signature provides high precision, but with a trade-off of low recall, thus substantially reducing candidate variant sets to prioritize variants for functional validation. We present this as a framework called FINDER-Functional SNV IdeNtification using DNase footprints and eRNA. CONCLUSIONS We demonstrate the utility to prioritize variants using leukocyte count trait and analyze variants in linkage disequilibrium with a lead variant to predict a functional variant in asthma. Our findings have implications for prioritizing variants from GWAS, in development of predictive scoring algorithms, and for functionally informed fine mapping approaches.
Collapse
Affiliation(s)
- Simon C Biddie
- MRC Human Genetics Unit, Institute of Genetics and Cancer, University of Edinburgh, Edinburgh, UK.
- NHS Lothian, Edinburgh, UK.
| | - Giovanna Weykopf
- MRC Human Genetics Unit, Institute of Genetics and Cancer, University of Edinburgh, Edinburgh, UK
| | | | - Elias T Friman
- MRC Human Genetics Unit, Institute of Genetics and Cancer, University of Edinburgh, Edinburgh, UK
| | - Wendy A Bickmore
- MRC Human Genetics Unit, Institute of Genetics and Cancer, University of Edinburgh, Edinburgh, UK.
| |
Collapse
|
4
|
Marie P, Bazire M, Ladet J, Ameur LB, Chahar S, Fontrodona N, Sexton T, Auboeuf D, Bourgeois CF, Mortreux F. Gene-to-gene coordinated regulation of transcription and alternative splicing by 3D chromatin remodeling upon NF-κB activation. Nucleic Acids Res 2024; 52:1527-1543. [PMID: 38272542 PMCID: PMC10899780 DOI: 10.1093/nar/gkae015] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/10/2023] [Revised: 12/13/2023] [Accepted: 01/05/2024] [Indexed: 01/27/2024] Open
Abstract
The NF-κB protein p65/RelA plays a pivotal role in coordinating gene expression in response to diverse stimuli, including viral infections. At the chromatin level, p65/RelA regulates gene transcription and alternative splicing through promoter enrichment and genomic exon occupancy, respectively. The intricate ways in which p65/RelA simultaneously governs these functions across various genes remain to be fully elucidated. In this study, we employed the HTLV-1 Tax oncoprotein, a potent activator of NF-κB, to investigate its influence on the three-dimensional organization of the genome, a key factor in gene regulation. We discovered that Tax restructures the 3D genomic landscape, bringing together genes based on their regulation and splicing patterns. Notably, we found that the Tax-induced gene-gene contact between the two master genes NFKBIA and RELA is associated with their respective changes in gene expression and alternative splicing. Through dCas9-mediated approaches, we demonstrated that NFKBIA-RELA interaction is required for alternative splicing regulation and is caused by an intragenic enrichment of p65/RelA on RELA. Our findings shed light on new regulatory mechanisms upon HTLV-1 Tax and underscore the integral role of p65/RelA in coordinated regulation of NF-κB-responsive genes at both transcriptional and splicing levels in the context of the 3D genome.
Collapse
Affiliation(s)
- Paul Marie
- Univ Lyon, ENS de Lyon, Univ Claude Bernard, CNRS UMR 5239, INSERM U1210, Laboratory of Biology and Modelling of the Cell, 46 Allée d’Italie Site Jacques Monod, F-69007 Lyon, France
| | - Matéo Bazire
- Univ Lyon, ENS de Lyon, Univ Claude Bernard, CNRS UMR 5239, INSERM U1210, Laboratory of Biology and Modelling of the Cell, 46 Allée d’Italie Site Jacques Monod, F-69007 Lyon, France
| | - Julien Ladet
- Univ Lyon, ENS de Lyon, Univ Claude Bernard, CNRS UMR 5239, INSERM U1210, Laboratory of Biology and Modelling of the Cell, 46 Allée d’Italie Site Jacques Monod, F-69007 Lyon, France
| | - Lamya Ben Ameur
- Univ Lyon, ENS de Lyon, Univ Claude Bernard, CNRS UMR 5239, INSERM U1210, Laboratory of Biology and Modelling of the Cell, 46 Allée d’Italie Site Jacques Monod, F-69007 Lyon, France
| | - Sanjay Chahar
- Institute of Genetics and Molecular and Cellular Biology (IGBMC), UMR7104, Centre National de la Recherche Scientifique, U1258, Institut National de la Santé et de la Recherche Médicale, University of Strasbourg, 6704 Illkirch, France
| | - Nicolas Fontrodona
- Univ Lyon, ENS de Lyon, Univ Claude Bernard, CNRS UMR 5239, INSERM U1210, Laboratory of Biology and Modelling of the Cell, 46 Allée d’Italie Site Jacques Monod, F-69007 Lyon, France
| | - Tom Sexton
- Institute of Genetics and Molecular and Cellular Biology (IGBMC), UMR7104, Centre National de la Recherche Scientifique, U1258, Institut National de la Santé et de la Recherche Médicale, University of Strasbourg, 6704 Illkirch, France
| | - Didier Auboeuf
- Univ Lyon, ENS de Lyon, Univ Claude Bernard, CNRS UMR 5239, INSERM U1210, Laboratory of Biology and Modelling of the Cell, 46 Allée d’Italie Site Jacques Monod, F-69007 Lyon, France
| | - Cyril F Bourgeois
- Univ Lyon, ENS de Lyon, Univ Claude Bernard, CNRS UMR 5239, INSERM U1210, Laboratory of Biology and Modelling of the Cell, 46 Allée d’Italie Site Jacques Monod, F-69007 Lyon, France
| | - Franck Mortreux
- Univ Lyon, ENS de Lyon, Univ Claude Bernard, CNRS UMR 5239, INSERM U1210, Laboratory of Biology and Modelling of the Cell, 46 Allée d’Italie Site Jacques Monod, F-69007 Lyon, France
| |
Collapse
|
5
|
Ullah F, Jabeen S, Salton M, Reddy ASN, Ben-Hur A. Evidence for the role of transcription factors in the co-transcriptional regulation of intron retention. Genome Biol 2023; 24:53. [PMID: 36949544 PMCID: PMC10031921 DOI: 10.1186/s13059-023-02885-1] [Citation(s) in RCA: 6] [Impact Index Per Article: 3.0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 01/03/2022] [Accepted: 02/16/2023] [Indexed: 03/24/2023] Open
Abstract
BACKGROUND Alternative splicing is a widespread regulatory phenomenon that enables a single gene to produce multiple transcripts. Among the different types of alternative splicing, intron retention is one of the least explored despite its high prevalence in both plants and animals. The recent discovery that the majority of splicing is co-transcriptional has led to the finding that chromatin state affects alternative splicing. Therefore, it is plausible that transcription factors can regulate splicing outcomes. RESULTS We provide evidence for the hypothesis that transcription factors are involved in the regulation of intron retention by studying regions of open chromatin in retained and excised introns. Using deep learning models designed to distinguish between regions of open chromatin in retained introns and non-retained introns, we identified motifs enriched in IR events with significant hits to known human transcription factors. Our model predicts that the majority of transcription factors that affect intron retention come from the zinc finger family. We demonstrate the validity of these predictions using ChIP-seq data for multiple zinc finger transcription factors and find strong over-representation for their peaks in intron retention events. CONCLUSIONS This work opens up opportunities for further studies that elucidate the mechanisms by which transcription factors affect intron retention and other forms of splicing. AVAILABILITY Source code available at https://github.com/fahadahaf/chromir.
Collapse
Affiliation(s)
- Fahad Ullah
- Department of Computer Science, Colorado State University, Fort Collins, CO, USA
| | - Saira Jabeen
- Department of Computer Science, Colorado State University, Fort Collins, CO, USA
| | - Maayan Salton
- Department of Biology, Colorado State University, Fort Collins, CO, USA
| | - Anireddy S N Reddy
- Biochemistry and Molecular Biology Department, The Hebrew University Faculty of Medicine, Jerusalem, Israel
| | - Asa Ben-Hur
- Department of Computer Science, Colorado State University, Fort Collins, CO, USA.
| |
Collapse
|
6
|
Kravatsky YV, Chechetkin VR, Tchurikov NA, Kravatskaya GI. Genome-Wide Study of Colocalization between Genomic Stretches: A Method and Applications to the Regulation of Gene Expression. BIOLOGY 2022; 11:1422. [PMID: 36290327 PMCID: PMC9598420 DOI: 10.3390/biology11101422] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Figures] [Subscribe] [Scholar Register] [Received: 08/29/2022] [Revised: 09/25/2022] [Accepted: 09/26/2022] [Indexed: 06/16/2023]
Abstract
In this paper, we describe a method for the study of colocalization effects between stretch-stretch and stretch-point genome tracks based on a set of indices varying within the (-1, +1) interval. The indices combine the distances between the centers of neighboring stretches and their lengths. The extreme boundaries of the interval correspond to the complete colocalization of the genome tracks or its complete absence. We also obtained the relevant criteria of statistical significance for such indices using the complete permutation test. The method is robust with respect to strongly inhomogeneous positioning and length distribution of the genome tracks. On the basis of this approach, we created command-line software, the Genome Track Colocalization Analyzer. The software was tested, compared with other available packages, and applied to particular problems related to gene expression. The package, Genome Track Colocalization Analyzer (GTCA), is freely available to the users. GTCA complements our previous software, the Genome Track Analyzer, intended for the search for pairwise correlations between point-like genome tracks (also freely available). The corresponding details are provided in Data Availability Statement at the end of the text.
Collapse
Affiliation(s)
- Yuri V. Kravatsky
- Engelhardt Institute of Molecular Biology, Russian Academy of Sciences, Vavilov Str., 32, 119991 Moscow, Russia
- Center for Precision Genome Editing and Genetic Technologies for Biomedicine, Engelhardt Institute of Molecular Biology, Russian Academy of Sciences, 119991 Moscow, Russia
| | - Vladimir R. Chechetkin
- Engelhardt Institute of Molecular Biology, Russian Academy of Sciences, Vavilov Str., 32, 119991 Moscow, Russia
| | - Nickolai A. Tchurikov
- Engelhardt Institute of Molecular Biology, Russian Academy of Sciences, Vavilov Str., 32, 119991 Moscow, Russia
| | - Galina I. Kravatskaya
- Engelhardt Institute of Molecular Biology, Russian Academy of Sciences, Vavilov Str., 32, 119991 Moscow, Russia
| |
Collapse
|
7
|
Terrone S, Valat J, Fontrodona N, Giraud G, Claude JB, Combe E, Lapendry A, Polvèche H, Ameur LB, Duvermy A, Modolo L, Bernard P, Mortreux F, Auboeuf D, Bourgeois C. RNA helicase-dependent gene looping impacts messenger RNA processing. Nucleic Acids Res 2022; 50:9226-9246. [PMID: 36039747 PMCID: PMC9458439 DOI: 10.1093/nar/gkac717] [Citation(s) in RCA: 6] [Impact Index Per Article: 2.0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 09/10/2021] [Revised: 07/25/2022] [Accepted: 08/25/2022] [Indexed: 12/24/2022] Open
Abstract
DDX5 and DDX17 are DEAD-box RNA helicase paralogs which regulate several aspects of gene expression, especially transcription and splicing, through incompletely understood mechanisms. A transcriptome analysis of DDX5/DDX17-depleted human cells confirmed the large impact of these RNA helicases on splicing and revealed a widespread deregulation of 3' end processing. In silico analyses and experiments in cultured cells showed the binding and functional contribution of the genome organizing factor CTCF to chromatin sites at or near a subset of DDX5/DDX17-dependent exons that are characterized by a high GC content and a high density of RNA Polymerase II. We propose the existence of an RNA helicase-dependent relationship between CTCF and the dynamics of transcription across DNA and/or RNA structured regions, that contributes to the processing of internal and terminal exons. Moreover, local DDX5/DDX17-dependent chromatin loops spatially connect RNA helicase-regulated exons with their cognate promoter, and we provide the first direct evidence that de novo gene looping modifies alternative splicing and polyadenylation. Overall our findings uncover the impact of DDX5/DDX17-dependent chromatin folding on pre-messenger RNA processing.
Collapse
Affiliation(s)
| | | | - Nicolas Fontrodona
- Laboratoire de Biologie et Modelisation de la Cellule, Ecole Normale Superieure de Lyon, CNRS, UMR 5239, Inserm, U1293, Universite Claude Bernard Lyon 1, 46 allee d'Italie, F-69364 Lyon, France
| | | | - Jean-Baptiste Claude
- Laboratoire de Biologie et Modelisation de la Cellule, Ecole Normale Superieure de Lyon, CNRS, UMR 5239, Inserm, U1293, Universite Claude Bernard Lyon 1, 46 allee d'Italie, F-69364 Lyon, France
| | | | - Audrey Lapendry
- Laboratoire de Biologie et Modelisation de la Cellule, Ecole Normale Superieure de Lyon, CNRS, UMR 5239, Inserm, U1293, Universite Claude Bernard Lyon 1, 46 allee d'Italie, F-69364 Lyon, France
| | - Hélène Polvèche
- Laboratoire de Biologie et Modelisation de la Cellule, Ecole Normale Superieure de Lyon, CNRS, UMR 5239, Inserm, U1293, Universite Claude Bernard Lyon 1, 46 allee d'Italie, F-69364 Lyon, France,CECS/AFM, I-STEM, 28 rue Henri Desbruères, F-91100, Corbeil-Essonnes, France
| | - Lamya Ben Ameur
- Laboratoire de Biologie et Modelisation de la Cellule, Ecole Normale Superieure de Lyon, CNRS, UMR 5239, Inserm, U1293, Universite Claude Bernard Lyon 1, 46 allee d'Italie, F-69364 Lyon, France
| | - Arnaud Duvermy
- Laboratoire de Biologie et Modelisation de la Cellule, Ecole Normale Superieure de Lyon, CNRS, UMR 5239, Inserm, U1293, Universite Claude Bernard Lyon 1, 46 allee d'Italie, F-69364 Lyon, France
| | - Laurent Modolo
- Laboratoire de Biologie et Modelisation de la Cellule, Ecole Normale Superieure de Lyon, CNRS, UMR 5239, Inserm, U1293, Universite Claude Bernard Lyon 1, 46 allee d'Italie, F-69364 Lyon, France
| | - Pascal Bernard
- Laboratoire de Biologie et Modelisation de la Cellule, Ecole Normale Superieure de Lyon, CNRS, UMR 5239, Inserm, U1293, Universite Claude Bernard Lyon 1, 46 allee d'Italie, F-69364 Lyon, France
| | - Franck Mortreux
- Laboratoire de Biologie et Modelisation de la Cellule, Ecole Normale Superieure de Lyon, CNRS, UMR 5239, Inserm, U1293, Universite Claude Bernard Lyon 1, 46 allee d'Italie, F-69364 Lyon, France
| | - Didier Auboeuf
- Laboratoire de Biologie et Modelisation de la Cellule, Ecole Normale Superieure de Lyon, CNRS, UMR 5239, Inserm, U1293, Universite Claude Bernard Lyon 1, 46 allee d'Italie, F-69364 Lyon, France
| | - Cyril F Bourgeois
- To whom correspondence should be addressed. Tel: +33 47272 8663; Fax: +33 47272 8674;
| |
Collapse
|
8
|
Rezaie N, Bayati M, Hamidi M, Tahaei MS, Khorasani S, Lovell NH, Breen J, Rabiee HR, Alinejad-Rokny H. Somatic point mutations are enriched in non-coding RNAs with possible regulatory function in breast cancer. Commun Biol 2022; 5:556. [PMID: 35672401 PMCID: PMC9174258 DOI: 10.1038/s42003-022-03528-0] [Citation(s) in RCA: 3] [Impact Index Per Article: 1.0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 08/19/2021] [Accepted: 05/24/2022] [Indexed: 11/09/2022] Open
Abstract
Non-coding RNAs (ncRNAs) form a large portion of the mammalian genome. However, their biological functions are poorly characterized in cancers. In this study, using a newly developed tool, SomaGene, we analyze de novo somatic point mutations from the International Cancer Genome Consortium (ICGC) whole-genome sequencing data of 1,855 breast cancer samples. We identify 1030 candidates of ncRNAs that are significantly and explicitly mutated in breast cancer samples. By integrating data from the ENCODE regulatory features and FANTOM5 expression atlas, we show that the candidate ncRNAs significantly enrich active chromatin histone marks (1.9 times), CTCF binding sites (2.45 times), DNase accessibility (1.76 times), HMM predicted enhancers (2.26 times) and eQTL polymorphisms (1.77 times). Importantly, we show that the 1030 ncRNAs contain a much higher level (3.64 times) of breast cancer-associated genome-wide association (GWAS) single nucleotide polymorphisms (SNPs) than genome-wide expectation. Such enrichment has not been seen with GWAS SNPs from other cancers. Using breast cell line related Hi-C data, we then show that 82% of our candidate ncRNAs (1.9 times) significantly interact with the promoter of protein-coding genes, including previously known cancer-associated genes, suggesting the critical role of candidate ncRNA genes in the activation of essential regulators of development and differentiation in breast cancer. We provide an extensive web-based resource (https://www.ihealthe.unsw.edu.au/research) to communicate our results with the research community. Our list of breast cancer-specific ncRNA genes has the potential to provide a better understanding of the underlying genetic causes of breast cancer. Lastly, the tool developed in this study can be used to analyze somatic mutations in all cancers. The SomaGene tool is developed to identify non-coding RNAs (ncRNAs) mutated in breast cancer but can be used for other cancers. Candidate ncRNAs are shown to be enriched for regulatory features and to contain specific trait loci polymorphisms.
Collapse
Affiliation(s)
- Narges Rezaie
- Center for Complex Biological Systems, University of California Irvine, Irvine, CA, 92697, USA
| | - Masroor Bayati
- Bioinformatics and Computational Biology Lab, Department of Computer Engineering, Sharif University of Technology, Tehran, 11365, Iran
| | - Mehrab Hamidi
- Bioinformatics and Computational Biology Lab, Department of Computer Engineering, Sharif University of Technology, Tehran, 11365, Iran
| | - Maedeh Sadat Tahaei
- Bioinformatics and Computational Biology Lab, Department of Computer Engineering, Sharif University of Technology, Tehran, 11365, Iran
| | - Sadegh Khorasani
- Bioinformatics and Computational Biology Lab, Department of Computer Engineering, Sharif University of Technology, Tehran, 11365, Iran
| | - Nigel H Lovell
- Tyree Institute of Health Engineering and The Graduate School of Biomedical Engineering, UNSW Sydney, Sydney, NSW, 2052, Australia
| | - James Breen
- South Australian Health & Medical Research Institute, Adelaide, SA, 5000, Australia.,Robinson Research Institute, University of Adelaide, Adelaide, SA, 5006, Australia.,Bioinformatics Hub, University of Adelaide, Adelaide, SA, 5006, Australia
| | - Hamid R Rabiee
- Bioinformatics and Computational Biology Lab, Department of Computer Engineering, Sharif University of Technology, Tehran, 11365, Iran.
| | - Hamid Alinejad-Rokny
- BioMedical Machine Learning Lab (BML), The Graduate School of Biomedical Engineering, UNSW Sydney, Sydney, NSW, 2052, Australia. .,UNSW Data Science Hub, The University of New South Wales (UNSW Sydney), Sydney, NSW, 2052, Australia. .,Health Data Analytics Program, AI-enabled Processes (AIP) Research Centre, Macquarie University, Sydney, NSW, 2109, Australia.
| |
Collapse
|
9
|
Dey P, Mattick JS. High frequency of intron retention and clustered H3K4me3-marked nucleosomes in short first introns of human long non-coding RNAs. Epigenetics Chromatin 2021; 14:45. [PMID: 34579770 PMCID: PMC8477579 DOI: 10.1186/s13072-021-00419-2] [Citation(s) in RCA: 2] [Impact Index Per Article: 0.5] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 04/19/2021] [Accepted: 08/27/2021] [Indexed: 12/14/2022] Open
Abstract
BACKGROUND It is established that protein-coding exons are preferentially localized in nucleosomes. To examine whether the same is true for non-coding exons, we analysed nucleosome occupancy in and adjacent to internal exons in genes encoding long non-coding RNAs (lncRNAs) in human CD4+ T cells and K562 cells. RESULTS We confirmed that internal exons in lncRNAs are preferentially associated with nucleosomes, but also observed an elevated signal from H3K4me3-marked nucleosomes in the sequences upstream of these exons. Examination of 200 genomic lncRNA loci chosen at random across all chromosomes showed that high-density regions of H3K4me3-marked nucleosomes, which we term 'slabs', are associated with genomic regions exhibiting intron retention. These retained introns occur in over 50% of lncRNAs examined and are mostly first introns with an average length of just 354 bp, compared to the average length of all human introns of 6355 and 7987 bp in mRNAs and lncRNAs, respectively. Removal of short introns from the dataset abrogated the high upstream H3K4me3 signal, confirming that the association of slabs and short lncRNA introns with intron retention holds genome-wide. The high upstream H3K4me3 signal is also associated with alternatively spliced exons, known to be prominent in lncRNAs. This phenomenon was not observed with mRNAs. CONCLUSIONS There is widespread intron retention and clustered H3K4me3-marked nucleosomes in short first introns of human long non-coding RNAs, which raises intriguing questions about the relationship of IR to lncRNA function and chromatin organization.
Collapse
Affiliation(s)
- Pinki Dey
- School of Biotechnology and Biomolecular Sciences, UNSW Sydney, 2052, Sydney, Australia
| | - John S Mattick
- School of Biotechnology and Biomolecular Sciences, UNSW Sydney, 2052, Sydney, Australia.
| |
Collapse
|
10
|
Jung SB, Lee CY, Lee KH, Heo K, Choi SH. A cleavage-based surrogate reporter for the evaluation of CRISPR-Cas9 cleavage efficiency. Nucleic Acids Res 2021; 49:e85. [PMID: 34086942 PMCID: PMC8421217 DOI: 10.1093/nar/gkab467] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 09/02/2020] [Revised: 04/30/2021] [Accepted: 05/17/2021] [Indexed: 12/03/2022] Open
Abstract
CRISPR-Cas9 is a powerful tool for genome engineering, but its efficiency largely depends on guide RNA (gRNA). There are multiple methods available to evaluate the efficiency of gRNAs, including the T7E1 assay, surveyor nuclease assay, deep sequencing, and surrogate reporter systems. In the present study, we developed a cleavage-based surrogate that we have named the LacI-reporter to evaluate gRNA cleavage efficiency. The LacI repressor, under the control of the EF-1α promoter, represses luciferase or EGFP reporter expression by binding to the lac operator. Upon CRISPR-Cas9 cleavage at a target site located between the EF-1α promoter and the lacI gene, repressor expression is disrupted, thereby triggering luciferase or EGFP expression. Using this system, we can quantitate gRNA cleavage efficiency by assessing luciferase activity or EGFP expression. We found a strong positive correlation between the cleavage efficiency of gRNAs measured using this reporter and mutation frequency, measured using surveyor and deep sequencing. The genome-editing efficiency of gRNAs was validated in human liver organoids. Our LacI-reporter system provides a useful tool to select efficient gRNAs for genome editing.
Collapse
Affiliation(s)
- Soo Bin Jung
- Research Center, Dongnam Institute of Radiological and Medical Sciences (DIRAMS), Busan, 46033, Republic of Korea
| | - Chae young Lee
- Research Center, Dongnam Institute of Radiological and Medical Sciences (DIRAMS), Busan, 46033, Republic of Korea
| | - Kwang-Ho Lee
- Center for Epigenetics, Van Andel Research Institute, Grand Rapids, MI, 49503, USA
| | - Kyu Heo
- Research Center, Dongnam Institute of Radiological and Medical Sciences (DIRAMS), Busan, 46033, Republic of Korea
| | - Si Ho Choi
- Research Center, Dongnam Institute of Radiological and Medical Sciences (DIRAMS), Busan, 46033, Republic of Korea
| |
Collapse
|
11
|
Atefi A, Kojouri PS, Karamali F, Irani S, Nasr-Esfahani MH. Construction and characterization of EGFP reporter plasmid harboring putative human RAX promoter for in vitro monitoring of retinal progenitor cells identity. BMC Mol Cell Biol 2021; 22:40. [PMID: 34348662 PMCID: PMC8335887 DOI: 10.1186/s12860-021-00378-2] [Citation(s) in RCA: 1] [Impact Index Per Article: 0.3] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 11/29/2020] [Accepted: 07/12/2021] [Indexed: 01/08/2023] Open
Abstract
BACKGROUND In retinal degenerative disease, progressive and debilitating conditions result in deterioration of retinal cells and visual loss. In human, retina lacks the inherent capacity for regeneration. Therefore, regeneration of retinal layer from human retinal progenitor cells (hRPCs) is a challenging task and restricted in vitro maintenance of hRPCs remains as the main hurdle. Retina and anterior neural fold homeobox gene (RAX) play critical roles in developing retina and maintenance of hRPCs. In this study, for the first time regulatory regions of human RAX gene with potential promoter activity were experimentally investigated. RESULTS For this purpose, after in silico analysis of regulatory regions of human RAX gene, the expression of EGFP reporter derived by putative promoter sequences was first evaluated in 293 T cells and then in hRPCS derived from human embryonic stem cells. The candidate region (RAX-3258 bp) showed the highest EGFP expression in hRPCs. This reporter construct can be used for in vitro monitoring of hRPC identity and verification of an efficient culture medium for maintenance of these cells. CONCLUSIONS Furthermore, our findings provide a platform for better insight into regulatory regions of human RAX gene and molecular mechanisms underlying its vital functions in retina development.
Collapse
Affiliation(s)
- Atefeh Atefi
- Department of Biology, Science and Research Branch, Islamic Azad University, Tehran, Iran
- Department of Animal Biotechnology, Cell Science Research Center, Royan Institute for Biotechnology, ACECR, Isfahan, Iran
| | - Pendar Shojaei Kojouri
- Department of Animal Biotechnology, Cell Science Research Center, Royan Institute for Biotechnology, ACECR, Isfahan, Iran
| | - Fereshteh Karamali
- Department of Animal Biotechnology, Cell Science Research Center, Royan Institute for Biotechnology, ACECR, Isfahan, Iran
| | - Shiva Irani
- Department of Biology, Science and Research Branch, Islamic Azad University, Tehran, Iran
| | - Mohammad Hossein Nasr-Esfahani
- Department of Animal Biotechnology, Cell Science Research Center, Royan Institute for Biotechnology, ACECR, Isfahan, Iran
| |
Collapse
|
12
|
Zhang Y, Cai Y, Roca X, Kwoh CK, Fullwood MJ. Chromatin loop anchors predict transcript and exon usage. Brief Bioinform 2021; 22:6319936. [PMID: 34263910 PMCID: PMC8575016 DOI: 10.1093/bib/bbab254] [Citation(s) in RCA: 3] [Impact Index Per Article: 0.8] [Reference Citation Analysis] [Abstract] [Key Words] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 04/01/2021] [Revised: 06/16/2021] [Accepted: 05/25/2021] [Indexed: 11/24/2022] Open
Abstract
Epigenomics and transcriptomics data from high-throughput sequencing techniques such as RNA-seq and ChIP-seq have been successfully applied in predicting gene transcript expression. However, the locations of chromatin loops in the genome identified by techniques such as Chromatin Interaction Analysis with Paired End Tag sequencing (ChIA-PET) have never been used for prediction tasks. Here, we developed machine learning models to investigate if ChIA-PET could contribute to transcript and exon usage prediction. In doing so, we used a large set of transcription factors as well as ChIA-PET data. We developed different Gradient Boosting Trees models according to the different tasks with the integrated datasets from three cell lines, including GM12878, HeLaS3 and K562. We validated the models via 10-fold cross validation, chromosome-split validation and cross-cell validation. Our results show that both transcript and splicing-derived exon usage can be effectively predicted with at least 0.7512 and 0.7459 of accuracy, respectively, on all cell lines from all kinds of validations. Examining the predictive features, we found that RNA Polymerase II ChIA-PET was one of the most important features in both transcript and exon usage prediction, suggesting that chromatin loop anchors are predictive of both transcript and exon usage.
Collapse
Affiliation(s)
- Yu Zhang
- School of Computer Science and Engineering, Nanyang Technological University, 50 Nanyang Avenue, Singapore, 639798, Singapore
| | - Yichao Cai
- Cancer Science Institute of Singapore, National University of Singapore, 14 Medical Dr, Singapore 117599, Singapore
| | - Xavier Roca
- School of Biological Sciences, Nanyang Technological University, 60 Nanyang Dr, Singapore 637551, Singapore
| | - Chee Keong Kwoh
- School of Computer Science and Engineering, Nanyang Technological University, 50 Nanyang Avenue, Singapore, 639798, Singapore
| | - Melissa Jane Fullwood
- Cancer Science Institute of Singapore, National University of Singapore, 14 Medical Dr, Singapore 117599, Singapore.,School of Biological Sciences, Nanyang Technological University, 637551, Singapore.,Institute of Molecular and Cell Biology, Agency for Science, Technology and Research (A*STAR), 61 Biopolis Dr, Singapore 138673, Singapore
| |
Collapse
|
13
|
Dwyer K, Agarwal N, Pile L, Ansari A. Gene Architecture Facilitates Intron-Mediated Enhancement of Transcription. Front Mol Biosci 2021; 8:669004. [PMID: 33968994 PMCID: PMC8097089 DOI: 10.3389/fmolb.2021.669004] [Citation(s) in RCA: 13] [Impact Index Per Article: 3.3] [Reference Citation Analysis] [Abstract] [Key Words] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 02/17/2021] [Accepted: 03/31/2021] [Indexed: 12/28/2022] Open
Abstract
Introns impact several vital aspects of eukaryotic organisms like proteomic plasticity, genomic stability, stress response and gene expression. A role for introns in the regulation of gene expression at the level of transcription has been known for more than thirty years. The molecular basis underlying the phenomenon, however, is still not entirely clear. An important clue came from studies performed in budding yeast that indicate that the presence of an intron within a gene results in formation of a multi-looped gene architecture. When looping is defective, these interactions are abolished, and there is no enhancement of transcription despite normal splicing. In this review, we highlight several potential mechanisms through which looping interactions may enhance transcription. The promoter-5′ splice site interaction can facilitate initiation of transcription, the terminator-3′ splice site interaction can enable efficient termination of transcription, while the promoter-terminator interaction can enhance promoter directionality and expedite reinitiation of transcription. Like yeast, mammalian genes also exhibit an intragenic interaction of the promoter with the gene body, especially exons. Such promoter-exon interactions may be responsible for splicing-dependent transcriptional regulation. Thus, the splicing-facilitated changes in gene architecture may play a critical role in regulation of transcription in yeast as well as in higher eukaryotes.
Collapse
Affiliation(s)
- Katherine Dwyer
- Department of Biological Science, Wayne State University, Detroit, MI, United States
| | - Neha Agarwal
- Department of Biological Science, Wayne State University, Detroit, MI, United States
| | - Lori Pile
- Department of Biological Science, Wayne State University, Detroit, MI, United States
| | - Athar Ansari
- Department of Biological Science, Wayne State University, Detroit, MI, United States
| |
Collapse
|
14
|
Role of promoters in regulating alternative splicing. Gene 2021; 782:145523. [PMID: 33667606 DOI: 10.1016/j.gene.2021.145523] [Citation(s) in RCA: 7] [Impact Index Per Article: 1.8] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/25/2020] [Revised: 12/31/2020] [Accepted: 02/09/2021] [Indexed: 01/19/2023]
Abstract
Alternative splicing (AS) plays a critical role in enhancing proteome complexity in higher eukaryotes. Almost all the multi intron-containing genes undergo AS in humans. Splicing mainly occurs co-transcriptionally, where RNA polymerase II (RNA pol II) plays a crucial role in coordinating transcription and pre-mRNA splicing. Aberrant AS leads to non-functional proteins causative in various pathophysiological conditions such as cancers, neurodegenerative diseases, and muscular dystrophies. Transcription and pre-mRNA splicing are deeply interconnected and can influence each other's functions. Several studies evinced that specific promoters employed by RNA pol II dictate the RNA processing decisions. Promoter-specific recruitment of certain transcriptional factors or transcriptional coactivators influences splicing, and the extent to which these factors affect splicing has not been discussed in detail. Here, in this review, various DNA-binding proteins and their influence on promoter-specific AS are extensively discussed. Besides, this review highlights how the promoter-specific epigenetic changes might regulate AS.
Collapse
|
15
|
Rodin RE, Dou Y, Kwon M, Sherman MA, D'Gama AM, Doan RN, Rento LM, Girskis KM, Bohrson CL, Kim SN, Nadig A, Luquette LJ, Gulhan DC, Park PJ, Walsh CA. The landscape of somatic mutation in cerebral cortex of autistic and neurotypical individuals revealed by ultra-deep whole-genome sequencing. Nat Neurosci 2021; 24:176-185. [PMID: 33432195 PMCID: PMC7983596 DOI: 10.1038/s41593-020-00765-6] [Citation(s) in RCA: 79] [Impact Index Per Article: 19.8] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 01/10/2020] [Accepted: 11/21/2020] [Indexed: 01/29/2023]
Abstract
We characterize the landscape of somatic mutations-mutations occurring after fertilization-in the human brain using ultra-deep (~250×) whole-genome sequencing of prefrontal cortex from 59 donors with autism spectrum disorder (ASD) and 15 control donors. We observe a mean of 26 somatic single-nucleotide variants per brain present in ≥4% of cells, with enrichment of mutations in coding and putative regulatory regions. Our analysis reveals that the first cell division after fertilization produces ~3.4 mutations, followed by 2-3 mutations in subsequent generations. This suggests that a typical individual possesses ~80 somatic single-nucleotide variants present in ≥2% of cells-comparable to the number of de novo germline mutations per generation-with about half of individuals having at least one potentially function-altering somatic mutation somewhere in the cortex. ASD brains show an excess of somatic mutations in neural enhancer sequences compared with controls, suggesting that mosaic enhancer mutations may contribute to ASD risk.
Collapse
Affiliation(s)
- Rachel E Rodin
- Division of Genetics and Genomics, Manton Center for Orphan Disease Research, and Howard Hughes Medical Institute, Boston Children's Hospital, Boston, MA, USA
- Departments of Pediatrics and Neurology, Harvard Medical School, Boston, MA, USA
| | - Yanmei Dou
- Department of Biomedical Informatics, Harvard Medical School, Boston, MA, USA
| | - Minseok Kwon
- Department of Biomedical Informatics, Harvard Medical School, Boston, MA, USA
| | - Maxwell A Sherman
- Department of Biomedical Informatics, Harvard Medical School, Boston, MA, USA
- Department of Electrical Engineering and Computer Science, MIT, Cambridge, MA, USA
- Division of Genetics, Brigham and Women's Hospital, Boston, MA, USA
- Broad Institute of MIT and Harvard, Cambridge, MA, USA
| | - Alissa M D'Gama
- Division of Genetics and Genomics, Manton Center for Orphan Disease Research, and Howard Hughes Medical Institute, Boston Children's Hospital, Boston, MA, USA
- Departments of Pediatrics and Neurology, Harvard Medical School, Boston, MA, USA
| | - Ryan N Doan
- Division of Genetics and Genomics, Manton Center for Orphan Disease Research, and Howard Hughes Medical Institute, Boston Children's Hospital, Boston, MA, USA
| | - Lariza M Rento
- Division of Genetics and Genomics, Manton Center for Orphan Disease Research, and Howard Hughes Medical Institute, Boston Children's Hospital, Boston, MA, USA
- Departments of Pediatrics and Neurology, Harvard Medical School, Boston, MA, USA
| | - Kelly M Girskis
- Division of Genetics and Genomics, Manton Center for Orphan Disease Research, and Howard Hughes Medical Institute, Boston Children's Hospital, Boston, MA, USA
- Departments of Pediatrics and Neurology, Harvard Medical School, Boston, MA, USA
| | - Craig L Bohrson
- Department of Biomedical Informatics, Harvard Medical School, Boston, MA, USA
| | - Sonia N Kim
- Division of Genetics and Genomics, Manton Center for Orphan Disease Research, and Howard Hughes Medical Institute, Boston Children's Hospital, Boston, MA, USA
- Departments of Pediatrics and Neurology, Harvard Medical School, Boston, MA, USA
| | - Ajay Nadig
- Division of Genetics and Genomics, Manton Center for Orphan Disease Research, and Howard Hughes Medical Institute, Boston Children's Hospital, Boston, MA, USA
- Departments of Pediatrics and Neurology, Harvard Medical School, Boston, MA, USA
| | - Lovelace J Luquette
- Department of Biomedical Informatics, Harvard Medical School, Boston, MA, USA
| | - Doga C Gulhan
- Department of Biomedical Informatics, Harvard Medical School, Boston, MA, USA
| | - Peter J Park
- Department of Biomedical Informatics, Harvard Medical School, Boston, MA, USA.
- Division of Genetics, Brigham and Women's Hospital, Boston, MA, USA.
| | - Christopher A Walsh
- Division of Genetics and Genomics, Manton Center for Orphan Disease Research, and Howard Hughes Medical Institute, Boston Children's Hospital, Boston, MA, USA.
- Departments of Pediatrics and Neurology, Harvard Medical School, Boston, MA, USA.
- Broad Institute of MIT and Harvard, Cambridge, MA, USA.
| |
Collapse
|
16
|
Chen JB, Dong SS, Yao S, Duan YY, Hu WX, Chen H, Wang NN, Chen XF, Hao RH, Thynn HN, Guo MR, Zhang YJ, Rong Y, Chen YX, Zhou FL, Guo Y, Yang TL. Modeling circRNA expression pattern with integrated sequence and epigenetic features demonstrates the potential involvement of H3K79me2 in circRNA expression. Bioinformatics 2020; 36:4739-4748. [PMID: 32539144 DOI: 10.1093/bioinformatics/btaa567] [Citation(s) in RCA: 2] [Impact Index Per Article: 0.4] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 05/17/2019] [Revised: 05/07/2020] [Accepted: 06/08/2020] [Indexed: 01/22/2023] Open
Abstract
MOTIVATION CircRNAs are an abundant class of non-coding RNAs with widespread, cell-/tissue-specific patterns. Previous work suggested that epigenetic features might be related to circRNA expression. However, the contribution of epigenetic changes to circRNA expression has not been investigated systematically. Here, we built a machine learning framework named CIRCScan, to predict circRNA expression in various cell lines based on the sequence and epigenetic features. RESULTS The predicted accuracy of the expression status models was high with area under the curve of receiver operating characteristic (ROC) values of 0.89-0.92 and the false-positive rates of 0.17-0.25. Predicted expressed circRNAs were further validated by RNA-seq data. The performance of expression-level prediction models was also good with normalized root-mean-square errors of 0.28-0.30 and Pearson's correlation coefficient r over 0.4 in all cell lines, along with Spearman's correlation coefficient ρ of 0.33-0.46. Noteworthy, H3K79me2 was highly ranked in modeling both circRNA expression status and levels across different cells. Further analysis in additional nine cell lines demonstrated a significant enrichment of H3K79me2 in circRNA flanking intron regions, supporting the potential involvement of H3K79me2 in circRNA expression regulation. AVAILABILITY AND IMPLEMENTATION The CIRCScan assembler is freely available online for academic use at https://github.com/johnlcd/CIRCScan. SUPPLEMENTARY INFORMATION Supplementary data are available at Bioinformatics online.
Collapse
Affiliation(s)
- Jia-Bin Chen
- Key Laboratory of Biomedical Information Engineering of Ministry of Education, Biomedical Informatics & Genomics Center, School of Life Science and Technology, Zhongnan Hospital of Wuhan University, Wuhan 430071, China
| | - Shan-Shan Dong
- Key Laboratory of Biomedical Information Engineering of Ministry of Education, Biomedical Informatics & Genomics Center, School of Life Science and Technology, Zhongnan Hospital of Wuhan University, Wuhan 430071, China
| | - Shi Yao
- Key Laboratory of Biomedical Information Engineering of Ministry of Education, Biomedical Informatics & Genomics Center, School of Life Science and Technology, Zhongnan Hospital of Wuhan University, Wuhan 430071, China
| | - Yuan-Yuan Duan
- Key Laboratory of Biomedical Information Engineering of Ministry of Education, Biomedical Informatics & Genomics Center, School of Life Science and Technology, Zhongnan Hospital of Wuhan University, Wuhan 430071, China
| | - Wei-Xin Hu
- Key Laboratory of Biomedical Information Engineering of Ministry of Education, Biomedical Informatics & Genomics Center, School of Life Science and Technology, Zhongnan Hospital of Wuhan University, Wuhan 430071, China
| | - Hao Chen
- Key Laboratory of Biomedical Information Engineering of Ministry of Education, Biomedical Informatics & Genomics Center, School of Life Science and Technology, Zhongnan Hospital of Wuhan University, Wuhan 430071, China
| | - Nai-Ning Wang
- Key Laboratory of Biomedical Information Engineering of Ministry of Education, Biomedical Informatics & Genomics Center, School of Life Science and Technology, Zhongnan Hospital of Wuhan University, Wuhan 430071, China
| | - Xiao-Feng Chen
- Key Laboratory of Biomedical Information Engineering of Ministry of Education, Biomedical Informatics & Genomics Center, School of Life Science and Technology, Zhongnan Hospital of Wuhan University, Wuhan 430071, China
| | - Ruo-Han Hao
- Key Laboratory of Biomedical Information Engineering of Ministry of Education, Biomedical Informatics & Genomics Center, School of Life Science and Technology, Zhongnan Hospital of Wuhan University, Wuhan 430071, China
| | - Hlaing Nwe Thynn
- Key Laboratory of Biomedical Information Engineering of Ministry of Education, Biomedical Informatics & Genomics Center, School of Life Science and Technology, Zhongnan Hospital of Wuhan University, Wuhan 430071, China
| | - Ming-Rui Guo
- Key Laboratory of Biomedical Information Engineering of Ministry of Education, Biomedical Informatics & Genomics Center, School of Life Science and Technology, Zhongnan Hospital of Wuhan University, Wuhan 430071, China
| | - Yu-Jie Zhang
- Key Laboratory of Biomedical Information Engineering of Ministry of Education, Biomedical Informatics & Genomics Center, School of Life Science and Technology, Zhongnan Hospital of Wuhan University, Wuhan 430071, China
| | - Yu Rong
- Key Laboratory of Biomedical Information Engineering of Ministry of Education, Biomedical Informatics & Genomics Center, School of Life Science and Technology, Zhongnan Hospital of Wuhan University, Wuhan 430071, China
| | - Yi-Xiao Chen
- Key Laboratory of Biomedical Information Engineering of Ministry of Education, Biomedical Informatics & Genomics Center, School of Life Science and Technology, Zhongnan Hospital of Wuhan University, Wuhan 430071, China
| | - Fu-Ling Zhou
- Department of Hematopathology, Zhongnan Hospital of Wuhan University, Wuhan 430071, China
| | - Yan Guo
- Key Laboratory of Biomedical Information Engineering of Ministry of Education, Biomedical Informatics & Genomics Center, School of Life Science and Technology, Zhongnan Hospital of Wuhan University, Wuhan 430071, China
| | - Tie-Lin Yang
- Key Laboratory of Biomedical Information Engineering of Ministry of Education, Biomedical Informatics & Genomics Center, School of Life Science and Technology, Zhongnan Hospital of Wuhan University, Wuhan 430071, China
| |
Collapse
|
17
|
Zhou X, Im HK, Lee SH. CORE GREML for estimating covariance between random effects in linear mixed models for complex trait analyses. Nat Commun 2020; 11:4208. [PMID: 32826890 PMCID: PMC7442840 DOI: 10.1038/s41467-020-18085-5] [Citation(s) in RCA: 16] [Impact Index Per Article: 3.2] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 12/19/2019] [Accepted: 08/03/2020] [Indexed: 01/02/2023] Open
Abstract
As a key variance partitioning tool, linear mixed models (LMMs) using genome-based restricted maximum likelihood (GREML) allow both fixed and random effects. Classic LMMs assume independence between random effects, which can be violated, causing bias. Here we introduce a generalized GREML, named CORE GREML, that explicitly estimates the covariance between random effects. Using extensive simulations, we show that CORE GREML outperforms the conventional GREML, providing variance and covariance estimates free from bias due to correlated random effects. Applying CORE GREML to UK Biobank data, we find, for example, that the transcriptome, imputed using genotype data, explains a significant proportion of phenotypic variance for height (0.15, p-value = 1.5e-283), and that these transcriptomic effects correlate with the genomic effects (genome-transcriptome correlation = 0.35, p-value = 1.2e-14). We conclude that the covariance between random effects is a key parameter for estimation, especially when partitioning phenotypic variance by multi-omics layers. Linear mixed models have bias due to the assumed independence between random effects. Here, the authors describe a genome-based restricted maximum likelihood, CORE GREML, which estimates covariance between random effects. Application to UK Biobank data highlights this as an important parameter for multi-omics analyses of phenotypic variance.
Collapse
Affiliation(s)
- Xuan Zhou
- Australian Centre for Precision Health, University of South Australia, Adelaide, SA, 5000, Australia.,UniSA Allied Health and Human Performance, University of South Australia, Adelaide, SA, 5000, Australia.,South Australian Health and Medical Research Institute, Adelaide, SA, 5000, Australia
| | - Hae Kyung Im
- Section of Genetic Medicine, Department of Medicine, The University of Chicago, Chicago, IL, 60637, USA
| | - S Hong Lee
- Australian Centre for Precision Health, University of South Australia, Adelaide, SA, 5000, Australia. .,UniSA Allied Health and Human Performance, University of South Australia, Adelaide, SA, 5000, Australia. .,South Australian Health and Medical Research Institute, Adelaide, SA, 5000, Australia.
| |
Collapse
|
18
|
Maqbool MA, Pioger L, El Aabidine AZ, Karasu N, Molitor AM, Dao LTM, Charbonnier G, van Laethem F, Fenouil R, Koch F, Lacaud G, Gut I, Gut M, Amigorena S, Joffre O, Sexton T, Spicuglia S, Andrau JC. Alternative Enhancer Usage and Targeted Polycomb Marking Hallmark Promoter Choice during T Cell Differentiation. Cell Rep 2020; 32:108048. [PMID: 32814051 DOI: 10.1016/j.celrep.2020.108048] [Citation(s) in RCA: 9] [Impact Index Per Article: 1.8] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 02/17/2020] [Revised: 04/13/2020] [Accepted: 07/28/2020] [Indexed: 01/08/2023] Open
Abstract
During thymic development and upon peripheral activation, T cells undergo extensive phenotypic and functional changes coordinated by lineage-specific developmental programs. To characterize the regulatory landscape controlling T cell identity, we perform a wide epigenomic and transcriptional analysis of mouse thymocytes and naive CD4 differentiated T helper cells. Our investigations reveal a dynamic putative enhancer landscape, and we could validate many of the enhancers using the high-throughput CapStarr sequencing (CapStarr-seq) approach. We find that genes using multiple promoters display increased enhancer usage, suggesting that apparent "enhancer redundancy" might relate to isoform selection. Furthermore, we can show that two Runx3 promoters display long-range interactions with specific enhancers. Finally, our analyses suggest a novel function for the PRC2 complex in the control of alternative promoter usage. Altogether, our study has allowed for the mapping of an exhaustive set of active enhancers and provides new insights into their function and that of PRC2 in controlling promoter choice during T cell differentiation.
Collapse
Affiliation(s)
- Muhammad Ahmad Maqbool
- Institut de Génétique Moléculaire de Montpellier, Université de Montpellier, CNRS, 1919 Route de Mende, Montpellier 34293, France.
| | - Léo Pioger
- Institut de Génétique Moléculaire de Montpellier, Université de Montpellier, CNRS, 1919 Route de Mende, Montpellier 34293, France
| | - Amal Zine El Aabidine
- Institut de Génétique Moléculaire de Montpellier, Université de Montpellier, CNRS, 1919 Route de Mende, Montpellier 34293, France
| | - Nezih Karasu
- Institute of Genetics and Molecular and Cellular Biology (IGBMC), 1 rue Laurent Fries, 67404 Illkirch, France; CNRS UMR7104, 1 rue Laurent Fries, 67404 Illkirch, France; INSERM U1258, 1 rue Laurent Fries, 67404 Illkirch, France; University of Strasbourg, 1 rue Laurent Fries, 67404 Illkirch, France
| | - Anne Marie Molitor
- Institute of Genetics and Molecular and Cellular Biology (IGBMC), 1 rue Laurent Fries, 67404 Illkirch, France; CNRS UMR7104, 1 rue Laurent Fries, 67404 Illkirch, France; INSERM U1258, 1 rue Laurent Fries, 67404 Illkirch, France; University of Strasbourg, 1 rue Laurent Fries, 67404 Illkirch, France
| | - Lan T M Dao
- Aix-Marseille University, UMR-S 1090, TAGC, Marseille 13009, France
| | | | - Francois van Laethem
- Institut de Génétique Moléculaire de Montpellier, Université de Montpellier, CNRS, 1919 Route de Mende, Montpellier 34293, France
| | - Romain Fenouil
- Institut de Génétique Moléculaire de Montpellier, Université de Montpellier, CNRS, 1919 Route de Mende, Montpellier 34293, France
| | - Frederic Koch
- Institut de Génétique Moléculaire de Montpellier, Université de Montpellier, CNRS, 1919 Route de Mende, Montpellier 34293, France
| | - Georges Lacaud
- CRUK Stem Cell Biology Group, Cancer Research UK Manchester Institute, The University of Manchester, Aderley Park, Macclesfield SK104TG, UK
| | - Ivo Gut
- CNAG-CRG, Centre for Genomic Regulation (CRG), Barcelona Institute of Science and Technology (BIST), Baldiri i Reixac 4, 08028 Barcelona, Spain; Universitat Pompeu Fabra (UPF), Barcelona, Spain
| | - Marta Gut
- CNAG-CRG, Centre for Genomic Regulation (CRG), Barcelona Institute of Science and Technology (BIST), Baldiri i Reixac 4, 08028 Barcelona, Spain; Universitat Pompeu Fabra (UPF), Barcelona, Spain
| | - Sebastian Amigorena
- Institut Curie, Université Paris Sciences et Lettres, INSERM U932, 75005 Paris, France
| | - Olivier Joffre
- Centre de Physiopathologie de Toulouse Purpan, INSERM UMR1043 CHU Purpan - BP 3028, 31024 Toulouse Cedex 3, France
| | - Thomas Sexton
- Institute of Genetics and Molecular and Cellular Biology (IGBMC), 1 rue Laurent Fries, 67404 Illkirch, France; CNRS UMR7104, 1 rue Laurent Fries, 67404 Illkirch, France; INSERM U1258, 1 rue Laurent Fries, 67404 Illkirch, France; University of Strasbourg, 1 rue Laurent Fries, 67404 Illkirch, France
| | | | - Jean-Christophe Andrau
- Institut de Génétique Moléculaire de Montpellier, Université de Montpellier, CNRS, 1919 Route de Mende, Montpellier 34293, France.
| |
Collapse
|
19
|
Cao S, Luo X, Xie L, Gao C, Wang D, Holt BF, Lin H, Chu C, Xia X. The florigen interactor BdES43 represses flowering in the model temperate grass Brachypodium distachyon. THE PLANT JOURNAL : FOR CELL AND MOLECULAR BIOLOGY 2020; 102:262-275. [PMID: 31782581 DOI: 10.1111/tpj.14622] [Citation(s) in RCA: 1] [Impact Index Per Article: 0.2] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Subscribe] [Scholar Register] [Received: 12/19/2018] [Revised: 10/25/2019] [Accepted: 11/05/2019] [Indexed: 06/10/2023]
Abstract
FLOWERING LOCUS T (FT) protein, physiologically florigen, has been identified as a system integrator of numerous flowering time pathways in many studies, and its homologs are found throughout the plant lineage. It is important to uncover how precisely florigenic homologs contribute to flowering initiation and how these factors interact genetically. Here we dissected the function of Brachypodium FT orthologs BdFTL1 and BdFTL2 using overexpression and gene-editing experiments. Transgenic assays showed that both BdFTL1 and BdFTL2 could promote flowering, whereas BdFTL2 was essential for flowering initiation. Notably, BdFTL1 is subject to alternative splicing (AS), and its transcriptional level and AS are significantly affected by BdFTL2. Additionally, BdFTL2 could bind with the PHD-containing protein BdES43, an H3K4me3 reader. Furthermore, BdES43 was antagonistic to BdFTL2 in flowering initiation in a transcription-dependent manner and significantly affected BdFTL1 expression. BdFTL2, BdES43 and H3K4me3 also had highly similar distribution patterns within the BdFTL1 locus, indicating their interplay in regulating target genes. Taken together, florigen BdFTL2 functions as a potential epigenetic effector of BdFTL1 by interacting with a BdES43-H3K4me3 complex. This finding provides an additional insight for the regulatory mechanism underlying the multifaceted roles of florigen.
Collapse
Affiliation(s)
- Shuanghe Cao
- National Wheat Improvement Center, Institute of Crop Sciences, Chinese Academy of Agricultural Sciences (CAAS), 12 Zhongguancun South Street, Beijing, 100081, China
| | - Xumei Luo
- National Wheat Improvement Center, Institute of Crop Sciences, Chinese Academy of Agricultural Sciences (CAAS), 12 Zhongguancun South Street, Beijing, 100081, China
| | - Li Xie
- National Wheat Improvement Center, Institute of Crop Sciences, Chinese Academy of Agricultural Sciences (CAAS), 12 Zhongguancun South Street, Beijing, 100081, China
| | - Caixia Gao
- Institute of Genetics and Developmental Biology, Chinese Academy of Sciences (CAS), No. 1 West Beichen Road, Chaoyang District, Beijing, 100101, China
| | - Daowen Wang
- Institute of Genetics and Developmental Biology, Chinese Academy of Sciences (CAS), No. 1 West Beichen Road, Chaoyang District, Beijing, 100101, China
| | - Ben F Holt
- Department of Microbiology and Plant Biology, University of Oklahoma, 770 Van Vleet Oval, Norman, OK, 73019, USA
| | - Hao Lin
- Biotechnology Research Institute, Chinese Academy of Agricultural Sciences (CAAS), 12 Zhongguancun South Street, Haidian District, Beijing, 100081, China
| | - Chengcai Chu
- Institute of Genetics and Developmental Biology, Chinese Academy of Sciences (CAS), No. 1 West Beichen Road, Chaoyang District, Beijing, 100101, China
| | - Xianchun Xia
- National Wheat Improvement Center, Institute of Crop Sciences, Chinese Academy of Agricultural Sciences (CAAS), 12 Zhongguancun South Street, Beijing, 100081, China
| |
Collapse
|
20
|
Sun X, Thorne RF, Zhang XD, He M, Li J, Feng S, Liu X, Wu M. LncRNA GUARDIN suppresses cellular senescence through a LRP130-PGC1α-FOXO4-p21-dependent signaling axis. EMBO Rep 2020; 21:e48796. [PMID: 32149459 DOI: 10.15252/embr.201948796] [Citation(s) in RCA: 10] [Impact Index Per Article: 2.0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/04/2019] [Revised: 12/29/2019] [Accepted: 02/14/2020] [Indexed: 12/14/2022] Open
Abstract
The long noncoding RNA GUARDIN functions to protect genome stability. Inhibiting GUARDIN expression can alter cell fate decisions toward senescence or apoptosis, but the underlying molecular signals are unknown. Here, we show that GUARDIN is an essential component of a transcriptional repressor complex involving LRP130 and PGC1α. GUARDIN acts as a scaffold to stabilize LRP130/PGC1α heterodimers and their occupancy at the FOXO4 promotor. Destabilizing this complex by silencing of GUARDIN, LRP130, or PGC1α leads to increased expression of FOXO4 and upregulation of its target gene p21, thereby driving cells into senescence. We also found that GUARDIN expression was induced by rapamycin, an agent that suppresses cell senescence. FOS-like antigen 2 (FOSL2) acts as a transcriptional repressor of GUARDIN, and lower FOSL2 levels in response to rapamycin correlate with increased levels of GUARDIN. Together, these results demonstrate that GUARDIN inhibits p21-dependent senescence through a LRP130-PGC1α-FOXO4 signaling axis, and moreover, GUARDIN contributes to the anti-aging activities of rapamycin.
Collapse
Affiliation(s)
- Xuedan Sun
- CAS Key Laboratory of Innate Immunity and Chronic Disease, CAS Centre for Excellence in Molecular Cell Science, School of Life Sciences, University of Science and Technology of China and The First Affiliated Hospital of University of Science and Technology of China, Hefei, China
| | - Rick Francis Thorne
- Translational Research Institute, Henan Provincial People's Hospital, Academy of Medical Science, Zhengzhou University, Zhengzhou, China.,School of Environmental & Life Sciences, University of Newcastle, Newcastle, NSW, Australia
| | - Xu Dong Zhang
- Translational Research Institute, Henan Provincial People's Hospital, Academy of Medical Science, Zhengzhou University, Zhengzhou, China.,School of Biomedical Sciences & Pharmacy, University of Newcastle, Newcastle, NSW, Australia
| | - Miao He
- High Magnetic Field Laboratory, Hefei Institutes of Physical Science, Chinese Academy of Sciences, Hefei, China
| | - Jinming Li
- Translational Research Institute, Henan Provincial People's Hospital, Academy of Medical Science, Zhengzhou University, Zhengzhou, China
| | - Shanshan Feng
- Key Laboratory of Regenerative Medicine, Ministry of Education, Department of Developmental & Regenerative Biology, Jinan University, Guangzhou, China
| | - Xiaoying Liu
- Translational Research Institute, Henan Provincial People's Hospital, Academy of Medical Science, Zhengzhou University, Zhengzhou, China.,School of Life Sciences, Anhui Medical University, Hefei, China
| | - Mian Wu
- CAS Key Laboratory of Innate Immunity and Chronic Disease, CAS Centre for Excellence in Molecular Cell Science, School of Life Sciences, University of Science and Technology of China and The First Affiliated Hospital of University of Science and Technology of China, Hefei, China.,Translational Research Institute, Henan Provincial People's Hospital, Academy of Medical Science, Zhengzhou University, Zhengzhou, China.,Key Laboratory of Stem Cell Differentiation & Modification, School of Clinical Medicine, Henan University, Zhengzhou, China
| |
Collapse
|
21
|
Schmidt F, Kern F, Schulz MH. Integrative prediction of gene expression with chromatin accessibility and conformation data. Epigenetics Chromatin 2020; 13:4. [PMID: 32029002 PMCID: PMC7003490 DOI: 10.1186/s13072-020-0327-0] [Citation(s) in RCA: 15] [Impact Index Per Article: 3.0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/13/2019] [Accepted: 01/06/2020] [Indexed: 02/06/2023] Open
Abstract
BACKGROUND Enhancers play a fundamental role in orchestrating cell state and development. Although several methods have been developed to identify enhancers, linking them to their target genes is still an open problem. Several theories have been proposed on the functional mechanisms of enhancers, which triggered the development of various methods to infer promoter-enhancer interactions (PEIs). The advancement of high-throughput techniques describing the three-dimensional organization of the chromatin, paved the way to pinpoint long-range PEIs. Here we investigated whether including PEIs in computational models for the prediction of gene expression improves performance and interpretability. RESULTS We have extended our [Formula: see text] framework to include DNA contacts deduced from chromatin conformation capture experiments and compared various methods to determine PEIs using predictive modelling of gene expression from chromatin accessibility data and predicted transcription factor (TF) motif data. We designed a novel machine learning approach that allows the prioritization of TFs binding to distal loop and promoter regions with respect to their importance for gene expression regulation. Our analysis revealed a set of core TFs that are part of enhancer-promoter loops involving YY1 in different cell lines. CONCLUSION We present a novel approach that can be used to prioritize TFs involved in distal and promoter-proximal regulatory events by integrating chromatin accessibility, conformation, and gene expression data. We show that the integration of chromatin conformation data can improve gene expression prediction and aids model interpretability.
Collapse
Affiliation(s)
- Florian Schmidt
- High-throughput Genomics & Systems Biology, Cluster of Excellence on Multimodal Computing and Interaction, Saarland Informatics Campus, 66123 Saarbrücken, Germany
- Computational Biology & Applied Algorithmics, Max-Planck Institute for Informatics, Saarland Informatics Campus, 66123 Saarbrücken, Germany
- Center for Bioinformatics, Saarland Informatics Campus, 66123 Saarbrücken, Germany
- Genome Institute of Singapore, A*STAR, 60 Biopolis Street, Singapore, 138672 Singapore
| | - Fabian Kern
- High-throughput Genomics & Systems Biology, Cluster of Excellence on Multimodal Computing and Interaction, Saarland Informatics Campus, 66123 Saarbrücken, Germany
- Center for Bioinformatics, Saarland Informatics Campus, 66123 Saarbrücken, Germany
- Chair for Clinical Bioinformatics, Saarland Informatics Campus, 66123 Saarbrücken, Germany
| | - Marcel H. Schulz
- High-throughput Genomics & Systems Biology, Cluster of Excellence on Multimodal Computing and Interaction, Saarland Informatics Campus, 66123 Saarbrücken, Germany
- Computational Biology & Applied Algorithmics, Max-Planck Institute for Informatics, Saarland Informatics Campus, 66123 Saarbrücken, Germany
- Center for Bioinformatics, Saarland Informatics Campus, 66123 Saarbrücken, Germany
- Institute of Cardiovascular Regeneration, Goethe-University, Theodor-Stern-Kai 7, 60590 Frankfurt am Main, Germany
- German Center for Cardiovascular Research, Partner Site Rhein-Main, Theodor-Stern-Kai 7, 60590 Frankfurt am Main, Germany
| |
Collapse
|
22
|
Fiszbein A, Krick KS, Begg BE, Burge CB. Exon-Mediated Activation of Transcription Starts. Cell 2019; 179:1551-1565.e17. [PMID: 31787377 DOI: 10.1016/j.cell.2019.11.002] [Citation(s) in RCA: 61] [Impact Index Per Article: 10.2] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 04/02/2019] [Revised: 08/20/2019] [Accepted: 10/30/2019] [Indexed: 10/25/2022]
Abstract
The processing of RNA transcripts from mammalian genes occurs in proximity to their transcription. Here, we describe a phenomenon affecting thousands of genes that we call exon-mediated activation of transcription starts (EMATS), in which the splicing of internal exons impacts promoter choice and the expression level of the gene. We observed that evolutionary gain of internal exons is associated with gain of new transcription start sites (TSSs) nearby and increased gene expression. Inhibiting exon splicing reduced transcription from nearby promoters, and creation of new spliced exons activated transcription from cryptic promoters. The strongest effects occurred for weak promoters located proximal and upstream of efficiently spliced exons. Together, our findings support a model in which splicing recruits transcription machinery locally to influence TSS choice and identify exon gain, loss, and regulatory change as major contributors to the evolution of alternative promoters and gene expression in mammals.
Collapse
Affiliation(s)
- Ana Fiszbein
- Department of Biology, Massachusetts Institute of Technology, Cambridge, MA 02138, USA
| | - Keegan S Krick
- Department of Biology, Massachusetts Institute of Technology, Cambridge, MA 02138, USA
| | - Bridget E Begg
- Department of Biology, Massachusetts Institute of Technology, Cambridge, MA 02138, USA
| | - Christopher B Burge
- Department of Biology, Massachusetts Institute of Technology, Cambridge, MA 02138, USA.
| |
Collapse
|
23
|
Org T, Hensen K, Kreevan R, Mark E, Sarv O, Andreson R, Jaakma Ü, Salumets A, Kurg A. Genome-wide histone modification profiling of inner cell mass and trophectoderm of bovine blastocysts by RAT-ChIP. PLoS One 2019; 14:e0225801. [PMID: 31765427 PMCID: PMC6876874 DOI: 10.1371/journal.pone.0225801] [Citation(s) in RCA: 7] [Impact Index Per Article: 1.2] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/27/2019] [Accepted: 11/12/2019] [Indexed: 12/29/2022] Open
Abstract
Chromatin immunoprecipitation coupled with next-generation sequencing (ChIP-seq) has revolutionized our understanding of chromatin-related biological processes. The method, however, requires thousands of cells and has therefore limited applications in situations where cell numbers are limited. Here we describe a novel method called Restriction Assisted Tagmentation Chromatin Immunoprecipitation (RAT-ChIP) that enables global histone modification profiling from as few as 100 cells. The method is simple, cost-effective and takes a single day to complete. We demonstrate the sensitivity of the method by deriving the first genome-wide maps of histone H3K4me3 and H3K27me3 modifications of inner cell mass and trophectoderm of bovine blastocyst stage embryos.
Collapse
Affiliation(s)
- Tõnis Org
- Department of Biotechnology, Institute of Molecular and Cell Biology, University of Tartu, Tartu, Estonia
- * E-mail:
| | - Kati Hensen
- Department of Biotechnology, Institute of Molecular and Cell Biology, University of Tartu, Tartu, Estonia
| | - Rita Kreevan
- Department of Biotechnology, Institute of Molecular and Cell Biology, University of Tartu, Tartu, Estonia
| | - Elina Mark
- Chair of Animal Breeding and Biotechnology, Estonian University of Life Sciences, Tartu, Estonia
| | - Olav Sarv
- Competence Centre on Health Technologies, Tartu, Estonia
| | - Reidar Andreson
- Department of Bioinformatics, Institute of Molecular and Cell Biology, University of Tartu, Tartu, Estonia
- Institute of Genomics, University of Tartu, Tartu, Estonia
| | - Ülle Jaakma
- Chair of Animal Breeding and Biotechnology, Estonian University of Life Sciences, Tartu, Estonia
| | - Andres Salumets
- Competence Centre on Health Technologies, Tartu, Estonia
- Department of Obstetrics and Gynaecology, Institute of Clinical Medicine, University of Tartu, Tartu, Estonia
- Institute of Biomedicine and Translational Medicine, University of Tartu, Tartu, Estonia
- Department of Obstetrics and Gynaecology, University of Helsinki and Helsinki University Hospital, Helsinki, Finland
| | - Ants Kurg
- Department of Biotechnology, Institute of Molecular and Cell Biology, University of Tartu, Tartu, Estonia
| |
Collapse
|
24
|
Malik R, Dichgans M. Challenges and opportunities in stroke genetics. Cardiovasc Res 2019; 114:1226-1240. [PMID: 29554300 DOI: 10.1093/cvr/cvy068] [Citation(s) in RCA: 17] [Impact Index Per Article: 2.8] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Submit a Manuscript] [Subscribe] [Scholar Register] [Received: 09/15/2017] [Accepted: 03/14/2018] [Indexed: 12/13/2022] Open
Abstract
Stroke, ischaemic stroke and subtypes of ischaemic stroke display substantial heritability. When compared with related vascular conditions, the number of established risk loci reaching genome-wide significance for association with stroke is still in the lower range, particularly for aetiological stroke subtypes such as large artery atherosclerotic stroke or small vessel stroke. Nevertheless, for individual loci substantial progress has been made in determining the specific mechanisms mediating stroke risk. In this review, we present a roadmap for functional follow-up of common risk variants associated with stroke. First, we discuss in silico strategies for characterizing signals in non-coding regions and highlight databases providing information on quantitative trait loci for mRNA and protein expression, as well as methylation, focussing on those with presumed relevance for stroke. Next, we discuss experimental strategies for following up on non-coding risk variants and regions such as massively parallel reporter assays, proteome-wide association studies, and chromatin conformation capture (3C) assays. These and other approaches are relevant for gaining insight into the specific variants and mechanisms mediating genetic stroke risk. Finally, we discuss how genetic findings could influence clinical practice by adding to diagnostic algorithms and eventually improve treatment options for stroke.
Collapse
Affiliation(s)
- Rainer Malik
- Institute for Stroke and Dementia Research (ISD), University Hospital, Ludwig-Maximilians-Universität (LMU) München, Feodor-Lynen-Straße 17, Munich, Germany
| | - Martin Dichgans
- Institute for Stroke and Dementia Research (ISD), University Hospital, Ludwig-Maximilians-Universität (LMU) München, Feodor-Lynen-Straße 17, Munich, Germany.,Munich Cluster for Systems Neurology (SyNergy), Feodor-Lynen-Straße 17, Munich, Germany
| |
Collapse
|
25
|
Ben Zouari Y, Molitor AM, Sikorska N, Pancaldi V, Sexton T. ChiCMaxima: a robust and simple pipeline for detection and visualization of chromatin looping in Capture Hi-C. Genome Biol 2019; 20:102. [PMID: 31118054 PMCID: PMC6532271 DOI: 10.1186/s13059-019-1706-3] [Citation(s) in RCA: 18] [Impact Index Per Article: 3.0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/15/2018] [Accepted: 05/03/2019] [Indexed: 12/19/2022] Open
Abstract
Capture Hi-C (CHi-C) is a new technique for assessing genome organization based on chromosome conformation capture coupled to oligonucleotide capture of regions of interest, such as gene promoters. Chromatin loop detection is challenging because existing Hi-C/4C-like tools, which make different assumptions about the technical biases presented, are often unsuitable. We describe a new approach, ChiCMaxima, which uses local maxima combined with limited filtering to detect DNA looping interactions, integrating information from biological replicates. ChiCMaxima shows more stringency and robustness compared to previously developed tools. The tool includes a GUI browser for flexible visualization of CHi-C profiles alongside epigenomic tracks.
Collapse
Affiliation(s)
- Yousra Ben Zouari
- Institute of Genetics and Molecular and Cellular Biology (IGBMC), Illkirch, France
- CNRS UMR7104, Illkirch, France
- INSERM U1258, Illkirch, France
- University of Strasbourg, Illkirch, France
| | - Anne M Molitor
- Institute of Genetics and Molecular and Cellular Biology (IGBMC), Illkirch, France
- CNRS UMR7104, Illkirch, France
- INSERM U1258, Illkirch, France
- University of Strasbourg, Illkirch, France
| | - Natalia Sikorska
- Institute of Genetics and Molecular and Cellular Biology (IGBMC), Illkirch, France
- CNRS UMR7104, Illkirch, France
- INSERM U1258, Illkirch, France
- University of Strasbourg, Illkirch, France
| | - Vera Pancaldi
- Centre de Recherches en Cancérologie de Toulouse (CRCT), INSERM U1037, Toulouse, France
- University Paul Sabatier III, Toulouse, France
- Barcelona Supercomputing Center, Barcelona, Spain
| | - Tom Sexton
- Institute of Genetics and Molecular and Cellular Biology (IGBMC), Illkirch, France.
- CNRS UMR7104, Illkirch, France.
- INSERM U1258, Illkirch, France.
- University of Strasbourg, Illkirch, France.
| |
Collapse
|
26
|
Rothenberg EV. Causal Gene Regulatory Network Modeling and Genomics: Second-Generation Challenges. J Comput Biol 2019; 26:703-718. [PMID: 31063008 DOI: 10.1089/cmb.2019.0098] [Citation(s) in RCA: 12] [Impact Index Per Article: 2.0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 12/29/2022] Open
Abstract
Gene regulatory network modeling has played a major role in advancing the understanding of developmental systems, by crystallizing structures of relevant extant information, by formally posing hypothetical functional relationships between network elements, and by offering clear predictive tests to improve understanding of the mechanisms driving developmental progression. Both ordinary differential equation (ODE)-based and Boolean models have also been highly successful in explaining dynamics within subcircuits of more complex processes. In a very small number of cases, gene regulatory network models of much more global scope have been proposed that successfully predict the dynamics of the processes establishing most of an embryonic body plan. Can such successes be expanded to very different developmental systems, including post-embryonic mammalian systems? This perspective discusses several problems that must be solved in more quantitative and predictive theoretical terms, to make this possible. These problems include: the effects of cellular history on chromatin state and how these affect gene accessibility; the dose dependence of activities of many transcription factors (a problem for Boolean models); stochasticity of some transcriptional outputs (a problem for simple ODE models); response timing delays due to epigenetic remodeling requirements; functionally different kinds of repression; and the regulatory syntax that governs responses of genes with multiple enhancers.
Collapse
Affiliation(s)
- Ellen V Rothenberg
- Division of Biology and Biological Engineering, California Institute of Technology, Pasadena, California
| |
Collapse
|
27
|
Kanduri C, Bock C, Gundersen S, Hovig E, Sandve GK. Colocalization analyses of genomic elements: approaches, recommendations and challenges. Bioinformatics 2019; 35:1615-1624. [PMID: 30307532 PMCID: PMC6499241 DOI: 10.1093/bioinformatics/bty835] [Citation(s) in RCA: 40] [Impact Index Per Article: 6.7] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 05/22/2018] [Revised: 09/03/2018] [Accepted: 10/10/2018] [Indexed: 12/23/2022] Open
Abstract
MOTIVATION Many high-throughput methods produce sets of genomic regions as one of their main outputs. Scientists often use genomic colocalization analysis to interpret such region sets, for example to identify interesting enrichments and to understand the interplay between the underlying biological processes. Although widely used, there is little standardization in how these analyses are performed. Different practices can substantially affect the conclusions of colocalization analyses. RESULTS Here, we describe the different approaches and provide recommendations for performing genomic colocalization analysis, while also discussing common methodological challenges that may influence the conclusions. As illustrated by concrete example cases, careful attention to analysis details is needed in order to meet these challenges and to obtain a robust and biologically meaningful interpretation of genomic region set data. SUPPLEMENTARY INFORMATION Supplementary data are available at Bioinformatics online.
Collapse
Affiliation(s)
- Chakravarthi Kanduri
- Department of Informatics, University of Oslo, Oslo, Norway
- K. G. Jebsen Coeliac Disease Research Centre, Oslo, Norway
| | - Christoph Bock
- CeMM Research Center for Molecular Medicine of the Austrian Academy of Sciences, Vienna, Austria
- Department of Laboratory Medicine, Medical University of Vienna, Vienna, Austria
- Max Planck Institute for Informatics, Saarbrücken, Germany
| | - Sveinung Gundersen
- Department of Informatics, University of Oslo, Oslo, Norway
- Elixir Norway, Oslo Node, University of Oslo, Oslo, Norway
| | - Eivind Hovig
- Department of Informatics, University of Oslo, Oslo, Norway
- Elixir Norway, Oslo Node, University of Oslo, Oslo, Norway
- Department of Tumor Biology, Institute for Cancer Research, Oslo, Norway
- Institute for Cancer Genetics and Informatics, The Norwegian Radium Hospital, Oslo, Norway, UK
| | - Geir Kjetil Sandve
- Department of Informatics, University of Oslo, Oslo, Norway
- K. G. Jebsen Coeliac Disease Research Centre, Oslo, Norway
| |
Collapse
|
28
|
Giraud G, Terrone S, Bourgeois CF. Functions of DEAD box RNA helicases DDX5 and DDX17 in chromatin organization and transcriptional regulation. BMB Rep 2019. [PMID: 30293550 PMCID: PMC6330936 DOI: 10.5483/bmbrep.2018.51.12.234] [Citation(s) in RCA: 45] [Impact Index Per Article: 7.5] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/20/2022] Open
Abstract
RNA helicases DDX5 and DDX17 are multitasking proteins that regulate gene expression in different biological contexts through diverse activities. Special attention has long been paid to their function as coregulators of transcription factors, providing insight about their functional association with a number of chromatin modifiers and remodelers. However, to date, the variety of described mechanisms has made it difficult to understand precisely how these proteins work at the molecular level, and the contribution of their ATPase domain to these mechanisms remains unclear as well. In light of their association with long noncoding RNAs that are key epigenetic regulators, an emerging view is that DDX5 and DDX17 may act through modulating the activity of various ribonucleoprotein complexes that could ensure their targeting to specific chromatin loci. This review will comprehensively describe the current knowledge on these different mechanisms. We will also discuss the potential roles of DDX5 and DDX17 on the 3D chromatin organization and how these could impact gene expression at the transcriptional and post-transcriptional levels.
Collapse
Affiliation(s)
- Guillaume Giraud
- Laboratoire de Biologie et Modelisation de la Cellule, Universite de Lyon, CNRS UMR 5239, INSERM U1210, Ecole Normale Superieure de Lyon, Universite Claude Bernard Lyon 1, F-69007 Lyon, France
| | - Sophie Terrone
- Laboratoire de Biologie et Modelisation de la Cellule, Universite de Lyon, CNRS UMR 5239, INSERM U1210, Ecole Normale Superieure de Lyon, Universite Claude Bernard Lyon 1, F-69007 Lyon, France
| | - Cyril F Bourgeois
- Laboratoire de Biologie et Modelisation de la Cellule, Universite de Lyon, CNRS UMR 5239, INSERM U1210, Ecole Normale Superieure de Lyon, Universite Claude Bernard Lyon 1, F-69007 Lyon, France
| |
Collapse
|
29
|
Cremer M, Cremer T. Nuclear compartmentalization, dynamics, and function of regulatory DNA sequences. Genes Chromosomes Cancer 2019; 58:427-436. [DOI: 10.1002/gcc.22714] [Citation(s) in RCA: 27] [Impact Index Per Article: 4.5] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 08/24/2018] [Revised: 11/23/2018] [Accepted: 11/27/2018] [Indexed: 12/15/2022] Open
Affiliation(s)
- Marion Cremer
- Biocenter, Department Biology II; Ludwig Maximilians-Universität (LMU Munich); Munich Germany
| | - Thomas Cremer
- Biocenter, Department Biology II; Ludwig Maximilians-Universität (LMU Munich); Munich Germany
| |
Collapse
|
30
|
Thompson JJ, Kaur R, Sosa CP, Lee JH, Kashiwagi K, Zhou D, Robertson KD. ZBTB24 is a transcriptional regulator that coordinates with DNMT3B to control DNA methylation. Nucleic Acids Res 2018; 46:10034-10051. [PMID: 30085123 PMCID: PMC6212772 DOI: 10.1093/nar/gky682] [Citation(s) in RCA: 36] [Impact Index Per Article: 5.1] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/28/2018] [Revised: 06/29/2018] [Accepted: 07/17/2018] [Indexed: 12/12/2022] Open
Abstract
The interplay between transcription factors and epigenetic writers like the DNA methyltransferases (DNMTs), and the role of this interplay in gene expression, is being increasingly appreciated. ZBTB24, a poorly characterized zinc-finger protein, or the de novo methyltransferase DNMT3B, when mutated, cause Immunodeficiency, Centromere Instability, and Facial anomalies (ICF) syndrome, suggesting an underlying mechanistic link. Chromatin immunoprecipitation coupled with loss-of-function approaches in model systems revealed common loci bound by ZBTB24 and DNMT3B, where they function to regulate gene body methylation. Genes coordinately regulated by ZBTB24 and DNMT3B are enriched for molecular mechanisms essential for cellular homeostasis, highlighting the importance of the ZBTB24-DNMT3B interplay in maintaining epigenetic patterns required for normal cellular function. We identify a ZBTB24 DNA binding motif, which is contained within the promoters of most of its transcriptional targets, including CDCA7, AXIN2, and OSTC. Direct binding of ZBTB24 at the promoters of these genes targets them for transcriptional activation. ZBTB24 binding at the promoters of RNF169 and CAMKMT, however, targets them for transcriptional repression. The involvement of ZBTB24 targets in diverse cellular programs, including the VDR/RXR and interferon regulatory pathways, suggest that ZBTB24's role as a transcriptional regulator is not restricted to immune cells.
Collapse
Affiliation(s)
- Joyce J Thompson
- Department of Molecular Pharmacology and Experimental Therapeutics, Mayo Clinic, 200 First Street SW, Stabile 12-58, Rochester, MN 55905, USA
| | - Rupinder Kaur
- Department of Molecular Pharmacology and Experimental Therapeutics, Mayo Clinic, 200 First Street SW, Stabile 12-58, Rochester, MN 55905, USA
| | - Carlos P Sosa
- Clinical Genome Sequencing Laboratory, Department of Laboratory Medicine and Pathology, Mayo Clinic, 200 First Street SW, Stabile12-58, Rochester, MN 55905, USA
| | - Jeong-Heon Lee
- Department of Biochemistry and Molecular Biology, Mayo Clinic, Rochester, MN 55905, USA
- Epigenomics Translational Program, Mayo Clinic, Rochester, MN 55905, USA
| | - Katsunobu Kashiwagi
- Department of Physiology II, Nara Medical University, Kashihara, Nara 634-8521, Japan
| | - Dan Zhou
- Institutes of Biomedical Sciences, Shanghai Medical College, Fudan University, Shanghai, China
| | - Keith D Robertson
- Department of Molecular Pharmacology and Experimental Therapeutics, Mayo Clinic, 200 First Street SW, Stabile 12-58, Rochester, MN 55905, USA
- Epigenomics Translational Program, Mayo Clinic, Rochester, MN 55905, USA
| |
Collapse
|
31
|
Radhakrishna U, Vishweswaraiah S, Veerappa AM, Zafra R, Albayrak S, Sitharam PH, Saiyed NM, Mishra NK, Guda C, Bahado-Singh R. Newborn blood DNA epigenetic variations and signaling pathway genes associated with Tetralogy of Fallot (TOF). PLoS One 2018; 13:e0203893. [PMID: 30212560 PMCID: PMC6136787 DOI: 10.1371/journal.pone.0203893] [Citation(s) in RCA: 18] [Impact Index Per Article: 2.6] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 05/23/2018] [Accepted: 08/29/2018] [Indexed: 12/31/2022] Open
Abstract
Tetralogy of Fallot (TOF) is the most common Critical Congenital Heart Defect (CCHD). The etiology of TOF is unknown in most cases. Preliminary data from our group and others suggest that epigenetic changes may play an important role in CHD. Epidemiologically, a significant percentage of CHD including TOF fail to be diagnosed in the prenatal and early newborn period which can negatively affect health outcomes. We performed genome-wide methylation assay in newborn blood in 24 non-syndromic TOF cases and 24 unaffected matched controls using Illumina Infinium HumanMethylation450 BeadChips. We identified 64 significantly differentially methylated CpG sites in TOF cases, of which 25 CpG sites had high predictive accuracy for TOF, based on the area under the receiver operating characteristics curve (AUC ROC) ≥ 0.90). The CpG methylation difference between TOF and controls was ≥10% in 51 CpG targets suggesting biological significance. Gene ontology analysis identified significant biological processes and functions related to these differentially methylated genes, including: CHD development, cardiomyopathy, diabetes, immunological, inflammation and other plausible pathways in CHD development. Multiple genes known or plausibly linked to heart development and post-natal heart disease were found to be differentially methylated in the blood DNA of newborns with TOF including: ABCB1, PPP2R5C, TLR1, SELL, SCN3A, CREM, RUNX and LHX9. We generated novel and highly accurate putative molecular markers for TOF detection using leucocyte DNA and thus provided information on pathogenesis of TOF.
Collapse
Affiliation(s)
- Uppala Radhakrishna
- Department of Obstetrics and Gynecology, Oakland University William Beaumont School of Medicine, Royal Oak, Michigan, United States of America
- * E-mail:
| | - Sangeetha Vishweswaraiah
- Department of Obstetrics and Gynecology, Oakland University William Beaumont School of Medicine, Royal Oak, Michigan, United States of America
| | - Avinash M. Veerappa
- Department of Studies in Genetics and Genomics, Laboratory of Genomic Sciences, University of Mysore, Mysore, Karnataka, India
| | - Rita Zafra
- Department of Obstetrics and Gynecology, Icahn School of Medicine at Mount Sinai, New York, New York, United States of America
| | - Samet Albayrak
- Department of Obstetrics and Gynecology, Wayne State University School of Medicine, Detroit, Michigan, United States of America
| | - Prajna H. Sitharam
- Department of Studies in Genetics and Genomics, Laboratory of Genomic Sciences, University of Mysore, Mysore, Karnataka, India
| | - Nazia M. Saiyed
- Biotechnology, Nirma Institute of Science, Nirma University, Ahmedabad, Gujarat, India
| | - Nitish K. Mishra
- Department of Genetics, Cell Biology & Anatomy College of Medicine, University of Nebraska Medical Center Omaha, Omaha, Nebraska, United States of America
| | - Chittibabu Guda
- Department of Genetics, Cell Biology & Anatomy College of Medicine, University of Nebraska Medical Center Omaha, Omaha, Nebraska, United States of America
| | - Ray Bahado-Singh
- Department of Obstetrics and Gynecology, Oakland University William Beaumont School of Medicine, Royal Oak, Michigan, United States of America
| |
Collapse
|
32
|
Iancu OD, Colville AM, Wilmot B, Searles R, Darakjian P, Zheng C, McWeeney S, Kawane S, Crabbe JC, Metten P, Oberbeck D, Hitzemann R. Gender-Specific Effects of Selection for Drinking in the Dark on the Network Roles of Coding and Noncoding RNAs. Alcohol Clin Exp Res 2018; 42:1454-1465. [PMID: 29786871 DOI: 10.1111/acer.13777] [Citation(s) in RCA: 12] [Impact Index Per Article: 1.7] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 11/20/2017] [Accepted: 05/10/2018] [Indexed: 12/11/2022]
Abstract
BACKGROUND Transcriptional differences between heterogeneous stock mice and high drinking-in-the-dark selected mouse lines have previously been described based on microarray technology coupled with network-based analysis. The network changes were reproducible in 2 independent selections and largely confined to 2 distinct network modules; in contrast, differential expression appeared more specific to each selected line. This study extends these results by utilizing RNA-Seq technology, allowing evaluation of the relationship between genetic risk and transcription of noncoding RNA (ncRNA); we additionally evaluate sex-specific transcriptional effects of selection. METHODS Naïve mice (N = 24/group and sex) were utilized for gene expression analysis in the ventral striatum; the transcriptome was sequenced with the Illumina HiSeq platform. Differential gene expression and the weighted gene co-expression network analysis were implemented largely as described elsewhere, resulting in the identification of genes that change expression level or (co)variance structure. RESULTS Across both sexes, we detect selection effects on the extracellular matrix and synaptic signaling, although the identity of individual genes varies. A majority of nc RNAs cluster in a single module of relatively low density in both the male and female network. The most strongly differentially expressed transcript in both sexes was Gm22513, a small nuclear RNA with unknown function. Associated with selection, we also found a number of network hubs that change edge strength and connectivity. At the individual gene level, there are many sex-specific effects; however, at the annotation level, results are more concordant. CONCLUSIONS In addition to demonstrating sex-specific effects of selection on the transcriptome, the data point to the involvement of extracellular matrix genes as being associated with the binge drinking phenotype.
Collapse
Affiliation(s)
- Ovidiu Dan Iancu
- Department of Behavioral Neuroscience, Oregon Health & Science University, Portland, Oregon
| | - Alex M Colville
- Department of Behavioral Neuroscience, Oregon Health & Science University, Portland, Oregon
| | - Beth Wilmot
- Department of Behavioral Neuroscience, Oregon Health & Science University, Portland, Oregon
| | - Robert Searles
- Integrated Genomics Laboratory, Oregon Health & Science University, Portland, Oregon
| | - Priscila Darakjian
- Department of Behavioral Neuroscience, Oregon Health & Science University, Portland, Oregon
| | - Christina Zheng
- Department of Medical Informatics and Clinical Epidemiology, Oregon Health & Science University, Portland, Oregon.,Knight Cancer Institute, Oregon Health & Science University, Portland, Oregon
| | - Shannon McWeeney
- Department of Medical Informatics and Clinical Epidemiology, Oregon Health & Science University, Portland, Oregon
| | - Sunita Kawane
- Department of Medical Informatics and Clinical Epidemiology, Oregon Health & Science University, Portland, Oregon
| | - John C Crabbe
- Department of Behavioral Neuroscience, Oregon Health & Science University, Portland, Oregon.,VA Portland Health Care System , Portland, Oregon
| | - Pamela Metten
- Department of Behavioral Neuroscience, Oregon Health & Science University, Portland, Oregon.,VA Portland Health Care System , Portland, Oregon
| | - Denesa Oberbeck
- Department of Behavioral Neuroscience, Oregon Health & Science University, Portland, Oregon
| | - Robert Hitzemann
- Department of Behavioral Neuroscience, Oregon Health & Science University, Portland, Oregon
| |
Collapse
|
33
|
Stielow B, Finkernagel F, Stiewe T, Nist A, Suske G. MGA, L3MBTL2 and E2F6 determine genomic binding of the non-canonical Polycomb repressive complex PRC1.6. PLoS Genet 2018; 14:e1007193. [PMID: 29381691 PMCID: PMC5806899 DOI: 10.1371/journal.pgen.1007193] [Citation(s) in RCA: 89] [Impact Index Per Article: 12.7] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/11/2017] [Revised: 02/09/2018] [Accepted: 01/09/2018] [Indexed: 02/02/2023] Open
Abstract
Diverse Polycomb repressive complexes 1 (PRC1) play essential roles in gene regulation, differentiation and development. Six major groups of PRC1 complexes that differ in their subunit composition have been identified in mammals. How the different PRC1 complexes are recruited to specific genomic sites is poorly understood. The Polycomb Ring finger protein PCGF6, the transcription factors MGA and E2F6, and the histone-binding protein L3MBTL2 are specific components of the non-canonical PRC1.6 complex. In this study, we have investigated their role in genomic targeting of PRC1.6. ChIP-seq analysis revealed colocalization of MGA, L3MBTL2, E2F6 and PCGF6 genome-wide. Ablation of MGA in a human cell line by CRISPR/Cas resulted in complete loss of PRC1.6 binding. Rescue experiments revealed that MGA recruits PRC1.6 to specific loci both by DNA binding-dependent and by DNA binding-independent mechanisms. Depletion of L3MBTL2 and E2F6 but not of PCGF6 resulted in differential, locus-specific loss of PRC1.6 binding illustrating that different subunits mediate PRC1.6 loading to distinct sets of promoters. Mga, L3mbtl2 and Pcgf6 colocalize also in mouse embryonic stem cells, where PRC1.6 has been linked to repression of germ cell-related genes. Our findings unveil strikingly different genomic recruitment mechanisms of the non-canonical PRC1.6 complex, which specify its cell type- and context-specific regulatory functions. Polycomb group proteins assemble in two major repressive multi-subunit complexes (PRC1 and PRC2), which play important roles in many physiological processes, including stem cell maintenance, differentiation, cell cycle control and cancer. In mammals, six different groups of PRC1 complexes exist (PRC1.1 to PRC1.6), which differ in their subunit composition. The mechanisms that target the different PRC1 complexes to specific genomic sites appear diverse and are poorly understood. In this study, we have investigated the genomic targeting mechanisms of the non-canonical PRC1.6 complex. In PRC1.6, the defining subunit PCGF6 is specifically associated with several proteins including the transcription factors MGA and E2F6, and the histone-binding protein L3MBTL2. We found that MGA is absolutely essential for targeting PRC1.6. MGA executes recruitment of PRC1.6 to its target sites through two distinct functions. On the one hand it acts as a sequence-specific DNA-binding factor; on the other hand it has a scaffolding function, which is independent of its DNA binding capacity. E2F6 and L3MBTL2 are also important in genomic targeting of PRC1.6 as they promote binding of PRC1.6 to different sets of genes associated with distinct functions. Our finding that different components specify loading of PRC1.6 to distinct sets of genes could establish a paradigm for other chromatin-associated complexes.
Collapse
Affiliation(s)
- Bastian Stielow
- Institute of Molecular Biology and Tumor Research (IMT), Philipps-University of Marburg, Marburg, Germany
| | - Florian Finkernagel
- Institute of Molecular Biology and Tumor Research (IMT), Philipps-University of Marburg, Marburg, Germany
| | - Thorsten Stiewe
- Genomics Core Facility, Center for Tumor Biology and Immunology (ZTI), Philipps-University of Marburg, Marburg, Germany
| | - Andrea Nist
- Genomics Core Facility, Center for Tumor Biology and Immunology (ZTI), Philipps-University of Marburg, Marburg, Germany
| | - Guntram Suske
- Institute of Molecular Biology and Tumor Research (IMT), Philipps-University of Marburg, Marburg, Germany
- * E-mail:
| |
Collapse
|
34
|
Ullah F, Hamilton M, Reddy ASN, Ben-Hur A. Exploring the relationship between intron retention and chromatin accessibility in plants. BMC Genomics 2018; 19:21. [PMID: 29304739 PMCID: PMC5756433 DOI: 10.1186/s12864-017-4393-z] [Citation(s) in RCA: 34] [Impact Index Per Article: 4.9] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/01/2017] [Accepted: 12/15/2017] [Indexed: 11/29/2022] Open
Abstract
Background Intron retention (IR) is the most prevalent form of alternative splicing in plants. IR, like other forms of alternative splicing, has an important role in increasing gene product diversity and regulating transcript functionality. Splicing is known to occur co-transcriptionally and is influenced by the speed of transcription which in turn, is affected by chromatin structure. It follows that chromatin structure may have an important role in the regulation of splicing, and there is preliminary evidence in metazoans to suggest that this is indeed the case; however, nothing is known about the role of chromatin structure in regulating IR in plants. DNase I-seq is a useful experimental tool for genome-wide interrogation of chromatin accessibility, providing information on regions of chromatin with very high likelihood of cleavage by the enzyme DNase I, known as DNase I Hypersensitive Sites (DHSs). While it is well-established that promoter regions are highly accessible and are over-represented with DHSs, not much is known about DHSs in the bodies of genes, and their relationship to splicing in general, and IR in particular. Results In this study we use publicly available DNase I-seq data in arabidopsis and rice to investigate the relationship between IR and chromatin structure. We find that IR events are highly enriched in DHSs in both species. This implies that chromatin is more open in retained introns, which is consistent with a kinetic model of the process whereby higher speeds of transcription in those regions give less time for the spliceosomal machinery to recognize and splice out those introns co-transcriptionally. The more open chromatin in IR can also be the result of regulation mediated by DNA-binding proteins. To test this, we performed an exhaustive search for footprints left by DNA-binding proteins that are associated with IR. We identified several hundred short sequence elements that exhibit footprints in their DNase I-seq coverage, the telltale sign for binding events of a regulatory protein, protecting its binding site from cleavage by DNase I. A highly significant fraction of those sequence elements are conserved between arabidopsis and rice, a strong indication of their functional importance. Conclusions In this study we have established an association between IR and chromatin accessibility, and presented a mechanistic hypothesis that explains the observed association from the perspective of the co-transcriptional nature of splicing. Furthermore, we identified conserved sequence elements for DNA-binding proteins that affect splicing. Electronic supplementary material The online version of this article (10.1186/s12864-017-4393-z) contains supplementary material, which is available to authorized users.
Collapse
Affiliation(s)
- Fahad Ullah
- Computer Science Department, Colorado State University, 1873 Campus Delivery, Fort Collins, 80523, CO, USA
| | - Michael Hamilton
- Computer Science Department, Colorado State University, 1873 Campus Delivery, Fort Collins, 80523, CO, USA
| | - Anireddy S N Reddy
- Department of Biology, Colorado State University, 1878 Campus Delivery, Fort Collins, 80523, CO, USA
| | - Asa Ben-Hur
- Computer Science Department, Colorado State University, 1873 Campus Delivery, Fort Collins, 80523, CO, USA.
| |
Collapse
|
35
|
Rambout X, Dequiedt F, Maquat LE. Beyond Transcription: Roles of Transcription Factors in Pre-mRNA Splicing. Chem Rev 2017; 118:4339-4364. [PMID: 29251915 DOI: 10.1021/acs.chemrev.7b00470] [Citation(s) in RCA: 42] [Impact Index Per Article: 5.3] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 12/17/2022]
Abstract
Whereas individual steps of protein-coding gene expression in eukaryotes can be studied in isolation in vitro, it has become clear that these steps are intimately connected within cells. Connections not only ensure quality control but also fine-tune the gene expression process, which must adapt to environmental changes while remaining robust. In this review, we systematically present proven and potential mechanisms by which sequence-specific DNA-binding transcription factors can alter gene expression beyond transcription initiation and regulate pre-mRNA splicing, and thereby mRNA isoform production, by (i) influencing transcription elongation rates, (ii) binding to pre-mRNA to recruit splicing factors, and/or (iii) blocking the association of splicing factors with pre-mRNA. We propose various mechanistic models throughout the review, in some cases without explicit supportive evidence, in hopes of providing fertile ground for future studies.
Collapse
|
36
|
Tilgner H, Jahanbani F, Gupta I, Collier P, Wei E, Rasmussen M, Snyder M. Microfluidic isoform sequencing shows widespread splicing coordination in the human transcriptome. Genome Res 2017; 28:231-242. [PMID: 29196558 PMCID: PMC5793787 DOI: 10.1101/gr.230516.117] [Citation(s) in RCA: 53] [Impact Index Per Article: 6.6] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 09/25/2017] [Accepted: 11/30/2017] [Indexed: 12/21/2022]
Abstract
Understanding transcriptome complexity is crucial for understanding human biology and disease. Technologies such as Synthetic long-read RNA sequencing (SLR-RNA-seq) delivered 5 million isoforms and allowed assessing splicing coordination. Pacific Biosciences and Oxford Nanopore increase throughput also but require high input amounts or amplification. Our new droplet-based method, sparse isoform sequencing (spISO-seq), sequences 100k–200k partitions of 10–200 molecules at a time, enabling analysis of 10–100 million RNA molecules. SpISO-seq requires less than 1 ng of input cDNA, limiting or removing the need for prior amplification with its associated biases. Adjusting the number of reads devoted to each molecule reduces sequencing lanes and cost, with little loss in detection power. The increased number of molecules expands our understanding of isoform complexity. In addition to confirming our previously published cases of splicing coordination (e.g., BIN1), the greater depth reveals many new cases, such as MAPT. Coordination of internal exons is found to be extensive among protein coding genes: 23.5%–59.3% (95% confidence interval) of highly expressed genes with distant alternative exons exhibit coordination, showcasing the need for long-read transcriptomics. However, coordination is less frequent for noncoding sequences, suggesting a larger role of splicing coordination in shaping proteins. Groups of genes with coordination are involved in protein–protein interactions with each other, raising the possibility that coordination facilitates complex formation and/or function. We also find new splicing coordination types, involving initial and terminal exons. Our results provide a more comprehensive understanding of the human transcriptome and a general, cost-effective method to analyze it.
Collapse
Affiliation(s)
- Hagen Tilgner
- Brain and Mind Research Institute, Weill Cornell Medicine, New York, New York 10021, USA
| | - Fereshteh Jahanbani
- Department of Genetics, Stanford University, Stanford, California 94304, USA
| | - Ishaan Gupta
- Brain and Mind Research Institute, Weill Cornell Medicine, New York, New York 10021, USA
| | - Paul Collier
- Brain and Mind Research Institute, Weill Cornell Medicine, New York, New York 10021, USA
| | - Eric Wei
- Department of Genetics, Stanford University, Stanford, California 94304, USA
| | | | - Michael Snyder
- Department of Genetics, Stanford University, Stanford, California 94304, USA
| |
Collapse
|
37
|
Ruiz-Velasco M, Kumar M, Lai MC, Bhat P, Solis-Pinson AB, Reyes A, Kleinsorg S, Noh KM, Gibson TJ, Zaugg JB. CTCF-Mediated Chromatin Loops between Promoter and Gene Body Regulate Alternative Splicing across Individuals. Cell Syst 2017; 5:628-637.e6. [DOI: 10.1016/j.cels.2017.10.018] [Citation(s) in RCA: 58] [Impact Index Per Article: 7.3] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/29/2016] [Revised: 08/02/2017] [Accepted: 10/27/2017] [Indexed: 12/20/2022]
|
38
|
Zhao D, Lin M, Pedrosa E, Lachman HM, Zheng D. Characteristics of allelic gene expression in human brain cells from single-cell RNA-seq data analysis. BMC Genomics 2017; 18:860. [PMID: 29126398 PMCID: PMC5681780 DOI: 10.1186/s12864-017-4261-x] [Citation(s) in RCA: 8] [Impact Index Per Article: 1.0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 05/17/2017] [Accepted: 11/01/2017] [Indexed: 12/24/2022] Open
Abstract
Background Monoallelic expression of autosomal genes has been implicated in human psychiatric disorders. However, there is a paucity of allelic expression studies in human brain cells at the single cell and genome wide levels. Results In this report, we reanalyzed a previously published single-cell RNA-seq dataset from several postmortem human brains and observed pervasive monoallelic expression in individual cells, largely in a random manner. Examining single nucleotide variants with a predicted functional disruption, we found that the “damaged” alleles were overall expressed in fewer brain cells than their counterparts, and at a lower level in cells where their expression was detected. We also identified many brain cell type-specific monoallelically expressed genes. Interestingly, many of these cell type-specific monoallelically expressed genes were enriched for functions important for those brain cell types. In addition, function analysis showed that genes displaying monoallelic expression and correlated expression across neuronal cells from different individual brains were implicated in the regulation of synaptic function. Conclusions Our findings suggest that monoallelic gene expression is prevalent in human brain cells, which may play a role in generating cellular identity and neuronal diversity and thus increasing the complexity and diversity of brain cell functions. Electronic supplementary material The online version of this article (10.1186/s12864-017-4261-x) contains supplementary material, which is available to authorized users.
Collapse
Affiliation(s)
- Dejian Zhao
- Department of Neurology, Albert Einstein College of Medicine, 1300 Morris Park Ave., Bronx, NY, USA.,Department of Genetics, Albert Einstein College of Medicine, 1300 Morris Park Ave., Bronx, NY, USA
| | - Mingyan Lin
- Department of Genetics, Albert Einstein College of Medicine, 1300 Morris Park Ave., Bronx, NY, USA.,Present address: Department of Neuroscience, School of Basic Medical Science, Nanjing Medical University, Nanjing, Jiangsu, 21166, China
| | - Erika Pedrosa
- Department of Psychiatry and Behavioral Sciences, Albert Einstein College of Medicine, 1300 Morris Park Ave., Bronx, NY, USA
| | - Herbert M Lachman
- Department of Genetics, Albert Einstein College of Medicine, 1300 Morris Park Ave., Bronx, NY, USA.,Department of Psychiatry and Behavioral Sciences, Albert Einstein College of Medicine, 1300 Morris Park Ave., Bronx, NY, USA.,Department of Neuroscience, Albert Einstein College of Medicine, 1300 Morris Park Ave., Bronx, NY, USA.,Department of Medicine, Albert Einstein College of Medicine, 1300 Morris Park Ave., Bronx, NY, USA
| | - Deyou Zheng
- Department of Neurology, Albert Einstein College of Medicine, 1300 Morris Park Ave., Bronx, NY, USA. .,Department of Genetics, Albert Einstein College of Medicine, 1300 Morris Park Ave., Bronx, NY, USA. .,Department of Neuroscience, Albert Einstein College of Medicine, 1300 Morris Park Ave., Bronx, NY, USA.
| |
Collapse
|
39
|
Sud A, Kinnersley B, Houlston RS. Genome-wide association studies of cancer: current insights and future perspectives. Nat Rev Cancer 2017; 17:692-704. [PMID: 29026206 DOI: 10.1038/nrc.2017.82] [Citation(s) in RCA: 244] [Impact Index Per Article: 30.5] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Submit a Manuscript] [Subscribe] [Scholar Register] [Indexed: 02/06/2023]
Abstract
Genome-wide association studies (GWAS) provide an agnostic approach for investigating the genetic basis of complex diseases. In oncology, GWAS of nearly all common malignancies have been performed, and over 450 genetic variants associated with increased risks have been identified. As well as revealing novel pathways important in carcinogenesis, these studies have shown that common genetic variation contributes substantially to the heritable risk of many common cancers. The clinical application of GWAS is starting to provide opportunities for drug discovery and repositioning as well as for cancer prevention. However, deciphering the functional and biological basis of associations is challenging and is in part a barrier to fully unlocking the potential of GWAS.
Collapse
Affiliation(s)
- Amit Sud
- Division of Genetics and Epidemiology, The Institute of Cancer Research
| | - Ben Kinnersley
- Division of Genetics and Epidemiology, The Institute of Cancer Research
| | - Richard S Houlston
- Division of Genetics and Epidemiology, The Institute of Cancer Research
- Division of Molecular Pathology, The Institute of Cancer Research, 15 Cotswold Road, Sutton, London SM2 5NG, UK
| |
Collapse
|
40
|
Ho MCW, Quintero-Cadena P, Sternberg PW. Genome-wide discovery of active regulatory elements and transcription factor footprints in Caenorhabditis elegans using DNase-seq. Genome Res 2017; 27:2108-2119. [PMID: 29074739 PMCID: PMC5741056 DOI: 10.1101/gr.223735.117] [Citation(s) in RCA: 12] [Impact Index Per Article: 1.5] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 04/06/2017] [Accepted: 10/18/2017] [Indexed: 12/23/2022]
Abstract
Deep sequencing of size-selected DNase I–treated chromatin (DNase-seq) allows high-resolution measurement of chromatin accessibility to DNase I cleavage, permitting identification of de novo active cis-regulatory modules (CRMs) and individual transcription factor (TF) binding sites. We adapted DNase-seq to nuclei isolated from C. elegans embryos and L1 arrest larvae to generate high-resolution maps of TF binding. Over half of embryonic DNase I hypersensitive sites (DHSs) were annotated as noncoding, with 24% in intergenic, 12% in promoters, and 28% in introns, with similar statistics observed in L1 arrest larvae. Noncoding DHSs are highly conserved and enriched in marks of enhancer activity and transcription. We validated noncoding DHSs against known enhancers from myo-2, myo-3, hlh-1, elt-2, and lin-26/lir-1 and recapitulated 15 of 17 known enhancers. We then mined DNase-seq data to identify putative active CRMs and TF footprints. Using DNase-seq data improved predictions of tissue-specific expression compared with motifs alone. In a pilot functional test, 10 of 15 DHSs from pha-4, icl-1, and ceh-13 drove reporter gene expression in transgenic C. elegans. Overall, we provide experimental annotation of 26,644 putative CRMs in the embryo containing 55,890 TF footprints, as well as 15,841 putative CRMs in the L1 arrest larvae containing 32,685 TF footprints.
Collapse
Affiliation(s)
- Margaret C W Ho
- Division of Biology and Bioengineering, Howard Hughes Medical Institute, California Institute of Technology, Pasadena, California 91125, USA
| | - Porfirio Quintero-Cadena
- Division of Biology and Bioengineering, Howard Hughes Medical Institute, California Institute of Technology, Pasadena, California 91125, USA
| | - Paul W Sternberg
- Division of Biology and Bioengineering, Howard Hughes Medical Institute, California Institute of Technology, Pasadena, California 91125, USA
| |
Collapse
|
41
|
Initial high-resolution microscopic mapping of active and inactive regulatory sequences proves non-random 3D arrangements in chromatin domain clusters. Epigenetics Chromatin 2017; 10:39. [PMID: 28784182 PMCID: PMC5547466 DOI: 10.1186/s13072-017-0146-0] [Citation(s) in RCA: 27] [Impact Index Per Article: 3.4] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 04/18/2017] [Accepted: 07/31/2017] [Indexed: 12/14/2022] Open
Abstract
BACKGROUND The association of active transcription regulatory elements (TREs) with DNAse I hypersensitivity (DHS[+]) and an 'open' local chromatin configuration has long been known. However, the 3D topography of TREs within the nuclear landscape of individual cells in relation to their active or inactive status has remained elusive. Here, we explored the 3D nuclear topography of active and inactive TREs in the context of a recently proposed model for a functionally defined nuclear architecture, where an active and an inactive nuclear compartment (ANC-INC) form two spatially co-aligned and functionally interacting networks. RESULTS Using 3D structured illumination microscopy, we performed 3D FISH with differently labeled DNA probe sets targeting either sites with DHS[+], apparently active TREs, or DHS[-] sites harboring inactive TREs. Using an in-house image analysis tool, DNA targets were quantitatively mapped on chromatin compaction shaped 3D nuclear landscapes. Our analyses present evidence for a radial 3D organization of chromatin domain clusters (CDCs) with layers of increasing chromatin compaction from the periphery to the CDC core. Segments harboring active TREs are significantly enriched at the decondensed periphery of CDCs with loops penetrating into interchromatin compartment channels, constituting the ANC. In contrast, segments lacking active TREs (DHS[-]) are enriched toward the compacted interior of CDCs (INC). CONCLUSIONS Our results add further evidence in support of the ANC-INC network model. The different 3D topographies of DHS[+] and DHS[-] sites suggest positional changes of TREs between the ANC and INC depending on their functional state, which might provide additional protection against an inappropriate activation. Our finding of a structural organization of CDCs based on radially arranged layers of different chromatin compaction levels indicates a complex higher-order chromatin organization beyond a dichotomic classification of chromatin into an 'open,' active and 'closed,' inactive state.
Collapse
|
42
|
Fu H, Zhang X. Noncoding Variants Functional Prioritization Methods Based on Predicted Regulatory Factor Binding Sites. Curr Genomics 2017; 18:322-331. [PMID: 29081688 PMCID: PMC5635616 DOI: 10.2174/1389202918666170228143619] [Citation(s) in RCA: 1] [Impact Index Per Article: 0.1] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 09/01/2016] [Revised: 10/16/2016] [Accepted: 11/02/2016] [Indexed: 12/31/2022] Open
Abstract
BACKGROUNDS With the advent of the post genomic era, the research for the genetic mechanism of the diseases has found to be increasingly depended on the studies of the genes, the gene-networks and gene-protein interaction networks. To explore gene expression and regulation, the researchers have carried out many studies on transcription factors and their binding sites (TFBSs). Based on the large amount of transcription factor binding sites predicting values in the deep learning models, further computation and analysis have been done to reveal the relationship between the gene mutation and the occurrence of the disease. It has been demonstrated that based on the deep learning methods, the performances of the prediction for the functions of the noncoding variants are outperforming than those of the conventional methods. The research on the prediction for functions of Single Nucleotide Polymorphisms (SNPs) is expected to uncover the mechanism of the gene mutation affection on traits and diseases of human beings. RESULTS We reviewed the conventional TFBSs identification methods from different perspectives. As for the deep learning methods to predict the TFBSs, we discussed the related problems, such as the raw data preprocessing, the structure design of the deep convolution neural network (CNN) and the model performance measure et al. And then we summarized the techniques that usually used in finding out the functional noncoding variants from de novo sequence. CONCLUSION Along with the rapid development of the high-throughout assays, more and more sample data and chromatin features would be conducive to improve the prediction accuracy of the deep convolution neural network for TFBSs identification. Meanwhile, getting more insights into the deep CNN framework itself has been proved useful for both the promotion on model performance and the development for more suitable design to sample data. Based on the feature values predicted by the deep CNN model, the prioritization model for functional noncoding variants would contribute to reveal the affection of gene mutation on the diseases.
Collapse
Affiliation(s)
- Haoyue Fu
- College of Sciences, Northeastern University, Shenyang, China
| | - LianpingYang
- College of Sciences, Northeastern University, Shenyang, China
- University of Southern California, Dept. Biol. Sci., Program Mol & Computat Biol, USA
| | - Xiangde Zhang
- College of Sciences, Northeastern University, Shenyang, China
| |
Collapse
|
43
|
Oka R, Zicola J, Weber B, Anderson SN, Hodgman C, Gent JI, Wesselink JJ, Springer NM, Hoefsloot HCJ, Turck F, Stam M. Genome-wide mapping of transcriptional enhancer candidates using DNA and chromatin features in maize. Genome Biol 2017; 18:137. [PMID: 28732548 PMCID: PMC5522596 DOI: 10.1186/s13059-017-1273-4] [Citation(s) in RCA: 107] [Impact Index Per Article: 13.4] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 02/02/2017] [Accepted: 07/05/2017] [Indexed: 11/10/2022] Open
Abstract
Background While most cells in multicellular organisms carry the same genetic information, in each cell type only a subset of genes is being transcribed. Such differentiation in gene expression depends, for a large part, on the activation and repression of regulatory sequences, including transcriptional enhancers. Transcriptional enhancers can be located tens of kilobases from their target genes, but display characteristic chromatin and DNA features, allowing their identification by genome-wide profiling. Here we show that integration of chromatin characteristics can be applied to predict distal enhancer candidates in Zea mays, thereby providing a basis for a better understanding of gene regulation in this important crop plant. Result To predict transcriptional enhancers in the crop plant maize (Zea mays L. ssp. mays), we integrated available genome-wide DNA methylation data with newly generated maps for chromatin accessibility and histone 3 lysine 9 acetylation (H3K9ac) enrichment in young seedling and husk tissue. Approximately 1500 intergenic regions, displaying low DNA methylation, high chromatin accessibility and H3K9ac enrichment, were classified as enhancer candidates. Based on their chromatin profiles, candidate sequences can be classified into four subcategories. Tissue-specificity of enhancer candidates is defined based on the tissues in which they are identified and putative target genes are assigned based on tissue-specific expression patterns of flanking genes. Conclusions Our method identifies three previously identified distal enhancers in maize, validating the new set of enhancer candidates and enlarging the toolbox for the functional characterization of gene regulation in the highly repetitive maize genome. Electronic supplementary material The online version of this article (doi:10.1186/s13059-017-1273-4) contains supplementary material, which is available to authorized users.
Collapse
Affiliation(s)
- Rurika Oka
- Swammerdam Institute for Life Sciences, Universiteit van Amsterdam, Science Park 904, 1098 XH, Amsterdam, The Netherlands
| | - Johan Zicola
- Department Plant Developmental Biology, Max Planck Institute for Plant Breeding Research, Carl-von-Linné-Weg 10, 50829, Köln, Germany
| | - Blaise Weber
- Swammerdam Institute for Life Sciences, Universiteit van Amsterdam, Science Park 904, 1098 XH, Amsterdam, The Netherlands
| | - Sarah N Anderson
- Department of Plant Biology, University of Minnesota, 40 Gortner Laboratory, 1479 Gortner Avenue, St. Paul, MN, 55108, USA
| | - Charlie Hodgman
- Centre for Plant Integrative Biology, School of Biosciences, University of Nottingham, Sutton Bonington, LE12 5RD, UK
| | - Jonathan I Gent
- Department of Plant Biology, University of Georgia, Athens, GA, 30602, USA
| | | | - Nathan M Springer
- Department of Plant Biology, University of Minnesota, 40 Gortner Laboratory, 1479 Gortner Avenue, St. Paul, MN, 55108, USA
| | - Huub C J Hoefsloot
- Swammerdam Institute for Life Sciences, Universiteit van Amsterdam, Science Park 904, 1098 XH, Amsterdam, The Netherlands
| | - Franziska Turck
- Department Plant Developmental Biology, Max Planck Institute for Plant Breeding Research, Carl-von-Linné-Weg 10, 50829, Köln, Germany.
| | - Maike Stam
- Swammerdam Institute for Life Sciences, Universiteit van Amsterdam, Science Park 904, 1098 XH, Amsterdam, The Netherlands.
| |
Collapse
|
44
|
Jung S, Angarica VE, Andrade-Navarro MA, Buckley NJ, Del Sol A. Prediction of Chromatin Accessibility in Gene-Regulatory Regions from Transcriptomics Data. Sci Rep 2017; 7:4660. [PMID: 28680085 PMCID: PMC5498635 DOI: 10.1038/s41598-017-04929-6] [Citation(s) in RCA: 5] [Impact Index Per Article: 0.6] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/13/2017] [Accepted: 05/22/2017] [Indexed: 01/01/2023] Open
Abstract
The epigenetics landscape of cells plays a key role in the establishment of cell-type specific gene expression programs characteristic of different cellular phenotypes. Different experimental procedures have been developed to obtain insights into the accessible chromatin landscape including DNase-seq, FAIRE-seq and ATAC-seq. However, current downstream computational tools fail to reliably determine regulatory region accessibility from the analysis of these experimental data. In particular, currently available peak calling algorithms are very sensitive to their parameter settings and show highly heterogeneous results, which hampers a trustworthy identification of accessible chromatin regions. Here, we present a novel method that predicts accessible and, more importantly, inaccessible gene-regulatory chromatin regions solely relying on transcriptomics data, which complements and improves the results of currently available computational methods for chromatin accessibility assays. We trained a hierarchical classification tree model on publicly available transcriptomics and DNase-seq data and assessed the predictive power of the model in six gold standard datasets. Our method increases precision and recall compared to traditional peak calling algorithms, while its usage is not limited to the prediction of accessible and inaccessible gene-regulatory chromatin regions, but constitutes a helpful tool for optimizing the parameter settings of peak calling methods in a cell type specific manner.
Collapse
Affiliation(s)
- Sascha Jung
- Luxembourg Centre for Systems Biomedicine (LCSB), University of Luxembourg, Belvaux, Luxembourg
| | | | - Miguel A Andrade-Navarro
- Faculty of Biology, Johannes-Gutenberg University of Mainz, Mainz, Germany.,Institute of Molecular Biology, Mainz, Germany
| | - Noel J Buckley
- Department of Psychiatry, Warneford Hospital, University of Oxford, Oxford, United Kingdom
| | - Antonio Del Sol
- Luxembourg Centre for Systems Biomedicine (LCSB), University of Luxembourg, Belvaux, Luxembourg.
| |
Collapse
|
45
|
Jadhav U, Saxena M, O'Neill NK, Saadatpour A, Yuan GC, Herbert Z, Murata K, Shivdasani RA. Dynamic Reorganization of Chromatin Accessibility Signatures during Dedifferentiation of Secretory Precursors into Lgr5+ Intestinal Stem Cells. Cell Stem Cell 2017. [PMID: 28648363 DOI: 10.1016/j.stem.2017.05.001] [Citation(s) in RCA: 187] [Impact Index Per Article: 23.4] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 12/25/2022]
Abstract
Replicating Lgr5+ stem cells and quiescent Bmi1+ cells behave as intestinal stem cells (ISCs) in vivo. Disrupting Lgr5+ ISCs triggers epithelial renewal from Bmi1+ cells, from secretory or absorptive progenitors, and from Paneth cell precursors, revealing a high degree of plasticity within intestinal crypts. Here, we show that GFP+ cells from Bmi1GFP mice are preterminal enteroendocrine cells and we identify CD69+CD274+ cells as related goblet cell precursors. Upon loss of native Lgr5+ ISCs, both populations revert toward an Lgr5+ cell identity. While active histone marks are distributed similarly between Lgr5+ ISCs and progenitors of both major lineages, thousands of cis elements that control expression of lineage-restricted genes are selectively open in secretory cells. This accessibility signature dynamically converts to that of Lgr5+ ISCs during crypt regeneration. Beyond establishing the nature of Bmi1GFP+ cells, these findings reveal how chromatin status underlies intestinal cell diversity and dedifferentiation to restore ISC function and intestinal homeostasis.
Collapse
Affiliation(s)
- Unmesh Jadhav
- Department of Medical Oncology and Center for Functional Cancer Epigenetics, Dana-Farber Cancer Institute, Boston, MA 02215, USA; Department of Medicine, Harvard Medical School, Boston, MA 02215, USA
| | - Madhurima Saxena
- Department of Medical Oncology and Center for Functional Cancer Epigenetics, Dana-Farber Cancer Institute, Boston, MA 02215, USA; Department of Medicine, Harvard Medical School, Boston, MA 02215, USA
| | - Nicholas K O'Neill
- Department of Medical Oncology and Center for Functional Cancer Epigenetics, Dana-Farber Cancer Institute, Boston, MA 02215, USA
| | - Assieh Saadatpour
- Department of Biostatistics and Computational Biology, Dana-Farber Cancer Institute, Boston, MA 02215, USA; Harvard TH Chan School of Public Health, Boston, MA 02215, USA
| | - Guo-Cheng Yuan
- Department of Biostatistics and Computational Biology, Dana-Farber Cancer Institute, Boston, MA 02215, USA; Harvard TH Chan School of Public Health, Boston, MA 02215, USA; Harvard Stem Cell Institute, Cambridge, MA 02138, USA
| | - Zachary Herbert
- Molecular Biology Core Facility, Dana-Farber Cancer Institute, Boston, MA 02215, USA
| | - Kazutaka Murata
- Department of Medical Oncology and Center for Functional Cancer Epigenetics, Dana-Farber Cancer Institute, Boston, MA 02215, USA; Department of Medicine, Harvard Medical School, Boston, MA 02215, USA
| | - Ramesh A Shivdasani
- Department of Medical Oncology and Center for Functional Cancer Epigenetics, Dana-Farber Cancer Institute, Boston, MA 02215, USA; Department of Medicine, Harvard Medical School, Boston, MA 02215, USA; Harvard Stem Cell Institute, Cambridge, MA 02138, USA; Department of Medicine, Brigham and Women's Hospital, Boston, MA 02215, USA.
| |
Collapse
|
46
|
Herman-Izycka J, Wlasnowolski M, Wilczynski B. Taking promoters out of enhancers in sequence based predictions of tissue-specific mammalian enhancers. BMC Med Genomics 2017; 10:34. [PMID: 28589862 PMCID: PMC5461523 DOI: 10.1186/s12920-017-0264-3] [Citation(s) in RCA: 7] [Impact Index Per Article: 0.9] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 12/24/2022] Open
Abstract
BACKGROUND Many genetic diseases are caused by mutations in non-coding regions of the genome. These mutations are frequently found in enhancer sequences, causing disruption to the regulatory program of the cell. Enhancers are short regulatory sequences in the non-coding part of the genome that are essential for the proper regulation of transcription. While the experimental methods for identification of such sequences are improving every year, our understanding of the rules behind the enhancer activity has not progressed much in the last decade. This is especially true in case of tissue-specific enhancers, where there are clear problems in predicting specificity of enhancer activity. RESULTS We show a random-forest based machine learning approach capable of matching the performance of the current state-of-the-art methods for enhancer prediction. Then we show that it is, similarly to other published methods, frequently cross-predicting enhancers as active in different tissues, making it less useful for predicting tissue specific activity. Then we proceed to show that the problem is related to the fact that the enhancer predicting models exhibit a bias towards predicting gene promoters as active enhancers. Then we show that using a two-step classifier can lead to lower cross-prediction between tissues. CONCLUSIONS We provide whole-genome predictions of human heart and brain enhancers obtained with two-step classifier.
Collapse
Affiliation(s)
- Julia Herman-Izycka
- Institute of Informatics, University of Warsaw, Banacha 2, Warsaw, 02-097, Poland
| | - Michal Wlasnowolski
- Institute of Informatics, University of Warsaw, Banacha 2, Warsaw, 02-097, Poland
| | - Bartek Wilczynski
- Institute of Informatics, University of Warsaw, Banacha 2, Warsaw, 02-097, Poland.
| |
Collapse
|
47
|
Ruiz-Velasco M, Zaugg JB. Structure meets function: How chromatin organisation conveys functionality. ACTA ACUST UNITED AC 2017. [DOI: 10.1016/j.coisb.2017.01.003] [Citation(s) in RCA: 7] [Impact Index Per Article: 0.9] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 10/20/2022]
|
48
|
De Luca M, Pels K, Moleirinho S, Curtale G. The epigenetic landscape of innate immunity. AIMS MOLECULAR SCIENCE 2017. [DOI: 10.3934/molsci.2017.1.110] [Citation(s) in RCA: 2] [Impact Index Per Article: 0.3] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 12/24/2022] Open
|
49
|
Babu D, Fullwood MJ. 3D genome organization in health and disease: emerging opportunities in cancer translational medicine. Nucleus 2016; 6:382-93. [PMID: 26553406 PMCID: PMC4915485 DOI: 10.1080/19491034.2015.1106676] [Citation(s) in RCA: 32] [Impact Index Per Article: 3.6] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 02/07/2023] Open
Abstract
Organizing the DNA to fit inside a spatially constrained nucleus is a challenging problem that has attracted the attention of scientists across all disciplines of science. Increasing evidence has demonstrated the importance of genome geometry in several cellular contexts that affect human health. Among several approaches, the application of sequencing technologies has substantially increased our understanding of this intricate organization, also known as chromatin interactions. These structures are involved in transcriptional control of gene expression by connecting distal regulatory elements with their target genes and regulating co-transcriptional splicing. In addition, chromatin interactions play pivotal roles in the organization of the genome, the formation of structural variants, recombination, DNA replication and cell division. Mutations in factors that regulate chromatin interactions lead to the development of pathological conditions, for example, cancer. In this review, we discuss key findings that have shed light on the importance of these structures in the context of cancers, and highlight the applicability of chromatin interactions as potential biomarkers in molecular medicine as well as therapeutic implications of chromatin interactions.
Collapse
Affiliation(s)
- Deepak Babu
- a Cancer Science Institute of Singapore: Singapore; National University of Singapore ; Singapore
| | - Melissa J Fullwood
- a Cancer Science Institute of Singapore: Singapore; National University of Singapore ; Singapore.,b School of Biological Sciences; Nanyang Technological University ; Singapore.,c Institute of Molecular and Cell Biology; Agency for Science; Technology and Research (A*STAR) ; Singapore.,d Yale-NUS Liberal Arts College ; Singapore
| |
Collapse
|
50
|
Abstract
Several studies propose an influence of chromatin on pre-mRNA splicing, but it is still unclear how widespread and how direct this phenomenon is. We find here that when assembled in vivo, the U2 snRNP co-purifies with a subset of chromatin-proteins, including histones and remodeling complexes like SWI/SNF. Yet, an unbiased RNAi screen revealed that the outcome of splicing is influenced by a much larger variety of chromatin factors not all associating with the spliceosome. The availability of this broad range of chromatin factors impacting splicing further unveiled their very context specific effect, resulting in either inclusion or skipping, depending on the exon under scrutiny. Finally, a direct assessment of the impact of chromatin on splicing using an in vitro co-transcriptional splicing assay with pre-mRNAs transcribed from a nucleosomal template, demonstrated that chromatin impacts nascent pre-mRNP in their competence for splicing. Altogether, our data show that numerous chromatin factors associated or not with the spliceosome can affect the outcome of splicing, possibly as a function of the local chromatin environment that by default interferes with the efficiency of splicing. Splicing is an RNA editing step allowing to produce multiple transcripts from a single gene. The gene itself is organized in chromatin, associating DNA and multiple proteins. Some proteins regulating the compaction of the chromatin also affect RNA splicing. Yet, it was unclear whether these chromatin proteins were exceptions or whether chromatin very generally affected the outcome of splicing. Here, we show that a subset of chromatin proteins is physically in interaction with the enzyme responsible for RNA splicing. In addition, several chromatin proteins not found directly associated with the splicing machinery were also able to influence RNA splicing, suggesting that chromatin compaction very globally plays a role in splicing. This finding was confirmed using the first in vitro assay combining transcription and splicing in the context of chromatin; this assay showed that assembling DNA with chromatin proteins influences the efficiency of splicing.
Collapse
|