Reference Citation Analysis: Find an Article, Find a Category, Find a Journal, Find a Scholar

For: Crawford GE, Holt IE, Whittle J, Webb BD, Tai D, Davis S, Margulies EH, Chen Y, Bernat JA, Ginsburg D, Zhou D, Luo S, Vasicek TJ, Daly MJ, Wolfsberg TG, Collins FS. Genome-wide mapping of DNase hypersensitive sites using massively parallel signature sequencing (MPSS). Genome Res 2005;16:123-31. [PMID: 16344561 PMCID: PMC1356136 DOI: 10.1101/gr.4074106] [Citation(s) in RCA: 353] [Impact Index Per Article: 18.6] [Reference Citation Analysis] [What about the content of this article? (0)] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/24/2022]

For:	Crawford GE, Holt IE, Whittle J, Webb BD, Tai D, Davis S, Margulies EH, Chen Y, Bernat JA, Ginsburg D, Zhou D, Luo S, Vasicek TJ, Daly MJ, Wolfsberg TG, Collins FS. Genome-wide mapping of DNase hypersensitive sites using massively parallel signature sequencing (MPSS). Genome Res 2005;16:123-31. [PMID: 16344561 PMCID: PMC1356136 DOI: 10.1101/gr.4074106] [Citation(s) in RCA: 353] [Impact Index Per Article: 18.6] [Reference Citation Analysis] [What about the content of this article? (0)] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/24/2022]

Number

Cited by Other Article(s)

Hou TY, Kraus WL. Spirits in the Material World: Enhancer RNAs in Transcriptional Regulation. Trends Biochem Sci 2020;46:138-153. [PMID: 32888773 DOI: 10.1016/j.tibs.2020.08.007] [Citation(s) in RCA: 29] [Impact Index Per Article: 7.3] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Journal Information] [Subscribe] [Scholar Register] [Received: 05/23/2020] [Revised: 08/04/2020] [Accepted: 08/07/2020] [Indexed: 12/15/2022]

Han J, Wang P, Wang Q, Lin Q, Chen Z, Yu G, Miao C, Dao Y, Wu R, Schnable JC, Tang H, Wang K. Genome-Wide Characterization of DNase I-Hypersensitive Sites and Cold Response Regulatory Landscapes in Grasses. THE PLANT CELL 2020;32:2457-2473. [PMID: 32471863 PMCID: PMC7401015 DOI: 10.1105/tpc.19.00716] [Citation(s) in RCA: 6] [Impact Index Per Article: 1.5] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Subscribe] [Scholar Register] [Received: 09/16/2019] [Revised: 05/11/2020] [Accepted: 05/23/2020] [Indexed: 05/05/2023]

Affiliation(s)

Jinlei Han Key Laboratory of Genetics, Breeding, and Multiple Utilization of Crops, Ministry of Education, Fujian Provincial Key Laboratory of Haixia Applied Plant Systems Biology, Center for Genomics and Biotechnology, Fujian Agriculture and Forestry University, 350002 Fuzhou, China
Pengxi Wang Key Laboratory of Genetics, Breeding, and Multiple Utilization of Crops, Ministry of Education, Fujian Provincial Key Laboratory of Haixia Applied Plant Systems Biology, Center for Genomics and Biotechnology, Fujian Agriculture and Forestry University, 350002 Fuzhou, China
Qiongli Wang Key Laboratory of Genetics, Breeding, and Multiple Utilization of Crops, Ministry of Education, Fujian Provincial Key Laboratory of Haixia Applied Plant Systems Biology, Center for Genomics and Biotechnology, Fujian Agriculture and Forestry University, 350002 Fuzhou, China
Qingfang Lin Key Laboratory of Genetics, Breeding, and Multiple Utilization of Crops, Ministry of Education, Fujian Provincial Key Laboratory of Haixia Applied Plant Systems Biology, Center for Genomics and Biotechnology, Fujian Agriculture and Forestry University, 350002 Fuzhou, China
Zhiyong Chen Key Laboratory of Genetics, Breeding, and Multiple Utilization of Crops, Ministry of Education, Fujian Provincial Key Laboratory of Haixia Applied Plant Systems Biology, Center for Genomics and Biotechnology, Fujian Agriculture and Forestry University, 350002 Fuzhou, China
Guangrun Yu Key Laboratory of Genetics, Breeding, and Multiple Utilization of Crops, Ministry of Education, Fujian Provincial Key Laboratory of Haixia Applied Plant Systems Biology, Center for Genomics and Biotechnology, Fujian Agriculture and Forestry University, 350002 Fuzhou, China
Chenyong Miao Center for Plant Science Innovation, University of Nebraska, Lincoln, Nebraska 68588
Yihang Dao Key Laboratory of Genetics, Breeding, and Multiple Utilization of Crops, Ministry of Education, Fujian Provincial Key Laboratory of Haixia Applied Plant Systems Biology, Center for Genomics and Biotechnology, Fujian Agriculture and Forestry University, 350002 Fuzhou, China
Ruoxi Wu Key Laboratory of Genetics, Breeding, and Multiple Utilization of Crops, Ministry of Education, Fujian Provincial Key Laboratory of Haixia Applied Plant Systems Biology, Center for Genomics and Biotechnology, Fujian Agriculture and Forestry University, 350002 Fuzhou, China
James C Schnable Center for Plant Science Innovation, University of Nebraska, Lincoln, Nebraska 68588
Haibao Tang Key Laboratory of Genetics, Breeding, and Multiple Utilization of Crops, Ministry of Education, Fujian Provincial Key Laboratory of Haixia Applied Plant Systems Biology, Center for Genomics and Biotechnology, Fujian Agriculture and Forestry University, 350002 Fuzhou, China
Kai Wang Key Laboratory of Genetics, Breeding, and Multiple Utilization of Crops, Ministry of Education, Fujian Provincial Key Laboratory of Haixia Applied Plant Systems Biology, Center for Genomics and Biotechnology, Fujian Agriculture and Forestry University, 350002 Fuzhou, China

Collapse

Use Chou’s 5-steps rule to identify DNase I hypersensitive sites via dinucleotide property matrix and extreme gradient boosting. Mol Genet Genomics 2020;295:1431-1442. [DOI: 10.1007/s00438-020-01711-8] [Citation(s) in RCA: 12] [Impact Index Per Article: 3.0] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 04/14/2020] [Accepted: 07/11/2020] [Indexed: 01/08/2023]

Malladi VS, Nagari A, Franco HL, Kraus WL. Total Functional Score of Enhancer Elements Identifies Lineage-Specific Enhancers That Drive Differentiation of Pancreatic Cells. Bioinform Biol Insights 2020;14:1177932220938063. [PMID: 32655276 PMCID: PMC7331761 DOI: 10.1177/1177932220938063] [Citation(s) in RCA: 3] [Impact Index Per Article: 0.8] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 05/25/2020] [Accepted: 06/02/2020] [Indexed: 01/10/2023] Open

Zhou J, Lu Q, Gui L, Xu R, Long Y, Wang H. MTTFsite: cross-cell type TF binding site prediction by using multi-task learning. Bioinformatics 2020;35:5067-5077. [PMID: 31161194 PMCID: PMC6954652 DOI: 10.1093/bioinformatics/btz451] [Citation(s) in RCA: 15] [Impact Index Per Article: 3.8] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 01/24/2019] [Revised: 05/19/2019] [Accepted: 05/30/2019] [Indexed: 12/30/2022] Open

Abstract

Motivation

The prediction of transcription factor binding sites (TFBSs) is crucial for gene expression analysis. Supervised learning approaches for TFBS predictions require large amounts of labeled data. However, many TFs of certain cell types either do not have sufficient labeled data or do not have any labeled data.

Results

In this paper, a multi-task learning framework (called MTTFsite) is proposed to address the lack of labeled data problem by leveraging on labeled data available in cross-cell types. The proposed MTTFsite contains a shared CNN to learn common features for all cell types and a private CNN for each cell type to learn private features. The common features are aimed to help predicting TFBSs for all cell types especially those cell types that lack labeled data. MTTFsite is evaluated on 241 cell type TF pairs and compared with a baseline method without using any multi-task learning model and a fully shared multi-task model that uses only a shared CNN and do not use private CNNs. For cell types with insufficient labeled data, results show that MTTFsite performs better than the baseline method and the fully shared model on more than 89% pairs. For cell types without any labeled data, MTTFsite outperforms the baseline method and the fully shared model by more than 80 and 93% pairs, respectively. A novel gene expression prediction method (called TFChrome) using both MTTFsite and histone modification features is also presented. Results show that TFBSs predicted by MTTFsite alone can achieve good performance. When MTTFsite is combined with histone modification features, a significant 5.7% performance improvement is obtained.

Availability and implementation

The resource and executable code are freely available at http://hlt.hitsz.edu.cn/MTTFsite/ and http://www.hitsz-hlt.com:8080/MTTFsite/.

Supplementary information

Supplementary data are available at Bioinformatics online.

Collapse

Markodimitraki CM, Rang FJ, Rooijers K, de Vries SS, Chialastri A, de Luca KL, Lochs SJA, Mooijman D, Dey SS, Kind J. Simultaneous quantification of protein-DNA interactions and transcriptomes in single cells with scDam&T-seq. Nat Protoc 2020;15:1922-1953. [PMID: 32350457 PMCID: PMC7779467 DOI: 10.1038/s41596-020-0314-8] [Citation(s) in RCA: 18] [Impact Index Per Article: 4.5] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/24/2019] [Accepted: 02/17/2020] [Indexed: 12/31/2022]

Lekschas F, Peterson B, Haehn D, Ma E, Gehlenborg N, Pfister H. Peax: Interactive Visual Pattern Search in Sequential Data Using Unsupervised Deep Representation Learning. COMPUTER GRAPHICS FORUM : JOURNAL OF THE EUROPEAN ASSOCIATION FOR COMPUTER GRAPHICS 2020;39:167-179. [PMID: 34334852 PMCID: PMC8323802 DOI: 10.1111/cgf.13971] [Citation(s) in RCA: 8] [Impact Index Per Article: 2.0] [Reference Citation Analysis] [Abstract] [Grants] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 05/20/2023]

Attentive gated neural networks for identifying chromatin accessibility. Neural Comput Appl 2020. [DOI: 10.1007/s00521-020-04879-7] [Citation(s) in RCA: 3] [Impact Index Per Article: 0.8] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 12/12/2022]

Berrio A, Haygood R, Wray GA. Identifying branch-specific positive selection throughout the regulatory genome using an appropriate proxy neutral. BMC Genomics 2020;21:359. [PMID: 32404186 PMCID: PMC7222330 DOI: 10.1186/s12864-020-6752-4] [Citation(s) in RCA: 8] [Impact Index Per Article: 2.0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/22/2019] [Accepted: 04/21/2020] [Indexed: 01/09/2023] Open

Wang J, Wang Y, Duan Z, Hu W. Hypoxia‐induced alterations of transcriptome and chromatin accessibility in HL ‐1 cells. IUBMB Life 2020;72:1737-1746. [PMID: 32351020 DOI: 10.1002/iub.2297] [Citation(s) in RCA: 9] [Impact Index Per Article: 2.3] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 02/18/2020] [Revised: 04/13/2020] [Accepted: 04/13/2020] [Indexed: 12/22/2022]

Robinson EK, Covarrubias S, Carpenter S. The how and why of lncRNA function: An innate immune perspective. BIOCHIMICA ET BIOPHYSICA ACTA. GENE REGULATORY MECHANISMS 2020;1863:194419. [PMID: 31487549 PMCID: PMC7185634 DOI: 10.1016/j.bbagrm.2019.194419] [Citation(s) in RCA: 181] [Impact Index Per Article: 45.3] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Subscribe] [Scholar Register] [Received: 05/31/2019] [Accepted: 08/21/2019] [Indexed: 02/06/2023]

iDHS-DSAMS: Identifying DNase I hypersensitive sites based on the dinucleotide property matrix and ensemble bagged tree. Genomics 2020;112:1282-1289. [DOI: 10.1016/j.ygeno.2019.07.017] [Citation(s) in RCA: 9] [Impact Index Per Article: 2.3] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/15/2019] [Revised: 07/14/2019] [Accepted: 07/30/2019] [Indexed: 11/21/2022]

Guo Y, Zhou D, Nie R, Ruan X, Li W. DeepANF: A deep attentive neural framework with distributed representation for chromatin accessibility prediction. Neurocomputing 2020. [DOI: 10.1016/j.neucom.2019.10.091] [Citation(s) in RCA: 11] [Impact Index Per Article: 2.8] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 01/30/2023]

Wong D, Turner AW, Miller CL. Genetic Insights Into Smooth Muscle Cell Contributions to Coronary Artery Disease. Arterioscler Thromb Vasc Biol 2020;39:1006-1017. [PMID: 31043074 DOI: 10.1161/atvbaha.119.312141] [Citation(s) in RCA: 22] [Impact Index Per Article: 5.5] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 12/20/2022]

Tao X, Feng S, Zhao T, Guan X. Efficient chromatin profiling of H3K4me3 modification in cotton using CUT&Tag. PLANT METHODS 2020;16:120. [PMID: 32884577 PMCID: PMC7460760 DOI: 10.1186/s13007-020-00664-8] [Citation(s) in RCA: 25] [Impact Index Per Article: 6.3] [Reference Citation Analysis] [Abstract] [Key Words] [Grants] [Track Full Text] [Subscribe] [Scholar Register] [Received: 07/09/2020] [Accepted: 08/25/2020] [Indexed: 05/02/2023]

Klein DC, Hainer SJ. Genomic methods in profiling DNA accessibility and factor localization. Chromosome Res 2019;28:69-85. [PMID: 31776829 PMCID: PMC7125251 DOI: 10.1007/s10577-019-09619-9] [Citation(s) in RCA: 55] [Impact Index Per Article: 11.0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 09/17/2019] [Revised: 10/10/2019] [Accepted: 10/15/2019] [Indexed: 12/24/2022]

Zhu C, Yu M, Huang H, Juric I, Abnousi A, Hu R, Lucero J, Behrens MM, Hu M, Ren B. An ultra high-throughput method for single-cell joint analysis of open chromatin and transcriptome. Nat Struct Mol Biol 2019;26:1063-1070. [PMID: 31695190 DOI: 10.1038/s41594-019-0323-x] [Citation(s) in RCA: 178] [Impact Index Per Article: 35.6] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/11/2019] [Accepted: 09/30/2019] [Indexed: 12/21/2022]

Zhou W, Ji Z, Fang W, Ji H. Global prediction of chromatin accessibility using small-cell-number and single-cell RNA-seq. Nucleic Acids Res 2019;47:e121. [PMID: 31428792 PMCID: PMC6821224 DOI: 10.1093/nar/gkz716] [Citation(s) in RCA: 17] [Impact Index Per Article: 3.4] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 12/26/2018] [Revised: 07/20/2019] [Accepted: 08/11/2019] [Indexed: 11/13/2022] Open

Identifying DNase I hypersensitive sites using multi-features fusion and F-score features selection via Chou's 5-steps rule. Biophys Chem 2019;253:106227. [DOI: 10.1016/j.bpc.2019.106227] [Citation(s) in RCA: 35] [Impact Index Per Article: 7.0] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 04/19/2019] [Revised: 07/04/2019] [Accepted: 07/10/2019] [Indexed: 01/12/2023]

Fosslie M, Manaf A, Lerdrup M, Hansen K, Gilfillan GD, Dahl JA. Going low to reach high: Small-scale ChIP-seq maps new terrain. WILEY INTERDISCIPLINARY REVIEWS-SYSTEMS BIOLOGY AND MEDICINE 2019;12:e1465. [PMID: 31478357 DOI: 10.1002/wsbm.1465] [Citation(s) in RCA: 3] [Impact Index Per Article: 0.6] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Subscribe] [Scholar Register] [Received: 04/15/2019] [Revised: 07/02/2019] [Accepted: 07/25/2019] [Indexed: 12/20/2022]

Deng C, Naler LB, Lu C. Microfluidic epigenomic mapping technologies for precision medicine. LAB ON A CHIP 2019;19:2630-2650. [PMID: 31338502 PMCID: PMC6697104 DOI: 10.1039/c9lc00407f] [Citation(s) in RCA: 5] [Impact Index Per Article: 1.0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 05/04/2023]

Quang D, Xie X. FactorNet: A deep learning framework for predicting cell type specific transcription factor binding from nucleotide-resolution sequential data. Methods 2019;166:40-47. [PMID: 30922998 PMCID: PMC6708499 DOI: 10.1016/j.ymeth.2019.03.020] [Citation(s) in RCA: 87] [Impact Index Per Article: 17.4] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 11/01/2018] [Revised: 03/05/2019] [Accepted: 03/20/2019] [Indexed: 01/08/2023] Open

Garcia-Alonso L, Holland CH, Ibrahim MM, Turei D, Saez-Rodriguez J. Benchmark and integration of resources for the estimation of human transcription factor activities. Genome Res 2019;29:1363-1375. [PMID: 31340985 PMCID: PMC6673718 DOI: 10.1101/gr.240663.118] [Citation(s) in RCA: 435] [Impact Index Per Article: 87.0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/18/2018] [Accepted: 05/28/2019] [Indexed: 12/25/2022]

Chen Y, Chen A. Unveiling the gene regulatory landscape in diseases through the identification of DNase I-hypersensitive sites. Biomed Rep 2019;11:87-97. [PMID: 31423302 PMCID: PMC6684942 DOI: 10.3892/br.2019.1233] [Citation(s) in RCA: 4] [Impact Index Per Article: 0.8] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 12/11/2018] [Accepted: 07/03/2019] [Indexed: 01/18/2023] Open

Benton ML, Talipineni SC, Kostka D, Capra JA. Genome-wide enhancer annotations differ significantly in genomic distribution, evolution, and function. BMC Genomics 2019;20:511. [PMID: 31221079 PMCID: PMC6585034 DOI: 10.1186/s12864-019-5779-x] [Citation(s) in RCA: 25] [Impact Index Per Article: 5.0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 01/04/2019] [Accepted: 05/07/2019] [Indexed: 12/28/2022] Open

Abstract

Background

Non-coding gene regulatory enhancers are essential to transcription in mammalian cells. As a result, a large variety of experimental and computational strategies have been developed to identify cis-regulatory enhancer sequences. Given the differences in the biological signals assayed, some variation in the enhancers identified by different methods is expected; however, the concordance of enhancers identified by different methods has not been comprehensively evaluated. This is critically needed, since in practice, most studies consider enhancers identified by only a single method. Here, we compare enhancer sets from eleven representative strategies in four biological contexts.

Results

All sets we evaluated overlap significantly more than expected by chance; however, there is significant dissimilarity in their genomic, evolutionary, and functional characteristics, both at the element and base-pair level, within each context. The disagreement is sufficient to influence interpretation of candidate SNPs from GWAS studies, and to lead to disparate conclusions about enhancer and disease mechanisms. Most regions identified as enhancers are supported by only one method, and we find limited evidence that regions identified by multiple methods are better candidates than those identified by a single method. As a result, we cannot recommend the use of any single enhancer identification strategy in all settings.

Conclusions

Our results highlight the inherent complexity of enhancer biology and identify an important challenge to mapping the genetic architecture of complex disease. Greater appreciation of how the diverse enhancer identification strategies in use today relate to the dynamic activity of gene regulatory regions is needed to enable robust and reproducible results.

Electronic supplementary material

The online version of this article (10.1186/s12864-019-5779-x) contains supplementary material, which is available to authorized users.

Collapse

Zeng Z, Zhang W, Marand AP, Zhu B, Buell CR, Jiang J. Cold stress induces enhanced chromatin accessibility and bivalent histone modifications H3K4me3 and H3K27me3 of active genes in potato. Genome Biol 2019;20:123. [PMID: 31208436 PMCID: PMC6580510 DOI: 10.1186/s13059-019-1731-2] [Citation(s) in RCA: 84] [Impact Index Per Article: 16.8] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 02/28/2019] [Accepted: 06/05/2019] [Indexed: 02/08/2023] Open

Osgood JA, Knight JC. Translating GWAS in rheumatic disease: approaches to establishing mechanism and function for genetic associations with ankylosing spondylitis. Brief Funct Genomics 2019;17:308-318. [PMID: 29741584 PMCID: PMC6158798 DOI: 10.1093/bfgp/ely015] [Citation(s) in RCA: 2] [Impact Index Per Article: 0.4] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 12/25/2022] Open

Liang Y, Zhang S. iDHS-DMCAC: identifying DNase I hypersensitive sites with balanced dinucleotide-based detrending moving-average cross-correlation coefficient. SAR AND QSAR IN ENVIRONMENTAL RESEARCH 2019;30:429-445. [PMID: 31117818 DOI: 10.1080/1062936x.2019.1615546] [Citation(s) in RCA: 4] [Impact Index Per Article: 0.8] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 05/15/2023]

Zehnder T, Benner P, Vingron M. Predicting enhancers in mammalian genomes using supervised hidden Markov models. BMC Bioinformatics 2019;20:157. [PMID: 30917778 PMCID: PMC6437899 DOI: 10.1186/s12859-019-2708-6] [Citation(s) in RCA: 6] [Impact Index Per Article: 1.2] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/11/2018] [Accepted: 02/27/2019] [Indexed: 12/24/2022] Open

Abstract

BACKGROUND

Eukaryotic gene regulation is a complex process comprising the dynamic interaction of enhancers and promoters in order to activate gene expression. In recent years, research in regulatory genomics has contributed to a better understanding of the characteristics of promoter elements and for most sequenced model organism genomes there exist comprehensive and reliable promoter annotations. For enhancers, however, a reliable description of their characteristics and location has so far proven to be elusive. With the development of high-throughput methods such as ChIP-seq, large amounts of data about epigenetic conditions have become available, and many existing methods use the information on chromatin accessibility or histone modifications to train classifiers in order to segment the genome into functional groups such as enhancers and promoters. However, these methods often do not consider prior biological knowledge about enhancers such as their diverse lengths or molecular structure.

RESULTS

We developed enhancer HMM (eHMM), a supervised hidden Markov model designed to learn the molecular structure of promoters and enhancers. Both consist of a central stretch of accessible DNA flanked by nucleosomes with distinct histone modification patterns. We evaluated the performance of eHMM within and across cell types and developmental stages and found that eHMM successfully predicts enhancers with high precision and recall comparable to state-of-the-art methods, and consistently outperforms those in terms of accuracy and resolution.

CONCLUSIONS

eHMM predicts active enhancers based on data from chromatin accessibility assays and a minimal set of histone modification ChIP-seq experiments. In comparison to other 'black box' methods its parameters are easy to interpret. eHMM can be used as a stand-alone tool for enhancer prediction without the need for additional training or a tuning of parameters. The high spatial precision of enhancer predictions gives valuable targets for potential knockout experiments or downstream analyses such as motif search.

Collapse

Li Z, Schulz MH, Look T, Begemann M, Zenke M, Costa IG. Identification of transcription factor binding sites using ATAC-seq. Genome Biol 2019;20:45. [PMID: 30808370 PMCID: PMC6391789 DOI: 10.1186/s13059-019-1642-2] [Citation(s) in RCA: 233] [Impact Index Per Article: 46.6] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 02/13/2018] [Accepted: 01/25/2019] [Indexed: 01/07/2023] Open

Li H, Quang D, Guan Y. Anchor: trans-cell type prediction of transcription factor binding sites. Genome Res 2019;29:281-292. [PMID: 30567711 PMCID: PMC6360811 DOI: 10.1101/gr.237156.118] [Citation(s) in RCA: 44] [Impact Index Per Article: 8.8] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/15/2018] [Accepted: 12/13/2018] [Indexed: 12/16/2022]

Cremer M, Cremer T. Nuclear compartmentalization, dynamics, and function of regulatory DNA sequences. Genes Chromosomes Cancer 2019;58:427-436. [DOI: 10.1002/gcc.22714] [Citation(s) in RCA: 27] [Impact Index Per Article: 5.4] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 08/24/2018] [Revised: 11/23/2018] [Accepted: 11/27/2018] [Indexed: 12/15/2022] Open

Chowdhury IH, Narra HP, Sahni A, Khanipov K, Fofanov Y, Sahni SK. Enhancer Associated Long Non-coding RNA Transcription and Gene Regulation in Experimental Models of Rickettsial Infection. Front Immunol 2019;9:3014. [PMID: 30687302 PMCID: PMC6333757 DOI: 10.3389/fimmu.2018.03014] [Citation(s) in RCA: 8] [Impact Index Per Article: 1.6] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/24/2018] [Accepted: 12/05/2018] [Indexed: 12/20/2022] Open

Abstract

Recent discovery that much of the mammalian genome does not encode protein-coding genes (PCGs) has brought widespread attention to long noncoding RNAs (lncRNAs) as a novel layer of biological regulation. Enhancer lnc (elnc) RNAs from the enhancer regions of the genome carry the capacity to regulate PCGs in cis or in trans. Spotted fever rickettsioses represent the consequence of host infection with Gram-negative, obligate intracellular bacteria in the Genus Rickettsia. Despite being implicated in the pathways of infection and inflammation, the roles of lncRNAs in host response to Rickettsia species have remained a mystery. We have profiled the expression of host lncRNAs during infection of susceptible mice with R. conorii as a model closely mimicking the pathogenesis of human spotted fever rickettsioses. RNA sequencing on the lungs of infected hosts yielded reads mapping to 74,964 non-coding RNAs, 206 and 277 of which were determined to be significantly up- and down-regulated, respectively, in comparison to uninfected controls. Following removal of short non-coding RNAs and ambiguous transcripts, remaining transcripts underwent in-depth analysis of mouse lung epigenetic signatures H3K4Me1 and H3K4Me3, active transcript markers (POLR2A, p300, CTCF), and DNaseI hypersensitivity sites to identify two potentially active and highly up-regulated elncRNAs NONMMUT013718 and NONMMUT024103. Using Hi-3C sequencing resource, we further determined that genomic loci of NONMMUT013718 and NONMMUT024103 might interact with and regulate the expression of nearby PCGs, namely Id2 (inhibitor of DNA binding 2) and Apol10b (apolipoprotein 10b), respectively. Heterologous reporter assays confirmed the activity of elncRNAs as the inducers of their predicted PCGs. In the lungs of infected mice, expression of both elncRNAs and their targets was significantly higher than mock-infected controls. Induced expression of NONMMUT013718/Id2 in murine macrophages and NONMMUT024103/Apol10b in endothelial cells was also clearly evident during R. conorii infection in vitro. Finally, shRNA mediated knock-down of NONMMUT013718 and NONMMUT024103 elncRNAs resulted in reduced expression of endogenous Id2 and Apl10b, demonstrating the regulatory roles of these elncRNAs on their target PCGs. Our results provide very first experimental evidence suggesting altered expression of pulmonary lncRNAs and elncRNA-mediated regulation of PCGs involved in immunity and during host interactions with pathogenic rickettsiae.

Collapse

Gündert M, Edelmann D, Benner A, Jansen L, Jia M, Walter V, Knebel P, Herpel E, Chang-Claude J, Hoffmeister M, Brenner H, Burwinkel B. Genome-wide DNA methylation analysis reveals a prognostic classifier for non-metastatic colorectal cancer (ProMCol classifier). Gut 2019;68:101-110. [PMID: 29101262 DOI: 10.1136/gutjnl-2017-314711] [Citation(s) in RCA: 29] [Impact Index Per Article: 5.8] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Submit a Manuscript] [Subscribe] [Scholar Register] [Received: 06/22/2017] [Revised: 09/21/2017] [Accepted: 09/30/2017] [Indexed: 12/13/2022]

Affiliation(s)

Melanie Gündert Division of Molecular Epidemiology, German Cancer Research Center (DKFZ), Heidelberg, Germany.,Molecular Biology of Breast Cancer, Department of Gynecology and Obstetrics, University of Heidelberg, Heidelberg, Germany
Dominic Edelmann Division of Biostatistics, German Cancer Research Center (DKFZ), Heidelberg, Germany
Axel Benner Division of Biostatistics, German Cancer Research Center (DKFZ), Heidelberg, Germany
Lina Jansen Division of Clinical Epidemiology and Aging Research, German Cancer Research Center (DKFZ), Heidelberg, Germany
Min Jia Division of Clinical Epidemiology and Aging Research, German Cancer Research Center (DKFZ), Heidelberg, Germany
Viola Walter Division of Clinical Epidemiology and Aging Research, German Cancer Research Center (DKFZ), Heidelberg, Germany
Phillip Knebel Department of General, Visceral and Transplantation Surgery, University of Heidelberg, Heidelberg, Germany
Esther Herpel Department of General Pathology, Institute of Pathology, University of Heidelberg, Heidelberg, Germany.,NCT Tissue Bank, National Center for Tumor Diseases (NCT), Heidelberg, Germany
Jenny Chang-Claude Division of Cancer Epidemiology, Unit of Genetic Epidemiology, German Cancer Research Center (DKFZ), Heidelberg, Germany.,Genetic Tumour Epidemiology Group, University Cancer Center Hamburg (UCCH), University Medical Center Hamburg-Eppendorf, Hamburg, Germany
Michael Hoffmeister Division of Clinical Epidemiology and Aging Research, German Cancer Research Center (DKFZ), Heidelberg, Germany
Hermann Brenner Division of Clinical Epidemiology and Aging Research, German Cancer Research Center (DKFZ), Heidelberg, Germany.,Division of Preventive Oncology, German Cancer Research Center (DKFZ) and National Center for Tumor Diseases (NCT), Heidelberg, Germany.,German Cancer Consortium (DKTK), German Cancer Research Center (DKFZ), Heidelberg, Germany
Barbara Burwinkel Division of Molecular Epidemiology, German Cancer Research Center (DKFZ), Heidelberg, Germany.,Molecular Biology of Breast Cancer, Department of Gynecology and Obstetrics, University of Heidelberg, Heidelberg, Germany

Collapse

Lyu C, Wang L, Zhang J. Deep learning for DNase I hypersensitive sites identification. BMC Genomics 2018;19:905. [PMID: 30598079 PMCID: PMC6311923 DOI: 10.1186/s12864-018-5283-8] [Citation(s) in RCA: 8] [Impact Index Per Article: 1.3] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 01/20/2023] Open

Shashikant T, Ettensohn CA. Genome-wide analysis of chromatin accessibility using ATAC-seq. Methods Cell Biol 2018;151:219-235. [PMID: 30948010 PMCID: PMC7259819 DOI: 10.1016/bs.mcb.2018.11.002] [Citation(s) in RCA: 42] [Impact Index Per Article: 7.0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 12/20/2022]

Oki S, Ohta T, Shioi G, Hatanaka H, Ogasawara O, Okuda Y, Kawaji H, Nakaki R, Sese J, Meno C. ChIP-Atlas: a data-mining suite powered by full integration of public ChIP-seq data. EMBO Rep 2018;19:e46255. [PMID: 30413482 PMCID: PMC6280645 DOI: 10.15252/embr.201846255] [Citation(s) in RCA: 427] [Impact Index Per Article: 71.2] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 04/09/2018] [Revised: 10/03/2018] [Accepted: 10/12/2018] [Indexed: 01/21/2023] Open

Huang Z, Du G, Huang X, Han L, Han X, Xu B, Zhang Y, Yu M, Qin Y, Xia Y, Wang X, Lu C. The enhancer RNA lnc-SLC4A1-1 epigenetically regulates unexplained recurrent pregnancy loss (URPL) by activating CXCL8 and NF-kB pathway. EBioMedicine 2018;38:162-170. [PMID: 30448228 PMCID: PMC6306333 DOI: 10.1016/j.ebiom.2018.11.015] [Citation(s) in RCA: 91] [Impact Index Per Article: 15.2] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 09/25/2018] [Revised: 11/01/2018] [Accepted: 11/08/2018] [Indexed: 12/16/2022] Open

Affiliation(s)

Zhenyao Huang State Key Laboratory of Reproductive Medicine, Institute of Toxicology, Nanjing Medical University, Nanjing 210029, China; Key Laboratory of Modern Toxicology of Ministry of Education, School of Public Health, Nanjing Medical University, Nanjing 210029, China
Guizhen Du State Key Laboratory of Reproductive Medicine, Institute of Toxicology, Nanjing Medical University, Nanjing 210029, China; Key Laboratory of Modern Toxicology of Ministry of Education, School of Public Health, Nanjing Medical University, Nanjing 210029, China
Xiaomin Huang State Key Laboratory of Reproductive Medicine, Institute of Toxicology, Nanjing Medical University, Nanjing 210029, China; Key Laboratory of Modern Toxicology of Ministry of Education, School of Public Health, Nanjing Medical University, Nanjing 210029, China
Li Han Department of Obstetrics, Huai-An First Affiliated Hospital, Nanjing Medical University, Nanjing 210029, China
Xiumei Han State Key Laboratory of Reproductive Medicine, Institute of Toxicology, Nanjing Medical University, Nanjing 210029, China; Key Laboratory of Modern Toxicology of Ministry of Education, School of Public Health, Nanjing Medical University, Nanjing 210029, China
Bo Xu State Key Laboratory of Reproductive Medicine, Institute of Toxicology, Nanjing Medical University, Nanjing 210029, China; Key Laboratory of Modern Toxicology of Ministry of Education, School of Public Health, Nanjing Medical University, Nanjing 210029, China
Yan Zhang State Key Laboratory of Reproductive Medicine, Institute of Toxicology, Nanjing Medical University, Nanjing 210029, China; Key Laboratory of Modern Toxicology of Ministry of Education, School of Public Health, Nanjing Medical University, Nanjing 210029, China
Mingming Yu State Key Laboratory of Reproductive Medicine, Institute of Toxicology, Nanjing Medical University, Nanjing 210029, China; Key Laboratory of Modern Toxicology of Ministry of Education, School of Public Health, Nanjing Medical University, Nanjing 210029, China
Yufeng Qin Epigenetics and Stem Cell Biology Laboratory, National Institute of Environmental Health Sciences, Research Triangle Park, NC 27709, USA
Yankai Xia State Key Laboratory of Reproductive Medicine, Institute of Toxicology, Nanjing Medical University, Nanjing 210029, China; Key Laboratory of Modern Toxicology of Ministry of Education, School of Public Health, Nanjing Medical University, Nanjing 210029, China
Xinru Wang State Key Laboratory of Reproductive Medicine, Institute of Toxicology, Nanjing Medical University, Nanjing 210029, China; Key Laboratory of Modern Toxicology of Ministry of Education, School of Public Health, Nanjing Medical University, Nanjing 210029, China
Chuncheng Lu State Key Laboratory of Reproductive Medicine, Institute of Toxicology, Nanjing Medical University, Nanjing 210029, China; Key Laboratory of Modern Toxicology of Ministry of Education, School of Public Health, Nanjing Medical University, Nanjing 210029, China.

Collapse

Rahman MS, Aktar U, Jani MR, Shatabda S. iPro70-FMWin: identifying Sigma70 promoters using multiple windowing and minimal features. Mol Genet Genomics 2018;294:69-84. [PMID: 30187132 DOI: 10.1007/s00438-018-1487-5] [Citation(s) in RCA: 27] [Impact Index Per Article: 4.5] [Reference Citation Analysis] [Abstract] [Key Words] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/13/2018] [Accepted: 08/29/2018] [Indexed: 01/16/2023]

Blighe K, DeDionisio L, Christie KA, Chawes B, Shareef S, Kakouli-Duarte T, Chao-Shern C, Harding V, Kelly RS, Castellano L, Stebbing J, Lasky-Su JA, Nesbit MA, Moore CBT. Gene editing in the context of an increasingly complex genome. BMC Genomics 2018;19:595. [PMID: 30086710 PMCID: PMC6081867 DOI: 10.1186/s12864-018-4963-8] [Citation(s) in RCA: 8] [Impact Index Per Article: 1.3] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/26/2017] [Accepted: 07/26/2018] [Indexed: 12/15/2022] Open

Abstract

The reporting of the first draft of the human genome in 2000 brought with it much hope for the future in what was felt as a paradigm shift toward improved health outcomes. Indeed, we have now mapped the majority of variation across human populations with landmark projects such as 1000 Genomes; in cancer, we have catalogued mutations across the primary carcinomas; whilst, for other diseases, we have identified the genetic variants with strongest association. Despite this, we are still awaiting the genetic revolution in healthcare to materialise and translate itself into the health benefits for which we had hoped. A major problem we face relates to our underestimation of the complexity of the genome, and that of biological mechanisms, generally. Fixation on DNA sequence alone and a 'rigid' mode of thinking about the genome has meant that the folding and structure of the DNA molecule -and how these relate to regulation- have been underappreciated. Projects like ENCODE have additionally taught us that regulation at the level of RNA is just as important as that at the spatiotemporal level of chromatin.In this review, we chart the course of the major advances in the biomedical sciences in the era pre- and post the release of the first draft sequence of the human genome, taking a focus on technology and how its development has influenced these. We additionally focus on gene editing via CRISPR/Cas9 as a key technique, in particular its use in the context of complex biological mechanisms. Our aim is to shift the mode of thinking about the genome to that which encompasses a greater appreciation of the folding of the DNA molecule, DNA- RNA/protein interactions, and how these regulate expression and elaborate disease mechanisms.Through the composition of our work, we recognise that technological improvement is conducive to a greater understanding of biological processes and life within the cell. We believe we now have the technology at our disposal that permits a better understanding of disease mechanisms, achievable through integrative data analyses. Finally, only with greater understanding of disease mechanisms can techniques such as gene editing be faithfully conducted.

Collapse

Affiliation(s)

K Blighe Channing Division of Network Medicine, Brigham and Women's Hospital and Harvard Medical School, 181 Longwood Avenue, Boston, MA, USA. Department of Cancer Studies and Molecular Medicine, Robert Kilpatrick Clinical Sciences Building, Leicester Royal Infirmary, Leicester, LE2 7LX, UK. Bill Lyons Informatics Centre, UCL Cancer Institute, University College London, WC1E 6DD, London, UK.
L DeDionisio Avellino Laboratories, Menlo Park, CA, 94025, USA
K A Christie Biomedical Sciences Research Institute, University of Ulster, Coleraine, Northern Ireland, BT52 1SA, UK
B Chawes COPSAC, Copenhagen Prospective Studies on Asthma in Childhood, Herlev and Gentofte Hospital, University of Copenhagen, Copenhagen, Denmark
S Shareef University of Raparin, Ranya, Kurdistan Region, Iraq
T Kakouli-Duarte Institute of Technology Carlow, Department of Science and Health, Kilkenny Road, Carlow, Ireland
C Chao-Shern Biomedical Sciences Research Institute, University of Ulster, Coleraine, Northern Ireland, BT52 1SA, UK Avellino Laboratories, Menlo Park, CA, 94025, USA
V Harding Imperial College London, Division of Cancer, Department of Surgery and Cancer, Hammersmith Hospital Campus, Du Cane Road, London, W12 0NN, UK
R S Kelly Channing Division of Network Medicine, Brigham and Women's Hospital and Harvard Medical School, 181 Longwood Avenue, Boston, MA, USA
L Castellano Imperial College London, Division of Cancer, Department of Surgery and Cancer, Hammersmith Hospital Campus, Du Cane Road, London, W12 0NN, UK JMS Building, School of Life Sciences, University of Sussex, Falmer, Brighton, BN1 9QG, UK
J Stebbing Imperial College London, Division of Cancer, Department of Surgery and Cancer, Hammersmith Hospital Campus, Du Cane Road, London, W12 0NN, UK
J A Lasky-Su Channing Division of Network Medicine, Brigham and Women's Hospital and Harvard Medical School, 181 Longwood Avenue, Boston, MA, USA
M A Nesbit Biomedical Sciences Research Institute, University of Ulster, Coleraine, Northern Ireland, BT52 1SA, UK
C B T Moore Biomedical Sciences Research Institute, University of Ulster, Coleraine, Northern Ireland, BT52 1SA, UK. Avellino Laboratories, Menlo Park, CA, 94025, USA.

Collapse

Pan-Cancer Analysis Reveals Differential Susceptibility of Bidirectional Gene Promoters to DNA Methylation, Somatic Mutations, and Copy Number Alterations. Int J Mol Sci 2018;19:ijms19082296. [PMID: 30081598 PMCID: PMC6121907 DOI: 10.3390/ijms19082296] [Citation(s) in RCA: 7] [Impact Index Per Article: 1.2] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/05/2018] [Revised: 07/26/2018] [Accepted: 08/02/2018] [Indexed: 12/12/2022] Open

Basil P, Li Q, Gui H, Hui TCK, Ling VHM, Wong CCY, Mill J, McAlonan GM, Sham PC. Prenatal immune activation alters the adult neural epigenome but can be partly stabilised by a n-3 polyunsaturated fatty acid diet. Transl Psychiatry 2018;8:125. [PMID: 29967385 PMCID: PMC6028639 DOI: 10.1038/s41398-018-0167-x] [Citation(s) in RCA: 29] [Impact Index Per Article: 4.8] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Submit a Manuscript] [Subscribe] [Scholar Register] [Received: 11/22/2017] [Revised: 04/01/2018] [Accepted: 04/21/2018] [Indexed: 02/08/2023] Open

Abstract

An unstable epigenome is implicated in the pathophysiology of neurodevelopmental disorders such as schizophrenia and autism. This is important because the epigenome is potentially modifiable. We have previously reported that adult offspring exposed to maternal immune activation (MIA) prenatally have significant global DNA hypomethylation in the hypothalamus. However, what genes had altered methylation state, their functional effects on gene expression and whether these changes can be moderated, have not been addressed. In this study, we used next-generation sequencing (NGS) for methylome profiling in a MIA rodent model of neurodevelopmental disorders. We assessed whether differentially methylated regions (DMRs) affected the chromatin state by mapping known DNase I hypersensitivity sites (DHSs), and selected overlapping genes to confirm a functional effect of MIA on gene expression using qPCR. Finally, we tested whether methylation differences elicited by MIA could be limited by post-natal dietary (omega) n-3 polyunsaturated fatty acid (PUFA) supplementation. These experiments were conducted using hypothalamic brain tissue from 12-week-old offspring of mice injected with viral analogue PolyI:C on gestation day 9 of pregnancy or saline on gestation day 9. Half of the animals from each group were fed a diet enriched with n-3 PUFA from weaning (MIA group, n = 12 units, n = 39 mice; Control group, n = 12 units, n = 38 mice). The results confirmed our previous finding that adult offspring exposed to MIA prenatally had significant global DNA hypomethylation. Furthermore, genes linked to synaptic plasticity were over-represented among differentially methylated genes following MIA. More than 80% of MIA-induced hypomethylated sites, including those affecting chromatin state and MECP2 binding, were stabilised by the n-3 PUFA intervention. MIA resulted in increased expression of two of the 'top five' genes identified from an integrated analysis of DMRs, DHSs and MECP2 binding sites, namely Abat (t = 2.46, p < 0.02) and Gnas9 (t = 2.96, p < 0.01), although these changes were not stabilised by dietary intervention. Thus, prenatal MIA exposure impacts upon the epigenomic regulation of gene pathways linked to neurodevelopmental conditions; and many of the changes can be attenuated by a low-cost dietary intervention.

Collapse

Affiliation(s)

Paul Basil Department of Psychiatry, The University of Hong Kong, Pokfulam, Hong Kong SAR China ,20000 0001 2160 926Xgrid.39382.33Department of Molecular & Cellular Biology, Baylor College of Medicine, Houston, TX 77030 USA
Qi Li Department of Psychiatry, The University of Hong Kong, Pokfulam, Hong Kong SAR China ,3State Key Laboratory of Brain and Cognitive Sciences, The University of Hong Kong, Pokfulam, Hong Kong SAR China
Hongsheng Gui Department of Psychiatry, The University of Hong Kong, Pokfulam, Hong Kong SAR China
Tomy C. K. Hui Department of Psychiatry, The University of Hong Kong, Pokfulam, Hong Kong SAR China
Vicki H. M. Ling Department of Psychiatry, The University of Hong Kong, Pokfulam, Hong Kong SAR China
Chloe C. Y. Wong 0000 0001 2322 6764grid.13097.3cMRC Social, Genetic and Developmental Psychiatry Centre, Institute of Psychiatry, King’s College London, De Crespigny Park, Denmark Hill, London, SE5 8AF UK
Jonathan Mill 0000 0001 2322 6764grid.13097.3cMRC Social, Genetic and Developmental Psychiatry Centre, Institute of Psychiatry, King’s College London, De Crespigny Park, Denmark Hill, London, SE5 8AF UK ,50000 0004 1936 8024grid.8391.3University of Exeter Medical School, Exeter University, St Luke’s Campus, Magdalen Street, Exeter, EX1 2LU UK
Grainne M. McAlonan Department of Psychiatry, The University of Hong Kong, Pokfulam, Hong Kong SAR China ,60000 0001 2322 6764grid.13097.3cDepartment of Forensic and Neurodevelopmental Sciences, Institute of Psychiatry, King’s College London, De Crespigny Park, Denmark Hill, London, SE5 8AF UK
Pak-Chung Sham Department of Psychiatry, The University of Hong Kong, Pokfulam, Hong Kong SAR, China. .,State Key Laboratory of Brain and Cognitive Sciences, The University of Hong Kong, Pokfulam, Hong Kong SAR, China. .,Centre for Genomic Sciences, The University of Hong Kong, Pokfulam, Hong Kong SAR, China.

Collapse

Niu M, Tabari E, Ni P, Su Z. Towards a map of cis-regulatory sequences in the human genome. Nucleic Acids Res 2018;46:5395-5409. [PMID: 29733395 PMCID: PMC6009671 DOI: 10.1093/nar/gky338] [Citation(s) in RCA: 10] [Impact Index Per Article: 1.7] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 01/17/2018] [Revised: 04/14/2018] [Accepted: 04/19/2018] [Indexed: 01/10/2023] Open

Ji Z, Zhou W, Ji H. Single-cell regulome data analysis by SCRAT. Bioinformatics 2018;33:2930-2932. [PMID: 28505247 PMCID: PMC5870556 DOI: 10.1093/bioinformatics/btx315] [Citation(s) in RCA: 41] [Impact Index Per Article: 6.8] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 12/13/2016] [Accepted: 05/10/2017] [Indexed: 11/15/2022] Open

Min X, Zeng W, Chen N, Chen T, Jiang R. Chromatin accessibility prediction via convolutional long short-term memory networks with k-mer embedding. Bioinformatics 2018;33:i92-i101. [PMID: 28881969 PMCID: PMC5870572 DOI: 10.1093/bioinformatics/btx234] [Citation(s) in RCA: 80] [Impact Index Per Article: 13.3] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 01/30/2023] Open

Abstract

Motivation

Experimental techniques for measuring chromatin accessibility are expensive and time consuming, appealing for the development of computational approaches to predict open chromatin regions from DNA sequences. Along this direction, existing methods fall into two classes: one based on handcrafted k-mer features and the other based on convolutional neural networks. Although both categories have shown good performance in specific applications thus far, there still lacks a comprehensive framework to integrate useful k-mer co-occurrence information with recent advances in deep learning.

Results

We fill this gap by addressing the problem of chromatin accessibility prediction with a convolutional Long Short-Term Memory (LSTM) network with k-mer embedding. We first split DNA sequences into k-mers and pre-train k-mer embedding vectors based on the co-occurrence matrix of k-mers by using an unsupervised representation learning approach. We then construct a supervised deep learning architecture comprised of an embedding layer, three convolutional layers and a Bidirectional LSTM (BLSTM) layer for feature learning and classification. We demonstrate that our method gains high-quality fixed-length features from variable-length sequences and consistently outperforms baseline methods. We show that k-mer embedding can effectively enhance model performance by exploring different embedding strategies. We also prove the efficacy of both the convolution and the BLSTM layers by comparing two variations of the network architecture. We confirm the robustness of our model to hyper-parameters by performing sensitivity analysis. We hope our method can eventually reinforce our understanding of employing deep learning in genomic studies and shed light on research regarding mechanisms of chromatin accessibility.

Availability and implementation

The source code can be downloaded from https://github.com/minxueric/ismb2017_lstm.

Supplementary information

Supplementary materials are available at Bioinformatics online.

Collapse

Koh PW, Pierson E, Kundaje A. Denoising genome-wide histone ChIP-seq with convolutional neural networks. Bioinformatics 2018;33:i225-i233. [PMID: 28881977 PMCID: PMC5870713 DOI: 10.1093/bioinformatics/btx243] [Citation(s) in RCA: 41] [Impact Index Per Article: 6.8] [Reference Citation Analysis] [Abstract] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 01/09/2023] Open

Henning AN, Roychoudhuri R, Restifo NP. Epigenetic control of CD8⁺ T cell differentiation. Nat Rev Immunol 2018;18:340-356. [PMID: 29379213 PMCID: PMC6327307 DOI: 10.1038/nri.2017.146] [Citation(s) in RCA: 304] [Impact Index Per Article: 50.7] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 02/07/2023]

Skene PJ, Henikoff JG, Henikoff S. Targeted in situ genome-wide profiling with high efficiency for low cell numbers. Nat Protoc 2018;13:1006-1019. [PMID: 29651053 DOI: 10.1038/nprot.2018.015] [Citation(s) in RCA: 456] [Impact Index Per Article: 76.0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 12/19/2022]

Shashikant T, Khor JM, Ettensohn CA. Global analysis of primary mesenchyme cell cis-regulatory modules by chromatin accessibility profiling. BMC Genomics 2018;19:206. [PMID: 29558892 PMCID: PMC5859501 DOI: 10.1186/s12864-018-4542-z] [Citation(s) in RCA: 26] [Impact Index Per Article: 4.3] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 11/06/2017] [Accepted: 02/13/2018] [Indexed: 12/11/2022] Open

Abstract

Background

The developmental gene regulatory network (GRN) that underlies skeletogenesis in sea urchins and other echinoderms is a paradigm of GRN structure, function, and evolution. This transcriptional network is deployed selectively in skeleton-forming primary mesenchyme cells (PMCs) of the early embryo. To advance our understanding of this model developmental GRN, we used genome-wide chromatin accessibility profiling to identify and characterize PMC cis-regulatory modules (CRMs).

Results

ATAC-seq (Assay for Transposase-Accessible Chromatin using sequencing) analysis of purified PMCs provided a global picture of chromatin accessibility in these cells. We used both ATAC-seq and DNase-seq (DNase I hypersensitive site sequencing) to identify > 3000 sites that exhibited increased accessibility in PMCs relative to other embryonic cell lineages, and provide both computational and experimental evidence that a large fraction of these sites represent bona fide skeletogenic CRMs. Putative PMC CRMs were preferentially located near genes differentially expressed by PMCs and consensus binding sites for two key transcription factors in the PMC GRN, Alx1 and Ets1, were enriched in these CRMs. Moreover, a high proportion of candidate CRMs drove reporter gene expression specifically in PMCs in transgenic embryos. Surprisingly, we found that PMC CRMs were partially open in other embryonic lineages and exhibited hyperaccessibility as early as the 128-cell stage.

Conclusions

Our work provides a comprehensive picture of chromatin accessibility in an early embryonic cell lineage. By identifying thousands of candidate PMC CRMs, we significantly enhance the utility of the sea urchin skeletogenic network as a general model of GRN architecture and evolution. Our work also shows that differential chromatin accessibility, which has been used for the high-throughput identification of enhancers in differentiated cell types, is a powerful approach for the identification of CRMs in early embryonic cells. Lastly, we conclude that in the sea urchin embryo, CRMs that control the cell type-specific expression of effector genes are hyperaccessible several hours in advance of gene activation.

Electronic supplementary material

The online version of this article (10.1186/s12864-018-4542-z) contains supplementary material, which is available to authorized users.

Collapse

100

Aughey GN, Estacio Gomez A, Thomson J, Yin H, Southall TD. CATaDa reveals global remodelling of chromatin accessibility during stem cell differentiation in vivo. eLife 2018;7:32341. [PMID: 29481322 PMCID: PMC5826290 DOI: 10.7554/elife.32341] [Citation(s) in RCA: 44] [Impact Index Per Article: 7.3] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 09/27/2017] [Accepted: 01/30/2018] [Indexed: 01/09/2023] Open

Abstract

During development eukaryotic gene expression is coordinated by dynamic changes in chromatin structure. Measurements of accessible chromatin are used extensively to identify genomic regulatory elements. Whilst chromatin landscapes of pluripotent stem cells are well characterised, chromatin accessibility changes in the development of somatic lineages are not well defined. Here we show that cell-specific chromatin accessibility data can be produced via ectopic expression of E. coli Dam methylase in vivo, without the requirement for cell-sorting (CATaDa). We have profiled chromatin accessibility in individual cell-types of Drosophila neural and midgut lineages. Functional cell-type-specific enhancers were identified, as well as novel motifs enriched at different stages of development. Finally, we show global changes in the accessibility of chromatin between stem-cells and their differentiated progeny. Our results demonstrate the dynamic nature of chromatin accessibility in somatic tissues during stem cell differentiation and provide a novel approach to understanding gene regulatory mechanisms underlying development.

For an embryo to successfully develop into an adult animal, specific genes must act in different types of cells. Though all the cells have the same genes encoded within their DNA, looking at the way that the DNA is packaged can indicate which parts of the DNA are important for that particular cell type. If regions of DNA are “open” one can infer that those regions are actively involved in gene regulation, whereas “closed” regions are considered less important.

It is currently difficult to determine which parts of the DNA are open within an individual cell type in a complex organ, such as the brain. Existing methods require the cells to be physically isolated from the tissue, which is technically challenging.

To overcome this issue, Aughey et al. have now developed a method that does not require isolation of the cells. The new technique involves using genetic engineering to introduce an enzyme called Dam into specific cell types in living fruit flies. This enzyme adds a chemical label on regions of open DNA, which can then be detected. Aughey et al. tested this technique on various cells of the developing brain and gut, and were able to see differences in the openness of DNA that corresponded to the action of genes that are important in each cell type. The data also contain trends that help to understand the role of open DNA in development. For example, mature cells were shown to overall have less open DNA than the stem cells that divide to generate them.

Aughey et al. hope their new technique will be of use to other researchers working with either fruit flies or mammalian tissues. The knowledge that scientists will gain from identifying how open DNA contributes to gene regulation, in both healthy and diseased tissues, will further our understanding of human development and the biology of diseases such as cancer.

Collapse