Reference Citation Analysis: Find an Article, Find a Category, Find a Journal, Find a Scholar

For: Wasserman WW, Fickett JW. Identification of regulatory regions which confer muscle-specific gene expression. J Mol Biol 1998;278:167-81. [PMID: 9571041 DOI: 10.1006/jmbi.1998.1700] [Citation(s) in RCA: 306] [Impact Index Per Article: 11.3] [Reference Citation Analysis] [What about the content of this article? (0)] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/22/2022]

For:	Wasserman WW, Fickett JW. Identification of regulatory regions which confer muscle-specific gene expression. J Mol Biol 1998;278:167-81. [PMID: 9571041 DOI: 10.1006/jmbi.1998.1700] [Citation(s) in RCA: 306] [Impact Index Per Article: 11.3] [Reference Citation Analysis] [What about the content of this article? (0)] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/22/2022]

Number

Cited by Other Article(s)

Novakovsky G, Fornes O, Saraswat M, Mostafavi S, Wasserman WW. ExplaiNN: interpretable and transparent neural networks for genomics. Genome Biol 2023;24:154. [PMID: 37370113 DOI: 10.1186/s13059-023-02985-y] [Citation(s) in RCA: 6] [Impact Index Per Article: 3.0] [Reference Citation Analysis] [Abstract] [Key Words] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 11/20/2022] [Accepted: 06/12/2023] [Indexed: 06/29/2023] Open

Wu H, Liu M, Zhang P, Zhang H. iEnhancer-SKNN: a stacking ensemble learning-based method for enhancer identification and classification using sequence information. Brief Funct Genomics 2023;22:302-311. [PMID: 36715222 DOI: 10.1093/bfgp/elac057] [Citation(s) in RCA: 1] [Impact Index Per Article: 0.5] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 08/10/2022] [Revised: 12/01/2022] [Accepted: 12/13/2022] [Indexed: 01/31/2023] Open

Abstract

Enhancers, a class of distal cis-regulatory elements located in the non-coding region of DNA, play a key role in gene regulation. It is difficult to identify enhancers from DNA sequence data because enhancers are freely distributed in the non-coding region, with no specific sequence features, and having a long distance with the targeted promoters. Therefore, this study presents a stacking ensemble learning method to accurately identify enhancers and classify enhancers into strong and weak enhancers. Firstly, we obtain the fusion feature matrix by fusing the four features of Kmer, PseDNC, PCPseDNC and Z-Curve9. Secondly, five K-Nearest Neighbor (KNN) models with different parameters are trained as the base model, and the Logistic Regression algorithm is utilized as the meta-model. Thirdly, the stacking ensemble learning strategy is utilized to construct a two-layer model based on the base model and meta-model to train the preprocessed feature sets. The proposed method, named iEnhancer-SKNN, is a two-layer prediction model, in which the function of the first layer is to predict whether the given DNA sequences are enhancers or non-enhancers, and the function of the second layer is to distinguish whether the predicted enhancers are strong enhancers or weak enhancers. The performance of iEnhancer-SKNN is evaluated on the independent testing dataset and the results show that the proposed method has better performance in predicting enhancers and their strength. In enhancer identification, iEnhancer-SKNN achieves an accuracy of 81.75%, an improvement of 1.35% to 8.75% compared with other predictors, and in enhancer classification, iEnhancer-SKNN achieves an accuracy of 80.50%, an improvement of 5.5% to 25.5% compared with other predictors. Moreover, we identify key transcription factor binding site motifs in the enhancer regions and further explore the biological functions of the enhancers and these key motifs. Source code and data can be downloaded from https://github.com/HaoWuLab-Bioinformatics/iEnhancer-SKNN.

Collapse

Liao M, Zhao JP, Tian J, Zheng CH. iEnhancer-DCLA: using the original sequence to identify enhancers and their strength based on a deep learning framework. BMC Bioinformatics 2022;23:480. [PMCID: PMC9664816 DOI: 10.1186/s12859-022-05033-x] [Citation(s) in RCA: 8] [Impact Index Per Article: 2.7] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 08/29/2022] [Accepted: 11/02/2022] [Indexed: 11/16/2022] Open

Nair SJ, Suter T, Wang S, Yang L, Yang F, Rosenfeld MG. Transcriptional enhancers at 40: evolution of a viral DNA element to nuclear architectural structures. Trends Genet 2022;38:1019-1047. [PMID: 35811173 PMCID: PMC9474616 DOI: 10.1016/j.tig.2022.05.015] [Citation(s) in RCA: 11] [Impact Index Per Article: 3.7] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 02/21/2022] [Revised: 05/05/2022] [Accepted: 05/31/2022] [Indexed: 02/08/2023]

Zhang WM, Cheng XZ, Fang D, Cao J. AT-HOOK MOTIF NUCLEAR LOCALIZED (AHL) proteins of ancient origin radiate new functions. Int J Biol Macromol 2022;214:290-300. [PMID: 35716788 DOI: 10.1016/j.ijbiomac.2022.06.100] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 01/21/2022] [Revised: 04/11/2022] [Accepted: 06/12/2022] [Indexed: 11/05/2022]

Qian Y, Zhang Y, Zhang J. Alignment-Free Sequence Comparison With Multiple k Values. IEEE/ACM TRANSACTIONS ON COMPUTATIONAL BIOLOGY AND BIOINFORMATICS 2021;18:1841-1849. [PMID: 31765317 DOI: 10.1109/tcbb.2019.2955081] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 06/10/2023]

Ullah F, Ben-Hur A. A self-attention model for inferring cooperativity between regulatory features. Nucleic Acids Res 2021;49:e77. [PMID: 33950192 PMCID: PMC8287919 DOI: 10.1093/nar/gkab349] [Citation(s) in RCA: 14] [Impact Index Per Article: 3.5] [Reference Citation Analysis] [Abstract] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/07/2020] [Revised: 04/15/2021] [Accepted: 04/20/2021] [Indexed: 11/14/2022] Open

Chen G, Yin Y, Lin Z, Wen H, Chen J, Luo W. Transcriptome profile analysis reveals KLHL30 as an essential regulator for myoblast differentiation. Biochem Biophys Res Commun 2021;559:84-91. [PMID: 33933993 DOI: 10.1016/j.bbrc.2021.04.086] [Citation(s) in RCA: 4] [Impact Index Per Article: 1.0] [Reference Citation Analysis] [Abstract] [Key Words] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/20/2021] [Accepted: 04/20/2021] [Indexed: 11/29/2022]

Affiliation(s)

Genghua Chen Department of Animal Genetics, Breeding and Reproduction, College of Animal Science, South China Agricultural University, Guangzhou, 510642, Guangdong Province, China; Guangdong Provincial Key Laboratory of Agro-Animal Genomics and Molecular Breeding, and Key Laboratory of Chicken Genetics, Breeding and Reproduction, Ministry of Agriculture and Rural Affair, South China Agricultural University, Guangzhou, 510642, China
Yunqian Yin Department of Animal Genetics, Breeding and Reproduction, College of Animal Science, South China Agricultural University, Guangzhou, 510642, Guangdong Province, China; Guangdong Provincial Key Laboratory of Agro-Animal Genomics and Molecular Breeding, and Key Laboratory of Chicken Genetics, Breeding and Reproduction, Ministry of Agriculture and Rural Affair, South China Agricultural University, Guangzhou, 510642, China
Zetong Lin Department of Animal Genetics, Breeding and Reproduction, College of Animal Science, South China Agricultural University, Guangzhou, 510642, Guangdong Province, China; Guangdong Provincial Key Laboratory of Agro-Animal Genomics and Molecular Breeding, and Key Laboratory of Chicken Genetics, Breeding and Reproduction, Ministry of Agriculture and Rural Affair, South China Agricultural University, Guangzhou, 510642, China
Huaqiang Wen Department of Animal Genetics, Breeding and Reproduction, College of Animal Science, South China Agricultural University, Guangzhou, 510642, Guangdong Province, China; Guangdong Provincial Key Laboratory of Agro-Animal Genomics and Molecular Breeding, and Key Laboratory of Chicken Genetics, Breeding and Reproduction, Ministry of Agriculture and Rural Affair, South China Agricultural University, Guangzhou, 510642, China
Jiahui Chen Department of Animal Genetics, Breeding and Reproduction, College of Animal Science, South China Agricultural University, Guangzhou, 510642, Guangdong Province, China; Guangdong Provincial Key Laboratory of Agro-Animal Genomics and Molecular Breeding, and Key Laboratory of Chicken Genetics, Breeding and Reproduction, Ministry of Agriculture and Rural Affair, South China Agricultural University, Guangzhou, 510642, China
Wen Luo Department of Animal Genetics, Breeding and Reproduction, College of Animal Science, South China Agricultural University, Guangzhou, 510642, Guangdong Province, China; Guangdong Provincial Key Laboratory of Agro-Animal Genomics and Molecular Breeding, and Key Laboratory of Chicken Genetics, Breeding and Reproduction, Ministry of Agriculture and Rural Affair, South China Agricultural University, Guangzhou, 510642, China.

Collapse

Tobias IC, Abatti LE, Moorthy SD, Mullany S, Taylor T, Khader N, Filice MA, Mitchell JA. Transcriptional enhancers: from prediction to functional assessment on a genome-wide scale. Genome 2020;64:426-448. [PMID: 32961076 DOI: 10.1139/gen-2020-0104] [Citation(s) in RCA: 10] [Impact Index Per Article: 2.0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 12/20/2022]

A New Algorithm for Identifying Cis-Regulatory Modules Based on Hidden Markov Model. BIOMED RESEARCH INTERNATIONAL 2018;2017:6274513. [PMID: 28497059 PMCID: PMC5405574 DOI: 10.1155/2017/6274513] [Citation(s) in RCA: 1] [Impact Index Per Article: 0.1] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Subscribe] [Scholar Register] [Received: 10/23/2016] [Revised: 03/06/2017] [Accepted: 03/23/2017] [Indexed: 11/24/2022]

Herman-Izycka J, Wlasnowolski M, Wilczynski B. Taking promoters out of enhancers in sequence based predictions of tissue-specific mammalian enhancers. BMC Med Genomics 2017;10:34. [PMID: 28589862 PMCID: PMC5461523 DOI: 10.1186/s12920-017-0264-3] [Citation(s) in RCA: 7] [Impact Index Per Article: 0.9] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 12/24/2022] Open

Li L, Wunderlich Z. An Enhancer's Length and Composition Are Shaped by Its Regulatory Task. Front Genet 2017;8:63. [PMID: 28588608 PMCID: PMC5440464 DOI: 10.3389/fgene.2017.00063] [Citation(s) in RCA: 24] [Impact Index Per Article: 3.0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/17/2017] [Accepted: 05/08/2017] [Indexed: 12/02/2022] Open

Wilczynski B, Tiuryn J. FastBill: An Improved Tool for Prediction of Cis-Regulatory Modules. J Comput Biol 2016;24:193-199. [PMID: 27710048 DOI: 10.1089/cmb.2016.0108] [Citation(s) in RCA: 1] [Impact Index Per Article: 0.1] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 02/02/2023] Open

Guo H, Huo H, Yu Q. SMCis: An Effective Algorithm for Discovery of Cis-Regulatory Modules. PLoS One 2016;11:e0162968. [PMID: 27637070 PMCID: PMC5026350 DOI: 10.1371/journal.pone.0162968] [Citation(s) in RCA: 3] [Impact Index Per Article: 0.3] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/17/2016] [Accepted: 08/31/2016] [Indexed: 12/02/2022] Open

Santolini M, Sakakibara I, Gauthier M, Ribas-Aulinas F, Takahashi H, Sawasaki T, Mouly V, Concordet JP, Defossez PA, Hakim V, Maire P. MyoD reprogramming requires Six1 and Six4 homeoproteins: genome-wide cis-regulatory module analysis. Nucleic Acids Res 2016;44:8621-8640. [PMID: 27302134 PMCID: PMC5062961 DOI: 10.1093/nar/gkw512] [Citation(s) in RCA: 22] [Impact Index Per Article: 2.4] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 12/15/2015] [Accepted: 05/26/2016] [Indexed: 11/12/2022] Open

Affiliation(s)

Marc Santolini Institut Cochin, Université Paris-Descartes, Centre National de la Recherche Scientifique (CNRS), UMR 8104, Paris, France Institut National de la Santé et de la Recherche Médicale (INSERM) U1016, Paris, France Ecole Normale Supérieure, CNRS, Laboratoire de Physique Statistique, PSL Research University, Université Pierre-et-Marie Curie, Paris, France
Iori Sakakibara Institut Cochin, Université Paris-Descartes, Centre National de la Recherche Scientifique (CNRS), UMR 8104, Paris, France Institut National de la Santé et de la Recherche Médicale (INSERM) U1016, Paris, France Division of Integrative Pathophysiology, Proteo-Science Center, Graduate School of Medicine, Ehime University, Ehime, Japan
Morgane Gauthier Institut Cochin, Université Paris-Descartes, Centre National de la Recherche Scientifique (CNRS), UMR 8104, Paris, France Institut National de la Santé et de la Recherche Médicale (INSERM) U1016, Paris, France
Francesc Ribas-Aulinas Institut Cochin, Université Paris-Descartes, Centre National de la Recherche Scientifique (CNRS), UMR 8104, Paris, France Institut National de la Santé et de la Recherche Médicale (INSERM) U1016, Paris, France
Hirotaka Takahashi Proteo-Science Center, Ehime University, Ehime 791-8577, Japan
Tatsuya Sawasaki Proteo-Science Center, Ehime University, Ehime 791-8577, Japan
Vincent Mouly Sorbonne Universités, UPMC Univ Paris 06, INSERM UMRS974, CNRS FRE3617, Center for Research in Myology, 75013 Paris, France
Jean-Paul Concordet Institut Cochin, Université Paris-Descartes, Centre National de la Recherche Scientifique (CNRS), UMR 8104, Paris, France Institut National de la Santé et de la Recherche Médicale (INSERM) U1016, Paris, France
Pierre-Antoine Defossez University Paris Diderot, Sorbonne Paris Cité, UMR 7216 CNRS, 75013 Paris, France
Vincent Hakim Ecole Normale Supérieure, CNRS, Laboratoire de Physique Statistique, PSL Research University, Université Pierre-et-Marie Curie, Paris, France
Pascal Maire Institut Cochin, Université Paris-Descartes, Centre National de la Recherche Scientifique (CNRS), UMR 8104, Paris, France Institut National de la Santé et de la Recherche Médicale (INSERM) U1016, Paris, France

Collapse

Murakawa Y, Yoshihara M, Kawaji H, Nishikawa M, Zayed H, Suzuki H, FANTOM Consortium, Hayashizaki Y. Enhanced Identification of Transcriptional Enhancers Provides Mechanistic Insights into Diseases. Trends Genet 2016;32:76-88. [DOI: 10.1016/j.tig.2015.11.004] [Citation(s) in RCA: 73] [Impact Index Per Article: 8.1] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/31/2015] [Revised: 11/25/2015] [Accepted: 11/30/2015] [Indexed: 12/24/2022]

McGettigan PA, Browne JA, Carrington SD, Crowe MA, Fair T, Forde N, Loftus BJ, Lohan A, Lonergan P, Pluta K, Mamo S, Murphy A, Roche J, Walsh SW, Creevey CJ, Earley B, Keady S, Kenny DA, Matthews D, McCabe M, Morris D, O'Loughlin A, Waters S, Diskin MG, Evans ACO. Fertility and genomics: comparison of gene expression in contrasting reproductive tissues of female cattle. Reprod Fertil Dev 2016;28:11-24. [DOI: 10.1071/rd15354] [Citation(s) in RCA: 6] [Impact Index Per Article: 0.7] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/23/2022] Open

Grant CE, Johnson J, Bailey TL, Noble WS. MCAST: scanning for cis-regulatory motif clusters. Bioinformatics 2015;32:1217-9. [PMID: 26704599 DOI: 10.1093/bioinformatics/btv750] [Citation(s) in RCA: 6] [Impact Index Per Article: 0.6] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 09/09/2015] [Accepted: 12/15/2015] [Indexed: 11/13/2022] Open

Payne JL, Wagner A. Mechanisms of mutational robustness in transcriptional regulation. Front Genet 2015;6:322. [PMID: 26579194 PMCID: PMC4621482 DOI: 10.3389/fgene.2015.00322] [Citation(s) in RCA: 56] [Impact Index Per Article: 5.6] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 09/11/2015] [Accepted: 10/10/2015] [Indexed: 12/17/2022] Open

Leoncini M, Montangero M, Pellegrini M, Tillan KP. CMStalker: A Combinatorial Tool for Composite Motif Discovery. IEEE/ACM TRANSACTIONS ON COMPUTATIONAL BIOLOGY AND BIOINFORMATICS 2015;12:1123-1136. [PMID: 26451824 DOI: 10.1109/tcbb.2014.2359444] [Citation(s) in RCA: 1] [Impact Index Per Article: 0.1] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 06/05/2023]

Suryamohan K, Halfon MS. Identifying transcriptional cis-regulatory modules in animal genomes. WILEY INTERDISCIPLINARY REVIEWS. DEVELOPMENTAL BIOLOGY 2015;4:59-84. [PMID: 25704908 PMCID: PMC4339228 DOI: 10.1002/wdev.168] [Citation(s) in RCA: 47] [Impact Index Per Article: 4.7] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Subscribe] [Scholar Register] [Received: 09/24/2014] [Revised: 11/04/2014] [Accepted: 11/16/2014] [Indexed: 11/08/2022]

Abstract

UNLABELLED

Gene expression is regulated through the activity of transcription factors (TFs) and chromatin-modifying proteins acting on specific DNA sequences, referred to as cis-regulatory elements. These include promoters, located at the transcription initiation sites of genes, and a variety of distal cis-regulatory modules (CRMs), the most common of which are transcriptional enhancers. Because regulated gene expression is fundamental to cell differentiation and acquisition of new cell fates, identifying, characterizing, and understanding the mechanisms of action of CRMs is critical for understanding development. CRM discovery has historically been challenging, as CRMs can be located far from the genes they regulate, have few readily identifiable sequence characteristics, and for many years were not amenable to high-throughput discovery methods. However, the recent availability of complete genome sequences and the development of next-generation sequencing methods have led to an explosion of both computational and empirical methods for CRM discovery in model and nonmodel organisms alike. Experimentally, CRMs can be identified through chromatin immunoprecipitation directed against TFs or histone post-translational modifications, identification of nucleosome-depleted 'open' chromatin regions, or sequencing-based high-throughput functional screening. Computational methods include comparative genomics, clustering of known or predicted TF-binding sites, and supervised machine-learning approaches trained on known CRMs. All of these methods have proven effective for CRM discovery, but each has its own considerations and limitations, and each is subject to a greater or lesser number of false-positive identifications. Experimental confirmation of predictions is essential, although shortcomings in current methods suggest that additional means of validation need to be developed. For further resources related to this article, please visit the WIREs website.

CONFLICT OF INTEREST

The authors have declared no conflicts of interest for this article.

Collapse

Starick SR, Ibn-Salem J, Jurk M, Hernandez C, Love MI, Chung HR, Vingron M, Thomas-Chollier M, Meijsing SH. ChIP-exo signal associated with DNA-binding motifs provides insight into the genomic binding of the glucocorticoid receptor and cooperating transcription factors. Genome Res 2015;25:825-35. [PMID: 25720775 PMCID: PMC4448679 DOI: 10.1101/gr.185157.114] [Citation(s) in RCA: 102] [Impact Index Per Article: 10.2] [Reference Citation Analysis] [Abstract] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/02/2014] [Accepted: 02/23/2015] [Indexed: 12/22/2022]

Taher L, Narlikar L, Ovcharenko I. Identification and computational analysis of gene regulatory elements. Cold Spring Harb Protoc 2015;2015:pdb.top083642. [PMID: 25561628 PMCID: PMC5885252 DOI: 10.1101/pdb.top083642] [Citation(s) in RCA: 6] [Impact Index Per Article: 0.6] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 06/04/2023]

A gene regulatory network controls the binary fate decision of rod and bipolar cells in the vertebrate retina. Dev Cell 2014;30:513-27. [PMID: 25155555 PMCID: PMC4304698 DOI: 10.1016/j.devcel.2014.07.018] [Citation(s) in RCA: 134] [Impact Index Per Article: 12.2] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 04/01/2014] [Revised: 06/16/2014] [Accepted: 07/21/2014] [Indexed: 12/12/2022]

Sohn I, Shim J, Hwang C, Kim S, Lee JW. Transcription factor-binding site identification and gene classification via fusion of the supervised-weighted discrete kernel clustering and support vector machine. J Appl Stat 2014. [DOI: 10.1080/02664763.2013.845143] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 10/26/2022]

Comai G, Tajbakhsh S. Molecular and cellular regulation of skeletal myogenesis. Curr Top Dev Biol 2014;110:1-73. [PMID: 25248473 DOI: 10.1016/b978-0-12-405943-6.00001-4] [Citation(s) in RCA: 120] [Impact Index Per Article: 10.9] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 12/13/2022]

Jiang P, Singh M. CCAT: Combinatorial Code Analysis Tool for transcriptional regulation. Nucleic Acids Res 2013;42:2833-47. [PMID: 24366875 PMCID: PMC3950699 DOI: 10.1093/nar/gkt1302] [Citation(s) in RCA: 20] [Impact Index Per Article: 1.7] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 01/08/2023] Open

Deyneko IV, Kel AE, Kel-Margoulis OV, Deineko EV, Wingender E, Weiss S. MatrixCatch--a novel tool for the recognition of composite regulatory elements in promoters. BMC Bioinformatics 2013;14:241. [PMID: 23924163 PMCID: PMC3754795 DOI: 10.1186/1471-2105-14-241] [Citation(s) in RCA: 19] [Impact Index Per Article: 1.6] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 05/07/2012] [Accepted: 08/05/2013] [Indexed: 01/28/2023] Open

Abstract

BACKGROUND

Accurate recognition of regulatory elements in promoters is an essential prerequisite for understanding the mechanisms of gene regulation at the level of transcription. Composite regulatory elements represent a particular type of such transcriptional regulatory elements consisting of pairs of individual DNA motifs. In contrast to the present approach, most available recognition techniques are based purely on statistical evaluation of the occurrence of single motifs. Such methods are limited in application, since the accuracy of recognition is greatly dependent on the size and quality of the sequence dataset. Methods that exploit available knowledge and have broad applicability are evidently needed.

RESULTS

We developed a novel method to identify composite regulatory elements in promoters using a library of known examples. In depth investigation of regularities encoded in known composite elements allowed us to introduce a new characteristic measure and to improve the specificity compared with other methods. Tests on an established benchmark and real genomic data show that our method outperforms other available methods based either on known examples or statistical evaluations. In addition to better recognition, a practical advantage of this method is first the ability to detect a high number of different types of composite elements, and second direct biological interpretation of the identified results. The program is available at http://gnaweb.helmholtz-hzi.de/cgi-bin/MCatch/MatrixCatch.pl and includes an option to extend the provided library by user supplied data.

CONCLUSIONS

The novel algorithm for the identification of composite regulatory elements presented in this paper was proved to be superior to existing methods. Its application to tissue specific promoters identified several highly specific composite elements with relevance to their biological function. This approach together with other methods will further advance the understanding of transcriptional regulation of genes.

Collapse

Nandi S, Blais A, Ioshikhes I. Identification of cis-regulatory modules in promoters of human genes exploiting mutual positioning of transcription factors. Nucleic Acids Res 2013;41:8822-41. [PMID: 23913413 PMCID: PMC3799424 DOI: 10.1093/nar/gkt578] [Citation(s) in RCA: 16] [Impact Index Per Article: 1.3] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 12/29/2022] Open

Malin J, Aniba MR, Hannenhalli S. Enhancer networks revealed by correlated DNAse hypersensitivity states of enhancers. Nucleic Acids Res 2013;41:6828-38. [PMID: 23700312 PMCID: PMC3737527 DOI: 10.1093/nar/gkt374] [Citation(s) in RCA: 22] [Impact Index Per Article: 1.8] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 01/22/2013] [Revised: 03/23/2013] [Accepted: 04/15/2013] [Indexed: 12/14/2022] Open

Loots GG, Bergmann A, Hum NR, Oldenburg CE, Wills AE, Hu N, Ovcharenko I, Harland RM. Interrogating transcriptional regulatory sequences in Tol2-mediated Xenopus transgenics. PLoS One 2013;8:e68548. [PMID: 23874664 PMCID: PMC3713029 DOI: 10.1371/journal.pone.0068548] [Citation(s) in RCA: 3] [Impact Index Per Article: 0.3] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/18/2013] [Accepted: 05/30/2013] [Indexed: 12/13/2022] Open

Abstract

Identifying gene regulatory elements and their target genes in vertebrates remains a significant challenge. It is now recognized that transcriptional regulatory sequences are critical in orchestrating dynamic controls of tissue-specific gene expression during vertebrate development and in adult tissues, and that these elements can be positioned at great distances in relation to the promoters of the genes they control. While significant progress has been made in mapping DNA binding regions by combining chromatin immunoprecipitation and next generation sequencing, functional validation remains a limiting step in improving our ability to correlate in silico predictions with biological function. We recently developed a computational method that synergistically combines genome-wide gene-expression profiling, vertebrate genome comparisons, and transcription factor binding-site analysis to predict tissue-specific enhancers in the human genome. We applied this method to 270 genes highly expressed in skeletal muscle and predicted 190 putative cis-regulatory modules. Furthermore, we optimized Tol2 transgenic constructs in Xenopus laevis to interrogate 20 of these elements for their ability to function as skeletal muscle-specific transcriptional enhancers during embryonic development. We found 45% of these elements expressed only in the fast muscle fibers that are oriented in highly organized chevrons in the Xenopus laevis tadpole. Transcription factor binding site analysis identified >2 Mef2/MyoD sites within ∼200 bp regions in 6 of the validated enhancers, and systematic mutagenesis of these sites revealed that they are critical for the enhancer function. The data described herein introduces a new reporter system suitable for interrogating tissue-specific cis-regulatory elements which allows monitoring of enhancer activity in real time, throughout early stages of embryonic development, in Xenopus.

Collapse

Stanley D, Watson-Haigh NS, Cowled CJE, Moore RJ. Genetic architecture of gene expression in the chicken. BMC Genomics 2013;14:13. [PMID: 23324119 PMCID: PMC3575264 DOI: 10.1186/1471-2164-14-13] [Citation(s) in RCA: 10] [Impact Index Per Article: 0.8] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/20/2012] [Accepted: 12/26/2012] [Indexed: 12/05/2022] Open

Jiwaji M, Daly R, Gibriel A, Barkess G, McLean P, Yang J, Pansare K, Cumming S, McLauchlan A, Kamola PJ, Bhutta MS, West AG, West KL, Kolch W, Girolami MA, Pitt AR. Unique reporter-based sensor platforms to monitor signalling in cells. PLoS One 2012;7:e50521. [PMID: 23209767 PMCID: PMC3510088 DOI: 10.1371/journal.pone.0050521] [Citation(s) in RCA: 4] [Impact Index Per Article: 0.3] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 08/02/2012] [Accepted: 10/23/2012] [Indexed: 11/30/2022] Open

Affiliation(s)

Meesbah Jiwaji Institute of Molecular, Cell and Systems Biology, College of Medical, Veterinary and Life Sciences, University of Glasgow, Glasgow, United Kingdom School of Life and Health Science, Aston University, Birmingham, United Kingdom
Rónán Daly School of Computing Science, University of Glasgow, Glasgow, United Kingdom
Abdullah Gibriel Institute of Molecular, Cell and Systems Biology, College of Medical, Veterinary and Life Sciences, University of Glasgow, Glasgow, United Kingdom
Gráinne Barkess Institute of Cancer Sciences, College of Medical, Veterinary and Life Sciences, University of Glasgow, Glasgow, United Kingdom
Pauline McLean Institute of Molecular, Cell and Systems Biology, College of Medical, Veterinary and Life Sciences, University of Glasgow, Glasgow, United Kingdom
Jingli Yang Institute of Molecular, Cell and Systems Biology, College of Medical, Veterinary and Life Sciences, University of Glasgow, Glasgow, United Kingdom
Kshama Pansare Institute of Molecular, Cell and Systems Biology, College of Medical, Veterinary and Life Sciences, University of Glasgow, Glasgow, United Kingdom School of Life and Health Science, Aston University, Birmingham, United Kingdom
Sarah Cumming Institute of Molecular, Cell and Systems Biology, College of Medical, Veterinary and Life Sciences, University of Glasgow, Glasgow, United Kingdom
Alisha McLauchlan Institute of Molecular, Cell and Systems Biology, College of Medical, Veterinary and Life Sciences, University of Glasgow, Glasgow, United Kingdom
Piotr J. Kamola Institute of Molecular, Cell and Systems Biology, College of Medical, Veterinary and Life Sciences, University of Glasgow, Glasgow, United Kingdom
Musab S. Bhutta Institute of Molecular, Cell and Systems Biology, College of Medical, Veterinary and Life Sciences, University of Glasgow, Glasgow, United Kingdom
Adam G. West Institute of Cancer Sciences, College of Medical, Veterinary and Life Sciences, University of Glasgow, Glasgow, United Kingdom
Katherine L. West Institute of Cancer Sciences, College of Medical, Veterinary and Life Sciences, University of Glasgow, Glasgow, United Kingdom
Walter Kolch Institute of Molecular, Cell and Systems Biology, College of Medical, Veterinary and Life Sciences, University of Glasgow, Glasgow, United Kingdom Systems Biology Ireland and the Conway Institute, University College Dublin, Dublin, Ireland
Mark A. Girolami School of Computing Science, University of Glasgow, Glasgow, United Kingdom Department of Statistical Science, University College London, London, United Kingdom
Andrew R. Pitt Institute of Molecular, Cell and Systems Biology, College of Medical, Veterinary and Life Sciences, University of Glasgow, Glasgow, United Kingdom School of Life and Health Science, Aston University, Birmingham, United Kingdom * E-mail:

Collapse

oPOSSUM-3: advanced analysis of regulatory motif over-representation across genes or ChIP-Seq datasets. G3-GENES GENOMES GENETICS 2012;2:987-1002. [PMID: 22973536 PMCID: PMC3429929 DOI: 10.1534/g3.112.003202] [Citation(s) in RCA: 230] [Impact Index Per Article: 17.7] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Subscribe] [Scholar Register] [Received: 05/27/2012] [Accepted: 06/11/2012] [Indexed: 01/12/2023]

Genomic approaches towards finding cis-regulatory modules in animals. Nat Rev Genet 2012;13:469-83. [PMID: 22705667 DOI: 10.1038/nrg3242] [Citation(s) in RCA: 156] [Impact Index Per Article: 12.0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 12/13/2022]

Nikulova AA, Favorov AV, Sutormin RA, Makeev VJ, Mironov AA. CORECLUST: identification of the conserved CRM grammar together with prediction of gene regulation. Nucleic Acids Res 2012;40:e93. [PMID: 22422836 PMCID: PMC3384346 DOI: 10.1093/nar/gks235] [Citation(s) in RCA: 8] [Impact Index Per Article: 0.6] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/16/2022] Open

Girgis HZ, Ovcharenko I. Predicting tissue specific cis-regulatory modules in the human genome using pairs of co-occurring motifs. BMC Bioinformatics 2012;13:25. [PMID: 22313678 PMCID: PMC3359238 DOI: 10.1186/1471-2105-13-25] [Citation(s) in RCA: 15] [Impact Index Per Article: 1.2] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 09/16/2011] [Accepted: 02/07/2012] [Indexed: 12/26/2022] Open

Abstract

Background

Researchers seeking to unlock the genetic basis of human physiology and diseases have been studying gene transcription regulation. The temporal and spatial patterns of gene expression are controlled by mainly non-coding elements known as cis-regulatory modules (CRMs) and epigenetic factors. CRMs modulating related genes share the regulatory signature which consists of transcription factor (TF) binding sites (TFBSs). Identifying such CRMs is a challenging problem due to the prohibitive number of sequence sets that need to be analyzed.

Results

We formulated the challenge as a supervised classification problem even though experimentally validated CRMs were not required. Our efforts resulted in a software system named CrmMiner. The system mines for CRMs in the vicinity of related genes. CrmMiner requires two sets of sequences: a mixed set and a control set. Sequences in the vicinity of the related genes comprise the mixed set, whereas the control set includes random genomic sequences. CrmMiner assumes that a large percentage of the mixed set is made of background sequences that do not include CRMs. The system identifies pairs of closely located motifs representing vertebrate TFBSs that are enriched in the training mixed set consisting of 50% of the gene loci. In addition, CrmMiner selects a group of the enriched pairs to represent the tissue-specific regulatory signature. The mixed and the control sets are searched for candidate sequences that include any of the selected pairs. Next, an optimal Bayesian classifier is used to distinguish candidates found in the mixed set from their control counterparts. Our study proposes 62 tissue-specific regulatory signatures and putative CRMs for different human tissues and cell types. These signatures consist of assortments of ubiquitously expressed TFs and tissue-specific TFs. Under controlled settings, CrmMiner identified known CRMs in noisy sets up to 1:25 signal-to-noise ratio. CrmMiner was 21-75% more precise than a related CRM predictor. The sensitivity of the system to locate known human heart enhancers reached up to 83%. CrmMiner precision reached 82% while mining for CRMs specific to the human CD4⁺T cells. On several data sets, the system achieved 99% specificity.

Conclusion

These results suggest that CrmMiner predictions are accurate and likely to be tissue-specific CRMs. We expect that the predicted tissue-specific CRMs and the regulatory signatures broaden our knowledge of gene transcription regulation.

Collapse

Jha A, Mehra M, Shankar R. The regulatory epicenter of miRNAs. J Biosci 2012;36:621-38. [PMID: 21857109 DOI: 10.1007/s12038-011-9109-y] [Citation(s) in RCA: 17] [Impact Index Per Article: 1.3] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 12/14/2022]

Abstract

miRNAs are small non-coding RNAs with average length of ~21 bp. miRNA formation seems to be dependent upon multiple factors besides Drosha and Dicer, in a tissue/stage-specific manner, with interplay of several specific binding factors. In the present study, we have investigated transcription factor binding sites in and around the genomic sequences of precursor miRNAs and RNA-binding protein (RBP) sites in miRNA precursor sequences, analysed and tested in comprehensive manner. Here, we report that miRNA precursor regions are positionally enriched for binding of transcription factors as well as RBPs around the 3' end of mature miRNA region in 5' arm. The pattern and distribution of such regulatory sites appears to be a characteristic of precursor miRNA sequences when compared with non-miRNA sequences as negative dataset and tested statistically.When compared with 1 kb upstreamregions, a sudden sharp peak for binding sites arises in the enriched zone near the mature miRNA region. An expression-data-based correlation analysis was performed between such miRNAs and their corresponding transcription factors and RBPs for this region. Some specific groups of binding factors and associated miRNAs were identified. We also identified some of the overrepresented transcription factors and associated miRNAs with high expression correlation values which could be useful in cancer-related studies. The highly correlated groups were found to host experimentally validated composite regulatory modules, in which Lmo2-GATA1 appeared as the predominant one. For many of RBP-miRNAs associations, coexpression similarity was also evident among the associated miRNA common to given RBPs, supporting the Regulon model, suggesting a common role and common control of these miRNAs by the associated RBPs. Based on our findings, we propose that the observed characteristic distribution of regulatory sites in precursor miRNA sequence regions could be critical inmiRNA transcription, processing, stability and formation and are important for therapeutic studies. Our findings also support the recently proposed theory of self-sufficient mode of transcription by miRNAs, which states that miRNA transcription can be carried out in host-independent mode too.

Collapse

Aerts S. Computational strategies for the genome-wide identification of cis-regulatory elements and transcriptional targets. Curr Top Dev Biol 2012;98:121-45. [PMID: 22305161 DOI: 10.1016/b978-0-12-386499-4.00005-7] [Citation(s) in RCA: 30] [Impact Index Per Article: 2.3] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 12/13/2022]

Kwon AT, Chou AY, Arenillas DJ, Wasserman WW. Validation of skeletal muscle cis-regulatory module predictions reveals nucleotide composition bias in functional enhancers. PLoS Comput Biol 2011;7:e1002256. [PMID: 22144875 PMCID: PMC3228787 DOI: 10.1371/journal.pcbi.1002256] [Citation(s) in RCA: 8] [Impact Index Per Article: 0.6] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 01/31/2011] [Accepted: 09/16/2011] [Indexed: 11/19/2022] Open

Abstract

We performed a genome-wide scan for muscle-specific cis-regulatory modules (CRMs) using three computational prediction programs. Based on the predictions, 339 candidate CRMs were tested in cell culture with NIH3T3 fibroblasts and C2C12 myoblasts for capacity to direct selective reporter gene expression to differentiated C2C12 myotubes. A subset of 19 CRMs validated as functional in the assay. The rate of predictive success reveals striking limitations of computational regulatory sequence analysis methods for CRM discovery. Motif-based methods performed no better than predictions based only on sequence conservation. Analysis of the properties of the functional sequences relative to inactive sequences identifies nucleotide sequence composition can be an important characteristic to incorporate in future methods for improved predictive specificity. Muscle-related TFBSs predicted within the functional sequences display greater sequence conservation than non-TFBS flanking regions. Comparison with recent MyoD and histone modification ChIP-Seq data supports the validity of the functional regions.

For efficient identification of genomic sequences responsible for regulating gene expression, a number of computer programs have been developed for automatic annotation of these regulatory regions. We searched for potential regulatory regions responsible for controlling the expression of skeletal muscle-specific genes using these programs, and validated the predictions in a popular cell culture model for muscle. We were able to identify 19 previously uncharacterized regulatory regions for muscle genes. The accuracy of the predictions made by these programs leaves much to be desired, leading us to conclude that other signals in addition to the sequence information will be required to achieve sufficient predictive power for genome annotation. Genomic regions with confirmed regulatory function were compared against non-functional sequences, revealing sequence conservation, composition and chromatin modification properties as important signals in determining regulatory region functionality.

Collapse

Starr MO, Ho MCW, Gunther EJM, Tu YK, Shur AS, Goetz SE, Borok MJ, Kang V, Drewell RA. Molecular dissection of cis-regulatory modules at the Drosophila bithorax complex reveals critical transcription factor signature motifs. Dev Biol 2011;359:290-302. [PMID: 21821017 PMCID: PMC3202680 DOI: 10.1016/j.ydbio.2011.07.028] [Citation(s) in RCA: 19] [Impact Index Per Article: 1.4] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 02/02/2011] [Revised: 07/17/2011] [Accepted: 07/19/2011] [Indexed: 11/17/2022]

Yan R, Boutros PC, Jurisica I. A tree-based approach for motif discovery and sequence classification. Bioinformatics 2011;27:2054-61. [PMID: 21685048 DOI: 10.1093/bioinformatics/btr353] [Citation(s) in RCA: 3] [Impact Index Per Article: 0.2] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/13/2022] Open

Fulton DL, Denarier E, Friedman HC, Wasserman WW, Peterson AC. Towards resolving the transcription factor network controlling myelin gene expression. Nucleic Acids Res 2011;39:7974-91. [PMID: 21729871 PMCID: PMC3185407 DOI: 10.1093/nar/gkr326] [Citation(s) in RCA: 21] [Impact Index Per Article: 1.5] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 12/26/2022] Open

Hemberg M, Kreiman G. Conservation of transcription factor binding events predicts gene expression across species. Nucleic Acids Res 2011;39:7092-102. [PMID: 21622661 PMCID: PMC3167604 DOI: 10.1093/nar/gkr404] [Citation(s) in RCA: 21] [Impact Index Per Article: 1.5] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 12/26/2022] Open

Zhang Z, Zhang MQ. Histone modification profiles are predictive for tissue/cell-type specific expression of both protein-coding and microRNA genes. BMC Bioinformatics 2011;12:155. [PMID: 21569556 PMCID: PMC3120700 DOI: 10.1186/1471-2105-12-155] [Citation(s) in RCA: 34] [Impact Index Per Article: 2.4] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 12/09/2010] [Accepted: 05/14/2011] [Indexed: 02/04/2023] Open

Dojer N, Biecek P, Tiuryn J. Bi-billboard: symmetrization and careful choice of informant species results in higher accuracy of regulatory element prediction. J Comput Biol 2011;18:809-19. [PMID: 21563976 DOI: 10.1089/cmb.2010.0299] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/12/2022] Open

Brohée S, Janky R, Abdel-Sater F, Vanderstocken G, André B, van Helden J. Unraveling networks of co-regulated genes on the sole basis of genome sequences. Nucleic Acids Res 2011;39:6340-58. [PMID: 21572103 PMCID: PMC3159452 DOI: 10.1093/nar/gkr264] [Citation(s) in RCA: 25] [Impact Index Per Article: 1.8] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 02/06/2023] Open

Li XY, Thomas S, Sabo PJ, Eisen MB, Stamatoyannopoulos JA, Biggin MD. The role of chromatin accessibility in directing the widespread, overlapping patterns of Drosophila transcription factor binding. Genome Biol 2011;12:R34. [PMID: 21473766 PMCID: PMC3218860 DOI: 10.1186/gb-2011-12-4-r34] [Citation(s) in RCA: 156] [Impact Index Per Article: 11.1] [Reference Citation Analysis] [Abstract] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/19/2011] [Accepted: 04/07/2011] [Indexed: 12/11/2022] Open

Kim TM, Park PJ. Advances in analysis of transcriptional regulatory networks. WILEY INTERDISCIPLINARY REVIEWS-SYSTEMS BIOLOGY AND MEDICINE 2011;3:21-35. [PMID: 21069662 DOI: 10.1002/wsbm.105] [Citation(s) in RCA: 24] [Impact Index Per Article: 1.7] [Reference Citation Analysis] [Abstract] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 02/05/2023]

PCR DNA-array profiling of DNA-binding transcription factor activities in adult mouse tissues. Methods Mol Biol 2011;687:319-31. [PMID: 20967619 DOI: 10.1007/978-1-60761-944-4_23] [Citation(s) in RCA: 2] [Impact Index Per Article: 0.1] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 02/19/2023]