Reference Citation Analysis: Find an Article, Find a Category, Find a Journal, Find a Scholar

For: Cheng C, Alexander R, Min R, Leng J, Yip KY, Rozowsky J, Yan KK, Dong X, Djebali S, Ruan Y, Davis CA, Carninci P, Lassman T, Gingeras TR, Guigó R, Birney E, Weng Z, Snyder M, Gerstein M. Understanding transcriptional regulation by integrative analysis of transcription factor binding data. Genome Res 2013;22:1658-67. [PMID: 22955978 PMCID: PMC3431483 DOI: 10.1101/gr.136838.111] [Citation(s) in RCA: 138] [Impact Index Per Article: 12.5] [Reference Citation Analysis] [What about the content of this article? (0)] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 12/31/2022]

For:	Cheng C, Alexander R, Min R, Leng J, Yip KY, Rozowsky J, Yan KK, Dong X, Djebali S, Ruan Y, Davis CA, Carninci P, Lassman T, Gingeras TR, Guigó R, Birney E, Weng Z, Snyder M, Gerstein M. Understanding transcriptional regulation by integrative analysis of transcription factor binding data. Genome Res 2013;22:1658-67. [PMID: 22955978 PMCID: PMC3431483 DOI: 10.1101/gr.136838.111] [Citation(s) in RCA: 138] [Impact Index Per Article: 12.5] [Reference Citation Analysis] [What about the content of this article? (0)] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 12/31/2022]

Number

Cited by Other Article(s)

Gonzalez-Avalos E, Onodera A, Samaniego-Castruita D, Rao A, Ay F. Predicting gene expression state and prioritizing putative enhancers using 5hmC signal. Genome Biol 2024;25:142. [PMID: 38825692 PMCID: PMC11145787 DOI: 10.1186/s13059-024-03273-z] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 04/03/2023] [Accepted: 05/11/2024] [Indexed: 06/04/2024] Open

Vishnevsky OV, Bocharnikov AV, Ignatieva EV. Peak Scores Significantly Depend on the Relationships between Contextual Signals in ChIP-Seq Peaks. Int J Mol Sci 2024;25:1011. [PMID: 38256085 PMCID: PMC10816497 DOI: 10.3390/ijms25021011] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/31/2023] [Revised: 12/13/2023] [Accepted: 01/09/2024] [Indexed: 01/24/2024] Open

Neikes HK, Kliza KW, Gräwe C, Wester RA, Jansen PWTC, Lamers LA, Baltissen MP, van Heeringen SJ, Logie C, Teichmann SA, Lindeboom RGH, Vermeulen M. Quantification of absolute transcription factor binding affinities in the native chromatin context using BANC-seq. Nat Biotechnol 2023;41:1801-1809. [PMID: 36973556 DOI: 10.1038/s41587-023-01715-w] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/09/2022] [Accepted: 02/16/2023] [Indexed: 03/29/2023]

Affiliation(s)

Hannah K Neikes Department of Molecular Biology, Faculty of Science, Radboud Institute for Molecular Life Sciences, Oncode Institute, Radboud University Nijmegen, Nijmegen, the Netherlands
Katarzyna W Kliza Department of Molecular Biology, Faculty of Science, Radboud Institute for Molecular Life Sciences, Oncode Institute, Radboud University Nijmegen, Nijmegen, the Netherlands
Cathrin Gräwe Department of Molecular Biology, Faculty of Science, Radboud Institute for Molecular Life Sciences, Oncode Institute, Radboud University Nijmegen, Nijmegen, the Netherlands
Roelof A Wester Department of Molecular Biology, Faculty of Science, Radboud Institute for Molecular Life Sciences, Oncode Institute, Radboud University Nijmegen, Nijmegen, the Netherlands
Pascal W T C Jansen Department of Molecular Biology, Faculty of Science, Radboud Institute for Molecular Life Sciences, Oncode Institute, Radboud University Nijmegen, Nijmegen, the Netherlands
Lieke A Lamers Department of Molecular Biology, Faculty of Science, Radboud Institute for Molecular Life Sciences, Oncode Institute, Radboud University Nijmegen, Nijmegen, the Netherlands
Marijke P Baltissen Department of Molecular Biology, Faculty of Science, Radboud Institute for Molecular Life Sciences, Oncode Institute, Radboud University Nijmegen, Nijmegen, the Netherlands
Simon J van Heeringen Department of Molecular Developmental Biology, Faculty of Science, Radboud Institute for Molecular Life Sciences, Radboud University Nijmegen, Nijmegen, the Netherlands
Colin Logie Department of Molecular Biology, Faculty of Science, Radboud Institute for Molecular Life Sciences, Radboud University Nijmegen, Nijmegen, the Netherlands
Sarah A Teichmann Wellcome Sanger Institute, Wellcome Genome Campus, Cambridge, UK
Rik G H Lindeboom Wellcome Sanger Institute, Wellcome Genome Campus, Cambridge, UK. The Netherlands Cancer Institute, Amsterdam, the Netherlands.
Michiel Vermeulen Department of Molecular Biology, Faculty of Science, Radboud Institute for Molecular Life Sciences, Oncode Institute, Radboud University Nijmegen, Nijmegen, the Netherlands. The Netherlands Cancer Institute, Amsterdam, the Netherlands.

Collapse

Pianfetti E, Lovino M, Ficarra E, Martignetti L. MiREx: mRNA levels prediction from gene sequence and miRNA target knowledge. BMC Bioinformatics 2023;24:443. [PMID: 37993778 PMCID: PMC10666312 DOI: 10.1186/s12859-023-05560-1] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/30/2023] [Accepted: 11/06/2023] [Indexed: 11/24/2023] Open

Li L, Bao H, Xu Y, Yang W, Zhang Z, Ma K, Zhang K, Zhou J, Gong Y, Ci W, Gong K. Preliminary Study of Whole-Genome Bisulfite Sequencing and Transcriptome Sequencing in VHL Disease-Associated ccRCC. Mol Diagn Ther 2023;27:741-752. [PMID: 37587253 DOI: 10.1007/s40291-023-00663-0] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Accepted: 07/02/2023] [Indexed: 08/18/2023]

Abstract

BACKGROUND

Von Hippel-Lindau (VHL) disease is an autosomal dominant hereditary tumor syndrome with an incidence of approximately 1/36,000. VHL disease-associated clear cell renal cell carcinoma (ccRCC) is the most common congenital RCC. Although recent advances in treating RCC have improved the long-term prognosis of patients with VHL disease, kidney cancer is still the leading cause of death in these patients. Therefore, finding new targets for diagnosing and treating VHL disease-associated ccRCC is still essential.

METHODS

In this study, we collected matched tumor tissues and normal samples from 25 patients with VHL disease-associated ccRCC, diagnosed and surgically treated in the Department of Urology, Peking University First Hospital. After screening, we performed whole genome bisulfite sequencing (WGBS) on 23 pairs of tissues and RNA-seq on 6 pairs of tissues. And we also compared the VHL disease-associated ccRCC transcriptome data with the sporadic ccRCC transcriptome data from the The Cancer Genome Atlas (TCGA) public database RESULTS: We found that the methylation level of VHL disease-associated ccRCC tumor tissues was significantly lower than that of normal tissues. The tumor tissues showed a difference in the copy number of 3p loss and 5q and 7q gain compared with normal tissues. We integrated RNA-seq and WGBS data to reveal methylation candidate genes associated with VHL disease-associated ccRCC; our results showed 124 hypermethylated and downregulated genes, and 245 hypomethylated and upregulated genes. By comparing the VHL disease-associated ccRCC transcriptome data with the sporadic ccRCC transcriptome data from the TCGA public database, we found that the major pathways of differential gene enrichment differed between them.

CONCLUSIONS

Our study mapped the multiomics of copy number variation, methylation and mRNA level changes in tumor and normal tissues of clear cell renal cell carcinoma with VHL syndrome, which provides a solid foundation for the mechanistic study, biomarker screening, and therapeutic target discovery of clear cell renal cell carcinoma.

Collapse

Affiliation(s)

Lei Li Department of Urology, Peking University First Hospital, Beijing, 100034, China Institution of Urology, Peking University, Beijing, 100034, China Beijing Key Laboratory of Urogenital Diseases (Male) Molecular Diagnosis and Treatment Center, Beijing, 100034, China National Urological Cancer Center, Beijing, 100034, China
Hainan Bao Key Laboratory of Genomic and Precision Medicine, Beijing Institute of Genomics, and China National Center for Bioinformation, Chinese Academy of Sciences, Beijing, 100101, China University of Chinese Academy of Sciences, Beijing, 100049, China
Yawei Xu Department of Urology, Peking University First Hospital, Beijing, 100034, China Institution of Urology, Peking University, Beijing, 100034, China Beijing Key Laboratory of Urogenital Diseases (Male) Molecular Diagnosis and Treatment Center, Beijing, 100034, China National Urological Cancer Center, Beijing, 100034, China
Wuping Yang Department of Urology, Peking University First Hospital, Beijing, 100034, China Institution of Urology, Peking University, Beijing, 100034, China Beijing Key Laboratory of Urogenital Diseases (Male) Molecular Diagnosis and Treatment Center, Beijing, 100034, China National Urological Cancer Center, Beijing, 100034, China
Zedan Zhang Department of Urology, Peking University First Hospital, Beijing, 100034, China Institution of Urology, Peking University, Beijing, 100034, China Beijing Key Laboratory of Urogenital Diseases (Male) Molecular Diagnosis and Treatment Center, Beijing, 100034, China National Urological Cancer Center, Beijing, 100034, China
Kaifang Ma Department of Urology, Beijing Tongren Hospital, Capital Medical University, No. 1 Dongjiaomingxiang Street, Dongcheng District, Beijing, 100730, China
Kenan Zhang Department of Urology, Peking University First Hospital, Beijing, 100034, China Institution of Urology, Peking University, Beijing, 100034, China Beijing Key Laboratory of Urogenital Diseases (Male) Molecular Diagnosis and Treatment Center, Beijing, 100034, China National Urological Cancer Center, Beijing, 100034, China
Jingcheng Zhou Department of Urology, Peking University First Hospital, Beijing, 100034, China Institution of Urology, Peking University, Beijing, 100034, China Beijing Key Laboratory of Urogenital Diseases (Male) Molecular Diagnosis and Treatment Center, Beijing, 100034, China National Urological Cancer Center, Beijing, 100034, China
Yanqing Gong Department of Urology, Peking University First Hospital, Beijing, 100034, China Institution of Urology, Peking University, Beijing, 100034, China Beijing Key Laboratory of Urogenital Diseases (Male) Molecular Diagnosis and Treatment Center, Beijing, 100034, China National Urological Cancer Center, Beijing, 100034, China
Weimin Ci Key Laboratory of Genomic and Precision Medicine, Beijing Institute of Genomics, and China National Center for Bioinformation, Chinese Academy of Sciences, Beijing, 100101, China. University of Chinese Academy of Sciences, Beijing, 100049, China. Institute for Stem Cell and Regeneration, Chinese Academy of Sciences, Beijing, China.
Kan Gong Department of Urology, Peking University First Hospital, Beijing, 100034, China. Institution of Urology, Peking University, Beijing, 100034, China. Beijing Key Laboratory of Urogenital Diseases (Male) Molecular Diagnosis and Treatment Center, Beijing, 100034, China. National Urological Cancer Center, Beijing, 100034, China.

Collapse

Hasib RA, Ali MC, Rahman MH, Ahmed S, Sultana S, Summa SZ, Shimu MSS, Afrin Z, Jamal MAHM. Integrated gene expression profiling and functional enrichment analyses to discover biomarkers and pathways associated with Guillain-Barré syndrome and autism spectrum disorder to identify new therapeutic targets. J Biomol Struct Dyn 2023:1-23. [PMID: 37776011 DOI: 10.1080/07391102.2023.2262586] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/15/2023] [Accepted: 09/17/2023] [Indexed: 10/01/2023]

Ochoa S, Hernández-Lemus E. Molecular mechanisms of multi-omic regulation in breast cancer. Front Oncol 2023;13:1148861. [PMID: 37564937 PMCID: PMC10411627 DOI: 10.3389/fonc.2023.1148861] [Citation(s) in RCA: 1] [Impact Index Per Article: 1.0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 01/20/2023] [Accepted: 07/05/2023] [Indexed: 08/12/2023] Open

Cai X, Shi W, Lian J, Zhang G, Cai Y, Zhu L. Characterization of immune landscape and development of a novel N7-methylguanine-related gene signature to aid therapy in recurrent aphthous stomatitis. Inflamm Res 2023;72:133-148. [PMID: 36352034 DOI: 10.1007/s00011-022-01665-0] [Citation(s) in RCA: 2] [Impact Index Per Article: 2.0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/13/2022] [Revised: 10/24/2022] [Accepted: 10/26/2022] [Indexed: 11/11/2022] Open

Nikolenko JV, Fursova NA, Mazina MY, Vorobyeva NE, Krasnov AN. The Drosophila CG9890 Protein is Involved in the Regulation of Ecdysone-Dependent Transcription. Mol Biol 2022. [DOI: 10.1134/s0026893322040082] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/22/2022]

Pan-cancer identification of the relationship of metabolism-related differentially expressed transcription regulation with non-differentially expressed target genes via a gated recurrent unit network. Comput Biol Med 2022;148:105883. [PMID: 35878490 DOI: 10.1016/j.compbiomed.2022.105883] [Citation(s) in RCA: 4] [Impact Index Per Article: 2.0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 05/06/2022] [Revised: 07/10/2022] [Accepted: 07/16/2022] [Indexed: 11/20/2022]

Ripon Rouf ASM, Amin MA, Islam MK, Haque F, Ahmed KR, Rahman MA, Islam MZ, Kim B. Statistical Bioinformatics to Uncover the Underlying Biological Mechanisms That Linked Smoking with Type 2 Diabetes Patients Using Transcritpomic and GWAS Analysis. Molecules 2022;27:molecules27144390. [PMID: 35889263 PMCID: PMC9323276 DOI: 10.3390/molecules27144390] [Citation(s) in RCA: 1] [Impact Index Per Article: 0.5] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 05/26/2022] [Revised: 06/30/2022] [Accepted: 07/04/2022] [Indexed: 12/14/2022] Open

Tian H, He Y, Xue Y, Gao YQ. Expression regulation of genes is linked to their CpG density distributions around transcription start sites. Life Sci Alliance 2022;5:5/9/e202101302. [PMID: 35580989 PMCID: PMC9113945 DOI: 10.26508/lsa.202101302] [Citation(s) in RCA: 6] [Impact Index Per Article: 3.0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 11/17/2021] [Revised: 05/07/2022] [Accepted: 05/09/2022] [Indexed: 11/24/2022] Open

Hanson HE, Wang C, Schrey AW, Liebl AL, Ravinet M, Jiang RH, Martin LB. Epigenetic Potential and DNA Methylation in an Ongoing House Sparrow (Passer domesticus) Range Expansion. Am Nat 2022;200:662-674. [DOI: 10.1086/720950] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/03/2022]

Hasan I, Hossain A, Bhuiyan P, Miah S, Rahman H. A system biology approach to determine therapeutic targets by identifying molecular mechanisms and key pathways for type 2 diabetes that are linked to the development of tuberculosis and rheumatoid arthritis. Life Sci 2022;297:120483. [DOI: 10.1016/j.lfs.2022.120483] [Citation(s) in RCA: 2] [Impact Index Per Article: 1.0] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 02/10/2022] [Revised: 03/07/2022] [Accepted: 03/09/2022] [Indexed: 12/17/2022]

Girgis CM, Brennan-Speranza TC. Vitamin D and Skeletal Muscle: Current Concepts From Preclinical Studies. JBMR Plus 2021;5:e10575. [PMID: 34950830 PMCID: PMC8674777 DOI: 10.1002/jbm4.10575] [Citation(s) in RCA: 8] [Impact Index Per Article: 2.7] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Submit a Manuscript] [Subscribe] [Scholar Register] [Received: 06/24/2021] [Revised: 10/07/2021] [Accepted: 10/24/2021] [Indexed: 12/12/2022] Open

Han Z, Yang T, Guo Y, Cui WH, Yao LJ, Li G, Wu AM, Li JH, Liu LJ. The transcription factor PagLBD3 contributes to the regulation of secondary growth in Populus. JOURNAL OF EXPERIMENTAL BOTANY 2021;72:7092-7106. [PMID: 34313722 DOI: 10.1093/jxb/erab351] [Citation(s) in RCA: 2] [Impact Index Per Article: 0.7] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Subscribe] [Scholar Register] [Received: 02/24/2021] [Accepted: 07/24/2021] [Indexed: 06/13/2023]

Mutual dependency between lncRNA LETN and protein NPM1 in controlling the nucleolar structure and functions sustaining cell proliferation. Cell Res 2021;31:664-683. [PMID: 33432115 PMCID: PMC8169757 DOI: 10.1038/s41422-020-00458-6] [Citation(s) in RCA: 26] [Impact Index Per Article: 8.7] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/18/2020] [Accepted: 11/30/2020] [Indexed: 02/06/2023] Open

Marand AP, Chen Z, Gallavotti A, Schmitz RJ. A cis-regulatory atlas in maize at single-cell resolution. Cell 2021;184:3041-3055.e21. [PMID: 33964211 DOI: 10.1101/2020.09.27.315499] [Citation(s) in RCA: 5] [Impact Index Per Article: 1.7] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 12/02/2020] [Revised: 03/04/2021] [Accepted: 04/07/2021] [Indexed: 05/22/2023]

Agarwal V, Shendure J. Predicting mRNA Abundance Directly from Genomic Sequence Using Deep Convolutional Neural Networks. Cell Rep 2021;31:107663. [PMID: 32433972 DOI: 10.1016/j.celrep.2020.107663] [Citation(s) in RCA: 87] [Impact Index Per Article: 29.0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/25/2018] [Revised: 06/11/2019] [Accepted: 04/28/2020] [Indexed: 01/06/2023] Open

Muley VY. Mathematical Programming for Modeling Expression of a Gene Using Gurobi Optimizer to Identify Its Transcriptional Regulators. Methods Mol Biol 2021;2328:99-113. [PMID: 34251621 DOI: 10.1007/978-1-0716-1534-8_6] [Citation(s) in RCA: 1] [Impact Index Per Article: 0.3] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 06/13/2023]

Sharipov RN, Kondrakhin YV, Ryabova AS, Yevshin IS, Kolpakov FA. Assessment of transcriptional importance of cell line-specific features based on GTRD and FANTOM5 data. PLoS One 2020;15:e0243332. [PMID: 33347457 PMCID: PMC7751965 DOI: 10.1371/journal.pone.0243332] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 08/04/2020] [Accepted: 11/19/2020] [Indexed: 11/18/2022] Open

Wang H, Liu Y, Guan H, Fan GL. The Regulation of Target Genes by Co-occupancy of Transcription Factors, c-Myc and Mxi1 with Max in the Mouse Cell Line. Curr Bioinform 2020. [DOI: 10.2174/1574893614666191106103633] [Citation(s) in RCA: 6] [Impact Index Per Article: 1.5] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 01/09/2023]

Singh R, Sophiarani Y. A report on DNA sequence determinants in gene expression. Bioinformation 2020;16:422-431. [PMID: 32831525 PMCID: PMC7434957 DOI: 10.6026/97320630016422] [Citation(s) in RCA: 1] [Impact Index Per Article: 0.3] [Reference Citation Analysis] [Abstract] [Key Words] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 04/23/2020] [Accepted: 04/24/2020] [Indexed: 11/26/2022] Open

Höllbacher B, Balázs K, Heinig M, Uhlenhaut NH. Seq-ing answers: Current data integration approaches to uncover mechanisms of transcriptional regulation. Comput Struct Biotechnol J 2020;18:1330-1341. [PMID: 32612756 PMCID: PMC7306512 DOI: 10.1016/j.csbj.2020.05.018] [Citation(s) in RCA: 12] [Impact Index Per Article: 3.0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 02/27/2020] [Revised: 05/21/2020] [Accepted: 05/23/2020] [Indexed: 02/06/2023] Open

Ochoa S, de Anda-Jáuregui G, Hernández-Lemus E. Multi-Omic Regulation of the PAM50 Gene Signature in Breast Cancer Molecular Subtypes. Front Oncol 2020;10:845. [PMID: 32528899 PMCID: PMC7259379 DOI: 10.3389/fonc.2020.00845] [Citation(s) in RCA: 11] [Impact Index Per Article: 2.8] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 12/01/2019] [Accepted: 04/29/2020] [Indexed: 12/24/2022] Open

Abstract

Breast cancer is a disease that exhibits heterogeneity that goes from the genomic to the clinical levels. This heterogeneity is thought to be captured (at least partially) by the so-called breast cancer molecular subtypes. These molecular subtypes were initially defined based on the unsupervised clustering of gene expression and its correlate with histological, morphological, phenotypic and clinical features already known. Later, a 50-gene signature, PAM50, was defined in order to identify the biological subtype of a given sample within the clinical setting. The PAM50 signature was obtained by the use of unsupervised statistical methods, and therefore no limitation was set on the biological relevance (or lack of) of the selected genes beyond its predictive capacity. An open question that remains is what are the regulatory elements that drive the various expression behaviors of this set of genes in the different molecular subtypes. This question becomes more relevant as the measurement of more biological layers of regulation becomes accessible. In this work, we analyzed the gene expression regulation of the 50 genes in the PAM50 signature, in terms of (a) gene co-expression, (b) transcription factors, (c) micro-RNAs, and (d) methylation. Using data from the Cancer Genome Atlas (TCGA) for the Luminal A and B, Basal, and HER2-enriched molecular subtypes as well as normal tumor adjacent tissue, we identified predictors for gene expression through the use of an elastic net model. We compare and contrast the sets of identified regulators for the gene signature in each molecular subtype, and systematically compare them to current literature. We also identified a unique set of predictors for the expression of genes in the PAM50 signature associated with each of the molecular subtypes. Most selected predictors are exclusive for a PAM50 gene and predictors are not shared across subtypes. There are only 13 coding transcripts and 2 miRNAs selected for the four subtypes. MiR-21 and miR-10b connect almost all the PAM50 genes in all the subtypes and normal tissue, but do it in an exclusive manner, suggesting a cancer switch from miR-10b coordination in normal tissue to miR-21. The PAM50 gene sets of selected predictors that enrich for a function across subtypes, support that different regulatory molecular mechanisms are taking place. With this study we aim to a wider understanding of the regulatory mechanisms that differentiate the expression of the PAM50 signature, which in turn could perhaps help understand the molecular basis of the differences between the molecular subtypes.

Collapse

Klein HU, Schäfer M, Bennett DA, Schwender H, De Jager PL. Bayesian integrative analysis of epigenomic and transcriptomic data identifies Alzheimer's disease candidate genes and networks. PLoS Comput Biol 2020;16:e1007771. [PMID: 32255787 PMCID: PMC7138305 DOI: 10.1371/journal.pcbi.1007771] [Citation(s) in RCA: 8] [Impact Index Per Article: 2.0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/01/2019] [Accepted: 03/03/2020] [Indexed: 12/28/2022] Open

Abstract

Biomedical research studies have generated large multi-omic datasets to study complex diseases like Alzheimer’s disease (AD). An important aim of these studies is the identification of candidate genes that demonstrate congruent disease-related alterations across the different data types measured by the study. We developed a new method to detect such candidate genes in large multi-omic case-control studies that measure multiple data types in the same set of samples. The method is based on a gene-centric integrative coefficient quantifying to what degree consistent differences are observed in the different data types. For statistical inference, a Bayesian hierarchical model is used to study the distribution of the integrative coefficient. The model employs a conditional autoregressive prior to integrate a functional gene network and to share information between genes known to be functionally related. We applied the method to an AD dataset consisting of histone acetylation, DNA methylation, and RNA transcription data from human cortical tissue samples of 233 subjects, and we detected 816 genes with consistent differences between persons with AD and controls. The findings were validated in protein data and in RNA transcription data from two independent AD studies. Finally, we found three subnetworks of jointly dysregulated genes within the functional gene network which capture three distinct biological processes: myeloid cell differentiation, protein phosphorylation and synaptic signaling. Further investigation of the myeloid network indicated an upregulation of this network in early stages of AD prior to accumulation of hyperphosphorylated tau and suggested that increased CSF1 transcription in astrocytes may contribute to microglial activation in AD. Thus, we developed a method that integrates multiple data types and external knowledge of gene function to detect candidate genes, applied the method to an AD dataset, and identified several disease-related genes and processes demonstrating the usefulness of the integrative approach.

Recent technological advances have led to a new generation of studies that interrogate multiple molecular levels in the same target tissue of a set of subjects, generating complex multi-omic datasets with which to study disease mechanism. These datasets of genetic, epigenomic, transcriptomic, and other data have the potential to reveal novel biological insights; however, integrative analyses remain challenging and require new computational methods. We developed an integrative Bayesian approach to detect genes with consistent differences between case and control samples across multiple data types. The method further integrates prior knowledge about gene function in the form of a gene functional similarity network to improve statistical inference by sharing information between related genes. We applied our method to an Alzheimer’s disease dataset of epigenomic and transcriptomic data and detected and then validated several novel and known candidate genes as well as three major disease-related biological processes. One of these processes reflected microglial activation and included the cytokine CSF1. Single-nucleus data revealed that CSF1 was primarily upregulated in astrocytes, implicating the involvement of this cell type in microglial activation. Hence, we demonstrated that integrative analysis approaches to multi-omic datasets can improve candidate gene detection and thereby generate new insights into complex diseases.

Collapse

do Amaral MCF, Frisbie J, Crum RJ, Goldstein DL, Krane CM. Hepatic transcriptome of the freeze-tolerant Cope's gray treefrog, Dryophytes chrysoscelis: responses to cold acclimation and freezing. BMC Genomics 2020;21:226. [PMID: 32164545 PMCID: PMC7069055 DOI: 10.1186/s12864-020-6602-4] [Citation(s) in RCA: 8] [Impact Index Per Article: 2.0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 08/20/2019] [Accepted: 02/20/2020] [Indexed: 11/10/2022] Open

Abstract

Background

Cope’s gray treefrog, Dryophytes chrysoscelis, withstands the physiological challenges of corporeal freezing, partly by accumulating cryoprotective compounds of hepatic origin, including glycerol, urea, and glucose. We hypothesized that expression of genes related to cryoprotectant mobilization and stress tolerance would be differentially regulated in response to cold. Using high-throughput RNA sequencing (RNA-Seq), a hepatic transcriptome was generated for D. chrysoscelis, and gene expression was compared among frogs that were warm-acclimated, cold-acclimated, and frozen.

Results

A total of 159,556 transcripts were generated; 39% showed homology with known transcripts, and 34% of all transcripts were annotated. Gene-level analyses identified 34,936 genes, 85% of which were annotated. Cold acclimation induced differential expression both of genes and non-coding transcripts; freezing induced few additional changes. Transcript-level analysis followed by gene-level aggregation revealed 3582 differentially expressed genes, whereas analysis at the gene level revealed 1324 differentially regulated genes. Approximately 3.6% of differentially expressed sequences were non-coding and of no identifiable homology. Expression of several genes associated with cryoprotectant accumulation was altered during cold acclimation. Of note, glycerol kinase expression decreased with cold exposure, possibly promoting accumulation of glycerol, whereas glucose export was transcriptionally promoted by upregulation of glucose-6-phosphatase and downregulation of genes of various glycolytic enzymes. Several genes related to heat shock protein response, DNA repair, and the ubiquitin proteasome pathway were upregulated in cold and frozen frogs, whereas genes involved in responses to oxidative stress and anoxia, both potential sources of cellular damage during freezing, were downregulated or unchanged.

Conclusion

Our study is the first to report transcriptomic responses to low temperature exposure in a freeze-tolerant vertebrate. The hepatic transcriptome of Dryophytes chrysoscelis is responsive to cold and freezing. Transcriptomic regulation of genes related to particular pathways, such as glycerol biosynthesis, were not all regulated in parallel. The physiological demands associated with cold and freezing, as well as the transcriptomic responses observed in this study, are shared with several organisms that face similar ecophysiological challenges, suggesting common regulatory mechanisms. The role of transcriptional regulation relative to other cellular processes, and of non-coding transcripts as elements of those responses, deserve further study.

Collapse

Rahman MH, Peng S, Hu X, Chen C, Rahman MR, Uddin S, Quinn JM, Moni MA. A Network-Based Bioinformatics Approach to Identify Molecular Biomarkers for Type 2 Diabetes that Are Linked to the Progression of Neurological Diseases. INTERNATIONAL JOURNAL OF ENVIRONMENTAL RESEARCH AND PUBLIC HEALTH 2020;17:ijerph17031035. [PMID: 32041280 PMCID: PMC7037290 DOI: 10.3390/ijerph17031035] [Citation(s) in RCA: 45] [Impact Index Per Article: 11.3] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Download PDF] [Figures] [Subscribe] [Scholar Register] [Received: 01/22/2020] [Revised: 02/02/2020] [Accepted: 02/02/2020] [Indexed: 12/21/2022]

Abstract

Neurological diseases (NDs) are progressive disorders, the progression of which can be significantly affected by a range of common diseases that present as comorbidities. Clinical studies, including epidemiological and neuropathological analyses, indicate that patients with type 2 diabetes (T2D) have worse progression of NDs, suggesting pathogenic links between NDs and T2D. However, finding causal or predisposing factors that link T2D and NDs remains challenging. To address these problems, we developed a high-throughput network-based quantitative pipeline using agnostic approaches to identify genes expressed abnormally in both T2D and NDs, to identify some of the shared molecular pathways that may underpin T2D and ND interaction. We employed gene expression transcriptomic datasets from control and disease-affected individuals and identified differentially expressed genes (DEGs) in tissues of patients with T2D and ND when compared to unaffected control individuals. One hundred and ninety seven DEGs (99 up-regulated and 98 down-regulated in affected individuals) that were common to both the T2D and the ND datasets were identified. Functional annotation of these identified DEGs revealed the involvement of significant cell signaling associated molecular pathways. The overlapping DEGs (i.e., seen in both T2D and ND datasets) were then used to extract the most significant GO terms. We performed validation of these results with gold benchmark databases and literature searching, which identified which genes and pathways had been previously linked to NDs or T2D and which are novel. Hub proteins in the pathways were identified (including DNM2, DNM1, MYH14, PACSIN2, TFRC, PDE4D, ENTPD1, PLK4, CDC20B, and CDC14A) using protein-protein interaction analysis which have not previously been described as playing a role in these diseases. To reveal the transcriptional and post-transcriptional regulators of the DEGs we used transcription factor (TF) interactions analysis and DEG-microRNAs (miRNAs) interaction analysis, respectively. We thus identified the following TFs as important in driving expression of our T2D/ND common genes: FOXC1, GATA2, FOXL1, YY1, E2F1, NFIC, NFYA, USF2, HINFP, MEF2A, SRF, NFKB1, USF2, HINFP, MEF2A, SRF, NFKB1, PDE4D, CREB1, SP1, HOXA5, SREBF1, TFAP2A, STAT3, POU2F2, TP53, PPARG, and JUN. MicroRNAs that affect expression of these genes include mir-335-5p, mir-16-5p, mir-93-5p, mir-17-5p, mir-124-3p. Thus, our transcriptomic data analysis identifies novel potential links between NDs and T2D pathologies that may underlie comorbidity interactions, links that may include potential targets for therapeutic intervention. In sum, our neighborhood-based benchmarking and multilayer network topology methods identified novel putative biomarkers that indicate how type 2 diabetes (T2D) and these neurological diseases interact and pathways that, in the future, may be targeted for treatment.

Collapse

Xu T, Zheng X, Li B, Jin P, Qin Z, Wu H. A comprehensive review of computational prediction of genome-wide features. Brief Bioinform 2020;21:120-134. [PMID: 30462144 PMCID: PMC10233247 DOI: 10.1093/bib/bby110] [Citation(s) in RCA: 8] [Impact Index Per Article: 2.0] [Reference Citation Analysis] [Abstract] [Key Words] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/10/2018] [Revised: 10/15/2018] [Accepted: 10/16/2018] [Indexed: 12/15/2022] Open

Girgis CM. Vitamin D and Skeletal Muscle: Emerging Roles in Development, Anabolism and Repair. Calcif Tissue Int 2020;106:47-57. [PMID: 31312865 DOI: 10.1007/s00223-019-00583-4] [Citation(s) in RCA: 27] [Impact Index Per Article: 6.8] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Submit a Manuscript] [Subscribe] [Scholar Register] [Received: 12/08/2018] [Accepted: 04/29/2019] [Indexed: 12/17/2022]

Yu Q, He Z, Zubkov D, Huang S, Kurochkin I, Yang X, Halene T, Willmitzer L, Giavalisco P, Akbarian S, Khaitovich P. Lipidome alterations in human prefrontal cortex during development, aging, and cognitive disorders. Mol Psychiatry 2020;25:2952-2969. [PMID: 30089790 PMCID: PMC7577858 DOI: 10.1038/s41380-018-0200-8] [Citation(s) in RCA: 62] [Impact Index Per Article: 15.5] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Submit a Manuscript] [Subscribe] [Scholar Register] [Received: 01/12/2018] [Revised: 04/26/2018] [Accepted: 06/11/2018] [Indexed: 12/27/2022]

Affiliation(s)

Qianhui Yu grid.9227.e0000000119573309Shanghai Institutes for Biological Sciences, Chinese Academy of Sciences, Shanghai, 200031 China ,2grid.419092.70000 0004 0467 2285CAS Key Laboratory of Compstudy has been deposited in the National Omics Datautational Biology, CAS-MPG Partner Institute for Computational Biology, SIBS, CAS, Shanghai, 200031 China
Zhisong He grid.419092.70000 0004 0467 2285CAS Key Laboratory of Compstudy has been deposited in the National Omics Datautational Biology, CAS-MPG Partner Institute for Computational Biology, SIBS, CAS, Shanghai, 200031 China ,3grid.454320.40000 0004 0555 3608Skolkovo Institute of Science and Technology, Moscow, 143028 Russia
Dmitry Zubkov grid.454320.40000 0004 0555 3608Skolkovo Institute of Science and Technology, Moscow, 143028 Russia
Shuyun Huang grid.419092.70000 0004 0467 2285CAS Key Laboratory of Compstudy has been deposited in the National Omics Datautational Biology, CAS-MPG Partner Institute for Computational Biology, SIBS, CAS, Shanghai, 200031 China ,4grid.440637.20000 0004 4657 8879ShanghaiTech University, Shanghai, 200031 China
Ilia Kurochkin grid.454320.40000 0004 0555 3608Skolkovo Institute of Science and Technology, Moscow, 143028 Russia
Xiaode Yang grid.9227.e0000000119573309Shanghai Institutes for Biological Sciences, Chinese Academy of Sciences, Shanghai, 200031 China ,2grid.419092.70000 0004 0467 2285CAS Key Laboratory of Compstudy has been deposited in the National Omics Datautational Biology, CAS-MPG Partner Institute for Computational Biology, SIBS, CAS, Shanghai, 200031 China
Tobias Halene grid.59734.3c0000 0001 0670 2351Department of Psychiatry and Friedman Brain Institute, Icahn School of Medicine at Mount Sinai, New York, NY 10029 USA
Lothar Willmitzer grid.418390.70000 0004 0491 976XMax Planck Institute for Molecular Plant Physiology, Am Mühlenberg 1, Potsdam, 14476 Germany
Patrick Giavalisco Max Planck Institute for Molecular Plant Physiology, Am Mühlenberg 1, Potsdam, 14476, Germany.
Schahram Akbarian Department of Psychiatry and Friedman Brain Institute, Icahn School of Medicine at Mount Sinai, New York, NY, 10029, USA.
Philipp Khaitovich Skolkovo Institute of Science and Technology, Moscow, 143028, Russia. .,ShanghaiTech University, Shanghai, 200031, China. .,Max Planck Institute for Evolutionary Anthropology, Leipzig, 04103, Germany. .,Comparative Biology Group, CAS-MPG Partner Institute for Computational Biology, SIBS, CAS, Shanghai, 200031, China.

Collapse

Schmidt F, Schulz MH. On the problem of confounders in modeling gene expression. Bioinformatics 2019;35:711-719. [PMID: 30084962 PMCID: PMC6530814 DOI: 10.1093/bioinformatics/bty674] [Citation(s) in RCA: 9] [Impact Index Per Article: 1.8] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 04/18/2018] [Revised: 06/21/2018] [Accepted: 08/02/2018] [Indexed: 01/01/2023] Open

Yu R, Nielsen J. Big data in yeast systems biology. FEMS Yeast Res 2019;19:5585886. [DOI: 10.1093/femsyr/foz070] [Citation(s) in RCA: 10] [Impact Index Per Article: 2.0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 08/30/2019] [Accepted: 10/09/2019] [Indexed: 12/16/2022] Open

Thormann V, Rothkegel MC, Schöpflin R, Glaser LV, Djuric P, Li N, Chung HR, Schwahn K, Vingron M, Meijsing SH. Genomic dissection of enhancers uncovers principles of combinatorial regulation and cell type-specific wiring of enhancer-promoter contacts. Nucleic Acids Res 2019;46:2868-2882. [PMID: 29385519 PMCID: PMC5888794 DOI: 10.1093/nar/gky051] [Citation(s) in RCA: 25] [Impact Index Per Article: 5.0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 08/11/2017] [Accepted: 01/19/2018] [Indexed: 12/19/2022] Open

Soleimani VD, Nguyen D, Ramachandran P, Palidwor GA, Porter CJ, Yin H, Perkins TJ, Rudnicki MA. Cis-regulatory determinants of MyoD function. Nucleic Acids Res 2019;46:7221-7235. [PMID: 30016497 PMCID: PMC6101602 DOI: 10.1093/nar/gky388] [Citation(s) in RCA: 9] [Impact Index Per Article: 1.8] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 12/13/2016] [Accepted: 04/30/2018] [Indexed: 01/06/2023] Open

Holland P, Bergenholm D, Börlin CS, Liu G, Nielsen J. Predictive models of eukaryotic transcriptional regulation reveals changes in transcription factor roles and promoter usage between metabolic conditions. Nucleic Acids Res 2019;47:4986-5000. [PMID: 30976803 PMCID: PMC6547448 DOI: 10.1093/nar/gkz253] [Citation(s) in RCA: 18] [Impact Index Per Article: 3.6] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 01/11/2019] [Revised: 03/26/2019] [Accepted: 04/04/2019] [Indexed: 01/08/2023] Open

Zhao Y, Schaafsma E, Cheng C. Applications of ENCODE data to Systematic Analyses via Data Integration. ACTA ACUST UNITED AC 2019;11:57-64. [PMID: 31011690 DOI: 10.1016/j.coisb.2018.08.010] [Citation(s) in RCA: 1] [Impact Index Per Article: 0.2] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 12/26/2022]

Liu W, Rajapakse JC. Fusing gene expressions and transitive protein-protein interactions for inference of gene regulatory networks. BMC SYSTEMS BIOLOGY 2019;13:37. [PMID: 30953534 PMCID: PMC6449891 DOI: 10.1186/s12918-019-0695-x] [Citation(s) in RCA: 3] [Impact Index Per Article: 0.6] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Subscribe] [Scholar Register] [Indexed: 12/02/2022]

Pantera H, Moran JJ, Hung HA, Pak E, Dutra A, Svaren J. Regulation of the neuropathy-associated Pmp22 gene by a distal super-enhancer. Hum Mol Genet 2019;27:2830-2839. [PMID: 29771329 DOI: 10.1093/hmg/ddy191] [Citation(s) in RCA: 16] [Impact Index Per Article: 3.2] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/12/2018] [Accepted: 05/09/2018] [Indexed: 12/27/2022] Open

Ma S, Jiang T, Jiang R. Constructing tissue-specific transcriptional regulatory networks via a Markov random field. BMC Genomics 2018;19:884. [PMID: 30598101 PMCID: PMC6311931 DOI: 10.1186/s12864-018-5277-6] [Citation(s) in RCA: 5] [Impact Index Per Article: 0.8] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 12/16/2022] Open

Abstract

BACKGROUND

Recent advances in sequencing technologies have enabled parallel assays of chromatin accessibility and gene expression for major human cell lines. Such innovation provides a great opportunity to decode phenotypic consequences of genetic variation via the construction of predictive gene regulatory network models. However, there still lacks a computational method to systematically integrate chromatin accessibility information with gene expression data to recover complicated regulatory relationships between genes in a tissue-specific manner.

RESULTS

We propose a Markov random field (MRF) model for constructing tissue-specific transcriptional regulatory networks via integrative analysis of DNase-seq and RNA-seq data. Our method, named CSNets (cell-line specific regulatory networks), first infers regulatory networks for individual cell lines using chromatin accessibility information, and then fine-tunes these networks using the MRF based on pairwise similarity between cell lines derived from gene expression data. Using this method, we constructed regulatory networks specific to 110 human cell lines and 13 major tissues with the use of ENCODE data. We demonstrated the high quality of these networks via comprehensive statistical analysis based on ChIP-seq profiles, functional annotations, taxonomic analysis, and literature surveys. We further applied these networks to analyze GWAS data of Crohn's disease and prostate cancer. Results were either consistent with the literature or provided biological insights into regulatory mechanisms of these two complex diseases. The website of CSNets is freely available at http://bioinfo.au.tsinghua.edu.cn/jianglab/CSNETS/ .

CONCLUSIONS

CSNets demonstrated the power of joint analysis on epigenomic and transcriptomic data towards the accurate construction of gene regulatory network. Our work provides not only a useful resource of regulatory networks to the community, but also valuable experiences in methodology development for multi-omics data integration.

Collapse

Lu R, Rogan PK. Transcription factor binding site clusters identify target genes with similar tissue-wide expression and buffer against mutations. F1000Res 2018;7:1933. [PMID: 31001412 PMCID: PMC6464064 DOI: 10.12688/f1000research.17363.1] [Citation(s) in RCA: 3] [Impact Index Per Article: 0.5] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Figures] [Journal Information] [Submit a Manuscript] [Subscribe] [Scholar Register] [Accepted: 12/05/2018] [Indexed: 10/12/2023] Open

Abstract

Background: The distribution and composition of cis-regulatory modules composed of transcription factor (TF) binding site (TFBS) clusters in promoters substantially determine gene expression patterns and TF targets. TF knockdown experiments have revealed that TF binding profiles and gene expression levels are correlated. We use TFBS features within accessible promoter intervals to predict genes with similar tissue-wide expression patterns and TF targets. Methods: Genes with correlated expression patterns across 53 tissues and TF targets were respectively identified from Bray-Curtis Similarity and TF knockdown experiments. Corresponding promoter sequences were reduced to DNase I-accessible intervals; TFBSs were then identified within these intervals using information theory-based position weight matrices for each TF (iPWMs) and clustered. Features from information-dense TFBS clusters predicted these genes with machine learning classifiers, which were evaluated for accuracy, specificity and sensitivity. Mutations in TFBSs were analyzed to in silico examine their impact on cluster densities and the regulatory states of target genes. Results: We initially chose the glucocorticoid receptor gene ( NR3C1), whose regulation has been extensively studied, to test this approach. SLC25A32 and TANK were found to exhibit the most similar expression patterns to NR3C1. A Decision Tree classifier exhibited the largest area under the Receiver Operating Characteristic (ROC) curve in detecting such genes. Target gene prediction was confirmed using siRNA knockdown of TFs, which was found to be more accurate than those predicted after CRISPR/CAS9 inactivation. In-silico mutation analyses of TFBSs also revealed that one or more information-dense TFBS clusters in promoters are required for accurate target gene prediction. Conclusions: Machine learning based on TFBS information density, organization, and chromatin accessibility accurately identifies gene targets with comparable tissue-wide expression patterns. Multiple information-dense TFBS clusters in promoters appear to protect promoters from effects of deleterious binding site mutations in a single TFBS that would otherwise alter regulation of these genes.

Collapse

Lu R, Rogan PK. Transcription factor binding site clusters identify target genes with similar tissue-wide expression and buffer against mutations. F1000Res 2018;7:1933. [PMID: 31001412 PMCID: PMC6464064 DOI: 10.12688/f1000research.17363.2] [Citation(s) in RCA: 11] [Impact Index Per Article: 1.8] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Submit a Manuscript] [Subscribe] [Scholar Register] [Accepted: 03/28/2019] [Indexed: 12/20/2022] Open

Abstract

Background: The distribution and composition of cis-regulatory modules composed of transcription factor (TF) binding site (TFBS) clusters in promoters substantially determine gene expression patterns and TF targets. TF knockdown experiments have revealed that TF binding profiles and gene expression levels are correlated. We use TFBS features within accessible promoter intervals to predict genes with similar tissue-wide expression patterns and TF targets using Machine Learning (ML). Methods: Bray-Curtis Similarity was used to identify genes with correlated expression patterns across 53 tissues. TF targets from knockdown experiments were also analyzed by this approach to set up the ML framework. TFBSs were selected within DNase I-accessible intervals of corresponding promoter sequences using information theory-based position weight matrices (iPWMs) for each TF. Features from information-dense clusters of TFBSs were input to ML classifiers which predict these gene targets along with their accuracy, specificity and sensitivity. Mutations in TFBSs were analyzed in silico to examine their impact on TFBS clustering and predict changes in gene regulation. Results: The glucocorticoid receptor gene ( NR3C1), whose regulation has been extensively studied, was selected to test this approach. SLC25A32 and TANK exhibited the most similar expression patterns to NR3C1. A Decision Tree classifier exhibited the best performance in detecting such genes, based on Area Under the Receiver Operating Characteristic curve (ROC). TF target gene prediction was confirmed using siRNA knockdown, which was more accurate than CRISPR/CAS9 inactivation. TFBS mutation analyses revealed that accurate target gene prediction required at least 1 information-dense TFBS cluster. Conclusions: ML based on TFBS information density, organization, and chromatin accessibility accurately identifies gene targets with comparable tissue-wide expression patterns. Multiple information-dense TFBS clusters in promoters appear to protect promoters from effects of deleterious binding site mutations in a single TFBS that would otherwise alter regulation of these genes.

Collapse

Ng FSL, Ruau D, Wernisch L, Göttgens B. A graphical model approach visualizes regulatory relationships between genome-wide transcription factor binding profiles. Brief Bioinform 2018;19:162-173. [PMID: 27780826 PMCID: PMC5496675 DOI: 10.1093/bib/bbw102] [Citation(s) in RCA: 1] [Impact Index Per Article: 0.2] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 12/08/2015] [Indexed: 11/16/2022] Open

Niu B, Coslo DM, Bataille AR, Albert I, Pugh BF, Omiecinski CJ. In vivo genome-wide binding interactions of mouse and human constitutive androstane receptors reveal novel gene targets. Nucleic Acids Res 2018;46:8385-8403. [PMID: 30102401 PMCID: PMC6144799 DOI: 10.1093/nar/gky692] [Citation(s) in RCA: 22] [Impact Index Per Article: 3.7] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 05/04/2018] [Revised: 07/17/2018] [Accepted: 07/20/2018] [Indexed: 12/13/2022] Open

Luo X, Wei Y. Nonparametric Bayesian learning of heterogeneous dynamic transcription factor networks. Ann Appl Stat 2018. [DOI: 10.1214/17-aoas1129] [Citation(s) in RCA: 4] [Impact Index Per Article: 0.7] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/19/2022]

Chen D, Fu LY, Hu D, Klukas C, Chen M, Kaufmann K. The HTPmod Shiny application enables modeling and visualization of large-scale biological data. Commun Biol 2018;1:89. [PMID: 30271970 PMCID: PMC6123733 DOI: 10.1038/s42003-018-0091-x] [Citation(s) in RCA: 10] [Impact Index Per Article: 1.7] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 02/26/2018] [Accepted: 06/03/2018] [Indexed: 01/20/2023] Open

Chen XW, Gao JX. Big Data Bioinformatics. Methods 2018;111:1-2. [PMID: 27908398 DOI: 10.1016/j.ymeth.2016.11.017] [Citation(s) in RCA: 9] [Impact Index Per Article: 1.5] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 12/14/2022] Open

Zhang LQ, Li QZ. Estimating the effects of transcription factors binding and histone modifications on gene expression levels in human cells. Oncotarget 2018;8:40090-40103. [PMID: 28454114 PMCID: PMC5522221 DOI: 10.18632/oncotarget.16988] [Citation(s) in RCA: 15] [Impact Index Per Article: 2.5] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 12/15/2016] [Accepted: 03/11/2017] [Indexed: 12/22/2022] Open

Kelley DR, Reshef YA, Bileschi M, Belanger D, McLean CY, Snoek J. Sequential regulatory activity prediction across chromosomes with convolutional neural networks. Genome Res 2018;28:739-750. [PMID: 29588361 PMCID: PMC5932613 DOI: 10.1101/gr.227819.117] [Citation(s) in RCA: 216] [Impact Index Per Article: 36.0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/17/2017] [Accepted: 03/23/2018] [Indexed: 01/10/2023]

Guo WL, Huang DS. An efficient method to transcription factor binding sites imputation via simultaneous completion of multiple matrices with positional consistency. MOLECULAR BIOSYSTEMS 2018;13:1827-1837. [PMID: 28718849 DOI: 10.1039/c7mb00155j] [Citation(s) in RCA: 19] [Impact Index Per Article: 3.2] [Reference Citation Analysis] [Abstract] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 12/20/2022]