1
|
Boyer K, Li L, Li T, Zhang B, Zhao G. MORA and EnsembleTFpredictor: An ensemble approach to reveal functional transcription factor regulatory networks. PLoS One 2023; 18:e0294724. [PMID: 38032891 PMCID: PMC10688744 DOI: 10.1371/journal.pone.0294724] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 05/16/2023] [Accepted: 11/06/2023] [Indexed: 12/02/2023] Open
Abstract
MOTIVATION Our study aimed to identify biologically relevant transcription factors (TFs) that control the expression of a set of co-expressed or co-regulated genes. RESULTS We developed a fully automated pipeline, Motif Over Representation Analysis (MORA), to detect enrichment of known TF binding motifs in any query sequences. MORA performed better than or comparable to five other TF-prediction tools as evaluated using hundreds of differentially expressed gene sets and ChIP-seq datasets derived from known TFs. Additionally, we developed EnsembleTFpredictor to harness the power of multiple TF-prediction tools to provide a list of functional TFs ranked by prediction confidence. When applied to the test datasets, EnsembleTFpredictor not only identified the target TF but also revealed many TFs known to cooperate with the target TF in the corresponding biological systems. MORA and EnsembleTFpredictor have been used in two publications, demonstrating their power in guiding experimental design and in revealing novel biological insights.
Collapse
Affiliation(s)
- Kevin Boyer
- Department of Neuroscience, Washington University School of Medicine, St. Louis, MO, United States of America
| | - Louis Li
- Brown University, Providence, RI, United States of America
| | - Tiandao Li
- Department of Developmental Biology, Washington University School of Medicine, St. Louis, MO, United States of America
| | - Bo Zhang
- Department of Developmental Biology, Washington University School of Medicine, St. Louis, MO, United States of America
| | - Guoyan Zhao
- Department of Neuroscience, Washington University School of Medicine, St. Louis, MO, United States of America
- Department of Pathology and Immunology, Washington University School of Medicine, St. Louis, MO, United States of America
| |
Collapse
|
2
|
Lu D, Pan R, Wu W, Zhang Y, Li S, Xu H, Huang J, Xia J, Wang Q, Luan X, Lv C, Zhang W, Meng G. FL-DTD: an integrated pipeline to predict the drug interacting targets by feedback loop-based network analysis. Brief Bioinform 2022; 23:6632928. [PMID: 35794722 DOI: 10.1093/bib/bbac263] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 02/28/2022] [Revised: 06/01/2022] [Accepted: 06/06/2022] [Indexed: 11/12/2022] Open
Abstract
Drug target discovery is an essential step to reveal the mechanism of action (MoA) underlying drug therapeutic effects and/or side effects. Most of the approaches are usually labor-intensive while unable to identify the tissue-specific interacting targets, especially the targets with weaker drug binding affinity. In this work, we proposed an integrated pipeline, FL-DTD, to predict the drug interacting targets of novel compounds in a tissue-specific manner. This method was built based on a hypothesis that cells under a status of homeostasis would take responses to drug perturbation by activating feedback loops. Therefore, the drug interacting targets can be predicted by analyzing the network responses after drug perturbation. We evaluated this method using the expression data of estrogen stimulation, gene manipulation and drug perturbation and validated its good performance to identify the annotated drug targets. Using STAT3 as a target protein, we applied this method to drug perturbation data of 500 natural compounds and predicted five compounds with STAT3 interacting activities. Experimental assay validated the STAT3-interacting activities of four compounds. Overall, our evaluation suggests that FL-DTD predicts the drug interacting targets with good accuracy and can be used for drug target discovery.
Collapse
Affiliation(s)
- Dong Lu
- Institute of Interdisciplinary Integrative Medicine Research, Shanghai University of Traditional Chinese Medicine, Cailun 1200, 201203, Shanghai, China
| | - Rongrong Pan
- Institute of Interdisciplinary Integrative Medicine Research, Shanghai University of Traditional Chinese Medicine, Cailun 1200, 201203, Shanghai, China
| | - Wenxuan Wu
- Institute of Interdisciplinary Integrative Medicine Research, Shanghai University of Traditional Chinese Medicine, Cailun 1200, 201203, Shanghai, China
| | - Yanyan Zhang
- Institute of Interdisciplinary Integrative Medicine Research, Shanghai University of Traditional Chinese Medicine, Cailun 1200, 201203, Shanghai, China
| | - Shensuo Li
- Institute of Interdisciplinary Integrative Medicine Research, Shanghai University of Traditional Chinese Medicine, Cailun 1200, 201203, Shanghai, China
| | - Hong Xu
- Institute of Interdisciplinary Integrative Medicine Research, Shanghai University of Traditional Chinese Medicine, Cailun 1200, 201203, Shanghai, China
| | - Jialan Huang
- Institute of Interdisciplinary Integrative Medicine Research, Shanghai University of Traditional Chinese Medicine, Cailun 1200, 201203, Shanghai, China
| | - Jianhua Xia
- Institute of Interdisciplinary Integrative Medicine Research, Shanghai University of Traditional Chinese Medicine, Cailun 1200, 201203, Shanghai, China
| | - Qun Wang
- Institute of Interdisciplinary Integrative Medicine Research, Shanghai University of Traditional Chinese Medicine, Cailun 1200, 201203, Shanghai, China
| | - Xin Luan
- Institute of Interdisciplinary Integrative Medicine Research, Shanghai University of Traditional Chinese Medicine, Cailun 1200, 201203, Shanghai, China
| | - Chao Lv
- Institute of Interdisciplinary Integrative Medicine Research, Shanghai University of Traditional Chinese Medicine, Cailun 1200, 201203, Shanghai, China
| | - Weidong Zhang
- Institute of Interdisciplinary Integrative Medicine Research, Shanghai University of Traditional Chinese Medicine, Cailun 1200, 201203, Shanghai, China
| | - Guofeng Meng
- Institute of Interdisciplinary Integrative Medicine Research, Shanghai University of Traditional Chinese Medicine, Cailun 1200, 201203, Shanghai, China
| |
Collapse
|
3
|
Lu Y, Zhou X, Nardini C. Dissection of the module network implementation "LemonTree": enhancements towards applications in metagenomics and translation in autoimmune maladies. MOLECULAR BIOSYSTEMS 2018; 13:2083-2091. [PMID: 28809429 DOI: 10.1039/c7mb00248c] [Citation(s) in RCA: 4] [Impact Index Per Article: 0.6] [Reference Citation Analysis] [Abstract] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 01/11/2023]
Abstract
Under the current deluge of omics, module networks distinctively emerge as methods capable of not only identifying inherently coherent groups (modules), thus reducing dimensionality, but also hypothesizing cause-effect relationships between modules and their regulators. Module networks were first designed in the transcriptomic era and further exploited in the multi-omic context to assess (for example) miRNA regulation of gene expression. Despite a number of available implementations, expansion of module networks to other omics is constrained by a limited characterization of the solutions' (modules plus regulators) accuracy and stability - an immediate need for the better characterization of molecular biology complexity in silico. We hence carefully assessed for LemonTree - a popular and open source module network implementation - the dependency of the software performances (sensitivity, specificity, false discovery rate, solutions' stability) on the input parameters and on the data quality (sample size, expression noise) based on synthetic and real data. In the process, we uncovered and fixed an issue in the code for the regulator assignment procedure. We concluded this evaluation with a table of recommended parameter settings. Finally, we applied these recommended settings to gut-intestinal metagenomic data from rheumatoid arthritis patients, to characterize the evolution of the gut-intestinal microbiome under different pharmaceutical regimens (methotrexate and prednisone) and we inferred innovative clinical recommendations with therapeutic potential, based on the computed module network.
Collapse
Affiliation(s)
- Youtao Lu
- CAS-MPG Partner Institute for Computational Biology, Shanghai Institutes for Biological Sciences, Chinese Academy of Sciences, Shanghai 200031, P. R. China
| | | | | |
Collapse
|
4
|
Meng G, Zhong X, Mei H. A Systematic Investigation into Aging Related Genes in Brain and Their Relationship with Alzheimer's Disease. PLoS One 2016; 11:e0150624. [PMID: 26937969 PMCID: PMC4777381 DOI: 10.1371/journal.pone.0150624] [Citation(s) in RCA: 21] [Impact Index Per Article: 2.3] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/14/2015] [Accepted: 02/17/2016] [Indexed: 01/08/2023] Open
Abstract
Aging, as a complex biological process, is accompanied by the accumulation of functional loses at different levels, which makes age to be the biggest risk factor to many neurological diseases. Even following decades of investigation, the process of aging is still far from being fully understood, especially at a systematic level. In this study, we identified aging related genes in brain by collecting the ones with sustained and consistent gene expression or DNA methylation changes in the aging process. Functional analysis with Gene Ontology to these genes suggested transcriptional regulators to be the most affected genes in the aging process. Transcription regulation analysis found some transcription factors, especially Specificity Protein 1 (SP1), to play important roles in regulating aging related gene expression. Module-based functional analysis indicated these genes to be associated with many well-known aging related pathways, supporting the validity of our approach to select aging related genes. Finally, we investigated the roles of aging related genes on Alzheimer's Disease (AD). We found that aging and AD related genes both involved some common pathways, which provided a possible explanation why aging made the brain more vulnerable to Alzheimer's Disease.
Collapse
Affiliation(s)
- Guofeng Meng
- Computational Modeling Sciences, Platform Technologies and Science, GlaxoSmithKline Research & Development, Shanghai, China
| | - Xiaoyan Zhong
- Neurodegeneration DPU, GlaxoSmithKline Research & Development, Shanghai, China
| | - Hongkang Mei
- Computational Modeling Sciences, Platform Technologies and Science, GlaxoSmithKline Research & Development, Shanghai, China
| |
Collapse
|
5
|
Meng G, Vingron M. Condition-specific target prediction from motifs and expression. ACTA ACUST UNITED AC 2014; 30:1643-50. [PMID: 24532727 DOI: 10.1093/bioinformatics/btu066] [Citation(s) in RCA: 10] [Impact Index Per Article: 0.9] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 01/27/2023]
Abstract
MOTIVATION It is commonplace to predict targets of transcription factors (TFs) by sequence matching with their binding motifs. However, this ignores the particular condition of the cells. Gene expression data can provide condition-specific information, as is, e.g. exploited in Motif Enrichment Analysis. RESULTS Here, we introduce a novel tool named condition-specific target prediction (CSTP) to predict condition-specific targets for TFs from expression data measured by either microarray or RNA-seq. Based on the philosophy of guilt by association, CSTP infers the regulators of each studied gene by recovering the regulators of its co-expressed genes. In contrast to the currently used methods, CSTP does not insist on binding sites of TFs in the promoter of the target genes. CSTP was applied to three independent biological processes for evaluation purposes. By analyzing the predictions for the same TF in three biological processes, we confirm that predictions with CSTP are condition-specific. Predictions were further compared with true TF binding sites as determined by ChIP-seq/chip. We find that CSTP predictions overlap with true binding sites to a degree comparable with motif-based predictions, although the two target sets do not coincide. AVAILABILITY AND IMPLEMENTATION CSTP is available via a web-based interface at http://cstp.molgen.mpg.de.
Collapse
Affiliation(s)
- Guofeng Meng
- Department of Computational Molecular Biology, Max Planck Institute for Molecular Genetics, Ihnestrasse 73, 14195 Berlin, Germany
| | - Martin Vingron
- Department of Computational Molecular Biology, Max Planck Institute for Molecular Genetics, Ihnestrasse 73, 14195 Berlin, Germany
| |
Collapse
|
6
|
oPOSSUM-3: advanced analysis of regulatory motif over-representation across genes or ChIP-Seq datasets. G3-GENES GENOMES GENETICS 2012; 2:987-1002. [PMID: 22973536 PMCID: PMC3429929 DOI: 10.1534/g3.112.003202] [Citation(s) in RCA: 230] [Impact Index Per Article: 17.7] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Subscribe] [Scholar Register] [Received: 05/27/2012] [Accepted: 06/11/2012] [Indexed: 01/12/2023]
Abstract
oPOSSUM-3 is a web-accessible software system for identification of over-represented transcription factor binding sites (TFBS) and TFBS families in either DNA sequences of co-expressed genes or sequences generated from high-throughput methods, such as ChIP-Seq. Validation of the system with known sets of co-regulated genes and published ChIP-Seq data demonstrates the capacity for oPOSSUM-3 to identify mediating transcription factors (TF) for co-regulated genes or co-recovered sequences. oPOSSUM-3 is available at http://opossum.cisreg.ca.
Collapse
|
7
|
Gautier A, Le Gac F, Lareyre JJ. The gsdf gene locus harbors evolutionary conserved and clustered genes preferentially expressed in fish previtellogenic oocytes. Gene 2010; 472:7-17. [PMID: 21047546 DOI: 10.1016/j.gene.2010.10.014] [Citation(s) in RCA: 65] [Impact Index Per Article: 4.3] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/14/2010] [Revised: 10/26/2010] [Accepted: 10/27/2010] [Indexed: 11/26/2022]
Abstract
The gonadal soma-derived factor (GSDF) belongs to the transforming growth factor-β superfamily and is conserved in teleostean fish species. Gsdf is specifically expressed in the gonads, and gene expression is restricted to the granulosa and Sertoli cells in trout and medaka. The gsdf gene expression is correlated to early testis differentiation in medaka and was shown to stimulate primordial germ cell and spermatogonia proliferation in trout. In the present study, we show that the gsdf gene localizes to a syntenic chromosomal fragment conserved among vertebrates although no gsdf-related gene is detected on the corresponding genomic region in tetrapods. We demonstrate using quantitative RT-PCR that most of the genes localized in the synteny are specifically expressed in medaka gonads. Gsdf is the only gene of the synteny with a much higher expression in the testis compared to the ovary. In contrast, gene expression pattern analysis of the gsdf surrounding genes (nup54, aff1, klhl8, sdad1, and ptpn13) indicates that these genes are preferentially expressed in the female gonads. The tissue distribution of these genes is highly similar in medaka and zebrafish, two teleostean species that have diverged more than 110 million years ago. The cellular localization of these genes was determined in medaka gonads using the whole-mount in situ hybridization technique. We confirm that gsdf gene expression is restricted to Sertoli and granulosa cells in contact with the premeiotic and meiotic cells. The nup54 gene is expressed in spermatocytes and previtellogenic oocytes. Transcripts corresponding to the ovary-specific genes (aff1, klhl8, and sdad1) are detected only in previtellogenic oocytes. No expression was detected in the gonocytes in 10 dpf embryos. In conclusion, we show that the gsdf gene localizes to a syntenic chromosomal fragment harboring evolutionary conserved genes in vertebrates. These genes are preferentially expressed in previtelloogenic oocytes, and thus, they display a different cellular localization compared to that of the gsdf gene indicating that the later gene is not co-regulated. Interestingly, our study identifies new clustered genes that are specifically expressed in previtellogenic oocytes (nup54, aff1, klhl8, sdad1).
Collapse
Affiliation(s)
- Aude Gautier
- INRA, UR SCRIBE, IFR, BioGenOuest, Rennes, France
| | | | | |
Collapse
|