1
|
Gao L, Wang Y, Gao Q, Chen Y, Zhang Z. Transcriptional control of CCAAT/enhancer binding protein zeta gene in chicken adipose tissue. Poult Sci 2024; 103:103540. [PMID: 38417330 PMCID: PMC10907851 DOI: 10.1016/j.psj.2024.103540] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/30/2023] [Revised: 01/25/2024] [Accepted: 02/05/2024] [Indexed: 03/01/2024] Open
Abstract
CCAAT/enhancer binding protein zeta (C/EBPZ) was differentially expressed in abdominal adipose tissues of fat and lean broilers and regulated adipogenesis in chicken. The objective of this study was to elucidate the transcriptional regulation of C/EBPZ gene in chicken adipose tissue. A 2,031-base pair (bp) chicken C/EBPZ sequence (2,025 nucleotides upstream to 6 nucleotides downstream from the initiator codon, -2,025/+6) was studied. The sequence exhibited a significant promoter activity (P < 0.05) and had some cis-acting elements, notably, a core promoter was identified in nucleotides -94 to +6. Additionally, DNA pull-down assay showed that proteins interacted with chicken C/EBPZ promoter (-173/+6) in preadipocytes were implicated in transcription, post-transcriptional regulation and translation. In addition, KLF2 facilitated the activities of chicken C/EBPZ promoter (-2,025/+6, -1,409/+6, -793/+6, -485/+6, -173/+6, and -94/+6) in preadipocytes (P < 0.05). The expression levels of KLF2 and C/EBPZ in chicken abdominal adipose tissue were substantially associated (r = 0.5978278, P < 0.0001), and KLF2 increased C/EBPZ expression in vitro (P < 0.05). Additionally, chromatin immunoprecipitation (ChIP)-PCR analysis revealed that KLF2 has the ability to interact with the chicken C/EBPZ promoter regions at least at the positions -1,245/-1,048 and -571/-397. Mutation analysis showed that the CGCAGCGCCCG motif located in the chicken C/EBPZ promoter at positions -45 to -35 is involved in regulating transcription and facilitates trans activation by KLF2. These results provided some information of transcription control of C/EBPZ in chicken adipose tissue.
Collapse
Affiliation(s)
- Lingyu Gao
- Department of Histology and Embryology, Shihezi University School of Medicine, Shihezi, Xinjiang, 832000, PR China; Key Medical Laboratory of Stem Cell Transformation and Application, The First People's Hospital of Zhengzhou, Zhengzhou, Henan, 450000, PR China
| | - Yingjun Wang
- Department of Histology and Embryology, Shihezi University School of Medicine, Shihezi, Xinjiang, 832000, PR China
| | - Qin Gao
- Department of Histology and Embryology, Shihezi University School of Medicine, Shihezi, Xinjiang, 832000, PR China
| | - Yuechan Chen
- Department of Reproductive Medicine, The First Affiliated Hospital of Shihezi University, Shihezi, Xinjiang, 832000, PR China
| | - Zhiwei Zhang
- Department of Histology and Embryology, Shihezi University School of Medicine, Shihezi, Xinjiang, 832000, PR China.
| |
Collapse
|
2
|
Guo Y, Gifford DK. Modular combinatorial binding among human trans-acting factors reveals direct and indirect factor binding. BMC Genomics 2017; 18:45. [PMID: 28061806 PMCID: PMC5219757 DOI: 10.1186/s12864-016-3434-3] [Citation(s) in RCA: 23] [Impact Index Per Article: 2.9] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 04/28/2016] [Accepted: 12/19/2016] [Indexed: 11/25/2022] Open
Abstract
Background The combinatorial binding of trans-acting factors (TFs) to the DNA is critical to the spatial and temporal specificity of gene regulation. For certain regulatory regions, more than one regulatory module (set of TFs that bind together) are combined to achieve context-specific gene regulation. However, previous approaches are limited to either pairwise TF co-association analysis or assuming that only one module is used in each regulatory region. Results We present a new computational approach that models the modular organization of TF combinatorial binding. Our method learns compact and coherent regulatory modules from in vivo binding data using a topic model. We found that the binding of 115 TFs in K562 cells can be organized into 49 interpretable modules. Furthermore, we found that tens of thousands of regulatory regions use multiple modules, a structure that cannot be observed with previous hard clustering based methods. The modules discovered recapitulate many published protein-protein physical interactions, have consistent functional annotations of chromatin states, and uncover context specific co-binding such as gene proximal binding of NFY + FOS + SP and distal binding of NFY + FOS + USF. For certain TFs, the co-binding partners of direct binding (motif present) differs from those of indirect binding (motif absent); the distinct set of co-binding partners can predict whether the TF binds directly or indirectly with up to 95% accuracy. Joint analysis across two cell types reveals both cell-type-specific and shared regulatory modules. Conclusions Our results provide comprehensive cell-type-specific combinatorial binding maps and suggest a modular organization of combinatorial binding. Electronic supplementary material The online version of this article (doi:10.1186/s12864-016-3434-3) contains supplementary material, which is available to authorized users.
Collapse
Affiliation(s)
- Yuchun Guo
- MIT, Computer Science and Artificial Intelligence Laboratory, Cambridge, MA, 02139, USA
| | - David K Gifford
- MIT, Computer Science and Artificial Intelligence Laboratory, Cambridge, MA, 02139, USA.
| |
Collapse
|
3
|
Navarro C, Lopez FJ, Cano C, Garcia-Alcalde F, Blanco A. CisMiner: genome-wide in-silico cis-regulatory module prediction by fuzzy itemset mining. PLoS One 2014; 9:e108065. [PMID: 25268582 PMCID: PMC4182448 DOI: 10.1371/journal.pone.0108065] [Citation(s) in RCA: 9] [Impact Index Per Article: 0.8] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/03/2014] [Accepted: 08/25/2014] [Indexed: 01/18/2023] Open
Abstract
Eukaryotic gene control regions are known to be spread throughout non-coding DNA sequences which may appear distant from the gene promoter. Transcription factors are proteins that coordinately bind to these regions at transcription factor binding sites to regulate gene expression. Several tools allow to detect significant co-occurrences of closely located binding sites (cis-regulatory modules, CRMs). However, these tools present at least one of the following limitations: 1) scope limited to promoter or conserved regions of the genome; 2) do not allow to identify combinations involving more than two motifs; 3) require prior information about target motifs. In this work we present CisMiner, a novel methodology to detect putative CRMs by means of a fuzzy itemset mining approach able to operate at genome-wide scale. CisMiner allows to perform a blind search of CRMs without any prior information about target CRMs nor limitation in the number of motifs. CisMiner tackles the combinatorial complexity of genome-wide cis-regulatory module extraction using a natural representation of motif combinations as itemsets and applying the Top-Down Fuzzy Frequent- Pattern Tree algorithm to identify significant itemsets. Fuzzy technology allows CisMiner to better handle the imprecision and noise inherent to regulatory processes. Results obtained for a set of well-known binding sites in the S. cerevisiae genome show that our method yields highly reliable predictions. Furthermore, CisMiner was also applied to putative in-silico predicted transcription factor binding sites to identify significant combinations in S. cerevisiae and D. melanogaster, proving that our approach can be further applied genome-wide to more complex genomes. CisMiner is freely accesible at: http://genome2.ugr.es/cisminer. CisMiner can be queried for the results presented in this work and can also perform a customized cis-regulatory module prediction on a query set of transcription factor binding sites provided by the user.
Collapse
Affiliation(s)
- Carmen Navarro
- Department of Computer Science and AI, University of Granada, Granada, Spain
| | - Francisco J. Lopez
- Andalusian Human Genome Sequencing Centre (CASEGH), Medical Genome Project (MGP), Sevilla, Spain
| | - Carlos Cano
- Department of Computer Science and AI, University of Granada, Granada, Spain
| | | | - Armando Blanco
- Department of Computer Science and AI, University of Granada, Granada, Spain
| |
Collapse
|
4
|
Ginsberg JS, Zhan M, Diamantidis CJ, Woods C, Chen J, Fink JC. Patient-reported and actionable safety events in CKD. J Am Soc Nephrol 2014; 25:1564-73. [PMID: 24556352 DOI: 10.1681/asn.2013090921] [Citation(s) in RCA: 25] [Impact Index Per Article: 2.3] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/03/2022] Open
Abstract
Patients with CKD are at high risk for adverse safety events because of the complexity of their care and impaired renal function. Using data from our observational study of predialysis patients with CKD enrolled in the Safe Kidney Care study, we estimated the baseline frequency of adverse safety events and determined to what extent these events co-occur. We examined patient-reported adverse safety incidents (class I) and actionable safety findings (class II), conditioned on participant use of drugs that might cause such an event, and we used association analysis as a data-mining technique to identify co-occurrences of these events. Of 267 participants, 185 (69.3%) had at least one class I or II event, 102 (38.2%) had more than one event, and 48 (18.0%) had at least one event from both classes. The adjusted conditional rates of class I and class II events ranged from 2.9 to 57.6 per 100 patients and from 2.2 to 8.3 per 100 patients, respectively. The most common conditional class I and II events were patient-reported hypoglycemia and hyperkalemia (serum potassium>5.5 mEq/L), respectively. Reporting of hypoglycemia (in patients with diabetes) and falling or severe dizziness (in patients without diabetes) were most frequently paired with other adverse safety events. We conclude that adverse safety events are common and varied in CKD, with frequent association between disparate events. Further work is needed to define the CKD "safety phenotype" and identify patients at highest risk for adverse safety events.
Collapse
Affiliation(s)
| | - Min Zhan
- Department of Epidemiology and Public Health, School of Medicine, and
| | | | - Corinne Woods
- Pharmaceutical Health Services Research, School of Pharmacy, University of Maryland, Baltimore, Maryland
| | | | - Jeffrey C Fink
- Department of Medicine and Department of Epidemiology and Public Health, School of Medicine, and
| |
Collapse
|
5
|
Chen H, Lonardi S, Zheng J. Deciphering histone code of transcriptional regulation in malaria parasites by large-scale data mining. Comput Biol Chem 2014; 50:3-10. [PMID: 24581698 DOI: 10.1016/j.compbiolchem.2014.01.002] [Citation(s) in RCA: 3] [Impact Index Per Article: 0.3] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Accepted: 12/23/2013] [Indexed: 10/25/2022]
Abstract
Histone modifications play a major role in the regulation of gene expression. Accumulated evidence has shown that histone modifications mediate biological processes such as transcription cooperatively. This has led to the hypothesis of 'histone code' which suggests that combinations of different histone modifications correspond to unique chromatin states and have distinct functions. In this paper, we propose a framework based on association rule mining to discover the potential regulatory relations between histone modifications and gene expression in Plasmodium falciparum. Our approach can output rules with statistical significance. Some of the discovered rules are supported by literature of experimental results. Moreover, we have also discovered de novo rules which can guide further research in epigenetic regulation of transcription. Based on our association rules we build a model to predict gene expression, which outperforms a published Bayesian network model for gene expression prediction by histone modifications. The results of our study reveal mechanisms for histone modifications to regulate transcription in large-scale. Among our findings, the cooperation among histone modifications provides new evidence for the hypothesis of histone code. Furthermore, the rules output by our method can be used to predict the change of gene expression.
Collapse
Affiliation(s)
- Haifen Chen
- School of Computer Engineering, Nanyang Technological University, 50 Nanyang Avenue, Singapore 639798, Singapore.
| | - Stefano Lonardi
- Department of Computer Science and Engineering, University of California Riverside, 900 University Avenue, Riverside, CA 92521, USA.
| | - Jie Zheng
- School of Computer Engineering, Nanyang Technological University, 50 Nanyang Avenue, Singapore 639798, Singapore; Genome Institute of Singapore, A*STAR (Agency for Science, Technology, and Research), Biopolis, Singapore 138672, Singapore.
| |
Collapse
|
6
|
Teng L, He B, Gao P, Gao L, Tan K. Discover context-specific combinatorial transcription factor interactions by integrating diverse ChIP-Seq data sets. Nucleic Acids Res 2013; 42:e24. [PMID: 24217919 PMCID: PMC3936738 DOI: 10.1093/nar/gkt1105] [Citation(s) in RCA: 17] [Impact Index Per Article: 1.4] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 02/06/2023] Open
Abstract
Combinatorial interactions among transcription factors (TFs) are critical for integrating diverse intrinsic and extrinsic signals, fine-tuning regulatory output and increasing the robustness and plasticity of regulatory systems. Current knowledge about combinatorial regulation is rather limited due to the lack of suitable experimental technologies and bioinformatics tools. The rapid accumulation of ChIP-Seq data has provided genome-wide occupancy maps for a large number of TFs and chromatin modification marks for identifying enhancers without knowing individual TF binding sites. Integration of the two data types has not been researched extensively, resulting in underused data and missed opportunities. We describe a novel method for discovering frequent combinatorial occupancy patterns by multiple TFs at enhancers. Our method is based on probabilistic item set mining and takes into account uncertainty in both types of ChIP-Seq data. By joint analysis of 108 TFs in four human cell types, we found that cell–type-specific interactions among TFs are abundant and that the majority of enhancers have flexible architecture. We show that several families of transposable elements disproportionally overlap with enhancers with combinatorial patterns, suggesting that these transposable element families play an important role in the evolution of combinatorial regulation.
Collapse
Affiliation(s)
- Li Teng
- Department of Internal Medicine, University of Iowa, Iowa City, IA 52242, USA, Interdisciplinary Graduate Program in Genetics, University of Iowa, Iowa City, IA 52242, USA and Department of Biomedical Engineering, University of Iowa, Iowa City, IA 52242, USA
| | | | | | | | | |
Collapse
|
7
|
Abstract
IMPORTANCE OF THE FIELD The ultimate goal of discovery screening is to have a fast and cost-effective strategy to meet the demands of producing high-content lead series with improved prospects for clinical success. While high-throughput screening (HTS) dominates the drug discovery landscape, other processes and technologies have emerged, including high-content screening and fragment-based design to provide alternatives that may be more suitable for certain targets. There has been a growing interest in reducing the number of compounds to be screened to prevent the escalation in the costs, time and resources associated with HTS campaigns. Library design plays a central role in these efforts. AREAS COVERED IN THIS REVIEW This opinion provides a survey of some recent developments in the diversity based library design process, but within a historical context. In particular, the importance of chemotyping and substructure analysis and the challenges presented by novel lead discovery technologies that require the design of libraries for screening are discussed. WHAT THE READER WILL GAIN Readers will gain an appreciation of some developments in the field of library design and the factors that are driving the development of new library design technologies; specifically, challenges presented for chemoinformatics with the novel screening technologies in diversity based screening and compound filtering. TAKE HOME MESSAGE Chemotyping and substrutural analysis are techniques that have been underutilized in the process of library design. However, they offer a direct way to evaluate libraries and have been successfully used to develop predictive methodologies. Tools are available to this end, but the full power of the approach has not been realized yet.
Collapse
Affiliation(s)
- Hugo O Villar
- Altoris, Inc., 7660-H Fay Ave #347, La Jolla, CA 92037, USA +1 858 259 8161 ; +1 858 764 5483 ;
| | | |
Collapse
|
8
|
Biomedical application of fuzzy association rules for identifying breast cancer biomarkers. Med Biol Eng Comput 2012; 50:981-90. [DOI: 10.1007/s11517-012-0914-8] [Citation(s) in RCA: 23] [Impact Index Per Article: 1.8] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/23/2011] [Accepted: 05/03/2012] [Indexed: 01/26/2023]
|
9
|
Hur J, Xiang Z, Feldman EL, He Y. Ontology-based Brucella vaccine literature indexing and systematic analysis of gene-vaccine association network. BMC Immunol 2011; 12:49. [PMID: 21871085 PMCID: PMC3180695 DOI: 10.1186/1471-2172-12-49] [Citation(s) in RCA: 31] [Impact Index Per Article: 2.2] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/02/2011] [Accepted: 08/26/2011] [Indexed: 11/22/2022] Open
Abstract
Background Vaccine literature indexing is poorly performed in PubMed due to limited hierarchy of Medical Subject Headings (MeSH) annotation in the vaccine field. Vaccine Ontology (VO) is a community-based biomedical ontology that represents various vaccines and their relations. SciMiner is an in-house literature mining system that supports literature indexing and gene name tagging. We hypothesize that application of VO in SciMiner will aid vaccine literature indexing and mining of vaccine-gene interaction networks. As a test case, we have examined vaccines for Brucella, the causative agent of brucellosis in humans and animals. Results The VO-based SciMiner (VO-SciMiner) was developed to incorporate a total of 67 Brucella vaccine terms. A set of rules for term expansion of VO terms were learned from training data, consisting of 90 biomedical articles related to Brucella vaccine terms. VO-SciMiner demonstrated high recall (91%) and precision (99%) from testing a separate set of 100 manually selected biomedical articles. VO-SciMiner indexing exhibited superior performance in retrieving Brucella vaccine-related papers over that obtained with MeSH-based PubMed literature search. For example, a VO-SciMiner search of "live attenuated Brucella vaccine" returned 922 hits as of April 20, 2011, while a PubMed search of the same query resulted in only 74 hits. Using the abstracts of 14,947 Brucella-related papers, VO-SciMiner identified 140 Brucella genes associated with Brucella vaccines. These genes included known protective antigens, virulence factors, and genes closely related to Brucella vaccines. These VO-interacting Brucella genes were significantly over-represented in biological functional categories, including metabolite transport and metabolism, replication and repair, cell wall biogenesis, intracellular trafficking and secretion, posttranslational modification, and chaperones. Furthermore, a comprehensive interaction network of Brucella vaccines and genes were identified. The asserted and inferred VO hierarchies provide semantic support for inferring novel knowledge of association of vaccines and genes from the retrieved data. New hypotheses were generated based on this analysis approach. Conclusion VO-SciMiner can be used to improve the efficiency for PubMed searching in the vaccine domain.
Collapse
Affiliation(s)
- Junguk Hur
- Bioinformatics Program, University of Michigan, Ann Arbor, MI 48109, USA
| | | | | | | |
Collapse
|
10
|
Meng G, Mosig A, Vingron M. A computational evaluation of over-representation of regulatory motifs in the promoter regions of differentially expressed genes. BMC Bioinformatics 2010; 11:267. [PMID: 20487530 PMCID: PMC3098066 DOI: 10.1186/1471-2105-11-267] [Citation(s) in RCA: 9] [Impact Index Per Article: 0.6] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/03/2009] [Accepted: 05/20/2010] [Indexed: 12/28/2022] Open
Abstract
Background Observed co-expression of a group of genes is frequently attributed to co-regulation by shared transcription factors. This assumption has led to the hypothesis that promoters of co-expressed genes should share common regulatory motifs, which forms the basis for numerous computational tools that search for these motifs. While frequently explored for yeast, the validity of the underlying hypothesis has not been assessed systematically in mammals. This demonstrates the need for a systematic and quantitative evaluation to what degree co-expressed genes share over-represented motifs for mammals. Results We identified 33 experiments for human and mouse in the ArrayExpress Database where transcription factors were manipulated and which exhibited a significant number of differentially expressed genes. We checked for over-representation of transcription factor binding sites in up- or down-regulated genes using the over-representation analysis tool oPOSSUM. In 25 out of 33 experiments, this procedure identified the binding matrices of the affected transcription factors. We also carried out de novo prediction of regulatory motifs shared by differentially expressed genes. Again, the detected motifs shared significant similarity with the matrices of the affected transcription factors. Conclusions Our results support the claim that functional regulatory motifs are over-represented in sets of differentially expressed genes and that they can be detected with computational methods.
Collapse
Affiliation(s)
- Guofeng Meng
- CAS-MPG Partner Institute and Key Laboratory for Computational Biology, Shanghai Institutes for Biological Sciences, 320 Yue Yang Road, 200031, Shanghai, China.
| | | | | |
Collapse
|
11
|
Abstract
A crucial question in the field of gene regulation is whether the location at which a transcription factor binds influences its effectiveness or the mechanism by which it regulates transcription. Comprehensive transcription factor binding maps are needed to address these issues, and genome-wide mapping is now possible thanks to the technological advances of ChIP-chip and ChIP-seq. This Review discusses how recent genomic profiling of transcription factors gives insight into how binding specificity is achieved and what features of chromatin influence the ability of transcription factors to interact with the genome. It also suggests future experiments that may further our understanding of the causes and consequences of transcription factor-genome interactions.
Collapse
|
12
|
Nam H, Lee K, Lee D. Identification of temporal association rules from time-series microarray data sets. BMC Bioinformatics 2009; 10 Suppl 3:S6. [PMID: 19344482 PMCID: PMC2665054 DOI: 10.1186/1471-2105-10-s3-s6] [Citation(s) in RCA: 18] [Impact Index Per Article: 1.1] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 12/28/2022] Open
Abstract
Background One of the most challenging problems in mining gene expression data is to identify how the expression of any particular gene affects the expression of other genes. To elucidate the relationships between genes, an association rule mining (ARM) method has been applied to microarray gene expression data. However, a conventional ARM method has a limit on extracting temporal dependencies between gene expressions, though the temporal information is indispensable to discover underlying regulation mechanisms in biological pathways. In this paper, we propose a novel method, referred to as temporal association rule mining (TARM), which can extract temporal dependencies among related genes. A temporal association rule has the form [gene A↑, gene B↓] → (7 min) [gene C↑], which represents that high expression level of gene A and significant repression of gene B followed by significant expression of gene C after 7 minutes. The proposed TARM method is tested with Saccharomyces cerevisiae cell cycle time-series microarray gene expression data set. Results In the parameter fitting phase of TARM, the fitted parameter set [threshold = ± 0.8, support ≥ 3 transactions, confidence ≥ 90%] with the best precision score for KEGG cell cycle pathway has been chosen for rule mining phase. With the fitted parameter set, numbers of temporal association rules with five transcriptional time delays (0, 7, 14, 21, 28 minutes) are extracted from gene expression data of 799 genes, which are pre-identified cell cycle relevant genes. From the extracted temporal association rules, associated genes, which play same role of biological processes within short transcriptional time delay and some temporal dependencies between genes with specific biological processes are identified. Conclusion In this work, we proposed TARM, which is an applied form of conventional ARM. TARM showed higher precision score than Dynamic Bayesian network and Bayesian network. Advantages of TARM are that it tells us the size of transcriptional time delay between associated genes, activation and inhibition relationship between genes, and sets of co-regulators.
Collapse
Affiliation(s)
- Hojung Nam
- Department of Bio and Brain Engineering, KAIST, 373-1 Guseong-dong, Yuseong-gu, Daejeon, Korea.
| | | | | |
Collapse
|