51
|
Shen X, Walsh B, Li JJ, Pang HX, Wang WJ, Tao SH. The correlations of the function and positional distribution of the cis-elements CArG around the TSS in the genes of Mus musculus. Genome 2009; 52:217-21. [PMID: 19234549 DOI: 10.1139/g08-117] [Citation(s) in RCA: 4] [Impact Index Per Article: 0.3] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/22/2022]
Abstract
While many studies of cis-elements CArG bound by serum response factor (SRF) are in progress, little is known about the positional distribution of the functional CArG elements around the transcription start site (TSS) of genes that they influence. We use a validated CArG data set to calculate the distance distribution of functional CArG elements around the TSS. Distances between adjacent CArGs were also analyzed. We compare these distributions with those derived using a control set of randomly selected CArGs (that were not experimentally validated for function). Our results show that most functional CArG elements (108 of 152, 71%) exist upstream of the annotated TSS, with copy number increasing as one moves closer to the TSS. Moreover, the average number of the CArG elements in the CArG-containing genes is significantly more than that in the control genes. Our study extends earlier bioinformatic analyses of functional CArG elements and provides an application of comparative sequence data to the identification of transcription factor binding sites.
Collapse
Affiliation(s)
- Xia Shen
- Bioinformatics Center, Northwest A&F University, 712100 Yangling, Shaanxi, China
| | | | | | | | | | | |
Collapse
|
52
|
Yokoyama KD, Ohler U, Wray GA. Measuring spatial preferences at fine-scale resolution identifies known and novel cis-regulatory element candidates and functional motif-pair relationships. Nucleic Acids Res 2009; 37:e92. [PMID: 19483094 PMCID: PMC2715254 DOI: 10.1093/nar/gkp423] [Citation(s) in RCA: 22] [Impact Index Per Article: 1.5] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 02/03/2023] Open
Abstract
Transcriptional regulation is mediated by the collective binding of proteins called transcription factors to cis-regulatory elements. A handful of factors are known to function at particular distances from the transcription start site, although the extent to which this occurs is not well understood. Spatial dependencies can also exist between pairs of binding motifs, facilitating factor-pair interactions. We sought to determine to what extent spatial preferences measured at high-scale resolution could be utilized to predict cis-regulatory elements as well as motif-pairs binding interacting proteins. We introduce the ‘motif positional function’ model which predicts spatial biases using regression analysis, differentiating noise from true position-specific overrepresentation at single-nucleotide resolution. Our method predicts 48 consensus motifs exhibiting positional enrichment within human promoters, including fourteen motifs without known binding partners. We then extend the model to analyze distance preferences between pairs of motifs. We find that motif-pairs binding interacting factors often co-occur preferentially at multiple distances, with intervals between preferred distances often corresponding to the turn of the DNA double-helix. This offers a novel means by which to predict sequence elements with a collective role in gene regulation.
Collapse
Affiliation(s)
- Ken Daigoro Yokoyama
- Biology Department, Institute for Genome Sciences and Policy, Duke University, Durham, NC 27708, USA
| | | | | |
Collapse
|
53
|
The transcriptional network that controls growth arrest and differentiation in a human myeloid leukemia cell line. Nat Genet 2009; 41:553-62. [PMID: 19377474 DOI: 10.1038/ng.375] [Citation(s) in RCA: 352] [Impact Index Per Article: 23.5] [Reference Citation Analysis] [Abstract] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/16/2008] [Accepted: 03/25/2009] [Indexed: 12/24/2022]
Abstract
Using deep sequencing (deepCAGE), the FANTOM4 study measured the genome-wide dynamics of transcription-start-site usage in the human monocytic cell line THP-1 throughout a time course of growth arrest and differentiation. Modeling the expression dynamics in terms of predicted cis-regulatory sites, we identified the key transcription regulators, their time-dependent activities and target genes. Systematic siRNA knockdown of 52 transcription factors confirmed the roles of individual factors in the regulatory network. Our results indicate that cellular states are constrained by complex networks involving both positive and negative regulatory interactions among substantial numbers of transcription factors and that no single transcription factor is both necessary and sufficient to drive the differentiation process.
Collapse
|
54
|
Severin J, Waterhouse AM, Kawaji H, Lassmann T, van Nimwegen E, Balwierz PJ, de Hoon MJ, Hume DA, Carninci P, Hayashizaki Y, Suzuki H, Daub CO, Forrest AR. FANTOM4 EdgeExpressDB: an integrated database of promoters, genes, microRNAs, expression dynamics and regulatory interactions. Genome Biol 2009; 10:R39. [PMID: 19374773 PMCID: PMC2688930 DOI: 10.1186/gb-2009-10-4-r39] [Citation(s) in RCA: 57] [Impact Index Per Article: 3.8] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 01/09/2009] [Revised: 03/09/2009] [Accepted: 04/19/2009] [Indexed: 11/28/2022] Open
Abstract
EdgeExpressDB is a novel database and set of interfaces for interpreting biological networks and comparing large high-throughput expression datasets. EdgeExpressDB is a novel database and set of interfaces for interpreting biological networks and comparing large high-throughput expression datasets that requires minimal development for new data types and search patterns. The FANTOM4 EdgeExpress database summarizes gene expression patterns in the context of alternative promoter structures and regulatory transcription factors and microRNAs using intuitive gene-centric and sub-network views. This is an important resource for gene regulation in acute myeloid leukemia, monocyte/macrophage differentiation and human transcriptional networks.
Collapse
Affiliation(s)
- Jessica Severin
- RIKEN Omics Science Center, RIKEN Yokohama Institute, 1-7-22 Suehiro-cho Tsurumi-ku Yokohama, Kanagawa, 230-0045 Japan.
| | | | | | | | | | | | | | | | | | | | | | | | | |
Collapse
|
55
|
A census of human transcription factors: function, expression and evolution. Nat Rev Genet 2009; 10:252-63. [PMID: 19274049 DOI: 10.1038/nrg2538] [Citation(s) in RCA: 1095] [Impact Index Per Article: 73.0] [Reference Citation Analysis] [Abstract] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 12/18/2022]
Abstract
Transcription factors are key cellular components that control gene expression: their activities determine how cells function and respond to the environment. Currently, there is great interest in research into human transcriptional regulation. However, surprisingly little is known about these regulators themselves. For example, how many transcription factors does the human genome contain? How are they expressed in different tissues? Are they evolutionarily conserved? Here, we present an analysis of 1,391 manually curated sequence-specific DNA-binding transcription factors, their functions, genomic organization and evolutionary conservation. Much remains to be explored, but this study provides a solid foundation for future investigations to elucidate regulatory mechanisms underlying diverse mammalian biological processes.
Collapse
|
56
|
Valouev A, Johnson DS, Sundquist A, Medina C, Anton E, Batzoglou S, Myers RM, Sidow A. Genome-wide analysis of transcription factor binding sites based on ChIP-Seq data. Nat Methods 2009; 5:829-34. [PMID: 19160518 PMCID: PMC2917543 DOI: 10.1038/nmeth.1246] [Citation(s) in RCA: 521] [Impact Index Per Article: 34.7] [Reference Citation Analysis] [Abstract] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/09/2022]
Abstract
Molecular interactions between protein complexes and DNA carry out essential gene regulatory functions. Uncovering such interactions by means of chromatin-immunoprecipitation coupled with massively parallel sequencing (ChIP-Seq) has recently become the focus of intense interest. We here introduce QuEST (Quantitative Enrichment of Sequence Tags), a powerful statistical framework based on the Kernel Density Estimation approach, which utilizes ChIP-Seq data to determine positions where protein complexes come into contact with DNA. Using QuEST, we discovered several thousand binding sites for the human transcription factors SRF, GABP and NRSF at an average resolution of about 20 base-pairs. MEME-based motif analyses on the QuEST-identified sequences revealed DNA binding by cofactors of SRF, providing evidence that cofactor binding specificity can be obtained from ChIP-Seq data. By combining QuEST analyses with gene ontology (GO) annotations and expression data, we illustrate how general functions of transcription factors can be inferred.
Collapse
Affiliation(s)
- Anton Valouev
- Department of Pathology, Stanford University Medical Center, 300 Pasteur Drive, Stanford, California 94305, USA
| | | | | | | | | | | | | | | |
Collapse
|
57
|
Pandey DP, Lappano R, Albanito L, Madeo A, Maggiolini M, Picard D. Estrogenic GPR30 signalling induces proliferation and migration of breast cancer cells through CTGF. EMBO J 2009; 28:523-32. [PMID: 19153601 DOI: 10.1038/emboj.2008.304] [Citation(s) in RCA: 257] [Impact Index Per Article: 17.1] [Reference Citation Analysis] [Abstract] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/18/2008] [Accepted: 12/22/2008] [Indexed: 11/09/2022] Open
Abstract
The steroid hormone oestrogen can signal through several receptors and pathways. Although the transcriptional responses mediated by the nuclear oestrogen receptors (ER) have been extensively characterized, the changes in gene expression elicited by signalling through the membrane-associated ER GPR30 have not been studied. We show here for ER-negative human breast cancer cells that the activation of GPR30 signalling by oestrogen or by hydroxytamoxifen (OHT), an ER antagonist but GPR30 agonist, induces a transcription factor network, which resembles that induced by serum in fibroblasts. The most strongly induced gene, CTGF, appears to be a target of these transcription factors. We found that the secreted factor connective tissue growth factor (CTGF) not only contributes to promote proliferation but also mediates the GPR30-induced stimulation of cell migration. These results provide a framework for understanding the physiological and pathological functions of GPR30. As the activation of GPR30 by OHT also induces CTGF in fibroblasts from breast tumour biopsies, these pathways may be involved in promoting aggressive behaviour of breast tumours in response to endogenous oestrogens or to OHT being used for endocrine therapy.
Collapse
Affiliation(s)
- Deo Prakash Pandey
- Département de Biologie Cellulaire, Sciences III, Université de Genève, Genève, Switzerland
| | | | | | | | | | | |
Collapse
|
58
|
Ettwiller L, Budd A, Spitz F, Wittbrodt J. Analysis of mammalian gene batteries reveals both stable ancestral cores and highly dynamic regulatory sequences. Genome Biol 2008; 9:R172. [PMID: 19087242 PMCID: PMC2646276 DOI: 10.1186/gb-2008-9-12-r172] [Citation(s) in RCA: 5] [Impact Index Per Article: 0.3] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 09/28/2008] [Revised: 12/01/2008] [Accepted: 12/16/2008] [Indexed: 12/18/2022] Open
Abstract
Analysis of the evolutionary dynamics of target gene batteries controlled by 16 different transcription factors reveals stable ancestral cores and highly dynamic regulatory sequences Background Changes in gene regulation are suspected to comprise one of the driving forces for evolution. To address the extent of cis-regulatory changes and how they impact on gene regulatory networks across eukaryotes, we systematically analyzed the evolutionary dynamics of target gene batteries controlled by 16 different transcription factors. Results We found that gene batteries show variable conservation within vertebrates, with slow and fast evolving modules. Hence, while a key gene battery associated with the cell cycle is conserved throughout metazoans, the POU5F1 (Oct4) and SOX2 batteries in embryonic stem cells show strong conservation within mammals, with the striking exception of rodents. Within the genes composing a given gene battery, we could identify a conserved core that likely reflects the ancestral function of the corresponding transcription factor. Interestingly, we show that the association between a transcription factor and its target genes is conserved even when we exclude conserved sequence similarities of their promoter regions from our analysis. This supports the idea that turnover, either of the transcription factor binding site or its direct neighboring sequence, is a pervasive feature of proximal regulatory sequences. Conclusions Our study reveals the dynamics of evolutionary changes within metazoan gene networks, including both the composition of gene batteries and the architecture of target gene promoters. This variation provides the playground required for evolutionary innovation around conserved ancestral core functions.
Collapse
Affiliation(s)
- Laurence Ettwiller
- Developmental Biology Unit, EMBL-Heidelberg, Meyerhofstrasse 1, Heidelberg, 69117, Germany.
| | | | | | | |
Collapse
|
59
|
Páez-Valencia J, Valencia-Mayoral P, Sánchez-Gómez C, Contreras-Ramos A, Hernández-Lucas I, Martínez-Barajas E, Gamboa-deBuen A. Identification of Fructose-1,6-bisphosphate aldolase cytosolic class I as an NMH7 MADS domain associated protein. Biochem Biophys Res Commun 2008; 376:700-5. [DOI: 10.1016/j.bbrc.2008.09.064] [Citation(s) in RCA: 17] [Impact Index Per Article: 1.1] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 09/04/2008] [Accepted: 09/11/2008] [Indexed: 11/26/2022]
|
60
|
Holloway DT, Kon M, DeLisi C. In silico regulatory analysis for exploring human disease progression. Biol Direct 2008; 3:24. [PMID: 18564415 PMCID: PMC2464594 DOI: 10.1186/1745-6150-3-24] [Citation(s) in RCA: 4] [Impact Index Per Article: 0.3] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/04/2008] [Accepted: 06/18/2008] [Indexed: 12/24/2022] Open
Abstract
Background An important goal in bioinformatics is to unravel the network of transcription factors (TFs) and their targets. This is important in the human genome, where many TFs are involved in disease progression. Here, classification methods are applied to identify new targets for 152 transcriptional regulators using publicly-available targets as training examples. Three types of sequence information are used: composition, conservation, and overrepresentation. Results Starting with 8817 TF-target interactions we predict an additional 9333 targets for 152 TFs. Randomized classifiers make few predictions (~2/18660) indicating that our predictions for many TFs are significantly enriched for true targets. An enrichment score is calculated and used to filter new predictions. Two case-studies for the TFs OCT4 and WT1 illustrate the usefulness of our predictions: • Many predicted OCT4 targets fall into the Wnt-pathway. This is consistent with known biology as OCT4 is developmentally related and Wnt pathway plays a role in early development. • Beginning with 15 known targets, 354 predictions are made for WT1. WT1 has a role in formation of Wilms' tumor. Chromosomal regions previously implicated in Wilms' tumor by cytological evidence are statistically enriched in predicted WT1 targets. These findings may shed light on Wilms' tumor progression, suggesting that the tumor progresses either by loss of WT1 or by loss of regions harbouring its targets. • Targets of WT1 are statistically enriched for cancer related functions including metastasis and apoptosis. Among new targets are BAX and PDE4B, which may help mediate the established anti-apoptotic effects of WT1. • Of the thirteen TFs found which co-regulate genes with WT1 (p ≤ 0.02), 8 have been previously implicated in cancer. The regulatory-network for WT1 targets in genomic regions relevant to Wilms' tumor is provided. Conclusion We have assembled a set of features for the targets of human TFs and used them to develop classifiers for the determination of new regulatory targets. Many predicted targets are consistent with the known biology of their regulators, and new targets for the Wilms' tumor regulator, WT1, are proposed. We speculate that Wilms' tumor development is mediated by chromosomal rearrangements in the location of WT1 targets. Reviewers This article was reviewed by Trey Ideker, Vladimir A. Kuznetsov(nominated by Frank Eisenhaber), and Tzachi Pilpel.
Collapse
Affiliation(s)
- Dustin T Holloway
- Molecular Biology Cell Biology and Biochemistry Department, Boston University, 5 Cummington Street, Boston, USA
| | | | | |
Collapse
|
61
|
Affiliation(s)
- Joseph M. Miano
- Aab Cardiovascular Research Institute, University of Rochester School of Medicine & Dentistry, Rochester, New York
| |
Collapse
|
62
|
Lemberger T, Parkitna JR, Chai M, Schütz G, Engblom D. CREB has a context-dependent role in activity-regulated transcription and maintains neuronal cholesterol homeostasis. FASEB J 2008; 22:2872-9. [PMID: 18424767 DOI: 10.1096/fj.08-107888] [Citation(s) in RCA: 66] [Impact Index Per Article: 4.1] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/11/2022]
Abstract
Induction of specific gene expression patterns in response to activity confers functional plasticity to neurons. A principal role in the regulation of these processes has been ascribed to the cAMP responsive element binding protein (CREB). Using genome-wide expression profiling in mice lacking CREB in the forebrain, accompanied by deletion of the cAMP responsive element modulator gene (CREM), we here show that the role of these proteins in activity-induced gene expression is surprisingly selective and highly context dependent. Thus, only a very restricted subset of activity-induced genes (i.e., Gadd45b or Nr4a2) requires these proteins for their induction in the hippocampus after kainic acid administration, while they are required for most of the cocaine-induced expression changes in the striatum. Interestingly, in the absence of CREB, CREM is able to rescue activity-regulated transcription, which strengthens the notion of overlapping functions of the two proteins. In addition, we show that cholesterol metabolism is dysregulated in the brains of mutant mice, as reflected coordinated expression changes in genes involved in cholesterol synthesis and neuronal accumulation of cholesterol. These findings provide novel insights into the role of CREB and CREM in stimulus-dependent transcription and neuronal homeostasis.
Collapse
|
63
|
Flanagin S, Nelson JD, Castner DG, Denisenko O, Bomsztyk K. Microplate-based chromatin immunoprecipitation method, Matrix ChIP: a platform to study signaling of complex genomic events. Nucleic Acids Res 2008; 36:e17. [PMID: 18203739 PMCID: PMC2241906 DOI: 10.1093/nar/gkn001] [Citation(s) in RCA: 67] [Impact Index Per Article: 4.2] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 01/02/2023] Open
Abstract
The chromatin immunoprecipitation (ChIP) assay is a major tool in the study of genomic processes in vivo. This and other methods are revealing that control of gene expression, cell division and DNA repair involves multiple proteins and great number of their modifications. ChIP assay is traditionally done in test tubes limiting the ability to study signaling of the complex genomic events. To increase the throughput and to simplify the assay we have developed a microplate-based ChIP (Matrix ChIP) method, where all steps from immunoprecipitation to DNA purification are done in microplate wells without sample transfers. This platform has several important advantages over the tube-based assay including very simple sample handling, high throughput, improved sensitivity and reproducibility, and potential for automation. 96 ChIP measurements including PCR can be done by one researcher in one day. We illustrate the power of Matrix ChIP by parallel profiling 80 different chromatin and transcription time-course events along an inducible gene including transient recruitment of kinases.
Collapse
Affiliation(s)
- Steve Flanagin
- UW Medicine Lake Union, Department of Medicine, University of Washington, Seattle, WA 98109, USA
| | | | | | | | | |
Collapse
|
64
|
Eilon T, Groner B, Barash I. Tumors caused by overexpression and forced activation of Stat5 in mammary epithelial cells of transgenic mice are parity-dependent and developed in aged, postestropausal females. Int J Cancer 2007; 121:1892-1902. [PMID: 17640063 DOI: 10.1002/ijc.22954] [Citation(s) in RCA: 18] [Impact Index Per Article: 1.1] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 01/14/2023]
Abstract
In transgenic mice overexpressing Stat5 or a constitutively activated Stat5 variant (STAT5ca), we show for the first time that parity is required for the development of tumors in postestropausal females. Tumors were detected in glands of multiparous transgenic female mice after latency period of 14 months, but rarely in their age-matched virgin (AMV) counterparts. This period was not affected by distinguishable tumor pathologies and was not dependent upon transgenic Stat5 variant. To associate Stat5 deregulation, parity and the postestropausal tumor occurrence with mammary cancer formation, the activities of endogenous and transgenic Stat5 were measured in the glands of aged multiparous and AMV females. No differences in phosphorylated Stat5 (pStat5) levels were found between the 2 cohorts. However, promoter sequences comprising the Stat5 binding sites from the cyclin D1 or the bcl-x genes associate differentially with acetylated histone H4 in aged multiparous and AMV STAT5ca transgenic females. Individual epithelial cells varied greatly with respect to the presence of nuclear pStat5. A small subset of epithelial cells, in which pStat5 and cyclin D1 were co-expressed, was exclusively present in the multiparous glands. Changes in chromatin structure might persist past the reproductive life time of the multiparous mice and contribute to the transcription of the cyclin D1 gene by activated Stat5. This may cause the detectable expression of cyclin D1 and add to the process of tumorigenesis.
Collapse
Affiliation(s)
- Tali Eilon
- Institute of Animal Science, ARO, The Volcani Center, Bet-Dagan, Israel
| | - Bernd Groner
- Georg Speyer Haus, Institute for Biomedical Research, Frankfurt/M, Germany
| | - Itamar Barash
- Institute of Animal Science, ARO, The Volcani Center, Bet-Dagan, Israel
| |
Collapse
|
65
|
Niu Z, Li A, Zhang SX, Schwartz RJ. Serum response factor micromanaging cardiogenesis. Curr Opin Cell Biol 2007; 19:618-27. [PMID: 18023168 DOI: 10.1016/j.ceb.2007.09.013] [Citation(s) in RCA: 95] [Impact Index Per Article: 5.6] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 09/13/2007] [Accepted: 09/27/2007] [Indexed: 12/19/2022]
Abstract
Serum response factor (SRF), a cardiac-enriched transcription factor, is required for the appearance of beating sarcomeres in the heart. SRF may also direct the expression of microRNAs (miRs) that inhibit the expression of cardiac regulatory factors. The recent knockout of miR-1-2, an SRF gene target, showed defective heart development, caused in part by the induction of GATA6, Irx4/5, and Hand2, that may alter cardiac morphogenesis, channel activity, and cell cycling. SRF and cofactors play an obligatory role in cardiogenesis, as major determinants of myocyte differentiation not only by regulating the biogenesis of muscle contractile proteins but also by driving the expression of silencer miRNA.
Collapse
Affiliation(s)
- Zhivy Niu
- Program in Cardiovascular Sciences, Baylor College of Medicine, Houston, TX 77030, USA
| | | | | | | |
Collapse
|
66
|
Abstract
Although the number of protein-encoding human genes is more limited than many had estimated, the human transcript repertoire is much more diverse than anticipated. In part, transcript diversity is generated through the use of alternative promoters and alternate splicing. In addition, based on discoveries using technologies such as full-length cDNA libraries and whole genome tiling microarrays, it is now likely that non-protein-encoding transcripts comprise a substantial fraction of the human RNA population. Much attention is currently focused on understanding the role of alternative promoters in generating transcript diversity, both for non-protein-encoding (ncRNAs) and protein-encoding RNAs.
Collapse
|