401
|
Gross KL, Oakley RH, Scoltock AB, Jewell CM, Cidlowski JA. Glucocorticoid receptor alpha isoform-selective regulation of antiapoptotic genes in osteosarcoma cells: a new mechanism for glucocorticoid resistance. Mol Endocrinol 2011; 25:1087-99. [PMID: 21527497 DOI: 10.1210/me.2010-0051] [Citation(s) in RCA: 41] [Impact Index Per Article: 3.2] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/19/2022] Open
Abstract
Glucocorticoids regulate a variety of physiological processes and are commonly used to treat disorders of inflammation, autoimmune diseases, and cancer. Glucocorticoid action is predominantly mediated through the classic glucocorticoid receptor (GR)α isoform. Recent data suggest that the mature GRα mRNA is translated into multiple N-terminal isoforms that have distinct biochemical properties and gene regulatory profiles. Interestingly, osteosarcoma cells stably expressing the GRα-D translational isoform are unique in that they are resistant to glucocorticoid-induced apoptosis. In this study, we investigate whether GRα isoform-specific differences in the regulation of antiapoptotic genes contribute to this resistant phenotype. We now show that GRα-D, unlike the other receptor isoforms, does not inhibit the activity of a nuclear factor κB (NF-κB)-responsive reporter gene and does not efficiently repress either the transcription or protein production of the antiapoptotic genes Bcl-xL, cellular inhibitor of apoptosis protein 1, and survivin. The inability of GRα-D to down-regulate the expression of these genes appears to be associated with a diminished interaction between GRα-D and NF-κB that is observed in cells, but not in vitro, and likely reflects the sequestration of GRα-D in the nucleus. Deletion of the GRα N-terminal amino acids 98-335 also results in a nuclear resident GR, which fails to interact with NF-κB in cells and promote apoptosis in response to glucocorticoids. These data suggest that the N-terminal translational isoforms of GRα selectively regulate antiapoptotic genes and that the GRα-D isoform may contribute to the resistance of certain cancer cells to glucocorticoid-induced apoptosis.
Collapse
Affiliation(s)
- Katherine L Gross
- Molecular Endocrinology Group, Laboratory of Signal Transduction, National Institute of Environmental Health Sciences, National Institutes of Health, Research Triangle Park, North Carolina 27709, USA
| | | | | | | | | |
Collapse
|
402
|
A ChIP-Seq benchmark shows that sequence conservation mainly improves detection of strong transcription factor binding sites. PLoS One 2011; 6:e18430. [PMID: 21533218 PMCID: PMC3077367 DOI: 10.1371/journal.pone.0018430] [Citation(s) in RCA: 12] [Impact Index Per Article: 0.9] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 11/01/2010] [Accepted: 03/03/2011] [Indexed: 11/19/2022] Open
Abstract
BACKGROUND Transcription factors are important controllers of gene expression and mapping transcription factor binding sites (TFBS) is key to inferring transcription factor regulatory networks. Several methods for predicting TFBS exist, but there are no standard genome-wide datasets on which to assess the performance of these prediction methods. Also, it is believed that information about sequence conservation across different genomes can generally improve accuracy of motif-based predictors, but it is not clear under what circumstances use of conservation is most beneficial. RESULTS Here we use published ChIP-seq data and an improved peak detection method to create comprehensive benchmark datasets for prediction methods which use known descriptors or binding motifs to detect TFBS in genomic sequences. We use this benchmark to assess the performance of five different prediction methods and find that the methods that use information about sequence conservation generally perform better than simpler motif-scanning methods. The difference is greater on high-affinity peaks and when using short and information-poor motifs. However, if the motifs are specific and information-rich, we find that simple motif-scanning methods can perform better than conservation-based methods. CONCLUSIONS Our benchmark provides a comprehensive test that can be used to rank the relative performance of transcription factor binding site prediction methods. Moreover, our results show that, contrary to previous reports, sequence conservation is better suited for predicting strong than weak transcription factor binding sites.
Collapse
|
403
|
Irie T, Park SJ, Yamashita R, Seki M, Yada T, Sugano S, Nakai K, Suzuki Y. Predicting promoter activities of primary human DNA sequences. Nucleic Acids Res 2011; 39:e75. [PMID: 21486745 PMCID: PMC3113590 DOI: 10.1093/nar/gkr173] [Citation(s) in RCA: 8] [Impact Index Per Article: 0.6] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 12/17/2022] Open
Abstract
We developed a computer program that can predict the intrinsic promoter activities of primary human DNA sequences. We observed promoter activity using a quantitative luciferase assay and generated a prediction model using multiple linear regression. Our program achieved a prediction accuracy correlation coefficient of 0.87 between the predicted and observed promoter activities. We evaluated the prediction accuracy of the program using massive sequencing analysis of transcriptional start sites in vivo. We found that it is still difficult to predict transcript levels in a strictly quantitative manner in vivo; however, it was possible to select active promoters in a given cell from the other silent promoters. Using this program, we analyzed the transcriptional landscape of the entire human genome. We demonstrate that many human genomic regions have potential promoter activity, and the expression of some previously uncharacterized putatively non-protein-coding transcripts can be explained by our prediction model. Furthermore, we found that nucleosomes occasionally formed open chromatin structures with RNA polymerase II recruitment where the program predicted significant promoter activities, although no transcripts were observed.
Collapse
Affiliation(s)
- Takuma Irie
- Department of Medical Genome Sciences, Graduate School of Frontier Sciences, the University of Tokyo, 5-1-5 Kashiwanoha, Kashiwashi, Chiba 277-8562, Japan
| | | | | | | | | | | | | | | |
Collapse
|
404
|
Slattery M, Ma L, Négre N, White KP, Mann RS. Genome-wide tissue-specific occupancy of the Hox protein Ultrabithorax and Hox cofactor Homothorax in Drosophila. PLoS One 2011; 6:e14686. [PMID: 21483663 PMCID: PMC3071676 DOI: 10.1371/journal.pone.0014686] [Citation(s) in RCA: 72] [Impact Index Per Article: 5.5] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 01/07/2011] [Accepted: 01/21/2011] [Indexed: 11/18/2022] Open
Abstract
The Hox genes are responsible for generating morphological diversity along the
anterior-posterior axis during animal development. The
Drosophila Hox gene Ultrabithorax
(Ubx), for example, is required for specifying the identity
of the third thoracic (T3) segment of the adult, which includes the dorsal
haltere, an appendage required for flight, and the ventral T3 leg.
Ubx mutants show homeotic transformations of the T3 leg
towards the identity of the T2 leg and the haltere towards the wing. All Hox
genes, including Ubx, encode homeodomain containing
transcription factors, raising the question of what target genes
Ubx regulates to generate these adult structures. To
address this question, we carried out whole genome ChIP-chip studies to identify
all of the Ubx bound regions in the haltere and T3 leg imaginal discs, which are
the precursors to these adult structures. In addition, we used ChIP-chip to
identify the sites bound by the Hox cofactor, Homothorax (Hth). In contrast to
previous ChIP-chip studies carried out in Drosophila embryos,
these binding studies reveal that there is a remarkable amount of tissue- and
transcription factor-specific binding. Analyses of the putative target genes
bound and regulated by these factors suggest that Ubx regulates many downstream
transcription factors and developmental pathways in the haltere and T3 leg.
Finally, we discovered additional DNA sequence motifs that in some cases are
specific for individual data sets, arguing that Ubx and/or Hth work together
with many regionally expressed transcription factors to execute their functions.
Together, these data provide the first whole-genome analysis of the binding
sites and target genes regulated by Ubx to specify the morphologies of the adult
T3 segment of the fly.
Collapse
Affiliation(s)
- Matthew Slattery
- Department of Biochemistry and Molecular
Biophysics, Columbia University, New York, New York, United States of
America
| | - Lijia Ma
- Department of Human Genetics, Department of
Ecology and Evolution, Institute for Genomics and Systems Biology, University of
Chicago, Chicago, Illinois, United States of America
| | - Nicolas Négre
- Department of Human Genetics, Department of
Ecology and Evolution, Institute for Genomics and Systems Biology, University of
Chicago, Chicago, Illinois, United States of America
| | - Kevin P. White
- Department of Human Genetics, Department of
Ecology and Evolution, Institute for Genomics and Systems Biology, University of
Chicago, Chicago, Illinois, United States of America
| | - Richard S. Mann
- Department of Biochemistry and Molecular
Biophysics, Columbia University, New York, New York, United States of
America
- * E-mail:
| |
Collapse
|
405
|
Ito Y, Ahmad A, Kewley E, Mason RJ. Hypoxia-inducible factor regulates expression of surfactant protein in alveolar type II cells in vitro. Am J Respir Cell Mol Biol 2011; 45:938-45. [PMID: 21454802 DOI: 10.1165/rcmb.2011-0052oc] [Citation(s) in RCA: 23] [Impact Index Per Article: 1.8] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 12/26/2022] Open
Abstract
Alveolar type II (ATII) cells cultured at an air-liquid (A/L) interface maintain differentiation, but they lose these properties when they are submerged. Others showed that an oxygen tension gradient develops in the culture medium as ATII cells consume oxygen. Therefore, we wondered whether hypoxia inducible factor (HIF) signaling could explain differences in the phenotypes of ATII cells cultured under A/L interface or submerged conditions. ATII cells were isolated from male Sprague-Dawley rats and cultured on inserts coated with a mixture of rat-tail collagen and Matrigel, in medium including 5% rat serum and 10 ng/ml keratinocyte growth factor, with their apical surfaces either exposed to air or submerged. The A/L interface condition maintained the expression of surfactant proteins, whereas that expression was down-regulated under the submerged condition, and the effect was rapid and reversible. Under submerged conditions, there was an increase in HIF1α and HIF2α in nuclear extracts, mRNA levels of HIF inducible genes, vascular endothelial growth factor, glucose transporter-1 (GLUT1), and the protein level of pyruvate dehydrogenase kinase isozyme-1. The expression of surfactant proteins was suppressed and GLUT1 mRNA levels were induced when cells were cultured with 1 mM dimethyloxalyl glycine. The expression of surfactant proteins was restored under submerged conditions with supplemented 60% oxygen. HIF signaling and oxygen tension at the surface of cells appears to be important in regulating the phenotype of rat ATII cells.
Collapse
Affiliation(s)
- Yoko Ito
- Department of Medicine, National Jewish Health, Denver, CO 80206, USA.
| | | | | | | |
Collapse
|
406
|
Genome-wide promoter DNA methylation dynamics of human hematopoietic progenitor cells during differentiation and aging. Blood 2011; 117:e182-9. [PMID: 21427290 DOI: 10.1182/blood-2011-01-331926] [Citation(s) in RCA: 158] [Impact Index Per Article: 12.2] [Reference Citation Analysis] [Abstract] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 12/13/2022] Open
Abstract
DNA methylation plays an important role in the self-renewal of hematopoietic stem cells and in the commitment to the lymphoid or myeloid lineages. Using purified CD34⁺ hematopoietic progenitor cells and differentiated myeloid cell populations from the same human samples, we obtained detailed methylation profiles at distinct stages of hematopoiesis. We identified a defined set of differentiation-related genes that are methylated in CD34⁺ hematopoietic progenitor cells but show pronounced DNA hypomethylation in monocytes and in granulocytes. In addition, by comparing hematopoietic progenitor cells from umbilical cord blood to hematopoietic progenitor cells from peripheral blood of adult donors we were also able to analyze age-related methylation changes in CD34⁺ cells. Interestingly, the methylation changes observed in older progenitor cells showed a bimodal pattern with hypomethylation of differentiation-associated genes and de novo methylation events resembling epigenetic mutations. Our results thus provide detailed insight into the methylation dynamics during differentiation and suggest that epigenetic changes contribute to hematopoietic progenitor cell aging.
Collapse
|
407
|
Hagemann S, Heil O, Lyko F, Brueckner B. Azacytidine and decitabine induce gene-specific and non-random DNA demethylation in human cancer cell lines. PLoS One 2011; 6:e17388. [PMID: 21408221 PMCID: PMC3049766 DOI: 10.1371/journal.pone.0017388] [Citation(s) in RCA: 106] [Impact Index Per Article: 8.2] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 11/23/2010] [Accepted: 02/01/2011] [Indexed: 12/31/2022] Open
Abstract
The DNA methyltransferase inhibitors azacytidine and decitabine represent archetypal drugs for epigenetic cancer therapy. To characterize the demethylating activity of azacytidine and decitabine we treated colon cancer and leukemic cells with both drugs and used array-based DNA methylation analysis of more than 14,000 gene promoters. Additionally, drug-induced demethylation was compared to methylation patterns of isogenic colon cancer cells lacking both DNA methyltransferase 1 (DNMT1) and DNMT3B. We show that drug-induced demethylation patterns are highly specific, non-random and reproducible, indicating targeted remethylation of specific loci after replication. Correspondingly, we found that CG dinucleotides within CG islands became preferentially remethylated, indicating a role for DNA sequence context. We also identified a subset of genes that were never demethylated by drug treatment, either in colon cancer or in leukemic cell lines. These demethylation-resistant genes were enriched for Polycomb Repressive Complex 2 components in embryonic stem cells and for transcription factor binding motifs not present in demethylated genes. Our results provide detailed insights into the DNA methylation patterns induced by azacytidine and decitabine and suggest the involvement of complex regulatory mechanisms in drug-induced DNA demethylation.
Collapse
Affiliation(s)
- Sabine Hagemann
- Division of Epigenetics, Deutsches Krebsforschungszentrum, Heidelberg, Germany
| | - Oliver Heil
- Genomics and Proteomics Core Facility, Deutsches Krebsforschungszentrum, Heidelberg, Germany
| | - Frank Lyko
- Division of Epigenetics, Deutsches Krebsforschungszentrum, Heidelberg, Germany
| | - Bodo Brueckner
- Division of Epigenetics, Deutsches Krebsforschungszentrum, Heidelberg, Germany
| |
Collapse
|
408
|
Perumal NB, Kaplan MH. Regulating Il9 transcription in T helper cells. Trends Immunol 2011; 32:146-50. [PMID: 21371941 DOI: 10.1016/j.it.2011.01.006] [Citation(s) in RCA: 59] [Impact Index Per Article: 4.5] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 12/28/2010] [Revised: 01/25/2011] [Accepted: 01/27/2011] [Indexed: 12/22/2022]
Abstract
T helper (Th) cells are crucial for the development of immunity to infections and inflammatory disease. The acquisition of specific cytokine-secreting profiles, primed by the cytokine microenvironment, is required for effector function of Th cells. The most recent addition to the growing list of effector subsets are Th9 cells that secrete IL-9. In this review, we propose a model for the transcriptional regulation of the Il9 gene in IL-9-expressing T cells and the relatedness of this subset to other Th phenotypes. We suggest that transcription factors restricted to certain Th subsets and common among several subsets might play a role in the plasticity of Th9 cells.
Collapse
Affiliation(s)
- Narayanan B Perumal
- School of Informatics, Indiana University-Purdue University Indianapolis, IN 46202, USA
| | | |
Collapse
|
409
|
De Masi F, Grove CA, Vedenko A, Alibés A, Gisselbrecht SS, Serrano L, Bulyk ML, Walhout AJM. Using a structural and logics systems approach to infer bHLH-DNA binding specificity determinants. Nucleic Acids Res 2011; 39:4553-63. [PMID: 21335608 PMCID: PMC3113581 DOI: 10.1093/nar/gkr070] [Citation(s) in RCA: 59] [Impact Index Per Article: 4.5] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 12/22/2022] Open
Abstract
Numerous efforts are underway to determine gene regulatory networks that describe physical relationships between transcription factors (TFs) and their target DNA sequences. Members of paralogous TF families typically recognize similar DNA sequences. Knowledge of the molecular determinants of protein–DNA recognition by paralogous TFs is of central importance for understanding how small differences in DNA specificities can dictate target gene selection. Previously, we determined the in vitro DNA binding specificities of 19 Caenorhabditis elegans basic helix-loop-helix (bHLH) dimers using protein binding microarrays. These TFs bind E-box (CANNTG) and E-box-like sequences. Here, we combine these data with logics, bHLH–DNA co-crystal structures and computational modeling to infer which bHLH monomer can interact with which CAN E-box half-site and we identify a critical residue in the protein that dictates this specificity. Validation experiments using mutant bHLH proteins provide support for our inferences. Our study provides insights into the mechanisms of DNA recognition by bHLH dimers as well as a blueprint for system-level studies of the DNA binding determinants of other TF families in different model organisms and humans.
Collapse
Affiliation(s)
- Federico De Masi
- Department of Medicine, Division of Genetics, Brigham & Women's Hospital and Harvard Medical School, Boston, MA 02115, USA
| | | | | | | | | | | | | | | |
Collapse
|
410
|
Molineris I, Grassi E, Ala U, Di Cunto F, Provero P. Evolution of promoter affinity for transcription factors in the human lineage. Mol Biol Evol 2011; 28:2173-83. [PMID: 21335606 DOI: 10.1093/molbev/msr027] [Citation(s) in RCA: 25] [Impact Index Per Article: 1.9] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 12/22/2022] Open
Abstract
Changes in gene regulation are believed to play an important role in the evolution of animals. It has been suggested that changes in cis-regulatory regions are responsible for many or most of the anatomical and behavioral differences between humans and apes. However, the study of the evolution of cis-regulatory regions is made problematic by the degeneracy of transcription factor (TF) binding sites and the shuffling of their positions. In this work, we use the predicted total affinity of a promoter for a large collection of TFs as the basis for studying the evolution of cis-regulatory regions in mammals. We introduce the human specificity of a promoter, measuring the divergence between the affinity profile of a human promoter and its orthologous promoters in other mammals. The promoters of genes involved in functional categories such as neural processes and signal transduction, among others, have higher human specificity compared with the rest of the genome. Clustering of the human-specific affinities (HSAs) of neural genes reveals patterns of promoter evolution associated with functional categories such as synaptic transmission and brain development and to diseases such as bipolar disorder and autism.
Collapse
Affiliation(s)
- Ivan Molineris
- Department of Genetics, Biology and Biochemistry, Molecular Biotechnology Center, University of Turin, Turin, Italy
| | | | | | | | | |
Collapse
|
411
|
An integrated pipeline for the genome-wide analysis of transcription factor binding sites from ChIP-Seq. PLoS One 2011; 6:e16432. [PMID: 21358819 PMCID: PMC3040171 DOI: 10.1371/journal.pone.0016432] [Citation(s) in RCA: 31] [Impact Index Per Article: 2.4] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 08/31/2010] [Accepted: 12/21/2010] [Indexed: 11/19/2022] Open
Abstract
ChIP-Seq has become the standard method for genome-wide profiling DNA association of transcription factors. To simplify analyzing and interpreting ChIP-Seq data, which typically involves using multiple applications, we describe an integrated, open source, R-based analysis pipeline. The pipeline addresses data input, peak detection, sequence and motif analysis, visualization, and data export, and can readily be extended via other R and Bioconductor packages. Using a standard multicore computer, it can be used with datasets consisting of tens of thousands of enriched regions. We demonstrate its effectiveness on published human ChIP-Seq datasets for FOXA1, ER, CTCF and STAT1, where it detected co-occurring motifs that were consistent with the literature but not detected by other methods. Our pipeline provides the first complete set of Bioconductor tools for sequence and motif analysis of ChIP-Seq and ChIP-chip data.
Collapse
|
412
|
Sex hormones in autism: androgens and estrogens differentially and reciprocally regulate RORA, a novel candidate gene for autism. PLoS One 2011; 6:e17116. [PMID: 21359227 PMCID: PMC3040206 DOI: 10.1371/journal.pone.0017116] [Citation(s) in RCA: 147] [Impact Index Per Article: 11.3] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 12/01/2010] [Accepted: 01/21/2011] [Indexed: 12/25/2022] Open
Abstract
Autism, a pervasive neurodevelopmental disorder manifested by deficits in social behavior and interpersonal communication, and by stereotyped, repetitive behaviors, is inexplicably biased towards males by a ratio of ∼4∶1, with no clear understanding of whether or how the sex hormones may play a role in autism susceptibility. Here, we show that male and female hormones differentially regulate the expression of a novel autism candidate gene, retinoic acid-related orphan receptor-alpha (RORA) in a neuronal cell line, SH-SY5Y. In addition, we demonstrate that RORA transcriptionally regulates aromatase, an enzyme that converts testosterone to estrogen. We further show that aromatase protein is significantly reduced in the frontal cortex of autistic subjects relative to sex- and age-matched controls, and is strongly correlated with RORA protein levels in the brain. These results indicate that RORA has the potential to be under both negative and positive feedback regulation by male and female hormones, respectively, through one of its transcriptional targets, aromatase, and further suggest a mechanism for introducing sex bias in autism.
Collapse
|
413
|
Niittymäki I, Tuupanen S, Li Y, Järvinen H, Mecklin JP, Tomlinson IPM, Houlston RS, Karhu A, Aaltonen LA. Systematic search for enhancer elements and somatic allelic imbalance at seven low-penetrance colorectal cancer predisposition loci. BMC MEDICAL GENETICS 2011; 12:23. [PMID: 21314996 PMCID: PMC3045878 DOI: 10.1186/1471-2350-12-23] [Citation(s) in RCA: 16] [Impact Index Per Article: 1.2] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Subscribe] [Scholar Register] [Received: 08/26/2010] [Accepted: 02/14/2011] [Indexed: 01/20/2023]
Abstract
BACKGROUND Common single-nucleotide polymorphisms (SNPs) in ten chromosomal loci have been shown to predispose to colorectal cancer (CRC) in genome-wide association studies. A plausible biological mechanism of CRC susceptibility associated with genetic variation has so far only been proposed for three loci, each pointing to variants that affect gene expression through distant regulatory elements. In this study, we aimed to gain insight into the molecular basis of seven low-penetrance CRC loci tagged by rs4779584 at 15q13, rs10795668 at 10p14, rs3802842 at 11q23, rs4444235 at 14q22, rs9929218 at 16q22, rs10411210 at 19q13, and rs961253 at 20p12. METHODS Possible somatic gain of the risk allele or loss of the protective allele was studied by analyzing allelic imbalance in tumour and corresponding normal tissue samples of heterozygous patients. Functional variants were searched from in silico predicted enhancer elements locating inside the CRC-associating linkage-disequilibrium regions. RESULTS No allelic imbalance targeting the SNPs was observed at any of the seven loci. Altogether, 12 SNPs that were predicted to disrupt potential transcription factor binding sequences were genotyped in the same population-based case-control series as the seven tagging SNPs originally. None showed association with CRC. CONCLUSIONS The results of the allelic imbalance analysis suggest that the seven CRC risk variants are not somatically selected for in the neoplastic progression. The bioinformatic approach was unable to pinpoint cancer-causing variants at any of the seven loci. While it is possible that many of the predisposition loci for CRC are involved in control of gene expression by targeting transcription factor binding sites, also other possibilities, such as regulatory RNAs, should be considered.
Collapse
Affiliation(s)
- Iina Niittymäki
- Department of Medical Genetics, Genome-Scale Biology Research Program, Biomedicum Helsinki, University of Helsinki, Helsinki, Finland
| | | | | | | | | | | | | | | | | |
Collapse
|
414
|
Delay in feedback repression by cryptochrome 1 is required for circadian clock function. Cell 2011; 144:268-81. [PMID: 21236481 DOI: 10.1016/j.cell.2010.12.019] [Citation(s) in RCA: 187] [Impact Index Per Article: 14.4] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 05/21/2010] [Revised: 08/21/2010] [Accepted: 12/10/2010] [Indexed: 12/31/2022]
Abstract
Direct evidence for the requirement of delay in feedback repression in the mammalian circadian clock has been elusive. Cryptochrome 1 (Cry1), an essential clock component, displays evening-time expression and serves as a strong repressor at morning-time elements (E box/E' box). In this study, we reveal that a combination of day-time elements (D box) within the Cry1-proximal promoter and night-time elements (RREs) within its intronic enhancer gives rise to evening-time expression. A synthetic composite promoter produced evening-time expression, which was further recapitulated by a simple phase-vector model. Of note, coordination of day-time with night-time elements can modulate the extent of phase delay. A genetic complementation assay in Cry1(-/-):Cry2(-/-) cells revealed that substantial delay of Cry1 expression is required to restore circadian rhythmicity, and its prolonged delay slows circadian oscillation. Taken together, our data suggest that phase delay in Cry1 transcription is required for mammalian clock function.
Collapse
|
415
|
Keilwagen J, Grau J, Paponov IA, Posch S, Strickert M, Grosse I. De-novo discovery of differentially abundant transcription factor binding sites including their positional preference. PLoS Comput Biol 2011; 7:e1001070. [PMID: 21347314 PMCID: PMC3037384 DOI: 10.1371/journal.pcbi.1001070] [Citation(s) in RCA: 43] [Impact Index Per Article: 3.3] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/15/2010] [Accepted: 12/28/2010] [Indexed: 11/18/2022] Open
Abstract
Transcription factors are a main component of gene regulation as they activate or repress gene expression by binding to specific binding sites in promoters. The de-novo discovery of transcription factor binding sites in target regions obtained by wet-lab experiments is a challenging problem in computational biology, which has not been fully solved yet. Here, we present a de-novo motif discovery tool called Dispom for finding differentially abundant transcription factor binding sites that models existing positional preferences of binding sites and adjusts the length of the motif in the learning process. Evaluating Dispom, we find that its prediction performance is superior to existing tools for de-novo motif discovery for 18 benchmark data sets with planted binding sites, and for a metazoan compendium based on experimental data from micro-array, ChIP-chip, ChIP-DSL, and DamID as well as Gene Ontology data. Finally, we apply Dispom to find binding sites differentially abundant in promoters of auxin-responsive genes extracted from Arabidopsis thaliana microarray data, and we find a motif that can be interpreted as a refined auxin responsive element predominately positioned in the 250-bp region upstream of the transcription start site. Using an independent data set of auxin-responsive genes, we find in genome-wide predictions that the refined motif is more specific for auxin-responsive genes than the canonical auxin-responsive element. In general, Dispom can be used to find differentially abundant motifs in sequences of any origin. However, the positional distribution learned by Dispom is especially beneficial if all sequences are aligned to some anchor point like the transcription start site in case of promoter sequences. We demonstrate that the combination of searching for differentially abundant motifs and inferring a position distribution from the data is beneficial for de-novo motif discovery. Hence, we make the tool freely available as a component of the open-source Java framework Jstacs and as a stand-alone application at http://www.jstacs.de/index.php/Dispom. Binding of transcription factors to promoters of genes, and subsequent enhancement or repression of transcription, is one of the main steps of transcriptional gene regulation. Direct or indirect wet-lab experiments allow the identification of approximate regions potentially bound or regulated by a transcription factor. Subsequently, de-novo motif discovery tools can be used for detecting the precise positions of binding sites. Many traditional tools focus on motifs over-represented in the target regions, which often turn out to be similarly over-represented in the entire genome. In contrast, several recent tools focus on differentially abundant motifs in target regions compared to a control set. As binding sites are often located at some preferred distance to the transcription start site, it is favorable to include this information into de-novo motif discovery. Here, we present Dispom a novel approach for learning differentially abundant motifs and their positional preferences simultaneously, which predicts binding sites with increased accuracy compared to many popular de-novo motif discovery tools. When applying Dispom to promoters of auxin-responsive genes of Arabidopsis thaliana, we find a binding motif slightly different from the canonical auxin-response element, which exhibits a strong positional preference and which is considerably more specific to auxin-responsive genes.
Collapse
Affiliation(s)
- Jens Keilwagen
- Molecular Genetics, Leibniz Institute of Plant Genetics and Crop Plant Research (IPK), Gatersleben, Germany.
| | | | | | | | | | | |
Collapse
|
416
|
A systems approach to analyze transcription factors in mammalian cells. Methods 2011; 53:151-62. [DOI: 10.1016/j.ymeth.2010.08.002] [Citation(s) in RCA: 19] [Impact Index Per Article: 1.5] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/30/2010] [Revised: 08/06/2010] [Accepted: 08/06/2010] [Indexed: 01/09/2023] Open
|
417
|
Tretyakov K, Laur S, Vilo J. G = MAT: linking transcription factor expression and DNA binding data. PLoS One 2011; 6:e14559. [PMID: 21297945 PMCID: PMC3031503 DOI: 10.1371/journal.pone.0014559] [Citation(s) in RCA: 1] [Impact Index Per Article: 0.1] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 04/27/2010] [Accepted: 12/03/2010] [Indexed: 12/02/2022] Open
Abstract
Transcription factors are proteins that bind to motifs on the DNA and thus affect gene expression regulation. The qualitative description of the corresponding processes is therefore important for a better understanding of essential biological mechanisms. However, wet lab experiments targeted at the discovery of the regulatory interplay between transcription factors and binding sites are expensive. We propose a new, purely computational method for finding putative associations between transcription factors and motifs. This method is based on a linear model that combines sequence information with expression data. We present various methods for model parameter estimation and show, via experiments on simulated data, that these methods are reliable. Finally, we examine the performance of this model on biological data and conclude that it can indeed be used to discover meaningful associations. The developed software is available as a web tool and Scilab source code at http://biit.cs.ut.ee/gmat/.
Collapse
Affiliation(s)
| | - Sven Laur
- Institute of Computer Science, University of Tartu, Tartu, Estonia
| | - Jaak Vilo
- Institute of Computer Science, University of Tartu, Tartu, Estonia
- Quretec, Tartu, Estonia
- * E-mail:
| |
Collapse
|
418
|
Motti D, Le Duigou C, Eugène E, Chemaly N, Wittner L, Lazarevic D, Krmac H, Marstrand T, Valen E, Sanges R, Stupka E, Sandelin A, Cherubini E, Gustincich S, Miles R. Gene expression analysis of the emergence of epileptiform activity after focal injection of kainic acid into mouse hippocampus. Eur J Neurosci 2011; 32:1364-79. [PMID: 20950280 DOI: 10.1111/j.1460-9568.2010.07403.x] [Citation(s) in RCA: 22] [Impact Index Per Article: 1.7] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 01/07/2023]
Abstract
We report gene profiling data on genomic processes underlying the progression towards recurrent seizures after injection of kainic acid (KA) into the mouse hippocampus. Focal injection enabled us to separate the effects of proepileptic stimuli initiated by KA injection. Both the injected and contralateral hippocampus participated in the status epilepticus. However, neuronal death induced by KA treatment was restricted to the injected hippocampus, although there was some contralateral axonal degeneration. We profiled gene expression changes in dorsal and ventral regions of both the injected and contralateral hippocampus. Changes were detected in the expression of 1526 transcripts in samples from three time-points: (i) during the KA-induced status epilepticus, (ii) at 2 weeks, before recurrent seizures emerged, and (iii) at 6 months after seizures emerged. Grouping genes with similar spatio-temporal changes revealed an early transcriptional response, strong immune, cell death and growth responses at 2 weeks and an activation of immune and extracellular matrix genes persisting at 6 months. Immunostaining for proteins coded by genes identified from array studies provided evidence for gliogenesis and suggested that the proteoglycan biglycan is synthesized by astrocytes and contributes to a glial scar. Gene changes at 6 months after KA injection were largely restricted to tissue from the injection site. This suggests that either recurrent seizures might depend on maintained processes including immune responses and changes in extracellular matrix proteins near the injection site or alternatively might result from processes, such as growth, distant from the injection site and terminated while seizures are maintained.
Collapse
Affiliation(s)
- Dario Motti
- SISSA/ISAS International School for Advanced Studies, Neurobiology Sector, Trieste, Italy
| | | | | | | | | | | | | | | | | | | | | | | | | | | | | |
Collapse
|
419
|
Reyon D, Kirkpatrick JR, Sander JD, Zhang F, Voytas DF, Joung JK, Dobbs D, Coffman CR. ZFNGenome: a comprehensive resource for locating zinc finger nuclease target sites in model organisms. BMC Genomics 2011; 12:83. [PMID: 21276248 PMCID: PMC3042413 DOI: 10.1186/1471-2164-12-83] [Citation(s) in RCA: 36] [Impact Index Per Article: 2.8] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/08/2010] [Accepted: 01/28/2011] [Indexed: 02/04/2023] Open
Abstract
Background Zinc Finger Nucleases (ZFNs) have tremendous potential as tools to facilitate genomic modifications, such as precise gene knockouts or gene replacements by homologous recombination. ZFNs can be used to advance both basic research and clinical applications, including gene therapy. Recently, the ability to engineer ZFNs that target any desired genomic DNA sequence with high fidelity has improved significantly with the introduction of rapid, robust, and publicly available techniques for ZFN design such as the Oligomerized Pool ENgineering (OPEN) method. The motivation for this study is to make resources for genome modifications using OPEN-generated ZFNs more accessible to researchers by creating a user-friendly interface that identifies and provides quality scores for all potential ZFN target sites in the complete genomes of several model organisms. Description ZFNGenome is a GBrowse-based tool for identifying and visualizing potential target sites for OPEN-generated ZFNs. ZFNGenome currently includes a total of more than 11.6 million potential ZFN target sites, mapped within the fully sequenced genomes of seven model organisms; S. cerevisiae, C. reinhardtii, A. thaliana, D. melanogaster, D. rerio, C. elegans, and H. sapiens and can be visualized within the flexible GBrowse environment. Additional model organisms will be included in future updates. ZFNGenome provides information about each potential ZFN target site, including its chromosomal location and position relative to transcription initiation site(s). Users can query ZFNGenome using several different criteria (e.g., gene ID, transcript ID, target site sequence). Tracks in ZFNGenome also provide "uniqueness" and ZiFOpT (Zinc Finger OPEN Targeter) "confidence" scores that estimate the likelihood that a chosen ZFN target site will function in vivo. ZFNGenome is dynamically linked to ZiFDB, allowing users access to all available information about zinc finger reagents, such as the effectiveness of a given ZFN in creating double-stranded breaks. Conclusions ZFNGenome provides a user-friendly interface that allows researchers to access resources and information regarding genomic target sites for engineered ZFNs in seven model organisms. This genome-wide database of potential ZFN target sites should greatly facilitate the utilization of ZFNs in both basic and clinical research. ZFNGenome is freely available at: http://bindr.gdcb.iastate.edu/ZFNGenome or at the Zinc Finger Consortium website: http://www.zincfingers.org/.
Collapse
Affiliation(s)
- Deepak Reyon
- Department of Genetics, Iowa State University, Ames, IA 50011, USA.
| | | | | | | | | | | | | | | |
Collapse
|
420
|
Street NR, Jansson S, Hvidsten TR. A systems biology model of the regulatory network in Populus leaves reveals interacting regulators and conserved regulation. BMC PLANT BIOLOGY 2011; 11:13. [PMID: 21232107 PMCID: PMC3030533 DOI: 10.1186/1471-2229-11-13] [Citation(s) in RCA: 15] [Impact Index Per Article: 1.2] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Subscribe] [Scholar Register] [Received: 10/08/2010] [Accepted: 01/13/2011] [Indexed: 05/23/2023]
Abstract
BACKGROUND Green plant leaves have always fascinated biologists as hosts for photosynthesis and providers of basic energy to many food webs. Today, comprehensive databases of gene expression data enable us to apply increasingly more advanced computational methods for reverse-engineering the regulatory network of leaves, and to begin to understand the gene interactions underlying complex emergent properties related to stress-response and development. These new systems biology methods are now also being applied to organisms such as Populus, a woody perennial tree, in order to understand the specific characteristics of these species. RESULTS We present a systems biology model of the regulatory network of Populus leaves. The network is reverse-engineered from promoter information and expression profiles of leaf-specific genes measured over a large set of conditions related to stress and developmental. The network model incorporates interactions between regulators, such as synergistic and competitive relationships, by evaluating increasingly more complex regulatory mechanisms, and is therefore able to identify new regulators of leaf development not found by traditional genomics methods based on pair-wise expression similarity. The approach is shown to explain available gene function information and to provide robust prediction of expression levels in new data. We also use the predictive capability of the model to identify condition-specific regulation as well as conserved regulation between Populus and Arabidopsis. CONCLUSIONS We outline a computationally inferred model of the regulatory network of Populus leaves, and show how treating genes as interacting, rather than individual, entities identifies new regulators compared to traditional genomics analysis. Although systems biology models should be used with care considering the complexity of regulatory programs and the limitations of current genomics data, methods describing interactions can provide hypotheses about the underlying cause of emergent properties and are needed if we are to identify target genes other than those constituting the "low hanging fruit" of genomic analysis.
Collapse
Affiliation(s)
- Nathaniel Robert Street
- Umeå Plant Science Centre, Department of Plant Physiology, Umeå University, 901 87 Umeå, Sweden
| | - Stefan Jansson
- Umeå Plant Science Centre, Department of Plant Physiology, Umeå University, 901 87 Umeå, Sweden
| | - Torgeir R Hvidsten
- Umeå Plant Science Centre, Department of Plant Physiology, Umeå University, 901 87 Umeå, Sweden
- Computational Life Science Cluster (CLiC), Umeå University, 901 87 Umeå, Sweden
| |
Collapse
|
421
|
Jolma A, Taipale J. Methods for Analysis of Transcription Factor DNA-Binding Specificity In Vitro. Subcell Biochem 2011; 52:155-173. [PMID: 21557082 DOI: 10.1007/978-90-481-9069-0_7] [Citation(s) in RCA: 17] [Impact Index Per Article: 1.3] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 05/30/2023]
Abstract
Transcription of genes during development and in response to environmental stimuli is determined by genomic DNA sequence. The DNA sequences regulating transcription are read by sequence-specific transcription factors (TFs) that recognize relatively short sequences, generally between four and twenty base pairs in length. Transcriptional regulation generally requires binding of multiple TFs in close proximity to each other. Mechanistic understanding of transcription in an organism thus requires detailed knowledge of binding affinities of all its TFs to all possible DNA sequences, and the co-operative interactions between the TFs. However, very little is known about such co-operative binding interactions, and even the simple TF-DNA binding information exists only for a very small proportion of all TFs - for example, mammals have approximately 1,300-2,000 TFs [1, 2], yet the largest public databases for TF binding specificity, Jaspar and Uniprobe [3, 4] currently list only approximately 500 moderate to high resolution profiles for human or mouse. This lack of knowledge is in part due to the fact that analysis of TF DNA binding has been laborious and expensive. In this chapter, we review methods that can be used to determine binding specificity of TFs to DNA, mainly focusing on recently developed assays that allow high-resolution analysis of TF binding specificity in relatively high throughput.
Collapse
Affiliation(s)
- Arttu Jolma
- Department of Biosciences and Nutrition, SE-171 77, Stockholm, Sweden,
| | | |
Collapse
|
422
|
Characterization of the core elements of the NF-κB signaling pathway of the sea anemone Nematostella vectensis. Mol Cell Biol 2010; 31:1076-87. [PMID: 21189285 DOI: 10.1128/mcb.00927-10] [Citation(s) in RCA: 51] [Impact Index Per Article: 3.6] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 12/22/2022] Open
Abstract
The sea anemone Nematostella vectensis is the leading developmental and genomic model for the phylum Cnidaria, which includes anemones, hydras, jellyfish, and corals. In insects and vertebrates, the NF-κB pathway is required for cellular and organismal responses to various stresses, including pathogens and chemicals, as well as for several developmental processes. Herein, we have characterized proteins that comprise the core NF-κB pathway in Nematostella, including homologs of NF-κB, IκB, Bcl-3, and IκB kinase (IKK). We show that N. vectensis NF-κB (Nv-NF-κB) can bind to κB sites and activate transcription of reporter genes containing multimeric κB sites or the Nv-IκB promoter. Both Nv-IκB and Nv-Bcl-3 interact with Nv-NF-κB and block its ability to activate reporter gene expression. Nv-IKK is most similar to human IKKε/TBK kinases and, in vitro, can phosphorylate Ser47 of Nv-IκB. Nv-NF-κB is expressed in a subset of ectodermal cells in juvenile and adult Nematostella anemones. A bioinformatic analysis suggests that homologs of many mammalian NF-κB target genes are targets for Nv-NF-κB, including genes involved in apoptosis and responses to organic compounds and endogenous stimuli. These results indicate that NF-κB pathway proteins in Nematostella are similar to their vertebrate homologs, and these results also provide a framework for understanding the evolutionary origins of NF-κB signaling.
Collapse
|
423
|
Kuttippurathu L, Hsing M, Liu Y, Schmidt B, Maskell DL, Lee K, He A, Pu WT, Kong SW. CompleteMOTIFs: DNA motif discovery platform for transcription factor binding experiments. ACTA ACUST UNITED AC 2010; 27:715-7. [PMID: 21183585 DOI: 10.1093/bioinformatics/btq707] [Citation(s) in RCA: 41] [Impact Index Per Article: 2.9] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/12/2022]
Abstract
UNLABELLED CompleteMOTIFs (cMOTIFs) is an integrated web tool developed to facilitate systematic discovery of overrepresented transcription factor binding motifs from high-throughput chromatin immunoprecipitation experiments. Comprehensive annotations and Boolean logic operations on multiple peak locations enable users to focus on genomic regions of interest for de novo motif discovery using tools such as MEME, Weeder and ChIPMunk. The pipeline incorporates a scanning tool for known motifs from TRANSFAC and JASPAR databases, and performs an enrichment test using local or precalculated background models that significantly improve the motif scanning result. Furthermore, using the cMOTIFs pipeline, we demonstrated that multiple transcription factors could cooperatively bind to the upstream of important stem cell differentiation regulators. AVAILABILITY http://cmotifs.tchlab.org.
Collapse
Affiliation(s)
- Lakshmi Kuttippurathu
- Children's Hospital Informatics Program at the Harvard-MIT, Division of Health Sciences and Technology, Department of Medicine, Children's Hospital Boston, Boston, MA 02115, USA
| | | | | | | | | | | | | | | | | |
Collapse
|
424
|
Majoros WH, Ohler U. Modeling the evolution of regulatory elements by simultaneous detection and alignment with phylogenetic pair HMMs. PLoS Comput Biol 2010; 6:e1001037. [PMID: 21187896 PMCID: PMC3002982 DOI: 10.1371/journal.pcbi.1001037] [Citation(s) in RCA: 11] [Impact Index Per Article: 0.8] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 04/09/2010] [Accepted: 11/17/2010] [Indexed: 11/18/2022] Open
Abstract
The computational detection of regulatory elements in DNA is a difficult but important problem impacting our progress in understanding the complex nature of eukaryotic gene regulation. Attempts to utilize cross-species conservation for this task have been hampered both by evolutionary changes of functional sites and poor performance of general-purpose alignment programs when applied to non-coding sequence. We describe a new and flexible framework for modeling binding site evolution in multiple related genomes, based on phylogenetic pair hidden Markov models which explicitly model the gain and loss of binding sites along a phylogeny. We demonstrate the value of this framework for both the alignment of regulatory regions and the inference of precise binding-site locations within those regions. As the underlying formalism is a stochastic, generative model, it can also be used to simulate the evolution of regulatory elements. Our implementation is scalable in terms of numbers of species and sequence lengths and can produce alignments and binding-site predictions with accuracy rivaling or exceeding current systems that specialize in only alignment or only binding-site prediction. We demonstrate the validity and power of various model components on extensive simulations of realistic sequence data and apply a specific model to study Drosophila enhancers in as many as ten related genomes and in the presence of gain and loss of binding sites. Different models and modeling assumptions can be easily specified, thus providing an invaluable tool for the exploration of biological hypotheses that can drive improvements in our understanding of the mechanisms and evolution of gene regulation.
Collapse
Affiliation(s)
- William H Majoros
- Institute for Genome Sciences & Policy, Duke University, Durham, North Carolina, United States of America.
| | | |
Collapse
|
425
|
Bogner-Strauss JG, Prokesch A, Sanchez-Cabo F, Rieder D, Hackl H, Duszka K, Krogsdam A, Di Camillo B, Walenta E, Klatzer A, Lass A, Pinent M, Wong WC, Eisenhaber F, Trajanoski Z. Reconstruction of gene association network reveals a transmembrane protein required for adipogenesis and targeted by PPARγ. Cell Mol Life Sci 2010; 67:4049-64. [PMID: 20552250 PMCID: PMC11115701 DOI: 10.1007/s00018-010-0424-5] [Citation(s) in RCA: 35] [Impact Index Per Article: 2.5] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/22/2010] [Revised: 05/18/2010] [Accepted: 05/27/2010] [Indexed: 01/24/2023]
Abstract
We have developed a method for reconstructing gene association networks and have applied this method to gene profiles from 3T3-L1 cells. Priorization of the candidate genes pinpointed a transcript annotated as APMAP (adipocyte plasma membrane-associated protein). Functional studies showed that APMAP is upregulated in murine and human adipogenic cell models as well as in a genetic mouse model of obesity. Silencing APMAP in 3T3-L1 cells strongly impaired the differentiation into adipocytes. Moreover, APMAP expression was strongly induced by the PPARγ ligand rosiglitazone in adipocytes in vitro and in vivo in adipose tissue. Using ChIP-qPCR and luciferase reporter assays, we show a functional PPARγ binding site. In addition, we provide evidence that the extracellular C-terminal domain of APMAP is required for the function of APMAP in adipocyte differentiation. Finally, we demonstrate that APMAP translocates from the endoplasmatic reticulum to the plasma membrane during adipocyte differentiation.
Collapse
Affiliation(s)
- Juliane G. Bogner-Strauss
- Institute for Genomics and Bioinformatics, Graz University of Technology, Petersgasse 14, 8010 Graz, Austria
| | - Andreas Prokesch
- Institute for Genomics and Bioinformatics, Graz University of Technology, Petersgasse 14, 8010 Graz, Austria
| | - Fatima Sanchez-Cabo
- Genomics Unit, Centro Nacional de Investigaciones Cardiovasculares (CNIC), Madrid, Spain
| | - Dietmar Rieder
- Institute for Genomics and Bioinformatics, Graz University of Technology, Petersgasse 14, 8010 Graz, Austria
| | - Hubert Hackl
- Institute for Genomics and Bioinformatics, Graz University of Technology, Petersgasse 14, 8010 Graz, Austria
| | - Kalina Duszka
- Institute for Genomics and Bioinformatics, Graz University of Technology, Petersgasse 14, 8010 Graz, Austria
| | - Anne Krogsdam
- Institute for Genomics and Bioinformatics, Graz University of Technology, Petersgasse 14, 8010 Graz, Austria
| | | | - Evelyn Walenta
- Institute for Genomics and Bioinformatics, Graz University of Technology, Petersgasse 14, 8010 Graz, Austria
| | - Ariane Klatzer
- Institute for Genomics and Bioinformatics, Graz University of Technology, Petersgasse 14, 8010 Graz, Austria
| | - Achim Lass
- Institute for Molecular Biosciences, Karl-Franzens University Graz, Graz, Austria
| | - Montserrat Pinent
- Institute for Genomics and Bioinformatics, Graz University of Technology, Petersgasse 14, 8010 Graz, Austria
| | - Wing-Cheong Wong
- Bioinformatics Institute (BII), Agency for Science, Technology and Research (A*STAR), Singapore, Singapore
| | - Frank Eisenhaber
- Bioinformatics Institute (BII), Agency for Science, Technology and Research (A*STAR), Singapore, Singapore
- Department of Biological Sciences (DBS), National University of Singapore (NUS), Singapore, Singapore
- School of Computer Engineering (SCE), Nanyang Technological University (NTU), Singapore, Singapore
| | - Zlatko Trajanoski
- Biocenter, Section for Bioinformatics, Innsbruck Medical University, Schöpfstrasse 45, 6020 Innsbruck, Austria
| |
Collapse
|
426
|
Schröder A, Eichner J, Supper J, Eichner J, Wanke D, Henneges C, Zell A. Predicting DNA-binding specificities of eukaryotic transcription factors. PLoS One 2010; 5:e13876. [PMID: 21152420 PMCID: PMC2994704 DOI: 10.1371/journal.pone.0013876] [Citation(s) in RCA: 8] [Impact Index Per Article: 0.6] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/11/2010] [Accepted: 10/14/2010] [Indexed: 11/18/2022] Open
Abstract
Today, annotated amino acid sequences of more and more transcription factors (TFs) are readily available. Quantitative information about their DNA-binding specificities, however, are hard to obtain. Position frequency matrices (PFMs), the most widely used models to represent binding specificities, are experimentally characterized only for a small fraction of all TFs. Even for some of the most intensively studied eukaryotic organisms (i.e., human, rat and mouse), roughly one-sixth of all proteins with annotated DNA-binding domain have been characterized experimentally. Here, we present a new method based on support vector regression for predicting quantitative DNA-binding specificities of TFs in different eukaryotic species. This approach estimates a quantitative measure for the PFM similarity of two proteins, based on various features derived from their protein sequences. The method is trained and tested on a dataset containing 1 239 TFs with known DNA-binding specificity, and used to predict specific DNA target motifs for 645 TFs with high accuracy.
Collapse
Affiliation(s)
- Adrian Schröder
- Center for Bioinformatics Tübingen (ZBIT), University of Tübingen, Tübingen, Germany.
| | | | | | | | | | | | | |
Collapse
|
427
|
Boyle AP, Song L, Lee BK, London D, Keefe D, Birney E, Iyer VR, Crawford GE, Furey TS. High-resolution genome-wide in vivo footprinting of diverse transcription factors in human cells. Genome Res 2010; 21:456-64. [PMID: 21106903 DOI: 10.1101/gr.112656.110] [Citation(s) in RCA: 235] [Impact Index Per Article: 16.8] [Reference Citation Analysis] [Abstract] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 01/11/2023]
Abstract
Regulation of gene transcription in diverse cell types is determined largely by varied sets of cis-elements where transcription factors bind. Here we demonstrate that data from a single high-throughput DNase I hypersensitivity assay can delineate hundreds of thousands of base-pair resolution in vivo footprints in human cells that precisely mark individual transcription factor-DNA interactions. These annotations provide a unique resource for the investigation of cis-regulatory elements. We find that footprints for specific transcription factors correlate with ChIP-seq enrichment and can accurately identify functional versus nonfunctional transcription factor motifs. We also find that footprints reveal a unique evolutionary conservation pattern that differentiates functional footprinted bases from surrounding DNA. Finally, detailed analysis of CTCF footprints suggests multiple modes of binding and a novel DNA binding motif upstream of the primary binding site.
Collapse
Affiliation(s)
- Alan P Boyle
- Institute for Genome Sciences & Policy, Duke University, Durham, North Carolina 27708, USA
| | | | | | | | | | | | | | | | | |
Collapse
|
428
|
Lang M, Hadzhiev Y, Siegel N, Amemiya CT, Parada C, Strähle U, Becker MB, Müller F, Meyer A. Conservation of shh cis-regulatory architecture of the coelacanth is consistent with its ancestral phylogenetic position. EvoDevo 2010; 1:11. [PMID: 21047394 PMCID: PMC2992049 DOI: 10.1186/2041-9139-1-11] [Citation(s) in RCA: 14] [Impact Index Per Article: 1.0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 05/18/2010] [Accepted: 11/03/2010] [Indexed: 12/17/2022] Open
Abstract
Background The modern coelacanth (Latimeria) is the extant taxon of a basal sarcopterygian lineage and sister group to tetrapods. Apart from certain apomorphic traits, its morphology is characterized by a high degree of retention of ancestral vertebrate structures and little morphological change. An insight into the molecular evolution that may explain the unchanged character of Latimeria morphology requires the analysis of the expression patterns of developmental regulator genes and their cis-regulatory modules (CRMs). Results We describe the comparative and functional analysis of the sonic hedgehog (shh) genomic region of Latimeria menadoensis. Several putative enhancers in the Latimeria shh locus have been identified by comparisons to sarcopterygian and actinopterygian extant species. Specific sequence conservation with all known actinopterygian enhancer elements has been detected. However, these elements are selectively missing in more recently diverged actinopterygian and sarcopterygian species. The functionality of the putative Latimeria enhancers was confirmed by reporter gene expression analysis in transient transgenic zebrafish and chick embryos. Conclusions Latimeria shh CRMs represent the ancestral set of enhancers that have emerged before the split of lobe-finned and ray-finned fishes. In contrast to lineage-specific losses and differentiations in more derived lineages, Latimeria shh enhancers reveal low levels of sequence diversification. High overall sequence conservation of shh conserved noncoding elements (CNE) is consistent with the general trend of high levels of conservation of noncoding DNA in the slowly evolving Latimeria genome.
Collapse
Affiliation(s)
- Michael Lang
- Department of Biology, University of Konstanz, 78457 Konstanz, Germany.,Development and Neurobiology Program, Jacques Monod Institute, 75013 Paris, France
| | - Yavor Hadzhiev
- Department of Medical and Molecular Genetics, University of Birmingham, Birmingham B15 2TT, UK
| | - Nicol Siegel
- Department of Biology, University of Konstanz, 78457 Konstanz, Germany.,Medical University of Vienna, Medical Genetics, 1090 Vienna, Austria
| | - Chris T Amemiya
- Benaroya Research Institute at Virginia Mason, Seattle, WA 98101, USA
| | - Carolina Parada
- Developmental Biology Group, Universitat Pompeu Fabra, Parc de Recerca Biomèdica de Barcelona, 08003 Barcelona, Spain.,Center for Craniofacial Molecular Biology, University of Southern California, Los Angeles, CA 90030, USA
| | - Uwe Strähle
- Karlsruhe Institute of Technology, Institute for Toxicology and Genetics, 76021 Karlsruhe, Germany
| | - May-Britt Becker
- Department of Biology, University of Konstanz, 78457 Konstanz, Germany.,Exzellenzcluster CellNetworks, INF 267, 69120 Heidelberg, Germany
| | - Ferenc Müller
- Department of Medical and Molecular Genetics, University of Birmingham, Birmingham B15 2TT, UK.,Karlsruhe Institute of Technology, Institute for Toxicology and Genetics, 76021 Karlsruhe, Germany
| | - Axel Meyer
- Department of Biology, University of Konstanz, 78457 Konstanz, Germany
| |
Collapse
|
429
|
Unbiased, genome-wide in vivo mapping of transcriptional regulatory elements reveals sex differences in chromatin structure associated with sex-specific liver gene expression. Mol Cell Biol 2010; 30:5531-44. [PMID: 20876297 DOI: 10.1128/mcb.00601-10] [Citation(s) in RCA: 80] [Impact Index Per Article: 5.7] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 01/21/2023] Open
Abstract
We have used a simple and efficient method to identify condition-specific transcriptional regulatory sites in vivo to help elucidate the molecular basis of sex-related differences in transcription, which are widespread in mammalian tissues and affect normal physiology, drug response, inflammation, and disease. To systematically uncover transcriptional regulators responsible for these differences, we used DNase hypersensitivity analysis coupled with high-throughput sequencing to produce condition-specific maps of regulatory sites in male and female mouse livers and in livers of male mice feminized by continuous infusion of growth hormone (GH). We identified 71,264 hypersensitive sites, with 1,284 showing robust sex-related differences. Continuous GH infusion suppressed the vast majority of male-specific sites and induced a subset of female-specific sites in male livers. We also identified broad genomic regions (up to ∼100 kb) showing sex-dependent hypersensitivity and similar patterns of GH responses. We found a strong association of sex-specific sites with sex-specific transcription; however, a majority of sex-specific sites were >100 kb from sex-specific genes. By analyzing sequence motifs within regulatory regions, we identified two known regulators of liver sexual dimorphism and several new candidates for further investigation. This approach can readily be applied to mapping condition-specific regulatory sites in mammalian tissues under a wide variety of physiological conditions.
Collapse
|
430
|
Leleu M, Lefebvre G, Rougemont J. Processing and analyzing ChIP-seq data: from short reads to regulatory interactions. Brief Funct Genomics 2010; 9:466-76. [PMID: 20861161 DOI: 10.1093/bfgp/elq022] [Citation(s) in RCA: 14] [Impact Index Per Article: 1.0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 12/15/2022] Open
Abstract
Chromatin-immunoprecipitation and sequencing (ChIP-seq) is a rapidly maturing technology that draws on the power of high-throughput short-read sequencing to decipher chromatin states with unprecedented precision and breadth. Although some aspects of the experimental protocol require careful tuning, the bottleneck currently firmly lies with the downstream data analysis. We give an overview of the better-established aspects of genome mapping and data normalization and we describe the more recent progress in peak calling and their statistical analysis and provide a brief overview of popular follow-up analyses such as genomic feature categorization and motif search.
Collapse
Affiliation(s)
- Marion Leleu
- Swiss Institute of Bioinformatics, Lausanne, Switzerland
| | | | | |
Collapse
|
431
|
Poly(ADP-ribose) polymerase 1 participates in the phase entrainment of circadian clocks to feeding. Cell 2010; 142:943-53. [PMID: 20832105 DOI: 10.1016/j.cell.2010.08.016] [Citation(s) in RCA: 269] [Impact Index Per Article: 19.2] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/11/2010] [Revised: 06/14/2010] [Accepted: 07/23/2010] [Indexed: 11/21/2022]
Abstract
Circadian clocks in peripheral organs are tightly coupled to cellular metabolism and are readily entrained by feeding-fasting cycles. However, the molecular mechanisms involved are largely unknown. Here we show that in liver the activity of PARP-1, an NAD(+)-dependent ADP-ribosyltransferase, oscillates in a daily manner and is regulated by feeding. We provide biochemical evidence that PARP-1 binds and poly(ADP-ribosyl)ates CLOCK at the beginning of the light phase. The loss of PARP-1 enhances the binding of CLOCK-BMAL1 to DNA and leads to a phase-shift of the interaction of CLOCK-BMAL1 with PER and CRY repressor proteins. As a consequence, CLOCK-BMAL1-dependent gene expression is altered in PARP-1-deficient mice, in particular in response to changes in feeding times. Our results show that Parp-1 knockout mice exhibit impaired food entrainment of peripheral circadian clocks and support a role for PARP-1 in connecting feeding with the mammalian timing system.
Collapse
|
432
|
Piechota M, Korostynski M, Przewlocki R. Identification of cis-regulatory elements in the mammalian genome: the cREMaG database. PLoS One 2010; 5:e12465. [PMID: 20824209 PMCID: PMC2930848 DOI: 10.1371/journal.pone.0012465] [Citation(s) in RCA: 26] [Impact Index Per Article: 1.9] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 02/10/2010] [Accepted: 08/02/2010] [Indexed: 12/20/2022] Open
Abstract
Background A growing number of gene expression-profiling datasets provides a reliable source of information about gene co-expression. In silico analyses of the properties shared among the promoters of co-expressed genes facilitates the identification of transcription factors (TFs) involved in the co-regulation of those genes. Our previous experience with microarray data led to the development of a database suitable for the examination of regulatory motifs in the promoters of co-expressed genes. Methodology We introduce the cREMaG (cis-Regulatory Elements in the Mammalian Genome) system designed for in silico studies of the promoter properties of co-regulated mammalian genes. The cREMaG system offers an analysis of data obtained from human, mouse, rat, bovine and canine gene expression-profiling studies. More than eight analysis parameters can be utilized in user-defined combinations. The selection of alternative transcription start sites and information about CpG islands are also available. Conclusions Using the cREMaG system, we successfully identified TFs mediating transcriptional responses in reference gene sets. The cREMaG system facilitates in silico studies of mammalian transcriptional gene regulation. The resource is freely available at http://www.cremag.org.
Collapse
Affiliation(s)
- Marcin Piechota
- Department of Molecular Neuropharmacology, Institute of Pharmacology, Polish Academy of Sciences, Kraków, Poland.
| | | | | |
Collapse
|
433
|
Blow MJ, McCulley DJ, Li Z, Zhang T, Akiyama JA, Holt A, Plajzer-Frick I, Shoukry M, Wright C, Chen F, Afzal V, Bristow J, Ren B, Black BL, Rubin EM, Visel A, Pennacchio LA. ChIP-Seq identification of weakly conserved heart enhancers. Nat Genet 2010; 42:806-10. [PMID: 20729851 DOI: 10.1038/ng.650] [Citation(s) in RCA: 345] [Impact Index Per Article: 24.6] [Reference Citation Analysis] [Abstract] [Journal Information] [Subscribe] [Scholar Register] [Received: 12/17/2010] [Accepted: 07/22/2010] [Indexed: 01/29/2023]
Abstract
Accurate control of tissue-specific gene expression plays a pivotal role in heart development, but few cardiac transcriptional enhancers have thus far been identified. Extreme noncoding-sequence conservation has successfully predicted enhancers that are active in many tissues but has failed to identify substantial numbers of heart-specific enhancers. Here, we used ChIP-Seq with the enhancer-associated protein p300 from mouse embryonic day 11.5 heart tissue to identify over 3,000 candidate heart enhancers genome wide. Compared to enhancers active in other tissues we studied at this time point, most candidate heart enhancers were less deeply conserved in vertebrate evolution. Nevertheless, transgenic mouse assays of 130 candidate regions revealed that most function reproducibly as enhancers active in the heart, irrespective of their degree of evolutionary constraint. These results provide evidence for a large population of poorly conserved heart enhancers and suggest that the evolutionary conservation of embryonic enhancers can vary depending on tissue type.
Collapse
Affiliation(s)
- Matthew J Blow
- Genomics Division, Lawrence Berkeley National Laboratory, Berkeley, California, USA
| | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | |
Collapse
|
434
|
Biological reprogramming in acquired resistance to endocrine therapy of breast cancer. Oncogene 2010; 29:6071-83. [PMID: 20711236 DOI: 10.1038/onc.2010.333] [Citation(s) in RCA: 54] [Impact Index Per Article: 3.9] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 12/12/2022]
Abstract
Endocrine therapies targeting the proliferative effect of 17β-estradiol through estrogen receptor α (ERα) are the most effective systemic treatment of ERα-positive breast cancer. However, most breast tumors initially responsive to these therapies develop resistance through molecular mechanisms that are not yet fully understood. The long-term estrogen-deprived (LTED) MCF7 cell model has been proposed to recapitulate acquired resistance to aromatase inhibitors in postmenopausal women. To elucidate this resistance, genomic, transcriptomic and molecular data were integrated into the time course of MCF7-LTED adaptation. Dynamic and widespread genomic changes were observed, including amplification of the ESR1 locus consequently linked to an increase in ERα. Dynamic transcriptomic profiles were also observed that correlated significantly with genomic changes and were predicted to be influenced by transcription factors known to be involved in acquired resistance or cell proliferation (for example, interferon regulatory transcription factor 1 and E2F1, respectively) but, notably, not by canonical ERα transcriptional function. Consistently, at the molecular level, activation of growth factor signaling pathways by EGFR/ERBB/AKT and a switch from phospho-Ser118 (pS118)- to pS167-ERα were observed during MCF7-LTED adaptation. Evaluation of relevant clinical settings identified significant associations between MCF7-LTED and breast tumor transcriptome profiles that characterize ERα-negative status, early response to letrozole and tamoxifen, and recurrence after tamoxifen treatment. In accordance with these profiles, MCF7-LTED cells showed increased sensitivity to inhibition of FGFR-mediated signaling with PD173074. This study provides mechanistic insight into acquired resistance to endocrine therapies of breast cancer and highlights a potential therapeutic strategy.
Collapse
|
435
|
O'Seaghdha CM, Yang Q, Glazer NL, Leak TS, Dehghan A, Smith AV, Kao WHL, Lohman K, Hwang SJ, Johnson AD, Hofman A, Uitterlinden AG, Chen YDI, Brown EM, Siscovick DS, Harris TB, Psaty BM, Coresh J, Gudnason V, Witteman JC, Liu YM, Kestenbaum BR, Fox CS, Köttgen A. Common variants in the calcium-sensing receptor gene are associated with total serum calcium levels. Hum Mol Genet 2010; 19:4296-303. [PMID: 20705733 DOI: 10.1093/hmg/ddq342] [Citation(s) in RCA: 74] [Impact Index Per Article: 5.3] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 12/29/2022] Open
Abstract
Serum calcium levels are tightly regulated. We performed genome-wide association studies (GWAS) in population-based studies participating in the CHARGE Consortium to uncover common genetic variations associated with total serum calcium levels. GWAS of serum calcium concentrations was performed in 20 611 individuals of European ancestry for ∼2.5 million genotyped and imputed single-nucleotide polymorphisms (SNPs). The SNP with the lowest P-value was rs17251221 (P = 2.4 * 10(-22), minor allele frequency 14%) in the calcium-sensing receptor gene (CASR). This lead SNP was associated with higher serum calcium levels [0.06 mg/dl (0.015 mmol/l) per copy of the minor G allele] and accounted for 0.54% of the variance in serum calcium concentrations. The identification of variation in CASR that influences serum calcium concentration confirms the results of earlier candidate gene studies. The G allele of rs17251221 was also associated with higher serum magnesium levels (P = 1.2 * 10(-3)), lower serum phosphate levels (P = 2.8 * 10(-7)) and lower bone mineral density at the lumbar spine (P = 0.038), but not the femoral neck. No additional genomic loci contained SNPs associated at genome-wide significance (P < 5 * 10(-8)). These associations resemble clinical characteristics of patients with familial hypocalciuric hypercalcemia, an autosomal-dominant disease arising from rare inactivating mutations in the CASR gene. We conclude that common genetic variation in the CASR gene is associated with similar but milder features in the general population.
Collapse
Affiliation(s)
- Conall M O'Seaghdha
- Division of Nephrology, Brigham and Women's Hospital and Harvard Medical School, Boston, MA 02115, USA
| | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | |
Collapse
|
436
|
Global Egr1-miRNAs binding analysis in PMA-induced K562 cells using ChIP-Seq. J Biomed Biotechnol 2010; 2010. [PMID: 20811575 PMCID: PMC2929687 DOI: 10.1155/2010/867517] [Citation(s) in RCA: 10] [Impact Index Per Article: 0.7] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 02/11/2010] [Revised: 05/17/2010] [Accepted: 06/28/2010] [Indexed: 12/01/2022] Open
Abstract
Although much is known about microRNAs' regulation in gene expression and their contributions in cell fate, to date, globally lineage-(cell-) specific identification of the binding events between a transcription factor and its targeting microRNA genes is still waiting for elucidation. In this paper, we performed a ChIP-Seq experiment to find the targeting microRNA genes of a transcription factor, Egr1, in human erythroleukemia cell line K562. We found Egr1 binding sites near the promoters of 124 distinct microRNA genes, accounting for about 42% of the miRNAs which have high-confidence predicted promoters (294). We also found EGR1 bind to another 63 pre-miRNAs. We chose 12 of the 187 microRNAs with Egr1 binding sites to perform ChIP-PCR assays and the positive binding signal from ChIP-PCR confirmed the ChIP-Seq results. Our experiments provide the first global binding profile between Egr1 and its targeting microRNA genes in PMA-treated K562 cells, which may facilitate the understanding of pathways controlling microRNA biology in this specific cell line.
Collapse
|
437
|
Genome-wide identification of cis-regulatory motifs and modules underlying gene coregulation using statistics and phylogeny. Proc Natl Acad Sci U S A 2010; 107:14615-20. [PMID: 20671200 DOI: 10.1073/pnas.1002876107] [Citation(s) in RCA: 20] [Impact Index Per Article: 1.4] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 01/14/2023] Open
Abstract
Cell fate determination depends in part on the establishment of specific transcriptional programs of gene expression. These programs result from the interpretation of the genomic cis-regulatory information by sequence-specific factors. Decoding this information in sequenced genomes is an important issue. Here, we developed statistical analysis tools to computationally identify the cis-regulatory elements that control gene expression in a set of coregulated genes. Starting with a small number of validated and/or predicted cis-regulatory modules (CRMs) in a reference species as a training set, but with no a priori knowledge of the factors acting in trans, we computationally predicted transcription factor binding sites (TFBSs) and genomic CRMs underlying coregulation. This method was applied to the gene expression program active in Drosophila melanogaster sensory organ precursor cells (SOPs), a specific type of neural progenitor cells. Mutational analysis showed that four, including one newly characterized, out of the five top-ranked families of predicted TFBSs were required for SOP-specific gene expression. Additionaly, 19 out of the 29 top-ranked predicted CRMs directed gene expression in neural progenitor cells, i.e., SOPs or larval brain neuroblasts, with a notable fraction active in SOPs (11/29). We further identified the lola gene as the target of two SOP-specific CRMs and found that the lola gene contributed to SOP specification. The statistics and phylogeny-based tools described here can be more generally applied to identify the cis-regulatory elements of specific gene regulatory networks in any family of related species with sequenced genomes.
Collapse
|
438
|
Kumar GR, Sakthivel K, Sundaram R, Neeraja C, Balachandran S, Rani NS, Viraktamath B, Madhav M. Allele mining in crops: Prospects and potentials. Biotechnol Adv 2010; 28:451-61. [DOI: 10.1016/j.biotechadv.2010.02.007] [Citation(s) in RCA: 88] [Impact Index Per Article: 6.3] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/02/2009] [Revised: 09/21/2009] [Accepted: 09/25/2009] [Indexed: 12/26/2022]
|
439
|
Piipari M, Down TA, Hubbard TJ. Metamotifs--a generative model for building families of nucleotide position weight matrices. BMC Bioinformatics 2010; 11:348. [PMID: 20579334 PMCID: PMC2906491 DOI: 10.1186/1471-2105-11-348] [Citation(s) in RCA: 5] [Impact Index Per Article: 0.4] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/04/2010] [Accepted: 06/25/2010] [Indexed: 11/25/2022] Open
Abstract
Background Development of high-throughput methods for measuring DNA interactions of transcription factors together with computational advances in short motif inference algorithms is expanding our understanding of transcription factor binding site motifs. The consequential growth of sequence motif data sets makes it important to systematically group and categorise regulatory motifs. It has been shown that there are familial tendencies in DNA sequence motifs that are predictive of the family of factors that binds them. Further development of methods that detect and describe familial motif trends has the potential to help in measuring the similarity of novel computational motif predictions to previously known data and sensitively detecting regulatory motifs similar to previously known ones from novel sequence. Results We propose a probabilistic model for position weight matrix (PWM) sequence motif families. The model, which we call the 'metamotif' describes recurring familial patterns in a set of motifs. The metamotif framework models variation within a family of sequence motifs. It allows for simultaneous estimation of a series of independent metamotifs from input position weight matrix (PWM) motif data and does not assume that all input motif columns contribute to a familial pattern. We describe an algorithm for inferring metamotifs from weight matrix data. We then demonstrate the use of the model in two practical tasks: in the Bayesian NestedMICA model inference algorithm as a PWM prior to enhance motif inference sensitivity, and in a motif classification task where motifs are labelled according to their interacting DNA binding domain. Conclusions We show that metamotifs can be used as PWM priors in the NestedMICA motif inference algorithm to dramatically increase the sensitivity to infer motifs. Metamotifs were also successfully applied to a motif classification problem where sequence motif features were used to predict the family of protein DNA binding domains that would interact with it. The metamotif based classifier is shown to compare favourably to previous related methods. The metamotif has great potential for further use in machine learning tasks related to especially de novo computational sequence motif inference. The metamotif methods presented have been incorporated into the NestedMICA suite.
Collapse
Affiliation(s)
- Matias Piipari
- Wellcome Trust Sanger Institute, Hinxton, Cambridgeshire, UK.
| | | | | |
Collapse
|
440
|
Identification of E74-like factor 1 (ELF1) as a transcriptional regulator of the Hox cofactor MEIS1. Exp Hematol 2010; 38:798-8, 808.e1-2. [PMID: 20600580 DOI: 10.1016/j.exphem.2010.06.006] [Citation(s) in RCA: 21] [Impact Index Per Article: 1.5] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/01/2010] [Revised: 06/01/2010] [Accepted: 06/04/2010] [Indexed: 11/22/2022]
Abstract
OBJECTIVE Myeloid ectropic viral integration site 1 (MEIS1) is a Hox cofactor known for its role in development and is strongly linked to normal and leukemic hematopoiesis. Although previous studies have focused on identifying protein partners of MEIS1 and its transcriptionally regulated targets, little is known about the upstream transcriptional regulators of this tightly regulated gene. Understanding the regulation of MEIS1 is important to understanding normal hematopoiesis and leukemogenesis. MATERIALS AND METHODS Here we describe our studies focusing on the evolutionary conserved putative MEIS1 promoter region. Phylogenetic sequence analysis and reporter assays in MEIS1-expressing (K562) and nonexpressing (HL60) leukemic cell line models were used to identify key regulatory regions and potential transcription factor binding sites within the candidate promoter region followed by functional and expression studies of one identified regulator in both cell lines and primary human cord blood and leukemia samples. RESULTS Chromatin status of MEIS1 promoter region is associated with MEIS1 expression. Truncation and mutation studies coupled with reporter assays revealed that a conserved ETS family member binding site located 289 bp upstream of the annotated human MEIS1 transcription start site is required for promoter activity. Of the three ETS family members tested, only ELF1 was enriched on the MEIS1 promoter as assessed by both electrophoretic mobility shift assay and chromatin immunoprecipitation experiments in K562. This finding was confirmed in MEIS1-expressing primary human samples. Moreover, small interfering RNA-mediated knockdown of ELF1 in K562 cells was associated with a decreased MEIS1 expression. CONCLUSIONS We conclude that the ETS transcription factor ELF1 is an important positive regulator of MEIS1 expression.
Collapse
|
441
|
Sadlon TJ, Wilkinson BG, Pederson S, Brown CY, Bresatz S, Gargett T, Melville EL, Peng K, D'Andrea RJ, Glonek GG, Goodall GJ, Zola H, Shannon MF, Barry SC. Genome-wide identification of human FOXP3 target genes in natural regulatory T cells. THE JOURNAL OF IMMUNOLOGY 2010; 185:1071-81. [PMID: 20554955 DOI: 10.4049/jimmunol.1000082] [Citation(s) in RCA: 115] [Impact Index Per Article: 8.2] [Reference Citation Analysis] [Abstract] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 01/22/2023]
Abstract
The transcription factor FOXP3 is essential for the formation and function of regulatory T cells (Tregs), and Tregs are essential for maintaining immune homeostasis and tolerance. This is demonstrated by a lethal autoimmune defect in mice lacking Foxp3 and in immunodysregulation polyendocrinopathy enteropathy X-linked syndrome patients. However, little is known about the molecular basis of human FOXP3 function or the relationship between direct and indirect targets of FOXP3 in human Tregs. To investigate this, we have performed a comprehensive genome-wide analysis for human FOXP3 target genes from cord blood Tregs using chromatin immunoprecipitation array profiling and expression profiling. We have identified 5579 human FOXP3 target genes and derived a core Treg gene signature conserved across species using mouse chromatin immunoprecipitation data sets. A total of 739 of the 5579 FOXP3 target genes were differentially regulated in Tregs compared with Th cells, thus allowing the identification of a number of pathways and biological functions overrepresented in Tregs. We have identified gene families including cell surface molecules and microRNAs that are differentially expressed in FOXP3(+) Tregs. In particular, we have identified a novel role for peptidase inhibitor 16, which is expressed on the cell surface of >80% of resting human CD25(+)FOXP3(+) Tregs, suggesting that in conjunction with CD25 peptidase inhibitor 16 may be a surrogate surface marker for Tregs with potential clinical application.
Collapse
Affiliation(s)
- Timothy J Sadlon
- Molecular Immunology Laboratory, Women's and Children's Health Research Institute, Women's and Children's Hospital, North Adelaide, South Australia, Australia
| | | | | | | | | | | | | | | | | | | | | | | | | | | |
Collapse
|
442
|
Ryge J, Winther O, Wienecke J, Sandelin A, Westerdahl AC, Hultborn H, Kiehn O. Transcriptional regulation of gene expression clusters in motor neurons following spinal cord injury. BMC Genomics 2010; 11:365. [PMID: 20534130 PMCID: PMC2900267 DOI: 10.1186/1471-2164-11-365] [Citation(s) in RCA: 33] [Impact Index Per Article: 2.4] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/04/2010] [Accepted: 06/09/2010] [Indexed: 01/09/2023] Open
Abstract
Background Spinal cord injury leads to neurological dysfunctions affecting the motor, sensory as well as the autonomic systems. Increased excitability of motor neurons has been implicated in injury-induced spasticity, where the reappearance of self-sustained plateau potentials in the absence of modulatory inputs from the brain correlates with the development of spasticity. Results Here we examine the dynamic transcriptional response of motor neurons to spinal cord injury as it evolves over time to unravel common gene expression patterns and their underlying regulatory mechanisms. For this we use a rat-tail-model with complete spinal cord transection causing injury-induced spasticity, where gene expression profiles are obtained from labeled motor neurons extracted with laser microdissection 0, 2, 7, 21 and 60 days post injury. Consensus clustering identifies 12 gene clusters with distinct time expression profiles. Analysis of these gene clusters identifies early immunological/inflammatory and late developmental responses as well as a regulation of genes relating to neuron excitability that support the development of motor neuron hyper-excitability and the reappearance of plateau potentials in the late phase of the injury response. Transcription factor motif analysis identifies differentially expressed transcription factors involved in the regulation of each gene cluster, shaping the expression of the identified biological processes and their associated genes underlying the changes in motor neuron excitability. Conclusions This analysis provides important clues to the underlying mechanisms of transcriptional regulation responsible for the increased excitability observed in motor neurons in the late chronic phase of spinal cord injury suggesting alternative targets for treatment of spinal cord injury. Several transcription factors were identified as potential regulators of gene clusters containing elements related to motor neuron hyper-excitability, the manipulation of which potentially could be used to alter the transcriptional response to prevent the motor neurons from entering a state of hyper-excitability.
Collapse
Affiliation(s)
- Jesper Ryge
- Mammalian Locomotor Laboratory, Department of Neuroscience, Karolinska Institutet, Retzius väg 8, 171 77 Stockholm, Sweden.
| | | | | | | | | | | | | |
Collapse
|
443
|
Conte I, Marco-Ferreres R, Beccari L, Cisneros E, Ruiz JM, Tabanera N, Bovolenta P. Proper differentiation of photoreceptors and amacrine cells depends on a regulatory loop between NeuroD and Six6. Development 2010; 137:2307-17. [PMID: 20534668 DOI: 10.1242/dev.045294] [Citation(s) in RCA: 35] [Impact Index Per Article: 2.5] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 12/16/2022]
Abstract
Timely generation of distinct neural cell types in appropriate numbers is fundamental for the generation of a functional retina. In vertebrates, the transcription factor Six6 is initially expressed in multipotent retina progenitors and then becomes restricted to differentiated retinal ganglion and amacrine cells. How Six6 expression in the retina is controlled and what are its precise functions are still unclear. To address this issue, we used bioinformatic searches and transgenic approaches in medaka fish (Oryzias latipes) to characterise highly conserved regulatory enhancers responsible for Six6 expression. One of the enhancers drove gene expression in the differentiating and adult retina. A search for transcription factor binding sites, together with luciferase, ChIP assays and gain-of-function studies, indicated that NeuroD, a bHLH transcription factor, directly binds an 'E-box' sequence present in this enhancer and specifically regulates Six6 expression in the retina. NeuroD-induced Six6 overexpression in medaka embryos promoted unorganized retinal progenitor proliferation and, most notably, impaired photoreceptor differentiation, with no apparent changes in other retinal cell types. Conversely, Six6 gain- and loss-of-function changed NeuroD expression levels and altered the expression of the photoreceptor differentiation marker Rhodopsin. In addition, knockdown of Six6 interfered with amacrine cell generation. Together, these results indicate that Six6 and NeuroD control the expression of each other and their functions coordinate amacrine cell generation and photoreceptor terminal differentiation.
Collapse
Affiliation(s)
- Ivan Conte
- Instituto Cajal, CSIC and CIBER de Enfermedades Raras, Madrid, Spain
| | | | | | | | | | | | | |
Collapse
|
444
|
Petrovic V, Costa RH, Lau LF, Raychaudhuri P, Tyner AL. Negative regulation of the oncogenic transcription factor FoxM1 by thiazolidinediones and mithramycin. Cancer Biol Ther 2010; 9:1008-16. [PMID: 20372080 DOI: 10.4161/cbt.9.12.11710] [Citation(s) in RCA: 33] [Impact Index Per Article: 2.4] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 01/11/2023] Open
Abstract
The Forkhead Box transcription factor FoxM1 regulates expression of genes that promote cell cycle progression, and it plays essential roles in the development of liver, lung, prostate and colorectal tumors. Thiazolidinediones (TZDs) activate the peroxisome proliferator-activated receptor gamma (PPARγ), a ligand-activated nuclear receptor transcription factor. We found that treatment of the human hepatoma cell lines HepG2 and PLC/PRF/5 cells with TZDs leads to inhibition of FoxM1 gene expression. No PPARγ/retinoid X receptor (RXR) consensus DNA binding sites were detected in the FoxM1 promoter extending to -10 kb upstream, and knockdown of PPARγ had no impact on TZD mediated downregulation of FoxM1 expression. Previously, others showed that PPARγ agonists inhibit the expression and DNA-binding activity of the Sp1 transcription factor. Here we show that Sp1 binds to the FoxM1 promoter region and positively regulates FoxM1 transcription, while mithramycin, a chemotherapy drug that specifically binds GC rich sequences in the DNA and inhibits activities of Sp1, inhibits expression of FoxM1. Our data suggest that TZD mediated suppression of Sp1 is responsible for downregulation of FoxM1 gene expression. Inhibition of FoxM1 expression by TZDs provides a new mechanism for TZD mediated negative regulation of cancer cell growth. FoxM1 expression and activity in cancer cells can be targeted using PPARγ agonists or the anti-neoplastic antibiotic mithramycin.
Collapse
Affiliation(s)
- Vladimir Petrovic
- Department of Biochemistry & Molecular Genetics, University of Illinois, College of Medicine, Chicago, IL, USA
| | | | | | | | | |
Collapse
|
445
|
Jung CH, Makunin IV, Mattick JS. Identification of conserved Drosophila-specific euchromatin-restricted non-coding sequence motifs. Genomics 2010; 96:154-66. [PMID: 20595017 DOI: 10.1016/j.ygeno.2010.05.006] [Citation(s) in RCA: 3] [Impact Index Per Article: 0.2] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 02/23/2010] [Revised: 05/25/2010] [Accepted: 05/26/2010] [Indexed: 01/19/2023]
Abstract
Non-protein-coding DNA comprises the majority of animal genomes but its functions are largely unknown. We identified over 17,000 different tetranucleotide pairs in the Drosophila melanogaster genome that are over-represented at distances up to 100nt in conserved non-exonic sequences. Those exhibiting the highest information content in surrounding nucleotides were classified into five groups: tRNAs, motifs associated with histone genes, Suppressor-of-Hairy-wing binding sites, and two sets of previously unrecognized motifs (DLM3 and DLM4). There are hundreds to thousands of copies of DLM3 and DLM4, respectively, in the genome, located almost exclusively in non-coding regions. They have similar copy numbers among drosophilids, but are largely absent in other insects. DLM3 is likely a cis-regulatory element, whereas DLM4 sequences are capable of forming a short hairpin structure and are expressed as approximately 80nt RNAs. This work reports the existence of Drosophila genus-specific sequence motifs, and suggests that many more novel functional elements may be discovered in genomes using the general approach outlined herein.
Collapse
Affiliation(s)
- Chol-Hee Jung
- Institute for Molecular Bioscience, The University of Queensland, St Lucia QLD, Australia
| | | | | |
Collapse
|
446
|
Abstract
DNA-binding transcription factors (TFs) play a central role in transcription regulation, and computational approaches that help in elucidating complex mechanisms governing this basic biological process are of great use. In this perspective, we present the TFM-Explorer web server that is a toolbox to identify putative TF binding sites within a set of upstream regulatory sequences of genes sharing some regulatory mechanisms. TFM-Explorer finds local regions showing overrepresentation of binding sites. Accepted organisms are human, mouse, rat, chicken and drosophila. The server employs a number of features to help users to analyze their data: visualization of selected binding sites on genomic sequences, and selection of cis-regulatory modules. TFM-Explorer is available at http://bioinfo.lifl.fr/TFM.
Collapse
Affiliation(s)
- Laurie Tonon
- INRIA Lille-Nord Europe, 40 av Halley, 59650 Villeneuve d'Ascq, d'Ascq Cedex, France
| | | | | |
Collapse
|
447
|
Hödar C, Assar R, Colombres M, Aravena A, Pavez L, González M, Martínez S, Inestrosa NC, Maass A. Genome-wide identification of new Wnt/beta-catenin target genes in the human genome using CART method. BMC Genomics 2010; 11:348. [PMID: 20515496 PMCID: PMC2996972 DOI: 10.1186/1471-2164-11-348] [Citation(s) in RCA: 41] [Impact Index Per Article: 2.9] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 09/10/2009] [Accepted: 06/01/2010] [Indexed: 11/21/2022] Open
Abstract
Background The importance of in silico predictions for understanding cellular processes is now widely accepted, and a variety of algorithms useful for studying different biological features have been designed. In particular, the prediction of cis regulatory modules in non-coding human genome regions represents a major challenge for understanding gene regulation in several diseases. Recently, studies of the Wnt signaling pathway revealed a connection with neurodegenerative diseases such as Alzheimer's. In this article, we construct a classification tool that uses the transcription factor binding site motifs composition of some gene promoters to identify new Wnt/β-catenin pathway target genes potentially involved in brain diseases. Results In this study, we propose 89 new Wnt/β-catenin pathway target genes predicted in silico by using a method based on multiple Classification and Regression Tree (CART) analysis. We used as decision variables the presence of transcription factor binding site motifs in the upstream region of each gene. This prediction was validated by RT-qPCR in a sample of 9 genes. As expected, LEF1, a member of the T-cell factor/lymphoid enhancer-binding factor family (TCF/LEF1), was relevant for the classification algorithm and, remarkably, other factors related directly or indirectly to the inflammatory response and amyloidogenic processes also appeared to be relevant for the classification. Among the 89 new Wnt/β-catenin pathway targets, we found a group expressed in brain tissue that could be involved in diverse responses to neurodegenerative diseases, like Alzheimer's disease (AD). These genes represent new candidates to protect cells against amyloid β toxicity, in agreement with the proposed neuroprotective role of the Wnt signaling pathway. Conclusions Our multiple CART strategy proved to be an effective tool to identify new Wnt/β-catenin pathway targets based on the study of their regulatory regions in the human genome. In particular, several of these genes represent a new group of transcriptional dependent targets of the canonical Wnt pathway. The functions of these genes indicate that they are involved in pathophysiology related to Alzheimer's disease or other brain disorders.
Collapse
Affiliation(s)
- Christian Hödar
- Laboratorio de Bioinformática y Expresión Génica, INTA, Universidad de Chile, Santiago, Chile.
| | | | | | | | | | | | | | | | | |
Collapse
|
448
|
Jolma A, Kivioja T, Toivonen J, Cheng L, Wei G, Enge M, Taipale M, Vaquerizas JM, Yan J, Sillanpää MJ, Bonke M, Palin K, Talukder S, Hughes TR, Luscombe NM, Ukkonen E, Taipale J. Multiplexed massively parallel SELEX for characterization of human transcription factor binding specificities. Genome Res 2010; 20:861-73. [PMID: 20378718 PMCID: PMC2877582 DOI: 10.1101/gr.100552.109] [Citation(s) in RCA: 312] [Impact Index Per Article: 22.3] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 09/11/2009] [Accepted: 03/22/2010] [Indexed: 01/15/2023]
Abstract
The genetic code-the binding specificity of all transfer-RNAs--defines how protein primary structure is determined by DNA sequence. DNA also dictates when and where proteins are expressed, and this information is encoded in a pattern of specific sequence motifs that are recognized by transcription factors. However, the DNA-binding specificity is only known for a small fraction of the approximately 1400 human transcription factors (TFs). We describe here a high-throughput method for analyzing transcription factor binding specificity that is based on systematic evolution of ligands by exponential enrichment (SELEX) and massively parallel sequencing. The method is optimized for analysis of large numbers of TFs in parallel through the use of affinity-tagged proteins, barcoded selection oligonucleotides, and multiplexed sequencing. Data are analyzed by a new bioinformatic platform that uses the hundreds of thousands of sequencing reads obtained to control the quality of the experiments and to generate binding motifs for the TFs. The described technology allows higher throughput and identification of much longer binding profiles than current microarray-based methods. In addition, as our method is based on proteins expressed in mammalian cells, it can also be used to characterize DNA-binding preferences of full-length proteins or proteins requiring post-translational modifications. We validate the method by determining binding specificities of 14 different classes of TFs and by confirming the specificities for NFATC1 and RFX3 using ChIP-seq. Our results reveal unexpected dimeric modes of binding for several factors that were thought to preferentially bind DNA as monomers.
Collapse
Affiliation(s)
- Arttu Jolma
- Department of Molecular Medicine, National Public Health Institute (KTL) and Genome-Scale Biology Program, Institute of Biomedicine and High Throughput Center, University of Helsinki, Biomedicum, Helsinki, Finland
- Department of Biosciences and Nutrition, Karolinska Institutet, Stockholm, Sweden
| | - Teemu Kivioja
- Department of Molecular Medicine, National Public Health Institute (KTL) and Genome-Scale Biology Program, Institute of Biomedicine and High Throughput Center, University of Helsinki, Biomedicum, Helsinki, Finland
- Department of Computer Science, FI-00014 University of Helsinki, Helsinki, Finland
| | - Jarkko Toivonen
- Department of Computer Science, FI-00014 University of Helsinki, Helsinki, Finland
| | - Lu Cheng
- Department of Computer Science, FI-00014 University of Helsinki, Helsinki, Finland
| | - Gonghong Wei
- Department of Molecular Medicine, National Public Health Institute (KTL) and Genome-Scale Biology Program, Institute of Biomedicine and High Throughput Center, University of Helsinki, Biomedicum, Helsinki, Finland
| | - Martin Enge
- Department of Biosciences and Nutrition, Karolinska Institutet, Stockholm, Sweden
| | - Mikko Taipale
- Department of Molecular Medicine, National Public Health Institute (KTL) and Genome-Scale Biology Program, Institute of Biomedicine and High Throughput Center, University of Helsinki, Biomedicum, Helsinki, Finland
| | - Juan M. Vaquerizas
- EMBL–European Bioinformatics Institute, Cambridge CB10 1SD, United Kingdom
| | - Jian Yan
- Department of Molecular Medicine, National Public Health Institute (KTL) and Genome-Scale Biology Program, Institute of Biomedicine and High Throughput Center, University of Helsinki, Biomedicum, Helsinki, Finland
| | - Mikko J. Sillanpää
- Department of Mathematics and Statistics, FI-00014 University of Helsinki, Helsinki, Finland
| | - Martin Bonke
- Department of Molecular Medicine, National Public Health Institute (KTL) and Genome-Scale Biology Program, Institute of Biomedicine and High Throughput Center, University of Helsinki, Biomedicum, Helsinki, Finland
| | - Kimmo Palin
- Department of Computer Science, FI-00014 University of Helsinki, Helsinki, Finland
| | - Shaheynoor Talukder
- Department of Molecular Genetics and Banting and Best Department of Medical Research, University of Toronto, Toronto, ON M4T 2J4, Canada
| | - Timothy R. Hughes
- Department of Molecular Genetics and Banting and Best Department of Medical Research, University of Toronto, Toronto, ON M4T 2J4, Canada
| | | | - Esko Ukkonen
- Department of Computer Science, FI-00014 University of Helsinki, Helsinki, Finland
| | - Jussi Taipale
- Department of Molecular Medicine, National Public Health Institute (KTL) and Genome-Scale Biology Program, Institute of Biomedicine and High Throughput Center, University of Helsinki, Biomedicum, Helsinki, Finland
- Department of Biosciences and Nutrition, Karolinska Institutet, Stockholm, Sweden
| |
Collapse
|
449
|
Genome-wide analysis of ETS-family DNA-binding in vitro and in vivo. EMBO J 2010; 29:2147-60. [PMID: 20517297 PMCID: PMC2905244 DOI: 10.1038/emboj.2010.106] [Citation(s) in RCA: 435] [Impact Index Per Article: 31.1] [Reference Citation Analysis] [Abstract] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 12/09/2009] [Accepted: 05/04/2010] [Indexed: 12/30/2022] Open
Abstract
Members of the large ETS family of transcription factors (TFs) have highly similar DNA-binding domains (DBDs)—yet they have diverse functions and activities in physiology and oncogenesis. Some differences in DNA-binding preferences within this family have been described, but they have not been analysed systematically, and their contributions to targeting remain largely uncharacterized. We report here the DNA-binding profiles for all human and mouse ETS factors, which we generated using two different methods: a high-throughput microwell-based TF DNA-binding specificity assay, and protein-binding microarrays (PBMs). Both approaches reveal that the ETS-binding profiles cluster into four distinct classes, and that all ETS factors linked to cancer, ERG, ETV1, ETV4 and FLI1, fall into just one of these classes. We identify amino-acid residues that are critical for the differences in specificity between all the classes, and confirm the specificities in vivo using chromatin immunoprecipitation followed by sequencing (ChIP-seq) for a member of each class. The results indicate that even relatively small differences in in vitro binding specificity of a TF contribute to site selectivity in vivo.
Collapse
|
450
|
Jung M, Peterson H, Chavez L, Kahlem P, Lehrach H, Vilo J, Adjaye J. A data integration approach to mapping OCT4 gene regulatory networks operative in embryonic stem cells and embryonal carcinoma cells. PLoS One 2010; 5:e10709. [PMID: 20505756 PMCID: PMC2873957 DOI: 10.1371/journal.pone.0010709] [Citation(s) in RCA: 69] [Impact Index Per Article: 4.9] [Reference Citation Analysis] [Abstract] [MESH Headings] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/28/2009] [Accepted: 04/25/2010] [Indexed: 01/06/2023] Open
Abstract
It is essential to understand the network of transcription factors controlling self-renewal of human embryonic stem cells (ESCs) and human embryonal carcinoma cells (ECs) if we are to exploit these cells in regenerative medicine regimes. Correlating gene expression levels after RNAi-based ablation of OCT4 function with its downstream targets enables a better prediction of motif-specific driven expression modules pertinent for self-renewal and differentiation of embryonic stem cells and induced pluripotent stem cells.We initially identified putative direct downstream targets of OCT4 by employing CHIP-on-chip analysis. A comparison of three peak analysis programs revealed a refined list of OCT4 targets in the human EC cell line NCCIT, this list was then compared to previously published OCT4 CHIP-on-chip datasets derived from both ES and EC cells. We have verified an enriched POU-motif, discovered by a de novo approach, thus enabling us to define six distinct modules of OCT4 binding and regulation of its target genes.A selection of these targets has been validated, like NANOG, which harbours the evolutionarily conserved OCT4-SOX2 binding motif within its proximal promoter. Other validated targets, which do not harbour the classical HMG motif are USP44 and GADD45G, a key regulator of the cell cycle. Over-expression of GADD45G in NCCIT cells resulted in an enrichment and up-regulation of genes associated with the cell cycle (CDKN1B, CDKN1C, CDK6 and MAPK4) and developmental processes (BMP4, HAND1, EOMES, ID2, GATA4, GATA5, ISL1 and MSX1). A comparison of positively regulated OCT4 targets common to EC and ES cells identified genes such as NANOG, PHC1, USP44, SOX2, PHF17 and OCT4, thus further confirming their universal role in maintaining self-renewal in both cell types. Finally we have created a user-friendly database (http://biit.cs.ut.ee/escd/), integrating all OCT4 and stem cell related datasets in both human and mouse ES and EC cells.In the current era of systems biology driven research, we envisage that our integrated embryonic stem cell database will prove beneficial to the booming field of ES, iPS and cancer research.
Collapse
Affiliation(s)
- Marc Jung
- Molecular Embryology and Aging Group, Department of Vertebrate Genomics, Max-Planck Institute for Molecular Genetics, Berlin, Germany
- * E-mail: (JA); (MJ)
| | - Hedi Peterson
- Institute of Molecular and Cell Biology, University of Tartu, Tartu, Estonia
- Quretec Ltd., Tartu, Estonia
| | - Lukas Chavez
- Molecular Embryology and Aging Group, Department of Vertebrate Genomics, Max-Planck Institute for Molecular Genetics, Berlin, Germany
| | - Pascal Kahlem
- EMBL - European Bioinformatics Institute, Cambridge, United Kingdom
| | - Hans Lehrach
- Molecular Embryology and Aging Group, Department of Vertebrate Genomics, Max-Planck Institute for Molecular Genetics, Berlin, Germany
| | - Jaak Vilo
- Quretec Ltd., Tartu, Estonia
- Institute of Computer Science, University of Tartu, Tartu, Estonia
| | - James Adjaye
- Molecular Embryology and Aging Group, Department of Vertebrate Genomics, Max-Planck Institute for Molecular Genetics, Berlin, Germany
- * E-mail: (JA); (MJ)
| |
Collapse
|