Reference Citation Analysis: Find an Article, Find a Category, Find a Journal, Find a Scholar

For: Zhang ZD, Paccanaro A, Fu Y, Weissman S, Weng Z, Chang J, Snyder M, Gerstein MB. Statistical analysis of the genomic distribution and correlation of regulatory elements in the ENCODE regions. Genome Res 2007;17:787-97. [PMID: 17567997 PMCID: PMC1891338 DOI: 10.1101/gr.5573107] [Citation(s) in RCA: 52] [Impact Index Per Article: 3.1] [Reference Citation Analysis] [What about the content of this article? (0)] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/24/2022]

For:	Zhang ZD, Paccanaro A, Fu Y, Weissman S, Weng Z, Chang J, Snyder M, Gerstein MB. Statistical analysis of the genomic distribution and correlation of regulatory elements in the ENCODE regions. Genome Res 2007;17:787-97. [PMID: 17567997 PMCID: PMC1891338 DOI: 10.1101/gr.5573107] [Citation(s) in RCA: 52] [Impact Index Per Article: 3.1] [Reference Citation Analysis] [What about the content of this article? (0)] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/24/2022]

Number

Cited by Other Article(s)

Hoang M, Marçais G, Kingsford C. Density and Conservation Optimization of the Generalized Masked-Minimizer Sketching Scheme. J Comput Biol 2024;31:2-20. [PMID: 37975802 PMCID: PMC10794853 DOI: 10.1089/cmb.2023.0212] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/19/2023] Open

Bo S, Sun Q, Ning P, Yuan N, Weng Y, Liang Y, Wang H, Lu Z, Li Z, Zhao X. A novel approach to analyze the association characteristics between post-spliced introns and their corresponding mRNA. Front Genet 2023;14:1151172. [PMID: 36923795 PMCID: PMC10008863 DOI: 10.3389/fgene.2023.1151172] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 01/25/2023] [Accepted: 02/15/2023] [Indexed: 03/03/2023] Open

Abstract

Studies have shown that post-spliced introns promote cell survival when nutrients are scarce, and intron loss/gain can influence many stages of mRNA metabolism. However, few approaches are currently available to study the correlation between intron sequences and their corresponding mature mRNA sequences. Here, based on the results of the improved Smith-Waterman local alignment-based algorithm method (SW method) and binding free energy weighted local alignment algorithm method (BFE method), the optimal matched segments between introns and their corresponding mature mRNAs in Caenorhabditis elegans (C.elegans) and their relative matching frequency (RF) distributions were obtained. The results showed that although the distributions of relative matching frequencies on mRNAs obtained by the BFE method were similar to the SW method, the interaction intensity in 5'and 3'untranslated regions (UTRs) regions was weaker than the SW method. The RF distributions in the exon-exon junction regions were comparable, the effects of long and short introns on mRNA and on the five functional sites with BFE method were similar to the SW method. However, the interaction intensity in 5'and 3'UTR regions with BFE method was weaker than with SW method. Although the matching rate and length distribution shape of the optimal matched fragment were consistent with the SW method, an increase in length was observed. The matching rates and the length of the optimal matched fragments were mainly in the range of 60%-80% and 20-30bp, respectively. Although we found that there were still matching preferences in the 5'and 3'UTR regions of the mRNAs with BFE, the matching intensities were significantly lower than the matching intensities between introns and their corresponding mRNAs with SW method. Overall, our findings suggest that the interaction between introns and mRNAs results from synergism among different types of sequences during the evolutionary process.

Collapse

Hoang M, Zheng H, Kingsford C. Differentiable Learning of Sequence-Specific Minimizer Schemes with DeepMinimizer. J Comput Biol 2022;29:1288-1304. [PMID: 36095142 PMCID: PMC9807081 DOI: 10.1089/cmb.2022.0275] [Citation(s) in RCA: 2] [Impact Index Per Article: 1.0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 01/12/2023] Open

Du Z, D’Alessandro E, Zheng Y, Wang M, Chen C, Wang X, Song C. Retrotransposon Insertion Polymorphisms (RIPs) in Pig Coat Color Candidate Genes. Animals (Basel) 2022;12:ani12080969. [PMID: 35454216 PMCID: PMC9031378 DOI: 10.3390/ani12080969] [Citation(s) in RCA: 1] [Impact Index Per Article: 0.5] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 02/10/2022] [Revised: 03/28/2022] [Accepted: 04/05/2022] [Indexed: 12/17/2022] Open

Gu A, Cho HJ, Sheffield NC. Bedshift: perturbation of genomic interval sets. Genome Biol 2021;22:238. [PMID: 34416909 PMCID: PMC8379854 DOI: 10.1186/s13059-021-02440-w] [Citation(s) in RCA: 2] [Impact Index Per Article: 0.7] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 08/12/2020] [Accepted: 07/26/2021] [Indexed: 12/25/2022] Open

SINE Insertion in the Intron of Pig GHR May Decrease Its Expression by Acting as a Repressor. Animals (Basel) 2021;11:ani11071871. [PMID: 34201672 PMCID: PMC8300111 DOI: 10.3390/ani11071871] [Citation(s) in RCA: 3] [Impact Index Per Article: 1.0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 05/11/2021] [Revised: 06/15/2021] [Accepted: 06/19/2021] [Indexed: 11/17/2022] Open

Abstract

Simple Summary

GH/IGF axis genes play a central role in the regulation of skeletal accretion during development and growth, and thus represent candidate genes for growth traits. Retrotransposon insertion polymorphisms are major contributors to structural variations. They tend to generate large effect mutations resulting in variations in target gene activity and phenotype due to the fact that they carry functional elements, such as enhancers, insulators, or promoters. In the present study, RIPs in four GH/IGF axis genes (GH, GHR, IGF1, and IGF1R) were investigated by comparative genomics and PCR. Four RIPs in the GHR gene and one RIP in the IGF1 gene were identified. Further analysis revealed that one RIP in the first intron of GHR might play a role in the regulation of GHR expression by acting as a repressor. These findings contribute to the understanding of the role of RIPs in the genetic variation of GH/IGF axis genes and phenotypic variation in pigs.

Abstract

The genetic diversity of the GH/IGF axis genes and their association with the variation of gene expression and phenotypic traits, principally represented by SNPs, have been extensively reported. Nevertheless, the impact of retrotransposon insertion polymorphisms (RIPs) on the GH/IGF axis gene activity has not been reported. In the present study, bioinformatic prediction and PCR verification were performed to screen RIPs in four GH/IGF axis genes (GH, GHR, IGF1 and IGF1R). In total, five RIPs, including one SINE RIP in intron 3 of IGF1, one L1 RIP in intron 7 of GHR, and three SINE RIPs in intron 1, intron 5 and intron 9 of GHR, were confirmed by PCR, displaying polymorphisms in diverse breeds. Dual luciferase reporter assay revealed that the SINE insertion in intron 1 of GHR significantly repressed the GHR promoter activity in PK15, Hela, C2C12 and 3T3-L1 cells. Furthermore, qPCR results confirmed that this SINE insertion was associated with a decreased expression of GHR in the leg muscle and longissimus dorsi, indicating that it may act as a repressor involved in the regulation of GHR expression. In summary, our data revealed that RIPs contribute to the genetic variation of GH/IGF axis genes, whereby one SINE RIP in the intron 1 of GHR may decrease the expression of GHR by acting as a repressor.

Collapse

Nikitin D, Kolosov N, Murzina A, Pats K, Zamyatin A, Tkachev V, Sorokin M, Kopylov P, Buzdin A. Retroelement-Linked H3K4me1 Histone Tags Uncover Regulatory Evolution Trends of Gene Enhancers and Feature Quickly Evolving Molecular Processes in Human Physiology. Cells 2019;8:cells8101219. [PMID: 31597351 PMCID: PMC6830109 DOI: 10.3390/cells8101219] [Citation(s) in RCA: 3] [Impact Index Per Article: 0.6] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 08/23/2019] [Revised: 09/25/2019] [Accepted: 10/01/2019] [Indexed: 12/20/2022] Open

Kanduri C, Bock C, Gundersen S, Hovig E, Sandve GK. Colocalization analyses of genomic elements: approaches, recommendations and challenges. Bioinformatics 2019;35:1615-1624. [PMID: 30307532 PMCID: PMC6499241 DOI: 10.1093/bioinformatics/bty835] [Citation(s) in RCA: 34] [Impact Index Per Article: 6.8] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 05/22/2018] [Revised: 09/03/2018] [Accepted: 10/10/2018] [Indexed: 12/23/2022] Open

Dozmorov MG. Epigenomic annotation-based interpretation of genomic data: from enrichment analysis to machine learning. Bioinformatics 2018;33:3323-3330. [PMID: 29028263 DOI: 10.1093/bioinformatics/btx414] [Citation(s) in RCA: 26] [Impact Index Per Article: 4.3] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 01/05/2017] [Accepted: 06/22/2017] [Indexed: 12/12/2022] Open

Naidoo T, Sjödin P, Schlebusch C, Jakobsson M. Patterns of variation in cis-regulatory regions: examining evidence of purifying selection. BMC Genomics 2018;19:95. [PMID: 29373957 PMCID: PMC5787233 DOI: 10.1186/s12864-017-4422-y] [Citation(s) in RCA: 7] [Impact Index Per Article: 1.2] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/19/2017] [Accepted: 12/27/2017] [Indexed: 11/10/2022] Open

Lin Z, Guo H, Cao Y, Zohrabian S, Zhou P, Ma Q, VanDusen N, Guo Y, Zhang J, Stevens SM, Liang F, Quan Q, van Gorp PR, Li A, Dos Remedios C, He A, Bezzerides VJ, Pu WT. Acetylation of VGLL4 Regulates Hippo-YAP Signaling and Postnatal Cardiac Growth. Dev Cell 2016;39:466-479. [PMID: 27720608 DOI: 10.1016/j.devcel.2016.09.005] [Citation(s) in RCA: 80] [Impact Index Per Article: 10.0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 04/20/2016] [Revised: 07/12/2016] [Accepted: 09/08/2016] [Indexed: 11/28/2022]

Affiliation(s)

Zhiqiang Lin Department of Cardiology, Boston Children's Hospital, 300 Longwood Avenue, Boston, MA 02115, USA.
Haidong Guo Department of Cardiology, Boston Children's Hospital, 300 Longwood Avenue, Boston, MA 02115, USA; Department of Anatomy, School of Basic Medicine, Shanghai University of Traditional Chinese Medicine, Shanghai 201203, China
Yuan Cao Department of Cardiology, Boston Children's Hospital, 300 Longwood Avenue, Boston, MA 02115, USA; Peking University, Fifth School of Clinical Medicine, Beijing 100730, China
Sylvia Zohrabian Department of Cardiology, Boston Children's Hospital, 300 Longwood Avenue, Boston, MA 02115, USA
Pingzhu Zhou Department of Cardiology, Boston Children's Hospital, 300 Longwood Avenue, Boston, MA 02115, USA
Qing Ma Department of Cardiology, Boston Children's Hospital, 300 Longwood Avenue, Boston, MA 02115, USA
Nathan VanDusen Department of Cardiology, Boston Children's Hospital, 300 Longwood Avenue, Boston, MA 02115, USA
Yuxuan Guo Department of Cardiology, Boston Children's Hospital, 300 Longwood Avenue, Boston, MA 02115, USA
Jin Zhang Department of Cardiology, Boston Children's Hospital, 300 Longwood Avenue, Boston, MA 02115, USA
Sean M Stevens Department of Cardiology, Boston Children's Hospital, 300 Longwood Avenue, Boston, MA 02115, USA
Feng Liang Rowland Institute at Harvard, Harvard University, Cambridge, MA 02142, USA
Qimin Quan Rowland Institute at Harvard, Harvard University, Cambridge, MA 02142, USA
Pim R van Gorp Department of Cardiology, Leiden University Medical Center, 2300 RC Leiden, the Netherlands
Amy Li Department of Anatomy & Histology, Bosch Institute, University of Sydney, Sydney, NSW 2006, Australia
Cristobal Dos Remedios Department of Anatomy & Histology, Bosch Institute, University of Sydney, Sydney, NSW 2006, Australia
Aibin He Institute of Molecular Medicine, Peking University, PKU-Tsinghua U Joint Center for Life Sciences, Beijing 100871, China
Vassilios J Bezzerides Department of Cardiology, Boston Children's Hospital, 300 Longwood Avenue, Boston, MA 02115, USA
William T Pu Department of Cardiology, Boston Children's Hospital, 300 Longwood Avenue, Boston, MA 02115, USA; Harvard Stem Cell Institute, Harvard University, Cambridge, MA 02138, USA.

Collapse

Castellanos-Martín A, Castillo-Lluva S, Sáez-Freire MDM, Blanco-Gómez A, Hontecillas-Prieto L, Patino-Alonso C, Galindo-Villardon P, Pérez Del Villar L, Martín-Seisdedos C, Isidoro-Garcia M, Abad-Hernández MDM, Cruz-Hernández JJ, Rodríguez-Sánchez CA, González-Sarmiento R, Alonso-López D, De Las Rivas J, García-Cenador B, García-Criado J, Lee DY, Bowen B, Reindl W, Northen T, Mao JH, Pérez-Losada J. Unraveling heterogeneous susceptibility and the evolution of breast cancer using a systems biology approach. Genome Biol 2015;16:40. [PMID: 25853295 PMCID: PMC4389302 DOI: 10.1186/s13059-015-0599-z] [Citation(s) in RCA: 18] [Impact Index Per Article: 2.0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/07/2014] [Accepted: 01/27/2015] [Indexed: 12/16/2022] Open

Abstract

Background

An essential question in cancer is why individuals with the same disease have different clinical outcomes. Progress toward a more personalized medicine in cancer patients requires taking into account the underlying heterogeneity at different molecular levels.

Results

Here, we present a model in which there are complex interactions at different cellular and systemic levels that account for the heterogeneity of susceptibility to and evolution of ERBB2-positive breast cancers. Our model is based on our analyses of a cohort of mice that are characterized by heterogeneous susceptibility to ERBB2-positive breast cancers. Our analysis reveals that there are similarities between ERBB2 tumors in humans and those of backcross mice at clinical, genomic, expression, and signaling levels. We also show that mice that have tumors with intrinsically high levels of active AKT and ERK are more resistant to tumor metastasis. Our findings suggest for the first time that a site-specific phosphorylation at the serine 473 residue of AKT1 modifies the capacity for tumors to disseminate. Finally, we present two predictive models that can explain the heterogeneous behavior of the disease in the mouse population when we consider simultaneously certain genetic markers, liver cell signaling and serum biomarkers that are identified before the onset of the disease.

Conclusions

Considering simultaneously tumor pathophenotypes and several molecular levels, we show the heterogeneous behavior of ERBB2-positive breast cancer in terms of disease progression. This and similar studies should help to better understand disease variability in patient populations.

Electronic supplementary material

The online version of this article (doi:10.1186/s13059-015-0599-z) contains supplementary material, which is available to authorized users.

Collapse

Kravatsky YV, Chechetkin VR, Tchurikov NA, Kravatskaya GI. Genome-wide study of correlations between genomic features and their relationship with the regulation of gene expression. DNA Res 2015;22:109-19. [PMID: 25627242 PMCID: PMC4379982 DOI: 10.1093/dnares/dsu044] [Citation(s) in RCA: 18] [Impact Index Per Article: 2.0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 02/05/2023] Open

Li WC, Zhong ZJ, Zhu PP, Deng EZ, Ding H, Chen W, Lin H. Sequence analysis of origins of replication in the Saccharomyces cerevisiae genomes. Front Microbiol 2014;5:574. [PMID: 25477864 PMCID: PMC4235382 DOI: 10.3389/fmicb.2014.00574] [Citation(s) in RCA: 18] [Impact Index Per Article: 1.8] [Reference Citation Analysis] [Abstract] [Key Words] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 09/05/2014] [Accepted: 10/11/2014] [Indexed: 12/26/2022] Open

Budden DM, Hurley DG, Crampin EJ. Predictive modelling of gene expression from transcriptional regulatory elements. Brief Bioinform 2014;16:616-28. [PMID: 25231769 DOI: 10.1093/bib/bbu034] [Citation(s) in RCA: 30] [Impact Index Per Article: 3.0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/02/2014] [Accepted: 08/20/2014] [Indexed: 12/15/2022] Open

Shen L, Choi I, Nestler EJ, Won KJ. Human Transcriptome and Chromatin Modifications: An ENCODE Perspective. Genomics Inform 2013;11:60-7. [PMID: 23843771 PMCID: PMC3704928 DOI: 10.5808/gi.2013.11.2.60] [Citation(s) in RCA: 3] [Impact Index Per Article: 0.3] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 02/14/2013] [Revised: 03/05/2013] [Accepted: 03/13/2013] [Indexed: 11/22/2022] Open

Frazer KA. Decoding the human genome. Genome Res 2013;22:1599-601. [PMID: 22955971 PMCID: PMC3431476 DOI: 10.1101/gr.146175.112] [Citation(s) in RCA: 35] [Impact Index Per Article: 3.2] [Reference Citation Analysis] [Track Full Text] [Download PDF] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 12/24/2022]

Zhang J, Parvin J, Huang K. Redistribution of H3K4me2 on neural tissue specific genes during mouse brain development. BMC Genomics 2012;13 Suppl 8:S5. [PMID: 23281639 PMCID: PMC3535709 DOI: 10.1186/1471-2164-13-s8-s5] [Citation(s) in RCA: 25] [Impact Index Per Article: 2.1] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 12/02/2022] Open

Abstract

Background

Histone modification plays an important role in cell differentiation and tissue development. A recent study has shown that the dimethylation of lysine 4 residue on histone 3 (H3K4me2) marks the gene body area of tissue specific genes in the human CD4+ T cells and neural cells. However, little is known of the H3k4me2 distribution dynamics through the cell differentiation and tissue development.

Results

We applied several clustering methods including K-means, hierarchical and principle component analysis on H3K4me2 ChIP-seq data from embryonic stem cell, neural progenitor cell and whole brain of mouse, trying to identify genes with the H3K4me2 binding on the gene body region in different cell development stage and study their redistribution in different tissue development stages.

A cluster of 356 genes with heavy H3K4me2 labeling in the gene body region was identified in the mouse whole brain tissue using K-means clustering. They are highly enriched with neural system related functions and pathways, and are involved in several central neural system diseases. The distribution of H3K4me2 on neural function related genes follows three distinctive patterns: a group of genes contain constant heavy H3K4me2 marks in the gene body from embryonic stem cell stage through neural progenitor stage to matured brain tissue stage; another group of gene have little H3K4me2 marks until cells mature into brain cells; the majority of the genes acquired H3K4me2 marks in the neural progenitor cell stage, and gain heavy labeling in the matured brain cell stage. Gene ontology enrichment analysis also revealed corresponding gene ontology terms that fit in the scenario of each cell developmental stages.

Conclusions

We investigated the process of the H3K4me2 mark redistribution during tissue specificity development for mouse brain tissue. Our analysis confirmed the previous report that heavy labeling of H3K4me2 in the downstream of TSS marks tissue specific genes. These genes show remarkable enrichment in central neural system related diseases. Furthermore, we have shown that H3K4me2 labeling can be acquired as early as the embryonic stem cell stage, and its distribution is dynamic and progressive throughout cell differentiation and tissue development.

Collapse

Stojnic R, Fu AQ, Adryan B. A graphical modelling approach to the dissection of highly correlated transcription factor binding site profiles. PLoS Comput Biol 2012;8:e1002725. [PMID: 23144600 PMCID: PMC3493460 DOI: 10.1371/journal.pcbi.1002725] [Citation(s) in RCA: 4] [Impact Index Per Article: 0.3] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 04/05/2012] [Accepted: 08/01/2012] [Indexed: 11/18/2022] Open

Abstract

Inferring the combinatorial regulatory code of transcription factors (TFs) from genome-wide TF binding profiles is challenging. A major reason is that TF binding profiles significantly overlap and are therefore highly correlated. Clustered occurrence of multiple TFs at genomic sites may arise from chromatin accessibility and local cooperation between TFs, or binding sites may simply appear clustered if the profiles are generated from diverse cell populations. Overlaps in TF binding profiles may also result from measurements taken at closely related time intervals. It is thus of great interest to distinguish TFs that directly regulate gene expression from those that are indirectly associated with gene expression. Graphical models, in particular Bayesian networks, provide a powerful mathematical framework to infer different types of dependencies. However, existing methods do not perform well when the features (here: TF binding profiles) are highly correlated, when their association with the biological outcome is weak, and when the sample size is small. Here, we develop a novel computational method, the Neighbourhood Consistent PC (NCPC) algorithms, which deal with these scenarios much more effectively than existing methods do. We further present a novel graphical representation, the Direct Dependence Graph (DDGraph), to better display the complex interactions among variables. NCPC and DDGraph can also be applied to other problems involving highly correlated biological features. Both methods are implemented in the R package ddgraph, available as part of Bioconductor (http://bioconductor.org/packages/2.11/bioc/html/ddgraph.html). Applied to real data, our method identified TFs that specify different classes of cis-regulatory modules (CRMs) in Drosophila mesoderm differentiation. Our analysis also found depletion of the early transcription factor Twist binding at the CRMs regulating expression in visceral and somatic muscle cells at later stages, which suggests a CRM-specific repression mechanism that so far has not been characterised for this class of mesodermal CRMs.

Transcription factors (TFs) are proteins that bind to DNA and regulate gene expression. Recent technological advances make it possible to map TF binding patterns across the whole genome. Multiple single-gene studies showed that combinatorial binding of multiple transcription factors determines the gene transcriptional output. A common naive assumption is that correlated binding profiles may indicate combinatorial binding. However, it has been found that many TFs bind to distinct hotspots whose role is currently unclear. It is thus of great interest to find transcription factor combinations whose correlated binding is causally most immediate to gene expression. Building upon theories of statistical dependence and causality, we develop novel graphical modelbased algorithms that handle highly correlated transcription factor binding profiles more efficiently and reliably than existing algorithms do. These algorithms can also be applied to other biological areas involving highly correlated variables, such as the analysis of high-throughput gene knock-down experiments.

Collapse

Dawson M, Foster S, Bannister A, Robson S, Hannah R, Wang X, Xhemalce B, Wood A, Green A, Göttgens B, Kouzarides T. Three distinct patterns of histone H3Y41 phosphorylation mark active genes. Cell Rep 2012;2:470-7. [PMID: 22999934 PMCID: PMC3607218 DOI: 10.1016/j.celrep.2012.08.016] [Citation(s) in RCA: 47] [Impact Index Per Article: 3.9] [Reference Citation Analysis] [Abstract] [Track Full Text] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 04/23/2012] [Revised: 07/16/2012] [Accepted: 08/16/2012] [Indexed: 02/02/2023] Open

Affiliation(s)

Mark A. Dawson Gurdon Institute and Department of Pathology, Tennis Court Road, Cambridge, CB2 1QN, UK,Department of Haematology, Cambridge Institute for Medical Research and The Wellcome Trust and MRC Stem Cell Institute, University of Cambridge, Cambridge, CB2 0XY, UK,Addenbrooke’s Hospital, University of Cambridge, Cambridge, CB2 0XY, UK
Samuel D. Foster Department of Haematology, Cambridge Institute for Medical Research and The Wellcome Trust and MRC Stem Cell Institute, University of Cambridge, Cambridge, CB2 0XY, UK
Andrew J. Bannister Gurdon Institute and Department of Pathology, Tennis Court Road, Cambridge, CB2 1QN, UK
Samuel C. Robson Gurdon Institute and Department of Pathology, Tennis Court Road, Cambridge, CB2 1QN, UK
Rebecca Hannah Department of Haematology, Cambridge Institute for Medical Research and The Wellcome Trust and MRC Stem Cell Institute, University of Cambridge, Cambridge, CB2 0XY, UK
Xiaonan Wang Department of Haematology, Cambridge Institute for Medical Research and The Wellcome Trust and MRC Stem Cell Institute, University of Cambridge, Cambridge, CB2 0XY, UK
Blerta Xhemalce Gurdon Institute and Department of Pathology, Tennis Court Road, Cambridge, CB2 1QN, UK
Andrew D. Wood Department of Haematology, Cambridge Institute for Medical Research and The Wellcome Trust and MRC Stem Cell Institute, University of Cambridge, Cambridge, CB2 0XY, UK
Anthony R. Green Department of Haematology, Cambridge Institute for Medical Research and The Wellcome Trust and MRC Stem Cell Institute, University of Cambridge, Cambridge, CB2 0XY, UK,Addenbrooke’s Hospital, University of Cambridge, Cambridge, CB2 0XY, UK
Berthold Göttgens Department of Haematology, Cambridge Institute for Medical Research and The Wellcome Trust and MRC Stem Cell Institute, University of Cambridge, Cambridge, CB2 0XY, UK,Corresponding author
Tony Kouzarides Gurdon Institute and Department of Pathology, Tennis Court Road, Cambridge, CB2 1QN, UK,Corresponding author

Collapse

Classification of human genomic regions based on experimentally determined binding sites of more than 100 transcription-related factors. Genome Biol 2012;13:R48. [PMID: 22950945 PMCID: PMC3491392 DOI: 10.1186/gb-2012-13-9-r48] [Citation(s) in RCA: 187] [Impact Index Per Article: 15.6] [Reference Citation Analysis] [Abstract] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 12/21/2011] [Revised: 05/06/2012] [Accepted: 06/08/2012] [Indexed: 01/22/2023] Open

Manic G, Maurin-Marlin A, Galluzzi L, Subra F, Mouscadet JF, Bury-Moné S. 3' self-inactivating long terminal repeat inserts for the modulation of transgene expression from lentiviral vectors. Hum Gene Ther Methods 2012;23:84-97. [PMID: 22456436 DOI: 10.1089/hgtb.2011.154] [Citation(s) in RCA: 6] [Impact Index Per Article: 0.5] [Reference Citation Analysis] [Abstract] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 01/18/2023] Open

Chikina MD, Troyanskaya OG. An effective statistical evaluation of ChIPseq dataset similarity. ACTA ACUST UNITED AC 2012;28:607-13. [PMID: 22262674 DOI: 10.1093/bioinformatics/bts009] [Citation(s) in RCA: 54] [Impact Index Per Article: 4.5] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 12/22/2022]

Transposon-mediated BAC transgenesis in zebrafish. Nat Protoc 2011;6:1998-2021. [PMID: 22134125 DOI: 10.1038/nprot.2011.416] [Citation(s) in RCA: 166] [Impact Index Per Article: 12.8] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 12/22/2022]

Cheng C, Min R, Gerstein M. TIP: a probabilistic method for identifying transcription factor target genes from ChIP-seq binding profiles. ACTA ACUST UNITED AC 2011;27:3221-7. [PMID: 22039215 DOI: 10.1093/bioinformatics/btr552] [Citation(s) in RCA: 41] [Impact Index Per Article: 3.2] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 12/22/2022]

Jee J, Rozowsky J, Yip KY, Lochovsky L, Bjornson R, Zhong G, Zhang Z, Fu Y, Wang J, Weng Z, Gerstein M. ACT: aggregation and correlation toolbox for analyses of genome tracks. Bioinformatics 2011;27:1152-4. [PMID: 21349863 PMCID: PMC3072554 DOI: 10.1093/bioinformatics/btr092] [Citation(s) in RCA: 33] [Impact Index Per Article: 2.5] [Reference Citation Analysis] [Abstract] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/14/2022] Open

Sakabe NJ, Nobrega MA. Genome-wide maps of transcription regulatory elements. WILEY INTERDISCIPLINARY REVIEWS-SYSTEMS BIOLOGY AND MEDICINE 2010;2:422-437. [PMID: 20836039 DOI: 10.1002/wsbm.70] [Citation(s) in RCA: 20] [Impact Index Per Article: 1.4] [Reference Citation Analysis] [Abstract] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 01/25/2023]

When needles look like hay: how to find tissue-specific enhancers in model organism genomes. Dev Biol 2010;350:239-54. [PMID: 21130761 DOI: 10.1016/j.ydbio.2010.11.026] [Citation(s) in RCA: 25] [Impact Index Per Article: 1.8] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 04/14/2010] [Revised: 11/11/2010] [Accepted: 11/22/2010] [Indexed: 01/22/2023]

Cooper DN, Chen JM, Ball EV, Howells K, Mort M, Phillips AD, Chuzhanova N, Krawczak M, Kehrer-Sawatzki H, Stenson PD. Genes, mutations, and human inherited disease at the dawn of the age of personalized genomics. Hum Mutat 2010;31:631-55. [PMID: 20506564 DOI: 10.1002/humu.21260] [Citation(s) in RCA: 117] [Impact Index Per Article: 8.4] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 12/12/2022]

Carstensen L, Sandelin A, Winther O, Hansen NR. Multivariate Hawkes process models of the occurrence of regulatory elements. BMC Bioinformatics 2010;11:456. [PMID: 20828413 PMCID: PMC2949889 DOI: 10.1186/1471-2105-11-456] [Citation(s) in RCA: 13] [Impact Index Per Article: 0.9] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/01/2010] [Accepted: 09/09/2010] [Indexed: 11/10/2022] Open

Alexander RP, Fang G, Rozowsky J, Snyder M, Gerstein MB. Annotating non-coding regions of the genome. Nat Rev Genet 2010;11:559-71. [PMID: 20628352 DOI: 10.1038/nrg2814] [Citation(s) in RCA: 326] [Impact Index Per Article: 23.3] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 02/06/2023]

Cai X, Hu H, Li X. A new measurement of sequence conservation. BMC Genomics 2009;10:623. [PMID: 20028539 PMCID: PMC2807881 DOI: 10.1186/1471-2164-10-623] [Citation(s) in RCA: 3] [Impact Index Per Article: 0.2] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/15/2009] [Accepted: 12/22/2009] [Indexed: 11/10/2022] Open

Abstract

Background

Understanding sequence conservation is important for the study of sequence evolution and for the identification of functional regions of the genome. Current studies often measure sequence conservation based on every position in contiguous regions. Therefore, a large number of functional regions that contain conserved segments separated by relatively long divergent segments are ignored. Our goal in this paper is to define a new measurement of sequence conservation such that both contiguously conserved regions and discontiguously conserved regions can be detected based on this new measurement. Here and in the following, conserved regions are those regions that share similarity higher than a pre-specified similarity threshold with their homologous regions in other species. That is, conserved regions are good candidates of functional regions and may not be always functional. Moreover, conserved regions may contain long and divergent segments.

Results

To identify both discontiguously and contiguously conserved regions, we proposed a new measurement of sequence conservation, which measures sequence similarity based only on the conserved segments within the regions. By defining conserved segments using the local alignment tool CHAOS, under the new measurement, we analyzed the conservation of 1642 experimentally verified human functional non-coding regions in the mouse genome. We found that the conservation in at least 11% of these functional regions could be missed by the current conservation analysis methods. We also found that 72% of the mouse homologous regions identified based on the new measurement are more similar to the human functional sequences than the aligned mouse sequences from the UCSC genome browser. We further compared BLAST and discontiguous MegaBLAST with our method. We found that our method picks up many more conserved segments than BLAST and discontiguous MegaBLAST in these regions.

Conclusions

It is critical to have a new measurement of sequence conservation that is based only on the conserved segments in one region. Such a new measurement can aid the identification of better local "orthologous" regions. It will also shed light on the identification of new types of conserved functional regions in vertebrate genomes [1].

Collapse

ChIP-Seq of transcription factors predicts absolute and differential gene expression in embryonic stem cells. Proc Natl Acad Sci U S A 2009;106:21521-6. [PMID: 19995984 DOI: 10.1073/pnas.0904863106] [Citation(s) in RCA: 243] [Impact Index Per Article: 16.2] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 01/07/2023] Open

He X, Chen CC, Hong F, Fang F, Sinha S, Ng HH, Zhong S. A biophysical model for analysis of transcription factor interaction and binding site arrangement from genome-wide binding data. PLoS One 2009;4:e8155. [PMID: 19956545 PMCID: PMC2780727 DOI: 10.1371/journal.pone.0008155] [Citation(s) in RCA: 46] [Impact Index Per Article: 3.1] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 09/30/2009] [Accepted: 11/10/2009] [Indexed: 11/19/2022] Open

Abstract

Background

How transcription factors (TFs) interact with cis-regulatory sequences and interact with each other is a fundamental, but not well understood, aspect of gene regulation.

Methodology/Principal Findings

We present a computational method to address this question, relying on the established biophysical principles. This method, STAP (sequence to affinity prediction), takes into account all combinations and configurations of strong and weak binding sites to analyze large scale transcription factor (TF)-DNA binding data to discover cooperative interactions among TFs, infer sequence rules of interaction and predict TF target genes in new conditions with no TF-DNA binding data. The distinctions between STAP and other statistical approaches for analyzing cis-regulatory sequences include the utility of physical principles and the treatment of the DNA binding data as quantitative representation of binding strengths. Applying this method to the ChIP-seq data of 12 TFs in mouse embryonic stem (ES) cells, we found that the strength of TF-DNA binding could be significantly modulated by cooperative interactions among TFs with adjacent binding sites. However, further analysis on five putatively interacting TF pairs suggests that such interactions may be relatively insensitive to the distance and orientation of binding sites. Testing a set of putative Nanog motifs, STAP showed that a novel Nanog motif could better explain the ChIP-seq data than previously published ones. We then experimentally tested and verified the new Nanog motif. A series of comparisons showed that STAP has more predictive power than several state-of-the-art methods for cis-regulatory sequence analysis. We took advantage of this power to study the evolution of TF-target relationship in Drosophila. By learning the TF-DNA interaction models from the ChIP-chip data of D. melanogaster (Mel) and applying them to the genome of D. pseudoobscura (Pse), we found that only about half of the sequences strongly bound by TFs in Mel have high binding affinities in Pse. We show that prediction of functional TF targets from ChIP-chip data can be improved by using the conservation of STAP predicted affinities as an additional filter.

Conclusions/Significance

STAP is an effective method to analyze binding site arrangements, TF cooperativity, and TF target genes from genome-wide TF-DNA binding data.

Collapse

Fu AQ, Adryan B. Scoring overlapping and adjacent signals from genome-wide ChIP and DamID assays. MOLECULAR BIOSYSTEMS 2009;5:1429-38. [PMID: 19763325 PMCID: PMC3475982 DOI: 10.1039/b906880e] [Citation(s) in RCA: 11] [Impact Index Per Article: 0.7] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 11/21/2022]

Lister R, Gregory BD, Ecker JR. Next is now: new technologies for sequencing of genomes, transcriptomes, and beyond. CURRENT OPINION IN PLANT BIOLOGY 2009;12:107-18. [PMID: 19157957 PMCID: PMC2723731 DOI: 10.1016/j.pbi.2008.11.004] [Citation(s) in RCA: 138] [Impact Index Per Article: 9.2] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Subscribe] [Scholar Register] [Received: 10/19/2008] [Revised: 11/17/2008] [Accepted: 11/20/2008] [Indexed: 05/18/2023]

Smith JJ, Putta S, Zhu W, Pao GM, Verma IM, Hunter T, Bryant SV, Gardiner DM, Harkins TT, Voss SR. Genic regions of a large salamander genome contain long introns and novel genes. BMC Genomics 2009;10:19. [PMID: 19144141 PMCID: PMC2633012 DOI: 10.1186/1471-2164-10-19] [Citation(s) in RCA: 73] [Impact Index Per Article: 4.9] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/18/2008] [Accepted: 01/13/2009] [Indexed: 01/30/2023] Open

McGaughey DM, Stine ZE, Huynh JL, Vinton RM, McCallion AS. Asymmetrical distribution of non-conserved regulatory sequences at PHOX2B is reflected at the ENCODE loci and illuminates a possible genome-wide trend. BMC Genomics 2009;10:8. [PMID: 19128492 PMCID: PMC2630312 DOI: 10.1186/1471-2164-10-8] [Citation(s) in RCA: 21] [Impact Index Per Article: 1.4] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/08/2008] [Accepted: 01/07/2009] [Indexed: 02/04/2023] Open

Abstract

BACKGROUND

Transcriptional regulatory elements are central to development and interspecific phenotypic variation. Current regulatory element prediction tools rely heavily upon conservation for prediction of putative elements. Recent in vitro observations from the ENCODE project combined with in vivo analyses at the zebrafish phox2b locus suggests that a significant fraction of regulatory elements may fall below commonly applied metrics of conservation. We propose to explore these observations in vivo at the human PHOX2B locus, and also evaluate the potential evidence for genome-wide applicability of these observations through a novel analysis of extant data.

RESULTS

Transposon-based transgenic analysis utilizing a tiling path proximal to human PHOX2B in zebrafish recapitulates the observations at the zebrafish phox2b locus of both conserved and non-conserved regulatory elements. Analysis of human sequences conserved with previously identified zebrafish phox2b regulatory elements demonstrates that the orthologous sequences exhibit overlapping regulatory control. Additionally, analysis of non-conserved sequences scattered over 135 kb 5' to PHOX2B, provides evidence of non-conserved regulatory elements positively biased with close proximity to the gene. Furthermore, we provide a novel analysis of data from the ENCODE project, finding a non-uniform distribution of regulatory elements consistent with our in vivo observations at PHOX2B. These observations remain largely unchanged when one accounts for the sequence repeat content of the assayed intervals, when the intervals are sub-classified by biological role (developmental versus non-developmental), or by gene density (gene desert versus non-gene desert).

CONCLUSION

While regulatory elements frequently display evidence of evolutionary conservation, a fraction appears to be undetected by current metrics of conservation. In vivo observations at the PHOX2B locus, supported by our analyses of in vitro data from the ENCODE project, suggest that the risk of excluding non-conserved sequences in a search for regulatory elements may decrease as distance from the gene increases. Our data combined with the ENCODE data suggests that this may represent a genome wide trend.

Collapse

Miele A, Dekker J. Long-range chromosomal interactions and gene regulation. MOLECULAR BIOSYSTEMS 2008;4:1046-57. [PMID: 18931780 PMCID: PMC2653627 DOI: 10.1039/b803580f] [Citation(s) in RCA: 130] [Impact Index Per Article: 8.1] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 02/06/2023]

Identification of nuclear and cytoplasmic mRNA targets for the shuttling protein SF2/ASF. PLoS One 2008;3:e3369. [PMID: 18841201 PMCID: PMC2556390 DOI: 10.1371/journal.pone.0003369] [Citation(s) in RCA: 91] [Impact Index Per Article: 5.7] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/26/2008] [Accepted: 07/31/2008] [Indexed: 12/15/2022] Open

Pashos EE, Kague E, Fisher S. Evaluation of cis-regulatory function in zebrafish. BRIEFINGS IN FUNCTIONAL GENOMICS AND PROTEOMICS 2008;7:465-73. [PMID: 18820318 DOI: 10.1093/bfgp/eln045] [Citation(s) in RCA: 6] [Impact Index Per Article: 0.4] [Reference Citation Analysis] [Abstract] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 11/13/2022]

Zhou H, Lin K. Excess of microRNAs in large and very 5' biased introns. Biochem Biophys Res Commun 2008;368:709-15. [PMID: 18249189 DOI: 10.1016/j.bbrc.2008.01.117] [Citation(s) in RCA: 18] [Impact Index Per Article: 1.1] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 01/13/2008] [Accepted: 01/27/2008] [Indexed: 11/29/2022]

Bock C, Lengauer T. Computational epigenetics. Bioinformatics 2007;24:1-10. [PMID: 18024971 DOI: 10.1093/bioinformatics/btm546] [Citation(s) in RCA: 150] [Impact Index Per Article: 8.8] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 01/22/2023] Open

Adams D, Karolak M, Robertson E, Oxburgh L. Control of kidney, eye and limb expression of Bmp7 by an enhancer element highly conserved between species. Dev Biol 2007;311:679-90. [PMID: 17936743 DOI: 10.1016/j.ydbio.2007.08.036] [Citation(s) in RCA: 17] [Impact Index Per Article: 1.0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 01/12/2007] [Revised: 08/10/2007] [Accepted: 08/20/2007] [Indexed: 01/04/2023]

Abstract

Bmp7 is expressed in numerous tissues throughout development and is required for morphogenesis of the eye, hindlimb and kidney. In this study we show that the majority if not all of the cis-regulatory sequence governing expression at these anatomical sites during development is present in approximately 20 kb surrounding exon 1. In eye, limb and kidney, multiple distinct enhancer elements drive Bmp7 expression within each organ. In the eye, the elements driving expression in the pigmented epithelium and iris are spatially separated. In the kidney, Bmp7 expression in collecting ducts and nephron progenitors is driven by separate enhancer elements. Similarly, limb mesenchyme and apical ectodermal ridge expression are governed by separate elements. Although enhancers for pigmented epithelium, nephrogenic mesenchyme and apical ectodermal ridge are distributed across the approximately 20 kb region, an element of approximately 480 base pairs within intron 1 governs expression within the developing iris, collecting duct system of the kidney and limb mesenchyme. This element is remarkably conserved both in sequence and position in the Bmp7 locus between different vertebrates, ranging from Xenopus tropicalis to Homo sapiens, demonstrating that there is strong selective pressure for Bmp7 expression at these tissue sites. Furthermore, we show that the frog enhancer functions appropriately in transgenic mice. Interestingly, the intron 1 element cannot be found in the Bmp7 genes of vertebrates such as Danio rerio and Takifugu rubripes indicating that this modification of the Bmp7 gene might have arisen during the adaptation from aquatic to terrestrial life. Mutational analysis demonstrates that the enhancer activity of the intron 1 element is entirely dependent on the presence of a 10 base pair site within the intron 1 enhancer containing a predicted binding site for the FOXD3 transcription factor.

Collapse

Weinstock GM. ENCODE: more genomic empowerment. Genome Res 2007;17:667-8. [PMID: 17567987 DOI: 10.1101/gr.6534207] [Citation(s) in RCA: 33] [Impact Index Per Article: 1.9] [Reference Citation Analysis] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/24/2022]

Birney E, Stamatoyannopoulos JA, Dutta A, Guigó R, Gingeras TR, Margulies EH, Weng Z, Snyder M, Dermitzakis ET, Thurman RE, Kuehn MS, Taylor CM, Neph S, Koch CM, Asthana S, Malhotra A, Adzhubei I, Greenbaum JA, Andrews RM, Flicek P, Boyle PJ, Cao H, Carter NP, Clelland GK, Davis S, Day N, Dhami P, Dillon SC, Dorschner MO, Fiegler H, Giresi PG, Goldy J, Hawrylycz M, Haydock A, Humbert R, James KD, Johnson BE, Johnson EM, Frum TT, Rosenzweig ER, Karnani N, Lee K, Lefebvre GC, Navas PA, Neri F, Parker SCJ, Sabo PJ, Sandstrom R, Shafer A, Vetrie D, Weaver M, Wilcox S, Yu M, Collins FS, Dekker J, Lieb JD, Tullius TD, Crawford GE, Sunyaev S, Noble WS, Dunham I, Denoeud F, Reymond A, Kapranov P, Rozowsky J, Zheng D, Castelo R, Frankish A, Harrow J, Ghosh S, Sandelin A, Hofacker IL, Baertsch R, Keefe D, Dike S, Cheng J, Hirsch HA, Sekinger EA, Lagarde J, Abril JF, Shahab A, Flamm C, Fried C, Hackermüller J, Hertel J, Lindemeyer M, Missal K, Tanzer A, Washietl S, Korbel J, Emanuelsson O, Pedersen JS, Holroyd N, Taylor R, Swarbreck D, Matthews N, Dickson MC, Thomas DJ, Weirauch MT, Gilbert J, Drenkow J, Bell I, Zhao X, Srinivasan KG, Sung WK, Ooi HS, Chiu KP, Foissac S, Alioto T, Brent M, Pachter L, Tress ML, Valencia A, Choo SW, Choo CY, Ucla C, Manzano C, Wyss C, Cheung E, Clark TG, Brown JB, Ganesh M, Patel S, Tammana H, Chrast J, Henrichsen CN, Kai C, Kawai J, Nagalakshmi U, Wu J, Lian Z, Lian J, Newburger P, Zhang X, Bickel P, Mattick JS, Carninci P, Hayashizaki Y, Weissman S, Hubbard T, Myers RM, Rogers J, Stadler PF, Lowe TM, Wei CL, Ruan Y, Struhl K, Gerstein M, Antonarakis SE, Fu Y, Green ED, Karaöz U, Siepel A, Taylor J, Liefer LA, Wetterstrand KA, Good PJ, Feingold EA, Guyer MS, Cooper GM, Asimenos G, Dewey CN, Hou M, Nikolaev S, Montoya-Burgos JI, Löytynoja A, Whelan S, Pardi F, Massingham T, Huang H, Zhang NR, Holmes I, Mullikin JC, Ureta-Vidal A, Paten B, Seringhaus M, Church D, Rosenbloom K, Kent WJ, Stone EA, Batzoglou S, Goldman N, Hardison RC, Haussler D, Miller W, Sidow A, Trinklein ND, Zhang ZD, Barrera L, Stuart R, King DC, Ameur A, Enroth S, Bieda MC, Kim J, Bhinge AA, Jiang N, Liu J, Yao F, Vega VB, Lee CWH, Ng P, Shahab A, Yang A, Moqtaderi Z, Zhu Z, Xu X, Squazzo S, Oberley MJ, Inman D, Singer MA, Richmond TA, Munn KJ, Rada-Iglesias A, Wallerman O, Komorowski J, Fowler JC, Couttet P, Bruce AW, Dovey OM, Ellis PD, Langford CF, Nix DA, Euskirchen G, Hartman S, Urban AE, Kraus P, Van Calcar S, Heintzman N, Kim TH, Wang K, Qu C, Hon G, Luna R, Glass CK, Rosenfeld MG, Aldred SF, Cooper SJ, Halees A, Lin JM, Shulha HP, Zhang X, Xu M, Haidar JNS, Yu Y, Ruan Y, Iyer VR, Green RD, Wadelius C, Farnham PJ, Ren B, Harte RA, Hinrichs AS, Trumbower H, Clawson H, Hillman-Jackson J, Zweig AS, Smith K, Thakkapallayil A, Barber G, Kuhn RM, Karolchik D, Armengol L, Bird CP, de Bakker PIW, Kern AD, Lopez-Bigas N, Martin JD, Stranger BE, Woodroffe A, Davydov E, Dimas A, Eyras E, Hallgrímsdóttir IB, Huppert J, Zody MC, Abecasis GR, Estivill X, Bouffard GG, Guan X, Hansen NF, Idol JR, Maduro VVB, Maskeri B, McDowell JC, Park M, Thomas PJ, Young AC, Blakesley RW, Muzny DM, Sodergren E, Wheeler DA, Worley KC, Jiang H, Weinstock GM, Gibbs RA, Graves T, Fulton R, Mardis ER, Wilson RK, Clamp M, Cuff J, Gnerre S, Jaffe DB, Chang JL, Lindblad-Toh K, Lander ES, Koriabine M, Nefedov M, Osoegawa K, Yoshinaga Y, Zhu B, de Jong PJ. Identification and analysis of functional elements in 1% of the human genome by the ENCODE pilot project. Nature 2007;447:799-816. [PMID: 17571346 PMCID: PMC2212820 DOI: 10.1038/nature05874] [Citation(s) in RCA: 3826] [Impact Index Per Article: 225.1] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 01/17/2023]

Gerstein MB, Bruce C, Rozowsky JS, Zheng D, Du J, Korbel JO, Emanuelsson O, Zhang ZD, Weissman S, Snyder M. What is a gene, post-ENCODE? History and updated definition. Genome Res 2007;17:669-81. [PMID: 17567988 DOI: 10.1101/gr.6339607] [Citation(s) in RCA: 457] [Impact Index Per Article: 26.9] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/24/2022]