51
|
Li G, Ruan X, Auerbach RK, Sandhu KS, Zheng M, Wang P, Poh HM, Goh Y, Lim J, Zhang J, Sim HS, Peh SQ, Mulawadi FH, Ong CT, Orlov YL, Hong S, Zhang Z, Landt S, Raha D, Euskirchen G, Wei CL, Ge W, Wang H, Davis C, Fisher-Aylor KI, Mortazavi A, Gerstein M, Gingeras T, Wold B, Sun Y, Fullwood MJ, Cheung E, Liu E, Sung WK, Snyder M, Ruan Y. Extensive promoter-centered chromatin interactions provide a topological basis for transcription regulation. Cell 2012; 148:84-98. [PMID: 22265404 DOI: 10.1016/j.cell.2011.12.014] [Citation(s) in RCA: 905] [Impact Index Per Article: 75.4] [Reference Citation Analysis] [Abstract] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/18/2011] [Revised: 10/21/2011] [Accepted: 12/12/2011] [Indexed: 10/14/2022]
Abstract
Higher-order chromosomal organization for transcription regulation is poorly understood in eukaryotes. Using genome-wide Chromatin Interaction Analysis with Paired-End-Tag sequencing (ChIA-PET), we mapped long-range chromatin interactions associated with RNA polymerase II in human cells and uncovered widespread promoter-centered intragenic, extragenic, and intergenic interactions. These interactions further aggregated into higher-order clusters, wherein proximal and distal genes were engaged through promoter-promoter interactions. Most genes with promoter-promoter interactions were active and transcribed cooperatively, and some interacting promoters could influence each other implying combinatorial complexity of transcriptional controls. Comparative analyses of different cell lines showed that cell-specific chromatin interactions could provide structural frameworks for cell-specific transcription, and suggested significant enrichment of enhancer-promoter interactions for cell-specific functions. Furthermore, genetically-identified disease-associated noncoding elements were found to be spatially engaged with corresponding genes through long-range interactions. Overall, our study provides insights into transcription regulation by three-dimensional chromatin interactions for both housekeeping and cell-specific genes in human cells.
Collapse
|
52
|
Handoko L, Xu H, Li G, Ngan CY, Chew E, Schnapp M, Lee CWH, Ye C, Ping JLH, Mulawadi F, Wong E, Sheng J, Zhang Y, Poh T, Chan CS, Kunarso G, Shahab A, Bourque G, Cacheux-Rataboul V, Sung WK, Ruan Y, Wei CL. Erratum: CTCF-mediated functional chromatin interactome in pluripotent cells. Nat Genet 2011. [DOI: 10.1038/ng0811-815a] [Citation(s) in RCA: 2] [Impact Index Per Article: 0.2] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/09/2022]
|
53
|
Handoko L, Xu H, Li G, Ngan CY, Chew E, Schnapp M, Lee CWH, Ye C, Ping JLH, Mulawadi F, Wong E, Sheng J, Zhang Y, Poh T, Chan CS, Kunarso G, Shahab A, Bourque G, Cacheux-Rataboul V, Sung WK, Ruan Y, Wei CL. CTCF-mediated functional chromatin interactome in pluripotent cells. Nat Genet 2011; 43:630-8. [PMID: 21685913 PMCID: PMC3436933 DOI: 10.1038/ng.857] [Citation(s) in RCA: 492] [Impact Index Per Article: 37.8] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/08/2011] [Accepted: 05/16/2011] [Indexed: 12/13/2022]
Abstract
Mammalian genomes are viewed as functional organizations that orchestrate spatial and temporal gene regulation. CTCF, the most characterized insulator-binding protein, has been implicated as a key genome organizer. However, little is known about CTCF-associated higher-order chromatin structures at a global scale. Here we applied chromatin interaction analysis by paired-end tag (ChIA-PET) sequencing to elucidate the CTCF-chromatin interactome in pluripotent cells. From this analysis, we identified 1,480 cis- and 336 trans-interacting loci with high reproducibility and precision. Associating these chromatin interaction loci with their underlying epigenetic states, promoter activities, enhancer binding and nuclear lamina occupancy, we uncovered five distinct chromatin domains that suggest potential new models of CTCF function in chromatin organization and transcriptional control. Specifically, CTCF interactions demarcate chromatin-nuclear membrane attachments and influence proper gene expression through extensive cross-talk between promoters and regulatory elements. This highly complex nuclear organization offers insights toward the unifying principles that govern genome plasticity and function.
Collapse
|
54
|
Müller J, Samans B, van Riggelen J, Fagà G, Peh K N R, Wei CL, Müller H, Amati B, Felsher D, Eilers M. TGFβ-dependent gene expression shows that senescence correlates with abortive differentiation along several lineages in Myc-induced lymphomas. Cell Cycle 2011; 9:4622-6. [PMID: 21127397 DOI: 10.4161/cc.9.23.14211] [Citation(s) in RCA: 6] [Impact Index Per Article: 0.5] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/19/2022] Open
Abstract
Deregulated expression of Myc under the control of an immunoglobulin enhancer induces lymphoma formation in mice. The development of lymphomas is limited by TGFβ-dependent senescence and high levels of Myc expression are continuously required to antagonize senescence. The biological processes underlying senescence are not fully resolved. We report here a comprehensive analysis of TGFβ-dependent alterations in gene expression when the Myc transgene is switched off. Our data show that Myc-induced target genes are downregulated in a TGFβ-independent manner. In contrast, TGFβ is required to upregulate a broad spectrum of genes that are characteristic of different T-cell lineages when Myc is turned off. The analysis reveals a significant overlap between these Myc-repressed genes with genes that are targets of polycomb repressive complexes in embryonic stem cells. Therefore, TGFβ-dependent senescence is associated with gene expression patterns indicative of abortive cellular differentiation along several lineages.
Collapse
|
55
|
Hillmer AM, Yao F, Inaki K, Lee WH, Ariyaratne PN, Teo ASM, Woo XY, Zhang Z, Zhao H, Ukil L, Chen JP, Zhu F, So JBY, Salto-Tellez M, Poh WT, Zawack KFB, Nagarajan N, Gao S, Li G, Kumar V, Lim HPJ, Sia YY, Chan CS, Leong ST, Neo SC, Choi PSD, Thoreau H, Tan PBO, Shahab A, Ruan X, Bergh J, Hall P, Cacheux-Rataboul V, Wei CL, Yeoh KG, Sung WK, Bourque G, Liu ET, Ruan Y. Comprehensive long-span paired-end-tag mapping reveals characteristic patterns of structural variations in epithelial cancer genomes. Genome Res 2011; 21:665-75. [PMID: 21467267 DOI: 10.1101/gr.113555.110] [Citation(s) in RCA: 70] [Impact Index Per Article: 5.4] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 12/13/2022]
Abstract
Somatic genome rearrangements are thought to play important roles in cancer development. We optimized a long-span paired-end-tag (PET) sequencing approach using 10-Kb genomic DNA inserts to study human genome structural variations (SVs). The use of a 10-Kb insert size allows the identification of breakpoints within repetitive or homology-containing regions of a few kilobases in size and results in a higher physical coverage compared with small insert libraries with the same sequencing effort. We have applied this approach to comprehensively characterize the SVs of 15 cancer and two noncancer genomes and used a filtering approach to strongly enrich for somatic SVs in the cancer genomes. Our analyses revealed that most inversions, deletions, and insertions are germ-line SVs, whereas tandem duplications, unpaired inversions, interchromosomal translocations, and complex rearrangements are over-represented among somatic rearrangements in cancer genomes. We demonstrate that the quantitative and connective nature of DNA-PET data is precise in delineating the genealogy of complex rearrangement events, we observe signatures that are compatible with breakage-fusion-bridge cycles, and we discover that large duplications are among the initial rearrangements that trigger genome instability for extensive amplification in epithelial cancers.
Collapse
|
56
|
Wong E, Wei CL. Genome-wide distribution of DNA methylation at single-nucleotide resolution. PROGRESS IN MOLECULAR BIOLOGY AND TRANSLATIONAL SCIENCE 2011; 101:459-77. [PMID: 21507362 DOI: 10.1016/b978-0-12-387685-0.00015-9] [Citation(s) in RCA: 2] [Impact Index Per Article: 0.2] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 01/07/2023]
Abstract
DNA methylation, a well-known epigenetic modification in mammalian genomes, is important for development and health. Dysregulation of DNA methylation can cause abnormal gene regulation, leading to anomalous development and diseases. Until recently, the ability to understand the functions and dynamics of DNA methylation was limited by the availability of technologies for comprehensively characterizing methylation on a genome-wide scale. Rapid advances in high-throughput approaches (particularly next-generation sequencing), coupled with molecular techniques, have enabled unbiased genome-wide profiling of DNA modifications at single-base resolution and helped to elucidate their impact on gene regulation. Here, we discuss the development of genomic approaches to decipher the global methylome at single-base resolution, the challenges faced, and the emerging new insights. Our ability to decipher this important epigenetic modification and how it impacts gene expression will provide a framework for understanding numerous disease mechanisms, and suggest means to treat or prevent them in the future.
Collapse
|
57
|
Schnetz MP, Handoko L, Akhtar-Zaidi B, Bartels CF, Pereira CF, Fisher AG, Adams DJ, Flicek P, Crawford GE, LaFramboise T, Tesar P, Wei CL, Scacheri PC. CHD7 targets active gene enhancer elements to modulate ES cell-specific gene expression. PLoS Genet 2010; 6:e1001023. [PMID: 20657823 PMCID: PMC2904778 DOI: 10.1371/journal.pgen.1001023] [Citation(s) in RCA: 183] [Impact Index Per Article: 13.1] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 12/09/2009] [Accepted: 06/14/2010] [Indexed: 01/22/2023] Open
Abstract
CHD7 is one of nine members of the chromodomain helicase DNA–binding domain family of ATP–dependent chromatin remodeling enzymes found in mammalian cells. De novo mutation of CHD7 is a major cause of CHARGE syndrome, a genetic condition characterized by multiple congenital anomalies. To gain insights to the function of CHD7, we used the technique of chromatin immunoprecipitation followed by massively parallel DNA sequencing (ChIP–Seq) to map CHD7 sites in mouse ES cells. We identified 10,483 sites on chromatin bound by CHD7 at high confidence. Most of the CHD7 sites show features of gene enhancer elements. Specifically, CHD7 sites are predominantly located distal to transcription start sites, contain high levels of H3K4 mono-methylation, found within open chromatin that is hypersensitive to DNase I digestion, and correlate with ES cell-specific gene expression. Moreover, CHD7 co-localizes with P300, a known enhancer-binding protein and strong predictor of enhancer activity. Correlations with 18 other factors mapped by ChIP–seq in mouse ES cells indicate that CHD7 also co-localizes with ES cell master regulators OCT4, SOX2, and NANOG. Correlations between CHD7 sites and global gene expression profiles obtained from Chd7+/+, Chd7+/−, and Chd7−/− ES cells indicate that CHD7 functions at enhancers as a transcriptional rheostat to modulate, or fine-tune the expression levels of ES–specific genes. CHD7 can modulate genes in either the positive or negative direction, although negative regulation appears to be the more direct effect of CHD7 binding. These data indicate that enhancer-binding proteins can limit gene expression and are not necessarily co-activators. Although ES cells are not likely to be affected in CHARGE syndrome, we propose that enhancer-mediated gene dysregulation contributes to disease pathogenesis and that the critical CHD7 target genes may be subject to positive or negative regulation. The gene encoding chromodomain helicase DNA–binding protein 7 (CHD7) is required for normal mammalian development. In humans, genetic mutations in CHD7 lead to CHARGE syndrome, a disorder characterized by multiple birth defects. In previous studies, CHD7 was shown to localize to the cell nucleus and bind to specific sites on chromatin. However, the genome-wide distribution of CHD7 on chromatin and its function are not known. Here, we identified 10,483 sites on chromatin bound by CHD7 in mouse embryonic stem cells. Many of these sites are gene enhancer elements suspected to be involved in turning on genes. We show CHD7 functions at these loci to fine-tune the levels of genes that are specifically expressed in mouse ES cells. This modulation is mediated through several proteins that bind together with CHD7 at enhancer elements and can occur in either direction. These findings suggest CHARGE syndrome is the result of key genes that are improperly expressed during development. These key genes are currently unknown but are likely to be tissue-specific and may be upregulated or downregulated in response to CHD7 mutation.
Collapse
|
58
|
De Santa F, Barozzi I, Mietton F, Ghisletti S, Polletti S, Tusi BK, Muller H, Ragoussis J, Wei CL, Natoli G. A large fraction of extragenic RNA pol II transcription sites overlap enhancers. PLoS Biol 2010; 8:e1000384. [PMID: 20485488 PMCID: PMC2867938 DOI: 10.1371/journal.pbio.1000384] [Citation(s) in RCA: 631] [Impact Index Per Article: 45.1] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/07/2009] [Accepted: 04/21/2010] [Indexed: 01/17/2023] Open
Abstract
A substantial fraction of extragenic Pol II transcription sites coincides with transcriptional enhancers, which may be relevant for functional annotation of mammalian genomes. Mammalian genomes are pervasively transcribed outside mapped protein-coding genes. One class of extragenic transcription products is represented by long non-coding RNAs (lncRNAs), some of which result from Pol_II transcription of bona-fide RNA genes. Whether all lncRNAs described insofar are products of RNA genes, however, is still unclear. Here we have characterized transcription sites located outside protein-coding genes in a highly regulated response, macrophage activation by endotoxin. Using chromatin signatures, we could unambiguously classify extragenic Pol_II binding sites as belonging to either canonical RNA genes or transcribed enhancers. Unexpectedly, 70% of extragenic Pol_II peaks were associated with genomic regions with a canonical chromatin signature of enhancers. Enhancer-associated extragenic transcription was frequently adjacent to inducible inflammatory genes, was regulated in response to endotoxin stimulation, and generated very low abundance transcripts. Moreover, transcribed enhancers were under purifying selection and contained binding sites for inflammatory transcription factors, thus suggesting their functionality. These data demonstrate that a large fraction of extragenic Pol_II transcription sites can be ascribed to cis-regulatory genomic regions. Discrimination between lncRNAs generated by canonical RNA genes and products of transcribed enhancers will provide a framework for experimental approaches to lncRNAs and help complete the annotation of mammalian genomes. Mammalian genomes contain vast intergenic regions that are extensively transcribed and generate various types of short and long non-coding RNAs (ncRNAs). Although in some cases specific functions have been assigned to intergenic transcripts, the functional significance of this transcriptional output remains largely unknown, and the possibility exists that part of this transcription reflects noise generated by random collisions of the transcriptional machinery with the genome to generate meaningless transcription. In this study we used chromatin signatures to characterize extragenic transcription sites targeted by RNA Polymerase II (RNA Pol II) in a highly regulated response—endotoxin activation of macrophages. We found that a significant portion of extragenic transcription sites are associated with the chromatin signature characteristic of enhancers. Consistent with their chromatin signature, we found that these extragenic transcription sites are under purifying selection and contain binding sites for inflammatory transcription factors, as well as for PU.1, a hematopoietic transcription factor that marks enhancers in macrophages. Moreover, much of this extragenic transcription is regulated by stimulation. We also identified hundreds of transcribed regions with a signature of canonical RNA genes. Our data indicate that extragenic transcription sites can be efficiently classified using chromatin signatures, which will be relevant for functional annotation of mammalian genomes.
Collapse
|
59
|
Xu H, Handoko L, Wei X, Ye C, Sheng J, Wei CL, Lin F, Sung WK. A signal-noise model for significance analysis of ChIP-seq with negative control. ACTA ACUST UNITED AC 2010; 26:1199-204. [PMID: 20371496 DOI: 10.1093/bioinformatics/btq128] [Citation(s) in RCA: 116] [Impact Index Per Article: 8.3] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/14/2022]
Abstract
MOTIVATION ChIP-seq is becoming the main approach to the genome-wide study of protein-DNA interactions and histone modifications. Existing informatics tools perform well to extract strong ChIP-enriched sites. However, two questions remain to be answered: (i) to which extent is a ChIP-seq experiment able to reveal the weak ChIP-enriched sites? (ii) are the weak sites biologically meaningful? To answer these questions, it is necessary to identify the weak ChIP signals from background noise. RESULTS We propose a linear signal-noise model, in which a noise rate was introduced to represent the fraction of noise in a ChIP library. We developed an iterative algorithm to estimate the noise rate using a control library, and derived a library-swapping strategy for the false discovery rate estimation. These approaches were integrated in a general-purpose framework, named CCAT (Control-based ChIP-seq Analysis Tool), for the significance analysis of ChIP-seq. Applications to H3K4me3 and H3K36me3 datasets showed that CCAT predicted significantly more ChIP-enriched sites that the previous methods did. With the high sensitivity of CCAT prediction, we revealed distinct chromatin features associated to the strong and weak H3K4me3 sites. AVAILABILITY http://cmb.gis.a-star.edu.sg/ChIPSeq/tools.htm.
Collapse
|
60
|
Ghisletti S, Barozzi I, Mietton F, Polletti S, De Santa F, Venturini E, Gregory L, Lonie L, Chew A, Wei CL, Ragoussis J, Natoli G. Identification and characterization of enhancers controlling the inflammatory gene expression program in macrophages. Immunity 2010; 32:317-28. [PMID: 20206554 DOI: 10.1016/j.immuni.2010.02.008] [Citation(s) in RCA: 492] [Impact Index Per Article: 35.1] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Journal Information] [Subscribe] [Scholar Register] [Received: 01/07/2010] [Revised: 02/03/2010] [Accepted: 02/04/2010] [Indexed: 01/29/2023]
Abstract
Enhancers determine tissue-specific gene expression programs. Enhancers are marked by high histone H3 lysine 4 mono-methylation (H3K4me1) and by the acetyl-transferase p300, which has allowed genome-wide enhancer identification. However, the regulatory principles by which subsets of enhancers become active in specific developmental and/or environmental contexts are unknown. We exploited inducible p300 binding to chromatin to identify, and then mechanistically dissect, enhancers controlling endotoxin-stimulated gene expression in macrophages. In these enhancers, binding sites for the lineage-restricted and constitutive Ets protein PU.1 coexisted with those for ubiquitous stress-inducible transcription factors such as NF-kappaB, IRF, and AP-1. PU.1 was required for maintaining H3K4me1 at macrophage-specific enhancers. Reciprocally, ectopic expression of PU.1 reactivated these enhancers in fibroblasts. Thus, the combinatorial assembly of tissue- and signal-specific transcription factors determines the activity of a distinct group of enhancers. We suggest that this may represent a general paradigm in tissue-restricted and stimulus-responsive gene regulation.
Collapse
|
61
|
Li G, Fullwood MJ, Xu H, Mulawadi FH, Velkov S, Vega V, Ariyaratne PN, Mohamed YB, Ooi HS, Tennakoon C, Wei CL, Ruan Y, Sung WK. ChIA-PET tool for comprehensive chromatin interaction analysis with paired-end tag sequencing. Genome Biol 2010; 11:R22. [PMID: 20181287 PMCID: PMC2872882 DOI: 10.1186/gb-2010-11-2-r22] [Citation(s) in RCA: 207] [Impact Index Per Article: 14.8] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 09/03/2009] [Revised: 12/30/2009] [Accepted: 02/25/2010] [Indexed: 11/10/2022] Open
Abstract
ChIA-PET Tool can be used to process long-range chromatin interaction data. Results are visualized on a user-friendly genome browser. Chromatin interaction analysis with paired-end tag sequencing (ChIA-PET) is a new technology to study genome-wide long-range chromatin interactions bound by protein factors. Here we present ChIA-PET Tool, a software package for automatic processing of ChIA-PET sequence data, including linker filtering, mapping tags to reference genomes, identifying protein binding sites and chromatin interactions, and displaying the results on a graphical genome browser. ChIA-PET Tool is fast, accurate, comprehensive, user-friendly, and open source (available at http://chiapet.gis.a-star.edu.sg).
Collapse
|
62
|
Fullwood MJ, Han Y, Wei CL, Ruan X, Ruan Y. Chromatin interaction analysis using paired-end tag sequencing. ACTA ACUST UNITED AC 2010; Chapter 21:Unit 21.15.1-25. [PMID: 20069536 DOI: 10.1002/0471142727.mb2115s89] [Citation(s) in RCA: 34] [Impact Index Per Article: 2.4] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 01/21/2023]
Abstract
Chromatin Interaction Analysis using Paired-End Tag sequencing (ChIA-PET) is a technique developed for large-scale, de novo analysis of higher-order chromatin structures. Cells are treated with formaldehyde to cross-link chromatin interactions, DNA segments bound by protein factors are enriched by chromatin immunoprecipitation, and interacting DNA fragments are then captured by proximity ligation. The Paired-End Tag (PET) strategy is applied to the construction of ChIA-PET libraries, which are sequenced by high-throughput next-generation sequencing technologies. Finally, raw PET sequences are subjected to bioinformatics analysis, resulting in a genome-wide map of binding sites and chromatin interactions mediated by the protein factor under study. This unit describes ChIA-PET for genome-wide analysis of chromatin interactions in mammalian cells, with the application of Roche/454 and Illumina sequencing technologies.
Collapse
|
63
|
Laurent L, Wong E, Li G, Huynh T, Tsirigos A, Ong CT, Low HM, Kin Sung KW, Rigoutsos I, Loring J, Wei CL. Dynamic changes in the human methylome during differentiation. Genome Res 2010; 20:320-31. [PMID: 20133333 DOI: 10.1101/gr.101907.109] [Citation(s) in RCA: 759] [Impact Index Per Article: 54.2] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 12/14/2022]
Abstract
DNA methylation is a critical epigenetic regulator in mammalian development. Here, we present a whole-genome comparative view of DNA methylation using bisulfite sequencing of three cultured cell types representing progressive stages of differentiation: human embryonic stem cells (hESCs), a fibroblastic differentiated derivative of the hESCs, and neonatal fibroblasts. As a reference, we compared our maps with a methylome map of a fully differentiated adult cell type, mature peripheral blood mononuclear cells (monocytes). We observed many notable common and cell-type-specific features among all cell types. Promoter hypomethylation (both CG and CA) and higher levels of gene body methylation were positively correlated with transcription in all cell types. Exons were more highly methylated than introns, and sharp transitions of methylation occurred at exon-intron boundaries, suggesting a role for differential methylation in transcript splicing. Developmental stage was reflected in both the level of global methylation and extent of non-CpG methylation, with hESC highest, fibroblasts intermediate, and monocytes lowest. Differentiation-associated differential methylation profiles were observed for developmentally regulated genes, including the HOX clusters, other homeobox transcription factors, and pluripotence-associated genes such as POU5F1, TCF3, and KLF4. Our results highlight the value of high-resolution methylation maps, in conjunction with other systems-level analyses, for investigation of previously undetectable developmental regulatory mechanisms.
Collapse
|
64
|
Fullwood MJ, Liu MH, Pan YF, Liu J, Xu H, Mohamed YB, Orlov YL, Velkov S, Ho A, Mei PH, Chew EGY, Huang PYH, Welboren WJ, Han Y, Ooi HS, Ariyaratne PN, Vega VB, Luo Y, Tan PY, Choy PY, Wansa KDSA, Zhao B, Lim KS, Leow SC, Yow JS, Joseph R, Li H, Desai KV, Thomsen JS, Lee YK, Karuturi RKM, Herve T, Bourque G, Stunnenberg HG, Ruan X, Cacheux-Rataboul V, Sung WK, Liu ET, Wei CL, Cheung E, Ruan Y. An oestrogen-receptor-alpha-bound human chromatin interactome. Nature 2009; 462:58-64. [PMID: 19890323 PMCID: PMC2774924 DOI: 10.1038/nature08497] [Citation(s) in RCA: 1251] [Impact Index Per Article: 83.4] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/25/2009] [Accepted: 09/11/2009] [Indexed: 12/17/2022]
Abstract
Genomes are organized into high-level three-dimensional structures, and DNA elements separated by long genomic distances can in principle interact functionally. Many transcription factors bind to regulatory DNA elements distant from gene promoters. Although distal binding sites have been shown to regulate transcription by long-range chromatin interactions at a few loci, chromatin interactions and their impact on transcription regulation have not been investigated in a genome-wide manner. Here we describe the development of a new strategy, chromatin interaction analysis by paired-end tag sequencing (ChIA-PET) for the de novo detection of global chromatin interactions, with which we have comprehensively mapped the chromatin interaction network bound by oestrogen receptor alpha (ER-alpha) in the human genome. We found that most high-confidence remote ER-alpha-binding sites are anchored at gene promoters through long-range chromatin interactions, suggesting that ER-alpha functions by extensive chromatin looping to bring genes together for coordinated transcriptional regulation. We propose that chromatin interactions constitute a primary mechanism for regulating transcription in mammalian genomes.
Collapse
|
65
|
De Santa F, Narang V, Yap ZH, Tusi BK, Burgold T, Austenaa L, Bucci G, Caganova M, Notarbartolo S, Casola S, Testa G, Sung WK, Wei CL, Natoli G. Jmjd3 contributes to the control of gene expression in LPS-activated macrophages. EMBO J 2009; 28:3341-52. [PMID: 19779457 PMCID: PMC2752025 DOI: 10.1038/emboj.2009.271] [Citation(s) in RCA: 342] [Impact Index Per Article: 22.8] [Reference Citation Analysis] [Abstract] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 05/06/2009] [Accepted: 08/17/2009] [Indexed: 12/29/2022] Open
Abstract
Jmjd3, a JmjC family histone demethylase, is induced by the transcription factor NF-kB in response to microbial stimuli. Jmjd3 erases H3K27me3, a histone mark associated with transcriptional repression and involved in lineage determination. However, the specific contribution of Jmjd3 induction and H3K27me3 demethylation to inflammatory gene expression remains unknown. Using chromatin immunoprecipitation-sequencing we found that Jmjd3 is preferentially recruited to transcription start sites characterized by high levels of H3K4me3, a marker of gene activity, and RNA polymerase II (Pol_II). Moreover, 70% of lipopolysaccharide (LPS)-inducible genes were found to be Jmjd3 targets. Although most Jmjd3 target genes were unaffected by its deletion, a few hundred genes, including inducible inflammatory genes, showed moderately impaired Pol_II recruitment and transcription. Importantly, most Jmjd3 target genes were not associated with detectable levels of H3K27me3, and transcriptional effects of Jmjd3 absence in the window of time analysed were uncoupled from measurable effects on this histone mark. These data show that Jmjd3 fine-tunes the transcriptional output of LPS-activated macrophages in an H3K27 demethylation-independent manner.
Collapse
|
66
|
Wong E, Wei CL. ChIP'ing the mammalian genome: technical advances and insights into functional elements. Genome Med 2009; 1:89. [PMID: 19804607 PMCID: PMC2768996 DOI: 10.1186/gm89] [Citation(s) in RCA: 15] [Impact Index Per Article: 1.0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 12/11/2022] Open
Abstract
Characterization of the functional components in mammalian genomes depends on our ability to completely elucidate the genetic and epigenetic regulatory networks of chromatin states and nuclear architecture. Such endeavors demand the availability of robust and effective approaches to characterizing protein-DNA associations in their native chromatin environments. Consider able progress has been made through the applica tion of chromatin immunoprecipitation (ChIP) to study chromatin biology in cells. Coupled with genome-wide analyses, ChIP-based assays enable us to take a global, unbiased and comprehensive view of transcriptional control, epigenetic regulation and chromatin structures, with high precision and versatility. The integrated knowledge derived from these studies is used to decipher gene regulatory networks and define genome organization. In this review, we discuss this powerful approach and its current advances. We also explore the possible future developments of ChIP-based approaches to interrogating long-range chromatin interactions and their impact on the mechanisms regulating gene expression.
Collapse
|
67
|
Fullwood MJ, Wei CL, Liu ET, Ruan Y. Next-generation DNA sequencing of paired-end tags (PET) for transcriptome and genome analyses. Genome Res 2009; 19:521-32. [PMID: 19339662 DOI: 10.1101/gr.074906.107] [Citation(s) in RCA: 259] [Impact Index Per Article: 17.3] [Reference Citation Analysis] [Abstract] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 01/02/2023]
Abstract
Comprehensive understanding of functional elements in the human genome will require thorough interrogation and comparison of individual human genomes and genomic structures. Such an endeavor will require improvements in the throughputs and costs of DNA sequencing. Next-generation sequencing platforms have impressively low costs and high throughputs but are limited by short read lengths. An immediate and widely recognized solution to this critical limitation is the paired-end tag (PET) sequencing for various applications, collectively called the PET sequencing strategy, in which short and paired tags are extracted from the ends of long DNA fragments for ultra-high-throughput sequencing. The PET sequences can be accurately mapped to the reference genome, thus demarcating the genomic boundaries of PET-represented DNA fragments and revealing the identities of the target DNA elements. PET protocols have been developed for the analyses of transcriptomes, transcription factor binding sites, epigenetic sites such as histone modification sites, and genome structures. The exclusive advantage of the PET technology is its ability to uncover linkages between the two ends of DNA fragments. Using this unique feature, unconventional fusion transcripts, genome structural variations, and even molecular interactions between distant genomic elements can be unraveled by PET analysis. Extensive use of PET data could lead to efficient assembly of individual human genomes, transcriptomes, and interactomes, enabling new biological and clinical insights. With its versatile and powerful nature for DNA analysis, the PET sequencing strategy has a bright future ahead.
Collapse
|
68
|
Zhao X, Ruan Y, Wei CL. Tackling the epigenome in the pluripotent stem cells. J Genet Genomics 2009; 35:403-12. [PMID: 18640620 DOI: 10.1016/s1673-8527(08)60058-2] [Citation(s) in RCA: 13] [Impact Index Per Article: 0.9] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 04/14/2008] [Revised: 05/23/2008] [Accepted: 05/24/2008] [Indexed: 11/26/2022]
Abstract
Embryonic stem cells are unique in their abilities of self-renewal and to differentiate into many, if not all, cellular lineages. Transcriptional regulation, epigenetic modifications and chromatin structures are the key modulators in controlling such pluripotency nature of embryonic stem cell genomes, particularly in the developmental decisions and the maintenance of cell fates. Among them, epigenetic regulation of gene expression is mediated partly by covalent modifications of core histone proteins including methylation, phosphorylation and acetylation. Moreover, the chromatins in stem cell genome appear as a highly organized structure containing distinct functional domains. Recent rapid progress of new technologies enables us to take a global, unbiased and comprehensive view of the epigenetic modifications and chromatin structures that contribute to gene expression regulation and cell identity during diverse developmental stages. Here, we summarized the latest advances made by high throughput approaches in profiling epigenetic modifications and chromatin conformations, with an emphasis on genome-wide analysis of histone modifications and their implications in pluripotency nature of embryonic stem cells.
Collapse
|
69
|
Lim CY, Tam WL, Zhang J, Ang HS, Jia H, Lipovich L, Ng HH, Wei CL, Sung WK, Robson P, Yang H, Lim B. Sall4 regulates distinct transcription circuitries in different blastocyst-derived stem cell lineages. Cell Stem Cell 2008; 3:543-54. [PMID: 18804426 DOI: 10.1016/j.stem.2008.08.004] [Citation(s) in RCA: 183] [Impact Index Per Article: 11.4] [Reference Citation Analysis] [Abstract] [Journal Information] [Subscribe] [Scholar Register] [Received: 04/04/2008] [Revised: 07/13/2008] [Accepted: 08/15/2008] [Indexed: 12/21/2022]
Abstract
Stem cells self-renew or differentiate under the governance of a stem-cell-specific transcriptional program, with each transcription factor orchestrating the activities of a particular set of genes. Here we demonstrate that a single transcription factor is able to regulate distinct core circuitries in two different blastocyst-derived stem cell lines, embryonic stem cells (ESCs) and extraembryonic endoderm (XEN) cells. The transcription factor Sall4 is required for early embryonic development and for ESC pluripotency. Sall4 is also expressed in XEN cells, and depletion of Sall4 disrupts self-renewal and induces differentiation. Genome-wide analysis reveals that Sall4 is regulating different gene sets in ESCs and XEN cells, and depletion of Sall4 targets in the respective cell types induces differentiation. With Oct4, Sox2, and Nanog, Sall4 forms a crucial interconnected autoregulatory network in ESCs. In XEN cells, Sall4 regulates the key XEN lineage-associated genes Gata4, Gata6, Sox7, and Sox17. Our findings demonstrate how Sall4 functions as an essential stemness factor for two different stem cell lines.
Collapse
|
70
|
Ablikim M, Bai JZ, Ban Y, Cai X, Chen HF, Chen HS, Chen HX, Chen JC, Chen J, Chen YB, Chu YP, Dai YS, Diao LY, Deng ZY, Dong QF, Du SX, Fang J, Fang SS, Fu CD, Gao CS, Gao YN, Gu SD, Gu YT, Guo YN, He KL, He M, Heng YK, Hou J, Hu HM, Hu JH, Hu T, Huang GS, Huang XT, Ji XB, Jiang XS, Jiang XY, Jiao JB, Jin DP, Jin S, Lai YF, Li G, Li HB, Li J, Li RY, Li SM, Li WD, Li WG, Li XL, Li XN, Li XQ, Liang YF, Liao HB, Liu BJ, Liu CX, Liu F, Liu F, Liu HH, Liu HM, Liu J, Liu JB, Liu JP, Liu J, Liu Q, Liu RG, Liu ZA, Lou YC, Lu F, Lu GR, Lu JG, Luo CL, Ma FC, Ma HL, Ma LL, Ma QM, Mao ZP, Mo XH, Nie J, Ping RG, Qi ND, Qin H, Qiu JF, Ren ZY, Rong G, Ruan XD, Shan LY, Shang L, Shen DL, Shen XY, Sheng HY, Sun HS, Sun SS, Sun YZ, Sun ZJ, Tang X, Tong GL, Wang DY, Wang L, Wang LL, Wang LS, Wang M, Wang P, Wang PL, Wang WF, Wang YF, Wang Z, Wang ZY, Wang Z, Wei CL, Wei DH, Weng Y, Wu N, Xia XM, Xie XX, Xu GF, Xu XP, Xu Y, Yan ML, Yang HX, Yang YX, Ye MH, Ye YX, Yu GW, Yuan CZ, Yuan Y, Zang SL, Zeng Y, Zhang BX, Zhang BY, Zhang CC, Zhang DH, Zhang HQ, Zhang HY, Zhang JW, Zhang JY, Zhang SH, Zhang XY, Zhang Y, Zhang ZX, Zhang ZP, Zhao DX, Zhao JW, Zhao MG, Zhao PP, Zhao WR, Zhao ZG, Zheng HQ, Zheng JP, Zheng ZP, Zhou L, Zhu KJ, Zhu QM, Zhu YC, Zhu YS, Zhu ZA, Zhuang BA, Zhuang XA, Zou BS. Anomalous line shape of the cross section for e{+}e{-}--> hadrons in the center-of-mass energy region between 3.650 and 3.872 GeV. PHYSICAL REVIEW LETTERS 2008; 101:102004. [PMID: 18851209 DOI: 10.1103/physrevlett.101.102004] [Citation(s) in RCA: 1] [Impact Index Per Article: 0.1] [Reference Citation Analysis] [Abstract] [Track Full Text] [Subscribe] [Scholar Register] [Received: 06/03/2008] [Indexed: 05/26/2023]
Abstract
We observe an obvious anomalous line shape of the e;{+}e;{-}--> hadrons total cross sections in the energy region between 3.700 and 3.872 GeV. It is inconsistent with the explanation for only one simple psi(3770) resonance with a statistical significance of 7sigma. The anomalous line shape may be explained by two possible enhancements of the inclusive hadron production near the center-of-mass energies of 3.764 and 3.779 GeV, indicating that either there is likely a new structure in addition to the psi(3770) resonance around 3.773 GeV, or there are some physics effects reflecting the DD[over ] production dynamics.
Collapse
|
71
|
Bourque G, Leong B, Vega VB, Chen X, Lee YL, Srinivasan KG, Chew JL, Ruan Y, Wei CL, Ng HH, Liu ET. Evolution of the mammalian transcription factor binding repertoire via transposable elements. Genome Res 2008; 18:1752-62. [PMID: 18682548 DOI: 10.1101/gr.080663.108] [Citation(s) in RCA: 404] [Impact Index Per Article: 25.3] [Reference Citation Analysis] [Abstract] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 12/22/2022]
Abstract
Identification of lineage-specific innovations in genomic control elements is critical for understanding transcriptional regulatory networks and phenotypic heterogeneity. We analyzed, from an evolutionary perspective, the binding regions of seven mammalian transcription factors (ESR1, TP53, MYC, RELA, POU5F1, SOX2, and CTCF) identified on a genome-wide scale by different chromatin immunoprecipitation approaches and found that only a minority of sites appear to be conserved at the sequence level. Instead, we uncovered a pervasive association with genomic repeats by showing that a large fraction of the bona fide binding sites for five of the seven transcription factors (ESR1, TP53, POU5F1, SOX2, and CTCF) are embedded in distinctive families of transposable elements. Using the age of the repeats, we established that these repeat-associated binding sites (RABS) have been associated with significant regulatory expansions throughout the mammalian phylogeny. We validated the functional significance of these RABS by showing that they are over-represented in proximity of regulated genes and that the binding motifs within these repeats have undergone evolutionary selection. Our results demonstrate that transcriptional regulatory networks are highly dynamic in eukaryotic genomes and that transposable elements play an important role in expanding the repertoire of binding sites.
Collapse
|
72
|
Xu H, Wei CL, Lin F, Sung WK. An HMM approach to genome-wide identification of differential histone modification sites from ChIP-seq data. ACTA ACUST UNITED AC 2008; 24:2344-9. [PMID: 18667444 DOI: 10.1093/bioinformatics/btn402] [Citation(s) in RCA: 125] [Impact Index Per Article: 7.8] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 01/01/2023]
Abstract
MOTIVATION Epigenetic modifications are one of the critical factors to regulate gene expression and genome function. Among different epigenetic modifications, the differential histone modification sites (DHMSs) are of great interest to study the dynamic nature of epigenetic and gene expression regulations among various cell types, stages or environmental responses. To capture the histone modifications at whole genome scale, ChIP-seq technology is becoming a robust and comprehensive approach. Thus the DHMSs are potentially identifiable by comparing two ChIP-seq libraries. However, little has been addressed on this issue in literature. RESULTS Aiming at identifying DHMSs, we propose an approach called ChIPDiff for the genome-wide comparison of histone modification sites identified by ChIP-seq. Based on the observations of ChIP fragment counts, the proposed approach employs a hidden Markov model (HMM) to infer the states of histone modification changes at each genomic location. We evaluated the performance of ChIPDiff by comparing the H3K27me3 modification sites between mouse embryonic stem cell (ESC) and neural progenitor cell (NPC). We demonstrated that the H3K27me3 DHMSs identified by our approach are of high sensitivity, specificity and technical reproducibility. ChIPDiff was further applied to uncover the differential H3K4me3 and H3K36me3 sites between different cell states. Interesting biological discoveries were achieved from such comparison in our study.
Collapse
|
73
|
Ablikim M, Bai JZ, Ban Y, Cai X, Chen HF, Chen HS, Chen HX, Chen JC, Chen J, Chen YB, Chu YP, Dai YS, Diao LY, Deng ZY, Dong QF, Du SX, Fang J, Fang SS, Fu CD, Gao CS, Gao YN, Gu SD, Gu YT, Guo YN, Guo ZJ, Harris FA, He KL, He M, Heng YK, Hou J, Hu HM, Hu JH, Hu T, Huang XT, Ji XB, Jiang XS, Jiang XY, Jiao JB, Jin DP, Jin S, Jin Y, Lai YF, Li G, Li HB, Li J, Li RY, Li SM, Li WD, Li WG, Li XL, Li XN, Li XQ, Liang YF, Liao HB, Liu BJ, Liu CX, Liu F, Liu F, Liu HH, Liu HM, Liu J, Liu JB, Liu JP, Liu J, Liu Q, Liu RG, Liu ZA, Lou YC, Lu F, Lu GR, Lu JG, Luo CL, Ma FC, Ma HL, Ma LL, Ma QM, Mao ZP, Mo XH, Nie J, Olsen SL, Ping RG, Qi ND, Qin H, Qiu JF, Ren ZY, Rong G, Ruan XD, Shan LY, Shang L, Shen CP, Shen DL, Shen XY, Sheng HY, Sun HS, Sun SS, Sun YZ, Sun ZJ, Tang X, Tong GL, Varner GS, Wang DY, Wang L, Wang LL, Wang LS, Wang M, Wang P, Wang PL, Wang YF, Wang Z, Wang ZY, Wang Z, Wei CL, Wei DH, Wiedner U, Weng Y, Wu N, Xia XM, Xie XX, Xu GF, Xu XP, Xu Y, Yan ML, Yang HX, Yang YX, Ye MH, Ye YX, Yu GW, Yuan CZ, Yuan Y, Zang SL, Zeng Y, Zhang BX, Zhang BY, Zhang CC, Zhang DH, Zhang HQ, Zhang HY, Zhang JW, Zhang JY, Zhang SH, Zhang XY, Zhang Y, Zhang ZX, Zhang ZP, Zhao DX, Zhao JW, Zhao MG, Zhao PP, Zhao WR, Zhao ZG, Zheng HQ, Zheng JP, Zheng ZP, Zhou L, Zhu KJ, Zhu QM, Zhu YC, Zhu YS, Zhu ZA, Zhuang BA, Zhuang XA, Zou BS. Search for the invisible decay of J/psi in psi(2S) --> pi(+)pi(-) J/psi. PHYSICAL REVIEW LETTERS 2008; 100:192001. [PMID: 18518438 DOI: 10.1103/physrevlett.100.192001] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Subscribe] [Scholar Register] [Received: 09/29/2007] [Indexed: 05/26/2023]
Abstract
Using psi(2S) --> pi(+)pi(-) J/psi events in a sample of 14.0 x 10(6) psi(2S) decays collected with the BES-II detector, a search for the decay of the J/psi to invisible final states is performed. No signal is found, and an upper limit at the 90% confidence level is determined to be 1.2 x 10(-2) for the ratio B(J/psi --> invisible)/B(J/psi-->mu(+)mu(-)). This is the first search for J/psi decays to invisible final states.
Collapse
|
74
|
Ablikim M, Bai JZ, Bai Y, Ban Y, Cai X, Chen HF, Chen HS, Chen HX, Chen JC, Chen J, Chen XD, Chen YB, Chu YP, Dai YS, Deng ZY, Du SX, Fang J, Fu CD, Gao CS, Gao YN, Gu SD, Gu YT, Guo YN, Guo ZJ, Harris FA, He KL, He M, Heng YK, Hou J, Hu HM, Hu T, Huang GS, Huang XT, Huang YP, Ji XB, Jiang XS, Jiao JB, Jin DP, Jin S, Lai YF, Li HB, Li J, Li RY, Li WD, Li WG, Li XL, Li XN, Li XQ, Liang YF, Liao HB, Liu BJ, Liu CX, Liu F, Liu F, Liu HH, Liu HM, Liu JB, Liu JP, Liu HB, Liu J, Liu Q, Liu RG, Liu S, Liu ZA, Lu F, Lu GR, Lu JG, Luo CL, Ma FC, Ma HL, Ma LL, Ma QM, Malik MQA, Mao ZP, Mo XH, Nie J, Olsen SL, Ping RG, Qi ND, Qin H, Qiu JF, Rong G, Ruan XD, Shan LY, Shang L, Shen CP, Shen DL, Shen XY, Sheng HY, Sun HS, Sun SS, Sun YZ, Sun ZJ, Tang X, Tian JP, Tong GL, Varner GS, Wan X, Wang L, Wang LL, Wang LS, Wang P, Wang PL, Wang WF, Wang YF, Wang Z, Wang ZY, Wei CL, Wei DH, Weng Y, Wu N, Xia XM, Xie XX, Xu GF, Xu XP, Xu Y, Yan ML, Yang HX, Yang M, Yang YX, Ye MH, Ye YX, Yu CX, Yu GW, Yuan CZ, Yuan Y, Zang SL, Zeng Y, Zhang BX, Zhang BY, Zhang CC, Zhang DH, Zhang HQ, Zhang HY, Zhang JW, Zhang JY, Zhang XY, Zhang YY, Zhang ZX, Zhang ZP, Zhao DX, Zhao JW, Zhao MG, Zhao PP, Zhao ZG, Zheng HQ, Zheng JP, Zheng ZP, Zhou BZL, Zhu KJ, Zhu QM, Zhu XW, Zhu YC, Zhu YS, Zhu ZA, Zhu ZL, Zhuang BA, Zou BS. Observation of Y(2175) in J/psi --> etaphif0 (980). PHYSICAL REVIEW LETTERS 2008; 100:102003. [PMID: 18352176 DOI: 10.1103/physrevlett.100.102003] [Citation(s) in RCA: 7] [Impact Index Per Article: 0.4] [Reference Citation Analysis] [Abstract] [Track Full Text] [Subscribe] [Scholar Register] [Received: 12/07/2007] [Indexed: 05/26/2023]
Abstract
The decays of J/psi --> etaphif(0)(980)[eta --> gammagamma, phi --> K(+) K(-), f(0)(980) --> pi(+)pi(-)] are analyzed using a sample of 5.8 x 10(7) J/psi events collected with the BESII detector at the Beijing Electron-Positron Collider. A structure at around 2.18 GeV/c(2) with about 5 sigma significance is observed in the phif(0)(980) invariant mass spectrum. A fit with a Breit-Wigner function gives the peak mass and width of m = 2.186+/-0.010(stat)+/-0.006(syst) GeV/c(2) and Gamma = 0.065+/-0.023(stat)+/-0.017(syst) GeV/c(2), respectively, which are consistent with those of Y(2175), observed by the BABAR Collaboration in the initial-state radiation process e(+)e(-) --> gamma(ISR) phif(0)(980). The production branching ratio is determined to be Br(J/psi --> etaY(2175))Br(Y(2175)- -> phif(0)(980))Br(f(0)(980) --> pi(+)pi(-)) = [3.23+/-0.75(stat)+/-0.73(syst)] x 10(-4), assuming that the Y(2175) is a 1(--) state.
Collapse
|
75
|
Ng P, Wei CL, Ruan Y. Paired-end diTagging for transcriptome and genome analysis. CURRENT PROTOCOLS IN MOLECULAR BIOLOGY 2008; Chapter 21:Unit 21.12. [PMID: 18265396 DOI: 10.1002/0471142727.mb2112s79] [Citation(s) in RCA: 12] [Impact Index Per Article: 0.8] [Reference Citation Analysis] [Abstract] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 11/09/2022]
Abstract
The Paired-End diTagging (PET) procedure enables one to obtain sequence information from both termini of any contiguous DNA fragment. This is achieved by a series of enzymatic manipulations that introduce MmeI sites directly flanking each DNA insert during the construction of a plasmid library. Subsequent MmeI digestion and self-ligation results in the production of covalently-linked paired-end ditags (PETs) that can be extracted and then concatenated for efficient sequencing. By mapping the PET sequences to assembled genomes, the original DNA fragments from which the PETs were derived can be precisely localized. This unit details two applications of PET technology. In GIS-PET, ditagging of mRNA converted to full-length cDNA enables whole-transcriptome analysis, including novel gene identification, gene prediction validation, and gene expression studies. In ChIP-PET, ditagging of chromatin immunoprecipitation-enriched genomic DNA fragments enables the global mapping of transcription factor binding sites. A recent innovation (Multiplex Sequencing of Paired-End ditags; MS-PET) enables PETs to be sequenced using high-throughput 454 sequencing, greatly increasing the amount of data that can be collected in each run.
Collapse
|