1
|
Shortcut barcoding and early pooling for scalable multiplex single-cell reduced-representation CpG methylation sequencing at single nucleotide resolution. Nucleic Acids Res 2023; 51:e108. [PMID: 37870443 PMCID: PMC10681715 DOI: 10.1093/nar/gkad892] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 05/21/2023] [Revised: 09/25/2023] [Accepted: 10/04/2023] [Indexed: 10/24/2023] Open
Abstract
DNA methylation is essential for a wide variety of biological processes, yet the development of a highly efficient and robust technology remains a challenge for routine single-cell analysis. We developed a multiplex scalable single-cell reduced representation bisulfite sequencing (msRRBS) technology. It allows cell-specific barcoded DNA fragments of individual cells to be pooled before bisulfite conversion, free of enzymatic modification or physical capture of the DNA ends, and achieves read mapping rates of 62.5 ± 3.9%, covering 60.0 ± 1.4% of CpG islands and 71.6 ± 1.6% of promoters in K562 cells. Its reproducibility is shown in duplicates of bulk cells with close to perfect correlation (R = 0.97-0.99). At a low 1 Mb of clean reads, msRRBS provides highly consistent coverage of CpG islands and promoters, outperforming the conventional methods with orders of magnitude reduction in cost. Here, we use this method to characterize the distinct methylation patterns and cellular heterogeneity of six cell lines, plus leukemia and hepatocellular carcinoma models. Taking 4 h of hands-on time, msRRBS offers a unique, highly efficient approach for dissecting methylation heterogeneity in a variety of multicellular systems.
Collapse
|
2
|
Sample-multiplexing approaches for single-cell sequencing. Cell Mol Life Sci 2022; 79:466. [PMID: 35927335 PMCID: PMC11073057 DOI: 10.1007/s00018-022-04482-0] [Citation(s) in RCA: 9] [Impact Index Per Article: 4.5] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 02/18/2022] [Revised: 06/25/2022] [Accepted: 07/11/2022] [Indexed: 12/12/2022]
Abstract
Single-cell sequencing is widely used in biological and medical studies. However, its application with multiple samples is hindered by inefficient sample processing, high experimental costs, ambiguous identification of true single cells, and technical batch effects. Here, we introduce sample-multiplexing approaches for single-cell sequencing in transcriptomics, epigenomics, genomics, and multiomics. In single-cell transcriptomics, sample multiplexing uses variants of native or artificial features as sample markers, enabling sample pooling and decoding. Such features include: (1) natural genetic variation, (2) nucleotide-barcode anchoring on cellular or nuclear membranes, (3) nucleotide-barcode internalization to the cytoplasm or nucleus, (4) vector-based barcode expression in cells, and (5) nucleotide-barcode incorporation during library construction. Other single-cell omics methods are based on similar concepts, particularly single-cell combinatorial indexing. These methods overcome current challenges, while enabling super-loading of single cells. Finally, selection guidelines are presented that can accelerate technological application.
Collapse
|
3
|
Spatial transcriptome profiling by MERFISH reveals fetal liver hematopoietic stem cell niche architecture. Cell Discov 2021; 7:47. [PMID: 34183665 PMCID: PMC8238952 DOI: 10.1038/s41421-021-00266-1] [Citation(s) in RCA: 27] [Impact Index Per Article: 9.0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/25/2020] [Accepted: 03/25/2021] [Indexed: 12/17/2022] Open
Abstract
The hematopoietic stem cell (HSC) niche has been extensively studied in bone marrow, yet a more systematic investigation into the microenvironment regulation of hematopoiesis in fetal liver is necessary. Here we investigate the spatial organization and transcriptional profile of individual cells in both wild type (WT) and Tet2−/− fetal livers, by multiplexed error robust fluorescence in situ hybridization. We find that specific pairs of fetal liver cell types are preferentially positioned next to each other. Ligand-receptor signaling molecule pairs such as Kitl and Kit are enriched in neighboring cell types. The majority of HSCs are in direct contact with endothelial cells (ECs) in both WT and Tet2−/− fetal livers. Loss of Tet2 increases the number of HSCs, and upregulates Wnt and Notch signaling genes in the HSC niche. Two subtypes of ECs, arterial ECs and sinusoidal ECs, and other cell types contribute distinct signaling molecules to the HSC niche. Collectively, this study provides a comprehensive picture and bioinformatic foundation for HSC spatial regulation in fetal liver.
Collapse
|
4
|
Colorectal Cancer Stem Cell States Uncovered by Simultaneous Single-Cell Analysis of Transcriptome and Telomeres. ADVANCED SCIENCE (WEINHEIM, BADEN-WURTTEMBERG, GERMANY) 2021; 8:2004320. [PMID: 33898197 PMCID: PMC8061397 DOI: 10.1002/advs.202004320] [Citation(s) in RCA: 32] [Impact Index Per Article: 10.7] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Subscribe] [Scholar Register] [Received: 11/10/2020] [Revised: 12/15/2020] [Indexed: 05/02/2023]
Abstract
Cancer stem cells (CSCs) presumably contribute to tumor progression and drug resistance, yet their definitive features have remained elusive. Here, simultaneous measurement of telomere length and transcriptome in the same cells enables systematic assessment of CSCs in primary colorectal cancer (CRC). The in-depth transcriptome profiled by SMART-seq2 is independently validated by high-throughput scRNA-seq using 10 × Genomics. It is found that rare CSCs exist in dormant state and display plasticity toward cancer epithelial cells (EPCs) that essentially are presumptive tumor-initiating cells (TICs), while both retaining the prominent signaling pathways including WNT, TGF-β, and HIPPO/YAP. Moreover, CSCs exhibit chromosome copy number variation (CNV) pattern resembling cancer EPCs but distinct from normal stem cells, suggesting the phylogenetic relationship between CSCs and cancer EPCs. Notably, CSCs maintain shorter telomeres and possess minimal telomerase activity consistent with their nonproliferative nature, unlike cancer EPCs. Additionally, the specific signature of CSCs particularly NOTUM, SMOC2, BAMBI, PHLDA1, and TNFRSF19 correlates with the prognosis of CRC. These findings characterize the heterogeneity of CSCs and their linkage to cancer EPCs/TICs, some of which are conventionally regarded as CSCs.
Collapse
|
5
|
Dysregulation of BRD4 Function Underlies the Functional Abnormalities of MeCP2 Mutant Neurons. Mol Cell 2020; 79:84-98.e9. [PMID: 32526163 DOI: 10.1016/j.molcel.2020.05.016] [Citation(s) in RCA: 31] [Impact Index Per Article: 7.8] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 09/02/2019] [Revised: 03/04/2020] [Accepted: 05/12/2020] [Indexed: 12/22/2022]
Abstract
Rett syndrome (RTT), mainly caused by mutations in methyl-CpG binding protein 2 (MeCP2), is one of the most prevalent intellectual disorders without effective therapies. Here, we used 2D and 3D human brain cultures to investigate MeCP2 function. We found that MeCP2 mutations cause severe abnormalities in human interneurons (INs). Surprisingly, treatment with a BET inhibitor, JQ1, rescued the molecular and functional phenotypes of MeCP2 mutant INs. We uncovered that abnormal increases in chromatin binding of BRD4 and enhancer-promoter interactions underlie the abnormal transcription in MeCP2 mutant INs, which were recovered to normal levels by JQ1. We revealed cell-type-specific transcriptome impairment in MeCP2 mutant region-specific human brain organoids that were rescued by JQ1. Finally, JQ1 ameliorated RTT-like phenotypes in mice. These data demonstrate that BRD4 dysregulation is a critical driver for RTT etiology and suggest that targeting BRD4 could be a potential therapeutic opportunity for RTT.
Collapse
|
6
|
A versatile flow-based assay for immunocyte-mediated cytotoxicity. J Immunol Methods 2019; 474:112668. [PMID: 31525367 DOI: 10.1016/j.jim.2019.112668] [Citation(s) in RCA: 5] [Impact Index Per Article: 1.0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 08/02/2019] [Revised: 09/09/2019] [Accepted: 09/12/2019] [Indexed: 10/26/2022]
Abstract
Cell-mediated cytotoxicity is a critical function of the immune system in mounting defense against pathogens and cancers. Current methods that allow direct evaluation of cell-mediated cytotoxicity suffer from a wide-range of drawbacks. Here, we present a novel strategy to measure cytotoxicity that is direct, sensitive, rapid, and highly adaptable. Moreover, it allows accurate measurement of viability of both target and effector cells. Target cells are fluorescently labeled with a non-toxic, cell-permeable dye that covalently binds to cell proteins, including nuclear proteins. The labeled target cells are incubated with effector cells to begin killing. Following the killing reaction, the cell mixture is incubated with another dye that specifically stains proteins of dead cells, including nuclear proteins. In the final step, cell nuclei are released by Triton X-100, and analyzed by flow cytometry. This results in four nuclear staining patterns that separate target and effector nuclei as well as nuclei of live and dead cells. Analyzing nuclei, instead of cells, greatly reduces flow cytometry errors caused by the presence of target-effector cell aggregates. Target killing time can often be reduced to 2 h and the assay can be done in a high throughput format. We have successfully validated this assay in a variety of cytotoxicity scenarios including those mediated by NK-92 cells, Chimeric Antigen Receptor (CAR)-T cells, and Tumor Infiltrating Lymphocytes (TIL). Therefore, this technique is broadly applicable, highly sensitive and easily administered, making it a powerful tool to assess immunotherapy-based, cell-mediated cytotoxicity.
Collapse
|
7
|
Telomere Maintenance-Associated PML Is a Potential Specific Therapeutic Target of Human Colorectal Cancer. Transl Oncol 2019; 12:1164-1176. [PMID: 31207547 PMCID: PMC6580093 DOI: 10.1016/j.tranon.2019.05.010] [Citation(s) in RCA: 8] [Impact Index Per Article: 1.6] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 05/09/2019] [Accepted: 05/13/2019] [Indexed: 02/06/2023] Open
Abstract
Telomere length maintenance is essential for cell proliferation, which is particularly prominent in cancer. We validate that the primary colorectal tumors exhibit heterogeneous telomere lengths but mostly (90%) short telomeres relative to normal tissues. Intriguingly, relatively short telomeres are associated with tumor malignancy as indicated by poorly differentiated state, and these tumors contain more cancer stem-like cells (CSLCs) identified by several commonly used markers CD44, EPHB2 or LGR5. Moreover, promyelocytic leukemia (PML) and ALT-associated PML nuclear bodies (APBs) are frequently found in tumors with short telomeres and high proliferation. In contrast, distant normal tissues rarely or only minimally express PML. Inhibition of PML and APBs by an ATR inhibitor decreases proliferation of CSLCs and organoids, suggesting a potential therapeutic target to progressive colorectal tumors. Together, telomere maintenance underling tumor progression is connected with CSLCs.
Collapse
|
8
|
hESC-Derived Thalamic Organoids Form Reciprocal Projections When Fused with Cortical Organoids. Cell Stem Cell 2019; 24:487-497.e7. [PMID: 30799279 DOI: 10.1016/j.stem.2018.12.015] [Citation(s) in RCA: 244] [Impact Index Per Article: 48.8] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/11/2018] [Revised: 10/01/2018] [Accepted: 12/20/2018] [Indexed: 12/22/2022]
Abstract
Human brain organoid techniques have rapidly advanced to facilitate investigating human brain development and diseases. These efforts have largely focused on generating telencephalon due to its direct relevance in a variety of forebrain disorders. Despite its importance as a relay hub between cortex and peripheral tissues, the investigation of three-dimensional (3D) organoid models for the human thalamus has not been explored. Here, we describe a method to differentiate human embryonic stem cells (hESCs) to thalamic organoids (hThOs) that specifically recapitulate the development of thalamus. Single-cell RNA sequencing revealed a formation of distinct thalamic lineages, which diverge from telencephalic fate. Importantly, we developed a 3D system to create the reciprocal projections between thalamus and cortex by fusing the two distinct region-specific organoids representing the developing thalamus or cortex. Our study provides a platform for understanding human thalamic development and modeling circuit organizations and related disorders in the brain.
Collapse
|
9
|
Correction to: MISC: missing imputation for single-cell RNA sequencing data. BMC SYSTEMS BIOLOGY 2019; 13:13. [PMID: 30670065 PMCID: PMC6343234 DOI: 10.1186/s12918-019-0681-3] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Subscribe] [Scholar Register] [Indexed: 11/10/2022]
Abstract
It was highlighted that the original article [1] contained a typesetting error in the last name of Allon Canaan. This was incorrectly captured as Allon Canaann in the original article which has since been updated.
Collapse
|
10
|
Local and global chromatin interactions are altered by large genomic deletions associated with human brain development. Nat Commun 2018; 9:5356. [PMID: 30559385 PMCID: PMC6297223 DOI: 10.1038/s41467-018-07766-x] [Citation(s) in RCA: 36] [Impact Index Per Article: 6.0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 05/05/2017] [Accepted: 11/09/2018] [Indexed: 01/18/2023] Open
Abstract
Large copy number variants (CNVs) in the human genome are strongly associated with common neurodevelopmental, neuropsychiatric disorders such as schizophrenia and autism. Here we report on the epigenomic effects of the prominent large deletion CNVs on chromosome 22q11.2 and on chromosome 1q21.1. We use Hi-C analysis of long-range chromosome interactions, including haplotype-specific Hi-C analysis, ChIP-Seq analysis of regulatory histone marks, and RNA-Seq analysis of gene expression patterns. We observe changes on all the levels of analysis, within the deletion boundaries, in the deletion flanking regions, along chromosome 22q, and genome wide. We detect gene expression changes as well as pronounced and multilayered effects on chromatin states, chromosome folding and on the topological domains of the chromatin, that emanate from the large CNV locus. These findings suggest basic principles of how such large genomic deletions can alter nuclear organization and affect genomic molecular activity. Copy number variants in the human genome (CNVs) are associated with neurodevelopmental and psychiatric disorders such as schizophrenia and autism. Here the authors investigate how the large deletion CNV on chromosome 22q11.2 alters chromatin organization.
Collapse
|
11
|
Abstract
BACKGROUND Single-cell RNA sequencing (scRNA-seq) technology provides an effective way to study cell heterogeneity. However, due to the low capture efficiency and stochastic gene expression, scRNA-seq data often contains a high percentage of missing values. It has been showed that the missing rate can reach approximately 30% even after noise reduction. To accurately recover missing values in scRNA-seq data, we need to know where the missing data is; how much data is missing; and what are the values of these data. METHODS To solve these three problems, we propose a novel model with a hybrid machine learning method, namely, missing imputation for single-cell RNA-seq (MISC). To solve the first problem, we transformed it to a binary classification problem on the RNA-seq expression matrix. Then, for the second problem, we searched for the intersection of the classification results, zero-inflated model and false negative model results. Finally, we used the regression model to recover the data in the missing elements. RESULTS We compared the raw data without imputation, the mean-smooth neighbor cell trajectory, MISC on chronic myeloid leukemia data (CML), the primary somatosensory cortex and the hippocampal CA1 region of mouse brain cells. On the CML data, MISC discovered a trajectory branch from the CP-CML to the BC-CML, which provides direct evidence of evolution from CP to BC stem cells. On the mouse brain data, MISC clearly divides the pyramidal CA1 into different branches, and it is direct evidence of pyramidal CA1 in the subpopulations. In the meantime, with MISC, the oligodendrocyte cells became an independent group with an apparent boundary. CONCLUSIONS Our results showed that the MISC model improved the cell type classification and could be instrumental to study cellular heterogeneity. Overall, MISC is a robust missing data imputation model for single-cell RNA-seq data.
Collapse
|
12
|
Ritornello: high fidelity control-free chromatin immunoprecipitation peak calling. Nucleic Acids Res 2017; 45:e173. [PMID: 28981893 PMCID: PMC5716106 DOI: 10.1093/nar/gkx799] [Citation(s) in RCA: 6] [Impact Index Per Article: 0.9] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 12/23/2015] [Accepted: 08/30/2017] [Indexed: 02/03/2023] Open
Abstract
With the advent of next generation high-throughput DNA sequencing technologies, omics experiments have become the mainstay for studying diverse biological effects on a genome wide scale. Chromatin immunoprecipitation (ChIP-seq) is the omics technique that enables genome wide localization of transcription factor (TF) binding or epigenetic modification events. Since the inception of ChIP-seq in 2007, many methods have been developed to infer ChIP-target binding loci from the resultant reads after mapping them to a reference genome. However, interpreting these data has proven challenging, and as such these algorithms have several shortcomings, including susceptibility to false positives due to artifactual peaks, poor localization of binding sites and the requirement for a total DNA input control which increases the cost of performing these experiments. We present Ritornello, a new approach for finding TF-binding sites in ChIP-seq, with roots in digital signal processing that addresses all of these problems. We show that Ritornello generally performs equally or better than the peak callers tested and recommended by the ENCODE consortium, but in contrast, Ritornello does not require a matched total DNA input control to avoid false positives, effectively decreasing the sequencing cost to perform ChIP-seq. Ritornello is freely available at https://github.com/KlugerLab/Ritornello.
Collapse
|
13
|
Bisulfite-independent analysis of CpG island methylation enables genome-scale stratification of single cells. Nucleic Acids Res 2017; 45:e77. [PMID: 28126923 PMCID: PMC5605247 DOI: 10.1093/nar/gkx026] [Citation(s) in RCA: 26] [Impact Index Per Article: 3.7] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/25/2016] [Accepted: 01/20/2017] [Indexed: 01/03/2023] Open
Abstract
Conventional DNA bisulfite sequencing has been extended to single cell level, but the coverage consistency is insufficient for parallel comparison. Here we report a novel method for genome-wide CpG island (CGI) methylation sequencing for single cells (scCGI-seq), combining methylation-sensitive restriction enzyme digestion and multiple displacement amplification for selective detection of methylated CGIs. We applied this method to analyzing single cells from two types of hematopoietic cells, K562 and GM12878 and small populations of fibroblasts and induced pluripotent stem cells. The method detected 21 798 CGIs (76% of all CGIs) per cell, and the number of CGIs consistently detected from all 16 profiled single cells was 20 864 (72.7%), with 12 961 promoters covered. This coverage represents a substantial improvement over results obtained using single cell reduced representation bisulfite sequencing, with a 66-fold increase in the fraction of consistently profiled CGIs across individual cells. Single cells of the same type were more similar to each other than to other types, but also displayed epigenetic heterogeneity. The method was further validated by comparing the CpG methylation pattern, methylation profile of CGIs/promoters and repeat regions and 41 classes of known regulatory markers to the ENCODE data. Although not every minor methylation differences between cells are detectable, scCGI-seq provides a solid tool for unsupervised stratification of a heterogeneous cell population.
Collapse
|
14
|
Fusion of Regionally Specified hPSC-Derived Organoids Models Human Brain Development and Interneuron Migration. Cell Stem Cell 2017; 21:383-398.e7. [PMID: 28757360 DOI: 10.1016/j.stem.2017.07.007] [Citation(s) in RCA: 390] [Impact Index Per Article: 55.7] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 04/18/2017] [Revised: 06/06/2017] [Accepted: 07/11/2017] [Indexed: 01/18/2023]
Abstract
Organoid techniques provide unique platforms to model brain development and neurological disorders. Whereas several methods for recapitulating corticogenesis have been described, a system modeling human medial ganglionic eminence (MGE) development, a critical ventral brain domain producing cortical interneurons and related lineages, has been lacking until recently. Here, we describe the generation of MGE and cortex-specific organoids from human pluripotent stem cells that recapitulate the development of MGE and cortex domains, respectively. Population and single-cell RNA sequencing (RNA-seq) profiling combined with bulk assay for transposase-accessible chromatin with high-throughput sequencing (ATAC-seq) analyses revealed transcriptional and chromatin accessibility dynamics and lineage relationships during MGE and cortical organoid development. Furthermore, MGE and cortical organoids generated physiologically functional neurons and neuronal networks. Finally, fusing region-specific organoids followed by live imaging enabled analysis of human interneuron migration and integration. Together, our study provides a platform for generating domain-specific brain organoids and modeling human interneuron migration and offers deeper insight into molecular dynamics during human brain development.
Collapse
|
15
|
Detecting heterogeneity in single-cell RNA-Seq data by non-negative matrix factorization. PeerJ 2017; 5:e2888. [PMID: 28133571 PMCID: PMC5251935 DOI: 10.7717/peerj.2888] [Citation(s) in RCA: 59] [Impact Index Per Article: 8.4] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/08/2016] [Accepted: 12/08/2016] [Indexed: 01/08/2023] Open
Abstract
Single-cell RNA-Sequencing (scRNA-Seq) is a fast-evolving technology that enables the understanding of biological processes at an unprecedentedly high resolution. However, well-suited bioinformatics tools to analyze the data generated from this new technology are still lacking. Here we investigate the performance of non-negative matrix factorization (NMF) method to analyze a wide variety of scRNA-Seq datasets, ranging from mouse hematopoietic stem cells to human glioblastoma data. In comparison to other unsupervised clustering methods including K-means and hierarchical clustering, NMF has higher accuracy in separating similar groups in various datasets. We ranked genes by their importance scores (D-scores) in separating these groups, and discovered that NMF uniquely identifies genes expressed at intermediate levels as top-ranked genes. Finally, we show that in conjugation with the modularity detection method FEM, NMF reveals meaningful protein-protein interaction modules. In summary, we propose that NMF is a desirable method to analyze heterogeneous single-cell RNA-Seq data. The NMF based subpopulation detection package is available at: https://github.com/lanagarmire/NMFEM.
Collapse
|
16
|
Evolution and heterogeneity of non-hereditary colorectal cancer revealed by single-cell exome sequencing. Oncogene 2016; 36:2857-2867. [PMID: 27941887 DOI: 10.1038/onc.2016.438] [Citation(s) in RCA: 50] [Impact Index Per Article: 6.3] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/15/2016] [Revised: 09/22/2016] [Accepted: 10/04/2016] [Indexed: 12/13/2022]
Abstract
Recently single-cell whole-exome sequencing (scWES) has deeply expanded and sharpened our knowledge of cancer evolution and subclonality. Herein, with scWES and matched bulk whole-exome sequencing (bulk WES) on two colorectal cancer (CRC) patients with normal or adenomatous polyps, we found that both the adenoma and cancer were of monoclonal origin, and both shared partial mutations in the same signaling pathways, but each showed a specific spectrum of heterogeneous somatic mutations. In addition, the adenoma and cancer further developed intratumor heterogeneity with the accumulation of nonrandom somatic mutations specifically in GPCR, PI3K-Akt and FGFR signaling pathways. We identified novel driver mutations that developed during adenoma and cancer evolution, particularly in OR1B1 (GPCR signaling pathway) for adenoma evolution, and LAMA1 (PI3K-Akt signaling pathway) and ADCY3 (FGFR signaling pathway) for CRC evolution. In summary, we demonstrated that both colorectal adenoma and CRC are monoclonal in origin, and the CRCs further diversified into different subclones with heterogeneous mutation profiles accumulating in GPCR, PI3K-Akt and FGFR signaling pathways. ScWES provides evidence for the importance of mutations in certain pathways that would not be as apparent from bulk sequencing of tumors, and can potentially establish whether specific mutations are mutually exclusive or occur sequentially in the same subclone of cells.
Collapse
|
17
|
Regulation of the DNA Methylation Landscape in Human Somatic Cell Reprogramming by the miR-29 Family. Stem Cell Reports 2016; 7:43-54. [PMID: 27373925 PMCID: PMC4945581 DOI: 10.1016/j.stemcr.2016.05.014] [Citation(s) in RCA: 26] [Impact Index Per Article: 3.3] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 09/26/2015] [Revised: 05/31/2016] [Accepted: 05/31/2016] [Indexed: 02/05/2023] Open
Abstract
Reprogramming to pluripotency after overexpression of OCT4, SOX2, KLF4, and MYC is accompanied by global genomic and epigenomic changes. Histone modification and DNA methylation states in induced pluripotent stem cells (iPSCs) have been shown to be highly similar to embryonic stem cells (ESCs). However, epigenetic differences still exist between iPSCs and ESCs. In particular, aberrant DNA methylation states found in iPSCs are a major concern when using iPSCs in a clinical setting. Thus, it is critical to find factors that regulate DNA methylation states in reprogramming. Here, we found that the miR-29 family is an important epigenetic regulator during human somatic cell reprogramming. Our global DNA methylation and hydroxymethylation analysis shows that DNA demethylation is a major event mediated by miR-29a depletion during early reprogramming, and that iPSCs derived from miR-29a depletion are epigenetically closer to ESCs. Our findings uncover an important miRNA-based approach to generate clinically robust iPSCs.
Collapse
|
18
|
A Molecular Chipper technology for CRISPR sgRNA library generation and functional mapping of noncoding regions. Nat Commun 2016; 7:11178. [PMID: 27025950 PMCID: PMC4820989 DOI: 10.1038/ncomms11178] [Citation(s) in RCA: 15] [Impact Index Per Article: 1.9] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 08/18/2015] [Accepted: 02/26/2016] [Indexed: 12/12/2022] Open
Abstract
Clustered regularly-interspaced palindromic repeats (CRISPR)-based genetic screens using single-guide-RNA (sgRNA) libraries have proven powerful to identify genetic regulators. Applying CRISPR screens to interrogate functional elements in noncoding regions requires generating sgRNA libraries that are densely covering, and ideally inexpensive, easy to implement and flexible for customization. Here we present a Molecular Chipper technology for generating dense sgRNA libraries for genomic regions of interest, and a proof-of-principle screen that identifies novel cis-regulatory domains for miR-142 biogenesis. The Molecular Chipper approach utilizes a combination of random fragmentation and a type III restriction enzyme to derive a densely covering sgRNA library from input DNA. Applying this approach to 17 microRNAs and their flanking regions and with a reporter for miR-142 activity, we identify both the pre-miR-142 region and two previously unrecognized cis-domains important for miR-142 biogenesis, with the latter regulating miR-142 processing. This strategy will be useful for identifying functional noncoding elements in mammalian genomes.
Collapse
|
19
|
Single-Cell Sequencing for Precise Cancer Research: Progress and Prospects. Cancer Res 2016; 76:1305-12. [PMID: 26941284 DOI: 10.1158/0008-5472.can-15-1907] [Citation(s) in RCA: 84] [Impact Index Per Article: 10.5] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/13/2015] [Accepted: 10/22/2015] [Indexed: 11/16/2022]
Abstract
Advances in genomic technology have enabled the faithful detection and measurement of mutations and the gene expression profile of cancer cells at the single-cell level. Recently, several single-cell sequencing methods have been developed that permit the comprehensive and precise analysis of the cancer-cell genome, transcriptome, and epigenome. The use of these methods to analyze cancer cells has led to a series of unanticipated discoveries, such as the high heterogeneity and stochastic changes in cancer-cell populations, the new driver mutations and the complicated clonal evolution mechanisms, and the novel identification of biomarkers of variant tumors. These methods and the knowledge gained from their utilization could potentially improve the early detection and monitoring of rare cancer cells, such as circulating tumor cells and disseminated tumor cells, and promote the development of personalized and highly precise cancer therapy. Here, we discuss the current methods for single cancer-cell sequencing, with a strong focus on those practically used or potentially valuable in cancer research, including single-cell isolation, whole genome and transcriptome amplification, epigenome profiling, multi-dimensional sequencing, and next-generation sequencing and analysis. We also examine the current applications, challenges, and prospects of single cancer-cell sequencing.
Collapse
|
20
|
A comparative encyclopedia of DNA elements in the mouse genome. Nature 2015; 515:355-64. [PMID: 25409824 PMCID: PMC4266106 DOI: 10.1038/nature13992] [Citation(s) in RCA: 1135] [Impact Index Per Article: 126.1] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 02/03/2014] [Accepted: 10/24/2014] [Indexed: 12/11/2022]
Abstract
The laboratory mouse shares the majority of its protein-coding genes with humans, making it the premier model organism in biomedical research, yet the two mammals differ in significant ways. To gain greater insights into both shared and species-specific transcriptional and cellular regulatory programs in the mouse, the Mouse ENCODE Consortium has mapped transcription, DNase I hypersensitivity, transcription factor binding, chromatin modifications and replication domains throughout the mouse genome in diverse cell and tissue types. By comparing with the human genome, we not only confirm substantial conservation in the newly annotated potential functional sequences, but also find a large degree of divergence of sequences involved in transcriptional regulation, chromatin state and higher order chromatin organization. Our results illuminate the wide range of evolutionary forces acting on genes and their regulatory regions, and provide a general resource for research into mammalian biology and mechanisms of human diseases.
Collapse
|
21
|
Co-detection and sequencing of genes and transcripts from the same single cells facilitated by a microfluidics platform. Sci Rep 2014; 4:6485. [PMID: 25255798 PMCID: PMC4175731 DOI: 10.1038/srep06485] [Citation(s) in RCA: 44] [Impact Index Per Article: 4.4] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 11/25/2013] [Accepted: 09/05/2014] [Indexed: 01/12/2023] Open
Abstract
Despite the recent advance of single-cell gene expression analyses, co-measurement of both genomic and transcriptional signatures at the single-cell level has not been realized. However such analysis is necessary in order to accurately delineate how genetic information is transcribed, expressed, and regulated to give rise to an enormously diverse range of cell phenotypes. Here we report on a microfluidics-facilitated approach that allows for controlled separation of cytoplasmic and nuclear contents of a single cell followed by on-chip amplification of genomic DNA and cytoplasmic mRNA. When coupled with off-chip polymerase chain reaction, gel electrophoresis and Sanger sequencing, a panel of genes and transcripts from the same single cell can be co-detected and sequenced. This platform is potentially an enabling tool to permit multiple genomic measurements performed on the same single cells and opens new opportunities to tackle a range of fundamental biology questions including non-genetic cell-to-cell variability, epigenetic regulation, and stem cell fate control. It also helps address clinical challenges such as diagnosing intra-tumor heterogeneity and dissecting complex cellular immune responses.
Collapse
|
22
|
Long intergenic non-coding RNA HOTAIRM1 regulates cell cycle progression during myeloid maturation in NB4 human promyelocytic leukemia cells. RNA Biol 2014; 11:777-87. [PMID: 24824789 DOI: 10.4161/rna.28828] [Citation(s) in RCA: 132] [Impact Index Per Article: 13.2] [Reference Citation Analysis] [Abstract] [Key Words] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 12/27/2022] Open
Abstract
HOTAIRM1 is a long intergenic non-coding RNA encoded in the human HOXA gene cluster, with gene expression highly specific for maturing myeloid cells. Knockdown of HOTAIRM1 in the NB4 acute promyelocytic leukemia cell line retarded all-trans retinoid acid (ATRA)-induced granulocytic differentiation, resulting in a significantly larger population of immature and proliferating cells that maintained cell cycle progression from G1 to S phases. Correspondingly, HOTAIRM1 knockdown resulted in retained expression of many otherwise ATRA-suppressed cell cycle and DNA replication genes, and abated ATRA induction of cell surface leukocyte activation, defense response, and other maturation-related genes. Resistance to ATRA-induced cell cycle arrest at the G1/S phase transition in knockdown cells was accompanied by retained expression of ITGA4 (CD49d) and decreased induction of ITGAX (CD11c). The coupling of cell cycle progression with temporal dynamics in the expression patterns of these integrin genes suggests a regulated switch to control the transit from the proliferative phase to granulocytic maturation. Furthermore, ITGAX was among a small number of genes showing perturbation in transcript levels upon HOTAIRM1 knockdown even without ATRA treatment, suggesting a direct pathway of regulation. These results indicate that HOTAIRM1 provides a regulatory link in myeloid maturation by modulating integrin-controlled cell cycle progression at the gene expression level.
Collapse
|
23
|
Nonstochastic reprogramming from a privileged somatic cell state. Cell 2014; 156:649-62. [PMID: 24486105 DOI: 10.1016/j.cell.2014.01.020] [Citation(s) in RCA: 136] [Impact Index Per Article: 13.6] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/07/2013] [Revised: 11/04/2013] [Accepted: 01/10/2014] [Indexed: 12/29/2022]
Abstract
Reprogramming somatic cells to induced pluripotency by Yamanaka factors is usually slow and inefficient and is thought to be a stochastic process. We identified a privileged somatic cell state, from which acquisition of pluripotency could occur in a nonstochastic manner. Subsets of murine hematopoietic progenitors are privileged whose progeny cells predominantly adopt the pluripotent fate with activation of endogenous Oct4 locus after four to five divisions in reprogramming conditions. Privileged cells display an ultrafast cell cycle of ∼8 hr. In fibroblasts, a subpopulation cycling at a similar ultrafast speed is observed after 6 days of factor expression and is increased by p53 knockdown. This ultrafast cycling population accounts for >99% of the bulk reprogramming activity in wild-type or p53 knockdown fibroblasts. Our data demonstrate that the stochastic nature of reprogramming can be overcome in a privileged somatic cell state and suggest that cell-cycle acceleration toward a critical threshold is an important bottleneck for reprogramming. PAPERCLIP:
Collapse
|
24
|
Abstract
PPARGC1A is a transcriptional coactivator that binds to and coactivates a variety of transcription factors (TFs) to regulate the expression of target genes. PPARGC1A plays a pivotal role in regulating energy metabolism and has been implicated in several human diseases, most notably type II diabetes. Previous studies have focused on the interplay between PPARGC1A and individual TFs, but little is known about how PPARGC1A combines with all of its partners across the genome to regulate transcriptional dynamics. In this study, we describe a core PPARGC1A transcriptional regulatory network operating in HepG2 cells treated with forskolin. We first mapped the genome-wide binding sites of PPARGC1A using chromatin-IP followed by high-throughput sequencing (ChIP-seq) and uncovered overrepresented DNA sequence motifs corresponding to known and novel PPARGC1A network partners. We then profiled six of these site-specific TF partners using ChIP-seq and examined their network connectivity and combinatorial binding patterns with PPARGC1A. Our analysis revealed extensive overlap of targets including a novel link between PPARGC1A and HSF1, a TF regulating the conserved heat shock response pathway that is misregulated in diabetes. Importantly, we found that different combinations of TFs bound to distinct functional sets of genes, thereby helping to reveal the combinatorial regulatory code for metabolic and other cellular processes. In addition, the different TFs often bound near the promoters and coding regions of each other's genes suggesting an intricate network of interdependent regulation. Overall, our study provides an important framework for understanding the systems-level control of metabolic gene expression in humans.
Collapse
|
25
|
|
26
|
Abstract
To complement the human Encyclopedia of DNA Elements (ENCODE) project and to enable a broad range of mouse genomics efforts, the Mouse ENCODE Consortium is applying the same experimental pipelines developed for human ENCODE to annotate the mouse genome.
Collapse
|
27
|
A multiprotein complex necessary for both transcription and DNA replication at the β-globin locus. EMBO J 2010; 29:3260-71. [PMID: 20808282 DOI: 10.1038/emboj.2010.204] [Citation(s) in RCA: 24] [Impact Index Per Article: 1.7] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 02/23/2010] [Accepted: 07/29/2010] [Indexed: 12/17/2022] Open
Abstract
DNA replication, repair, transcription and chromatin structure are intricately associated nuclear processes, but the molecular links between these events are often obscure. In this study, we have surveyed the protein complexes that bind at β-globin locus control region, and purified and characterized the function of one such multiprotein complex from human erythroleukemic K562 cells. We further validated the existence of this complex in human CD34+ cell-derived normal erythroid cells. This complex contains ILF2/ILF3 transcription factors, p300 acetyltransferase and proteins associated with DNA replication, transcription and repair. RNAi knockdown of ILF2, a DNA-binding component of this complex, abrogates the recruitment of the complex to its cognate DNA sequence and inhibits transcription, histone acetylation and usage of the origin of DNA replication at the β-globin locus. These results imply a direct link between mammalian DNA replication, transcription and histone acetylation mediated by a single multiprotein complex.
Collapse
|
28
|
Abstract
Differences in gene expression may play a major role in speciation and phenotypic diversity. We examined genome-wide differences in transcription factor (TF) binding in several humans and a single chimpanzee by using chromatin immunoprecipitation followed by sequencing. The binding sites of RNA polymerase II (PolII) and a key regulator of immune responses, nuclear factor kappaB (p65), were mapped in 10 lymphoblastoid cell lines, and 25 and 7.5% of the respective binding regions were found to differ between individuals. Binding differences were frequently associated with single-nucleotide polymorphisms and genomic structural variants, and these differences were often correlated with differences in gene expression, suggesting functional consequences of binding variation. Furthermore, comparing PolII binding between humans and chimpanzee suggests extensive divergence in TF binding. Our results indicate that many differences in individuals and species occur at the level of TF binding, and they provide insight into the genetic events responsible for these differences.
Collapse
|
29
|
Dynamics of alpha-globin locus chromatin structure and gene expression during erythroid differentiation of human CD34(+) cells in culture. Exp Hematol 2009; 37:1143-1156.e3. [PMID: 19607874 DOI: 10.1016/j.exphem.2009.07.001] [Citation(s) in RCA: 19] [Impact Index Per Article: 1.3] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/12/2009] [Revised: 06/05/2009] [Accepted: 07/07/2009] [Indexed: 01/17/2023]
Abstract
OBJECTIVE The aim of the present study has been to establish serum-free culture conditions for ex vivo expansion and differentiation of human CD34(+) cells into erythroid lineage and to study the chromatin structure, gene expression, and transcription factor recruitment at the alpha-globin locus in the developing erythron. MATERIALS AND METHODS A basal Iscove's modified Dulbecco's medium cell culture medium with 1% bovine serum albumin as a serum replacement and a combination of cytokines and growth factors was used for expansion and differentiation of the CD34(+) cells. Expression patterns of the alpha- and beta-like genes at various stages of erythropoiesis was studied by reverse transcriptase quantitative polymerase chain reaction analysis, profile of key erythroid transcription factors was investigated by Western blotting, and the chromatin structure and transcription factor recruitment at the alpha-globin locus was investigated by chromatin immunoprecipitation quantitative polymerase chain reaction analysis. RESULTS Human CD34(+) cells in the serum-free medium undergo near synchronous erythroid differentiation to yield large amount of cells at different differentiation stages. We observe distinct patterns of the histone modifications and transcription factor binding at the alpha-globin locus during erythroid differentiation of CD34(+) cells. Nuclear factor erythroid-derived 2 (NF-E2) was present at upstream activator sites even before addition of erythropoietin (EPO), while bound GATA-1 was only detectable after EPO treatment. After 7 days of EPO treatment, H3K4Me2 modification uniformly increases throughout the alpha-globin locus. Acetylation at H3K9 and binding of Pol II, NF-E2, and GATA-1 were restricted to certain hypersensitive sites of the enhancer and theta gene, and were conspicuously low at the alpha-like globin promoters. Rearrangement of the insulator binding factor CTCF took place at and around the alpha-globin locus as CD34(+) cells differentiated into erythroid pathway. CONCLUSION Our results indicate that remodeling of the upstream elements may be the primary event in activation of alpha-globin gene expression. Activation of alpha-globin genes upon EPO treatment involves initial binding of Pol II, downregulation of pre-existing factors like NF-E2, removal of CTCF from the locus, then rebinding of CTCF in an altered pattern, and concurrent or subsequent binding of transcription factors like GATA-1.
Collapse
|
30
|
Abstract
DNA methylation is an important component of epigenetic modifications, which influences the transcriptional machinery aberrant in many human diseases. In this study we present the first genome-wide integrative analysis of promoter methylation and gene expression for the identification of methylation markers in melanoma. Genome-wide promoter methylation and gene expression of eight early-passage human melanoma cell strains were compared with newborn and adult melanocytes. We used linear mixed effect models (LME) in combination with a series of filters based on the localization of promoter methylation relative to the transcription start site, overall promoter CpG content, and differential gene expression to discover DNA methylation markers. This approach identified 76 markers, of which 68 were hyper- and eight hypomethylated (LME, P < 0.05). Promoter methylation and differential gene expression of five markers (COL1A2, NPM2, HSPB6, DDIT4L, MT1G) were validated by sequencing of bisulfite-modified DNA and real-time reverse transcriptase PCR, respectively. Importantly, the incidence of promoter methylation of the validated markers increased moderately in early and significantly in advanced-stage melanomas, using early-passage cell strains and snap-frozen tissues (n = 18 and n = 24, respectively) compared with normal melanocytes and nevi (n = 11 and n = 9, respectively). Our approach allows robust identification of methylation markers that can be applied to other studies involving genome-wide promoter methylation. In conclusion, this study represents the first unbiased systematic effort to determine methylation markers in melanoma and revealed several novel genes regulated by promoter methylation that were not described in cancer cells before.
Collapse
|
31
|
Chimeric receptor mRNA transfection as a tool to generate antineoplastic lymphocytes. Hum Gene Ther 2009; 20:51-61. [PMID: 19025415 PMCID: PMC2855249 DOI: 10.1089/hum.2008.068] [Citation(s) in RCA: 40] [Impact Index Per Article: 2.7] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 05/23/2008] [Accepted: 10/15/2008] [Indexed: 11/12/2022] Open
Abstract
mRNA transfection is a useful approach for temporal cell reprogramming with minimal risk of transgene-mediated mutagenesis. We applied this to redirect lymphocyte cytotoxicity toward malignant cells. Using the chimeric immune receptor (CIR) constructs anti-CD19 CIR and 8H9 CIR, we achieved uniform expression of CIRs on virtually the entire population of lymphocytes. We reprogrammed CD3+ CD8+, CD3+ CD4+, and natural killer (NK ) cells toward autologous and allogeneic targets such as B cells, Daudi lymphoma, primary melanoma, breast ductal carcinoma, breast adenocarcinoma, and rhabdomyosarcoma. The reprogramming procedure is fast. Although most of the experiments were performed on lymphocytes obtained after 7-day activation, only 1-day activation of T cells with anti-CD3, anti-CD28 antibodies, and interleukin-2 is sufficient to develop both lymphocyte cytotoxicity and competence for mRNA transfer. The entire procedure, which includes lymphocyte activation and reprogramming, can be completed in 2 days. The efficiency of mRNA-modified human T cells was tested in a murine xenograft model. Human CD3+CD8+ lymphocytes expressing anti-CD19 CIR mRNA inhibited Daudi lymphoma growth in NOD=SCID mice. These results demonstrate that a mixed population of cytotoxic lymphocytes, including T cells together with NK cells, can be quickly and simultaneously reprogrammed by mRNA against autologous malignancies. With relatively minor modifications the described method of lymphocyte reprogramming can be scaled up for cancer therapy.
Collapse
|
32
|
Chimeric Receptor mRNA Transfection as a Tool to Generate Antineoplastic Lymphocytes. Hum Gene Ther 2008. [DOI: 10.1089/hgt.2008.068] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/12/2022] Open
|
33
|
Genome-wide occupancy of SREBP1 and its partners NFY and SP1 reveals novel functional roles and combinatorial regulation of distinct classes of genes. PLoS Genet 2008; 4:e1000133. [PMID: 18654640 PMCID: PMC2478640 DOI: 10.1371/journal.pgen.1000133] [Citation(s) in RCA: 167] [Impact Index Per Article: 10.4] [Reference Citation Analysis] [Abstract] [MESH Headings] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 04/14/2008] [Accepted: 06/18/2008] [Indexed: 01/23/2023] Open
Abstract
The sterol regulatory element-binding protein (SREBP) family member SREBP1 is a critical transcriptional regulator of cholesterol and fatty acid metabolism and has been implicated in insulin resistance, diabetes, and other diet-related diseases. We globally identified the promoters occupied by SREBP1 and its binding partners NFY and SP1 in a human hepatocyte cell line using chromatin immunoprecipitation combined with genome tiling arrays (ChIP-chip). We find that SREBP1 occupies the promoters of 1,141 target genes involved in diverse biological pathways, including novel targets with roles in lipid metabolism and insulin signaling. We also identify a conserved SREBP1 DNA-binding motif in SREBP1 target promoters, and we demonstrate that many SREBP1 target genes are transcriptionally activated by treatment with insulin and glucose using gene expression microarrays. Finally, we show that SREBP1 cooperates extensively with NFY and SP1 throughout the genome and that unique combinations of these factors target distinct functional pathways. Our results provide insight into the regulatory circuitry in which SREBP1 and its network partners coordinate a complex transcriptional response in the liver with cues from the diet. Transcription factors (TFs) are DNA-binding proteins that regulate the transcription of their target genes. TFs typically bind in proximity to the transcription start sites of their target genes in a region called the promoter. SREBP1 is a TF that increases the transcription of numerous genes involved in cholesterol and fat metabolism and has been linked to diet-related diseases such as insulin resistance and type 2 diabetes. Using microarray technology, we identified all of the promoters in the human genome that are bound by SREBP1 and two associated TFs called NFY and SP1 in a human liver cell line. Our findings greatly expand the number of genes and biological pathways that may be regulated by SREBP1 and reveal that different combinations of SREBP1 and its partners preferentially target genes involved in different pathways. Thus, in contrast to traditional studies that focus on individual genes, we have used a genomics approach to provide a novel global view of the regulatory circuitry in which SREBP1 and its partners coordinate a transcriptional response in the liver with cues from the diet.
Collapse
|
34
|
A genomic analysis of RNA polymerase II modification and chromatin architecture related to 3' end RNA polyadenylation. Genome Res 2008; 18:1224-37. [PMID: 18487515 DOI: 10.1101/gr.075804.107] [Citation(s) in RCA: 49] [Impact Index Per Article: 3.1] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 01/23/2023]
Abstract
Genomic analyses have been applied extensively to analyze the process of transcription initiation in mammalian cells, but less to transcript 3' end formation and transcription termination. We used a novel approach to prepare 3' end fragments from polyadenylated RNA, and mapped the position of the poly(A) addition site using oligonucleotide arrays tiling 1% of the human genome. This approach revealed more 3' ends than had been annotated. The distribution of these ends relative to RNA polymerase II (PolII) and di- and trimethylated lysine 4 and lysine 36 of histone H3 was compared. A substantial fraction of unannotated 3' ends of RNA are intronic and antisense to the embedding gene. Poly(A) ends of annotated messages lie on average 2 kb upstream of the end of PolII binding (termination). Near the termination sites, and in some internal sites, unphosphorylated and C-terminal domain (CTD) serine 2 phosphorylated PolII (POLR2A) accumulate, suggesting pausing of the polymerase and perhaps dephosphorylation prior to release. Lysine 36 trimethylation occurs across transcribed genes, sometimes alternating with stretches of DNA in which lysine 36 dimethylation is more prominent. Lysine 36 methylation decreases at or near the site of polyadenylation, sometimes disappearing before disappearance of phosphorylated RNA PolII or release of PolII from DNA. Our results suggest that transcription termination loss of histone 3 lysine 36 methylation and later release of RNA polymerase. The latter is often associated with polymerase pausing. Overall, our study reveals extensive sites of poly(A) addition and provides insights into the events that occur during 3' end formation.
Collapse
|
35
|
Abstract
The developmental changes in expression of the beta like genes from embryonic to adult stages of human life are controlled at least partially at the level of the promoter sequences of these genes and their binding factors, and competition for promoter specific interactions with the locus control region (LCR). In recent years, the control of beta globin genes has also been investigated at the level of chromatin structure involving the chemical modification of histones and their remodelling by DNA dependent ATPases (SMARCA) containing protein complexes. The role of intergenic RNA is also being investigated with renewed interest. Although a wealth of information on the structure/function relationship of the LCR and globin promoters has been gathered over more than two decades, the fundamental nature of the control of these genes at the molecular level is still not completely understood. In the following pages, we intend to briefly describe the progress made in the field and discuss future directions.
Collapse
|
36
|
Abstract
Structural variation of the genome involves kilobase- to megabase-sized deletions, duplications, insertions, inversions, and complex combinations of rearrangements. We introduce high-throughput and massive paired-end mapping (PEM), a large-scale genome-sequencing method to identify structural variants (SVs) approximately 3 kilobases (kb) or larger that combines the rescue and capture of paired ends of 3-kb fragments, massive 454 sequencing, and a computational approach to map DNA reads onto a reference genome. PEM was used to map SVs in an African and in a putatively European individual and identified shared and divergent SVs relative to the reference genome. Overall, we fine-mapped more than 1000 SVs and documented that the number of SVs among humans is much larger than initially hypothesized; many of the SVs potentially affect gene function. The breakpoint junction sequences of more than 200 SVs were determined with a novel pooling strategy and computational analysis. Our analysis provided insights into the mechanisms of SV formation in humans.
Collapse
|
37
|
Systematic prediction and validation of breakpoints associated with copy-number variants in the human genome. Proc Natl Acad Sci U S A 2007; 104:10110-5. [PMID: 17551006 PMCID: PMC1891248 DOI: 10.1073/pnas.0703834104] [Citation(s) in RCA: 61] [Impact Index Per Article: 3.6] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/18/2022] Open
Abstract
Copy-number variants (CNVs) are an abundant form of genetic variation in humans. However, approaches for determining exact CNV breakpoint sequences (physical deletion or duplication boundaries) across individuals, crucial for associating genotype to phenotype, have been lacking so far, and the vast majority of CNVs have been reported with approximate genomic coordinates only. Here, we report an approach, called BreakPtr, for fine-mapping CNVs (available from http://breakptr.gersteinlab.org). We statistically integrate both sequence characteristics and data from high-resolution comparative genome hybridization experiments in a discrete-valued, bivariate hidden Markov model. Incorporation of nucleotide-sequence information allows us to take into account the fact that recently duplicated sequences (e.g., segmental duplications) often coincide with breakpoints. In anticipation of an upcoming increase in CNV data, we developed an iterative, "active" approach to initially scoring with a preliminary model, performing targeted validations, retraining the model, and then rescoring, and a flexible parameterization system that intuitively collapses from a full model of 2,503 parameters to a core one of only 10. Using our approach, we accurately mapped >400 breakpoints on chromosome 22 and a region of chromosome 11, refining the boundaries of many previously approximately mapped CNVs. Four predicted breakpoints flanked known disease-associated deletions. We validated an additional four predicted CNV breakpoints by sequencing. Overall, our results suggest a predictive resolution of approximately 300 bp. This level of resolution enables more precise correlations between CNVs and across individuals than previously possible, allowing the study of CNV population frequencies. Further, it enabled us to demonstrate a clear Mendelian pattern of inheritance for one of the CNVs.
Collapse
|
38
|
Reply to “On the cloning of animals from terminally differentiated cells”. Nat Genet 2007. [DOI: 10.1038/ng0207-137] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/09/2022]
|
39
|
Abstract
Transfection of human cells with DNA in biomedical applications carries the risk of insertional mutagenesis. Transfection with mRNA avoids this problem; however, in vitro production of mRNA, based on preliminary DNA template cloning in special vectors, is a laborious and time-consuming procedure. We report an efficient vectorfree method of mRNA production from polymerase chain reaction-generated DNA templates. For all cell types tested mRNA was transfected more readily than DNA, and its expression was highly uniform in cell populations. Even cell types relatively resistant to transfection with DNA could express transfected mRNA well. The level of mRNA expression could be controlled over a wide range by changing the amount of input RNA. Cells could be efficiently and simultaneously loaded with several different transcripts. To test a potential clinical application of this method, we transfected human T lymphocytes with mRNA encoding a chimeric immune receptor directed against CD19, a surface antigen widely expressed in leukemia and lymphoma. The transfected mRNA conferred powerful cytotoxicity to T cells against CD19+ targets from the same donor. These results demonstrate that this method can be applied to generate autologous T lymphocytes directed toward malignant cells.
Collapse
|
40
|
Differentiated cells are more efficient than adult stem cells for cloning by somatic cell nuclear transfer. Nat Genet 2006; 38:1323-8. [PMID: 17013394 DOI: 10.1038/ng1895] [Citation(s) in RCA: 73] [Impact Index Per Article: 4.1] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 05/19/2006] [Accepted: 09/05/2006] [Indexed: 01/29/2023]
Abstract
Since the creation of Dolly via somatic cell nuclear transfer (SCNT), more than a dozen species of mammals have been cloned using this technology. One hypothesis for the limited success of cloning via SCNT (1%-5%) is that the clones are likely to be derived from adult stem cells. Support for this hypothesis comes from the findings that the reproductive cloning efficiency for embryonic stem cells is five to ten times higher than that for somatic cells as donors and that cloned pups cannot be produced directly from cloned embryos derived from differentiated B and T cells or neuronal cells. The question remains as to whether SCNT-derived animal clones can be derived from truly differentiated somatic cells. We tested this hypothesis with mouse hematopoietic cells at different differentiation stages: hematopoietic stem cells, progenitor cells and granulocytes. We found that cloning efficiency increases over the differentiation hierarchy, and terminally differentiated postmitotic granulocytes yield cloned pups with the greatest cloning efficiency.
Collapse
|
41
|
|
42
|
FAT10/diubiquitin-like protein-deficient mice exhibit minimal phenotypic differences. Mol Cell Biol 2006; 26:5180-9. [PMID: 16782901 PMCID: PMC1489174 DOI: 10.1128/mcb.00966-05] [Citation(s) in RCA: 84] [Impact Index Per Article: 4.7] [Reference Citation Analysis] [Abstract] [MESH Headings] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/20/2022] Open
Abstract
The FAT10 gene encodes a diubiquitin-like protein containing two tandem head-to-tail ubiquitin-like domains. There is a high degree of similarity between murine and human FAT10 sequences at both the mRNA and protein levels. In various cell lines, FAT10 expression was shown to be induced by gamma interferon or by tumor necrosis factor alpha. In addition, FAT10 expression was found to be up-regulated in some Epstein-Barr virus-infected B-cell lines, in activated dendritic cells, and in several epithelial tumors. However, forced expression of FAT10 in cultured cells was also found to produce apoptotic cell death. Overall, these findings suggest that FAT10 may modulate cellular growth or cellular viability. Here we describe the steps to generate, by genetic targeting, a FAT10 gene knockout mouse model. The FAT10 knockout homozygous mice are viable and fertile. No gross lesions or obvious histological differences were found in these mutated mice. Examination of lymphocyte populations from spleen, thymus, and bone marrow did not reveal any abnormalities. However, flow cytometry analysis demonstrated that the lymphocytes of FAT10 knockout mice were, on average, more prone to spontaneous apoptotic death. Physiologically, these mice demonstrated a high level of sensitivity toward endotoxin challenge. These findings indicate that FAT10 may function as a survival factor.
Collapse
|
43
|
Identification of coding single nucleotide polymorphisms and mutations by combination of genome tiling arrays and enrichment/depletion of mismatch cDNAs. Anal Biochem 2006; 356:117-24. [PMID: 16777053 DOI: 10.1016/j.ab.2006.05.013] [Citation(s) in RCA: 2] [Impact Index Per Article: 0.1] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/15/2006] [Revised: 05/11/2006] [Accepted: 05/11/2006] [Indexed: 11/27/2022]
Abstract
Genome tiling array technology combined with a method for both enrichment and depletion of mismatch-containing cDNA fragments offers a useful approach for detecting coding single nucleotide polymorphisms (cSNPs) and mutations in pooled cDNA samples. Enriched mismatch and perfect match cDNA samples from human primary melanoma cells and normal melanocytes were obtained by selection using mismatch repair thymine DNA glycosylase-bound beads. These cDNA samples were then labeled and hybridized to Encyclopedia of DNA Elements genome tiling arrays. The results revealed that the hybridization intensity values of potential cDNA variation regions of the enriched mismatch samples increased, whereas the hybridization intensity values of corresponding regions of the enriched perfect match samples decreased. Six potential mutations were confirmed by polymerase chain reaction product sequencing, including two novel heterozygous mutations in melanoma cells. We suggest that this strategy should increase the efficiency of both cSNP and mutation detection throughout the entire human genome and decrease the cost and complexity of genomewide analysis of cDNA variations.
Collapse
|
44
|
High-resolution mapping of DNA copy alterations in human chromosome 22 using high-density tiling oligonucleotide arrays. Proc Natl Acad Sci U S A 2006; 103:4534-9. [PMID: 16537408 PMCID: PMC1450206 DOI: 10.1073/pnas.0511340103] [Citation(s) in RCA: 118] [Impact Index Per Article: 6.6] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 12/26/2022] Open
Abstract
Deletions and amplifications of the human genomic sequence (copy number polymorphisms) are the cause of numerous diseases and a potential cause of phenotypic variation in the normal population. Comparative genomic hybridization (CGH) has been developed as a useful tool for detecting alterations in DNA copy number that involve blocks of DNA several kilobases or larger in size. We have developed high-resolution CGH (HR-CGH) to detect accurately and with relatively little bias the presence and extent of chromosomal aberrations in human DNA. Maskless array synthesis was used to construct arrays containing 385,000 oligonucleotides with isothermal probes of 45-85 bp in length; arrays tiling the beta-globin locus and chromosome 22q were prepared. Arrays with a 9-bp tiling path were used to map a 622-bp heterozygous deletion in the beta-globin locus. Arrays with an 85-bp tiling path were used to analyze DNA from patients with copy number changes in the pericentromeric region of chromosome 22q. Heterozygous deletions and duplications as well as partial triploidies and partial tetraploidies of portions of chromosome 22q were mapped with high resolution (typically up to 200 bp) in each patient, and the precise breakpoints of two deletions were confirmed by DNA sequencing. Additional peaks potentially corresponding to known and novel additional CNPs were also observed. Our results demonstrate that HR-CGH allows the detection of copy number changes in the human genome at an unprecedented level of resolution.
Collapse
|
45
|
Multi-protein complexes at the beta-globin locus. BRIEFINGS IN FUNCTIONAL GENOMICS AND PROTEOMICS 2006; 5:62-5. [PMID: 16769681 DOI: 10.1093/bfgp/ell001] [Citation(s) in RCA: 7] [Impact Index Per Article: 0.4] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 11/14/2022]
Abstract
Biochemical analysis of the beta-globin gene function has led to the identification of several multi-protein complexes at the locus control region (LCR), insulator and promoters. This review briefly summarizes these multi-protein complexes and discusses their contribution towards the regulation of the beta-globin gene expression.
Collapse
|
46
|
Abstract
Mature B cells, upon activation, progressively differentiate through centroblasts into centrocytes and finally to plasmacytes that express large amounts of selected immunoglobulins. A significant part of this maturation is thought to involve induction of the unfolded protein response (UPR). We have compared gene expression in normal germinal center centroblasts, centrocytes, lymphoblastoid cells undergoing induced UPR, and the CCL155 plasmacytoma cell line. In the centroblast to centrocyte transition there is a change in the expression of a relatively small number of genes. These include a limited subset of the genes upregulated by a fully activated UPR as well as a small number of other transcription factors, some disulphide isomerases, and other genes. This is consistent with a model in which this transition is mediated by changes in the levels of expression of transcription factor B-lymphocyte-induced maturation protein 1 (Blimp1) (PRDM1), BACH2, X-box binding protein 1 (XBP1), interferon regulatory factor 4 (IRF4), and possibly vitamin D receptor (VDR) expression, together with post-transcriptional changes and a limited induction of aspects of the UPR.
Collapse
|
47
|
Abstract
The biochemistry of early stages of hematopoietic differentiation is difficult to study because only relatively small numbers of precursor cells are available. The murine EML cell line is a multipotential cell line that can be used to model some of these steps. We found that the lineage- EML precursor cells can be separated into two populations based on cell surface markers including CD34. Both populations contain similar levels of stem cell factor (SCF) receptor (c-Kit) but only the CD34+ population shows a growth response when treated with SCF. Conversely, the CD34- population will grow in the presence of the cytokine IL-3. The human beta-globin locus control region hypersensitive site 2 plays different roles on beta-globin transcription in the CD34+ and CD34- populations. The two populations are present in about equal amounts in culture, and the CD34+ population rapidly regenerates the mixed population when grown in the presence of SCF. We suggest that this system may mimic a normal developmental transition in hematopoiesis.
Collapse
|
48
|
Werner protein protects nonproliferating cells from oxidative DNA damage. Mol Cell Biol 2005; 25:10492-506. [PMID: 16287861 PMCID: PMC1291253 DOI: 10.1128/mcb.25.23.10492-10506.2005] [Citation(s) in RCA: 75] [Impact Index Per Article: 3.9] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 12/01/2004] [Revised: 01/10/2005] [Accepted: 08/31/2005] [Indexed: 02/06/2023] Open
Abstract
Werner syndrome, caused by mutations of the WRN gene, mimics many changes of normal aging. Although roles for WRN protein in DNA replication, recombination, and telomere maintenance have been suggested, the pathology of rapidly dividing cells is not a feature of Werner syndrome. To identify cellular events that are specifically vulnerable to WRN deficiency, we used RNA interference (RNAi) to knockdown WRN or BLM (the RecQ helicase mutated in Bloom syndrome) expression in primary human fibroblasts. Withdrawal of WRN or BLM produced accelerated cellular senescence phenotype and DNA damage response in normal fibroblasts, as evidenced by induction of gammaH2AX and 53BP1 nuclear foci. After WRN depletion, the induction of these foci was seen most prominently in nondividing cells. Growth in physiological (3%) oxygen or in the presence of an antioxidant prevented the development of the DNA damage foci in WRN-depleted cells, whereas acute oxidative stress led to inefficient repair of the lesions. Furthermore, WRN RNAi-induced DNA damage was suppressed by overexpression of the telomere-binding protein TRF2. These conditions, however, did not prevent the DNA damage response in BLM-ablated cells, suggesting a distinct role for WRN in DNA homeostasis in vivo. Thus, manifestations of Werner syndrome may reflect an impaired ability of slowly dividing cells to limit oxidative DNA damage.
Collapse
|
49
|
Heterogeneous nuclear ribonucleoprotein C1/C2, MeCP1, and SWI/SNF form a chromatin remodeling complex at the beta-globin locus control region. Proc Natl Acad Sci U S A 2005; 102:15012-7. [PMID: 16217013 PMCID: PMC1257739 DOI: 10.1073/pnas.0507596102] [Citation(s) in RCA: 48] [Impact Index Per Article: 2.5] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 12/19/2022] Open
Abstract
Locus control regions (LCRs) are regulatory DNA sequences that are situated many kilobases away from their cognate promoters. LCRs protect transgenes from position effect variegation and heterochromatinization and also promote copy-number dependence of the levels of transgene expression. In this work, we describe the biochemical purification of a previously undescribed LCR-associated remodeling complex (LARC) that consists of heterogeneous nuclear ribonucleoprotein C1/C2, nucleosome remodeling SWI/SNF, and nucleosome remodeling and deacetylating (NuRD)/MeCP1 as a single homogeneous complex. LARC binds to the hypersensitive 2 (HS2)-Maf recognition element (MARE) DNA in a sequence-specific manner and remodels nucleosomes. Heterogeneous nuclear ribonucleoprotein C1/C2, previously known as a general RNA binding protein, provides a sequence-specific DNA recognition element for LARC, and the LARC DNA-recognition sequence is essential for the enhancement of transcription by HS2. Independently of the initiation of transcription, LARC becomes associated with beta-like globin promoters.
Collapse
|
50
|
Abstract
The hematopoietic system offers many advantages as a model for understanding general aspects of lineage choice and specification. Using oligonucleotide microarrays, we compared gene expression patterns of multiple purified hematopoietic cell populations, including neutrophils, monocytes, macrophages, resting, centrocytic, and centroblastic B lymphocytes, dendritic cells, and hematopoietic stem cells. Some of these cells were studied under both resting and stimulated conditions. We studied the collective behavior of subsets of genes derived from the Biocarta database of functional pathways, hand-tuned groupings of genes into broad functional categories based on the Gene Ontology database, and the metabolic pathways in the Kyoto Encyclopedia of Genes and Genomes database. Principal component analysis revealed strikingly pervasive differences in relative levels of gene expression among cell lineages that involve most of the subsets examined. These results indicate that many processes in these cells behave differently in different lineages. Much of the variation among lineages was captured by the first few principal components. Principal components biplots were found to provide a convenient visual display of the contributions of the various genes within the subsets in lineage discrimination. Moreover, by applying tree-constructing methodologies borrowed from phylogenetics to the expression data from differentiated cells and stem cells, we reconstructed a tree of relationships that resembled the established hematopoietic program of lineage development. Thus, the mRNA expression data implicitly contained information about developmental relationships among cell types.
Collapse
|