1
|
SCENIC+: single-cell multiomic inference of enhancers and gene regulatory networks. Nat Methods 2023; 20:1355-1367. [PMID: 37443338 PMCID: PMC10482700 DOI: 10.1038/s41592-023-01938-4] [Citation(s) in RCA: 67] [Impact Index Per Article: 67.0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 08/19/2022] [Accepted: 06/06/2023] [Indexed: 07/15/2023]
Abstract
Joint profiling of chromatin accessibility and gene expression in individual cells provides an opportunity to decipher enhancer-driven gene regulatory networks (GRNs). Here we present a method for the inference of enhancer-driven GRNs, called SCENIC+. SCENIC+ predicts genomic enhancers along with candidate upstream transcription factors (TFs) and links these enhancers to candidate target genes. To improve both recall and precision of TF identification, we curated and clustered a motif collection with more than 30,000 motifs. We benchmarked SCENIC+ on diverse datasets from different species, including human peripheral blood mononuclear cells, ENCODE cell lines, melanoma cell states and Drosophila retinal development. Next, we exploit SCENIC+ predictions to study conserved TFs, enhancers and GRNs between human and mouse cell types in the cerebral cortex. Finally, we use SCENIC+ to study the dynamics of gene regulation along differentiation trajectories and the effect of TF perturbations on cell state. SCENIC+ is available at scenicplus.readthedocs.io .
Collapse
|
2
|
Abstract
The Drosophila brain is a frequently used model in neuroscience. Single-cell transcriptome analysis1-6, three-dimensional morphological classification7 and electron microscopy mapping of the connectome8,9 have revealed an immense diversity of neuronal and glial cell types that underlie an array of functional and behavioural traits in the fly. The identities of these cell types are controlled by gene regulatory networks (GRNs), involving combinations of transcription factors that bind to genomic enhancers to regulate their target genes. Here, to characterize GRNs at the cell-type level in the fly brain, we profiled the chromatin accessibility of 240,919 single cells spanning 9 developmental timepoints and integrated these data with single-cell transcriptomes. We identify more than 95,000 regulatory regions that are used in different neuronal cell types, of which 70,000 are linked to developmental trajectories involving neurogenesis, reprogramming and maturation. For 40 cell types, uniquely accessible regions were associated with their expressed transcription factors and downstream target genes through a combination of motif discovery, network inference and deep learning, creating enhancer GRNs. The enhancer architectures revealed by DeepFlyBrain lead to a better understanding of neuronal regulatory diversity and can be used to design genetic driver lines for cell types at specific timepoints, facilitating their characterization and manipulation.
Collapse
|
3
|
A scalable SCENIC workflow for single-cell gene regulatory network analysis. Nat Protoc 2020; 15:2247-2276. [PMID: 32561888 DOI: 10.1038/s41596-020-0336-2] [Citation(s) in RCA: 420] [Impact Index Per Article: 105.0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 09/25/2019] [Accepted: 04/17/2020] [Indexed: 11/09/2022]
Abstract
This protocol explains how to perform a fast SCENIC analysis alongside standard best practices steps on single-cell RNA-sequencing data using software containers and Nextflow pipelines. SCENIC reconstructs regulons (i.e., transcription factors and their target genes) assesses the activity of these discovered regulons in individual cells and uses these cellular activity patterns to find meaningful clusters of cells. Here we present an improved version of SCENIC with several advances. SCENIC has been refactored and reimplemented in Python (pySCENIC), resulting in a tenfold increase in speed, and has been packaged into containers for ease of use. It is now also possible to use epigenomic track databases, as well as motifs, to refine regulons. In this protocol, we explain the different steps of SCENIC: the workflow starts from the count matrix depicting the gene abundances for all cells and consists of three stages. First, coexpression modules are inferred using a regression per-target approach (GRNBoost2). Next, the indirect targets are pruned from these modules using cis-regulatory motif discovery (cisTarget). Lastly, the activity of these regulons is quantified via an enrichment score for the regulon's target genes (AUCell). Nonlinear projection methods can be used to display visual groupings of cells based on the cellular activity patterns of these regulons. The results can be exported as a loom file and visualized in the SCope web application. This protocol is illustrated on two use cases: a peripheral blood mononuclear cell data set and a panel of single-cell RNA-sequencing cancer experiments. For a data set of 10,000 genes and 50,000 cells, the pipeline runs in <2 h.
Collapse
|
4
|
Identification of genomic enhancers through spatial integration of single-cell transcriptomics and epigenomics. Mol Syst Biol 2020; 16:e9438. [PMID: 32431014 PMCID: PMC7237818 DOI: 10.15252/msb.20209438] [Citation(s) in RCA: 39] [Impact Index Per Article: 9.8] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 01/03/2020] [Revised: 03/13/2020] [Accepted: 03/18/2020] [Indexed: 01/02/2023] Open
Abstract
Single-cell technologies allow measuring chromatin accessibility and gene expression in each cell, but jointly utilizing both layers to map bona fide gene regulatory networks and enhancers remains challenging. Here, we generate independent single-cell RNA-seq and single-cell ATAC-seq atlases of the Drosophila eye-antennal disc and spatially integrate the data into a virtual latent space that mimics the organization of the 2D tissue using ScoMAP (Single-Cell Omics Mapping into spatial Axes using Pseudotime ordering). To validate spatially predicted enhancers, we use a large collection of enhancer-reporter lines and identify ~ 85% of enhancers in which chromatin accessibility and enhancer activity are coupled. Next, we infer enhancer-to-gene relationships in the virtual space, finding that genes are mostly regulated by multiple, often redundant, enhancers. Exploiting cell type-specific enhancers, we deconvolute cell type-specific effects of bulk-derived chromatin accessibility QTLs. Finally, we discover that Prospero drives neuronal differentiation through the binding of a GGG motif. In summary, we provide a comprehensive spatial characterization of gene regulation in a 2D tissue.
Collapse
|
5
|
cisTopic: cis-regulatory topic modeling on single-cell ATAC-seq data. Nat Methods 2019; 16:397-400. [PMID: 30962623 DOI: 10.1038/s41592-019-0367-1] [Citation(s) in RCA: 205] [Impact Index Per Article: 41.0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/03/2018] [Accepted: 02/28/2019] [Indexed: 12/17/2022]
Abstract
We present cisTopic, a probabilistic framework used to simultaneously discover coaccessible enhancers and stable cell states from sparse single-cell epigenomics data ( http://github.com/aertslab/cistopic ). Using a compendium of single-cell ATAC-seq datasets from differentiating hematopoietic cells, brain and transcription factor perturbations, we demonstrate that topic modeling can be exploited for robust identification of cell types, enhancers and relevant transcription factors. cisTopic provides insight into the mechanisms underlying regulatory heterogeneity in cell populations.
Collapse
|
6
|
Toward Minimal Residual Disease-Directed Therapy in Melanoma. Cell 2018; 174:843-855.e19. [PMID: 30017245 DOI: 10.1016/j.cell.2018.06.025] [Citation(s) in RCA: 397] [Impact Index Per Article: 66.2] [Reference Citation Analysis] [Abstract] [Key Words] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/09/2017] [Revised: 04/13/2018] [Accepted: 06/12/2018] [Indexed: 01/05/2023]
Abstract
Many patients with advanced cancers achieve dramatic responses to a panoply of therapeutics yet retain minimal residual disease (MRD), which ultimately results in relapse. To gain insights into the biology of MRD, we applied single-cell RNA sequencing to malignant cells isolated from BRAF mutant patient-derived xenograft melanoma cohorts exposed to concurrent RAF/MEK-inhibition. We identified distinct drug-tolerant transcriptional states, varying combinations of which co-occurred within MRDs from PDXs and biopsies of patients on treatment. One of these exhibited a neural crest stem cell (NCSC) transcriptional program largely driven by the nuclear receptor RXRG. An RXR antagonist mitigated accumulation of NCSCs in MRD and delayed the development of resistance. These data identify NCSCs as key drivers of resistance and illustrate the therapeutic potential of MRD-directed therapy. They also highlight how gene regulatory network architecture reprogramming may be therapeutically exploited to limit cellular heterogeneity, a key driver of disease progression and therapy resistance.
Collapse
|
7
|
Phenotype molding of stromal cells in the lung tumor microenvironment. Nat Med 2018; 24:1277-1289. [PMID: 29988129 DOI: 10.1038/s41591-018-0096-5] [Citation(s) in RCA: 927] [Impact Index Per Article: 154.5] [Reference Citation Analysis] [Abstract] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/16/2017] [Accepted: 05/16/2018] [Indexed: 02/07/2023]
Abstract
Cancer cells are embedded in the tumor microenvironment (TME), a complex ecosystem of stromal cells. Here, we present a 52,698-cell catalog of the TME transcriptome in human lung tumors at single-cell resolution, validated in independent samples where 40,250 additional cells were sequenced. By comparing with matching non-malignant lung samples, we reveal a highly complex TME that profoundly molds stromal cells. We identify 52 stromal cell subtypes, including novel subpopulations in cell types hitherto considered to be homogeneous, as well as transcription factors underlying their heterogeneity. For instance, we discover fibroblasts expressing different collagen sets, endothelial cells downregulating immune cell homing and genes coregulated with established immune checkpoint transcripts and correlating with T-cell activity. By assessing marker genes for these cell subtypes in bulk RNA-sequencing data from 1,572 patients, we illustrate how these correlate with survival, while immunohistochemistry for selected markers validates them as separate cellular entities in an independent series of lung tumors. Hence, in providing a comprehensive catalog of stromal cells types and by characterizing their phenotype and co-optive behavior, this resource provides deeper insights into lung cancer biology that will be helpful in advancing lung cancer diagnosis and therapy.
Collapse
|
8
|
The transcription factor Grainy head primes epithelial enhancers for spatiotemporal activation by displacing nucleosomes. Nat Genet 2018; 50:1011-1020. [PMID: 29867222 PMCID: PMC6031307 DOI: 10.1038/s41588-018-0140-x] [Citation(s) in RCA: 85] [Impact Index Per Article: 14.2] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/26/2017] [Accepted: 04/06/2018] [Indexed: 12/21/2022]
Abstract
Transcriptional enhancers function as docking platforms for combinations of transcription factors (TFs) to control gene expression. How enhancer sequences determine nucleosome occupancy, TF recruitment and transcriptional activation in vivo remains unclear. Using ATAC-seq across a panel of Drosophila inbred strains, we found that SNPs affecting binding sites of the TF Grainy head (Grh) causally determine the accessibility of epithelial enhancers. We show that deletion and ectopic expression of Grh cause loss and gain of DNA accessibility, respectively. However, although Grh binding is necessary for enhancer accessibility, it is insufficient to activate enhancers. Finally, we show that human Grh homologs-GRHL1, GRHL2 and GRHL3-function similarly. We conclude that Grh binding is necessary and sufficient for the opening of epithelial enhancers but not for their activation. Our data support a model positing that complex spatiotemporal expression patterns are controlled by regulatory hierarchies in which pioneer factors, such as Grh, establish tissue-specific accessible chromatin landscapes upon which other factors can act.
Collapse
|
9
|
Abstract
Single-cell techniques are advancing rapidly and are yielding unprecedented insight into cellular heterogeneity. Mapping the gene regulatory networks (GRNs) underlying cell states provides attractive opportunities to mechanistically understand this heterogeneity. In this review, we discuss recently emerging methods to map GRNs from single-cell transcriptomics data, tackling the challenge of increased noise levels and data sparsity compared with bulk data, alongside increasing data volumes. Next, we discuss how new techniques for single-cell epigenomics, such as single-cell ATAC-seq and single-cell DNA methylation profiling, can be used to decipher gene regulatory programmes. We finally look forward to the application of single-cell multi-omics and perturbation techniques that will likely play important roles for GRN inference in the future.
Collapse
|
10
|
A Single-Cell Transcriptome Atlas of the Aging Drosophila Brain. Cell 2018; 174:982-998.e20. [PMID: 29909982 PMCID: PMC6086935 DOI: 10.1016/j.cell.2018.05.057] [Citation(s) in RCA: 407] [Impact Index Per Article: 67.8] [Reference Citation Analysis] [Abstract] [Key Words] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 11/29/2017] [Revised: 03/30/2018] [Accepted: 05/25/2018] [Indexed: 02/06/2023]
Abstract
The diversity of cell types and regulatory states in the brain, and how these change during aging, remains largely unknown. We present a single-cell transcriptome atlas of the entire adult Drosophila melanogaster brain sampled across its lifespan. Cell clustering identified 87 initial cell clusters that are further subclustered and validated by targeted cell-sorting. Our data show high granularity and identify a wide range of cell types. Gene network analyses using SCENIC revealed regulatory heterogeneity linked to energy consumption. During aging, RNA content declines exponentially without affecting neuronal identity in old brains. This single-cell brain atlas covers nearly all cells in the normal brain and provides the tools to study cellular diversity alongside other Drosophila and mammalian single-cell datasets in our unique single-cell analysis platform: SCope (http://scope.aertslab.org). These results, together with SCope, allow comprehensive exploration of all transcriptional states of an entire aging brain. A single-cell atlas of the adult fly brain during aging Network inference reveals regulatory states related to oxidative phosphorylation Cell identity is retained during aging despite exponential decline of gene expression SCope: An online tool to explore and compare single-cell datasets across species
Collapse
|
11
|
Comparative oncogenomics identifies tyrosine kinase FES as a tumor suppressor in melanoma. J Clin Invest 2017; 127:2310-2325. [PMID: 28463229 DOI: 10.1172/jci91291] [Citation(s) in RCA: 18] [Impact Index Per Article: 2.6] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/21/2016] [Accepted: 03/02/2017] [Indexed: 01/11/2023] Open
Abstract
Identification and functional validation of oncogenic drivers are essential steps toward advancing cancer precision medicine. Here, we have presented a comprehensive analysis of the somatic genomic landscape of the widely used BRAFV600E- and NRASQ61K-driven mouse models of melanoma. By integrating the data with publically available genomic, epigenomic, and transcriptomic information from human clinical samples, we confirmed the importance of several genes and pathways previously implicated in human melanoma, including the tumor-suppressor genes phosphatase and tensin homolog (PTEN), cyclin dependent kinase inhibitor 2A (CDKN2A), LKB1, and others. Importantly, this approach also identified additional putative melanoma drivers with prognostic and therapeutic relevance. Surprisingly, one of these genes encodes the tyrosine kinase FES. Whereas FES is highly expressed in normal human melanocytes, FES expression is strongly decreased in over 30% of human melanomas. This downregulation correlates with poor overall survival. Correspondingly, engineered deletion of Fes accelerated tumor progression in a BRAFV600E-driven mouse model of melanoma. Together, these data implicate FES as a driver of melanoma progression and demonstrate the potential of cross-species oncogenomic approaches combined with mouse modeling to uncover impactful mutations and oncogenic driver alleles with clinical importance in the treatment of human cancer.
Collapse
|
12
|
Identification of expression patterns in the progression of disease stages by integration of transcriptomic data. BMC Bioinformatics 2016; 17:432. [PMID: 28185568 PMCID: PMC5133487 DOI: 10.1186/s12859-016-1290-4] [Citation(s) in RCA: 12] [Impact Index Per Article: 1.5] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 12/26/2022] Open
Abstract
Background In the study of complex diseases using genome-wide expression data from clinical samples, a difficult case is the identification and mapping of the gene signatures associated to the stages that occur in the progression of a disease. The stages usually correspond to different subtypes or classes of the disease, and the difficulty to identify them often comes from patient heterogeneity and sample variability that can hide the biomedical relevant changes that characterize each stage, making standard differential analysis inadequate or inefficient. Results We propose a methodology to study diseases or disease stages ordered in a sequential manner (e.g. from early stages with good prognosis to more acute or serious stages associated to poor prognosis). The methodology is applied to diseases that have been studied obtaining genome-wide expression profiling of cohorts of patients at different stages. The approach allows searching for consistent expression patterns along the progression of the disease through two major steps: (i) identifying genes with increasing or decreasing trends in the progression of the disease; (ii) clustering the increasing/decreasing gene expression patterns using an unsupervised approach to reveal whether there are consistent patterns and find genes altered at specific disease stages. The first step is carried out using Gamma rank correlation to identify genes whose expression correlates with a categorical variable that represents the stages of the disease. The second step is done using a Self Organizing Map (SOM) to cluster the genes according to their progressive profiles and identify specific patterns. Both steps are done after normalization of the genomic data to allow the integration of multiple independent datasets. In order to validate the results and evaluate their consistency and biological relevance, the methodology is applied to datasets of three different diseases: myelodysplastic syndrome, colorectal cancer and Alzheimer’s disease. A software script written in R, named genediseasePatterns, is provided to allow the use and application of the methodology. Conclusion The method presented allows the analysis of the progression of complex and heterogeneous diseases that can be divided in pathological stages. It identifies gene groups whose expression patterns change along the advance of the disease, and it can be applied to different types of genomic data studying cohorts of patients in different states. Electronic supplementary material The online version of this article (doi:10.1186/s12859-016-1290-4) contains supplementary material, which is available to authorized users.
Collapse
|
13
|
Extraction and analysis of signatures from the Gene Expression Omnibus by the crowd. Nat Commun 2016; 7:12846. [PMID: 27667448 PMCID: PMC5052684 DOI: 10.1038/ncomms12846] [Citation(s) in RCA: 156] [Impact Index Per Article: 19.5] [Reference Citation Analysis] [Abstract] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 12/07/2015] [Accepted: 08/05/2016] [Indexed: 12/14/2022] Open
Abstract
Gene expression data are accumulating exponentially in public repositories. Reanalysis and integration of themed collections from these studies may provide new insights, but requires further human curation. Here we report a crowdsourcing project to annotate and reanalyse a large number of gene expression profiles from Gene Expression Omnibus (GEO). Through a massive open online course on Coursera, over 70 participants from over 25 countries identify and annotate 2,460 single-gene perturbation signatures, 839 disease versus normal signatures, and 906 drug perturbation signatures. All these signatures are unique and are manually validated for quality. Global analysis of these signatures confirms known associations and identifies novel associations between genes, diseases and drugs. The manually curated signatures are used as a training set to develop classifiers for extracting similar signatures from the entire GEO repository. We develop a web portal to serve these signatures for query, download and visualization.
Collapse
|
14
|
Analyse multiple disease subtypes and build associated gene networks using genome-wide expression profiles. BMC Genomics 2015; 16 Suppl 5:S3. [PMID: 26040557 PMCID: PMC4460584 DOI: 10.1186/1471-2164-16-s5-s3] [Citation(s) in RCA: 11] [Impact Index Per Article: 1.2] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 01/03/2023] Open
Abstract
BACKGROUND Despite the large increase of transcriptomic studies that look for gene signatures on diseases, there is still a need for integrative approaches that obtain separation of multiple pathological states providing robust selection of gene markers for each disease subtype and information about the possible links or relations between those genes. RESULTS We present a network-oriented and data-driven bioinformatic approach that searches for association of genes and diseases based on the analysis of genome-wide expression data derived from microarrays or RNA-Seq studies. The approach aims to (i) identify gene sets associated to different pathological states analysed together; (ii) identify a minimum subset within these genes that unequivocally differentiates and classifies the compared disease subtypes; (iii) provide a measurement of the discriminant power of these genes and (iv) identify links between the genes that characterise each of the disease subtypes. This bioinformatic approach is implemented in an R package, named geNetClassifier, available as an open access tool in Bioconductor. To illustrate the performance of the tool, we applied it to two independent datasets: 250 samples from patients with four major leukemia subtypes analysed using expression arrays; another leukemia dataset analysed with RNA-Seq that includes a subtype also present in the previous set. The results show the selection of key deregulated genes recently reported in the literature and assigned to the leukemia subtypes studied. We also show, using these independent datasets, the selection of similar genes in a network built for the same disease subtype. CONCLUSIONS The construction of gene networks related to specific disease subtypes that include parameters such as gene-to-gene association, gene disease specificity and gene discriminant power can be very useful to draw gene-disease maps and to unravel the molecular features that characterize specific pathological states. The application of the bioinformatic tool here presented shows a neat way to achieve such molecular characterization of the diseases using genome-wide expression data.
Collapse
|
15
|
Functional Gene Networks: R/Bioc package to generate and analyse gene networks derived from functional enrichment and clustering. ACTA ACUST UNITED AC 2015; 31:1686-8. [PMID: 25600944 PMCID: PMC4426835 DOI: 10.1093/bioinformatics/btu864] [Citation(s) in RCA: 45] [Impact Index Per Article: 5.0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/20/2014] [Accepted: 12/29/2014] [Indexed: 12/04/2022]
Abstract
Summary: Functional Gene Networks (FGNet) is an R/Bioconductor package that generates gene networks derived from the results of functional enrichment analysis (FEA) and annotation clustering. The sets of genes enriched with specific biological terms (obtained from a FEA platform) are transformed into a network by establishing links between genes based on common functional annotations and common clusters. The network provides a new view of FEA results revealing gene modules with similar functions and genes that are related to multiple functions. In addition to building the functional network, FGNet analyses the similarity between the groups of genes and provides a distance heatmap and a bipartite network of functionally overlapping genes. The application includes an interface to directly perform FEA queries using different external tools: DAVID, GeneTerm Linker, TopGO or GAGE; and a graphical interface to facilitate the use. Availability and implementation:FGNet is available in Bioconductor, including a tutorial. URL: http://bioconductor.org/packages/release/bioc/html/FGNet.html Contact:jrivas@usal.es Supplementary information:Supplementary data are available at Bioinformatics online.
Collapse
|
16
|
Combined analysis of genome-wide expression and copy number profiles to identify key altered genomic regions in cancer. BMC Genomics 2012; 13 Suppl 5:S5. [PMID: 23095915 PMCID: PMC3476997 DOI: 10.1186/1471-2164-13-s5-s5] [Citation(s) in RCA: 6] [Impact Index Per Article: 0.5] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 12/13/2022] Open
Abstract
BACKGROUND Analysis of DNA copy number alterations and gene expression changes in human samples have been used to find potential target genes in complex diseases. Recent studies have combined these two types of data using different strategies, but focusing on finding gene-based relationships. However, it has been proposed that these data can be used to identify key genomic regions, which may enclose causal genes under the assumption that disease-associated gene expression changes are caused by genomic alterations.
Collapse
|
17
|
Genome-wide profiling of methylation identifies novel targets with aberrant hypermethylation and reduced expression in low-risk myelodysplastic syndromes. Leukemia 2012; 27:610-8. [PMID: 22936014 DOI: 10.1038/leu.2012.253] [Citation(s) in RCA: 29] [Impact Index Per Article: 2.4] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 12/13/2022]
Abstract
Gene expression profiling signatures may be used to classify the subtypes of Myelodysplastic syndrome (MDS) patients. However, there are few reports on the global methylation status in MDS. The integration of genome-wide epigenetic regulatory marks with gene expression levels would provide additional information regarding the biological differences between MDS and healthy controls. Gene expression and methylation status were measured using high-density microarrays. A total of 552 differentially methylated CpG loci were identified as being present in low-risk MDS; hypermethylated genes were more frequent than hypomethylated genes. In addition, mRNA expression profiling identified 1005 genes that significantly differed between low-risk MDS and the control group. Integrative analysis of the epigenetic and expression profiles revealed that 66.7% of the hypermethylated genes were underexpressed in low-risk MDS cases. Gene network analysis revealed molecular mechanisms associated with the low-risk MDS group, including altered apoptosis pathways. The two key apoptotic genes BCL2 and ETS1 were identified as silenced genes. In addition, the immune response and micro RNA biogenesis were affected by the hypermethylation and underexpression of IL27RA and DICER1. Our integrative analysis revealed that aberrant epigenetic regulation is a hallmark of low-risk MDS patients and could have a central role in these diseases.
Collapse
|
18
|
235 An integrative analysis of methylation and gene expression profiling in low-risk myelodysplastic syndromes (MDS). Leuk Res 2011. [DOI: 10.1016/s0145-2126(11)70237-5] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/25/2022]
|