1
|
Jurkowska RZ. Role of epigenetic mechanisms in the pathogenesis of chronic respiratory diseases and response to inhaled exposures: From basic concepts to clinical applications. Pharmacol Ther 2024:108732. [PMID: 39426605 DOI: 10.1016/j.pharmthera.2024.108732] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/26/2024] [Revised: 08/15/2024] [Accepted: 10/11/2024] [Indexed: 10/21/2024]
Abstract
Epigenetic modifications are chemical groups in our DNA (and chromatin) that determine which genes are active and which are shut off. Importantly, they integrate environmental signals to direct cellular function. Upon chronic environmental exposures, the epigenetic signature of lung cells gets altered, triggering aberrant gene expression programs that can lead to the development of chronic lung diseases. In addition to driving disease, epigenetic marks can serve as attractive lung disease biomarkers, due to early onset, disease specificity, and stability, warranting the need for more epigenetic research in the lung field. Despite substantial progress in mapping epigenetic alterations (mostly DNA methylation) in chronic lung diseases, the molecular mechanisms leading to their establishment are largely unknown. This review is meant as a guide for clinicians and lung researchers interested in epigenetic regulation with a focus on DNA methylation. It provides a short introduction to the main epigenetic mechanisms (DNA methylation, histone modifications and non-coding RNA) and the machinery responsible for their establishment and removal. It presents examples of epigenetic dysregulation across a spectrum of chronic lung diseases and discusses the current state of epigenetic therapies. Finally, it introduces the concept of epigenetic editing, an exciting novel approach to dissecting the functional role of epigenetic modifications. The promise of this emerging technology for the functional study of epigenetic mechanisms in cells and its potential future use in the clinic is further discussed.
Collapse
Affiliation(s)
- Renata Z Jurkowska
- Division of Biomedicine, School of Biosciences, Cardiff University, Cardiff, UK.
| |
Collapse
|
2
|
Xu H, Zhang G, Chen J. A novel method for cell deconvolution using DNA methylation in PCA space. BMC Genomics 2024; 25:798. [PMID: 39179972 PMCID: PMC11344294 DOI: 10.1186/s12864-024-10652-0] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 12/28/2023] [Accepted: 07/22/2024] [Indexed: 08/26/2024] Open
Abstract
BACKGROUND In this study, we present a novel method for reference-based cell deconvolution using data from DNA methylation arrays. Different from existing methods like IDOL-Ext, which operate on probe-level data, our approach represents features in the principal component analysis (PCA) space for cell type deconvolution. RESULTS Our method's accuracy in estimating cell compositions is validated across various public datasets, including blood samples from glioma patients. It demonstrates precision comparable to IDOL-Ext, with R2 values ranging from 0.73 to 0.99 for most cell types, while offering improved discrimination between similar cell types, particularly T cell subtypes in glioma patient samples (R2 0.42-0.75 vs. 0.36-0.66 for IDOL-Ext). However, both methods showed lower accuracy for certain cell types, such as memory CD8 T cells in glioma patients (R2 0.42 vs. 0.36 for IDOL-Ext), highlighting the challenges in distinguishing closely related cell populations. We have made this method available as an R package "BloodCellDecon" on GitHub. CONCLUSIONS Our study confirms the efficacy of cell type deconvolution in PCA space. The results indicate wide-ranging applicability and potential for adaptation to other forms of genomic data.
Collapse
Affiliation(s)
- Huan Xu
- Division of Human Genetics, Cincinnati Children's Hospital Medical Center, Cincinnati, OH, USA
| | - Ge Zhang
- Division of Human Genetics, Cincinnati Children's Hospital Medical Center, Cincinnati, OH, USA
- Department of Pediatrics, University of Cincinnati College of Medicine, Cincinnati, OH, USA
- Center for Prevention of Preterm Birth, Perinatal Institute, Cincinnati Children's Hospital Medical Center and March of Dimes Prematurity Research Center Ohio Collaborative, Cincinnati, OH, USA
| | - Jing Chen
- Department of Pediatrics, University of Cincinnati College of Medicine, Cincinnati, OH, USA.
- Division of Biomedical Informatics, Cincinnati Children's Hospital Medical Center, Cincinnati, OH, USA.
| |
Collapse
|
3
|
Jeong Y, Ronen J, Kopp W, Lutsik P, Akalin A. scMaui: a widely applicable deep learning framework for single-cell multiomics integration in the presence of batch effects and missing data. BMC Bioinformatics 2024; 25:257. [PMID: 39107690 PMCID: PMC11304929 DOI: 10.1186/s12859-024-05880-w] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 08/25/2023] [Accepted: 07/23/2024] [Indexed: 08/10/2024] Open
Abstract
The recent advances in high-throughput single-cell sequencing have created an urgent demand for computational models which can address the high complexity of single-cell multiomics data. Meticulous single-cell multiomics integration models are required to avoid biases towards a specific modality and overcome sparsity. Batch effects obfuscating biological signals must also be taken into account. Here, we introduce a new single-cell multiomics integration model, Single-cell Multiomics Autoencoder Integration (scMaui) based on variational product-of-experts autoencoders and adversarial learning. scMaui calculates a joint representation of multiple marginal distributions based on a product-of-experts approach which is especially effective for missing values in the modalities. Furthermore, it overcomes limitations seen in previous VAE-based integration methods with regard to batch effect correction and restricted applicable assays. It handles multiple batch effects independently accepting both discrete and continuous values, as well as provides varied reconstruction loss functions to cover all possible assays and preprocessing pipelines. We demonstrate that scMaui achieves superior performance in many tasks compared to other methods. Further downstream analyses also demonstrate its potential in identifying relations between assays and discovering hidden subpopulations.
Collapse
Affiliation(s)
- Yunhee Jeong
- Division of Cancer Epigenomics, German Cancer Research Center (DKFZ), Im Neuenheimer Feld 280, Heidelberg, Germany
- Faculty of Mathematics and Informatics, Heidelberg University, Im Neuenheimer Feld 205, Heidelberg, Germany
| | - Jonathan Ronen
- Bioinformatics and Omics Data Science Platform, Max Delbrück Center for Molecular Medicine in the Helmholtz Association, Berlin Institute for Medical Systems Biology, Berlin, Germany
- Inceptive Nucleics, Inc., Palo Alto, CA, USA
| | - Wolfgang Kopp
- Bioinformatics and Omics Data Science Platform, Max Delbrück Center for Molecular Medicine in the Helmholtz Association, Berlin Institute for Medical Systems Biology, Berlin, Germany
- Roche Diagnostics GmbH, Penzberg, Germany
| | - Pavlo Lutsik
- Division of Cancer Epigenomics, German Cancer Research Center (DKFZ), Im Neuenheimer Feld 280, Heidelberg, Germany.
- Department of Oncology, Catholic University (KU) Leuven, Leuven, Belgium.
| | - Altuna Akalin
- Bioinformatics and Omics Data Science Platform, Max Delbrück Center for Molecular Medicine in the Helmholtz Association, Berlin Institute for Medical Systems Biology, Berlin, Germany.
| |
Collapse
|
4
|
Unterman I, Avrahami D, Katsman E, Triche TJ, Glaser B, Berman BP. CelFiE-ISH: a probabilistic model for multi-cell type deconvolution from single-molecule DNA methylation haplotypes. Genome Biol 2024; 25:151. [PMID: 38858759 PMCID: PMC11163775 DOI: 10.1186/s13059-024-03275-x] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 08/20/2023] [Accepted: 05/13/2024] [Indexed: 06/12/2024] Open
Abstract
Deconvolution methods infer quantitative cell type estimates from bulk measurement of mixed samples including blood and tissue. DNA methylation sequencing measures multiple CpGs per read, but few existing deconvolution methods leverage this within-read information. We develop CelFiE-ISH, which extends an existing method (CelFiE) to use within-read haplotype information. CelFiE-ISH outperforms CelFiE and other existing methods, achieving 30% better accuracy and more sensitive detection of rare cell types. We also demonstrate the importance of marker selection and of tailoring markers for haplotype-aware methods. While here we use gold-standard short-read sequencing data, haplotype-aware methods will be well-suited for long-read sequencing.
Collapse
Affiliation(s)
- Irene Unterman
- Department of Developmental Biology and Cancer Research, Institute for Medical Research Israel-Canada, Faculty of Medicine, The Hebrew University of Jerusalem, Jerusalem, Israel
| | - Dana Avrahami
- Department of Developmental Biology and Cancer Research, Institute for Medical Research Israel-Canada, Faculty of Medicine, The Hebrew University of Jerusalem, Jerusalem, Israel
- Department of Endocrinology and Metabolism, Hadassah Medical Center and Faculty of Medicine, Hebrew University of Jerusalem, Jerusalem, Israel
| | - Efrat Katsman
- Department of Developmental Biology and Cancer Research, Institute for Medical Research Israel-Canada, Faculty of Medicine, The Hebrew University of Jerusalem, Jerusalem, Israel
| | - Timothy J Triche
- Center for Epigenetics, Van Andel Research Institute, Grand Rapids, MI, USA
| | - Benjamin Glaser
- Department of Endocrinology and Metabolism, Hadassah Medical Center and Faculty of Medicine, Hebrew University of Jerusalem, Jerusalem, Israel.
| | - Benjamin P Berman
- Department of Developmental Biology and Cancer Research, Institute for Medical Research Israel-Canada, Faculty of Medicine, The Hebrew University of Jerusalem, Jerusalem, Israel.
| |
Collapse
|
5
|
De Ridder K, Che H, Leroy K, Thienpont B. Benchmarking of methods for DNA methylome deconvolution. Nat Commun 2024; 15:4134. [PMID: 38755121 PMCID: PMC11099101 DOI: 10.1038/s41467-024-48466-z] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/20/2023] [Accepted: 04/30/2024] [Indexed: 05/18/2024] Open
Abstract
Defining the number and abundance of different cell types in tissues is important for understanding disease mechanisms as well as for diagnostic and prognostic purposes. Typically, this is achieved by immunohistological analyses, cell sorting, or single-cell RNA-sequencing. Alternatively, cell-specific DNA methylome information can be leveraged to deconvolve cell fractions from a bulk DNA mixture. However, comprehensive benchmarking of deconvolution methods and modalities was not yet performed. Here we evaluate 16 deconvolution algorithms, developed either specifically for DNA methylome data or more generically. We assess the performance of these algorithms, and the effect of normalization methods, while modeling variables that impact deconvolution performance, including cell abundance, cell type similarity, reference panel size, method for methylome profiling (array or sequencing), and technical variation. We observe differences in algorithm performance depending on each these variables, emphasizing the need for tailoring deconvolution analyses. The complexity of the reference, marker selection method, number of marker loci and, for sequencing-based assays, sequencing depth have a marked influence on performance. By developing handles to select the optimal analysis configuration, we provide a valuable source of information for studies aiming to deconvolve array- or sequencing-based methylation data.
Collapse
Affiliation(s)
- Kobe De Ridder
- Laboratory for Functional Epigenetics, Department of Human Genetics, KU Leuven, 3000, Leuven, Belgium
| | - Huiwen Che
- Laboratory for Functional Epigenetics, Department of Human Genetics, KU Leuven, 3000, Leuven, Belgium
| | - Kaat Leroy
- Laboratory for Functional Epigenetics, Department of Human Genetics, KU Leuven, 3000, Leuven, Belgium
| | - Bernard Thienpont
- Laboratory for Functional Epigenetics, Department of Human Genetics, KU Leuven, 3000, Leuven, Belgium.
- KU Leuven Institute for Single Cell Omics (LISCO), KU Leuven, 3000, Leuven, Belgium.
- KU Leuven Cancer Institute (LKI), KU Leuven, 3000, Leuven, Belgium.
| |
Collapse
|
6
|
Ferro dos Santos MR, Giuili E, De Koker A, Everaert C, De Preter K. Computational deconvolution of DNA methylation data from mixed DNA samples. Brief Bioinform 2024; 25:bbae234. [PMID: 38762790 PMCID: PMC11102637 DOI: 10.1093/bib/bbae234] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 02/07/2024] [Revised: 03/30/2024] [Accepted: 04/30/2024] [Indexed: 05/20/2024] Open
Abstract
In this review, we provide a comprehensive overview of the different computational tools that have been published for the deconvolution of bulk DNA methylation (DNAm) data. Here, deconvolution refers to the estimation of cell-type proportions that constitute a mixed sample. The paper reviews and compares 25 deconvolution methods (supervised, unsupervised or hybrid) developed between 2012 and 2023 and compares the strengths and limitations of each approach. Moreover, in this study, we describe the impact of the platform used for the generation of methylation data (including microarrays and sequencing), the applied data pre-processing steps and the used reference dataset on the deconvolution performance. Next to reference-based methods, we also examine methods that require only partial reference datasets or require no reference set at all. In this review, we provide guidelines for the use of specific methods dependent on the DNA methylation data type and data availability.
Collapse
Affiliation(s)
- Maísa R Ferro dos Santos
- VIB-UGent Center for Medical Biotechnology (CMB), Technologiepark-Zwijnaarde 75, 9052 Zwijnaarde, Belgium
- Cancer Research Institute Ghent (CRIG), 9000 Ghent, Belgium
| | - Edoardo Giuili
- VIB-UGent Center for Medical Biotechnology (CMB), Technologiepark-Zwijnaarde 75, 9052 Zwijnaarde, Belgium
- Cancer Research Institute Ghent (CRIG), 9000 Ghent, Belgium
| | - Andries De Koker
- VIB-UGent Center for Medical Biotechnology (CMB), Technologiepark-Zwijnaarde 75, 9052 Zwijnaarde, Belgium
- Cancer Research Institute Ghent (CRIG), 9000 Ghent, Belgium
| | - Celine Everaert
- VIB-UGent Center for Medical Biotechnology (CMB), Technologiepark-Zwijnaarde 75, 9052 Zwijnaarde, Belgium
- Cancer Research Institute Ghent (CRIG), 9000 Ghent, Belgium
| | - Katleen De Preter
- VIB-UGent Center for Medical Biotechnology (CMB), Technologiepark-Zwijnaarde 75, 9052 Zwijnaarde, Belgium
- Cancer Research Institute Ghent (CRIG), 9000 Ghent, Belgium
| |
Collapse
|
7
|
Zhang X, Hu Y, Vandenhoudt RE, Yan C, Marconi VC, Cohen MH, Wang Z, Justice AC, Aouizerat BE, Xu K. Computationally inferred cell-type specific epigenome-wide DNA methylation analysis unveils distinct methylation patterns among immune cells for HIV infection in three cohorts. PLoS Pathog 2024; 20:e1012063. [PMID: 38466776 PMCID: PMC10957090 DOI: 10.1371/journal.ppat.1012063] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/11/2023] [Revised: 03/21/2024] [Accepted: 02/20/2024] [Indexed: 03/13/2024] Open
Abstract
BACKGROUND Epigenome-wide association studies (EWAS) have identified CpG sites associated with HIV infection in blood cells in bulk, which offer limited knowledge of cell-type specific methylation patterns associated with HIV infection. In this study, we aim to identify differentially methylated CpG sites for HIV infection in immune cell types: CD4+ T-cells, CD8+ T-cells, B cells, Natural Killer (NK) cells, and monocytes. METHODS Applying a computational deconvolution method, we performed a cell-type based EWAS for HIV infection in three independent cohorts (Ntotal = 1,382). DNA methylation in blood or in peripheral blood mononuclear cells (PBMCs) was profiled by an array-based method and then deconvoluted by Tensor Composition Analysis (TCA). The TCA-computed CpG methylation in each cell type was first benchmarked by bisulfite DNA methylation capture sequencing in a subset of the samples. Cell-type EWAS of HIV infection was performed in each cohort separately and a meta-EWAS was conducted followed by gene set enrichment analysis. RESULTS The meta-analysis unveiled a total of 2,021 cell-type unique significant CpG sites for five inferred cell types. Among these inferred cell-type unique CpG sites, the concordance rate in the three cohorts ranged from 96% to 100% in each cell type. Cell-type level meta-EWAS unveiled distinct patterns of HIV-associated differential CpG methylation, where 74% of CpG sites were unique to individual cell types (false discovery rate, FDR <0.05). CD4+ T-cells had the largest number of unique HIV-associated CpG sites (N = 1,624) compared to any other cell type. Genes harboring significant CpG sites are involved in immunity and HIV pathogenesis (e.g. CD4+ T-cells: NLRC5, CX3CR1, B cells: IFI44L, NK cells: IL12R, monocytes: IRF7), and in oncogenesis (e.g. CD4+ T-cells: BCL family, PRDM16, monocytes: PRDM16, PDCD1LG2). HIV-associated CpG sites were enriched among genes involved in HIV pathogenesis and oncogenesis that were enriched among interferon-α and -γ, TNF-α, inflammatory response, and apoptotic pathways. CONCLUSION Our findings uncovered computationally inferred cell-type specific modifications in the host epigenome for people with HIV that contribute to the growing body of evidence regarding HIV pathogenesis.
Collapse
Affiliation(s)
- Xinyu Zhang
- Department of Psychiatry, Yale School of Medicine, New Haven, Connecticut, United States of America
- VA Connecticut Healthcare System, West Haven, Connecticut, United States of America
| | - Ying Hu
- Center for Biomedical Information and Information Technology, National Cancer Institute, Rockville, Maryland, United States of America
| | - Ral E. Vandenhoudt
- Department of Psychiatry, Yale School of Medicine, New Haven, Connecticut, United States of America
- VA Connecticut Healthcare System, West Haven, Connecticut, United States of America
| | - Chunhua Yan
- Center for Biomedical Information and Information Technology, National Cancer Institute, Rockville, Maryland, United States of America
| | - Vincent C. Marconi
- Division of Infectious Diseases, Emory University School of Medicine and Department of Global Health, Rollins School of Public Health, Emory University, Georgia, United States of America
- Atlanta Veterans Affairs Healthcare System, Decatur, Georgia, United States of America
| | - Mardge H. Cohen
- Department of Medicine, Stroger Hospital of Cook County, Chicago, Illinois, United States of America
| | - Zuoheng Wang
- Department of Biostatistics, Yale School of Public Health, New Haven, Connecticut, United States of America
| | - Amy C. Justice
- VA Connecticut Healthcare System, West Haven, Connecticut, United States of America
- Department of Biostatistics, Yale School of Public Health, New Haven, Connecticut, United States of America
- Department of Internal Medicine, Yale School of Medicine, New Haven, Connecticut, United States of America
| | - Bradley E. Aouizerat
- Translational Research Center, College of Dentistry, New York University, New York, New York, United States of America
- Department of Oral and Maxillofacial Surgery, College of Dentistry, New York University, New York, New York, United States of America
| | - Ke Xu
- Department of Psychiatry, Yale School of Medicine, New Haven, Connecticut, United States of America
- VA Connecticut Healthcare System, West Haven, Connecticut, United States of America
- Biomedical Informatics and Data Science, Yale School of Medicine, New Haven, Connecticut, United States of America
| |
Collapse
|
8
|
Garmire LX, Li Y, Huang Q, Xu C, Teichmann SA, Kaminski N, Pellegrini M, Nguyen Q, Teschendorff AE. Challenges and perspectives in computational deconvolution of genomics data. Nat Methods 2024; 21:391-400. [PMID: 38374264 DOI: 10.1038/s41592-023-02166-6] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 11/04/2022] [Accepted: 12/26/2023] [Indexed: 02/21/2024]
Abstract
Deciphering cell-type heterogeneity is crucial for systematically understanding tissue homeostasis and its dysregulation in diseases. Computational deconvolution is an efficient approach for estimating cell-type abundances from a variety of omics data. Despite substantial methodological progress in computational deconvolution in recent years, challenges are still outstanding. Here we enlist four important challenges related to computational deconvolution: the quality of the reference data, generation of ground truth data, limitations of computational methodologies, and benchmarking design and implementation. Finally, we make recommendations on reference data generation, new directions of computational methodologies, and strategies to promote rigorous benchmarking.
Collapse
Affiliation(s)
- Lana X Garmire
- Department of Computational Medicine and Bioinformatics, University of Michigan, Ann Arbor, MI, USA.
| | - Yijun Li
- Department of Biostatistics, University of Michigan, Ann Arbor, MI, USA
| | - Qianhui Huang
- Department of Computational Medicine and Bioinformatics, University of Michigan, Ann Arbor, MI, USA
| | - Chuan Xu
- Wellcome Sanger Institute, Wellcome Genome Campus, Hinxton, UK
| | | | - Naftali Kaminski
- Pulmonary, Critical Care & Sleep Medicine, Yale University School of Medicine, New Haven, CT, USA
| | - Matteo Pellegrini
- Molecular, Cell and Developmental Biology, University of California, Los Angeles, Los Angeles, CA, USA
| | - Quan Nguyen
- Institute for Molecular Bioscience, The University of Queensland and QIMR Berghofer Medical Research Institute, Brisbane, Queensland, Australia
| | - Andrew E Teschendorff
- CAS Key Laboratory of Computational Biology, Shanghai Institute of Nutrition and Health, University of Chinese Academy of Sciences, Chinese Academy of Sciences, Shanghai, China
- UCL Cancer Institute, University College London, London, UK
| |
Collapse
|
9
|
Piraud M, Camero A, Götz M, Kesselheim S, Steinbach P, Weigel T. Providing AI expertise as an infrastructure in academia. PATTERNS (NEW YORK, N.Y.) 2023; 4:100819. [PMID: 37602219 PMCID: PMC10436042 DOI: 10.1016/j.patter.2023.100819] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 08/22/2023]
Abstract
Artificial intelligence (AI) is proliferating and developing faster than any domain scientist can adapt. To support the scientific enterprise in the Helmholtz association, a network of AI specialists has been set up to disseminate AI expertise as an infrastructure among domain scientists. As this effort exposes an evolutionary step in science organization in Germany, this article aspires to describe our setup, goals, and motivations. We comment on past experiences, current developments, and future ideas as we bring our expertise as an infrastructure closer to scientists across our organization. We hope that this offers a brief yet insightful view of our activities as well as inspiration for other science organizations.
Collapse
Affiliation(s)
- Marie Piraud
- Helmholtz AI, Germany
- Helmholtz Munich, Neuherberg, Germany
| | - Andrés Camero
- Helmholtz AI, Germany
- German Aerospace Center, Wessling, Germany
| | - Markus Götz
- Helmholtz AI, Germany
- Karlsruhe Insitute of Technology, Eggenstein-Leopoldshafen, Germany
| | | | - Peter Steinbach
- Helmholtz AI, Germany
- Helmholtz-Zentrum Dresden-Rossendorf, Dresden, Germany
| | - Tobias Weigel
- Helmholtz AI, Germany
- Hereon, Geesthacht, Germany
- German Climate Computing Center, Hamburg, Germany
| |
Collapse
|
10
|
Keukeleire P, Makrodimitris S, Reinders M. Cell type deconvolution of methylated cell-free DNA at the resolution of individual reads. NAR Genom Bioinform 2023; 5:lqad048. [PMID: 37274121 PMCID: PMC10236360 DOI: 10.1093/nargab/lqad048] [Citation(s) in RCA: 2] [Impact Index Per Article: 2.0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/04/2022] [Revised: 02/28/2023] [Accepted: 05/19/2023] [Indexed: 06/06/2023] Open
Abstract
Cell-free DNA (cfDNA) are DNA fragments originating from dying cells that are detectable in bodily fluids, such as the plasma. Accelerated cell death, for example caused by disease, induces an elevated concentration of cfDNA. As a result, determining the cell type origins of cfDNA molecules can provide information about an individual's health. In this work, we aim to increase the sensitivity of methylation-based cell type deconvolution by adapting an existing method, CelFiE, which uses the methylation beta values of individual CpG sites to estimate cell type proportions. Our new method, CelFEER, instead differentiates cell types by the average methylation values within individual reads. We additionally improved the originally reported performance of CelFiE by using a new approach for finding marker regions that are differentially methylated between cell types. We show that CelFEER estimates cell type proportions with a higher correlation (r = 0.94 ± 0.04) than CelFiE (r = 0.86 ± 0.09) on simulated mixtures of cell types. Moreover, we show that the cell type proportion estimated by CelFEER can differentiate between ALS patients and healthy controls, between pregnant women in their first and third trimester, and between pregnant women with and without gestational diabetes.
Collapse
Affiliation(s)
| | | | - Marcel Reinders
- To whom correspondence should be addressed. Tel: +31 15 27 86424;
| |
Collapse
|
11
|
Zhang X, Hu Y, Vandenhoudt RE, Yan C, Marconi VC, Cohen MH, Justice AC, Aouizerat BE, Xu K. Cell-type specific EWAS identifies genes involved in HIV pathogenesis and oncogenesis among people with HIV infection. BIORXIV : THE PREPRINT SERVER FOR BIOLOGY 2023:2023.03.21.533691. [PMID: 36993343 PMCID: PMC10055405 DOI: 10.1101/2023.03.21.533691] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Grants] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 06/19/2023]
Abstract
Epigenome-wide association studies (EWAS) of heterogenous blood cells have identified CpG sites associated with chronic HIV infection, which offer limited knowledge of cell-type specific methylation patterns associated with HIV infection. Applying a computational deconvolution method validated by capture bisulfite DNA methylation sequencing, we conducted a cell type-based EWAS and identified differentially methylated CpG sites specific for chronic HIV infection among five immune cell types in blood: CD4+ T-cells, CD8+ T-cells, B cells, Natural Killer (NK) cells, and monocytes in two independent cohorts (N total =1,134). Differentially methylated CpG sites for HIV-infection were highly concordant between the two cohorts. Cell-type level meta-EWAS revealed distinct patterns of HIV-associated differential CpG methylation, where 67% of CpG sites were unique to individual cell types (false discovery rate, FDR <0.05). CD4+ T-cells had the largest number of HIV-associated CpG sites (N=1,472) compared to any other cell type. Genes harboring statistically significant CpG sites are involved in immunity and HIV pathogenesis (e.g. CX3CR1 in CD4+ T-cells, CCR7 in B cells, IL12R in NK cells, LCK in monocytes). More importantly, HIV-associated CpG sites were overrepresented for hallmark genes involved in cancer pathology ( FDR <0.05) (e.g. BCL family, PRDM16, PDCD1LGD, ESR1, DNMT3A, NOTCH2 ). HIV-associated CpG sites were enriched among genes involved in HIV pathogenesis and oncogenesis such as Kras-signaling, interferon-α and -γ, TNF-α, inflammatory, and apoptotic pathways. Our findings are novel, uncovering cell-type specific modifications in the host epigenome for people with HIV that contribute to the growing body of evidence regarding pathogen-induced epigenetic oncogenicity, specifically on HIV and its comorbidity with cancers.
Collapse
|