Zhang W, Li Z, Wei N, Wu HJ, Zheng X. Detection of differentially methylated CpG sites between tumor samples with uneven tumor purities.
Bioinformatics 2020;
36:2017-2024. [PMID:
31769783 DOI:
10.1093/bioinformatics/btz885]
[Citation(s) in RCA: 2] [Impact Index Per Article: 0.5] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 08/06/2019] [Revised: 11/14/2019] [Accepted: 11/23/2019] [Indexed: 11/12/2022] Open
Abstract
MOTIVATION
Inference of differentially methylated (DM) CpG sites between two groups of tumor samples with different geno- or pheno-types is a critical step to uncover the epigenetic mechanism of tumorigenesis, and identify biomarkers for cancer subtyping. However, as a major source of confounding factor, uneven distributions of tumor purity between two groups of tumor samples will lead to biased discovery of DM sites if not properly accounted for.
RESULTS
We here propose InfiniumDM, a generalized least square model to adjust tumor purity effect for differential methylation analysis. Our method is applicable to a variety of experimental designs including with or without normal controls, different sources of normal tissue contaminations. We compared our method with conventional methods including minfi, limma and limma corrected by tumor purity using simulated datasets. Our method shows significantly better performance at different levels of differential methylation thresholds, sample sizes, mean purity deviations and so on. We also applied the proposed method to breast cancer samples from TCGA database to further evaluate its performance. Overall, both simulation and real data analyses demonstrate favorable performance over existing methods serving similar purpose.
AVAILABILITY AND IMPLEMENTATION
InfiniumDM is a part of R package InfiniumPurify, which is freely available from GitHub (https://github.com/Xiaoqizheng/InfiniumPurify).
SUPPLEMENTARY INFORMATION
Supplementary data are available at Bioinformatics online.
Collapse