Zheng A, Lamkin M, Qiu Y, Ren K, Goren A, Gymrek M. A flexible ChIP-sequencing simulation toolkit.
BMC Bioinformatics 2021;
22:201. [PMID:
33879052 PMCID:
PMC8056602 DOI:
10.1186/s12859-021-04097-5]
[Citation(s) in RCA: 2] [Impact Index Per Article: 0.7] [Reference Citation Analysis] [Abstract] [Key Words] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 01/08/2021] [Accepted: 03/22/2021] [Indexed: 11/17/2022] Open
Abstract
Background
A major challenge in evaluating quantitative ChIP-seq analyses, such as peak calling and differential binding, is a lack of reliable ground truth data. Accurate simulation of ChIP-seq data can mitigate this challenge, but existing frameworks are either too cumbersome to apply genome-wide or unable to model a number of important experimental conditions in ChIP-seq.
Results
We present ChIPs, a toolkit for rapidly simulating ChIP-seq data using statistical models of key experimental steps. We demonstrate how ChIPs can be used for a range of applications, including benchmarking analysis tools and evaluating the impact of various experimental parameters. ChIPs is implemented as a standalone command-line program written in C++ and is available from https://github.com/gymreklab/chips.
Conclusions
ChIPs is an efficient ChIP-seq simulation framework that generates realistic datasets over a flexible range of experimental conditions. It can serve as an important component in various ChIP-seq analyses where ground truth data are needed.
Supplementary Information
The online version contains supplementary material available at 10.1186/s12859-021-04097-5.
Collapse