Balázs Z, Gitchev T, Ivanković I, Krauthammer M. Fragmentstein-facilitating data reuse for cell-free DNA fragment analysis.
Bioinformatics 2024;
40:btae017. [PMID:
38224549 PMCID:
PMC10805340 DOI:
10.1093/bioinformatics/btae017]
[Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [What about the content of this article? (0)] [Affiliation(s)] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 04/21/2023] [Revised: 12/09/2023] [Accepted: 01/09/2024] [Indexed: 01/17/2024] Open
Abstract
SUMMARY
Method development for the analysis of cell-free DNA (cfDNA) sequencing data is impeded by limited data sharing due to the strict control of sensitive genomic data. An existing solution for facilitating data sharing removes nucleotide-level information from raw cfDNA sequencing data, keeping alignment coordinates only. This simplified format can be publicly shared and would, theoretically, suffice for common functional analyses of cfDNA data. However, current bioinformatics software requires nucleotide-level information and cannot process the simplified format. We present Fragmentstein, a command-line tool for converting non-sensitive cfDNA-fragmentation data into alignment mapping (BAM) files. Fragmentstein complements fragment coordinates with sequence information from a reference genome to reconstruct BAM files. We demonstrate the utility of Fragmentstein by showing the feasibility of copy number variant (CNV), nucleosome occupancy, and fragment length analyses from non-sensitive fragmentation data.
AVAILABILITY AND IMPLEMENTATION
Implemented in bash, Fragmentstein is available at https://github.com/uzh-dqbm-cmi/fragmentstein, licensed under GNU GPLv3.
Collapse