1
|
O'Connell KA, Yosufzai ZB, Campbell RA, Lobb CJ, Engelken HT, Gorrell LM, Carlson TB, Catana JJ, Mikdadi D, Bonazzi VR, Klenk JA. Accelerating genomic workflows using NVIDIA Parabricks. BMC Bioinformatics 2023; 24:221. [PMID: 37259021 PMCID: PMC10230726 DOI: 10.1186/s12859-023-05292-2] [Citation(s) in RCA: 1] [Impact Index Per Article: 1.0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/20/2022] [Accepted: 04/15/2023] [Indexed: 06/02/2023] Open
Abstract
BACKGROUND As genome sequencing becomes better integrated into scientific research, government policy, and personalized medicine, the primary challenge for researchers is shifting from generating raw data to analyzing these vast datasets. Although much work has been done to reduce compute times using various configurations of traditional CPU computing infrastructures, Graphics Processing Units (GPUs) offer opportunities to accelerate genomic workflows by orders of magnitude. Here we benchmark one GPU-accelerated software suite called NVIDIA Parabricks on Amazon Web Services (AWS), Google Cloud Platform (GCP), and an NVIDIA DGX cluster. We benchmarked six variant calling pipelines, including two germline callers (HaplotypeCaller and DeepVariant) and four somatic callers (Mutect2, Muse, LoFreq, SomaticSniper). RESULTS We achieved up to 65 × acceleration with germline variant callers, bringing HaplotypeCaller runtimes down from 36 h to 33 min on AWS, 35 min on GCP, and 24 min on the NVIDIA DGX. Somatic callers exhibited more variation between the number of GPUs and computing platforms. On cloud platforms, GPU-accelerated germline callers resulted in cost savings compared with CPU runs, whereas some somatic callers were more expensive than CPU runs because their GPU acceleration was not sufficient to overcome the increased GPU cost. CONCLUSIONS Germline variant callers scaled well with the number of GPUs across platforms, whereas somatic variant callers exhibited more variation in the number of GPUs with the fastest runtimes, suggesting that, at least with the version of Parabricks used here, these workflows are less GPU optimized and require benchmarking on the platform of choice before being deployed at production scales. Our study demonstrates that GPUs can be used to greatly accelerate genomic workflows, thus bringing closer to grasp urgent societal advances in the areas of biosurveillance and personalized medicine.
Collapse
Affiliation(s)
- Kyle A O'Connell
- Health Data and AI, Deloitte Consulting LLP, VA, 22009, Arlington, USA
| | | | - Ross A Campbell
- Health Data and AI, Deloitte Consulting LLP, VA, 22009, Arlington, USA
| | - Collin J Lobb
- Health Data and AI, Deloitte Consulting LLP, VA, 22009, Arlington, USA
| | - Haley T Engelken
- Health Data and AI, Deloitte Consulting LLP, VA, 22009, Arlington, USA
| | - Laura M Gorrell
- Health Data and AI, Deloitte Consulting LLP, VA, 22009, Arlington, USA
| | - Thad B Carlson
- Cloud Managed Services, Deloitte Consulting LLP, Detroit, MI, 48226, USA
| | - Josh J Catana
- Health Data and AI, Deloitte Consulting LLP, VA, 22009, Arlington, USA
| | - Dina Mikdadi
- Health Data and AI, Deloitte Consulting LLP, VA, 22009, Arlington, USA
| | - Vivien R Bonazzi
- Health Data and AI, Deloitte Consulting LLP, VA, 22009, Arlington, USA.
| | - Juergen A Klenk
- Health Data and AI, Deloitte Consulting LLP, VA, 22009, Arlington, USA.
| |
Collapse
|
2
|
Nwadiugwu M. RNA-seq analysis of phagocytic cells from murine epididymal white adipose tissue shows immunosenescence and age-related phosphorus metabolism. Hum Cell 2022; 35:572-582. [PMID: 35032296 DOI: 10.1007/s13577-021-00663-4] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 11/02/2021] [Accepted: 12/15/2021] [Indexed: 12/01/2022]
Abstract
The underlying state of alterations in adipose tissue is hypothesized to be as a result of age-related changes. Young and aged mice have been documented to show distinct gene expression and distinct macrophage-specific adipose tissue regulation. However, more biological understanding is required to know the processes associated with these conditions in relation to the aging process. Transcriptional profiling with RNA-seq analysis was used to determine differentially expressed genes in young (2 months old) and aged (20 months old) mice macrophage-enriched phagocytic stromal vascular fractions of pooled epididymal white adipose tissue using data obtained from gene expression omnibus. Results showed distinct differentially expressed genes in young and aged mice with a p value cutoff of 0.05 and dissimilarities in the young and aged epididymal white adipose tissue phagocytic cells. Functional enrichment showed activation of cytokine-cytokine receptor interaction pathways, phosphorus metabolic processes and inflammatory pathways such as IL-17, TNF, NF-kappa B, and TGF-β, while AMPK, PPAR and oxidative phosphorylation were suppressed. The analysis showed that aging is linked with phagocytic cell decline, accumulated cellular damages, inflammation, immunosenescence and increased phosphorus metabolism. Interventions that reduce phosphate-containing compound could improve phosphorus metabolism in old age to prolong lifespan and better health.
Collapse
Affiliation(s)
- Martin Nwadiugwu
- Leonard Davis School of Gerontology, University of Southern California, 3715 McClintock Ave, Los Angeles, CA, 90089, USA.
| |
Collapse
|