1
|
Connor R, Shakya M, Yarmosh DA, Maier W, Martin R, Bradford R, Brister JR, Chain PSG, Copeland CA, di Iulio J, Hu B, Ebert P, Gunti J, Jin Y, Katz KS, Kochergin A, LaRosa T, Li J, Li PE, Lo CC, Rashid S, Maiorova ES, Xiao C, Zalunin V, Purcell L, Pruitt KD. Recommendations for Uniform Variant Calling of SARS-CoV-2 Genome Sequence across Bioinformatic Workflows. Viruses 2024; 16:430. [PMID: 38543795 PMCID: PMC10975397 DOI: 10.3390/v16030430] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 12/05/2023] [Revised: 02/12/2024] [Accepted: 02/16/2024] [Indexed: 04/01/2024] Open
Abstract
Genomic sequencing of clinical samples to identify emerging variants of SARS-CoV-2 has been a key public health tool for curbing the spread of the virus. As a result, an unprecedented number of SARS-CoV-2 genomes were sequenced during the COVID-19 pandemic, which allowed for rapid identification of genetic variants, enabling the timely design and testing of therapies and deployment of new vaccine formulations to combat the new variants. However, despite the technological advances of deep sequencing, the analysis of the raw sequence data generated globally is neither standardized nor consistent, leading to vastly disparate sequences that may impact identification of variants. Here, we show that for both Illumina and Oxford Nanopore sequencing platforms, downstream bioinformatic protocols used by industry, government, and academic groups resulted in different virus sequences from same sample. These bioinformatic workflows produced consensus genomes with differences in single nucleotide polymorphisms, inclusion and exclusion of insertions, and/or deletions, despite using the same raw sequence as input datasets. Here, we compared and characterized such discrepancies and propose a specific suite of parameters and protocols that should be adopted across the field. Consistent results from bioinformatic workflows are fundamental to SARS-CoV-2 and future pathogen surveillance efforts, including pandemic preparation, to allow for a data-driven and timely public health response.
Collapse
Affiliation(s)
- Ryan Connor
- National Center for Biotechnology Information, National Library of Medicine, National Institutes of Health, Bethesda, MD 20894, USA; (R.C.); (J.R.B.); (J.G.); (Y.J.); (K.S.K.); (A.K.); (C.X.); (V.Z.)
| | - Migun Shakya
- Bioscience Division, Los Alamos National Laboratory, Los Alamos, NM 87545, USA; (M.S.); (P.S.G.C.); (B.H.); (P.-E.L.); (C.-C.L.)
| | - David A. Yarmosh
- American Type Culture Collection, Manassas, VA 20110, USA; (D.A.Y.); (R.B.); (S.R.)
- BEI Resources, Manassas, VA 20110, USA
| | - Wolfgang Maier
- Galaxy Europe Team, University of Freiburg, 79085 Freiburg, Germany;
| | - Ross Martin
- Clinical Virology Department, Gilead Sciences, Foster City, CA 94404, USA; (R.M.); (J.L.); (E.S.M.)
| | - Rebecca Bradford
- American Type Culture Collection, Manassas, VA 20110, USA; (D.A.Y.); (R.B.); (S.R.)
- BEI Resources, Manassas, VA 20110, USA
| | - J. Rodney Brister
- National Center for Biotechnology Information, National Library of Medicine, National Institutes of Health, Bethesda, MD 20894, USA; (R.C.); (J.R.B.); (J.G.); (Y.J.); (K.S.K.); (A.K.); (C.X.); (V.Z.)
| | - Patrick S. G. Chain
- Bioscience Division, Los Alamos National Laboratory, Los Alamos, NM 87545, USA; (M.S.); (P.S.G.C.); (B.H.); (P.-E.L.); (C.-C.L.)
| | | | - Julia di Iulio
- Vir Biotechnology Inc., San Francisco, CA 94158, USA; (J.d.I.); (L.P.)
| | - Bin Hu
- Bioscience Division, Los Alamos National Laboratory, Los Alamos, NM 87545, USA; (M.S.); (P.S.G.C.); (B.H.); (P.-E.L.); (C.-C.L.)
| | - Philip Ebert
- Eli Lilly and Company, Indianapolis, IN 46225, USA;
| | - Jonathan Gunti
- National Center for Biotechnology Information, National Library of Medicine, National Institutes of Health, Bethesda, MD 20894, USA; (R.C.); (J.R.B.); (J.G.); (Y.J.); (K.S.K.); (A.K.); (C.X.); (V.Z.)
| | - Yumi Jin
- National Center for Biotechnology Information, National Library of Medicine, National Institutes of Health, Bethesda, MD 20894, USA; (R.C.); (J.R.B.); (J.G.); (Y.J.); (K.S.K.); (A.K.); (C.X.); (V.Z.)
| | - Kenneth S. Katz
- National Center for Biotechnology Information, National Library of Medicine, National Institutes of Health, Bethesda, MD 20894, USA; (R.C.); (J.R.B.); (J.G.); (Y.J.); (K.S.K.); (A.K.); (C.X.); (V.Z.)
| | - Andrey Kochergin
- National Center for Biotechnology Information, National Library of Medicine, National Institutes of Health, Bethesda, MD 20894, USA; (R.C.); (J.R.B.); (J.G.); (Y.J.); (K.S.K.); (A.K.); (C.X.); (V.Z.)
| | - Tré LaRosa
- Deloitte Consulting LLP, Rosslyn, VA 22209, USA; (C.A.C.); (T.L.)
| | - Jiani Li
- Clinical Virology Department, Gilead Sciences, Foster City, CA 94404, USA; (R.M.); (J.L.); (E.S.M.)
| | - Po-E Li
- Bioscience Division, Los Alamos National Laboratory, Los Alamos, NM 87545, USA; (M.S.); (P.S.G.C.); (B.H.); (P.-E.L.); (C.-C.L.)
| | - Chien-Chi Lo
- Bioscience Division, Los Alamos National Laboratory, Los Alamos, NM 87545, USA; (M.S.); (P.S.G.C.); (B.H.); (P.-E.L.); (C.-C.L.)
| | - Sujatha Rashid
- American Type Culture Collection, Manassas, VA 20110, USA; (D.A.Y.); (R.B.); (S.R.)
| | - Evguenia S. Maiorova
- Clinical Virology Department, Gilead Sciences, Foster City, CA 94404, USA; (R.M.); (J.L.); (E.S.M.)
| | - Chunlin Xiao
- National Center for Biotechnology Information, National Library of Medicine, National Institutes of Health, Bethesda, MD 20894, USA; (R.C.); (J.R.B.); (J.G.); (Y.J.); (K.S.K.); (A.K.); (C.X.); (V.Z.)
| | - Vadim Zalunin
- National Center for Biotechnology Information, National Library of Medicine, National Institutes of Health, Bethesda, MD 20894, USA; (R.C.); (J.R.B.); (J.G.); (Y.J.); (K.S.K.); (A.K.); (C.X.); (V.Z.)
| | - Lisa Purcell
- Vir Biotechnology Inc., San Francisco, CA 94158, USA; (J.d.I.); (L.P.)
| | - Kim D. Pruitt
- National Center for Biotechnology Information, National Library of Medicine, National Institutes of Health, Bethesda, MD 20894, USA; (R.C.); (J.R.B.); (J.G.); (Y.J.); (K.S.K.); (A.K.); (C.X.); (V.Z.)
| |
Collapse
|
2
|
Connor R, Yarmosh DA, Maier W, Shakya M, Martin R, Bradford R, Brister JR, Chain PS, Copeland CA, di Iulio J, Hu B, Ebert P, Gunti J, Jin Y, Katz KS, Kochergin A, LaRosa T, Li J, Li PE, Lo CC, Rashid S, Maiorova ES, Xiao C, Zalunin V, Pruitt KD. Towards increased accuracy and reproducibility in SARS-CoV-2 next generation sequence analysis for public health surveillance. BIORXIV : THE PREPRINT SERVER FOR BIOLOGY 2022:2022.11.03.515010. [PMID: 36380755 PMCID: PMC9645426 DOI: 10.1101/2022.11.03.515010] [Citation(s) in RCA: 4] [Impact Index Per Article: 2.0] [Reference Citation Analysis] [Abstract] [Grants] [Track Full Text] [Figures] [Subscribe] [Scholar Register] [Indexed: 06/16/2023]
Abstract
During the COVID-19 pandemic, SARS-CoV-2 surveillance efforts integrated genome sequencing of clinical samples to identify emergent viral variants and to support rapid experimental examination of genome-informed vaccine and therapeutic designs. Given the broad range of methods applied to generate new viral genomes, it is critical that consensus and variant calling tools yield consistent results across disparate pipelines. Here we examine the impact of sequencing technologies (Illumina and Oxford Nanopore) and 7 different downstream bioinformatic protocols on SARS-CoV-2 variant calling as part of the NIH Accelerating COVID-19 Therapeutic Interventions and Vaccines (ACTIV) Tracking Resistance and Coronavirus Evolution (TRACE) initiative, a public-private partnership established to address the COVID-19 outbreak. Our results indicate that bioinformatic workflows can yield consensus genomes with different single nucleotide polymorphisms, insertions, and/or deletions even when using the same raw sequence input datasets. We introduce the use of a specific suite of parameters and protocols that greatly improves the agreement among pipelines developed by diverse organizations. Such consistency among bioinformatic pipelines is fundamental to SARS-CoV-2 and future pathogen surveillance efforts. The application of analysis standards is necessary to more accurately document phylogenomic trends and support data-driven public health responses.
Collapse
Affiliation(s)
- Ryan Connor
- National Center for Biotechnology Information, National Library of Medicine, National Institutes of Health, Bethesda, MD 20894, USA
| | - David A Yarmosh
- American Type Culture Collection, 10807 University Blvd, Manassas, VA 20110, USA
- BEI Resources
| | - Wolfgang Maier
- Galaxy Europe Team, University of Freiburg, Freiburg, Germany
| | - Migun Shakya
- Bioscience Division, Los Alamos National Laboratory, Los Alamos, NM 87545 USA
| | - Ross Martin
- Clinical Virology Department, Gilead Sciences, 333 Lakeside Dr, Foster City, CA 94404, USA
| | - Rebecca Bradford
- American Type Culture Collection, 10807 University Blvd, Manassas, VA 20110, USA
- BEI Resources
| | - J Rodney Brister
- National Center for Biotechnology Information, National Library of Medicine, National Institutes of Health, Bethesda, MD 20894, USA
| | - Patrick Sg Chain
- Bioscience Division, Los Alamos National Laboratory, Los Alamos, NM 87545 USA
| | - Courtney A Copeland
- Deloitte Consulting LLP, 1919 North Lynn St, Suite 1500, Rosslyn, VA 22209 USA
| | | | - Bin Hu
- Bioscience Division, Los Alamos National Laboratory, Los Alamos, NM 87545 USA
| | | | - Jonathan Gunti
- National Center for Biotechnology Information, National Library of Medicine, National Institutes of Health, Bethesda, MD 20894, USA
| | - Yumi Jin
- National Center for Biotechnology Information, National Library of Medicine, National Institutes of Health, Bethesda, MD 20894, USA
| | - Kenneth S Katz
- National Center for Biotechnology Information, National Library of Medicine, National Institutes of Health, Bethesda, MD 20894, USA
| | - Andrey Kochergin
- National Center for Biotechnology Information, National Library of Medicine, National Institutes of Health, Bethesda, MD 20894, USA
| | - Tré LaRosa
- Deloitte Consulting LLP, 1919 North Lynn St, Suite 1500, Rosslyn, VA 22209 USA
| | - Jiani Li
- Clinical Virology Department, Gilead Sciences, 333 Lakeside Dr, Foster City, CA 94404, USA
| | - Po-E Li
- Bioscience Division, Los Alamos National Laboratory, Los Alamos, NM 87545 USA
| | - Chien-Chi Lo
- Bioscience Division, Los Alamos National Laboratory, Los Alamos, NM 87545 USA
| | - Sujatha Rashid
- American Type Culture Collection, 10807 University Blvd, Manassas, VA 20110, USA
| | - Evguenia S Maiorova
- Clinical Virology Department, Gilead Sciences, 333 Lakeside Dr, Foster City, CA 94404, USA
| | - Chunlin Xiao
- National Center for Biotechnology Information, National Library of Medicine, National Institutes of Health, Bethesda, MD 20894, USA
| | - Vadim Zalunin
- National Center for Biotechnology Information, National Library of Medicine, National Institutes of Health, Bethesda, MD 20894, USA
| | - Kim D Pruitt
- National Center for Biotechnology Information, National Library of Medicine, National Institutes of Health, Bethesda, MD 20894, USA
| |
Collapse
|
3
|
Communicating regulatory high-throughput sequencing data using BioCompute Objects. Drug Discov Today 2022; 27:1108-1114. [DOI: 10.1016/j.drudis.2022.01.007] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 01/12/2021] [Revised: 08/20/2021] [Accepted: 01/18/2022] [Indexed: 11/23/2022]
|
4
|
Komatsu TE, Hodowanec AC, Colberg-Poley AM, Pikis A, Singer ME, O'Rear JJ, Donaldson EF. In-depth genomic analyses identified novel letermovir resistance-associated substitutions in the cytomegalovirus UL56 and UL89 gene products. Antiviral Res 2019; 169:104549. [DOI: 10.1016/j.antiviral.2019.104549] [Citation(s) in RCA: 7] [Impact Index Per Article: 1.4] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 04/16/2019] [Revised: 06/27/2019] [Accepted: 07/01/2019] [Indexed: 01/08/2023]
|
5
|
Yuen MF, Gane EJ, Kim DJ, Weilert F, Yuen Chan HL, Lalezari J, Hwang SG, Nguyen T, Flores O, Hartman G, Liaw S, Lenz O, Kakuda TN, Talloen W, Schwabe C, Klumpp K, Brown N. Antiviral Activity, Safety, and Pharmacokinetics of Capsid Assembly Modulator NVR 3-778 in Patients with Chronic HBV Infection. Gastroenterology 2019; 156:1392-1403.e7. [PMID: 30625297 DOI: 10.1053/j.gastro.2018.12.023] [Citation(s) in RCA: 101] [Impact Index Per Article: 20.2] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Submit a Manuscript] [Subscribe] [Scholar Register] [Received: 07/31/2018] [Revised: 12/11/2018] [Accepted: 12/27/2018] [Indexed: 02/07/2023]
Abstract
BACKGROUND & AIMS NVR 3-778 is a first-in-class hepatitis B virus (HBV) capsid assembly modulator that can inhibit HBV replication. We performed a proof-of-concept study to examine the safety, pharmacokinetics, and antiviral activity of NVR 3-778 in patients with chronic HBV infection. METHODS We performed a phase 1 study in 73 hepatitis B envelope antigen (HBeAg)-positive patients with chronic HBV infection without cirrhosis. In a 2-part study (part 1 in New Zealand and part 2 in Hong Kong, Singapore, Taiwan, Korea, and the United States), patients were randomly assigned to groups that were given oral NVR 3-778 (100 mg, 200 mg, or 400 mg daily or 600 mg or 1000 mg twice daily) or placebo for 4 weeks. Additional groups received combination treatment with pegylated interferon (pegIFN) and NVR 3-778 (600 mg twice daily) or pegIFN with placebo. RESULTS Reductions in serum levels of HBV DNA and HBV RNA were observed in patients receiving ≥1200 mg/d NVR 3-778. The largest mean reduction in HBV DNA was observed in the group given NVR 3-778 plus pegIFN (1.97 log10 IU/mL), compared with the groups given NVR 3-778 or pegIFN alone (1.43 log10 IU/mL and 1.06 log10 IU/mL, respectively). The mean reduction in HBV RNA was also greatest in the group given NVR 3-778 plus pegIFN (2.09 log10 copies/mL), compared with the groups given NVR 3-778 or pegIFN alone (1.42 log10 copies/mL and 0.89 log10 copies/mL, respectively). There was no significant mean reduction in HBsAg during the 4-week treatment period. There were no discontinuations and no pattern of dose-related adverse effects with NVR 3-778. CONCLUSIONS In a phase 1 study of HBeAg-positive patients with chronic HBV infection without cirrhosis, NVR 3-778 was well tolerated and demonstrated antiviral activity. The agent reduced serum levels of HBV DNA and HBV RNA, to the greatest extent in combination with pegIFN. The observed reductions in HBV RNA confirmed the novel mechanism of NVR 3-778. Clinicaltrials.gov no. NCT02112799 (single-center) and NCT02401737 (multicenter).
Collapse
Affiliation(s)
- Man Fung Yuen
- Department of Medicine, Queen Mary Hospital, University of Hong Kong, Hong Kong.
| | | | - Dong Joon Kim
- Department of Internal Medicine, Hallym University, Chuncheon Sacred Heart Hospital, Gangwon-do, Republic of Korea
| | | | - Henry Lik Yuen Chan
- Department of Medicine and Therapeutics, Prince of Wales Hospital, The Chinese University of Hong Kong, Hong Kong
| | | | - Seong Gyu Hwang
- Department of Internal Medicine, CHA Bundang Medical Center, Gyeonggi-do, Republic of Korea
| | - Tuan Nguyen
- Research and Education, Inc., San Diego, California
| | | | - George Hartman
- Chemistry, Novira Therapeutics, Doylestown, Pennsylvania
| | - Sandy Liaw
- Clinical Operations, Novira Therapeutics, Doylestown, Pennsylvania
| | | | - Thomas N Kakuda
- Clinical Pharmacology, Janssen Biopharma, South San Francisco, California
| | | | | | - Klaus Klumpp
- Discovery Research, Novira Therapeutics, Doylestown, Pennsylvania
| | | |
Collapse
|
6
|
Harrington PR, Komatsu TE, Deming DJ, Donaldson EF, O'Rear JJ, Naeger LK. Impact of hepatitis C virus polymorphisms on direct-acting antiviral treatment efficacy: Regulatory analyses and perspectives. Hepatology 2018; 67:2430-2448. [PMID: 29194682 DOI: 10.1002/hep.29693] [Citation(s) in RCA: 33] [Impact Index Per Article: 5.5] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Submit a Manuscript] [Subscribe] [Scholar Register] [Received: 09/14/2017] [Revised: 11/20/2017] [Accepted: 11/27/2017] [Indexed: 12/21/2022]
Abstract
UNLABELLED Several highly effective, interferon-free, direct-acting antiviral (DAA)-based regimens are available for the treatment of chronic hepatitis C virus (HCV) infection. Despite impressive efficacy overall, a small proportion of patients in registrational trials experienced treatment failure, which in some cases was associated with the detection of HCV resistance-associated substitutions (RASs) at baseline. In this article, we describe methods and key findings from independent regulatory analyses investigating the impact of baseline nonstructural (NS) 3 Q80K and NS5A RASs on the efficacy of current United States Food and Drug Administration (FDA)-approved regimens for patients with HCV genotype (GT) 1 or GT3 infection. These analyses focused on clinical trials that included patients who were previously naïve to the DAA class(es) in their investigational regimen and characterized the impact of baseline RASs that were enriched in the viral population as natural or transmitted polymorphisms (i.e., not drug-selected RASs). We used a consistent approach to optimize comparability of results across different DAA regimens and patient populations, including the use of a 15% sensitivity cutoff for next-generation sequencing results and standardized lists of NS5A RASs. These analyses confirmed that detection of NS3 Q80K or NS5A baseline RASs was associated with reduced treatment efficacy for multiple DAA regimens, but their impact was often minimized with the use of an intensified treatment regimen, such as a longer treatment duration and/or addition of ribavirin. We discuss the drug resistance-related considerations that contributed to pretreatment resistance testing and treatment recommendations in drug labeling for FDA-approved DAA regimens. CONCLUSION Independent regulatory analyses confirmed that baseline HCV RASs can reduce the efficacy of certain DAA-based regimens in selected patient groups. However, highly effective treatment options are available for patients with or without baseline RASs. (Hepatology 2018;67:2430-2448).
Collapse
Affiliation(s)
- Patrick R Harrington
- Division of Antiviral Products, Office of Antimicrobial Products, Center for Drug Evaluation and Research, U.S. Food and Drug Administration, Silver Spring, MD
| | - Takashi E Komatsu
- Division of Antiviral Products, Office of Antimicrobial Products, Center for Drug Evaluation and Research, U.S. Food and Drug Administration, Silver Spring, MD
| | - Damon J Deming
- Division of Antiviral Products, Office of Antimicrobial Products, Center for Drug Evaluation and Research, U.S. Food and Drug Administration, Silver Spring, MD
| | - Eric F Donaldson
- Division of Antiviral Products, Office of Antimicrobial Products, Center for Drug Evaluation and Research, U.S. Food and Drug Administration, Silver Spring, MD
| | - Julian J O'Rear
- Division of Antiviral Products, Office of Antimicrobial Products, Center for Drug Evaluation and Research, U.S. Food and Drug Administration, Silver Spring, MD
| | - Lisa K Naeger
- Division of Antiviral Products, Office of Antimicrobial Products, Center for Drug Evaluation and Research, U.S. Food and Drug Administration, Silver Spring, MD
| |
Collapse
|
7
|
MinVar: A rapid and versatile tool for HIV-1 drug resistance genotyping by deep sequencing. J Virol Methods 2016; 240:7-13. [PMID: 27867045 DOI: 10.1016/j.jviromet.2016.11.008] [Citation(s) in RCA: 39] [Impact Index Per Article: 4.9] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 04/15/2016] [Revised: 10/17/2016] [Accepted: 11/11/2016] [Indexed: 02/08/2023]
Abstract
Genotypic monitoring of drug-resistance mutations (DRMs) in HIV-1 infected individuals is strongly recommended to guide selection of the initial antiretroviral therapy (ART) and changes of drug regimens. Traditionally, mutations conferring drug resistance are detected by population sequencing of the reverse transcribed viral RNA encoding the HIV-1 enzymes target by ART, followed by manual analysis and interpretation of Sanger sequencing traces. This process is labor intensive, relies on subjective interpretation from the operator, and offers limited sensitivity as only mutations above 20% frequency can be reliably detected. Here we present MinVar, a pipeline for the analysis of deep sequencing data, which allows reliable and automated detection of DRMs down to 5%. We evaluated MinVar with data from amplicon sequencing of defined mixtures of molecular virus clones with known DRM and plasma samples of viremic HIV-1 infected individuals and we compared it to VirVarSeq, another virus variant detection tool exclusively working on Illumina deep sequencing data. MinVar was designed to be compatible with a diverse range of sequencing platforms and allows the detection of DRMs and insertions/deletions from deep sequencing data without the need to perform additional bioinformatics analysis, a prerequisite to a widespread implementation of HIV-1 genotyping using deep sequencing in routine diagnostic settings.
Collapse
|