1
|
Li X, Muñoz JF, Gade L, Argimon S, Bougnoux ME, Bowers JR, Chow NA, Cuesta I, Farrer RA, Maufrais C, Monroy-Nieto J, Pradhan D, Uehling J, Vu D, Yeats CA, Aanensen DM, d’Enfert C, Engelthaler DM, Eyre DW, Fisher MC, Hagen F, Meyer W, Singh G, Alastruey-Izquierdo A, Litvintseva AP, Cuomo CA. Comparing genomic variant identification protocols for Candida auris. Microb Genom 2023; 9:mgen000979. [PMID: 37043380 PMCID: PMC10210944 DOI: 10.1099/mgen.0.000979] [Citation(s) in RCA: 2] [Impact Index Per Article: 2.0] [Reference Citation Analysis] [What about the content of this article? (0)] [Affiliation(s)] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 11/10/2022] [Accepted: 02/09/2023] [Indexed: 04/13/2023] Open
Abstract
Genomic analyses are widely applied to epidemiological, population genetic and experimental studies of pathogenic fungi. A wide range of methods are employed to carry out these analyses, typically without including controls that gauge the accuracy of variant prediction. The importance of tracking outbreaks at a global scale has raised the urgency of establishing high-accuracy pipelines that generate consistent results between research groups. To evaluate currently employed methods for whole-genome variant detection and elaborate best practices for fungal pathogens, we compared how 14 independent variant calling pipelines performed across 35 Candida auris isolates from 4 distinct clades and evaluated the performance of variant calling, single-nucleotide polymorphism (SNP) counts and phylogenetic inference results. Although these pipelines used different variant callers and filtering criteria, we found high overall agreement of SNPs from each pipeline. This concordance correlated with site quality, as SNPs discovered by a few pipelines tended to show lower mapping quality scores and depth of coverage than those recovered by all pipelines. We observed that the major differences between pipelines were due to variation in read trimming strategies, SNP calling methods and parameters, and downstream filtration criteria. We calculated specificity and sensitivity for each pipeline by aligning three isolates with chromosomal level assemblies and found that the GATK-based pipelines were well balanced between these metrics. Selection of trimming methods had a greater impact on SAMtools-based pipelines than those using GATK. Phylogenetic trees inferred by each pipeline showed high consistency at the clade level, but there was more variability between isolates from a single outbreak, with pipelines that used more stringent cutoffs having lower resolution. This project generated two truth datasets useful for routine benchmarking of C. auris variant calling, a consensus VCF of genotypes discovered by 10 or more pipelines across these 35 diverse isolates and variants for 2 samples identified from whole-genome alignments. This study provides a foundation for evaluating SNP calling pipelines and developing best practices for future fungal genomic studies.
Collapse
Affiliation(s)
- Xiao Li
- Broad Institute of MIT and Harvard, Cambridge, MA, 02142, USA
| | - José F. Muñoz
- Broad Institute of MIT and Harvard, Cambridge, MA, 02142, USA
| | - Lalitha Gade
- Mycotic Diseases Branch, Centers for Disease Control and Prevention, US Department of Health and Human Services, Atlanta, GA, 30329, USA
| | - Silvia Argimon
- Centre for Genomic Pathogen Surveillance, Big Data Institute, University of Oxford, Oxford, UK
| | - Marie-Elisabeth Bougnoux
- Institut Pasteur, Université Paris Cité, INRAE, USC2019, Unité Biologie et Pathogénicité Fongiques, Paris, France
- Université Paris Cité, Hôpital Necker-Enfants-Malades, Unité de Parasitologie-Mycologie, Assistance Publique des Hôpitaux de Paris, Paris, France
| | - Jolene R. Bowers
- Translational Genomics Research Institute, Pathogen and Microbiome Division, Flagstaff, AZ 86005, USA
| | - Nancy A. Chow
- Mycotic Diseases Branch, Centers for Disease Control and Prevention, US Department of Health and Human Services, Atlanta, GA, 30329, USA
| | - Isabel Cuesta
- Mycology Reference Laboratory, National Centre for Microbiology, Instituto de Salud Carlos III, Madrid, Spain
| | - Rhys A. Farrer
- Broad Institute of MIT and Harvard, Cambridge, MA, 02142, USA
- Medical Research Council Centre for Medical Mycology, University of Exeter, Exeter, EX4 4PY, UK
| | - Corinne Maufrais
- Institut Pasteur, Université Paris Cité, INRAE, USC2019, Unité Biologie et Pathogénicité Fongiques, Paris, France
- Institut Pasteur, Université Paris Cité, CNRS USR 3756, Hub de Bioinformatique et Biostatistique, Paris, France
| | - Juan Monroy-Nieto
- Translational Genomics Research Institute, Pathogen and Microbiome Division, Flagstaff, AZ 86005, USA
| | - Dibyabhaba Pradhan
- All India Institute of Medical Sciences, Ansari Nagar, New Delhi, 110029, India
| | - Jessie Uehling
- Botany and Plant Pathology, Oregon State University, Corvallis, OR 97330, USA
| | - Duong Vu
- Westerdijk Fungal Biodiversity Institute, Uppsalalaan 8, 3584CT, Utrecht, Netherlands
| | - Corin A. Yeats
- Centre for Genomic Pathogen Surveillance, Big Data Institute, University of Oxford, Oxford, UK
| | - David M. Aanensen
- Centre for Genomic Pathogen Surveillance, Big Data Institute, University of Oxford, Oxford, UK
| | - Christophe d’Enfert
- Institut Pasteur, Université Paris Cité, INRAE, USC2019, Unité Biologie et Pathogénicité Fongiques, Paris, France
| | - David M. Engelthaler
- Translational Genomics Research Institute, Pathogen and Microbiome Division, Flagstaff, AZ 86005, USA
| | - David W. Eyre
- NIHR Oxford Biomedical Research Centre, University of Oxford, Oxford, UK
| | - Matthew C. Fisher
- MRC Centre for Global Infectious Disease Analysis, Imperial College London, London, UK
| | - Ferry Hagen
- Westerdijk Fungal Biodiversity Institute, Uppsalalaan 8, 3584CT, Utrecht, Netherlands
- Institute for Biodiversity and Ecosystem Dynamics (IBED), University of Amsterdam, Amsterdam, Netherlands
- Department of Medical Microbiology, University Medical Center Utrecht, Utrecht, Netherlands
| | - Wieland Meyer
- Sydney Medical School, University of Sydney, Sydney, NSW 2050, Australia
| | - Gagandeep Singh
- All India Institute of Medical Sciences, Ansari Nagar, New Delhi, 110029, India
| | - Ana Alastruey-Izquierdo
- Mycology Reference Laboratory, National Centre for Microbiology, Instituto de Salud Carlos III, Madrid, Spain
| | - Anastasia P. Litvintseva
- Mycotic Diseases Branch, Centers for Disease Control and Prevention, US Department of Health and Human Services, Atlanta, GA, 30329, USA
| | | |
Collapse
|