Reference Citation Analysis: Find an Article, Find a Category, Find a Journal, Find a Scholar

For: Ho SFS, Wheeler NE, Millard AD, van Schaik W. Gauge your phage: benchmarking of bacteriophage identification tools in metagenomic sequencing data. Microbiome 2023;11:84. [PMID: 37085924 PMCID: PMC10120246 DOI: 10.1186/s40168-023-01533-x] [Citation(s) in RCA: 12] [Impact Index Per Article: 12.0] [Reference Citation Analysis] [What about the content of this article? (0)] [Track Full Text] [Figures] [Subscribe] [Scholar Register] [Received: 06/11/2022] [Accepted: 03/22/2023] [Indexed: 05/03/2023]

For:	Ho SFS, Wheeler NE, Millard AD, van Schaik W. Gauge your phage: benchmarking of bacteriophage identification tools in metagenomic sequencing data. Microbiome 2023;11:84. [PMID: 37085924 PMCID: PMC10120246 DOI: 10.1186/s40168-023-01533-x] [Citation(s) in RCA: 12] [Impact Index Per Article: 12.0] [Reference Citation Analysis] [What about the content of this article? (0)] [Track Full Text] [Figures] [Subscribe] [Scholar Register] [Received: 06/11/2022] [Accepted: 03/22/2023] [Indexed: 05/03/2023]

Number

Cited by Other Article(s)

Rahimian M, Panahi B. Metagenome sequence data mining for viral interaction studies: Review on progress and prospects. Virus Res 2024;349:199450. [PMID: 39151562 DOI: 10.1016/j.virusres.2024.199450] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/13/2024] [Revised: 08/11/2024] [Accepted: 08/13/2024] [Indexed: 08/19/2024]

Ridgway R, Lu H, Blower TR, Evans NJ, Ainsworth S. Genomic and taxonomic evaluation of 38 Treponema prophage sequences. BMC Genomics 2024;25:549. [PMID: 38824509 PMCID: PMC11144348 DOI: 10.1186/s12864-024-10461-5] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 11/01/2023] [Accepted: 05/28/2024] [Indexed: 06/03/2024] Open

Dantas CWD, Martins DT, Nogueira WG, Alegria OVC, Ramos RTJ. Tools and methodology to in silico phage discovery in freshwater environments. Front Microbiol 2024;15:1390726. [PMID: 38881659 PMCID: PMC11176557 DOI: 10.3389/fmicb.2024.1390726] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 02/23/2024] [Accepted: 05/16/2024] [Indexed: 06/18/2024] Open

Wu LY, Wijesekara Y, Piedade GJ, Pappas N, Brussaard CPD, Dutilh BE. Benchmarking bioinformatic virus identification tools using real-world metagenomic data across biomes. Genome Biol 2024;25:97. [PMID: 38622738 PMCID: PMC11020464 DOI: 10.1186/s13059-024-03236-4] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/06/2023] [Accepted: 04/01/2024] [Indexed: 04/17/2024] Open

Hegarty B, Riddell V J, Bastien E, Langenfeld K, Lindback M, Saini JS, Wing A, Zhang J, Duhaime M. Benchmarking informatics approaches for virus discovery: caution is needed when combining in silico identification methods. mSystems 2024;9:e0110523. [PMID: 38376167 PMCID: PMC10949488 DOI: 10.1128/msystems.01105-23] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/26/2023] [Accepted: 01/24/2024] [Indexed: 02/21/2024] Open

Abstract

Understanding the ecological impacts of viruses on natural and engineered ecosystems relies on the accurate identification of viral sequences from community sequencing data. To maximize viral recovery from metagenomes, researchers frequently combine viral identification tools. However, the effectiveness of this strategy is unknown. Here, we benchmarked combinations of six widely used informatics tools for viral identification and analysis (VirSorter, VirSorter2, VIBRANT, DeepVirFinder, CheckV, and Kaiju), called "rulesets." Rulesets were tested against mock metagenomes composed of taxonomically diverse sequence types and diverse aquatic metagenomes to assess the effects of the degree of viral enrichment and habitat on tool performance. We found that six rulesets achieved equivalent accuracy [Matthews Correlation Coefficient (MCC) = 0.77, Padj ≥ 0.05]. Each contained VirSorter2, and five used our "tuning removal" rule designed to remove non-viral contamination. While DeepVirFinder, VIBRANT, and VirSorter were each found once in these high-accuracy rulesets, they were not found in combination with each other: combining tools does not lead to optimal performance. Our validation suggests that the MCC plateau at 0.77 is partly caused by inaccurate labeling within reference sequence databases. In aquatic metagenomes, our highest MCC ruleset identified more viral sequences in virus-enriched (44%-46%) than in cellular metagenomes (7%-19%). While improved algorithms may lead to more accurate viral identification tools, this should be done in tandem with careful curation of sequence databases. We recommend using the VirSorter2 ruleset and our empirically derived tuning removal rule. Our analysis provides insight into methods for in silico viral identification and will enable more robust viral identification from metagenomic data sets.

IMPORTANCE

The identification of viruses from environmental metagenomes using informatics tools has offered critical insights in microbial ecology. However, it remains difficult for researchers to know which tools optimize viral recovery for their specific study. In an attempt to recover more viruses, studies are increasingly combining the outputs from multiple tools without validating this approach. After benchmarking combinations of six viral identification tools against mock metagenomes and environmental samples, we found that these tools should only be combined cautiously. Two to four tool combinations maximized viral recovery and minimized non-viral contamination compared with either the single-tool or the five- to six-tool ones. By providing a rigorous overview of the behavior of in silico viral identification strategies and a pipeline to replicate our process, our findings guide the use of existing viral identification tools and offer a blueprint for feature engineering of new tools that will lead to higher-confidence viral discovery in microbiome studies.

Collapse

Liu GY, Yu D, Fan MM, Zhang X, Jin ZY, Tang C, Liu XF. Antimicrobial resistance crisis: could artificial intelligence be the solution? Mil Med Res 2024;11:7. [PMID: 38254241 PMCID: PMC10804841 DOI: 10.1186/s40779-024-00510-1] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Submit a Manuscript] [Subscribe] [Scholar Register] [Received: 05/18/2023] [Accepted: 01/08/2024] [Indexed: 01/24/2024] Open

Miao Y, Sun Z, Ma C, Lin C, Wang G, Yang C. VirGrapher: a graph-based viral identifier for long sequences from metagenomes. Brief Bioinform 2024;25:bbae036. [PMID: 38343326 PMCID: PMC10859693 DOI: 10.1093/bib/bbae036] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 12/13/2023] [Revised: 01/15/2024] [Accepted: 01/18/2024] [Indexed: 02/15/2024] Open

Zhang H, Zhang H, Du H, Yu X, Xu Y. The insights into the phage communities of fermented foods in the age of viral metagenomics. Crit Rev Food Sci Nutr 2024:1-13. [PMID: 38214674 DOI: 10.1080/10408398.2023.2299323] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 01/13/2024]

Owens LA, Friant S, Martorelli Di Genova B, Knoll LJ, Contreras M, Noya-Alarcon O, Dominguez-Bello MG, Goldberg TL. VESPA: an optimized protocol for accurate metabarcoding-based characterization of vertebrate eukaryotic endosymbiont and parasite assemblages. Nat Commun 2024;15:402. [PMID: 38195557 PMCID: PMC10776621 DOI: 10.1038/s41467-023-44521-3] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 04/04/2023] [Accepted: 12/15/2023] [Indexed: 01/11/2024] Open

Kerkvliet JJ, Bossers A, Kers JG, Meneses R, Willems R, Schürch AC. Metagenomic assembly is the main bottleneck in the identification of mobile genetic elements. PeerJ 2024;12:e16695. [PMID: 38188174 PMCID: PMC10771768 DOI: 10.7717/peerj.16695] [Citation(s) in RCA: 3] [Impact Index Per Article: 3.0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 09/04/2023] [Accepted: 11/28/2023] [Indexed: 01/09/2024] Open

Abstract

Antimicrobial resistance genes (ARG) are commonly found on acquired mobile genetic elements (MGEs) such as plasmids or transposons. Understanding the spread of resistance genes associated with mobile elements (mARGs) across different hosts and environments requires linking ARGs to the existing mobile reservoir within bacterial communities. However, reconstructing mARGs in metagenomic data from diverse ecosystems poses computational challenges, including genome fragment reconstruction (assembly), high-throughput annotation of MGEs, and identification of their association with ARGs. Recently, several bioinformatics tools have been developed to identify assembled fragments of plasmids, phages, and insertion sequence (IS) elements in metagenomic data. These methods can help in understanding the dissemination of mARGs. To streamline the process of identifying mARGs in multiple samples, we combined these tools in an automated high-throughput open-source pipeline, MetaMobilePicker, that identifies ARGs associated with plasmids, IS elements and phages, starting from short metagenomic sequencing reads. This pipeline was used to identify these three elements on a simplified simulated metagenome dataset, comprising whole genome sequences from seven clinically relevant bacterial species containing 55 ARGs, nine plasmids and five phages. The results demonstrated moderate precision for the identification of plasmids (0.57) and phages (0.71), and moderate sensitivity of identification of IS elements (0.58) and ARGs (0.70). In this study, we aim to assess the main causes of this moderate performance of the MGE prediction tools in a comprehensive manner. We conducted a systematic benchmark, considering metagenomic read coverage, contig length cutoffs and investigating the performance of the classification algorithms. Our analysis revealed that the metagenomic assembly process is the primary bottleneck when linking ARGs to identified MGEs in short-read metagenomics sequencing experiments rather than ARGs and MGEs identification by the different tools.

Collapse

Roach MJ, Beecroft SJ, Mihindukulasuriya KA, Wang L, Paredes A, Cárdenas LAC, Henry-Cocks K, Lima LFO, Dinsdale EA, Edwards RA, Handley SA. Hecatomb: an integrated software platform for viral metagenomics. Gigascience 2024;13:giae020. [PMID: 38832467 PMCID: PMC11148595 DOI: 10.1093/gigascience/giae020] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/17/2023] [Revised: 01/18/2024] [Accepted: 04/08/2024] [Indexed: 06/05/2024] Open

Affiliation(s)

Michael J Roach Flinders Accelerator for Microbiome Exploration, Flinders University, Adelaide, SA, Australia Adelaide Centre for Epigenetics, University of Adelaide, Adelaide, SA, 5005, Australia South Australian Immunogenomics Cancer Institute, University of Adelaide, Adelaide, SA, 5005, Australia
Sarah J Beecroft Harry Perkins Institute of Medical Research, Perth, WA, 6009, Australia
Kathie A Mihindukulasuriya Department of Pathology & Immunology, Washington University School of Medicine, St. Louis, MO, 63110, USA The Edison Family Center for Genome Sciences & Systems Biology, Washington University School of Medicine, St. Louis, MO, 63110, USA
Leran Wang Department of Pathology & Immunology, Washington University School of Medicine, St. Louis, MO, 63110, USA The Edison Family Center for Genome Sciences & Systems Biology, Washington University School of Medicine, St. Louis, MO, 63110, USA
Anne Paredes Department of Pathology & Immunology, Washington University School of Medicine, St. Louis, MO, 63110, USA
Luis Alberto Chica Cárdenas Department of Pathology & Immunology, Washington University School of Medicine, St. Louis, MO, 63110, USA The Edison Family Center for Genome Sciences & Systems Biology, Washington University School of Medicine, St. Louis, MO, 63110, USA
Kara Henry-Cocks Flinders Accelerator for Microbiome Exploration, Flinders University, Adelaide, SA, Australia
Lais Farias Oliveira Lima Biology Department, San Diego State University, San Diego, CA, 92182, USA
Elizabeth A Dinsdale Flinders Accelerator for Microbiome Exploration, Flinders University, Adelaide, SA, Australia
Robert A Edwards Flinders Accelerator for Microbiome Exploration, Flinders University, Adelaide, SA, Australia
Scott A Handley Department of Pathology & Immunology, Washington University School of Medicine, St. Louis, MO, 63110, USA The Edison Family Center for Genome Sciences & Systems Biology, Washington University School of Medicine, St. Louis, MO, 63110, USA

Collapse

Young GR, Nelson A, Stewart CJ, Smith DL. Bacteriophage communities are a reservoir of unexplored microbial diversity in neonatal health and disease. Curr Opin Microbiol 2023;75:102379. [PMID: 37647765 DOI: 10.1016/j.mib.2023.102379] [Citation(s) in RCA: 1] [Impact Index Per Article: 1.0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/02/2023] [Revised: 07/30/2023] [Accepted: 08/02/2023] [Indexed: 09/01/2023]

Rangel-Pineros G, Almeida A, Beracochea M, Sakharova E, Marz M, Reyes Muñoz A, Hölzer M, Finn RD. VIRify: An integrated detection, annotation and taxonomic classification pipeline using virus-specific protein profile hidden Markov models. PLoS Comput Biol 2023;19:e1011422. [PMID: 37639475 PMCID: PMC10491390 DOI: 10.1371/journal.pcbi.1011422] [Citation(s) in RCA: 1] [Impact Index Per Article: 1.0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/24/2022] [Revised: 09/08/2023] [Accepted: 08/09/2023] [Indexed: 08/31/2023] Open