1
|
Tembrock LR, Zink FA, Gilligan TM. Viral Prevalence and Genomic Xenology in the Coevolution of HzNV-2 (Nudiviridae) with Host Helicoverpa zea (Lepidoptera: Noctuidae). INSECTS 2023; 14:797. [PMID: 37887809 PMCID: PMC10607169 DOI: 10.3390/insects14100797] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Subscribe] [Scholar Register] [Received: 06/28/2023] [Revised: 09/16/2023] [Accepted: 09/24/2023] [Indexed: 10/28/2023]
Abstract
Insect viruses have been described from numerous lineages, yet patterns of genetic exchange and viral prevalence, which are essential to understanding host-virus coevolution, are rarely studied. In Helicoverpa zea, the virus HzNV-2 can cause deformity of male and female genitalia, resulting in sterility. Using ddPCR, we found that male H. zea with malformed genitalia (agonadal) contained high levels of HzNV-2 DNA, confirming previous work. HzNV-2 was found to be prevalent throughout the United States, at more than twice the rate of the baculovirus HaSNPV, and that it contained several host-acquired DNA sequences. HzNV-2 possesses four recently endogenized lepidopteran genes and several more distantly related genes, including one gene with a bacteria-like sequence found in both host and virus. Among the recently acquired genes is cytosolic serine hydroxymethyltransferase (cSHMT). In nearly all tested H. zea, cSHMT contained a 200 bp transposable element (TE) that was not found in cSHMT of the sister species H. armigera. No other virus has been found with host cSHMT, and the study of this shared copy, including possible interactions, may yield new insights into the function of this gene with possible applications to insect biological control, and gene editing.
Collapse
Affiliation(s)
- Luke R. Tembrock
- Department of Agricultural Biology, Colorado State University, Fort Collins, CO 80523, USA
| | - Frida A. Zink
- Department of Agricultural Biology, Colorado State University, Fort Collins, CO 80523, USA
| | - Todd M. Gilligan
- USDA-APHIS-PPQ-Science & Technology, Identification Technology Program, Fort Collins, CO 80526, USA
| |
Collapse
|
2
|
Raju RS, Al Nahid A, Chondrow Dev P, Islam R. VirusTaxo: Taxonomic classification of viruses from the genome sequence using k-mer enrichment. Genomics 2022; 114:110414. [PMID: 35718090 DOI: 10.1016/j.ygeno.2022.110414] [Citation(s) in RCA: 6] [Impact Index Per Article: 3.0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 09/04/2021] [Revised: 04/27/2022] [Accepted: 06/13/2022] [Indexed: 11/04/2022]
Abstract
Classification of viruses into their taxonomic ranks (e.g., order, family, and genus) provides a framework to organize an abundant population of viruses. Next-generation metagenomic sequencing technologies lead to a rapid increase in generating sequencing data of viruses which require bioinformatics tools to analyze the taxonomy. Many metagenomic taxonomy classifiers have been developed to study microbiomes, but it is particularly challenging to assign the taxonomy of diverse virus sequences and there is a growing need for dedicated methods to be developed that are optimized to classify virus sequences into their taxa. For taxonomic classification of viruses from metagenomic sequences, we developed VirusTaxo using diverse (e.g., 402 DNA and 280 RNA) genera of viruses. VirusTaxo has an average accuracy of 93% at genus level prediction in DNA and RNA viruses. VirusTaxo outperformed existing taxonomic classifiers of viruses where it assigned taxonomy of a larger fraction of metagenomic contigs compared to other methods. Benchmarking of VirusTaxo on a collection of SARS-CoV-2 sequencing libraries and metavirome datasets suggests that VirusTaxo can characterize virus taxonomy from highly diverse contigs and provide a reliable decision on the taxonomy of viruses.
Collapse
Affiliation(s)
- Rajan Saha Raju
- Department of Computer Science and Engineering, Shahjalal University of Science and Technology, Sylhet 3114, Bangladesh
| | - Abdullah Al Nahid
- Department of Biochemistry and Molecular Biology, School of Life Sciences, Shahjalal University of Science and Technology, Sylhet 3114, Bangladesh
| | - Preonath Chondrow Dev
- Department of Biochemistry and Molecular Biology, School of Life Sciences, Shahjalal University of Science and Technology, Sylhet 3114, Bangladesh
| | - Rashedul Islam
- Omics Lab, Dhaka, Bangladesh; Bioinformatics Graduate Program, University of British Columbia, Vancouver, BC V5Z 4S6, Canada.
| |
Collapse
|
3
|
Andrade-Martínez JS, Camelo Valera LC, Chica Cárdenas LA, Forero-Junco L, López-Leal G, Moreno-Gallego JL, Rangel-Pineros G, Reyes A. Computational Tools for the Analysis of Uncultivated Phage Genomes. Microbiol Mol Biol Rev 2022; 86:e0000421. [PMID: 35311574 PMCID: PMC9199400 DOI: 10.1128/mmbr.00004-21] [Citation(s) in RCA: 11] [Impact Index Per Article: 5.5] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 12/15/2022] Open
Abstract
Over a century of bacteriophage research has uncovered a plethora of fundamental aspects of their biology, ecology, and evolution. Furthermore, the introduction of community-level studies through metagenomics has revealed unprecedented insights on the impact that phages have on a range of ecological and physiological processes. It was not until the introduction of viral metagenomics that we began to grasp the astonishing breadth of genetic diversity encompassed by phage genomes. Novel phage genomes have been reported from a diverse range of biomes at an increasing rate, which has prompted the development of computational tools that support the multilevel characterization of these novel phages based solely on their genome sequences. The impact of these technologies has been so large that, together with MAGs (Metagenomic Assembled Genomes), we now have UViGs (Uncultivated Viral Genomes), which are now officially recognized by the International Committee for the Taxonomy of Viruses (ICTV), and new taxonomic groups can now be created based exclusively on genomic sequence information. Even though the available tools have immensely contributed to our knowledge of phage diversity and ecology, the ongoing surge in software programs makes it challenging to keep up with them and the purpose each one is designed for. Therefore, in this review, we describe a comprehensive set of currently available computational tools designed for the characterization of phage genome sequences, focusing on five specific analyses: (i) assembly and identification of phage and prophage sequences, (ii) phage genome annotation, (iii) phage taxonomic classification, (iv) phage-host interaction analysis, and (v) phage microdiversity.
Collapse
Affiliation(s)
- Juan Sebastián Andrade-Martínez
- Max Planck Tandem Group in Computational Biology, Department of Biological Sciences, Universidad de los Andes, Bogotá, Colombia
| | - Laura Carolina Camelo Valera
- Max Planck Tandem Group in Computational Biology, Department of Biological Sciences, Universidad de los Andes, Bogotá, Colombia
| | - Luis Alberto Chica Cárdenas
- Max Planck Tandem Group in Computational Biology, Department of Biological Sciences, Universidad de los Andes, Bogotá, Colombia
| | - Laura Forero-Junco
- Max Planck Tandem Group in Computational Biology, Department of Biological Sciences, Universidad de los Andes, Bogotá, Colombia
- Department of Plant and Environmental Science, University of Copenhagen, Frederiksberg, Denmark
| | - Gamaliel López-Leal
- Max Planck Tandem Group in Computational Biology, Department of Biological Sciences, Universidad de los Andes, Bogotá, Colombia
| | - J. Leonardo Moreno-Gallego
- Max Planck Tandem Group in Computational Biology, Department of Biological Sciences, Universidad de los Andes, Bogotá, Colombia
- Department of Microbiome Science, Max Planck Institute for Developmental Biology, Tübingen, Germany
| | - Guillermo Rangel-Pineros
- Max Planck Tandem Group in Computational Biology, Department of Biological Sciences, Universidad de los Andes, Bogotá, Colombia
- The GLOBE Institute, Faculty of Health and Medical Sciences, University of Copenhagen, Copenhagen, Denmark
| | - Alejandro Reyes
- Max Planck Tandem Group in Computational Biology, Department of Biological Sciences, Universidad de los Andes, Bogotá, Colombia
- The Edison Family Center for Genome Sciences and Systems Biology, Washington University School of Medicine, St. Louis, Missouri, USA
| |
Collapse
|
4
|
Abstract
Viruses are the most abundant biological entity on Earth, infect cellular organisms from all domains of life, and are central players in the global biosphere. Over the last century, the discovery and characterization of viruses have progressed steadily alongside much of modern biology. In terms of outright numbers of novel viruses discovered, however, the last few years have been by far the most transformative for the field. Advances in methods for identifying viral sequences in genomic and metagenomic datasets, coupled to the exponential growth of environmental sequencing, have greatly expanded the catalog of known viruses and fueled the tremendous growth of viral sequence databases. Development and implementation of new standards, along with careful study of the newly discovered viruses, have transformed and will continue to transform our understanding of microbial evolution, ecology, and biogeochemical cycles, leading to new biotechnological innovations across many diverse fields, including environmental, agricultural, and biomedical sciences.
Collapse
Affiliation(s)
- Lee Call
- Department of Energy Joint Genome Institute, Lawrence Berkeley National Laboratory, Berkeley, California 94720, USA; ,
| | - Stephen Nayfach
- Department of Energy Joint Genome Institute, Lawrence Berkeley National Laboratory, Berkeley, California 94720, USA; ,
| | - Nikos C Kyrpides
- Department of Energy Joint Genome Institute, Lawrence Berkeley National Laboratory, Berkeley, California 94720, USA; ,
| |
Collapse
|
5
|
Pons JC, Paez-Espino D, Riera G, Ivanova N, Kyrpides NC, Llabrés M. VPF-Class: Taxonomic assignment and host prediction of uncultivated viruses based on viral protein families. Bioinformatics 2021; 37:1805-1813. [PMID: 33471063 PMCID: PMC8830756 DOI: 10.1093/bioinformatics/btab026] [Citation(s) in RCA: 52] [Impact Index Per Article: 17.3] [Reference Citation Analysis] [Abstract] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 09/02/2020] [Revised: 12/11/2020] [Accepted: 01/13/2021] [Indexed: 12/03/2022] Open
Abstract
Motivation Two key steps in the analysis of uncultured viruses recovered from metagenomes are the taxonomic classification of the viral sequences and the identification of putative host(s). Both steps rely mainly on the assignment of viral proteins to orthologs in cultivated viruses. Viral Protein Families (VPFs) can be used for the robust identification of new viral sequences in large metagenomics datasets. Despite the importance of VPF information for viral discovery, VPFs have not yet been explored for determining viral taxonomy and host targets. Results In this work, we classified the set of VPFs from the IMG/VR database and developed VPF-Class. VPF-Class is a tool that automates the taxonomic classification and host prediction of viral contigs based on the assignment of their proteins to a set of classified VPFs. Applying VPF-Class on 731K uncultivated virus contigs from the IMG/VR database, we were able to classify 363K contigs at the genus level and predict the host of over 461K contigs. In the RefSeq database, VPF-class reported an accuracy of nearly 100% to classify dsDNA, ssDNA and retroviruses, at the genus level, considering a membership ratio and a confidence score of 0.2. The accuracy in host prediction was 86.4%, also at the genus level, considering a membership ratio of 0.3 and a confidence score of 0.5. And, in the prophages dataset, the accuracy in host prediction was 86% considering a membership ratio of 0.6 and a confidence score of 0.8. Moreover, from the Global Ocean Virome dataset, over 817K viral contigs out of 1 million were classified. Availability and implementation The implementation of VPF-Class can be downloaded from https://github.com/biocom-uib/vpf-tools. Supplementary information Supplementary data are available at Bioinformatics online.
Collapse
Affiliation(s)
- Joan Carles Pons
- Department of Mathematics and Computer Science, University of the Balearic Islands, Palma, 07122, Spain
| | | | - Gabriel Riera
- Department of Mathematics and Computer Science, University of the Balearic Islands, Palma, 07122, Spain
| | - Natalia Ivanova
- Department of Energy Joint Genome Institute, Berkeley, 94720, USA
| | - Nikos C Kyrpides
- Department of Energy Joint Genome Institute, Berkeley, 94720, USA
| | - Mercè Llabrés
- Department of Mathematics and Computer Science, University of the Balearic Islands, Palma, 07122, Spain
| |
Collapse
|
6
|
Gibbs A. Binomial nomenclature for virus species: a long view. Arch Virol 2020; 165:3079-3083. [PMID: 33025196 DOI: 10.1007/s00705-020-04828-8] [Citation(s) in RCA: 4] [Impact Index Per Article: 1.0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 05/09/2020] [Accepted: 08/31/2020] [Indexed: 01/02/2023]
Abstract
On several occasions over the past century it has been proposed that Latinized (Linnaean) binomial names (LBs) should be used for the formal names of virus species, and the opinions expressed in the early debates are still valid. The use of LBs would be sensible for the current Taxonomy if confined to the names of the specific and generic taxa of viruses of which some basic biological properties are known (e.g. ecology, hosts and virions); there is no advantage in filling the literature with formal names for partly described viruses or virus-like gene sequences. The ICTV should support the time-honoured convention that LBs are only used with biological (phylogenetic) classifications. Recent changes have left the ICTV Taxonomy and its Code uncoordinated, and its aims and audience uncertain.
Collapse
Affiliation(s)
- Adrian Gibbs
- Emeritus Faculty, Australian National University, Canberra, ACT 2601, Australia.
| |
Collapse
|