1
|
Alarcón-Schumacher T, Lücking D, Erdmann S. Revisiting evolutionary trajectories and the organization of the Pleolipoviridae family. PLoS Genet 2023; 19:e1010998. [PMID: 37831715 PMCID: PMC10599561 DOI: 10.1371/journal.pgen.1010998] [Citation(s) in RCA: 2] [Impact Index Per Article: 2.0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 05/22/2023] [Revised: 10/25/2023] [Accepted: 09/26/2023] [Indexed: 10/15/2023] Open
Abstract
Archaeal pleomorphic viruses belonging to the Pleolipoviridae family represent an enigmatic group as they exhibit unique genomic features and are thought to have evolved through recombination with different archaeal plasmids. However, most of our understanding of the diversity and evolutionary trajectories of this clade comes from a handful of isolated representatives. Here we present 164 new genomes of pleolipoviruses obtained from metagenomic data of Australian hypersaline lakes and publicly available metagenomic data. We perform a comprehensive analysis on the diversity and evolutionary relationships of the newly discovered viruses and previously described pleolipoviruses. We propose to classify the viruses into five genera within the Pleolipoviridae family, with one new genus represented only by virus genomes retrieved in this study. Our data support the current hypothesis that pleolipoviruses reshaped their genomes through recombining with multiple different groups of plasmids, which is reflected in the diversity of their predicted replication strategies. We show that the proposed genus Epsilonpleolipovirus has evolutionary ties to pRN1-like plasmids from Sulfolobus, suggesting that this group could be infecting other archaeal phyla. Interestingly, we observed that the genome size of pleolipoviruses is correlated to the presence or absence of an integrase. Analyses of the host range revealed that all but one virus exhibit an extremely narrow range, and we show that the predicted tertiary structure of the spike protein is strongly associated with the host family, suggesting a specific adaptation to the host S-layer glycoprotein organization.
Collapse
Affiliation(s)
| | - Dominik Lücking
- Max-Planck-Institute for Marine Microbiology, Bremen, Germany
| | - Susanne Erdmann
- Max-Planck-Institute for Marine Microbiology, Bremen, Germany
| |
Collapse
|
2
|
Geller AM, Levy A. "What I cannot create, I do not understand": elucidating microbe-microbe interactions to facilitate plant microbiome engineering. Curr Opin Microbiol 2023; 72:102283. [PMID: 36868050 DOI: 10.1016/j.mib.2023.102283] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 11/05/2022] [Revised: 01/22/2023] [Accepted: 01/24/2023] [Indexed: 03/05/2023]
Abstract
Plant-microbe interactions are important for both physiological and pathological processes. Despite the significance of plant-microbe interactions, microbe-microbe interactions themselves represent an important, complex, dynamic network that warrants deeper investigation. To understand how microbe-microbe interactions affect plant microbiomes, one approach is to systematically understand all the factors involved in successful engineering of a microbial community. This follows the physicist Richard Feynman's declaration: "what I cannot create, I do not understand". This review highlights recent studies that focus on aspects that we believe are important for building (ergo understanding) microbe-microbe interactions in the plant environment, including pairwise screening, intelligent application of cross-feeding models, spatial distributions of microbes, and understudied interactions between bacteria and fungi, phages, and protists. We offer a framework for systematic collection and centralized integration of data of plant microbiomes that could organize all the factors that can help ecologists understand microbiomes and help synthetic ecologists engineer beneficial microbiomes.
Collapse
Affiliation(s)
- Alexander M Geller
- Department of Plant Pathology and Microbiology, Institute of Environmental Science, Robert H. Smith Faculty of Agriculture, Food, and Environment, The Hebrew University of Jerusalem, Rehovot 7610001, Israel
| | - Asaf Levy
- Department of Plant Pathology and Microbiology, Institute of Environmental Science, Robert H. Smith Faculty of Agriculture, Food, and Environment, The Hebrew University of Jerusalem, Rehovot 7610001, Israel.
| |
Collapse
|
3
|
Santamaría RI, Bustos P, Van Cauwenberghe J, González V. Hidden diversity of double-stranded DNA phages in symbiotic Rhizobium species. Philos Trans R Soc Lond B Biol Sci 2022; 377:20200468. [PMID: 34839703 PMCID: PMC8628074 DOI: 10.1098/rstb.2020.0468] [Citation(s) in RCA: 2] [Impact Index Per Article: 1.0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 01/19/2023] Open
Abstract
In this study, we addressed the extent of diversification of phages associated with nitrogen-fixing symbiotic Rhizobium species. Despite the ecological and economic importance of the Rhizobium genus, little is known about the diversity of the associated phages. A thorough assessment of viral diversity requires investigating both lytic phages and prophages harboured in diverse Rhizobium genomes. Protein-sharing networks identified 56 viral clusters (VCs) among a set of 425 isolated phages and predicted prophages. The VCs formed by phages had more proteins in common and a higher degree of synteny, and they group together in clades in the associated phylogenetic tree. By contrast, the VCs of prophages showed significant genetic variation and gene loss, with selective pressure on the remaining genes. Some VCs were found in various Rhizobium species and geographical locations, suggesting that they have wide host ranges. Our results indicate that the VCs represent distinct taxonomic units, probably representing taxa equivalent to genera or even species. The finding of previously undescribed phage taxa indicates the need for further exploration of the diversity of phages associated with Rhizobium species. This article is part of the theme issue 'The secret lives of microbial mobile genetic elements'.
Collapse
Affiliation(s)
- Rosa I. Santamaría
- Centro de Ciencias Genómicas, Universidad Nacional Autónoma de México, Cuernavaca, Mexico
| | - Patricia Bustos
- Centro de Ciencias Genómicas, Universidad Nacional Autónoma de México, Cuernavaca, Mexico
| | - Jannick Van Cauwenberghe
- Centro de Ciencias Genómicas, Universidad Nacional Autónoma de México, Cuernavaca, Mexico,Department of Integrative Biology, University of California, Berkeley, CA, USA
| | - Víctor González
- Centro de Ciencias Genómicas, Universidad Nacional Autónoma de México, Cuernavaca, Mexico
| |
Collapse
|
4
|
Rangel-Pineros G, Millard A, Michniewski S, Scanlan D, Sirén K, Reyes A, Petersen B, Clokie MR, Sicheritz-Pontén T. From Trees to Clouds: PhageClouds for Fast Comparison of ∼640,000 Phage Genomic Sequences and Host-Centric Visualization Using Genomic Network Graphs. PHAGE (NEW ROCHELLE, N.Y.) 2021; 2:194-203. [PMID: 36147515 PMCID: PMC9041511 DOI: 10.1089/phage.2021.0008] [Citation(s) in RCA: 12] [Impact Index Per Article: 4.0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 04/14/2023]
Abstract
Background: Fast and computationally efficient strategies are required to explore genomic relationships within an increasingly large and diverse phage sequence space. Here, we present PhageClouds, a novel approach using a graph database of phage genomic sequences and their intergenomic distances to explore the phage genomic sequence space. Methods: A total of 640,000 phage genomic sequences were retrieved from a variety of databases and public virome assemblies. Intergenomic distances were calculated with dashing, an alignment-free method suitable for handling massive data sets. These data were used to build a Neo4j® graph database. Results: PhageClouds supported the search of related phages among all complete phage genomes from GenBank for a single query phage in just 10 s. Moreover, PhageClouds expanded the number of closely related phage sequences detected for both finished and draft phage genomes, in comparison with searches exclusively targeting phage entries from GenBank. Conclusions: PhageClouds is a novel resource that will facilitate the analysis of phage genomic sequences and the characterization of assembled phage genomes.
Collapse
Affiliation(s)
- Guillermo Rangel-Pineros
- Section for Evolutionary Genomics, The GLOBE Institute, Faculty of Health and Medical Sciences, University of Copenhagen, Copenhagen, Denmark
- Max Planck Tandem Group in Computational Biology, Department of Biological Sciences, Universidad de los Andes, Bogota, Colombia
| | - Andrew Millard
- Department of Genetics and Genome Biology, University of Leicester, Leicester, United Kingdom
| | - Slawomir Michniewski
- Warwick Medical School of Life Sciences, University of Warwick, Coventry, United Kingdom
| | - David Scanlan
- Warwick Medical School of Life Sciences, University of Warwick, Coventry, United Kingdom
| | - Kimmo Sirén
- Section for Evolutionary Genomics, The GLOBE Institute, Faculty of Health and Medical Sciences, University of Copenhagen, Copenhagen, Denmark
| | - Alejandro Reyes
- Max Planck Tandem Group in Computational Biology, Department of Biological Sciences, Universidad de los Andes, Bogota, Colombia
| | - Bent Petersen
- Centre of Excellence for Omics-Driven Computational Biodiscovery (COMBio), Faculty of Applied Sciences, AIMST University, Kedah, Malaysia
- Center for Evolutionary Hologenomics, Globe Institute, University of Copenhagen, Copenhagen, Denmark
| | - Martha R.J. Clokie
- Department of Genetics and Genome Biology, University of Leicester, Leicester, United Kingdom
| | - Thomas Sicheritz-Pontén
- Centre of Excellence for Omics-Driven Computational Biodiscovery (COMBio), Faculty of Applied Sciences, AIMST University, Kedah, Malaysia
- Center for Evolutionary Hologenomics, Globe Institute, University of Copenhagen, Copenhagen, Denmark
| |
Collapse
|
5
|
Ramos-Barbero MD, Viver T, Zabaleta A, Senel E, Gomariz M, Antigüedad I, Santos F, Martínez-García M, Rosselló-Móra R, Antón J. Ancient saltern metagenomics: tracking changes in microbes and their viruses from the underground to the surface. Environ Microbiol 2021; 23:3477-3498. [PMID: 34110059 DOI: 10.1111/1462-2920.15630] [Citation(s) in RCA: 2] [Impact Index Per Article: 0.7] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 04/07/2021] [Revised: 05/26/2021] [Accepted: 06/06/2021] [Indexed: 11/28/2022]
Abstract
Microbial communities in hypersaline underground waters derive from ancient organisms trapped within the evaporitic salt crystals and are part of the poorly known subterranean biosphere. Here, we characterized the viral and prokaryotic assemblages present in the hypersaline springs that dissolve Triassic-Keuper evaporite rocks and feed the Añana Salt Valley (Araba/Alava, Basque Country, Spain). Four underground water samples (around 23% total salinity) with different levels of exposure to the open air were analysed by means of microscopy and metagenomics. Cells and viruses in the spring water had lower concentrations than what are normally found in hypersaline environments and seemed to be mostly inactive. Upon exposure to the open air, there was an increase in activity of both cells and viruses as well as a selection of phylotypes. The underground water was inhabited by a rich community harbouring a diverse set of genes coding for retinal binding proteins. A total of 35 viral contigs from 15 to 104 kb, representing partial or total viral genomes, were assembled and their evolutionary changes through the spring system were followed by SNP analysis and metagenomic island tracking. Overall, both the viral and the prokaryotic assemblages changed quickly upon exposure to the open air conditions.
Collapse
Affiliation(s)
- Mª Dolores Ramos-Barbero
- Department of Physiology, Genetics and Microbiology, University of Alicante, 03690 San Vicent del Raspeig, Alicante, Spain
| | - Tomeu Viver
- Marine Microbiology Group, Department of Animal and Microbial Diversity, Mediterranean Institute of Advanced Studies (IMEDEA; CSIC-UIB), Esporles, Illes Balears, 07190, Spain
| | - Ane Zabaleta
- Hydro-Environmental Processes Group, Geology Department, Science and Technology Faculty, University of the Basque Country UPV/EHU, Leioa, 48940, Spain
| | - Ece Senel
- Department of Physiology, Genetics and Microbiology, University of Alicante, 03690 San Vicent del Raspeig, Alicante, Spain.,Department of Biology, Institute of Graduate Programs, Eskisehir Technical University, Yunusemre Campus, Eskisehir, 26470, Turkey
| | - María Gomariz
- Department of Physiology, Genetics and Microbiology, University of Alicante, 03690 San Vicent del Raspeig, Alicante, Spain
| | - Iñaki Antigüedad
- Hydro-Environmental Processes Group, Geology Department, Science and Technology Faculty, University of the Basque Country UPV/EHU, Leioa, 48940, Spain
| | - Fernando Santos
- Department of Physiology, Genetics and Microbiology, University of Alicante, 03690 San Vicent del Raspeig, Alicante, Spain
| | - Manuel Martínez-García
- Department of Physiology, Genetics and Microbiology, University of Alicante, 03690 San Vicent del Raspeig, Alicante, Spain
| | - Ramon Rosselló-Móra
- Marine Microbiology Group, Department of Animal and Microbial Diversity, Mediterranean Institute of Advanced Studies (IMEDEA; CSIC-UIB), Esporles, Illes Balears, 07190, Spain
| | - Josefa Antón
- Department of Physiology, Genetics and Microbiology, University of Alicante, 03690 San Vicent del Raspeig, Alicante, Spain
| |
Collapse
|
6
|
Claverie JM. Fundamental Difficulties Prevent the Reconstruction of the Deep Phylogeny of Viruses. Viruses 2020; 12:E1130. [PMID: 33036160 PMCID: PMC7600955 DOI: 10.3390/v12101130] [Citation(s) in RCA: 6] [Impact Index Per Article: 1.5] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 09/17/2020] [Revised: 10/01/2020] [Accepted: 10/03/2020] [Indexed: 12/11/2022] Open
Abstract
The extension of virology beyond its traditional medical, veterinary, or agricultural applications, now called environmental virology, has shown that viruses are both the most numerous and diverse biological entities on Earth. In particular, virus isolations from unicellular eukaryotic hosts (heterotrophic and photosynthetic protozoans) revealed numerous viral types previously unexpected in terms of virion structure, gene content, or mode of replication. Complemented by large-scale metagenomic analyses, these discoveries have rekindled interest in the enigma of the origin of viruses, for which a description encompassing all their diversity remains not available. Several laboratories have repeatedly tackled the deep reconstruction of the evolutionary history of viruses, using various methods of molecular phylogeny applied to the few shared "core" genes detected in certain virus groups (e.g., the Nucleocytoviricota). Beyond the practical difficulties of establishing reliable homology relationships from extremely divergent sequences, I present here conceptual arguments highlighting several fundamental limitations plaguing the reconstruction of the deep evolutionary history of viruses, and even more the identification of their unique or multiple origin(s). These arguments also underline the risk of establishing premature high level viral taxonomic classifications. Those limitations are direct consequences of the random mechanisms governing the reductive/retrogressive evolution of all obligate intracellular parasites.
Collapse
Affiliation(s)
- Jean-Michel Claverie
- Structural & Genomic Information Laboratory (IGS, UMR 7256), Mediterranean Institute of Microbiology (FR3479), Aix-Marseille University and CNRS, 13288 Marseille, France
| |
Collapse
|
7
|
Khot V, Strous M, Hawley AK. Computational approaches in viral ecology. Comput Struct Biotechnol J 2020; 18:1605-1612. [PMID: 32670501 PMCID: PMC7334295 DOI: 10.1016/j.csbj.2020.06.019] [Citation(s) in RCA: 14] [Impact Index Per Article: 3.5] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 02/28/2020] [Revised: 06/09/2020] [Accepted: 06/10/2020] [Indexed: 01/21/2023] Open
Abstract
Dynamic virus-host interactions play a critical role in regulating microbial community structure and function. Yet for decades prior to the genomics era, viruses were largely overlooked in microbial ecology research, as only low-throughput culture-based methods of discovering viruses were available. With the advent of metagenomics, culture-independent techniques have provided exciting opportunities to discover and study new viruses. Here, we review recently developed computational methods for identifying viral sequences, exploring viral diversity in environmental samples, and predicting hosts from metagenomic sequence data. Methods to analyze viruses in silico utilize unconventional approaches to tackle challenges unique to viruses, such as vast diversity, mosaic viral genomes, and the lack of universal marker genes. As the field of viral ecology expands exponentially, computational advances have become increasingly important to gain insight into the role viruses in diverse habitats.
Collapse
Affiliation(s)
- Varada Khot
- Department of Geoscience, University of Calgary, Calgary, AB T2N 1N4, Canada
| | - Marc Strous
- Department of Geoscience, University of Calgary, Calgary, AB T2N 1N4, Canada
| | - Alyse K. Hawley
- Department of Geoscience, University of Calgary, Calgary, AB T2N 1N4, Canada
| |
Collapse
|
8
|
Jacobs-Sera D, Abad LA, Alvey RM, Anders KR, Aull HG, Bhalla SS, Blumer LS, Bollivar DW, Bonilla JA, Butela KA, Coomans RJ, Cresawn SG, D'Elia T, Diaz A, Divens AM, Edgington NP, Frederick GD, Gainey MD, Garlena RA, Grant KW, Gurney SMR, Hendrickson HL, Hughes LE, Kenna MA, Klyczek KK, Kotturi H, Mavrich TN, McKinney AL, Merkhofer EC, Moberg Parker J, Molloy SD, Monti DL, Pape-Zambito DA, Pollenz RS, Pope WH, Reyna NS, Rinehart CA, Russell DA, Shaffer CD, Sivanathan V, Stoner TH, Stukey J, Sunnen CN, Tolsma SS, Tsourkas PK, Wallen JR, Ware VC, Warner MH, Washington JM, Westover KM, Whitefleet-Smith JL, Wiersma-Koch HI, Williams DC, Zack KM, Hatfull GF. Genomic diversity of bacteriophages infecting Microbacterium spp. PLoS One 2020; 15:e0234636. [PMID: 32555720 PMCID: PMC7302621 DOI: 10.1371/journal.pone.0234636] [Citation(s) in RCA: 44] [Impact Index Per Article: 11.0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 04/10/2020] [Accepted: 05/29/2020] [Indexed: 11/19/2022] Open
Abstract
The bacteriophage population is vast, dynamic, old, and genetically diverse. The genomics of phages that infect bacterial hosts in the phylum Actinobacteria show them to not only be diverse but also pervasively mosaic, and replete with genes of unknown function. To further explore this broad group of bacteriophages, we describe here the isolation and genomic characterization of 116 phages that infect Microbacterium spp. Most of the phages are lytic, and can be grouped into twelve clusters according to their overall relatedness; seven of the phages are singletons with no close relatives. Genome sizes vary from 17.3 kbp to 97.7 kbp, and their G+C% content ranges from 51.4% to 71.4%, compared to ~67% for their Microbacterium hosts. The phages were isolated on five different Microbacterium species, but typically do not efficiently infect strains beyond the one on which they were isolated. These Microbacterium phages contain many novel features, including very large viral genes (13.5 kbp) and unusual fusions of structural proteins, including a fusion of VIP2 toxin and a MuF-like protein into a single gene. These phages and their genetic components such as integration systems, recombineering tools, and phage-mediated delivery systems, will be useful resources for advancing Microbacterium genetics.
Collapse
Affiliation(s)
- Deborah Jacobs-Sera
- Department of Biological Sciences, University of Pittsburgh, Pittsburgh, Pennsylvania, United States of America
| | - Lawrence A. Abad
- Department of Biological Sciences, University of Pittsburgh, Pittsburgh, Pennsylvania, United States of America
| | - Richard M. Alvey
- Department of Biology, Illinois Wesleyan University, Bloomington, Illinois, United States of America
| | - Kirk R. Anders
- Department of Biology, Gonzaga University, Spokane, Washington, United States of America
| | - Haley G. Aull
- Department of Biological Sciences, University of Pittsburgh, Pittsburgh, Pennsylvania, United States of America
| | - Suparna S. Bhalla
- Department of Natural Sciences, Mount Saint Mary College, Newburgh, New York, United States of America
| | - Lawrence S. Blumer
- Department of Biology, Morehouse College, Atlanta, Georgia, United States of America
| | - David W. Bollivar
- Department of Biology, Illinois Wesleyan University, Bloomington, Illinois, United States of America
| | - J. Alfred Bonilla
- Department of Biology, University of Wisconsin-River Falls, River Falls, Wisconsin, United States of America
| | - Kristen A. Butela
- Department of Biological Sciences, University of Pittsburgh, Pittsburgh, Pennsylvania, United States of America
| | - Roy J. Coomans
- Department of Biology, North Carolina A&T State University, Greensboro, North Carolina, United States of America
| | - Steven G. Cresawn
- Department of Biology, James Madison University, Harrisonburg, Virginia, United States of America
| | - Tom D'Elia
- Department of Biological Sciences, Indian River State College, Fort Pierce, Florida, United States of America
| | - Arturo Diaz
- Department of Biology, La Sierra University, Riverside, California, United States of America
| | - Ashley M. Divens
- Department of Biological Sciences, University of Pittsburgh, Pittsburgh, Pennsylvania, United States of America
| | - Nicholas P. Edgington
- Department of Biology, Southern Connecticut State University, New Haven, Connecticut, United States of America
| | - Gregory D. Frederick
- Department of Biology and Kinesiology, LeTourneau University, Longview, Texas, United States of America
| | - Maria D. Gainey
- Department of Chemistry & Physics, Western Carolina University, Cullowhee, North Carolina, United States of America
| | - Rebecca A. Garlena
- Department of Biological Sciences, University of Pittsburgh, Pittsburgh, Pennsylvania, United States of America
| | - Kenneth W. Grant
- Department of Pathology, Wake Forest Baptist Health, Winston-Salem, North Carolina, United States of America
| | - Susan M. R. Gurney
- Department of Biology, Drexel University, Philadelphia, Pennsylvania, United States of America
| | | | - Lee E. Hughes
- Department of Biological Sciences, University of North Texas, Denton, Texas, United States of America
| | - Margaret A. Kenna
- Department of Biological Sciences, Lehigh University, Bethlehem, Pennsylvania, United States of America
| | - Karen K. Klyczek
- Department of Biology, University of Wisconsin-River Falls, River Falls, Wisconsin, United States of America
| | - Hari Kotturi
- Department of Biology, University of Central Oklahoma, Edmond, Oklahoma, United States of America
| | - Travis N. Mavrich
- Department of Biological Sciences, University of Pittsburgh, Pittsburgh, Pennsylvania, United States of America
| | - Angela L. McKinney
- Department of Biology, Nebraska Wesleyan University, Lincoln, Nebraska, United States of America
| | - Evan C. Merkhofer
- Department of Natural Sciences, Mount Saint Mary College, Newburgh, New York, United States of America
| | - Jordan Moberg Parker
- Department of Microbiology, Immunology, & Molecular Genetics, University of California, Los Angeles, California, United States of America
| | - Sally D. Molloy
- Department of Molecular and Biomedical Sciences, University of Maine, Orono, Maine, United States of America
| | - Denise L. Monti
- Department of Biology, University of Alabama at Birmingham, Birmingham, Alabama, United States of America
| | - Dana A. Pape-Zambito
- Department of Biological Sciences, University of the Sciences in Philadelphia, Philadelphia, Pennsylvania, United States of America
| | - Richard S. Pollenz
- Department Cell Biology, Microbiology and Molecular Biology, University of South Florida, Tampa, Florida, United States of America
| | - Welkin H. Pope
- Department of Biological Sciences, University of Pittsburgh, Pittsburgh, Pennsylvania, United States of America
| | - Nathan S. Reyna
- Department of Biology, Ouachita Baptist University, Arkadelphia, Arkansas, United States of America
| | - Claire A. Rinehart
- Department of Biology, Western Kentucky University, Bowling Green, Kentucky, United States of America
| | - Daniel A. Russell
- Department of Biological Sciences, University of Pittsburgh, Pittsburgh, Pennsylvania, United States of America
| | - Christopher D. Shaffer
- Department of Biology, University of Washington in St. Louis, St. Louis, Missouri, United States of America
| | - Viknesh Sivanathan
- Howard Hughes Medical Institute, Chevy Chase, Maryland, United States of America
| | - Ty H. Stoner
- Department of Biological Sciences, University of Pittsburgh, Pittsburgh, Pennsylvania, United States of America
| | - Joseph Stukey
- Biology Department, Hope College, Holland, Michigan, United States of America
| | - C. Nicole Sunnen
- Department of Biological Sciences, University of the Sciences, Philadelphia, Pennsylvania, United States of America
| | - Sara S. Tolsma
- Biology Department, Northwestern College, Orange City, Iowa, United States of America
| | - Philippos K. Tsourkas
- School of Life Sciences, University of Nevada, Las Vegas, Las Vegas, Nevada, United States of America
| | - Jamie R. Wallen
- Department of Chemistry & Physics, Western Carolina University, Cullowhee, North Carolina, United States of America
| | - Vassie C. Ware
- Department of Biological Sciences, Lehigh University, Bethlehem, Pennsylvania, United States of America
| | - Marcie H. Warner
- Department of Biological Sciences, University of Pittsburgh, Pittsburgh, Pennsylvania, United States of America
| | | | - Kristi M. Westover
- Department of Biology, Winthrop University, Rock Hill, South Carolina, United States of America
| | - JoAnn L. Whitefleet-Smith
- Department of Biology & Biotechnology, Worcester Polytechnic Institute, Worcester, Massachusetts, United States of America
| | - Helen I. Wiersma-Koch
- Department of Biological Sciences, Indian River State College, Fort Pierce, Florida, United States of America
| | - Daniel C. Williams
- Department of Biology, Coastal Carolina University, Conway, South Carolina, United States of America
| | - Kira M. Zack
- Department of Biological Sciences, University of Pittsburgh, Pittsburgh, Pennsylvania, United States of America
| | - Graham F. Hatfull
- Department of Biological Sciences, University of Pittsburgh, Pittsburgh, Pennsylvania, United States of America
- * E-mail:
| |
Collapse
|
9
|
Koonin EV, Dolja VV, Krupovic M, Varsani A, Wolf YI, Yutin N, Zerbini FM, Kuhn JH. Global Organization and Proposed Megataxonomy of the Virus World. Microbiol Mol Biol Rev 2020; 84:e00061-19. [PMID: 32132243 PMCID: PMC7062200 DOI: 10.1128/mmbr.00061-19] [Citation(s) in RCA: 324] [Impact Index Per Article: 81.0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 12/25/2022] Open
Abstract
Viruses and mobile genetic elements are molecular parasites or symbionts that coevolve with nearly all forms of cellular life. The route of virus replication and protein expression is determined by the viral genome type. Comparison of these routes led to the classification of viruses into seven "Baltimore classes" (BCs) that define the major features of virus reproduction. However, recent phylogenomic studies identified multiple evolutionary connections among viruses within each of the BCs as well as between different classes. Due to the modular organization of virus genomes, these relationships defy simple representation as lines of descent but rather form complex networks. Phylogenetic analyses of virus hallmark genes combined with analyses of gene-sharing networks show that replication modules of five BCs (three classes of RNA viruses and two classes of reverse-transcribing viruses) evolved from a common ancestor that encoded an RNA-directed RNA polymerase or a reverse transcriptase. Bona fide viruses evolved from this ancestor on multiple, independent occasions via the recruitment of distinct cellular proteins as capsid subunits and other structural components of virions. The single-stranded DNA (ssDNA) viruses are a polyphyletic class, with different groups evolving by recombination between rolling-circle-replicating plasmids, which contributed the replication protein, and positive-sense RNA viruses, which contributed the capsid protein. The double-stranded DNA (dsDNA) viruses are distributed among several large monophyletic groups and arose via the combination of distinct structural modules with equally diverse replication modules. Phylogenomic analyses reveal the finer structure of evolutionary connections among RNA viruses and reverse-transcribing viruses, ssDNA viruses, and large subsets of dsDNA viruses. Taken together, these analyses allow us to outline the global organization of the virus world. Here, we describe the key aspects of this organization and propose a comprehensive hierarchical taxonomy of viruses.
Collapse
Affiliation(s)
- Eugene V Koonin
- National Center for Biotechnology Information, National Library of Medicine, National Institutes of Health, Bethesda, Maryland, USA
| | - Valerian V Dolja
- Department of Botany and Plant Pathology, Oregon State University, Corvallis, Oregon, USA
| | - Mart Krupovic
- Institut Pasteur, Archaeal Virology Unit, Department of Microbiology, Paris, France
| | - Arvind Varsani
- The Biodesign Center for Fundamental and Applied Microbiomics, Center for Evolution and Medicine, School of Life Sciences, Arizona State University, Tempe, Arizona, USA
- Structural Biology Research Unit, Department of Clinical Laboratory Sciences, University of Cape Town, Observatory, Cape Town, South Africa
| | - Yuri I Wolf
- National Center for Biotechnology Information, National Library of Medicine, National Institutes of Health, Bethesda, Maryland, USA
| | - Natalya Yutin
- National Center for Biotechnology Information, National Library of Medicine, National Institutes of Health, Bethesda, Maryland, USA
| | - F Murilo Zerbini
- Departamento de Fitopatologia/Bioagro, Universidade Federal de Viçosa, Viçosa, Minas Gerais, Brazil
| | - Jens H Kuhn
- Integrated Research Facility at Fort Detrick, National Institute of Allergy and Infectious Diseases, National Institutes of Health, Frederick, Maryland, USA
| |
Collapse
|
10
|
Tisza MJ, Pastrana DV, Welch NL, Stewart B, Peretti A, Starrett GJ, Pang YYS, Krishnamurthy SR, Pesavento PA, McDermott DH, Murphy PM, Whited JL, Miller B, Brenchley J, Rosshart SP, Rehermann B, Doorbar J, Ta'ala BA, Pletnikova O, Troncoso JC, Resnick SM, Bolduc B, Sullivan MB, Varsani A, Segall AM, Buck CB. Discovery of several thousand highly diverse circular DNA viruses. eLife 2020; 9:51971. [PMID: 32014111 PMCID: PMC7000223 DOI: 10.7554/elife.51971] [Citation(s) in RCA: 116] [Impact Index Per Article: 29.0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 09/19/2019] [Accepted: 01/06/2020] [Indexed: 12/18/2022] Open
Abstract
Although millions of distinct virus species likely exist, only approximately 9000 are catalogued in GenBank's RefSeq database. We selectively enriched for the genomes of circular DNA viruses in over 70 animal samples, ranging from nematodes to human tissue specimens. A bioinformatics pipeline, Cenote-Taker, was developed to automatically annotate over 2500 complete genomes in a GenBank-compliant format. The new genomes belong to dozens of established and emerging viral families. Some appear to be the result of previously undescribed recombination events between ssDNA and ssRNA viruses. In addition, hundreds of circular DNA elements that do not encode any discernable similarities to previously characterized sequences were identified. To characterize these ‘dark matter’ sequences, we used an artificial neural network to identify candidate viral capsid proteins, several of which formed virus-like particles when expressed in culture. These data further the understanding of viral sequence diversity and allow for high throughput documentation of the virosphere. When scientists hunt for new DNA sequences, sometimes they get a lot more than they bargained for. Such is the case in metagenomic surveys, which analyze not just DNA of a particular organism, but all the DNA in an environment at large. A vexing problem with these surveys is the overwhelming number of DNA sequences detected that are so different from any known microbe that they cannot be classified using traditional approaches. However, some of these “known unknowns” are undoubtedly viral sequences, because only a fraction of the enormous diversity of viruses has been characterized. This “viral dark matter” is a major obstacle for those studying viruses. This led Tisza et al. to attempt to classify some of the unknown viral sequences in their metagenomic surveys. The search, which specifically focused on viruses with circular DNA genomes, detected over 2,500 circular viral genomes. Intensive analysis revealed that many of these genomes had similar makeup to previously discovered viruses, but hundreds of them were totally different from any known virus, based on typical methods of comparison. Computational analysis of genes that were conserved among some of these brand-new circular sequences often revealed virus-like features. Experiments on a few of these genes showed that they encoded proteins capable of forming particles reminiscent of characteristic viral shells, implying that these new sequences are indeed viruses. Tisza et al. have added the 2,500 newly characterized viral sequences to the publicly accessible GenBank database, and the sequences are being considered for the more authoritative RefSeq database, which currently contains around 9,000 complete viral genomes. The expanded databases will hopefully now better equip scientists to explore the enormous diversity of viruses and help medics and veterinarians to detect disease-causing viruses in humans and other animals.
Collapse
Affiliation(s)
- Michael J Tisza
- Lab of Cellular Oncology, National Cancer Institute, National Institutes of Health, Bethesda, United States
| | - Diana V Pastrana
- Lab of Cellular Oncology, National Cancer Institute, National Institutes of Health, Bethesda, United States
| | - Nicole L Welch
- Lab of Cellular Oncology, National Cancer Institute, National Institutes of Health, Bethesda, United States
| | - Brittany Stewart
- Lab of Cellular Oncology, National Cancer Institute, National Institutes of Health, Bethesda, United States
| | - Alberto Peretti
- Lab of Cellular Oncology, National Cancer Institute, National Institutes of Health, Bethesda, United States
| | - Gabriel J Starrett
- Lab of Cellular Oncology, National Cancer Institute, National Institutes of Health, Bethesda, United States
| | - Yuk-Ying S Pang
- Lab of Cellular Oncology, National Cancer Institute, National Institutes of Health, Bethesda, United States
| | - Siddharth R Krishnamurthy
- Metaorganism Immunity Section, Laboratory of Immune System Biology, National Institute of Allergy and Infectious Diseases, National Institutes of Health, Bethesda, United States
| | - Patricia A Pesavento
- Department of Pathology, Microbiology, and Immunology, University of California, Davis, Davis, United States
| | - David H McDermott
- Molecular Signaling Section, Laboratory of Molecular Immunology, National Institute of Allergy and Infectious Diseases, National Institutes of Health, Bethesda, United States
| | - Philip M Murphy
- Molecular Signaling Section, Laboratory of Molecular Immunology, National Institute of Allergy and Infectious Diseases, National Institutes of Health, Bethesda, United States
| | - Jessica L Whited
- Department of Orthopedic Surgery, Harvard Medical School, The Harvard Stem Cell Institute, Brigham and Women's Hospital, Boston, United States.,Broad Institute of MIT and Harvard, Cambridge, United States.,Department of Stem Cell and Regenerative Biology, Harvard University, Cambridge, United States
| | - Bess Miller
- Department of Orthopedic Surgery, Harvard Medical School, The Harvard Stem Cell Institute, Brigham and Women's Hospital, Boston, United States.,Broad Institute of MIT and Harvard, Cambridge, United States
| | - Jason Brenchley
- Barrier Immunity Section, Lab of Viral Diseases, National Institute of Allergy and Infectious Diseases, National Institutes of Health, Cambridge, United States
| | - Stephan P Rosshart
- Immunology Section, Liver Diseases Branch, National Institute of Diabetes and Digestive and Kidney Diseases, National Institutes of Health, Bethesda, United States
| | - Barbara Rehermann
- Immunology Section, Liver Diseases Branch, National Institute of Diabetes and Digestive and Kidney Diseases, National Institutes of Health, Bethesda, United States
| | - John Doorbar
- Department of Pathology, University of Cambridge, Cambridge, United Kingdom
| | | | - Olga Pletnikova
- Department of Pathology (Neuropathology), Johns Hopkins University School of Medicine, Baltimore, United States
| | - Juan C Troncoso
- Department of Pathology (Neuropathology), Johns Hopkins University School of Medicine, Baltimore, United States
| | - Susan M Resnick
- Laboratory of Behavioral Neuroscience, National Institute on Aging, National Institutes of Health, Baltimore, United States
| | - Ben Bolduc
- Department of Microbiology, Ohio State University, Columbus, United States
| | - Matthew B Sullivan
- Department of Microbiology, Ohio State University, Columbus, United States.,Civil Environmental and Geodetic Engineering, Ohio State University, Columbus, United States
| | - Arvind Varsani
- The Biodesign Center of Fundamental and Applied Microbiomics, School of Life Sciences, Center for Evolution and Medicine, Arizona State University, Tempe, United States.,Structural Biology Research Unit, Department of Clinical Laboratory Sciences, University of Cape Town, Rondebosch, South Africa
| | - Anca M Segall
- Viral Information Institute and Department of Biology, San Diego State University, San Diego, United States
| | - Christopher B Buck
- Lab of Cellular Oncology, National Cancer Institute, National Institutes of Health, Bethesda, United States
| |
Collapse
|
11
|
Li X, Wang H, Tong W, Feng L, Wang L, Rahman SU, Wei G, Tao S. Exploring the evolutionary dynamics of Rhizobium plasmids through bipartite network analysis. Environ Microbiol 2019; 22:934-951. [PMID: 31361937 DOI: 10.1111/1462-2920.14762] [Citation(s) in RCA: 7] [Impact Index Per Article: 1.4] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 02/25/2019] [Revised: 06/24/2019] [Accepted: 07/25/2019] [Indexed: 10/26/2022]
Abstract
The genus Rhizobium usually has a multipartite genome architecture with a chromosome and several plasmids, making these bacteria a perfect candidate for plasmid biology studies. As there are no universally shared genes among typical plasmids, network analyses can complement traditional phylogenetics in a broad-scale study of plasmid evolution. Here, we present an exhaustive analysis of 216 plasmids from 49 complete genomes of Rhizobium by constructing a bipartite network that consists of two classes of nodes, the plasmids and homologous protein families that connect them. Dissection of the network using a hierarchical clustering strategy reveals extensive variety, with 34 homologous plasmid clusters. Four large clusters including one cluster of symbiotic plasmids and two clusters of chromids carrying some truly essential genes are widely distributed among Rhizobium. In contrast, the other clusters are quite small and rare. Symbiotic clusters and rare accessory clusters are exogenetic and do not appear to have co-evolved with the common accessory clusters; the latter ones have a large coding potential and functional complementarity for different lifestyles in Rhizobium. The bipartite network also provides preliminary evidence of Rhizobium plasmid variation and formation including genetic exchange, plasmid fusion and fission, exogenetic plasmid transfer, host plant selection, and environmental adaptation.
Collapse
Affiliation(s)
- Xiangchen Li
- State Key Laboratory of Crop Stress Biology in Arid Areas, Shaanxi Key Laboratory of Agricultural and Environmental Microbiology, College of Life Sciences, Northwest A&F University, Yangling, Shaanxi, 712100, China.,Bioinformatics Center, Northwest A&F University, Yangling, Shaanxi, 712100, China
| | - Hao Wang
- State Key Laboratory of Crop Stress Biology in Arid Areas, Shaanxi Key Laboratory of Agricultural and Environmental Microbiology, College of Life Sciences, Northwest A&F University, Yangling, Shaanxi, 712100, China.,Bioinformatics Center, Northwest A&F University, Yangling, Shaanxi, 712100, China
| | - Wenjun Tong
- State Key Laboratory of Crop Stress Biology in Arid Areas, Shaanxi Key Laboratory of Agricultural and Environmental Microbiology, College of Life Sciences, Northwest A&F University, Yangling, Shaanxi, 712100, China
| | - Li Feng
- College of Enology, Northwest A&F University, Yangling, Shaanxi, 712100, China
| | - Lina Wang
- State Key Laboratory of Crop Stress Biology in Arid Areas, Shaanxi Key Laboratory of Agricultural and Environmental Microbiology, College of Life Sciences, Northwest A&F University, Yangling, Shaanxi, 712100, China.,Bioinformatics Center, Northwest A&F University, Yangling, Shaanxi, 712100, China
| | - Siddiq Ur Rahman
- State Key Laboratory of Crop Stress Biology in Arid Areas, Shaanxi Key Laboratory of Agricultural and Environmental Microbiology, College of Life Sciences, Northwest A&F University, Yangling, Shaanxi, 712100, China.,Bioinformatics Center, Northwest A&F University, Yangling, Shaanxi, 712100, China.,Department of Computer Science and Bioinformatics, Khushal Khan Khattak University, Karak, Khyber Pakhtunkhwa, 27200, Pakistan
| | - Gehong Wei
- State Key Laboratory of Crop Stress Biology in Arid Areas, Shaanxi Key Laboratory of Agricultural and Environmental Microbiology, College of Life Sciences, Northwest A&F University, Yangling, Shaanxi, 712100, China
| | - Shiheng Tao
- State Key Laboratory of Crop Stress Biology in Arid Areas, Shaanxi Key Laboratory of Agricultural and Environmental Microbiology, College of Life Sciences, Northwest A&F University, Yangling, Shaanxi, 712100, China.,Bioinformatics Center, Northwest A&F University, Yangling, Shaanxi, 712100, China
| |
Collapse
|
12
|
Villarreal LP, Witzany G. That is life: communicating RNA networks from viruses and cells in continuous interaction. Ann N Y Acad Sci 2019; 1447:5-20. [PMID: 30865312 DOI: 10.1111/nyas.14040] [Citation(s) in RCA: 17] [Impact Index Per Article: 3.4] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 11/26/2018] [Revised: 01/13/2019] [Accepted: 01/31/2019] [Indexed: 02/06/2023]
Abstract
All the conserved detailed results of evolution stored in DNA must be read, transcribed, and translated via an RNA-mediated process. This is required for the development and growth of each individual cell. Thus, all known living organisms fundamentally depend on these RNA-mediated processes. In most cases, they are interconnected with other RNAs and their associated protein complexes and function in a strictly coordinated hierarchy of temporal and spatial steps (i.e., an RNA network). Clearly, all cellular life as we know it could not function without these key agents of DNA replication, namely rRNA, tRNA, and mRNA. Thus, any definition of life that lacks RNA functions and their networks misses an essential requirement for RNA agents that inherently regulate and coordinate (communicate to) cells, tissues, organs, and organisms. The precellular evolution of RNAs occurred at the core of the emergence of cellular life and the question remained of how both precellular and cellular levels are interconnected historically and functionally. RNA networks and RNA communication can interconnect these levels. With the reemergence of virology in evolution, it became clear that communicating viruses and subviral infectious genetic parasites are bridging these two levels by invading, integrating, coadapting, exapting, and recombining constituent parts in host genomes for cellular requirements in gene regulation and coordination aims. Therefore, a 21st century understanding of life is of an inherently social process based on communicating RNA networks, in which viruses and cells continuously interact.
Collapse
Affiliation(s)
- Luis P Villarreal
- Department of Molecular Biology and Biochemistry, University of California, Irvine, California
| | | |
Collapse
|
13
|
Wolf YI, Kazlauskas D, Iranzo J, Lucía-Sanz A, Kuhn JH, Krupovic M, Dolja VV, Koonin EV. Origins and Evolution of the Global RNA Virome. mBio 2018; 9:e02329-18. [PMID: 30482837 PMCID: PMC6282212 DOI: 10.1128/mbio.02329-18] [Citation(s) in RCA: 322] [Impact Index Per Article: 53.7] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/22/2018] [Accepted: 10/31/2018] [Indexed: 01/12/2023] Open
Abstract
Viruses with RNA genomes dominate the eukaryotic virome, reaching enormous diversity in animals and plants. The recent advances of metaviromics prompted us to perform a detailed phylogenomic reconstruction of the evolution of the dramatically expanded global RNA virome. The only universal gene among RNA viruses is the gene encoding the RNA-dependent RNA polymerase (RdRp). We developed an iterative computational procedure that alternates the RdRp phylogenetic tree construction with refinement of the underlying multiple-sequence alignments. The resulting tree encompasses 4,617 RNA virus RdRps and consists of 5 major branches; 2 of the branches include positive-sense RNA viruses, 1 is a mix of positive-sense (+) RNA and double-stranded RNA (dsRNA) viruses, and 2 consist of dsRNA and negative-sense (-) RNA viruses, respectively. This tree topology implies that dsRNA viruses evolved from +RNA viruses on at least two independent occasions, whereas -RNA viruses evolved from dsRNA viruses. Reconstruction of RNA virus evolution using the RdRp tree as the scaffold suggests that the last common ancestors of the major branches of +RNA viruses encoded only the RdRp and a single jelly-roll capsid protein. Subsequent evolution involved independent capture of additional genes, in particular, those encoding distinct RNA helicases, enabling replication of larger RNA genomes and facilitating virus genome expression and virus-host interactions. Phylogenomic analysis reveals extensive gene module exchange among diverse viruses and horizontal virus transfer between distantly related hosts. Although the network of evolutionary relationships within the RNA virome is bound to further expand, the present results call for a thorough reevaluation of the RNA virus taxonomy.IMPORTANCE The majority of the diverse viruses infecting eukaryotes have RNA genomes, including numerous human, animal, and plant pathogens. Recent advances of metagenomics have led to the discovery of many new groups of RNA viruses in a wide range of hosts. These findings enable a far more complete reconstruction of the evolution of RNA viruses than was attainable previously. This reconstruction reveals the relationships between different Baltimore classes of viruses and indicates extensive transfer of viruses between distantly related hosts, such as plants and animals. These results call for a major revision of the existing taxonomy of RNA viruses.
Collapse
Affiliation(s)
- Yuri I Wolf
- National Center for Biotechnology Information, National Library of Medicine, National Institutes of Health, Bethesda, Maryland, USA
| | - Darius Kazlauskas
- Institute of Biotechnology, Life Sciences Center, Vilnius University, Vilnius, Lithuania
- Département de Microbiologie, Institut Pasteur, Paris, France
| | - Jaime Iranzo
- National Center for Biotechnology Information, National Library of Medicine, National Institutes of Health, Bethesda, Maryland, USA
| | - Adriana Lucía-Sanz
- National Center for Biotechnology Information, National Library of Medicine, National Institutes of Health, Bethesda, Maryland, USA
- Centro Nacional de Biotecnología, Madrid, Spain
| | - Jens H Kuhn
- Integrated Research Facility at Fort Detrick, National Institute of Allergy and Infectious Diseases, National Institutes of Health, Frederick, Maryland, USA
| | - Mart Krupovic
- Département de Microbiologie, Institut Pasteur, Paris, France
| | - Valerian V Dolja
- Department of Botany and Plant Pathology, Oregon State University, Corvallis, Oregon, USA
| | - Eugene V Koonin
- National Center for Biotechnology Information, National Library of Medicine, National Institutes of Health, Bethesda, Maryland, USA
| |
Collapse
|
14
|
Aiewsakun P, Adriaenssens EM, Lavigne R, Kropinski AM, Simmonds P. Evaluation of the genomic diversity of viruses infecting bacteria, archaea and eukaryotes using a common bioinformatic platform: steps towards a unified taxonomy. J Gen Virol 2018; 99:1331-1343. [PMID: 30016225 PMCID: PMC6230767 DOI: 10.1099/jgv.0.001110] [Citation(s) in RCA: 59] [Impact Index Per Article: 9.8] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 04/19/2018] [Accepted: 06/13/2018] [Indexed: 01/01/2023] Open
Abstract
Genome Relationship Applied to Virus Taxonomy (GRAViTy) is a genetics-based tool that computes sequence relatedness between viruses. Composite generalized Jaccard (CGJ) distances combine measures of homology between encoded viral genes and similarities in genome organizational features (gene orders and orientations). This scoring framework effectively recapitulates the current, largely morphology and phenotypic-based, family-level classification of eukaryotic viruses. Eukaryotic virus families typically formed monophyletic groups with consistent CGJ distance cut-off dividing between and within family divergence ranges. In the current study, a parallel analysis of prokaryotic virus families revealed quite different sequence relationships, particularly those of tailed phage families (Siphoviridae, Myoviridae and Podoviridae), where members of the same family were generally far more divergent and often not detectably homologous to each other. Analysis of the 20 currently classified prokaryotic virus families indeed split them into 70 separate clusters of tailed phages genetically equivalent to family-level assignments of eukaryotic viruses. It further divided several bacterial (Sphaerolipoviridae, Tectiviridae) and archaeal (Lipothrixviridae) families. We also found that the subfamily-level groupings of tailed phages were generally more consistent with the family assignments of eukaryotic viruses, and this supports ongoing reclassifications, including Spounavirinae and Vi1virus taxa as new virus families. The current study applied a common benchmark with which to compare taxonomies of eukaryotic and prokaryotic viruses. The findings support the planned shift away from traditional morphology-based classifications of prokaryotic viruses towards a genome-based taxonomy. They demonstrate the feasibility of a unified taxonomy of viruses into which the vast body of metagenomic viral sequences may be consistently assigned.
Collapse
Affiliation(s)
- Pakorn Aiewsakun
- Nuffield Department of Medicine, University of Oxford, Peter Medawar Building, South Parks, Oxford, OX1 3SY, UK
- Department of Microbiology, Faculty of Science, Mahidol University, Bangkok, 10400, Thailand
| | - Evelien M. Adriaenssens
- Institute of Integrative Biology, University of Liverpool, Biosciences Building, Crown Street, L69 7ZB Liverpool, UK
| | - Rob Lavigne
- Department of Biosystems, Laboratory of Gene Technology, KU Leuven. Kasteelpark Arenberg 21, Box 2462, 3001 Leuven, Belgium
| | - Andrew M. Kropinski
- Departments of Food Science, and Pathobiology, University of Guelph, 50 Stone Rd E, Guelph, ON, N1G 2W1, Canada
| | - Peter Simmonds
- Nuffield Department of Medicine, University of Oxford, Peter Medawar Building, South Parks, Oxford, OX1 3SY, UK
| |
Collapse
|
15
|
Simmonds P, Aiewsakun P. Virus classification - where do you draw the line? Arch Virol 2018; 163:2037-2046. [PMID: 30039318 PMCID: PMC6096723 DOI: 10.1007/s00705-018-3938-z] [Citation(s) in RCA: 61] [Impact Index Per Article: 10.2] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/02/2018] [Accepted: 07/03/2018] [Indexed: 11/23/2022]
Abstract
High-throughput sequencing (HTS) and its use in recovering and assembling novel virus sequences from environmental, human clinical, veterinary and plant samples has unearthed a vast new catalogue of viruses. Their classification, known by their sequences alone, sets a major challenge to traditional virus taxonomy, especially at the family and species levels, which have been historically based largely on descriptive taxon definitions. These typically entail some knowledge of their phenotypic properties, including replication strategies, virion structure and clinical and epidemiological features, such as host range, geographical distribution and disease outcomes. Little to no information on these attributes is available, however, for viruses identified in metagenomic datasets. If such viruses are to be included in virus taxonomy, their assignments will have to be guided largely or entirely by metrics of genetic relatedness. The immediate problem here is that the International Committee on Taxonomy of Viruses (ICTV), an organisation that authorises the taxonomic classification of viruses, provides little or no guidance on how similar or how divergent viruses must be in order to be considered members of new species or new families. We have recently developed a method for scoring genomic (dis)similarity between viruses (Genome Relationships Applied to Virus Taxonomy - GRAViTy) among the eukaryotic and prokaryotic viruses currently classified by the ICTV. At the family and genus levels, we found large-scale consistency between genetic relationships and their taxonomic assignments for eukaryotic viruses of all genome configurations and genome sizes. Family assignments of prokaryotic viruses have, however, been made at a quite different genetic level, and groupings currently classified as sub-families are a much better match to the eukaryotic virus family level. These findings support the ongoing reorganisation of bacteriophage taxonomy by the ICTV Phage Study Group. A rapid and objective means to explore metagenomic viral diversity and make evidence-based assignments for such viruses at each taxonomic layer is essential. Analysis of sequences by GRAViTy provides evidence that family (and genus) assignments of currently classified viruses are largely underpinned by genomic relatedness, and these features could serve as a guide towards an evidence-based classification of metagenomic viruses in the future.
Collapse
Affiliation(s)
- Peter Simmonds
- Nuffield Department of Medicine, University of Oxford, Peter Medawar Building, South Parks Road, Oxford, OX1 3SY UK
| | - Pakorn Aiewsakun
- Nuffield Department of Medicine, University of Oxford, Peter Medawar Building, South Parks Road, Oxford, OX1 3SY UK
- Department of Microbiology, Faculty of Science, Mahidol University, Bangkok, 10400 Thailand
| |
Collapse
|
16
|
Abstract
Due to their dependence on cellular organisms for metabolism and replication, viruses are typically named and assigned to species according to their genome structure and the original host that they infect. But because viruses often infect multiple hosts and the numbers of distinct lineages within a host can be vast, their delineation into species is often dictated by arbitrary sequence thresholds, which are highly inconsistent across lineages. Here we apply an approach to determine the boundaries of viral species based on the detection of gene flow within populations, thereby defining viral species according to the biological species concept (BSC). Despite the potential for gene transfer between highly divergent genomes, viruses, like the cellular organisms they infect, assort into reproductively isolated groups and can be organized into biological species. This approach revealed that BSC-defined viral species are often congruent with the taxonomic partitioning based on shared gene contents and host tropism, and that bacteriophages can similarly be classified in biological species. These results open the possibility to use a single, universal definition of species that is applicable across cellular and acellular lifeforms.
Collapse
|
17
|
Aiewsakun P, Simmonds P. The genomic underpinnings of eukaryotic virus taxonomy: creating a sequence-based framework for family-level virus classification. MICROBIOME 2018; 6:38. [PMID: 29458427 PMCID: PMC5819261 DOI: 10.1186/s40168-018-0422-7] [Citation(s) in RCA: 51] [Impact Index Per Article: 8.5] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Subscribe] [Scholar Register] [Received: 12/07/2017] [Accepted: 02/07/2018] [Indexed: 05/14/2023]
Abstract
BACKGROUND The International Committee on Taxonomy of Viruses (ICTV) classifies viruses into families, genera and species and provides a regulated system for their nomenclature that is universally used in virus descriptions. Virus taxonomic assignments have traditionally been based upon virus phenotypic properties such as host range, virion morphology and replication mechanisms, particularly at family level. However, gene sequence comparisons provide a clearer guide to their evolutionary relationships and provide the only information that may guide the incorporation of viruses detected in environmental (metagenomic) studies that lack any phenotypic data. RESULTS The current study sought to determine whether the existing virus taxonomy could be reproduced by examination of genetic relationships through the extraction of protein-coding gene signatures and genome organisational features. We found large-scale consistency between genetic relationships and taxonomic assignments for viruses of all genome configurations and genome sizes. The analysis pipeline that we have called 'Genome Relationships Applied to Virus Taxonomy' (GRAViTy) was highly effective at reproducing the current assignments of viruses at family level as well as inter-family groupings into orders. Its ability to correctly differentiate assigned viruses from unassigned viruses, and classify them into the correct taxonomic group, was evaluated by threefold cross-validation technique. This predicted family membership of eukaryotic viruses with close to 100% accuracy and specificity potentially enabling the algorithm to predict assignments for the vast corpus of metagenomic sequences consistently with ICTV taxonomy rules. In an evaluation run of GRAViTy, over one half (460/921) of (near)-complete genome sequences from several large published metagenomic eukaryotic virus datasets were assigned to 127 novel family-level groupings. If corroborated by other analysis methods, these would potentially more than double the number of eukaryotic virus families in the ICTV taxonomy. CONCLUSIONS A rapid and objective means to explore metagenomic viral diversity and make informed recommendations for their assignments at each taxonomic layer is essential. GRAViTy provides one means to make rule-based assignments at family and order levels in a manner that preserves the integrity and underlying organisational principles of the current ICTV taxonomy framework. Such methods are increasingly required as the vast virosphere is explored.
Collapse
Affiliation(s)
- Pakorn Aiewsakun
- Nuffield Department of Medicine, University of Oxford, Peter Medawar Building, South Parks Road, Oxford, OX1 3SY UK
| | - Peter Simmonds
- Nuffield Department of Medicine, University of Oxford, Peter Medawar Building, South Parks Road, Oxford, OX1 3SY UK
| |
Collapse
|
18
|
Mushegian A, Karin EL, Pupko T. Sequence analysis of malacoherpesvirus proteins: Pan-herpesvirus capsid module and replication enzymes with an ancient connection to "Megavirales". Virology 2018; 513:114-128. [PMID: 29065352 PMCID: PMC7172337 DOI: 10.1016/j.virol.2017.10.009] [Citation(s) in RCA: 15] [Impact Index Per Article: 2.5] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 08/20/2017] [Revised: 10/08/2017] [Accepted: 10/09/2017] [Indexed: 12/30/2022]
Abstract
The order Herpesvirales includes animal viruses with large double-strand DNA genomes replicating in the nucleus. The main capsid protein in the best-studied family Herpesviridae contains a domain with HK97-like fold related to bacteriophage head proteins, and several virion maturation factors are also homologous between phages and herpesviruses. The origin of herpesvirus DNA replication proteins is less well understood. While analyzing the genomes of herpesviruses in the family Malacohepresviridae, we identified nearly 30 families of proteins conserved in other herpesviruses, including several phage-related domains in morphogenetic proteins. Herpesvirus DNA replication factors have complex evolutionary history: some are related to cellular proteins, but others are closer to homologs from large nucleocytoplasmic DNA viruses. Phylogenetic analyses suggest that the core replication machinery of herpesviruses may have been recruited from the same pool as in the case of other large DNA viruses of eukaryotes.
Collapse
Affiliation(s)
- Arcady Mushegian
- Division of Molecular and Cellular Biosciences, National Science Foundation, 2415 Eisenhower Avenue, Alexandria, VA 22314, USA.
| | - Eli Levy Karin
- Department of Cell Research and Immunology, George S. Wise Faculty of Life Sciences, Tel-Aviv University, Tel-Aviv 69978, Israel
| | - Tal Pupko
- Department of Cell Research and Immunology, George S. Wise Faculty of Life Sciences, Tel-Aviv University, Tel-Aviv 69978, Israel
| |
Collapse
|
19
|
Shakya M, Soucy SM, Zhaxybayeva O. Insights into origin and evolution of α-proteobacterial gene transfer agents. Virus Evol 2017; 3:vex036. [PMID: 29250433 PMCID: PMC5721377 DOI: 10.1093/ve/vex036] [Citation(s) in RCA: 41] [Impact Index Per Article: 5.9] [Reference Citation Analysis] [Abstract] [Key Words] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 12/15/2022] Open
Abstract
Several bacterial and archaeal lineages produce nanostructures that morphologically resemble small tailed viruses, but, unlike most viruses, contain apparently random pieces of the host genome. Since these elements can deliver the packaged DNA to other cells, they were dubbed gene transfer agents (GTAs). Because many genes involved in GTA production have viral homologs, it has been hypothesized that the GTA ancestor was a virus. Whether GTAs represent an atypical virus, a defective virus, or a virus co-opted by the prokaryotes for some function, remains to be elucidated. To evaluate these possibilities, we examined the distribution and evolutionary histories of genes that encode a GTA in the α-proteobacterium Rhodobacter capsulatus (RcGTA). We report that although homologs of many individual RcGTA genes are abundant across bacteria and their viruses, RcGTA-like genomes are mainly found in one subclade of α-proteobacteria. When compared with the viral homologs, genes of the RcGTA-like genomes evolve significantly slower, and do not have higher %A+T nucleotides than their host chromosomes. Moreover, they appear to reside in stable regions of the bacterial chromosomes that are generally conserved across taxonomic orders. These findings argue against RcGTA being an atypical or a defective virus. Our phylogenetic analyses suggest that RcGTA ancestor likely originated in the lineage that gave rise to contemporary α-proteobacterial orders Rhizobiales, Rhodobacterales, Caulobacterales, Parvularculales, and Sphingomonadales, and since that time the RcGTA-like element has co-evolved with its host chromosomes. Such evolutionary history is compatible with maintenance of these elements by bacteria due to some selective advantage. As for many other prokaryotic traits, horizontal gene transfer played a substantial role in the evolution of RcGTA-like elements, not only in shaping its genome components within the orders, but also in occasional dissemination of RcGTA-like regions across the orders and even to different bacterial phyla.
Collapse
Affiliation(s)
- Migun Shakya
- Department of Biological Sciences, Dartmouth College, 78 College Street, Hanover, NH 03755, USA
| | - Shannon M Soucy
- Department of Biological Sciences, Dartmouth College, 78 College Street, Hanover, NH 03755, USA
| | - Olga Zhaxybayeva
- Department of Biological Sciences, Dartmouth College, 78 College Street, Hanover, NH 03755, USA.,Department of Computer Science, Dartmouth College, 6211 Sudikoff Lab, Hanover, NH 03755, USA
| |
Collapse
|
20
|
Krupovic M, Cvirkaite-Krupovic V, Iranzo J, Prangishvili D, Koonin EV. Viruses of archaea: Structural, functional, environmental and evolutionary genomics. Virus Res 2017; 244:181-193. [PMID: 29175107 DOI: 10.1016/j.virusres.2017.11.025] [Citation(s) in RCA: 139] [Impact Index Per Article: 19.9] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/30/2017] [Revised: 11/20/2017] [Accepted: 11/20/2017] [Indexed: 11/18/2022]
Abstract
Viruses of archaea represent one of the most enigmatic parts of the virosphere. Most of the characterized archaeal viruses infect extremophilic hosts and display remarkable diversity of virion morphotypes, many of which have never been observed among viruses of bacteria or eukaryotes. The uniqueness of the virion morphologies is matched by the distinctiveness of the genomes of these viruses, with ∼75% of genes encoding unique proteins, refractory to functional annotation based on sequence analyses. In this review, we summarize the state-of-the-art knowledge on various aspects of archaeal virus genomics. First, we outline how structural and functional genomics efforts provided valuable insights into the functions of viral proteins and revealed intricate details of the archaeal virus-host interactions. We then highlight recent metagenomics studies, which provided a glimpse at the diversity of uncultivated viruses associated with the ubiquitous archaea in the oceans, including Thaumarchaeota, Marine Group II Euryarchaeota, and others. These findings, combined with the recent discovery that archaeal viruses mediate a rapid turnover of thaumarchaea in the deep sea ecosystems, illuminate the prominent role of these viruses in the biosphere. Finally, we discuss the origins and evolution of archaeal viruses and emphasize the evolutionary relationships between viruses and non-viral mobile genetic elements. Further exploration of the archaeal virus diversity as well as functional studies on diverse virus-host systems are bound to uncover novel, unexpected facets of the archaeal virome.
Collapse
Affiliation(s)
- Mart Krupovic
- Department of Microbiology, Institut Pasteur, 25 rue du Dr. Roux, Paris 75015, Paris, France.
| | | | - Jaime Iranzo
- National Center for Biotechnology Information, National Library of Medicine, Bethesda, MD, USA
| | - David Prangishvili
- Department of Microbiology, Institut Pasteur, 25 rue du Dr. Roux, Paris 75015, Paris, France
| | - Eugene V Koonin
- National Center for Biotechnology Information, National Library of Medicine, Bethesda, MD, USA
| |
Collapse
|
21
|
Abstract
One of the most prominent features of archaea is the extraordinary diversity of their DNA viruses. Many archaeal viruses differ substantially in morphology from bacterial and eukaryotic viruses and represent unique virus families. The distinct nature of archaeal viruses also extends to the gene composition and architectures of their genomes and the properties of the proteins that they encode. Environmental research has revealed prominent roles of archaeal viruses in influencing microbial communities in ocean ecosystems, and recent metagenomic studies have uncovered new groups of archaeal viruses that infect extremophiles and mesophiles in diverse habitats. In this Review, we summarize recent advances in our understanding of the genomic and morphological diversity of archaeal viruses and the molecular biology of their life cycles and virus-host interactions, including interactions with archaeal CRISPR-Cas systems. We also examine the potential origins and evolution of archaeal viruses and discuss their place in the global virosphere.
Collapse
|