51
|
Zhang H, Chen P, Ma H, Woińska M, Liu D, Cooper DR, Peng G, Peng Y, Deng L, Minor W, Zheng H. virusMED: an atlas of hotspots of viral proteins. IUCRJ 2021; 8:S2052252521009076. [PMID: 34614039 PMCID: PMC8479994 DOI: 10.1107/s2052252521009076] [Citation(s) in RCA: 1] [Impact Index Per Article: 0.3] [Reference Citation Analysis] [Abstract] [Key Words] [Grants] [Track Full Text] [Figures] [Subscribe] [Scholar Register] [Received: 06/18/2021] [Accepted: 09/02/2021] [Indexed: 06/13/2023]
Abstract
Metal binding sites, antigen epitopes and drug binding sites are the hotspots in viral proteins that control how viruses interact with their hosts. virusMED (virus Metal binding sites, Epitopes and Drug binding sites) is a rich internet application based on a database of atomic interactions around hotspots in 7041 experimentally determined viral protein structures. 25306 hotspots from 805 virus strains from 75 virus families were characterized, including influenza, HIV-1 and SARS-CoV-2 viruses. Just as Google Maps organizes and annotates points of interest, virusMED presents the positions of individual hotspots on each viral protein and creates an atlas upon which newly characterized functional sites can be placed as they are being discovered. virusMED contains an extensive set of annotation tags about the virus species and strains, viral hosts, viral proteins, metal ions, specific antibodies and FDA-approved drugs, which permits rapid screening of hotspots on viral proteins tailored to a particular research problem. The virusMED portal (https://virusmed.biocloud.top) can serve as a window to a valuable resource for many areas of virus research and play a critical role in the rational design of new preventative and therapeutic agents targeting viral infections.
Collapse
Affiliation(s)
- HuiHui Zhang
- Hunan University College of Biology, Bioinformatics Center, Hunan 410082, People’s Republic of China
| | - Pei Chen
- Hunan University College of Biology, Bioinformatics Center, Hunan 410082, People’s Republic of China
| | - Haojie Ma
- Hunan University College of Biology, Bioinformatics Center, Hunan 410082, People’s Republic of China
| | - Magdalena Woińska
- Biological and Chemical Research Centre, Chemistry Department, University of Warsaw, Żwirki i Wigury 101, 02-089 Warsaw, Poland
- University of Virginia, Charlottesville, VA 22908, USA
| | - Dejian Liu
- Hunan University College of Biology, Bioinformatics Center, Hunan 410082, People’s Republic of China
| | | | - Guo Peng
- Hunan University College of Biology, Bioinformatics Center, Hunan 410082, People’s Republic of China
| | - Yousong Peng
- Hunan University College of Biology, Bioinformatics Center, Hunan 410082, People’s Republic of China
| | - Lei Deng
- Hunan University College of Biology, Bioinformatics Center, Hunan 410082, People’s Republic of China
- Hunan Provincial Key Laboratory of Medical Virology, People’s Republic of China
| | - Wladek Minor
- University of Virginia, Charlottesville, VA 22908, USA
| | - Heping Zheng
- Hunan University College of Biology, Bioinformatics Center, Hunan 410082, People’s Republic of China
- Hunan Provincial Key Laboratory of Medical Virology, People’s Republic of China
| |
Collapse
|
52
|
Garneau JR, Legrand V, Marbouty M, Press MO, Vik DR, Fortier LC, Sullivan MB, Bikard D, Monot M. High-throughput identification of viral termini and packaging mechanisms in virome datasets using PhageTermVirome. Sci Rep 2021; 11:18319. [PMID: 34526611 PMCID: PMC8443750 DOI: 10.1038/s41598-021-97867-3] [Citation(s) in RCA: 5] [Impact Index Per Article: 1.7] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/31/2021] [Accepted: 08/27/2021] [Indexed: 11/13/2022] Open
Abstract
Viruses that infect bacteria (phages) are increasingly recognized for their importance in diverse ecosystems but identifying and annotating them in large-scale sequence datasets is still challenging. Although efficient scalable virus identification tools are emerging, defining the exact ends (termini) of phage genomes is still particularly difficult. The proper identification of termini is crucial, as it helps in characterizing the packaging mechanism of bacteriophages and provides information on various aspects of phage biology. Here, we introduce PhageTermVirome (PTV) as a tool for the easy and rapid high-throughput determination of phage termini and packaging mechanisms using modern large-scale metagenomics datasets. We successfully tested the PTV algorithm on a mock virome dataset and then used it on two real virome datasets to achieve the rapid identification of more than 100 phage termini and packaging mechanisms, with just a few hours of computing time. Because PTV allows the identification of free fully formed viral particles (by recognition of termini present only in encapsidated DNA), it can also complement other virus identification softwares to predict the true viral origin of contigs in viral metagenomics datasets. PTV is a novel and unique tool for high-throughput characterization of phage genomes, including phage termini identification and characterization of genome packaging mechanisms. This software should help researchers better visualize, map and study the virosphere. PTV is freely available for downloading and installation at https://gitlab.pasteur.fr/vlegrand/ptv.
Collapse
Affiliation(s)
| | - Véronique Legrand
- Infrastructure et Ingénierie Scientifique, Institut Pasteur, 75015, Paris, France
| | - Martial Marbouty
- Institut Pasteur, Unité Régulation Spatiale des Génomes, UMR 3525, CNRS, 75015, Paris, France
| | | | - Dean R Vik
- Department of Microbiology, Ohio State University, Columbus, OH, 43210, USA
| | - Louis-Charles Fortier
- Faculty of Medicine and Health Sciences, Department of Microbiology and Infectious Diseases, Université de Sherbrooke, Sherbrooke, QC, J1E 4K8, Canada
| | - Matthew B Sullivan
- Department of Microbiology, Ohio State University, Columbus, OH, 43210, USA
| | - David Bikard
- Département de Microbiologie, Institut Pasteur, Groupe Biologie de Synthèse, 75015, Paris, France
| | - Marc Monot
- Biomics Platform, C2RT, Institut Pasteur, 75015, Paris, France.
| |
Collapse
|
53
|
Lee YJ, Dai N, Müller SI, Guan C, Parker MJ, Fraser ME, Walsh SE, Sridar J, Mulholland A, Nayak K, Sun Z, Lin YC, Comb DG, Marks K, Gonzalez R, Dowling DP, Bandarian V, Saleh L, Corrêa IR, Weigele PR. Pathways of thymidine hypermodification. Nucleic Acids Res 2021; 50:3001-3017. [PMID: 34522950 PMCID: PMC8989533 DOI: 10.1093/nar/gkab781] [Citation(s) in RCA: 6] [Impact Index Per Article: 2.0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/21/2021] [Revised: 08/25/2021] [Accepted: 09/12/2021] [Indexed: 11/15/2022] Open
Abstract
The DNAs of bacterial viruses are known to contain diverse, chemically complex modifications to thymidine that protect them from the endonuclease-based defenses of their cellular hosts, but whose biosynthetic origins are enigmatic. Up to half of thymidines in the Pseudomonas phage M6, the Salmonella phage ViI, and others, contain exotic chemical moieties synthesized through the post-replicative modification of 5-hydroxymethyluridine (5-hmdU). We have determined that these thymidine hypermodifications are derived from free amino acids enzymatically installed on 5-hmdU. These appended amino acids are further sculpted by various enzyme classes such as radical SAM isomerases, PLP-dependent decarboxylases, flavin-dependent lyases and acetyltransferases. The combinatorial permutations of thymidine hypermodification genes found in viral metagenomes from geographically widespread sources suggests an untapped reservoir of chemical diversity in DNA hypermodifications.
Collapse
Affiliation(s)
- Yan-Jiun Lee
- Research Department, New England Biolabs, Inc., 240 County Road, Ipswich, MA01938, USA
| | - Nan Dai
- Research Department, New England Biolabs, Inc., 240 County Road, Ipswich, MA01938, USA
| | - Stephanie I Müller
- Research Department, New England Biolabs, Inc., 240 County Road, Ipswich, MA01938, USA
| | - Chudi Guan
- Research Department, New England Biolabs, Inc., 240 County Road, Ipswich, MA01938, USA
| | - Mackenzie J Parker
- Research Department, New England Biolabs, Inc., 240 County Road, Ipswich, MA01938, USA
| | - Morgan E Fraser
- Research Department, New England Biolabs, Inc., 240 County Road, Ipswich, MA01938, USA
| | - Shannon E Walsh
- Research Department, New England Biolabs, Inc., 240 County Road, Ipswich, MA01938, USA
| | - Janani Sridar
- Research Department, New England Biolabs, Inc., 240 County Road, Ipswich, MA01938, USA
| | - Andrew Mulholland
- Research Department, New England Biolabs, Inc., 240 County Road, Ipswich, MA01938, USA
| | - Krutika Nayak
- Research Department, New England Biolabs, Inc., 240 County Road, Ipswich, MA01938, USA
| | - Zhiyi Sun
- Research Department, New England Biolabs, Inc., 240 County Road, Ipswich, MA01938, USA
| | - Yu-Cheng Lin
- Research Department, New England Biolabs, Inc., 240 County Road, Ipswich, MA01938, USA
| | - Donald G Comb
- Research Department, New England Biolabs, Inc., 240 County Road, Ipswich, MA01938, USA
| | - Katherine Marks
- Research Department, New England Biolabs, Inc., 240 County Road, Ipswich, MA01938, USA
| | - Reyaz Gonzalez
- Chemistry Department, University of Massachusetts Boston, 100 William T. Morrissey Blvd. Boston, MA02125, USA
| | - Daniel P Dowling
- Chemistry Department, University of Massachusetts Boston, 100 William T. Morrissey Blvd. Boston, MA02125, USA
| | - Vahe Bandarian
- Department of Chemistry, University of Utah, 315 South 1400 East Salt Lake City, UT 84112, USA
| | - Lana Saleh
- Research Department, New England Biolabs, Inc., 240 County Road, Ipswich, MA01938, USA
| | - Ivan R Corrêa
- Research Department, New England Biolabs, Inc., 240 County Road, Ipswich, MA01938, USA
| | - Peter R Weigele
- Research Department, New England Biolabs, Inc., 240 County Road, Ipswich, MA01938, USA
| |
Collapse
|
54
|
Chen Y, Wang Y, Paez-Espino D, Polz MF, Zhang T. Prokaryotic viruses impact functional microorganisms in nutrient removal and carbon cycle in wastewater treatment plants. Nat Commun 2021; 12:5398. [PMID: 34518545 PMCID: PMC8438041 DOI: 10.1038/s41467-021-25678-1] [Citation(s) in RCA: 31] [Impact Index Per Article: 10.3] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/15/2020] [Accepted: 08/24/2021] [Indexed: 11/09/2022] Open
Abstract
As one of the largest biotechnological applications, activated sludge (AS) systems in wastewater treatment plants (WWTPs) harbor enormous viruses, with 10-1,000-fold higher concentrations than in natural environments. However, the compositional variation and host-connections of AS viruses remain poorly explored. Here, we report a catalogue of ~50,000 prokaryotic viruses from six WWTPs, increasing the number of described viral species of AS by 23-fold, and showing the very high viral diversity which is largely unknown (98.4-99.6% of total viral contigs). Most viral genera are represented in more than one AS system with 53 identified across all. Viral infection widely spans 8 archaeal and 58 bacterial phyla, linking viruses with aerobic/anaerobic heterotrophs, and other functional microorganisms controlling nitrogen/phosphorous removal. Notably, Mycobacterium, notorious for causing AS foaming, is associated with 402 viral genera. Our findings expand the current AS virus catalogue and provide reference for the phage treatment to control undesired microorganisms in WWTPs.
Collapse
Affiliation(s)
- Yiqiang Chen
- Environmental Microbiome Engineering and Biotechnology Laboratory, Center for Environmental Engineering Research, Department of Civil Engineering, The University of Hong Kong, Hong Kong, China
| | - Yulin Wang
- Environmental Microbiome Engineering and Biotechnology Laboratory, Center for Environmental Engineering Research, Department of Civil Engineering, The University of Hong Kong, Hong Kong, China
| | - David Paez-Espino
- Department of Energy, Joint Genome Institute, Lawrence Berkeley National Laboratory, Berkeley, CA, USA
| | - Martin F Polz
- Department of Civil and Environmental Engineering, Massachusetts Institute of Technology, Cambridge, MA, USA
- Division of Microbial Ecology, Centre for Microbiology and Environmental Systems Science, University of Vienna, Vienna, Austria
| | - Tong Zhang
- Environmental Microbiome Engineering and Biotechnology Laboratory, Center for Environmental Engineering Research, Department of Civil Engineering, The University of Hong Kong, Hong Kong, China.
| |
Collapse
|
55
|
Wu R, Davison MR, Gao Y, Nicora CD, Mcdermott JE, Burnum-Johnson KE, Hofmockel KS, Jansson JK. Moisture modulates soil reservoirs of active DNA and RNA viruses. Commun Biol 2021; 4:992. [PMID: 34446837 PMCID: PMC8390657 DOI: 10.1038/s42003-021-02514-2] [Citation(s) in RCA: 24] [Impact Index Per Article: 8.0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/21/2020] [Accepted: 07/18/2021] [Indexed: 02/07/2023] Open
Abstract
Soil is known to harbor viruses, but the majority are uncharacterized and their responses to environmental changes are unknown. Here, we used a multi-omics approach (metagenomics, metatranscriptomics and metaproteomics) to detect active DNA viruses and RNA viruses in a native prairie soil and to determine their responses to extremes in soil moisture. The majority of transcribed DNA viruses were bacteriophage, but some were assigned to eukaryotic hosts, mainly insects. We also demonstrated that higher soil moisture increased transcription of a subset of DNA viruses. Metaproteome data validated that the specific viral transcripts were translated into proteins, including chaperonins known to be essential for virion replication and assembly. The soil viral chaperonins were phylogenetically distinct from previously described marine viral chaperonins. The soil also had a high abundance of RNA viruses, with highest representation of Reoviridae. Leviviridae were the most diverse RNA viruses in the samples, with higher amounts in wet soil. This study demonstrates that extreme shifts in soil moisture have dramatic impacts on the composition, activity and potential functions of both DNA and RNA soil viruses.
Collapse
Affiliation(s)
- Ruonan Wu
- Earth and Biological Sciences Directorate, Pacific Northwest National Laboratory, Richland, WA, USA
| | - Michelle R Davison
- Earth and Biological Sciences Directorate, Pacific Northwest National Laboratory, Richland, WA, USA
| | - Yuqian Gao
- Earth and Biological Sciences Directorate, Pacific Northwest National Laboratory, Richland, WA, USA
| | - Carrie D Nicora
- Earth and Biological Sciences Directorate, Pacific Northwest National Laboratory, Richland, WA, USA
| | - Jason E Mcdermott
- Earth and Biological Sciences Directorate, Pacific Northwest National Laboratory, Richland, WA, USA
| | - Kristin E Burnum-Johnson
- Earth and Biological Sciences Directorate, Pacific Northwest National Laboratory, Richland, WA, USA
| | - Kirsten S Hofmockel
- Earth and Biological Sciences Directorate, Pacific Northwest National Laboratory, Richland, WA, USA
- Department of Agronomy, Iowa State University, Ames, IA, USA
| | - Janet K Jansson
- Earth and Biological Sciences Directorate, Pacific Northwest National Laboratory, Richland, WA, USA.
| |
Collapse
|
56
|
Zayulina KS, Elcheninov AG, Toshchakov SV, Kochetkova TV, Novikov AA, Blamey JM, Kublanov IV. Novel hyperthermophilic crenarchaeon Infirmifilum lucidum gen. nov. sp. nov., reclassification of Thermofilum uzonense as Infirmifilum uzonense comb. nov. and assignment of the family Thermofilaceae to the order Thermofilales ord. nov. Syst Appl Microbiol 2021; 44:126230. [PMID: 34293647 DOI: 10.1016/j.syapm.2021.126230] [Citation(s) in RCA: 2] [Impact Index Per Article: 0.7] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/19/2020] [Revised: 06/25/2021] [Accepted: 07/02/2021] [Indexed: 02/01/2023]
Abstract
A novel hyperthermophilic crenarchaeon, strain 3507LTT, was isolated from a terrestrial hot spring near Tinguiririca volcano, Chile. Cells were non-motile thin, slightly curved filamentous rods. It grew at 73-93 °C and pH range of 5 to 7.5 with an optimum at 85 °C and pH 6.0-6.7. The presence of culture broth filtrate of another hyperthemophilic archaeon as well as yeast extract was obligatory for growth of the novel isolate. Strain 3507LTT is an anaerobic chemoorganoheterotroph, fermenting monosaccharides, disaccharides and polysaccharides (lichenan, starch, xanthan gum, xyloglucan, alpha-cellulose and amorphous cellulose). No growth stimulation was detected when nitrate, thiosulfate, selenate or elemental sulfur were added as the electron acceptors. The complete genome of strain 3507LTT consisted of a single circular chromosome with size of 1.63 Mbp. The DNA G+C content was 53.9%. According to the 16S rRNA gene sequence as well as conserved protein sequences phylogenetic analyses, strain 3507LTT together with Thermofilum uzonense formed a separate cluster within a Thermofilaceae family (Thermoproteales/Thermoprotei/Crenarchaeota). Based on phenotypic characteristics, phylogeny as well as AAI comparisons, a novel genus and species Infirmifilum lucidum strain 3507LTT (=VKM B-3376T = KCTC 15797T) gen. nov. sp. nov. was proposed. Its closest relative, Thermofilum uzonense strain 1807-2T should be reclassified as Infirmifilum uzonense strain 1807-2T comb. nov. Finally, based on phylogenomic and comparative genome analyses of representatives of Thermofilaceae family and other representatives of Thermoproteales order, a proposal of transfer of the family Thermofilaceae into a separate order Thermofilales ord. nov. was made.
Collapse
Affiliation(s)
- Kseniya S Zayulina
- Winogradsky Institute of Microbiology, Research Center of Biotechnology RAS, 7/2 Prospekt 60-letiya Oktyabrya, 117312 Moscow, Russia.
| | - Alexander G Elcheninov
- Winogradsky Institute of Microbiology, Research Center of Biotechnology RAS, 7/2 Prospekt 60-letiya Oktyabrya, 117312 Moscow, Russia
| | - Stepan V Toshchakov
- Winogradsky Institute of Microbiology, Research Center of Biotechnology RAS, 7/2 Prospekt 60-letiya Oktyabrya, 117312 Moscow, Russia
| | - Tatiana V Kochetkova
- Winogradsky Institute of Microbiology, Research Center of Biotechnology RAS, 7/2 Prospekt 60-letiya Oktyabrya, 117312 Moscow, Russia
| | - Andrei A Novikov
- Gubkin University, 65-1, Leninsky prospect, 119991 Moscow, Russia
| | - Jenny M Blamey
- Fundacion Biociencia, Jose Domingo Cañas, 2280 Ñuñoa, Santiago, Chile; Facultad de Química y Biología, Universidad de Santiago de Chile, Alameda 3363, Estación Central, Santiago, Chile
| | - Ilya V Kublanov
- Winogradsky Institute of Microbiology, Research Center of Biotechnology RAS, 7/2 Prospekt 60-letiya Oktyabrya, 117312 Moscow, Russia
| |
Collapse
|
57
|
Jahn MT, Lachnit T, Markert SM, Stigloher C, Pita L, Ribes M, Dutilh BE, Hentschel U. Lifestyle of sponge symbiont phages by host prediction and correlative microscopy. THE ISME JOURNAL 2021; 15:2001-2011. [PMID: 33603147 PMCID: PMC8245591 DOI: 10.1038/s41396-021-00900-6] [Citation(s) in RCA: 15] [Impact Index Per Article: 5.0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Download PDF] [Figures] [Subscribe] [Scholar Register] [Received: 10/05/2020] [Revised: 12/22/2020] [Accepted: 01/18/2021] [Indexed: 01/31/2023]
Abstract
Bacteriophages (phages) are ubiquitous elements in nature, but their ecology and role in animals remains little understood. Sponges represent the oldest known extant animal-microbe symbiosis and are associated with dense and diverse microbial consortia. Here we investigate the tripartite interaction between phages, bacterial symbionts, and the sponge host. We combined imaging and bioinformatics to tackle important questions on who the phage hosts are and what the replication mode and spatial distribution within the animal is. This approach led to the discovery of distinct phage-microbe infection networks in sponge versus seawater microbiomes. A new correlative in situ imaging approach ('PhageFISH-CLEM') localised phages within bacterial symbiont cells, but also within phagocytotically active sponge cells. We postulate that the phagocytosis of free virions by sponge cells modulates phage-bacteria ratios and ultimately controls infection dynamics. Prediction of phage replication strategies indicated a distinct pattern, where lysogeny dominates the sponge microbiome, likely fostered by sponge host-mediated virion clearance, while lysis dominates in seawater. Collectively, this work provides new insights into phage ecology within sponges, highlighting the importance of tripartite animal-phage-bacterium interplay in holobiont functioning. We anticipate that our imaging approach will be instrumental to further understanding of viral distribution and cellular association in animal hosts.
Collapse
Affiliation(s)
- M T Jahn
- GEOMAR Helmholtz Centre for Ocean Research Kiel, Kiel, Germany.
- Department of Zoology and Department of Biochemistry, University of Oxford, Oxford, UK.
| | - T Lachnit
- Christian-Albrechts-University of Kiel, Kiel, Germany
| | - S M Markert
- Imaging Core Facility, Biocenter, University of Würzburg, Würzburg, Germany
| | - C Stigloher
- Imaging Core Facility, Biocenter, University of Würzburg, Würzburg, Germany
| | - L Pita
- GEOMAR Helmholtz Centre for Ocean Research Kiel, Kiel, Germany
| | - M Ribes
- Institut de Ciències del Mar (ICM-CSIC), Barcelona, Spain
| | - B E Dutilh
- Theoretical Biology and Bioinformatics, Utrecht University, Utrecht, The Netherlands
| | - U Hentschel
- GEOMAR Helmholtz Centre for Ocean Research Kiel, Kiel, Germany
- Christian-Albrechts-University of Kiel, Kiel, Germany
| |
Collapse
|
58
|
Danko D, Bezdan D, Afshin EE, Ahsanuddin S, Bhattacharya C, Butler DJ, Chng KR, Donnellan D, Hecht J, Jackson K, Kuchin K, Karasikov M, Lyons A, Mak L, Meleshko D, Mustafa H, Mutai B, Neches RY, Ng A, Nikolayeva O, Nikolayeva T, Png E, Ryon KA, Sanchez JL, Shaaban H, Sierra MA, Thomas D, Young B, Abudayyeh OO, Alicea J, Bhattacharyya M, Blekhman R, Castro-Nallar E, Cañas AM, Chatziefthimiou AD, Crawford RW, De Filippis F, Deng Y, Desnues C, Dias-Neto E, Dybwad M, Elhaik E, Ercolini D, Frolova A, Gankin D, Gootenberg JS, Graf AB, Green DC, Hajirasouliha I, Hastings JJA, Hernandez M, Iraola G, Jang S, Kahles A, Kelly FJ, Knights K, Kyrpides NC, Łabaj PP, Lee PKH, Leung MHY, Ljungdahl PO, Mason-Buck G, McGrath K, Meydan C, Mongodin EF, Moraes MO, Nagarajan N, Nieto-Caballero M, Noushmehr H, Oliveira M, Ossowski S, Osuolale OO, Özcan O, Paez-Espino D, Rascovan N, Richard H, Rätsch G, Schriml LM, Semmler T, Sezerman OU, Shi L, Shi T, Siam R, Song LH, Suzuki H, Court DS, Tighe SW, Tong X, Udekwu KI, Ugalde JA, Valentine B, Vassilev DI, Vayndorf EM, Velavan TP, Wu J, Zambrano MM, Zhu J, Zhu S, Mason CE. A global metagenomic map of urban microbiomes and antimicrobial resistance. Cell 2021; 184:3376-3393.e17. [PMID: 34043940 PMCID: PMC8238498 DOI: 10.1016/j.cell.2021.05.002] [Citation(s) in RCA: 146] [Impact Index Per Article: 48.7] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 12/02/2020] [Revised: 03/05/2021] [Accepted: 04/29/2021] [Indexed: 01/14/2023]
Abstract
We present a global atlas of 4,728 metagenomic samples from mass-transit systems in 60 cities over 3 years, representing the first systematic, worldwide catalog of the urban microbial ecosystem. This atlas provides an annotated, geospatial profile of microbial strains, functional characteristics, antimicrobial resistance (AMR) markers, and genetic elements, including 10,928 viruses, 1,302 bacteria, 2 archaea, and 838,532 CRISPR arrays not found in reference databases. We identified 4,246 known species of urban microorganisms and a consistent set of 31 species found in 97% of samples that were distinct from human commensal organisms. Profiles of AMR genes varied widely in type and density across cities. Cities showed distinct microbial taxonomic signatures that were driven by climate and geographic differences. These results constitute a high-resolution global metagenomic atlas that enables discovery of organisms and genes, highlights potential public health and forensic applications, and provides a culture-independent view of AMR burden in cities.
Collapse
Affiliation(s)
- David Danko
- Weill Cornell Medicine, New York, NY, USA; The Bin Talal Bin Abdulaziz Alsaud Institute for Computational Biomedicine, New York, NY, USA
| | - Daniela Bezdan
- Weill Cornell Medicine, New York, NY, USA; The Bin Talal Bin Abdulaziz Alsaud Institute for Computational Biomedicine, New York, NY, USA; Institute of Medical Genetics and Applied Genomics, University of Tübingen, Tübingen, Germany; NGS Competence Center Tübingen (NCCT), University of Tübingen, Tübingen, Germany
| | - Evan E Afshin
- Weill Cornell Medicine, New York, NY, USA; The Bin Talal Bin Abdulaziz Alsaud Institute for Computational Biomedicine, New York, NY, USA
| | | | - Chandrima Bhattacharya
- Weill Cornell Medicine, New York, NY, USA; The Bin Talal Bin Abdulaziz Alsaud Institute for Computational Biomedicine, New York, NY, USA
| | - Daniel J Butler
- Weill Cornell Medicine, New York, NY, USA; The Bin Talal Bin Abdulaziz Alsaud Institute for Computational Biomedicine, New York, NY, USA
| | - Kern Rei Chng
- Genome Institute of Singapore, A(∗)STAR, Singapore, Singapore
| | - Daisy Donnellan
- Weill Cornell Medicine, New York, NY, USA; The Bin Talal Bin Abdulaziz Alsaud Institute for Computational Biomedicine, New York, NY, USA
| | - Jochen Hecht
- Centre for Genomic Regulation (CRG), The Barcelona Institute of Science and Technology, Barcelona, Spain
| | - Katelyn Jackson
- Weill Cornell Medicine, New York, NY, USA; The Bin Talal Bin Abdulaziz Alsaud Institute for Computational Biomedicine, New York, NY, USA
| | - Katerina Kuchin
- Weill Cornell Medicine, New York, NY, USA; The Bin Talal Bin Abdulaziz Alsaud Institute for Computational Biomedicine, New York, NY, USA
| | - Mikhail Karasikov
- ETH Zurich, Department of Computer Science, Biomedical Informatics Group, Zurich, Switzerland; University Hospital Zurich, Biomedical Informatics Research, Zurich, Switzerland; Swiss Institute of Bioinformatics, Lausanne, Switzerland
| | - Abigail Lyons
- Weill Cornell Medicine, New York, NY, USA; The Bin Talal Bin Abdulaziz Alsaud Institute for Computational Biomedicine, New York, NY, USA
| | - Lauren Mak
- Weill Cornell Medicine, New York, NY, USA; The Bin Talal Bin Abdulaziz Alsaud Institute for Computational Biomedicine, New York, NY, USA
| | - Dmitry Meleshko
- Weill Cornell Medicine, New York, NY, USA; The Bin Talal Bin Abdulaziz Alsaud Institute for Computational Biomedicine, New York, NY, USA
| | - Harun Mustafa
- ETH Zurich, Department of Computer Science, Biomedical Informatics Group, Zurich, Switzerland; University Hospital Zurich, Biomedical Informatics Research, Zurich, Switzerland; Swiss Institute of Bioinformatics, Lausanne, Switzerland
| | - Beth Mutai
- Centre for Genomic Regulation (CRG), The Barcelona Institute of Science and Technology, Barcelona, Spain; Kenya Medical Research Institute - Kisumu, Kisumu, Kenya
| | - Russell Y Neches
- Department of Energy, Joint Genome Institute, Lawrence Berkeley National Laboratory, Berkeley, CA, USA
| | - Amanda Ng
- Genome Institute of Singapore, A(∗)STAR, Singapore, Singapore
| | | | | | - Eileen Png
- Genome Institute of Singapore, A(∗)STAR, Singapore, Singapore
| | - Krista A Ryon
- Weill Cornell Medicine, New York, NY, USA; The Bin Talal Bin Abdulaziz Alsaud Institute for Computational Biomedicine, New York, NY, USA
| | - Jorge L Sanchez
- Weill Cornell Medicine, New York, NY, USA; The Bin Talal Bin Abdulaziz Alsaud Institute for Computational Biomedicine, New York, NY, USA
| | - Heba Shaaban
- Weill Cornell Medicine, New York, NY, USA; The Bin Talal Bin Abdulaziz Alsaud Institute for Computational Biomedicine, New York, NY, USA
| | - Maria A Sierra
- Weill Cornell Medicine, New York, NY, USA; The Bin Talal Bin Abdulaziz Alsaud Institute for Computational Biomedicine, New York, NY, USA
| | - Dominique Thomas
- Weill Cornell Medicine, New York, NY, USA; The Bin Talal Bin Abdulaziz Alsaud Institute for Computational Biomedicine, New York, NY, USA
| | - Ben Young
- Weill Cornell Medicine, New York, NY, USA; The Bin Talal Bin Abdulaziz Alsaud Institute for Computational Biomedicine, New York, NY, USA
| | - Omar O Abudayyeh
- Massachusetts Institute of Technology, McGovern Institute for Brain Research, Cambridge, MA, USA
| | - Josue Alicea
- Weill Cornell Medicine, New York, NY, USA; The Bin Talal Bin Abdulaziz Alsaud Institute for Computational Biomedicine, New York, NY, USA
| | - Malay Bhattacharyya
- Machine Intelligence Unit, Indian Statistical Institute, Kolkata, India; Centre for Artificial Intelligence and Machine Learning, Indian Statistical Institute, Kolkata, India
| | | | - Eduardo Castro-Nallar
- Universidad Andres Bello, Center for Bioinformatics and Integrative Biology, Facultad de Ciencias de la Vida, Santiago, Chile
| | - Ana M Cañas
- Weill Cornell Medicine, New York, NY, USA; The Bin Talal Bin Abdulaziz Alsaud Institute for Computational Biomedicine, New York, NY, USA
| | - Aspassia D Chatziefthimiou
- Weill Cornell Medicine, New York, NY, USA; The Bin Talal Bin Abdulaziz Alsaud Institute for Computational Biomedicine, New York, NY, USA
| | | | - Francesca De Filippis
- Department of Agricultural Sciences, Division of Microbiology, University of Naples Federico II, Naples, Italy; Task Force on Microbiome Studies, University of Naples Federico II, Naples, Italy
| | - Youping Deng
- University of Hawaii John A. Burns School of Medicine, Honolulu, HI, USA
| | - Christelle Desnues
- Aix-Marseille Université, Mediterranean Institute of Oceanology, Université de Toulon, CNRS, IRD, UM 110, Marseille, France
| | - Emmanuel Dias-Neto
- Medical Genomics group, A.C.Camargo Cancer Center, São Paulo - SP, Brazil
| | - Marius Dybwad
- Norwegian Defence Research Establishment FFI, Kjeller, Norway
| | - Eran Elhaik
- Department of Biology, Lund University, Lund, Sweden
| | - Danilo Ercolini
- Department of Agricultural Sciences, Division of Microbiology, University of Naples Federico II, Naples, Italy; Task Force on Microbiome Studies, University of Naples Federico II, Naples, Italy
| | - Alina Frolova
- Institute of Molecular Biology and Genetics of National Academy of Sciences of Ukraine, Kyiv, Ukraine; Kyiv Academic University, Kyiv, Ukraine
| | - Dennis Gankin
- Massachusetts Institute of Technology, McGovern Institute for Brain Research, Cambridge, MA, USA
| | - Jonathan S Gootenberg
- Massachusetts Institute of Technology, McGovern Institute for Brain Research, Cambridge, MA, USA
| | | | - David C Green
- Department of Analytical, Environmental and Forensic Sciences, King's College London, London, UK
| | - Iman Hajirasouliha
- Weill Cornell Medicine, New York, NY, USA; The Bin Talal Bin Abdulaziz Alsaud Institute for Computational Biomedicine, New York, NY, USA
| | - Jaden J A Hastings
- Weill Cornell Medicine, New York, NY, USA; The Bin Talal Bin Abdulaziz Alsaud Institute for Computational Biomedicine, New York, NY, USA
| | | | - Gregorio Iraola
- Microbial Genomics Laboratory, Institut Pasteur de Montevideo, Montevideo, Uruguay; Center for Integrative Biology, Universidad Mayor, Santiago de Chile, Santiago, Chile; Wellcome Sanger Institute, Hinxton, UK
| | | | - Andre Kahles
- ETH Zurich, Department of Computer Science, Biomedical Informatics Group, Zurich, Switzerland; Kyiv Academic University, Kyiv, Ukraine; C+, Research Center in Technologies for Society, School of Engineering, Universidad del Desarrollo, Santiago, Chile
| | - Frank J Kelly
- Department of Analytical, Environmental and Forensic Sciences, King's College London, London, UK
| | - Kaymisha Knights
- Weill Cornell Medicine, New York, NY, USA; The Bin Talal Bin Abdulaziz Alsaud Institute for Computational Biomedicine, New York, NY, USA
| | - Nikos C Kyrpides
- Department of Energy, Joint Genome Institute, Lawrence Berkeley National Laboratory, Berkeley, CA, USA
| | - Paweł P Łabaj
- State Key Laboratory of Genetic Engineering (SKLGE) and MOE Key Laboratory of Contemporary Anthropology, School of Life Sciences, Human Phenome Institute, Fudan University, Shanghai, China; Małopolska Centre of Biotechnology, Jagiellonian University, Kraków, Poland; Boku University Viennna, Vienna, Austria
| | - Patrick K H Lee
- School of Energy and Environment, City University of Hong Kong, Hong Kong SAR, China
| | - Marcus H Y Leung
- School of Energy and Environment, City University of Hong Kong, Hong Kong SAR, China
| | - Per O Ljungdahl
- Department of Molecular Biosciences, The Wenner-Gren Institute, Stockholm University, Stockholm, Sweden
| | - Gabriella Mason-Buck
- Department of Analytical, Environmental and Forensic Sciences, King's College London, London, UK
| | - Ken McGrath
- Microba, 388 Queen St, Brisbane City, QLD 4000, Australia
| | - Cem Meydan
- Weill Cornell Medicine, New York, NY, USA; The Bin Talal Bin Abdulaziz Alsaud Institute for Computational Biomedicine, New York, NY, USA
| | - Emmanuel F Mongodin
- University of Maryland School of Medicine, Institute for Genome Sciences, Baltimore, MD, USA
| | | | | | | | - Houtan Noushmehr
- University of São Paulo, Ribeirão Preto Medical School, Ribeirão Preto - SP, Brazil
| | - Manuela Oliveira
- Instituto de Patologia e Imunologia Molecular da Universidade do Porto, Porto, Portugal
| | - Stephan Ossowski
- Centre for Genomic Regulation (CRG), The Barcelona Institute of Science and Technology, Barcelona, Spain; Institute of Medical Genetics and Applied Genomics, University of Tübingen, Tübingen, Germany; NGS Competence Center Tübingen (NCCT), University of Tübingen, Tübingen, Germany
| | - Olayinka O Osuolale
- Applied Environmental Metagenomics and Infectious Diseases Research (AEMIDR), Department of Biological Sciences, Elizade University, Ilara-Mokin, Nigeria
| | - Orhan Özcan
- Acibadem Mehmet Ali Aydınlar University, Istanbul, Turkey
| | - David Paez-Espino
- Department of Energy, Joint Genome Institute, Lawrence Berkeley National Laboratory, Berkeley, CA, USA
| | - Nicolás Rascovan
- Microbial Paleogenomics Unit, Institut Pasteur, CNRS UMR2000, Paris 75015, France
| | - Hugues Richard
- Sorbonne University, Faculty of Science, Institute of Biology Paris-Seine, Laboratory of Computational and Quantitative Biology, Paris, France; Robert Koch Institute, Berlin, Germany
| | - Gunnar Rätsch
- ETH Zurich, Department of Computer Science, Biomedical Informatics Group, Zurich, Switzerland; University Hospital Zurich, Biomedical Informatics Research, Zurich, Switzerland; Swiss Institute of Bioinformatics, Lausanne, Switzerland
| | - Lynn M Schriml
- University of Maryland School of Medicine, Institute for Genome Sciences, Baltimore, MD, USA
| | | | | | - Leming Shi
- Center for Pharmacogenomics, School of Life Sciences and Shanghai Cancer Center, Fudan University, Shanghai, China; State Key Laboratory of Genetic Engineering (SKLGE) and MOE Key Laboratory of Contemporary Anthropology, School of Life Sciences, Human Phenome Institute, Fudan University, Shanghai, China
| | - Tieliu Shi
- The Center for Bioinformatics and Computational Biology, Shanghai Key Laboratory of Regulatory Biology, the Institute of Biomedical Sciences and School of Life Sciences, East China Normal University, Shanghai, China
| | - Rania Siam
- University of Medicine and Health Sciences, St. Kitts, West Indies and American University in Cairo, Cairo, Egypt
| | - Le Huu Song
- 108 Military Central Hospital, Hanoi, Vietnam; Vietnamese-German Center for Medical Research (VG-CARE), Hanoi, Vietnam
| | | | - Denise Syndercombe Court
- Department of Analytical, Environmental and Forensic Sciences, King's College London, London, UK
| | | | - Xinzhao Tong
- School of Energy and Environment, City University of Hong Kong, Hong Kong SAR, China
| | - Klas I Udekwu
- Department of Molecular Biosciences, The Wenner-Gren Institute, Stockholm University, Stockholm, Sweden; SciLife EVP, Department of Aquatic Sciences Assessment, Swedish University of Agricultural Sciences, Uppsala, Sweden
| | - Juan A Ugalde
- Millennium Initiative for Collaborative Research on Bacterial Resistance, Santiago, Chile; C+, Research Center in Technologies for Society, School of Engineering, Universidad del Desarrollo, Santiago, Chile
| | - Brandon Valentine
- Weill Cornell Medicine, New York, NY, USA; The Bin Talal Bin Abdulaziz Alsaud Institute for Computational Biomedicine, New York, NY, USA
| | - Dimitar I Vassilev
- Faculty of Mathematics and Informatics, Sofia University "St. Kliment Ohridski," Sofia, Bulgaria
| | - Elena M Vayndorf
- Institute of Arctic Biology, University of Alaska, Fairbanks, Fairbanks, AK, USA
| | - Thirumalaisamy P Velavan
- Institute of Tropical Medicine, Univeristätsklinikum Tübingen, Tübingen, Germany; Faculty of Medicine, Duy Tan University, Da Nang, Vietnam
| | - Jun Wu
- The Center for Bioinformatics and Computational Biology, Shanghai Key Laboratory of Regulatory Biology, the Institute of Biomedical Sciences and School of Life Sciences, East China Normal University, Shanghai, China
| | | | - Jifeng Zhu
- Weill Cornell Medicine, New York, NY, USA; The Bin Talal Bin Abdulaziz Alsaud Institute for Computational Biomedicine, New York, NY, USA
| | - Sibo Zhu
- State Key Laboratory of Genetic Engineering (SKLGE) and MOE Key Laboratory of Contemporary Anthropology, School of Life Sciences, Human Phenome Institute, Fudan University, Shanghai, China; Department of Epidemiology, School of Public Health, Fudan University, Shanghai, China
| | - Christopher E Mason
- Weill Cornell Medicine, New York, NY, USA; The Bin Talal Bin Abdulaziz Alsaud Institute for Computational Biomedicine, New York, NY, USA; The WorldQuant Initiative for Quantitative Prediction, Weill Cornell Medicine, New York, NY, USA.
| |
Collapse
|
59
|
Coutinho FH, Zaragoza-Solas A, López-Pérez M, Barylski J, Zielezinski A, Dutilh BE, Edwards R, Rodriguez-Valera F. RaFAH: Host prediction for viruses of Bacteria and Archaea based on protein content. PATTERNS 2021; 2:100274. [PMID: 34286299 PMCID: PMC8276007 DOI: 10.1016/j.patter.2021.100274] [Citation(s) in RCA: 35] [Impact Index Per Article: 11.7] [Reference Citation Analysis] [Abstract] [Key Words] [Download PDF] [Figures] [Subscribe] [Scholar Register] [Received: 10/28/2020] [Revised: 11/23/2020] [Accepted: 05/07/2021] [Indexed: 02/06/2023]
Abstract
Culture-independent approaches have recently shed light on the genomic diversity of viruses of prokaryotes. One fundamental question when trying to understand their ecological roles is: which host do they infect? To tackle this issue we developed a machine-learning approach named Random Forest Assignment of Hosts (RaFAH), that uses scores to 43,644 protein clusters to assign hosts to complete or fragmented genomes of viruses of Archaea and Bacteria. RaFAH displayed performance comparable with that of other methods for virus-host prediction in three different benchmarks encompassing viruses from RefSeq, single amplified genomes, and metagenomes. RaFAH was applied to assembled metagenomic datasets of uncultured viruses from eight different biomes of medical, biotechnological, and environmental relevance. Our analyses led to the identification of 537 sequences of archaeal viruses representing unknown lineages, whose genomes encode novel auxiliary metabolic genes, shedding light on how these viruses interfere with the host molecular machinery. RaFAH is available at https://sourceforge.net/projects/rafah/. RaFAH was developed to predict the hosts of viruses of Bacteria and Archaea RaFAH displayed comparable or superior performance to other host-prediction tools RaFAH performed well across viromes from eight different ecosystems RaFAH identified hundreds of genomic sequences as derived from viruses of Archaea
Viruses that infect Bacteria and Archaea are ubiquitous and extremely abundant. Recent advances have led to the discovery of many thousands of complete and partial genomes of these biological entities. Understanding the biology of these viruses and how they influence their ecosystems depends on knowing which hosts they infect. We developed a tool that uses data from complete or fragmented genomes to predict the hosts of viruses using a machine-learning approach. Our tool, RaFAH, displayed performance comparable with or superior to that of other host-prediction tools. In addition, it identified hundreds of sequences as derived from the genomes of viruses of Archaea, which are one of the least characterized fractions of the global virosphere.
Collapse
Affiliation(s)
- Felipe Hernandes Coutinho
- Evolutionary Genomics Group, Departamento de Producción Vegetal y Microbiología, Universidad Miguel Hernández, Aptdo. 18., Ctra. Alicante-Valencia N-332, s/n, San Juan de Alicante, 03550 Alicante, Spain
| | - Asier Zaragoza-Solas
- Evolutionary Genomics Group, Departamento de Producción Vegetal y Microbiología, Universidad Miguel Hernández, Aptdo. 18., Ctra. Alicante-Valencia N-332, s/n, San Juan de Alicante, 03550 Alicante, Spain
| | - Mario López-Pérez
- Evolutionary Genomics Group, Departamento de Producción Vegetal y Microbiología, Universidad Miguel Hernández, Aptdo. 18., Ctra. Alicante-Valencia N-332, s/n, San Juan de Alicante, 03550 Alicante, Spain
| | - Jakub Barylski
- Molecular Virology Research Unit, Faculty of Biology, Adam Mickiewicz University Poznan, 61-614 Poznan, Poland
| | - Andrzej Zielezinski
- Department of Computational Biology, Faculty of Biology, Adam Mickiewicz University Poznan, 61-614 Poznan, Poland
| | - Bas E Dutilh
- Centre for Molecular and Biomolecular Informatics (CMBI), Radboud University Medical Centre/Radboud Institute for Molecular Life Sciences, 6525 GA Nijmegen, the Netherlands.,Theoretical Biology and Bioinformatics, Science for Life, Utrecht University (UU), 3584 CH Utrecht, the Netherlands
| | - Robert Edwards
- College of Science and Engineering, Flinders University, Bedford Park, SA 5042, Australia
| | - Francisco Rodriguez-Valera
- Evolutionary Genomics Group, Departamento de Producción Vegetal y Microbiología, Universidad Miguel Hernández, Aptdo. 18., Ctra. Alicante-Valencia N-332, s/n, San Juan de Alicante, 03550 Alicante, Spain.,Moscow Institute of Physics and Technology, Dolgoprudny 141701, Russia
| |
Collapse
|
60
|
Ecology of inorganic sulfur auxiliary metabolism in widespread bacteriophages. Nat Commun 2021; 12:3503. [PMID: 34108477 PMCID: PMC8190135 DOI: 10.1038/s41467-021-23698-5] [Citation(s) in RCA: 80] [Impact Index Per Article: 26.7] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 08/28/2020] [Accepted: 05/12/2021] [Indexed: 02/05/2023] Open
Abstract
Microbial sulfur metabolism contributes to biogeochemical cycling on global scales. Sulfur metabolizing microbes are infected by phages that can encode auxiliary metabolic genes (AMGs) to alter sulfur metabolism within host cells but remain poorly characterized. Here we identified 191 phages derived from twelve environments that encoded 227 AMGs for oxidation of sulfur and thiosulfate (dsrA, dsrC/tusE, soxC, soxD and soxYZ). Evidence for retention of AMGs during niche-differentiation of diverse phage populations provided evidence that auxiliary metabolism imparts measurable fitness benefits to phages with ramifications for ecosystem biogeochemistry. Gene abundance and expression profiles of AMGs suggested significant contributions by phages to sulfur and thiosulfate oxidation in freshwater lakes and oceans, and a sensitive response to changing sulfur concentrations in hydrothermal environments. Overall, our study provides fundamental insights on the distribution, diversity, and ecology of phage auxiliary metabolism associated with sulfur and reinforces the necessity of incorporating viral contributions into biogeochemical configurations.
Collapse
|
61
|
Berg M, Goudeau D, Olmsted C, McMahon KD, Yitbarek S, Thweatt JL, Bryant DA, Eloe-Fadrosh EA, Malmstrom RR, Roux S. Host population diversity as a driver of viral infection cycle in wild populations of green sulfur bacteria with long standing virus-host interactions. THE ISME JOURNAL 2021; 15:1569-1584. [PMID: 33452481 PMCID: PMC8163819 DOI: 10.1038/s41396-020-00870-1] [Citation(s) in RCA: 8] [Impact Index Per Article: 2.7] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Subscribe] [Scholar Register] [Received: 03/16/2020] [Revised: 09/29/2020] [Accepted: 12/07/2020] [Indexed: 01/29/2023]
Abstract
Temperate phages are viruses of bacteria that can establish two types of infection: a lysogenic infection in which the virus replicates with the host cell without producing virions, and a lytic infection where the host cell is eventually destroyed, and new virions are released. While both lytic and lysogenic infections are routinely observed in the environment, the ecological and evolutionary processes regulating these viral dynamics are still not well understood, especially for uncultivated virus-host pairs. Here, we characterized the long-term dynamics of uncultivated viruses infecting green sulfur bacteria (GSB) in a model freshwater lake (Trout Bog Lake, TBL). As no GSB virus has been formally described yet, we first used two complementary approaches to identify new GSB viruses from TBL; one in vitro based on flow cytometry cell sorting, the other in silico based on CRISPR spacer sequences. We then took advantage of existing TBL metagenomes covering the 2005-2018 period to examine the interactions between GSB and their viruses across years and seasons. From our data, GSB populations in TBL were constantly associated with at least 2-8 viruses each, including both lytic and temperate phages. The dominant GSB population in particular was consistently associated with two prophages with a nearly 100% infection rate for >10 years. We illustrate with a theoretical model that such an interaction can be stable given a low, but persistent, level of prophage induction in low-diversity host populations. Overall, our data suggest that lytic and lysogenic viruses can readily co-infect the same host population, and that host strain-level diversity might be an important factor controlling virus-host dynamics including lytic/lysogeny switch.
Collapse
Affiliation(s)
| | | | | | | | | | | | | | | | | | - Simon Roux
- Joint Genome Institute, Berkeley, CA, USA.
| |
Collapse
|
62
|
Roux S, Paul BG, Bagby SC, Nayfach S, Allen MA, Attwood G, Cavicchioli R, Chistoserdova L, Gruninger RJ, Hallam SJ, Hernandez ME, Hess M, Liu WT, McAllister TA, O'Malley MA, Peng X, Rich VI, Saleska SR, Eloe-Fadrosh EA. Ecology and molecular targets of hypermutation in the global microbiome. Nat Commun 2021; 12:3076. [PMID: 34031405 PMCID: PMC8144416 DOI: 10.1038/s41467-021-23402-7] [Citation(s) in RCA: 26] [Impact Index Per Article: 8.7] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 01/21/2021] [Accepted: 04/27/2021] [Indexed: 02/04/2023] Open
Abstract
Changes in the sequence of an organism's genome, i.e., mutations, are the raw material of evolution. The frequency and location of mutations can be constrained by specific molecular mechanisms, such as diversity-generating retroelements (DGRs). DGRs have been characterized from cultivated bacteria and bacteriophages, and perform error-prone reverse transcription leading to mutations being introduced in specific target genes. DGR loci were also identified in several metagenomes, but the ecological roles and evolutionary drivers of these DGRs remain poorly understood. Here, we analyze a dataset of >30,000 DGRs from public metagenomes, establish six major lineages of DGRs including three primarily encoded by phages and seemingly used to diversify host attachment proteins, and demonstrate that DGRs are broadly active and responsible for >10% of all amino acid changes in some organisms. Overall, these results highlight the constraints under which DGRs evolve, and elucidate several distinct roles these elements play in natural communities.
Collapse
Affiliation(s)
- Simon Roux
- DOE Joint Genome Institute, Lawrence Berkeley National Laboratory, Berkeley, CA, USA.
| | - Blair G Paul
- Marine Biological Laboratory, Woods Hole, MA, USA
| | - Sarah C Bagby
- Department of Biology, Case Western Reserve University, Cleveland, OH, USA
| | - Stephen Nayfach
- DOE Joint Genome Institute, Lawrence Berkeley National Laboratory, Berkeley, CA, USA
| | | | - Graeme Attwood
- AgResearch Limited, Grasslands Research Centre, Palmerston North, New Zealand
| | | | | | - Robert J Gruninger
- Lethbridge Research and Development Centre, Agriculture and Agri-Food Canada, Lethbridge, Alberta, Canada
| | - Steven J Hallam
- Department of Microbiology and Immunology, University of British Columbia, Vancouver, Canada
- Graduate Program in Bioinformatics, University of British Columbia, Genome Sciences Centre, Vancouver, Canada
- Genome Science and Technology Program, University of British Columbia, Vancouver, Canada
- Life Sciences Institute, University of British Columbia, Vancouver, Canada
- ECOSCOPE Training Program, University of British Columbia, Vancouver, Canada
| | - Maria E Hernandez
- Instituto de Ecología A.C. Red de Manejo Biotechnológico de Recursos. Xalapa, Veracruz, México
| | | | - Wen-Tso Liu
- University of Illinois at Urbana-Champaign, Urbana, IL, USA
| | - Tim A McAllister
- Lethbridge Research and Development Centre, Agriculture and Agri-Food Canada, Lethbridge, Alberta, Canada
| | - Michelle A O'Malley
- Department of Chemical Engineering, University of California Santa Barbara, Santa Barbara, CA, USA
| | - Xuefeng Peng
- Marine Science Institute, University of California Santa Barbara, Santa Barbara, CA, USA
| | | | | | - Emiley A Eloe-Fadrosh
- DOE Joint Genome Institute, Lawrence Berkeley National Laboratory, Berkeley, CA, USA.
| |
Collapse
|
63
|
Nayfach S, Roux S, Seshadri R, Udwary D, Varghese N, Schulz F, Wu D, Paez-Espino D, Chen IM, Huntemann M, Palaniappan K, Ladau J, Mukherjee S, Reddy TBK, Nielsen T, Kirton E, Faria JP, Edirisinghe JN, Henry CS, Jungbluth SP, Chivian D, Dehal P, Wood-Charlson EM, Arkin AP, Tringe SG, Visel A, Woyke T, Mouncey NJ, Ivanova NN, Kyrpides NC, Eloe-Fadrosh EA. A genomic catalog of Earth's microbiomes. Nat Biotechnol 2021; 39:499-509. [PMID: 33169036 PMCID: PMC8041624 DOI: 10.1038/s41587-020-0718-6] [Citation(s) in RCA: 348] [Impact Index Per Article: 116.0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 12/24/2019] [Accepted: 09/28/2020] [Indexed: 01/02/2023]
Abstract
The reconstruction of bacterial and archaeal genomes from shotgun metagenomes has enabled insights into the ecology and evolution of environmental and host-associated microbiomes. Here we applied this approach to >10,000 metagenomes collected from diverse habitats covering all of Earth's continents and oceans, including metagenomes from human and animal hosts, engineered environments, and natural and agricultural soils, to capture extant microbial, metabolic and functional potential. This comprehensive catalog includes 52,515 metagenome-assembled genomes representing 12,556 novel candidate species-level operational taxonomic units spanning 135 phyla. The catalog expands the known phylogenetic diversity of bacteria and archaea by 44% and is broadly available for streamlined comparative analyses, interactive exploration, metabolic modeling and bulk download. We demonstrate the utility of this collection for understanding secondary-metabolite biosynthetic potential and for resolving thousands of new host linkages to uncultivated viruses. This resource underscores the value of genome-centric approaches for revealing genomic properties of uncultivated microorganisms that affect ecosystem processes.
Collapse
Affiliation(s)
| | - Simon Roux
- DOE Joint Genome Institute, Berkeley, CA, USA
| | | | | | | | | | - Dongying Wu
- DOE Joint Genome Institute, Berkeley, CA, USA
| | | | - I-Min Chen
- DOE Joint Genome Institute, Berkeley, CA, USA
| | | | | | | | | | - T B K Reddy
- DOE Joint Genome Institute, Berkeley, CA, USA
| | | | | | | | | | | | - Sean P Jungbluth
- DOE Joint Genome Institute, Berkeley, CA, USA
- Lawrence Berkeley National Laboratory, Berkeley, CA, USA
| | - Dylan Chivian
- Lawrence Berkeley National Laboratory, Berkeley, CA, USA
| | - Paramvir Dehal
- Lawrence Berkeley National Laboratory, Berkeley, CA, USA
| | | | - Adam P Arkin
- Lawrence Berkeley National Laboratory, Berkeley, CA, USA
| | | | - Axel Visel
- DOE Joint Genome Institute, Berkeley, CA, USA
| | - Tanja Woyke
- DOE Joint Genome Institute, Berkeley, CA, USA
| | | | | | | | | |
Collapse
|
64
|
Rational Design of Profile Hidden Markov Models for Viral Classification and Discovery. Bioinformatics 2021. [DOI: 10.36255/exonpublications.bioinformatics.2021.ch9] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] Open
|
65
|
Abstract
Viruses are ubiquitous and abundant in the oceans, and viral metagenomes (viromes) have been investigated extensively via several large-scale ocean sequencing projects. However, there have not been any systematic viromic studies in estuaries. Here, we investigated the viromes of the Delaware Bay and Chesapeake Bay, two Mid-Atlantic estuaries. Deep sequencing generated a total of 48,190 assembled viral sequences (>5 kb) and 26,487 viral populations (9,204 virus clusters and 17,845 singletons), including 319 circular viral contigs between 7.5 kb and 161.8 kb. Unknown viruses represented the vast majority of the dominant populations, while the composition of known viruses, such as pelagiphage and cyanophage, appeared to be relatively consistent across a wide range of salinity gradients and in different seasons. A difference between estuarine and ocean viromes was reflected by the proportions of Myoviridae, Podoviridae, Siphoviridae, Phycodnaviridae, and a few well-studied virus representatives. The difference in viral community between the Delaware Bay and Chesapeake Bay is significantly more pronounced than the difference caused by temperature or salinity, indicating strong local profiles caused by the unique ecology of each estuary. Interestingly, a viral contig similar to phages infecting Acinetobacter baumannii (“Iraqibacter”) was found to be highly abundant in the Delaware Bay but not in the Chesapeake Bay, the source of which is yet to be identified. Highly abundant viruses in both estuaries have close hits to viral sequences derived from the marine single-cell genomes or long-read single-molecule sequencing, suggesting that important viruses are still waiting to be discovered in the estuarine environment. IMPORTANCE This is the first systematic study about spatial and temporal variation of virioplankton communities in estuaries using deep metagenomics sequencing. It is among the highest-quality viromic data sets to date, showing remarkably consistent sequencing depth and quality across samples. Our results indicate that there exists a large pool of abundant and diverse viruses in estuaries that have not yet been cultivated, their genomes only available thanks to single-cell genomics or single-molecule sequencing, demonstrating the importance of these methods for viral discovery. The spatiotemporal pattern of these abundant uncultivated viruses is more variable than that of cultured viruses. Despite strong environmental gradients, season and location had surprisingly little impact on the viral community within an estuary, but we saw a significant distinction between the two estuaries and also between estuarine and open ocean viromes.
Collapse
|
66
|
Abstract
Oral bacteriophages (or phages), especially periodontal ones, constitute a growing area of interest, but research on oral phages is still in its infancy. Phages are bacterial viruses that may persist as intracellular parasitic deoxyribonucleic acid (DNA) or use bacterial metabolism to replicate and cause bacterial lysis. The microbiomes of saliva, oral mucosa, and dental plaque contain active phage virions, bacterial lysogens (ie, carrying dormant prophages), and bacterial strains containing short fragments of phage DNA. In excess of 2000 oral phages have been confirmed or predicted to infect species of the phyla Actinobacteria (>300 phages), Bacteroidetes (>300 phages), Firmicutes (>1000 phages), Fusobacteria (>200 phages), and Proteobacteria (>700 phages) and three additional phyla (few phages only). This article assesses the current knowledge of the diversity of the oral phage population and the mechanisms by which phages may impact the ecology of oral biofilms. The potential use of phage-based therapy to control major periodontal pathogens is also discussed.
Collapse
Affiliation(s)
- Szymon P Szafrański
- Department of Prosthetic Dentistry and Biomedical Materials Science, Hannover Medical School, Hannover, Germany
| | - Jørgen Slots
- Division of Periodontology, Diagnostic Sciences and Dental Hygiene, Ostrow School of Dentistry of USC, University of Southern California, Los Angeles, California, USA
| | - Meike Stiesch
- Department of Prosthetic Dentistry and Biomedical Materials Science, Hannover Medical School, Hannover, Germany
| |
Collapse
|
67
|
Nuidate T, Kuaphiriyakul A, Surachat K, Mittraparp-arthorn P. Induction and Genome Analysis of HY01, a Newly Reported Prophage from an Emerging Shrimp Pathogen Vibrio campbellii. Microorganisms 2021; 9:400. [PMID: 33671959 PMCID: PMC7919010 DOI: 10.3390/microorganisms9020400] [Citation(s) in RCA: 5] [Impact Index Per Article: 1.7] [Reference Citation Analysis] [Abstract] [Key Words] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 12/30/2020] [Revised: 02/11/2021] [Accepted: 02/12/2021] [Indexed: 12/18/2022] Open
Abstract
Vibrio campbellii is an emerging aquaculture pathogen that causes luminous vibriosis in farmed shrimp. Although prophages in various aquaculture pathogens have been widely reported, there is still limited knowledge regarding prophages in the genome of pathogenic V. campbellii. Here, we describe the full-genome sequence of a prophage named HY01, induced from the emerging shrimp pathogen V. campbellii HY01. The phage HY01 was induced by mitomycin C and was morphologically characterized as long tailed phage. V. campbellii phage HY01 is composed of 41,772 bp of dsDNA with a G+C content of 47.45%. A total of 60 open reading frames (ORFs) were identified, of which 31 could be predicted for their biological functions. Twenty seven out of 31 predicted protein coding regions were matched with several encoded proteins of various Enterobacteriaceae, Pseudomonadaceae, Vibrionaceae, and other phages of Gram-negative bacteria. Interestingly, the comparative genome analysis revealed that the phage HY01 was only distantly related to Vibrio phage Va_PF430-3_p42 of fish pathogen V. anguillarum but differed in genomic size and gene organization. The phylogenetic tree placed the phage together with Siphoviridae family. Additionally, a survey of Clustered Regularly Interspaced Short Palindromic Repeats (CRISPR) spacers revealed two matching sequences between phage HY01 genome and viral spacer sequence of Vibrio spp. The spacer results combined with the synteny results suggest that the evolution of V. campbellii phage HY01 is driven by the horizontal genetic exchange between bacterial families belonging to the class of Gammaproteobacteria.
Collapse
Affiliation(s)
- Taiyeebah Nuidate
- Division of Biological Science, Faculty of Science, Prince of Songkla University, Hat Yai, Songkhla 90110, Thailand; (T.N.); (A.K.)
| | - Aphiwat Kuaphiriyakul
- Division of Biological Science, Faculty of Science, Prince of Songkla University, Hat Yai, Songkhla 90110, Thailand; (T.N.); (A.K.)
| | - Komwit Surachat
- Division of Computational Science, Faculty of Science, Prince of Songkla University, Hat Yai, Songkhla 90110, Thailand;
- Molecular Evolution and Computational Biology Research Unit, Faculty of Science, Prince of Songkla University, Hat Yai, Songkhla 90110, Thailand
| | - Pimonsri Mittraparp-arthorn
- Division of Biological Science, Faculty of Science, Prince of Songkla University, Hat Yai, Songkhla 90110, Thailand; (T.N.); (A.K.)
- Molecular Evolution and Computational Biology Research Unit, Faculty of Science, Prince of Songkla University, Hat Yai, Songkhla 90110, Thailand
| |
Collapse
|
68
|
Philippe C, Moineau S. The endless battle between phages and CRISPR-Cas systems in Streptococcus thermophilus. Biochem Cell Biol 2021; 99:397-402. [PMID: 33534660 DOI: 10.1139/bcb-2020-0593] [Citation(s) in RCA: 1] [Impact Index Per Article: 0.3] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 12/15/2022] Open
Abstract
This review describes the contribution of basic research on phage-bacteria interactions to the understanding of CRISPR-Cas systems and their various applications. It focuses on the natural function of CRISPR-Cas systems as adaptive defense mechanisms against mobile genetic elements such as bacteriophage genomes and plasmids. Some of the advances in the characterization of the type II-A CRISPR-Cas system of Streptococcus thermophilus and Streptococcus pyogenes led to the development of the CRISPR-Cas9 genome-editing technology. We mostly discuss the 3 stages of the CRISPR-Cas system in S. thermophilus, namely the adaptation stage, which is unique to this resistance mechanism; the CRISPR RNA biogenesis; and the DNA-cutting activity in the interference stage to protect bacteria against phages. Finally, we look into applications of CRISPR-Cas in microbiology, including overcoming limitations in genome editing.
Collapse
Affiliation(s)
- Cécile Philippe
- Département de biochimie, de microbiologie, et de bio-informatique, Faculté des sciences et de génie, Université Laval, Québec, QC G1V 0A6, Canada.,Groupe de recherche en écologie buccale, Faculté de médecine dentaire, Université Laval, Québec, QC G1V 0A6, Canada
| | - Sylvain Moineau
- Département de biochimie, de microbiologie, et de bio-informatique, Faculté des sciences et de génie, Université Laval, Québec, QC G1V 0A6, Canada.,Groupe de recherche en écologie buccale, Faculté de médecine dentaire, Université Laval, Québec, QC G1V 0A6, Canada.,Félix d'Hérelle Reference Center for Bacterial Viruses, Université Laval, Québec, QC G1V 0A6, Canada
| |
Collapse
|
69
|
Camarillo-Guerrero LF, Almeida A, Rangel-Pineros G, Finn RD, Lawley TD. Massive expansion of human gut bacteriophage diversity. Cell 2021; 184:1098-1109.e9. [PMID: 33606979 PMCID: PMC7895897 DOI: 10.1016/j.cell.2021.01.029] [Citation(s) in RCA: 260] [Impact Index Per Article: 86.7] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 08/18/2020] [Revised: 11/02/2020] [Accepted: 01/19/2021] [Indexed: 12/25/2022]
Abstract
Bacteriophages drive evolutionary change in bacterial communities by creating gene flow networks that fuel ecological adaptions. However, the extent of viral diversity and its prevalence in the human gut remains largely unknown. Here, we introduce the Gut Phage Database, a collection of ∼142,000 non-redundant viral genomes (>10 kb) obtained by mining a dataset of 28,060 globally distributed human gut metagenomes and 2,898 reference genomes of cultured gut bacteria. Host assignment revealed that viral diversity is highest in the Firmicutes phyla and that ∼36% of viral clusters (VCs) are not restricted to a single species, creating gene flow networks across phylogenetically distinct bacterial species. Epidemiological analysis uncovered 280 globally distributed VCs found in at least 5 continents and a highly prevalent phage clade with features reminiscent of p-crAssphage. This high-quality, large-scale catalog of phage genomes will improve future virome studies and enable ecological and evolutionary analysis of human gut bacteriophages.
Collapse
Affiliation(s)
- Luis F Camarillo-Guerrero
- Host-Microbiota Interactions Laboratory, Wellcome Sanger Institute, Wellcome Genome Campus, Hinxton CB10 1SA, UK.
| | - Alexandre Almeida
- European Bioinformatics Institute (EMBL-EBI), Wellcome Genome Campus, Hinxton CB10 1SA, UK; Wellcome Sanger Institute, Wellcome Genome Campus, Hinxton CB10 1SA, UK
| | - Guillermo Rangel-Pineros
- European Bioinformatics Institute (EMBL-EBI), Wellcome Genome Campus, Hinxton CB10 1SA, UK; Max Planck Tandem Group in Computational Biology, Department of Biological Sciences, Universidad de los Andes, Bogota 111711, Colombia
| | - Robert D Finn
- European Bioinformatics Institute (EMBL-EBI), Wellcome Genome Campus, Hinxton CB10 1SA, UK
| | - Trevor D Lawley
- Host-Microbiota Interactions Laboratory, Wellcome Sanger Institute, Wellcome Genome Campus, Hinxton CB10 1SA, UK.
| |
Collapse
|
70
|
Guo J, Bolduc B, Zayed AA, Varsani A, Dominguez-Huerta G, Delmont TO, Pratama AA, Gazitúa MC, Vik D, Sullivan MB, Roux S. VirSorter2: a multi-classifier, expert-guided approach to detect diverse DNA and RNA viruses. MICROBIOME 2021; 9:37. [PMID: 33522966 PMCID: PMC7852108 DOI: 10.1186/s40168-020-00990-y] [Citation(s) in RCA: 399] [Impact Index Per Article: 133.0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Subscribe] [Scholar Register] [Received: 07/04/2020] [Accepted: 12/29/2020] [Indexed: 05/19/2023]
Abstract
BACKGROUND Viruses are a significant player in many biosphere and human ecosystems, but most signals remain "hidden" in metagenomic/metatranscriptomic sequence datasets due to the lack of universal gene markers, database representatives, and insufficiently advanced identification tools. RESULTS Here, we introduce VirSorter2, a DNA and RNA virus identification tool that leverages genome-informed database advances across a collection of customized automatic classifiers to improve the accuracy and range of virus sequence detection. When benchmarked against genomes from both isolated and uncultivated viruses, VirSorter2 uniquely performed consistently with high accuracy (F1-score > 0.8) across viral diversity, while all other tools under-detected viruses outside of the group most represented in reference databases (i.e., those in the order Caudovirales). Among the tools evaluated, VirSorter2 was also uniquely able to minimize errors associated with atypical cellular sequences including eukaryotic genomes and plasmids. Finally, as the virosphere exploration unravels novel viral sequences, VirSorter2's modular design makes it inherently able to expand to new types of viruses via the design of new classifiers to maintain maximal sensitivity and specificity. CONCLUSION With multi-classifier and modular design, VirSorter2 demonstrates higher overall accuracy across major viral groups and will advance our knowledge of virus evolution, diversity, and virus-microbe interaction in various ecosystems. Source code of VirSorter2 is freely available ( https://bitbucket.org/MAVERICLab/virsorter2 ), and VirSorter2 is also available both on bioconda and as an iVirus app on CyVerse ( https://de.cyverse.org/de ). Video abstract.
Collapse
Affiliation(s)
- Jiarong Guo
- Department of Microbiology, Ohio State University, Columbus, OH, 43210, USA
| | - Ben Bolduc
- Department of Microbiology, Ohio State University, Columbus, OH, 43210, USA
| | - Ahmed A Zayed
- Department of Microbiology, Ohio State University, Columbus, OH, 43210, USA
| | - Arvind Varsani
- The Biodesign Center for Fundamental and Applied Microbiomics, Center for Evolution and Medicine, School of Life Sciences, Arizona State University, Tempe, AZ, 85287, USA
- Structural Biology Research Unit, Department of Integrative Biomedical Sciences, University of Cape Town, Observatory, Cape Town, 7701, South Africa
| | | | - Tom O Delmont
- Génomique Métabolique, Genoscope, Institut François Jacob, CEA, CNRS, Univ Evry, Université Paris-Saclay, 91057, Evry, France
| | | | | | - Dean Vik
- Department of Microbiology, Ohio State University, Columbus, OH, 43210, USA
| | - Matthew B Sullivan
- Department of Microbiology, Ohio State University, Columbus, OH, 43210, USA.
- Civil, Environmental and Geodetic Engineering, Ohio State University, Columbus, OH, 43210, USA.
- Center of Microbiome Science, Ohio State University, Columbus, OH, 43210, USA.
| | - Simon Roux
- DOE Joint Genome Institute, Lawrence Berkeley National Laboratory, Berkeley, CA, 94720, USA.
| |
Collapse
|
71
|
Rasmussen TS, Jakobsen RR, Castro-Mejía JL, Kot W, Thomsen AR, Vogensen FK, Nielsen DS, Hansen AK. Inter-vendor variance of enteric eukaryotic DNA viruses in specific pathogen free C57BL/6N mice. Res Vet Sci 2021; 136:1-5. [PMID: 33548686 DOI: 10.1016/j.rvsc.2021.01.022] [Citation(s) in RCA: 4] [Impact Index Per Article: 1.3] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/28/2020] [Revised: 01/16/2021] [Accepted: 01/26/2021] [Indexed: 02/06/2023]
Abstract
The laboratory mouse strain C57BL/6 is widely used as an animal model for various applications. It is becoming increasingly clear that the bacterial enteric community highly influences the phenotype. Eukaryotic viruses represent a sparsely investigated member of the enteric microbiome that might also affect the phenotype. We here investigated the presence of enteric eukaryotic DNA viruses (EDVs) in specific pathogen-free (SPF) C57BL/6N mice purchased from three vendors upon arrival and after being fed a low-fat diet (LFD) or high-fat diet (HFD). We detected genetic fragments of EDVs belonging to the viral families of Herpes-, Mimi-, Baculo- and Phycodnaviridae represented by two genera; Chlorovirus and Prasinovirus. The EDVs were detected in the mice upon arrival and persisted for 13 weeks. However, these signals of EDVs were only detected at notable levels in mice fed LFD from 2 out of 3 vendors, which suggested that the enteric composition of these EDVs were affected by both vendor (p < 0.003) and different dietary regimes (p < 0.013). This highlights the need of additional studies assessing the potential function of these EDVs that may influence the mouse phenotype and the reproducibility of animal studies using this C57BL/6N substrain.
Collapse
Affiliation(s)
| | | | | | - Witold Kot
- Department of Plant and Environmental Sciences, University of Copenhagen, Frederiksberg, Denmark
| | - Allan Randrup Thomsen
- Department of Immunology and Microbiology, University of Copenhagen, Copenhagen, Denmark
| | - Finn Kvist Vogensen
- Department of Food Science, University of Copenhagen, Frederiksberg, Denmark
| | | | - Axel Kornerup Hansen
- Department of Veterinary and Animal Sciences, University of Copenhagen, Frederiksberg, Denmark.
| |
Collapse
|
72
|
Pons JC, Paez-Espino D, Riera G, Ivanova N, Kyrpides NC, Llabrés M. VPF-Class: Taxonomic assignment and host prediction of uncultivated viruses based on viral protein families. Bioinformatics 2021; 37:1805-1813. [PMID: 33471063 PMCID: PMC8830756 DOI: 10.1093/bioinformatics/btab026] [Citation(s) in RCA: 52] [Impact Index Per Article: 17.3] [Reference Citation Analysis] [Abstract] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 09/02/2020] [Revised: 12/11/2020] [Accepted: 01/13/2021] [Indexed: 12/03/2022] Open
Abstract
Motivation Two key steps in the analysis of uncultured viruses recovered from metagenomes are the taxonomic classification of the viral sequences and the identification of putative host(s). Both steps rely mainly on the assignment of viral proteins to orthologs in cultivated viruses. Viral Protein Families (VPFs) can be used for the robust identification of new viral sequences in large metagenomics datasets. Despite the importance of VPF information for viral discovery, VPFs have not yet been explored for determining viral taxonomy and host targets. Results In this work, we classified the set of VPFs from the IMG/VR database and developed VPF-Class. VPF-Class is a tool that automates the taxonomic classification and host prediction of viral contigs based on the assignment of their proteins to a set of classified VPFs. Applying VPF-Class on 731K uncultivated virus contigs from the IMG/VR database, we were able to classify 363K contigs at the genus level and predict the host of over 461K contigs. In the RefSeq database, VPF-class reported an accuracy of nearly 100% to classify dsDNA, ssDNA and retroviruses, at the genus level, considering a membership ratio and a confidence score of 0.2. The accuracy in host prediction was 86.4%, also at the genus level, considering a membership ratio of 0.3 and a confidence score of 0.5. And, in the prophages dataset, the accuracy in host prediction was 86% considering a membership ratio of 0.6 and a confidence score of 0.8. Moreover, from the Global Ocean Virome dataset, over 817K viral contigs out of 1 million were classified. Availability and implementation The implementation of VPF-Class can be downloaded from https://github.com/biocom-uib/vpf-tools. Supplementary information Supplementary data are available at Bioinformatics online.
Collapse
Affiliation(s)
- Joan Carles Pons
- Department of Mathematics and Computer Science, University of the Balearic Islands, Palma, 07122, Spain
| | | | - Gabriel Riera
- Department of Mathematics and Computer Science, University of the Balearic Islands, Palma, 07122, Spain
| | - Natalia Ivanova
- Department of Energy Joint Genome Institute, Berkeley, 94720, USA
| | - Nikos C Kyrpides
- Department of Energy Joint Genome Institute, Berkeley, 94720, USA
| | - Mercè Llabrés
- Department of Mathematics and Computer Science, University of the Balearic Islands, Palma, 07122, Spain
| |
Collapse
|
73
|
Lai S, Jia L, Subramanian B, Pan S, Zhang J, Dong Y, Chen WH, Zhao XM. mMGE: a database for human metagenomic extrachromosomal mobile genetic elements. Nucleic Acids Res 2021; 49:D783-D791. [PMID: 33074335 PMCID: PMC7778953 DOI: 10.1093/nar/gkaa869] [Citation(s) in RCA: 10] [Impact Index Per Article: 3.3] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 08/14/2020] [Revised: 09/18/2020] [Accepted: 09/24/2020] [Indexed: 12/15/2022] Open
Abstract
Extrachromosomal mobile genetic elements (eMGEs), including phages and plasmids, that can move across different microbes, play important roles in genome evolution and shaping the structure of microbial communities. However, we still know very little about eMGEs, especially their abundances, distributions and putative functions in microbiomes. Thus, a comprehensive description of eMGEs is of great utility. Here we present mMGE, a comprehensive catalog of 517 251 non-redundant eMGEs, including 92 492 plasmids and 424 759 phages, derived from diverse body sites of 66 425 human metagenomic samples. About half the eMGEs could be further grouped into 70 074 clusters using relaxed criteria (referred as to eMGE clusters below). We provide extensive annotations of the identified eMGEs including sequence characteristics, taxonomy affiliation, gene contents and their prokaryotic hosts. We also calculate the prevalence, both within and across samples for each eMGE and eMGE cluster, enabling users to see putative associations of eMGEs with human phenotypes or their distribution preferences. All eMGE records can be browsed or queried in multiple ways, such as eMGE clusters, metagenomic samples and associated hosts. The mMGE is equipped with a user-friendly interface and a BLAST server, facilitating easy access/queries to all its contents easily. mMGE is freely available for academic use at: https://mgedb.comp-sysbio.org.
Collapse
Affiliation(s)
- Senying Lai
- Institute of Science and Technology for Brain-Inspired Intelligence, Fudan University, Shanghai 200433, China
| | - Longhao Jia
- Institute of Science and Technology for Brain-Inspired Intelligence, Fudan University, Shanghai 200433, China
| | - Balakrishnan Subramanian
- Key Laboratory of Molecular Biophysics of the Ministry of Education, Hubei Key Laboratory of Bioinformatics and Molecular-imaging, Center for Artificial Intelligence Biology, Department of Bioinformatics and Systems Biology, College of Life Science and Technology, Huazhong University of Science and Technology, Wuhan, Hubei 430074, China
| | - Shaojun Pan
- Institute of Science and Technology for Brain-Inspired Intelligence, Fudan University, Shanghai 200433, China
| | - Jinglong Zhang
- Institute of Science and Technology for Brain-Inspired Intelligence, Fudan University, Shanghai 200433, China
| | - Yanqi Dong
- Institute of Science and Technology for Brain-Inspired Intelligence, Fudan University, Shanghai 200433, China
| | - Wei-Hua Chen
- Key Laboratory of Molecular Biophysics of the Ministry of Education, Hubei Key Laboratory of Bioinformatics and Molecular-imaging, Center for Artificial Intelligence Biology, Department of Bioinformatics and Systems Biology, College of Life Science and Technology, Huazhong University of Science and Technology, Wuhan, Hubei 430074, China
- College of Life Science, Henan Normal University, Xinxiang, Henan 453007, China
| | - Xing-Ming Zhao
- Institute of Science and Technology for Brain-Inspired Intelligence, Fudan University, Shanghai 200433, China
- Key Laboratory of Computational Neuroscience and Brain-Inspired Intelligence, Ministry of Education, Shanghai 200433, China
- Research Institute of Intelligent Complex System, Fudan University, Shanghai 200433, China
| |
Collapse
|
74
|
Roux S, Páez-Espino D, Chen IMA, Palaniappan K, Ratner A, Chu K, Reddy TBK, Nayfach S, Schulz F, Call L, Neches RY, Woyke T, Ivanova NN, Eloe-Fadrosh EA, Kyrpides NC. IMG/VR v3: an integrated ecological and evolutionary framework for interrogating genomes of uncultivated viruses. Nucleic Acids Res 2021; 49:D764-D775. [PMID: 33137183 PMCID: PMC7778971 DOI: 10.1093/nar/gkaa946] [Citation(s) in RCA: 190] [Impact Index Per Article: 63.3] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 09/14/2020] [Revised: 10/02/2020] [Accepted: 10/09/2020] [Indexed: 12/28/2022] Open
Abstract
Viruses are integral components of all ecosystems and microbiomes on Earth. Through pervasive infections of their cellular hosts, viruses can reshape microbial community structure and drive global nutrient cycling. Over the past decade, viral sequences identified from genomes and metagenomes have provided an unprecedented view of viral genome diversity in nature. Since 2016, the IMG/VR database has provided access to the largest collection of viral sequences obtained from (meta)genomes. Here, we present the third version of IMG/VR, composed of 18 373 cultivated and 2 314 329 uncultivated viral genomes (UViGs), nearly tripling the total number of sequences compared to the previous version. These clustered into 935 362 viral Operational Taxonomic Units (vOTUs), including 188 930 with two or more members. UViGs in IMG/VR are now reported as single viral contigs, integrated proviruses or genome bins, and are annotated with a new standardized pipeline including genome quality estimation using CheckV, taxonomic classification reflecting the latest ICTV update, and expanded host taxonomy prediction. The new IMG/VR interface enables users to efficiently browse, search, and select UViGs based on genome features and/or sequence similarity. IMG/VR v3 is available at https://img.jgi.doe.gov/vr, and the underlying data are available to download at https://genome.jgi.doe.gov/portal/IMG_VR.
Collapse
Affiliation(s)
- Simon Roux
- DOE Joint Genome Institute, Lawrence Berkeley National Laboratory, Berkeley, CA 94720, USA
| | - David Páez-Espino
- DOE Joint Genome Institute, Lawrence Berkeley National Laboratory, Berkeley, CA 94720, USA
| | - I-Min A Chen
- DOE Joint Genome Institute, Lawrence Berkeley National Laboratory, Berkeley, CA 94720, USA
| | - Krishna Palaniappan
- DOE Joint Genome Institute, Lawrence Berkeley National Laboratory, Berkeley, CA 94720, USA
| | - Anna Ratner
- DOE Joint Genome Institute, Lawrence Berkeley National Laboratory, Berkeley, CA 94720, USA
| | - Ken Chu
- DOE Joint Genome Institute, Lawrence Berkeley National Laboratory, Berkeley, CA 94720, USA
| | - T B K Reddy
- DOE Joint Genome Institute, Lawrence Berkeley National Laboratory, Berkeley, CA 94720, USA
| | - Stephen Nayfach
- DOE Joint Genome Institute, Lawrence Berkeley National Laboratory, Berkeley, CA 94720, USA
| | - Frederik Schulz
- DOE Joint Genome Institute, Lawrence Berkeley National Laboratory, Berkeley, CA 94720, USA
| | - Lee Call
- DOE Joint Genome Institute, Lawrence Berkeley National Laboratory, Berkeley, CA 94720, USA
| | - Russell Y Neches
- DOE Joint Genome Institute, Lawrence Berkeley National Laboratory, Berkeley, CA 94720, USA
| | - Tanja Woyke
- DOE Joint Genome Institute, Lawrence Berkeley National Laboratory, Berkeley, CA 94720, USA
| | - Natalia N Ivanova
- DOE Joint Genome Institute, Lawrence Berkeley National Laboratory, Berkeley, CA 94720, USA
| | - Emiley A Eloe-Fadrosh
- DOE Joint Genome Institute, Lawrence Berkeley National Laboratory, Berkeley, CA 94720, USA
| | - Nikos C Kyrpides
- DOE Joint Genome Institute, Lawrence Berkeley National Laboratory, Berkeley, CA 94720, USA
| |
Collapse
|
75
|
Chen IMA, Chu K, Palaniappan K, Ratner A, Huang J, Huntemann M, Hajek P, Ritter S, Varghese N, Seshadri R, Roux S, Woyke T, Eloe-Fadrosh EA, Ivanova NN, Kyrpides N. The IMG/M data management and analysis system v.6.0: new tools and advanced capabilities. Nucleic Acids Res 2021; 49:D751-D763. [PMID: 33119741 PMCID: PMC7778900 DOI: 10.1093/nar/gkaa939] [Citation(s) in RCA: 262] [Impact Index Per Article: 87.3] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 09/09/2020] [Revised: 10/04/2020] [Accepted: 10/07/2020] [Indexed: 12/22/2022] Open
Abstract
The Integrated Microbial Genomes & Microbiomes system (IMG/M: https://img.jgi.doe.gov/m/) contains annotated isolate genome and metagenome datasets sequenced at the DOE's Joint Genome Institute (JGI), submitted by external users, or imported from public sources such as NCBI. IMG v 6.0 includes advanced search functions and a new tool for statistical analysis of mixed sets of genomes and metagenome bins. The new IMG web user interface also has a new Help page with additional documentation and webinar tutorials to help users better understand how to use various IMG functions and tools for their research. New datasets have been processed with the prokaryotic annotation pipeline v.5, which includes extended protein family assignments.
Collapse
Affiliation(s)
- I-Min A Chen
- Department of Energy Joint Genome Institute, Lawrence Berkeley National Laboratory, 1 Cyclotron Road, Berkeley, CA 94720, USA
| | - Ken Chu
- Department of Energy Joint Genome Institute, Lawrence Berkeley National Laboratory, 1 Cyclotron Road, Berkeley, CA 94720, USA
| | - Krishnaveni Palaniappan
- Department of Energy Joint Genome Institute, Lawrence Berkeley National Laboratory, 1 Cyclotron Road, Berkeley, CA 94720, USA
| | - Anna Ratner
- Department of Energy Joint Genome Institute, Lawrence Berkeley National Laboratory, 1 Cyclotron Road, Berkeley, CA 94720, USA
| | - Jinghua Huang
- Department of Energy Joint Genome Institute, Lawrence Berkeley National Laboratory, 1 Cyclotron Road, Berkeley, CA 94720, USA
| | - Marcel Huntemann
- Department of Energy Joint Genome Institute, Lawrence Berkeley National Laboratory, 1 Cyclotron Road, Berkeley, CA 94720, USA
| | - Patrick Hajek
- Department of Energy Joint Genome Institute, Lawrence Berkeley National Laboratory, 1 Cyclotron Road, Berkeley, CA 94720, USA
| | - Stephan Ritter
- Department of Energy Joint Genome Institute, Lawrence Berkeley National Laboratory, 1 Cyclotron Road, Berkeley, CA 94720, USA
| | - Neha Varghese
- Department of Energy Joint Genome Institute, Lawrence Berkeley National Laboratory, 1 Cyclotron Road, Berkeley, CA 94720, USA
| | - Rekha Seshadri
- Department of Energy Joint Genome Institute, Lawrence Berkeley National Laboratory, 1 Cyclotron Road, Berkeley, CA 94720, USA
| | - Simon Roux
- Department of Energy Joint Genome Institute, Lawrence Berkeley National Laboratory, 1 Cyclotron Road, Berkeley, CA 94720, USA
| | - Tanja Woyke
- Department of Energy Joint Genome Institute, Lawrence Berkeley National Laboratory, 1 Cyclotron Road, Berkeley, CA 94720, USA
| | - Emiley A Eloe-Fadrosh
- Department of Energy Joint Genome Institute, Lawrence Berkeley National Laboratory, 1 Cyclotron Road, Berkeley, CA 94720, USA
| | - Natalia N Ivanova
- Department of Energy Joint Genome Institute, Lawrence Berkeley National Laboratory, 1 Cyclotron Road, Berkeley, CA 94720, USA
| | - Nikos C Kyrpides
- Department of Energy Joint Genome Institute, Lawrence Berkeley National Laboratory, 1 Cyclotron Road, Berkeley, CA 94720, USA
| |
Collapse
|
76
|
Zhang J, Cook J, Nearing JT, Zhang J, Raudonis R, Glick BR, Langille MGI, Cheng Z. Harnessing the plant microbiome to promote the growth of agricultural crops. Microbiol Res 2021; 245:126690. [PMID: 33460987 DOI: 10.1016/j.micres.2020.126690] [Citation(s) in RCA: 42] [Impact Index Per Article: 14.0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 09/03/2020] [Revised: 12/11/2020] [Accepted: 12/30/2020] [Indexed: 12/11/2022]
Abstract
The rhizosphere microbiome is composed of diverse microbial organisms, including archaea, viruses, fungi, bacteria as well as eukaryotic microorganisms, which occupy a narrow region of soil directly associated with plant roots. The interactions between these microorganisms and the plant can be commensal, beneficial or pathogenic. These microorganisms can also interact with each other, either competitively or synergistically. Promoting plant growth by harnessing the soil microbiome holds tremendous potential for providing an environmentally friendly solution to the increasing food demands of the world's rapidly growing population, while also helping to alleviate the associated environmental and societal issues of large-scale food production. There recently have been many studies on the disease suppression and plant growth promoting abilities of the rhizosphere microbiome; however, these findings largely have not been translated into the field. Therefore, additional research into the dynamic interactions between crop plants, the rhizosphere microbiome and the environment are necessary to better guide the harnessing of the microbiome to increase crop yield and quality. This review explores the biotic and abiotic interactions that occur within the plant's rhizosphere as well as current agricultural practices, and how these biotic and abiotic factors, as well as human practices, impact the plant microbiome. Additionally, some limitations, safety considerations, and future directions to the study of the plant microbiome are discussed.
Collapse
Affiliation(s)
- Janie Zhang
- Department of Microbiology and Immunology, Dalhousie University, Halifax, NS, Canada
| | - Jamie Cook
- Department of Microbiology and Immunology, Dalhousie University, Halifax, NS, Canada
| | - Jacob T Nearing
- Department of Microbiology and Immunology, Dalhousie University, Halifax, NS, Canada
| | - Junzeng Zhang
- Aquatic and Crop Resource Development Research Centre, National Research Council of Canada, Halifax, NS, Canada
| | - Renee Raudonis
- Department of Microbiology and Immunology, Dalhousie University, Halifax, NS, Canada
| | - Bernard R Glick
- Department of Biology, University of Waterloo, Waterloo, ON, Canada
| | - Morgan G I Langille
- Department of Microbiology and Immunology, Dalhousie University, Halifax, NS, Canada; Department of Pharmacology, Dalhousie University, Halifax, NS, Canada; CGEB-Integrated Microbiome Resource (IMR), Dalhousie University, Halifax, NS, Canada
| | - Zhenyu Cheng
- Department of Microbiology and Immunology, Dalhousie University, Halifax, NS, Canada.
| |
Collapse
|
77
|
Yahara K, Suzuki M, Hirabayashi A, Suda W, Hattori M, Suzuki Y, Okazaki Y. Long-read metagenomics using PromethION uncovers oral bacteriophages and their interaction with host bacteria. Nat Commun 2021; 12:27. [PMID: 33397904 PMCID: PMC7782811 DOI: 10.1038/s41467-020-20199-9] [Citation(s) in RCA: 28] [Impact Index Per Article: 9.3] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/15/2020] [Accepted: 11/17/2020] [Indexed: 12/11/2022] Open
Abstract
Bacteriophages (phages), or bacterial viruses, are very diverse and highly abundant worldwide, including as a part of the human microbiomes. Although a few metagenomic studies have focused on oral phages, they relied on short-read sequencing. Here, we conduct a long-read metagenomic study of human saliva using PromethION. Our analyses, which integrate both PromethION and HiSeq data of >30 Gb per sample with low human DNA contamination, identify hundreds of viral contigs; 0-43.8% and 12.5-56.3% of the confidently predicted phages and prophages, respectively, do not cluster with those reported previously. Our analyses demonstrate enhanced scaffolding, and the ability to place a prophage in its host genomic context and enable its taxonomic classification. Our analyses also identify a Streptococcus phage/prophage group and nine jumbo phages/prophages. 86% of the phage/prophage group and 67% of the jumbo phages/prophages contain remote homologs of antimicrobial resistance genes. Pan-genome analysis of the phages/prophages reveals remarkable diversity, identifying 0.3% and 86.4% of the genes as core and singletons, respectively. Furthermore, our study suggests that oral phages present in human saliva are under selective pressure to escape CRISPR immunity. Our study demonstrates the power of long-read metagenomics utilizing PromethION in uncovering bacteriophages and their interaction with host bacteria.
Collapse
Affiliation(s)
- Koji Yahara
- Antimicrobial Resistance Research Center, National Institute of Infectious Diseases, Tokyo, Japan.
| | - Masato Suzuki
- Antimicrobial Resistance Research Center, National Institute of Infectious Diseases, Tokyo, Japan
| | - Aki Hirabayashi
- Antimicrobial Resistance Research Center, National Institute of Infectious Diseases, Tokyo, Japan
| | - Wataru Suda
- Laboratory for Microbiome Science, RIKEN Center for Integrative Medical Sciences, Kanagawa, Japan
| | - Masahira Hattori
- Laboratory for Microbiome Science, RIKEN Center for Integrative Medical Sciences, Kanagawa, Japan
| | - Yutaka Suzuki
- Laboratory of Systems Genomics, Department of Computational Biology and Medical Sciences, Graduate School of Frontier Sciences, The University of Tokyo, Bunkyo City, Japan
| | - Yusuke Okazaki
- Bioproduction Research Institute, National Institute of Advanced Industrial Science and Technology, Tsukuba, Japan
| |
Collapse
|
78
|
Metagenomic compendium of 189,680 DNA viruses from the human gut microbiome. Nat Microbiol 2021; 6:960-970. [PMID: 34168315 PMCID: PMC8241571 DOI: 10.1038/s41564-021-00928-6] [Citation(s) in RCA: 198] [Impact Index Per Article: 66.0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/09/2021] [Accepted: 05/25/2021] [Indexed: 02/05/2023]
Abstract
Bacteriophages have important roles in the ecology of the human gut microbiome but are under-represented in reference databases. To address this problem, we assembled the Metagenomic Gut Virus catalogue that comprises 189,680 viral genomes from 11,810 publicly available human stool metagenomes. Over 75% of genomes represent double-stranded DNA phages that infect members of the Bacteroidia and Clostridia classes. Based on sequence clustering we identified 54,118 candidate viral species, 92% of which were not found in existing databases. The Metagenomic Gut Virus catalogue improves detection of viruses in stool metagenomes and accounts for nearly 40% of CRISPR spacers found in human gut Bacteria and Archaea. We also produced a catalogue of 459,375 viral protein clusters to explore the functional potential of the gut virome. This revealed tens of thousands of diversity-generating retroelements, which use error-prone reverse transcription to mutate target genes and may be involved in the molecular arms race between phages and their bacterial hosts.
Collapse
|
79
|
CheckV assesses the quality and completeness of metagenome-assembled viral genomes. Nat Biotechnol 2020; 39:578-585. [PMID: 33349699 PMCID: PMC8116208 DOI: 10.1038/s41587-020-00774-7] [Citation(s) in RCA: 528] [Impact Index Per Article: 132.0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 05/19/2020] [Accepted: 11/12/2020] [Indexed: 02/07/2023]
Abstract
Millions of new viral sequences have been identified from metagenomes, but the quality and completeness of these sequences vary considerably. Here we present CheckV, an automated pipeline for identifying closed viral genomes, estimating the completeness of genome fragments and removing flanking host regions from integrated proviruses. CheckV estimates completeness by comparing sequences with a large database of complete viral genomes, including 76,262 identified from a systematic search of publicly available metagenomes, metatranscriptomes and metaviromes. After validation on mock datasets and comparison to existing methods, we applied CheckV to large and diverse collections of metagenome-assembled viral sequences, including IMG/VR and the Global Ocean Virome. This revealed 44,652 high-quality viral genomes (that is, >90% complete), although the vast majority of sequences were small fragments, which highlights the challenge of assembling viral genomes from short-read metagenomes. Additionally, we found that removal of host contamination substantially improved the accurate identification of auxiliary metabolic genes and interpretation of viral-encoded functions.
Collapse
|
80
|
Hufsky F, Beerenwinkel N, Meyer IM, Roux S, Cook GM, Kinsella CM, Lamkiewicz K, Marquet M, Nieuwenhuijse DF, Olendraite I, Paraskevopoulou S, Young F, Dijkman R, Ibrahim B, Kelly J, Le Mercier P, Marz M, Ramette A, Thiel V. The International Virus Bioinformatics Meeting 2020. Viruses 2020; 12:E1398. [PMID: 33291220 PMCID: PMC7762161 DOI: 10.3390/v12121398] [Citation(s) in RCA: 3] [Impact Index Per Article: 0.8] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 11/30/2020] [Accepted: 12/01/2020] [Indexed: 12/16/2022] Open
Abstract
The International Virus Bioinformatics Meeting 2020 was originally planned to take place in Bern, Switzerland, in March 2020. However, the COVID-19 pandemic put a spoke in the wheel of almost all conferences to be held in 2020. After moving the conference to 8-9 October 2020, we got hit by the second wave and finally decided at short notice to go fully online. On the other hand, the pandemic has made us even more aware of the importance of accelerating research in viral bioinformatics. Advances in bioinformatics have led to improved approaches to investigate viral infections and outbreaks. The International Virus Bioinformatics Meeting 2020 has attracted approximately 120 experts in virology and bioinformatics from all over the world to join the two-day virtual meeting. Despite concerns being raised that virtual meetings lack possibilities for face-to-face discussion, the participants from this small community created a highly interactive scientific environment, engaging in lively and inspiring discussions and suggesting new research directions and questions. The meeting featured five invited and twelve contributed talks, on the four main topics: (1) proteome and RNAome of RNA viruses, (2) viral metagenomics and ecology, (3) virus evolution and classification and (4) viral infections and immunology. Further, the meeting featured 20 oral poster presentations, all of which focused on specific areas of virus bioinformatics. This report summarizes the main research findings and highlights presented at the meeting.
Collapse
Affiliation(s)
- Franziska Hufsky
- European Virus Bioinformatics Center, 07743 Jena, Germany; (N.B.); (I.M.M.); (G.M.C.); (C.M.K.); (K.L.); (M.M.); (D.F.N.); (I.O.); (S.P.); (R.D.); (B.I.); (J.K.); (P.L.M.); (M.M.); (A.R.); (V.T.)
- RNA Bioinformatics and High-Throughput Analysis, Friedrich Schiller University Jena, 07743 Jena, Germany
| | - Niko Beerenwinkel
- European Virus Bioinformatics Center, 07743 Jena, Germany; (N.B.); (I.M.M.); (G.M.C.); (C.M.K.); (K.L.); (M.M.); (D.F.N.); (I.O.); (S.P.); (R.D.); (B.I.); (J.K.); (P.L.M.); (M.M.); (A.R.); (V.T.)
- Department of Biosystems Science and Engineering, ETH Zurich, 4058 Basel, Switzerland
- SIB Swiss Institute of Bioinformatics, 4058 Basel, Switzerland
| | - Irmtraud M. Meyer
- European Virus Bioinformatics Center, 07743 Jena, Germany; (N.B.); (I.M.M.); (G.M.C.); (C.M.K.); (K.L.); (M.M.); (D.F.N.); (I.O.); (S.P.); (R.D.); (B.I.); (J.K.); (P.L.M.); (M.M.); (A.R.); (V.T.)
- Max Delbrück Center for Molecular Medicine in the Helmholtz Association, Berlin Institute for Medical Systems Biology, 10115 Berlin, Germany
- Department of Biology, Chemistry and Pharmacy, Institute of Chemistry and Biochemistry, Freie Universität Berlin, 14195 Berlin, Germany
| | - Simon Roux
- Lawrence Berkeley National Laboratory, DOE Joint Genome Institute, Berkeley, CA 94720, USA;
| | - Georgia May Cook
- European Virus Bioinformatics Center, 07743 Jena, Germany; (N.B.); (I.M.M.); (G.M.C.); (C.M.K.); (K.L.); (M.M.); (D.F.N.); (I.O.); (S.P.); (R.D.); (B.I.); (J.K.); (P.L.M.); (M.M.); (A.R.); (V.T.)
- Department of Pathology, Division of Virology, University of Cambridge, Cambridge CB2 1TN, UK
| | - Cormac M. Kinsella
- European Virus Bioinformatics Center, 07743 Jena, Germany; (N.B.); (I.M.M.); (G.M.C.); (C.M.K.); (K.L.); (M.M.); (D.F.N.); (I.O.); (S.P.); (R.D.); (B.I.); (J.K.); (P.L.M.); (M.M.); (A.R.); (V.T.)
- Laboratory of Experimental Virology, Department of Medical Microbiology and Infection Prevention, Amsterdam UMC, University of Amsterdam, 1105 AZ Amsterdam, The Netherlands
| | - Kevin Lamkiewicz
- European Virus Bioinformatics Center, 07743 Jena, Germany; (N.B.); (I.M.M.); (G.M.C.); (C.M.K.); (K.L.); (M.M.); (D.F.N.); (I.O.); (S.P.); (R.D.); (B.I.); (J.K.); (P.L.M.); (M.M.); (A.R.); (V.T.)
- RNA Bioinformatics and High-Throughput Analysis, Friedrich Schiller University Jena, 07743 Jena, Germany
| | - Mike Marquet
- European Virus Bioinformatics Center, 07743 Jena, Germany; (N.B.); (I.M.M.); (G.M.C.); (C.M.K.); (K.L.); (M.M.); (D.F.N.); (I.O.); (S.P.); (R.D.); (B.I.); (J.K.); (P.L.M.); (M.M.); (A.R.); (V.T.)
- CaSe Group, Institut für Infektionsmedizin und Krankenhaushygiene, Universitätsklinikum Jena, 07743 Jena, Germany
| | - David F. Nieuwenhuijse
- European Virus Bioinformatics Center, 07743 Jena, Germany; (N.B.); (I.M.M.); (G.M.C.); (C.M.K.); (K.L.); (M.M.); (D.F.N.); (I.O.); (S.P.); (R.D.); (B.I.); (J.K.); (P.L.M.); (M.M.); (A.R.); (V.T.)
- Viroscience Department, Erasmus MC, 3015 GD Rotterdam, The Netherlands
| | - Ingrida Olendraite
- European Virus Bioinformatics Center, 07743 Jena, Germany; (N.B.); (I.M.M.); (G.M.C.); (C.M.K.); (K.L.); (M.M.); (D.F.N.); (I.O.); (S.P.); (R.D.); (B.I.); (J.K.); (P.L.M.); (M.M.); (A.R.); (V.T.)
- Department of Pathology, Division of Virology, University of Cambridge, Cambridge CB2 1TN, UK
| | - Sofia Paraskevopoulou
- European Virus Bioinformatics Center, 07743 Jena, Germany; (N.B.); (I.M.M.); (G.M.C.); (C.M.K.); (K.L.); (M.M.); (D.F.N.); (I.O.); (S.P.); (R.D.); (B.I.); (J.K.); (P.L.M.); (M.M.); (A.R.); (V.T.)
- Institute of Virology, Charité-Universitätsmedizin Berlin, Corporate Member of Freie Universität Berlin, Humboldt-Universität zu Berlin, and Berlin Institute of Health, 10117 Berlin, Germany
| | - Francesca Young
- MRC-University of Glasgow Centre for Virus Research, Glasgow G61 1QH, UK;
| | - Ronald Dijkman
- European Virus Bioinformatics Center, 07743 Jena, Germany; (N.B.); (I.M.M.); (G.M.C.); (C.M.K.); (K.L.); (M.M.); (D.F.N.); (I.O.); (S.P.); (R.D.); (B.I.); (J.K.); (P.L.M.); (M.M.); (A.R.); (V.T.)
- Institute of Virology and Immunology, University of Bern, 3012 Bern, Switzerland
- Department of Infectious Diseases and Pathobiology, Vetsuisse Faculty, University of Bern, 3012 Bern, Switzerland
- Institute for Infectious Diseases, University of Bern, 3012 Bern, Switzerland
| | - Bashar Ibrahim
- European Virus Bioinformatics Center, 07743 Jena, Germany; (N.B.); (I.M.M.); (G.M.C.); (C.M.K.); (K.L.); (M.M.); (D.F.N.); (I.O.); (S.P.); (R.D.); (B.I.); (J.K.); (P.L.M.); (M.M.); (A.R.); (V.T.)
- Centre for Applied Mathematics and Bioinformatics, Hawally 32093, Kuwait
- Department of Mathematics and Natural Sciences Gulf University for Science and Technology, Hawally 32093, Kuwait
| | - Jenna Kelly
- European Virus Bioinformatics Center, 07743 Jena, Germany; (N.B.); (I.M.M.); (G.M.C.); (C.M.K.); (K.L.); (M.M.); (D.F.N.); (I.O.); (S.P.); (R.D.); (B.I.); (J.K.); (P.L.M.); (M.M.); (A.R.); (V.T.)
- Institute of Virology and Immunology, University of Bern, 3012 Bern, Switzerland
| | - Philippe Le Mercier
- European Virus Bioinformatics Center, 07743 Jena, Germany; (N.B.); (I.M.M.); (G.M.C.); (C.M.K.); (K.L.); (M.M.); (D.F.N.); (I.O.); (S.P.); (R.D.); (B.I.); (J.K.); (P.L.M.); (M.M.); (A.R.); (V.T.)
- Swiss-Prot Group, SIB Swiss Institute of Bioinformatics, 1205 Geneva, Switzerland
| | - Manja Marz
- European Virus Bioinformatics Center, 07743 Jena, Germany; (N.B.); (I.M.M.); (G.M.C.); (C.M.K.); (K.L.); (M.M.); (D.F.N.); (I.O.); (S.P.); (R.D.); (B.I.); (J.K.); (P.L.M.); (M.M.); (A.R.); (V.T.)
- RNA Bioinformatics and High-Throughput Analysis, Friedrich Schiller University Jena, 07743 Jena, Germany
| | - Alban Ramette
- European Virus Bioinformatics Center, 07743 Jena, Germany; (N.B.); (I.M.M.); (G.M.C.); (C.M.K.); (K.L.); (M.M.); (D.F.N.); (I.O.); (S.P.); (R.D.); (B.I.); (J.K.); (P.L.M.); (M.M.); (A.R.); (V.T.)
- Institute for Infectious Diseases, University of Bern, 3012 Bern, Switzerland
| | - Volker Thiel
- European Virus Bioinformatics Center, 07743 Jena, Germany; (N.B.); (I.M.M.); (G.M.C.); (C.M.K.); (K.L.); (M.M.); (D.F.N.); (I.O.); (S.P.); (R.D.); (B.I.); (J.K.); (P.L.M.); (M.M.); (A.R.); (V.T.)
- Institute of Virology and Immunology, University of Bern, 3012 Bern, Switzerland
| |
Collapse
|
81
|
Kumar PS, Dabdoub SM, Ganesan SM. Probing periodontal microbial dark matter using metataxonomics and metagenomics. Periodontol 2000 2020; 85:12-27. [PMID: 33226714 DOI: 10.1111/prd.12349] [Citation(s) in RCA: 9] [Impact Index Per Article: 2.3] [Reference Citation Analysis] [Abstract] [Key Words] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 12/12/2022]
Abstract
Our view of the periodontal microbial community has been shaped by a century or more of cultivation-based and microscopic investigations. While these studies firmly established the infection-mediated etiology of periodontal diseases, it was apparent from the very early days that periodontal microbiology suffered from what Staley and Konopka described as the "great plate count anomaly", in that these culturable bacteria were only a minor part of what was visible under the microscope. For nearly a century, much effort has been devoted to finding the right tools to investigate this uncultivated majority, also known as "microbial dark matter". The discovery that DNA was an effective tool to "see" microbial dark matter was a significant breakthrough in environmental microbiology, and oral microbiologists were among the earliest to capitalize on these advances. By identifying the order in which nucleotides are arranged in a stretch of DNA (DNA sequencing) and creating a repository of these sequences, sequence databases were created. Computational tools that used probability-driven analysis of these sequences enabled the discovery of new and unsuspected species and ascribed novel functions to these species. This review will trace the development of DNA sequencing as a quantitative, open-ended, comprehensive approach to characterize microbial communities in their native environments, and explore how this technology has shifted traditional dogmas on how the oral microbiome promotes health and its role in disease causation and perpetuation.
Collapse
Affiliation(s)
- Purnima S Kumar
- Department of Periodontology, College of Dentistry, The Ohio State University, Columbus, Ohio, USA
| | - Shareef M Dabdoub
- Department of Periodontology, College of Dentistry, The Ohio State University, Columbus, Ohio, USA
| | - Sukirth M Ganesan
- Department of Periodontics, College of Dentistry and Dental Clinics, The University of Iowa, Iowa City, Iowa, USA
| |
Collapse
|
82
|
Gregory AC, Zablocki O, Zayed AA, Howell A, Bolduc B, Sullivan MB. The Gut Virome Database Reveals Age-Dependent Patterns of Virome Diversity in the Human Gut. Cell Host Microbe 2020; 28:724-740.e8. [PMID: 32841606 PMCID: PMC7443397 DOI: 10.1016/j.chom.2020.08.003] [Citation(s) in RCA: 280] [Impact Index Per Article: 70.0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/15/2020] [Revised: 07/14/2020] [Accepted: 08/06/2020] [Indexed: 12/12/2022]
Abstract
The gut microbiome profoundly affects human health and disease, and their infecting viruses are likely as important, but often missed because of reference database limitations. Here, we (1) built a human Gut Virome Database (GVD) from 2,697 viral particle or microbial metagenomes from 1,986 individuals representing 16 countries, (2) assess its effectiveness, and (3) report a meta-analysis that reveals age-dependent patterns across healthy Westerners. The GVD contains 33,242 unique viral populations (approximately species-level taxa) and improves average viral detection rates over viral RefSeq and IMG/VR nearly 182-fold and 2.6-fold, respectively. GVD meta-analyses show highly personalized viromes, reveal that inter-study variability from technical artifacts is larger than any "disease" effect at the population level, and document how viral diversity changes from human infancy into senescence. Together, this compact foundational resource, these standardization guidelines, and these meta-analysis findings provide a systematic toolkit to help maximize our understanding of viral roles in health and disease.
Collapse
Affiliation(s)
- Ann C Gregory
- Department of Microbiology, Ohio State University, Columbus, OH 43210, USA
| | - Olivier Zablocki
- Department of Microbiology, Ohio State University, Columbus, OH 43210, USA; Center of Microbiome Science, Ohio State University, Columbus, OH 43210, USA
| | - Ahmed A Zayed
- Department of Microbiology, Ohio State University, Columbus, OH 43210, USA; Center of Microbiome Science, Ohio State University, Columbus, OH 43210, USA
| | - Allison Howell
- Department of Microbiology, Ohio State University, Columbus, OH 43210, USA
| | - Benjamin Bolduc
- Department of Microbiology, Ohio State University, Columbus, OH 43210, USA; Center of Microbiome Science, Ohio State University, Columbus, OH 43210, USA
| | - Matthew B Sullivan
- Department of Microbiology, Ohio State University, Columbus, OH 43210, USA; Department of Civil, Environmental and Geodetic Engineering, Ohio State University, Columbus, OH 43210, USA; Center of Microbiome Science, Ohio State University, Columbus, OH 43210, USA.
| |
Collapse
|
83
|
Abstract
Viruses are extremely diverse and modulate important biological and ecological processes globally. However, much of viral diversity remains uncultured and yet to be discovered. Several powerful culture-independent tools, in particular metagenomics, have substantially advanced virus discovery. Among those tools is single-virus genomics, which yields sequenced reference genomes from individual sorted virus particles without the need for cultivation. This new method complements virus culturing and metagenomic approaches and its advantages include targeted investigation of specific virus groups and investigation of genomic microdiversity within viral populations. In this Review, we provide a brief history of single-virus genomics, outline how this emergent method has facilitated advances in virus ecology and discuss its current limitations and future potential. Finally, we address how this method may synergistically intersect with other single-virus and single-cell approaches.
Collapse
|
84
|
A distinct lineage of Caudovirales that encodes a deeply branching multi-subunit RNA polymerase. Nat Commun 2020; 11:4506. [PMID: 32908149 PMCID: PMC7481178 DOI: 10.1038/s41467-020-18281-3] [Citation(s) in RCA: 6] [Impact Index Per Article: 1.5] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 11/21/2019] [Accepted: 08/14/2020] [Indexed: 01/27/2023] Open
Abstract
Bacteriophages play critical roles in the biosphere, but their vast genomic diversity has obscured their evolutionary origins, and phylogenetic analyses have traditionally been hindered by their lack of universal phylogenetic marker genes. In this study we mine metagenomic data and identify a clade of Caudovirales that encodes the β and β' subunits of multi-subunit RNA polymerase (RNAP), a high-resolution phylogenetic marker which enables detailed evolutionary analyses. Our RNAP phylogeny revealed that the Caudovirales RNAP forms a clade distinct from cellular homologs, suggesting an ancient acquisition of this enzyme. Within these multimeric RNAP-encoding Caudovirales (mReC), we find that the similarity of major capsid proteins and terminase large subunits further suggests they form a distinct clade with common evolutionary origin. Our study characterizes a clade of RNAP-encoding Caudovirales and suggests the ancient origin of this enzyme in this group, underscoring the important role of viruses in the early evolution of life on Earth.
Collapse
|
85
|
Hryckowian AJ, Merrill BD, Porter NT, Van Treuren W, Nelson EJ, Garlena RA, Russell DA, Martens EC, Sonnenburg JL. Bacteroides thetaiotaomicron-Infecting Bacteriophage Isolates Inform Sequence-Based Host Range Predictions. Cell Host Microbe 2020; 28:371-379.e5. [PMID: 32652063 PMCID: PMC8045012 DOI: 10.1016/j.chom.2020.06.011] [Citation(s) in RCA: 41] [Impact Index Per Article: 10.3] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/04/2020] [Revised: 04/22/2020] [Accepted: 06/12/2020] [Indexed: 12/21/2022]
Abstract
Our emerging view of the gut microbiome largely focuses on bacteria, while less is known about other microbial components, such as bacteriophages (phages). Though phages are abundant in the gut, very few phages have been isolated from this ecosystem. Here, we report the genomes of 27 phages from the United States and Bangladesh that infect the prevalent human gut bacterium Bacteroides thetaiotaomicron. These phages are mostly distinct from previously sequenced phages with the exception of two, which are crAss-like phages. We compare these isolates to existing human gut metagenomes, revealing similarities to previously inferred phages and additional unexplored phage diversity. Finally, we use host tropisms of these phages to identify alleles of phage structural genes associated with infectivity. This work provides a detailed view of the gut's "viral dark matter" and a framework for future efforts to further integrate isolation- and sequencing-focused efforts to understand gut-resident phages.
Collapse
Affiliation(s)
- Andrew J Hryckowian
- Department of Microbiology & Immunology, Stanford University School of Medicine, Stanford, CA 94305, USA.
| | - Bryan D Merrill
- Department of Microbiology & Immunology, Stanford University School of Medicine, Stanford, CA 94305, USA
| | - Nathan T Porter
- Department of Microbiology & Immunology, University of Michigan Medical School, Ann Arbor, MI 48109, USA
| | - William Van Treuren
- Department of Microbiology & Immunology, Stanford University School of Medicine, Stanford, CA 94305, USA
| | - Eric J Nelson
- Emerging Pathogens Institute, University of Florida, Gainesville, FL 32611, USA
| | - Rebecca A Garlena
- Department of Biological Sciences, University of Pittsburgh, Pittsburgh, PA 15213, USA
| | - Daniel A Russell
- Department of Biological Sciences, University of Pittsburgh, Pittsburgh, PA 15213, USA
| | - Eric C Martens
- Department of Microbiology & Immunology, University of Michigan Medical School, Ann Arbor, MI 48109, USA
| | - Justin L Sonnenburg
- Department of Microbiology & Immunology, Stanford University School of Medicine, Stanford, CA 94305, USA.
| |
Collapse
|
86
|
Bartel J, Varadarajan AR, Sura T, Ahrens CH, Maaß S, Becher D. Optimized Proteomics Workflow for the Detection of Small Proteins. J Proteome Res 2020; 19:4004-4018. [DOI: 10.1021/acs.jproteome.0c00286] [Citation(s) in RCA: 15] [Impact Index Per Article: 3.8] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 12/22/2022]
Affiliation(s)
- Jürgen Bartel
- Department of Microbial Proteomics, Institute of Microbiology, University of Greifswald, D-17489 Greifswald, Germany
| | - Adithi R. Varadarajan
- Agroscope, Research Group Molecular Diagnostics, Genomics & Bioinformatics and SIB Swiss Institute of Bioinformatics, CH-8820 Wädenswil, Switzerland
| | - Thomas Sura
- Department of Microbial Proteomics, Institute of Microbiology, University of Greifswald, D-17489 Greifswald, Germany
| | - Christian H. Ahrens
- Agroscope, Research Group Molecular Diagnostics, Genomics & Bioinformatics and SIB Swiss Institute of Bioinformatics, CH-8820 Wädenswil, Switzerland
| | - Sandra Maaß
- Department of Microbial Proteomics, Institute of Microbiology, University of Greifswald, D-17489 Greifswald, Germany
| | - Dörte Becher
- Department of Microbial Proteomics, Institute of Microbiology, University of Greifswald, D-17489 Greifswald, Germany
| |
Collapse
|
87
|
Panwar P, Allen MA, Williams TJ, Hancock AM, Brazendale S, Bevington J, Roux S, Páez-Espino D, Nayfach S, Berg M, Schulz F, Chen IMA, Huntemann M, Shapiro N, Kyrpides NC, Woyke T, Eloe-Fadrosh EA, Cavicchioli R. Influence of the polar light cycle on seasonal dynamics of an Antarctic lake microbial community. MICROBIOME 2020; 8:116. [PMID: 32772914 PMCID: PMC7416419 DOI: 10.1186/s40168-020-00889-8] [Citation(s) in RCA: 23] [Impact Index Per Article: 5.8] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Subscribe] [Scholar Register] [Received: 05/07/2020] [Accepted: 06/30/2020] [Indexed: 05/10/2023]
Abstract
BACKGROUND Cold environments dominate the Earth's biosphere and microbial activity drives ecosystem processes thereby contributing greatly to global biogeochemical cycles. Polar environments differ to all other cold environments by experiencing 24-h sunlight in summer and no sunlight in winter. The Vestfold Hills in East Antarctica contains hundreds of lakes that have evolved from a marine origin only 3000-7000 years ago. Ace Lake is a meromictic (stratified) lake from this region that has been intensively studied since the 1970s. Here, a total of 120 metagenomes representing a seasonal cycle and four summers spanning a 10-year period were analyzed to determine the effects of the polar light cycle on microbial-driven nutrient cycles. RESULTS The lake system is characterized by complex sulfur and hydrogen cycling, especially in the anoxic layers, with multiple mechanisms for the breakdown of biopolymers present throughout the water column. The two most abundant taxa are phototrophs (green sulfur bacteria and cyanobacteria) that are highly influenced by the seasonal availability of sunlight. The extent of the Chlorobium biomass thriving at the interface in summer was captured in underwater video footage. The Chlorobium abundance dropped from up to 83% in summer to 6% in winter and 1% in spring, before rebounding to high levels. Predicted Chlorobium viruses and cyanophage were also abundant, but their levels did not negatively correlate with their hosts. CONCLUSION Over-wintering expeditions in Antarctica are logistically challenging, meaning insight into winter processes has been inferred from limited data. Here, we found that in contrast to chemolithoautotrophic carbon fixation potential of Southern Ocean Thaumarchaeota, this marine-derived lake evolved a reliance on photosynthesis. While viruses associated with phototrophs also have high seasonal abundance, the negative impact of viral infection on host growth appeared to be limited. The microbial community as a whole appears to have developed a capacity to generate biomass and remineralize nutrients, sufficient to sustain itself between two rounds of sunlight-driven summer-activity. In addition, this unique metagenome dataset provides considerable opportunity for future interrogation of eukaryotes and their viruses, abundant uncharacterized taxa (i.e. dark matter), and for testing hypotheses about endemic species in polar aquatic ecosystems. Video Abstract.
Collapse
Affiliation(s)
- Pratibha Panwar
- School of Biotechnology and Biomolecular Sciences, UNSW Sydney, Sydney, New South Wales, 2052, Australia
| | - Michelle A Allen
- School of Biotechnology and Biomolecular Sciences, UNSW Sydney, Sydney, New South Wales, 2052, Australia
| | - Timothy J Williams
- School of Biotechnology and Biomolecular Sciences, UNSW Sydney, Sydney, New South Wales, 2052, Australia
| | - Alyce M Hancock
- School of Biotechnology and Biomolecular Sciences, UNSW Sydney, Sydney, New South Wales, 2052, Australia
- Institute for Marine and Antarctic Studies, University of Tasmania, 20 Castray Esplanade, Battery Point, Tasmania, Australia
| | - Sarah Brazendale
- School of Biotechnology and Biomolecular Sciences, UNSW Sydney, Sydney, New South Wales, 2052, Australia
- , 476 Lancaster Rd, Pegarah, Australia
| | - James Bevington
- School of Biotechnology and Biomolecular Sciences, UNSW Sydney, Sydney, New South Wales, 2052, Australia
| | - Simon Roux
- Department of Energy Joint Genome Institute, Berkeley, CA, USA
| | - David Páez-Espino
- Department of Energy Joint Genome Institute, Berkeley, CA, USA
- Mammoth BioSciences, 279 East Grand Ave, South San Francisco, CA, USA
| | - Stephen Nayfach
- Department of Energy Joint Genome Institute, Berkeley, CA, USA
| | - Maureen Berg
- Department of Energy Joint Genome Institute, Berkeley, CA, USA
| | - Frederik Schulz
- Department of Energy Joint Genome Institute, Berkeley, CA, USA
| | - I-Min A Chen
- Department of Energy Joint Genome Institute, Berkeley, CA, USA
| | | | - Nicole Shapiro
- Department of Energy Joint Genome Institute, Berkeley, CA, USA
| | | | - Tanja Woyke
- Department of Energy Joint Genome Institute, Berkeley, CA, USA
| | | | - Ricardo Cavicchioli
- School of Biotechnology and Biomolecular Sciences, UNSW Sydney, Sydney, New South Wales, 2052, Australia.
| |
Collapse
|
88
|
Wu Z, Waneka G, Broz AK, King CR, Sloan DB. MSH1 is required for maintenance of the low mutation rates in plant mitochondrial and plastid genomes. Proc Natl Acad Sci U S A 2020. [PMID: 32601224 DOI: 10.1073/pnas.2001998117/suppl_file/pnas.2001998117.sd01.xlsx] [Citation(s) in RCA: 1] [Impact Index Per Article: 0.3] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 05/15/2023] Open
Abstract
Mitochondrial and plastid genomes in land plants exhibit some of the slowest rates of sequence evolution observed in any eukaryotic genome, suggesting an exceptional ability to prevent or correct mutations. However, the mechanisms responsible for this extreme fidelity remain unclear. We tested seven candidate genes involved in cytoplasmic DNA replication, recombination, and repair (POLIA, POLIB, MSH1, RECA3, UNG, FPG, and OGG1) for effects on mutation rates in the model angiosperm Arabidopsis thaliana by applying a highly accurate DNA sequencing technique (duplex sequencing) that can detect newly arisen mitochondrial and plastid mutations even at low heteroplasmic frequencies. We find that disrupting MSH1 (but not the other candidate genes) leads to massive increases in the frequency of point mutations and small indels and changes to the mutation spectrum in mitochondrial and plastid DNA. We also used droplet digital PCR to show transmission of de novo heteroplasmies across generations in msh1 mutants, confirming a contribution to heritable mutation rates. This dual-targeted gene is part of an enigmatic lineage within the mutS mismatch repair family that we find is also present outside of green plants in multiple eukaryotic groups (stramenopiles, alveolates, haptophytes, and cryptomonads), as well as certain bacteria and viruses. MSH1 has previously been shown to limit ectopic recombination in plant cytoplasmic genomes. Our results point to a broader role in recognition and correction of errors in plant mitochondrial and plastid DNA sequence, leading to greatly suppressed mutation rates perhaps via initiation of double-stranded breaks and repair pathways based on faithful homologous recombination.
Collapse
Affiliation(s)
- Zhiqiang Wu
- Shenzhen Branch, Guangdong Laboratory for Lingnan Modern Agriculture, Genome Analysis Laboratory of the Ministry of Agriculture, Agricultural Genomics Institute at Shenzhen, Chinese Academy of Agricultural Sciences, 518120 Shenzhen, China
- Department of Biology, Colorado State University, Fort Collins, CO 80523
| | - Gus Waneka
- Department of Biology, Colorado State University, Fort Collins, CO 80523
| | - Amanda K Broz
- Department of Biology, Colorado State University, Fort Collins, CO 80523
| | - Connor R King
- Department of Biology, Colorado State University, Fort Collins, CO 80523
| | - Daniel B Sloan
- Department of Biology, Colorado State University, Fort Collins, CO 80523
| |
Collapse
|
89
|
Seppey M, Manni M, Zdobnov EM. LEMMI: a continuous benchmarking platform for metagenomics classifiers. Genome Res 2020; 30:1208-1216. [PMID: 32616517 PMCID: PMC7462069 DOI: 10.1101/gr.260398.119] [Citation(s) in RCA: 10] [Impact Index Per Article: 2.5] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 01/16/2020] [Accepted: 06/25/2020] [Indexed: 11/24/2022]
Abstract
Studies of microbiomes are booming, along with the diversity of computational approaches to make sense out of the sequencing data and the volumes of accumulated microbial genotypes. A swift evaluation of newly published methods and their improvements against established tools is necessary to reduce the time between the methods' release and their adoption in microbiome analyses. The LEMMI platform offers a novel approach for benchmarking software dedicated to metagenome composition assessments based on read classification. It enables the integration of newly published methods in an independent and centralized benchmark designed to be continuously open to new submissions. This allows developers to be proactive regarding comparative evaluations and guarantees that any promising methods can be assessed side by side with established tools quickly after their release. Moreover, LEMMI enforces an effective distribution through software containers to ensure long-term availability of all methods. Here, we detail the LEMMI workflow and discuss the performances of some previously unevaluated tools. We see this platform eventually as a community-driven effort in which method developers can showcase novel approaches and get unbiased benchmarks for publications, and users can make informed choices and obtain standardized and easy-to-use tools.
Collapse
Affiliation(s)
- Mathieu Seppey
- Department of Genetic Medicine and Development, University of Geneva Medical School and Swiss Institute of Bioinformatics, 1211 Geneva, Switzerland
| | - Mosè Manni
- Department of Genetic Medicine and Development, University of Geneva Medical School and Swiss Institute of Bioinformatics, 1211 Geneva, Switzerland
| | - Evgeny M Zdobnov
- Department of Genetic Medicine and Development, University of Geneva Medical School and Swiss Institute of Bioinformatics, 1211 Geneva, Switzerland
| |
Collapse
|
90
|
Khan Mirzaei M, Xue J, Costa R, Ru J, Schulz S, Taranu ZE, Deng L. Challenges of Studying the Human Virome - Relevant Emerging Technologies. Trends Microbiol 2020; 29:171-181. [PMID: 32622559 DOI: 10.1016/j.tim.2020.05.021] [Citation(s) in RCA: 39] [Impact Index Per Article: 9.8] [Reference Citation Analysis] [Abstract] [Key Words] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/16/2020] [Revised: 05/27/2020] [Accepted: 05/28/2020] [Indexed: 01/17/2023]
Abstract
In this review we provide an overview of current challenges and advances in bacteriophage research within the growing field of viromics. In particular, we discuss, from a human virome study perspective, the current and emerging technologies available, their limitations in terms of de novo discoveries, and possible solutions to overcome present experimental and computational biases associated with low abundance of viral DNA or RNA. We summarize recent breakthroughs in metagenomics assembling tools and single-cell analysis, which have the potential to increase our understanding of phage biology, diversity, and interactions with both the microbial community and the human body. We expect that these recent and future advances in the field of viromics will have a strong impact on how we develop phage-based therapeutic approaches.
Collapse
Affiliation(s)
- Mohammadali Khan Mirzaei
- Institute of Virology, Helmholtz Centre Munich and Technical University of Munich, Neuherberg, Bavaria 85764, Germany
| | - Jinling Xue
- Institute of Virology, Helmholtz Centre Munich and Technical University of Munich, Neuherberg, Bavaria 85764, Germany
| | - Rita Costa
- Institute of Virology, Helmholtz Centre Munich and Technical University of Munich, Neuherberg, Bavaria 85764, Germany
| | - Jinlong Ru
- Institute of Virology, Helmholtz Centre Munich and Technical University of Munich, Neuherberg, Bavaria 85764, Germany
| | - Sarah Schulz
- Institute of Virology, Helmholtz Centre Munich and Technical University of Munich, Neuherberg, Bavaria 85764, Germany
| | - Zofia E Taranu
- Aquatic Contaminants Research Division (ACRD), Environment and Climate Change Canada (ECCC), Montréal, QC H2Y 2E7, Canada
| | - Li Deng
- Institute of Virology, Helmholtz Centre Munich and Technical University of Munich, Neuherberg, Bavaria 85764, Germany.
| |
Collapse
|
91
|
MSH1 is required for maintenance of the low mutation rates in plant mitochondrial and plastid genomes. Proc Natl Acad Sci U S A 2020; 117:16448-16455. [PMID: 32601224 DOI: 10.1073/pnas.2001998117] [Citation(s) in RCA: 49] [Impact Index Per Article: 12.3] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 12/31/2022] Open
Abstract
Mitochondrial and plastid genomes in land plants exhibit some of the slowest rates of sequence evolution observed in any eukaryotic genome, suggesting an exceptional ability to prevent or correct mutations. However, the mechanisms responsible for this extreme fidelity remain unclear. We tested seven candidate genes involved in cytoplasmic DNA replication, recombination, and repair (POLIA, POLIB, MSH1, RECA3, UNG, FPG, and OGG1) for effects on mutation rates in the model angiosperm Arabidopsis thaliana by applying a highly accurate DNA sequencing technique (duplex sequencing) that can detect newly arisen mitochondrial and plastid mutations even at low heteroplasmic frequencies. We find that disrupting MSH1 (but not the other candidate genes) leads to massive increases in the frequency of point mutations and small indels and changes to the mutation spectrum in mitochondrial and plastid DNA. We also used droplet digital PCR to show transmission of de novo heteroplasmies across generations in msh1 mutants, confirming a contribution to heritable mutation rates. This dual-targeted gene is part of an enigmatic lineage within the mutS mismatch repair family that we find is also present outside of green plants in multiple eukaryotic groups (stramenopiles, alveolates, haptophytes, and cryptomonads), as well as certain bacteria and viruses. MSH1 has previously been shown to limit ectopic recombination in plant cytoplasmic genomes. Our results point to a broader role in recognition and correction of errors in plant mitochondrial and plastid DNA sequence, leading to greatly suppressed mutation rates perhaps via initiation of double-stranded breaks and repair pathways based on faithful homologous recombination.
Collapse
|
92
|
Abstract
Wastewater is a rich source of microbial life and contains bacteria, viruses, and other microbes found in human waste as well as environmental runoff sources. As part of an effort to characterize the New York City wastewater metagenome, we profiled the viral community of sewage samples across all five boroughs of NYC and found that local sampling sites have unique sets of viruses. We focused on bacteriophages, or viruses of bacteria, to understand how they may influence the microbial ecology of this system. We identified several new clusters of phages and successfully associated them with bacterial hosts, providing insight into virus-host interactions in urban wastewater. This study provides a first look into the viral communities present across the wastewater system in NYC and points to their functional importance in this environment. Bacteriophages are abundant members of all microbiomes studied to date, influencing microbial communities through interactions with their bacterial hosts. Despite their functional importance and ubiquity, phages have been underexplored in urban environments compared to their bacterial counterparts. We profiled the viral communities in New York City (NYC) wastewater using metagenomic data collected in November 2014 from 14 wastewater treatment plants. We show that phages accounted for the largest viral component of the sewage samples and that specific virus communities were associated with local environmental conditions within boroughs. The vast majority of the virus sequences had no homology matches in public databases, forming an average of 1,700 unique virus clusters (putative genera). These new clusters contribute to elucidating the overwhelming proportion of data that frequently goes unidentified in viral metagenomic studies. We assigned potential hosts to these phages, which appear to infect a wide range of bacterial genera, often outside their presumed host. We determined that infection networks form a modular-nested pattern, indicating that phages include a range of host specificities, from generalists to specialists, with most interactions organized into distinct groups. We identified genes in viral contigs involved in carbon and sulfur cycling, suggesting functional importance of viruses in circulating pathways and gene functions in the wastewater environment. In addition, we identified virophage genes as well as a nearly complete novel virophage genome. These findings provide an understanding of phage abundance and diversity in NYC wastewater, previously uncharacterized, and further examine geographic patterns of phage-host association in urban environments. IMPORTANCE Wastewater is a rich source of microbial life and contains bacteria, viruses, and other microbes found in human waste as well as environmental runoff sources. As part of an effort to characterize the New York City wastewater metagenome, we profiled the viral community of sewage samples across all five boroughs of NYC and found that local sampling sites have unique sets of viruses. We focused on bacteriophages, or viruses of bacteria, to understand how they may influence the microbial ecology of this system. We identified several new clusters of phages and successfully associated them with bacterial hosts, providing insight into virus-host interactions in urban wastewater. This study provides a first look into the viral communities present across the wastewater system in NYC and points to their functional importance in this environment.
Collapse
|
93
|
Kieft K, Zhou Z, Anantharaman K. VIBRANT: automated recovery, annotation and curation of microbial viruses, and evaluation of viral community function from genomic sequences. MICROBIOME 2020; 8:90. [PMID: 32522236 PMCID: PMC7288430 DOI: 10.1186/s40168-020-00867-0] [Citation(s) in RCA: 391] [Impact Index Per Article: 97.8] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Subscribe] [Scholar Register] [Received: 02/27/2020] [Accepted: 05/13/2020] [Indexed: 05/08/2023]
Abstract
BACKGROUND Viruses are central to microbial community structure in all environments. The ability to generate large metagenomic assemblies of mixed microbial and viral sequences provides the opportunity to tease apart complex microbiome dynamics, but these analyses are currently limited by the tools available for analyses of viral genomes and assessing their metabolic impacts on microbiomes. DESIGN Here we present VIBRANT, the first method to utilize a hybrid machine learning and protein similarity approach that is not reliant on sequence features for automated recovery and annotation of viruses, determination of genome quality and completeness, and characterization of viral community function from metagenomic assemblies. VIBRANT uses neural networks of protein signatures and a newly developed v-score metric that circumvents traditional boundaries to maximize identification of lytic viral genomes and integrated proviruses, including highly diverse viruses. VIBRANT highlights viral auxiliary metabolic genes and metabolic pathways, thereby serving as a user-friendly platform for evaluating viral community function. VIBRANT was trained and validated on reference virus datasets as well as microbiome and virome data. RESULTS VIBRANT showed superior performance in recovering higher quality viruses and concurrently reduced the false identification of non-viral genome fragments in comparison to other virus identification programs, specifically VirSorter, VirFinder, and MARVEL. When applied to 120,834 metagenome-derived viral sequences representing several human and natural environments, VIBRANT recovered an average of 94% of the viruses, whereas VirFinder, VirSorter, and MARVEL achieved less powerful performance, averaging 48%, 87%, and 71%, respectively. Similarly, VIBRANT identified more total viral sequence and proteins when applied to real metagenomes. When compared to PHASTER, Prophage Hunter, and VirSorter for the ability to extract integrated provirus regions from host scaffolds, VIBRANT performed comparably and even identified proviruses that the other programs did not. To demonstrate applications of VIBRANT, we studied viromes associated with Crohn's disease to show that specific viral groups, namely Enterobacteriales-like viruses, as well as putative dysbiosis associated viral proteins are more abundant compared to healthy individuals, providing a possible viral link to maintenance of diseased states. CONCLUSIONS The ability to accurately recover viruses and explore viral impacts on microbial community metabolism will greatly advance our understanding of microbiomes, host-microbe interactions, and ecosystem dynamics. Video Abstract.
Collapse
Affiliation(s)
- Kristopher Kieft
- Department of Bacteriology, University of Wisconsin-Madison, Madison, WI, 53706, USA
| | - Zhichao Zhou
- Department of Bacteriology, University of Wisconsin-Madison, Madison, WI, 53706, USA
| | - Karthik Anantharaman
- Department of Bacteriology, University of Wisconsin-Madison, Madison, WI, 53706, USA.
| |
Collapse
|
94
|
Young F, Rogers S, Robertson DL. Predicting host taxonomic information from viral genomes: A comparison of feature representations. PLoS Comput Biol 2020; 16:e1007894. [PMID: 32453718 PMCID: PMC7307784 DOI: 10.1371/journal.pcbi.1007894] [Citation(s) in RCA: 15] [Impact Index Per Article: 3.8] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 09/03/2019] [Revised: 06/22/2020] [Accepted: 04/21/2020] [Indexed: 12/13/2022] Open
Abstract
The rise in metagenomics has led to an exponential growth in virus discovery. However, the majority of these new virus sequences have no assigned host. Current machine learning approaches to predicting virus host interactions have a tendency to focus on nucleotide features, ignoring other representations of genomic information. Here we investigate the predictive potential of features generated from four different ‘levels’ of viral genome representation: nucleotide, amino acid, amino acid properties and protein domains. This more fully exploits the biological information present in the virus genomes. Over a hundred and eighty binary datasets for infecting versus non-infecting viruses at all taxonomic ranks of both eukaryote and prokaryote hosts were compiled. The viral genomes were converted into the four different levels of genome representation and twenty feature sets were generated by extracting k-mer compositions and predicted protein domains. We trained and tested Support Vector Machine, SVM, classifiers to compare the predictive capacity of each of these feature sets for each dataset. Our results show that all levels of genome representation are consistently predictive of host taxonomy and that prediction k-mer composition improves with increasing k-mer length for all k-mer based features. Using a phylogenetically aware holdout method, we demonstrate that the predictive feature sets contain signals reflecting both the evolutionary relationship between the viruses infecting related hosts, and host-mimicry. Our results demonstrate that incorporating a range of complementary features, generated purely from virus genome sequences, leads to improved accuracy for a range of virus host prediction tasks enabling computational assignment of host taxonomic information. Elucidating the host of a newly identified virus species is an important challenge, with applications from knowing the source species of a newly emerged pathogen to understanding the bacteriophage-host relationships within the microbiome of any of earth’s ecosystems. Current high throughput methods used to identify viruses within biological or environmental samples have resulted in an unprecedented increase in virus discovery. However, for the majority of these virus genomes the host species/taxonomic classification remains unknown. To address this gap in our knowledge there is a need for fast, accurate computational methods for the assignment of putative host taxonomic information. Machine learning is an ideal approach but to maximise predictive accuracy the viral genomes need to be represented in a format (sets of features) that makes the discriminative information available to the machine learning algorithm. Here, we compare different types of features derived from the same viral genomes for their ability to predict host information. Our results demonstrate that all these feature sets are predictive of host taxonomy and when combined have the potential to improve accuracy over the use of individual feature sets across many virus host prediction applications.
Collapse
Affiliation(s)
- Francesca Young
- MRC-University of Glasgow Centre For Virus Research, Glasgow, United Kingdom
| | - Simon Rogers
- School of Computing Science, University of Glasgow, Glasgow, United Kingdom
| | - David L. Robertson
- MRC-University of Glasgow Centre For Virus Research, Glasgow, United Kingdom
- * E-mail:
| |
Collapse
|
95
|
Carr VR, Shkoporov A, Hill C, Mullany P, Moyes DL. Probing the Mobilome: Discoveries in the Dynamic Microbiome. Trends Microbiol 2020; 29:158-170. [PMID: 32448763 DOI: 10.1016/j.tim.2020.05.003] [Citation(s) in RCA: 33] [Impact Index Per Article: 8.3] [Reference Citation Analysis] [Abstract] [Journal Information] [Subscribe] [Scholar Register] [Received: 02/12/2020] [Revised: 04/30/2020] [Accepted: 05/05/2020] [Indexed: 02/06/2023]
Abstract
There has been an explosion of metagenomic data representing human, animal, and environmental microbiomes. This provides an unprecedented opportunity for comparative and longitudinal studies of many functional aspects of the microbiome that go beyond taxonomic classification, such as profiling genetic determinants of antimicrobial resistance, interactions with the host, potentially clinically relevant functions, and the role of mobile genetic elements (MGEs). One of the most important but least studied of these aspects are the MGEs, collectively referred to as the 'mobilome'. Here we elaborate on the benefits and limitations of using different metagenomic protocols, discuss the relative merits of various sequencing technologies, and highlight relevant bioinformatics tools and pipelines to predict the presence of MGEs and their microbial hosts.
Collapse
Affiliation(s)
- Victoria R Carr
- Centre for Host-Microbiome Interactions, Faculty of Dentistry, Oral and Craniofacial Sciences, King's College London, London, UK; The Alan Turing Institute, British Library, London, UK.
| | - Andrey Shkoporov
- APC Microbiome Ireland, School of Microbiology, University College Cork, Cork, Ireland
| | - Colin Hill
- APC Microbiome Ireland, School of Microbiology, University College Cork, Cork, Ireland
| | - Peter Mullany
- Eastman Dental Institute, University College London, London, UK
| | - David L Moyes
- Centre for Host-Microbiome Interactions, Faculty of Dentistry, Oral and Craniofacial Sciences, King's College London, London, UK.
| |
Collapse
|
96
|
Abstract
SAR11 clade members are among the most abundant bacteria on Earth. Their study is complicated by their great diversity and difficulties in being grown and manipulated in the laboratory. On the other hand, and due to their extraordinary abundance, metagenomic data sets provide enormous richness of information about these microbes. Given the major role played by phages in the lifestyle and evolution of prokaryotic cells, the contribution of several new bacteriophage genomes preying on this clade opens windows into the infection strategies and life cycle of its viruses. Such strategies could provide models of attack of large-genome phages preying on streamlined aquatic microbes. The SAR11 clade is one of the most abundant bacterioplankton groups in surface waters of most of the oceans and lakes. However, only 15 SAR11 phages have been isolated thus far, and only one of them belongs to the Myoviridae family (pelagimyophages). Here, we have analyzed 26 sequences of myophages that putatively infect the SAR11 clade. They have been retrieved by mining ca. 45 Gbp aquatic assembled cellular metagenomes and viromes. Most of the myophages were obtained from the cellular fraction (0.2 μm), indicating a bias against this type of virus in viromes. We have found the first myophages that putatively infect Candidatus Fonsibacter (freshwater SAR11) and another group putatively infecting bathypelagic SAR11 phylogroup Ic. The genomes have similar sizes and maintain overall synteny in spite of low average nucleotide identity values, revealing high similarity to marine cyanomyophages. Pelagimyophages recruited metagenomic reads widely from several locations but always much more from cellular metagenomes than from viromes, opposite to what happens with pelagipodophages. Comparing the genomes resulted in the identification of a hypervariable island that is related to host recognition. Interestingly, some genes in these islands could be related to host cell wall synthesis and coinfection avoidance. A cluster of curli-related proteins was widespread among the genomes, although its function is unclear. IMPORTANCE SAR11 clade members are among the most abundant bacteria on Earth. Their study is complicated by their great diversity and difficulties in being grown and manipulated in the laboratory. On the other hand, and due to their extraordinary abundance, metagenomic data sets provide enormous richness of information about these microbes. Given the major role played by phages in the lifestyle and evolution of prokaryotic cells, the contribution of several new bacteriophage genomes preying on this clade opens windows into the infection strategies and life cycle of its viruses. Such strategies could provide models of attack of large-genome phages preying on streamlined aquatic microbes.
Collapse
|
97
|
Tisza MJ, Pastrana DV, Welch NL, Stewart B, Peretti A, Starrett GJ, Pang YYS, Krishnamurthy SR, Pesavento PA, McDermott DH, Murphy PM, Whited JL, Miller B, Brenchley J, Rosshart SP, Rehermann B, Doorbar J, Ta'ala BA, Pletnikova O, Troncoso JC, Resnick SM, Bolduc B, Sullivan MB, Varsani A, Segall AM, Buck CB. Discovery of several thousand highly diverse circular DNA viruses. eLife 2020; 9:51971. [PMID: 32014111 PMCID: PMC7000223 DOI: 10.7554/elife.51971] [Citation(s) in RCA: 116] [Impact Index Per Article: 29.0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 09/19/2019] [Accepted: 01/06/2020] [Indexed: 12/18/2022] Open
Abstract
Although millions of distinct virus species likely exist, only approximately 9000 are catalogued in GenBank's RefSeq database. We selectively enriched for the genomes of circular DNA viruses in over 70 animal samples, ranging from nematodes to human tissue specimens. A bioinformatics pipeline, Cenote-Taker, was developed to automatically annotate over 2500 complete genomes in a GenBank-compliant format. The new genomes belong to dozens of established and emerging viral families. Some appear to be the result of previously undescribed recombination events between ssDNA and ssRNA viruses. In addition, hundreds of circular DNA elements that do not encode any discernable similarities to previously characterized sequences were identified. To characterize these ‘dark matter’ sequences, we used an artificial neural network to identify candidate viral capsid proteins, several of which formed virus-like particles when expressed in culture. These data further the understanding of viral sequence diversity and allow for high throughput documentation of the virosphere. When scientists hunt for new DNA sequences, sometimes they get a lot more than they bargained for. Such is the case in metagenomic surveys, which analyze not just DNA of a particular organism, but all the DNA in an environment at large. A vexing problem with these surveys is the overwhelming number of DNA sequences detected that are so different from any known microbe that they cannot be classified using traditional approaches. However, some of these “known unknowns” are undoubtedly viral sequences, because only a fraction of the enormous diversity of viruses has been characterized. This “viral dark matter” is a major obstacle for those studying viruses. This led Tisza et al. to attempt to classify some of the unknown viral sequences in their metagenomic surveys. The search, which specifically focused on viruses with circular DNA genomes, detected over 2,500 circular viral genomes. Intensive analysis revealed that many of these genomes had similar makeup to previously discovered viruses, but hundreds of them were totally different from any known virus, based on typical methods of comparison. Computational analysis of genes that were conserved among some of these brand-new circular sequences often revealed virus-like features. Experiments on a few of these genes showed that they encoded proteins capable of forming particles reminiscent of characteristic viral shells, implying that these new sequences are indeed viruses. Tisza et al. have added the 2,500 newly characterized viral sequences to the publicly accessible GenBank database, and the sequences are being considered for the more authoritative RefSeq database, which currently contains around 9,000 complete viral genomes. The expanded databases will hopefully now better equip scientists to explore the enormous diversity of viruses and help medics and veterinarians to detect disease-causing viruses in humans and other animals.
Collapse
Affiliation(s)
- Michael J Tisza
- Lab of Cellular Oncology, National Cancer Institute, National Institutes of Health, Bethesda, United States
| | - Diana V Pastrana
- Lab of Cellular Oncology, National Cancer Institute, National Institutes of Health, Bethesda, United States
| | - Nicole L Welch
- Lab of Cellular Oncology, National Cancer Institute, National Institutes of Health, Bethesda, United States
| | - Brittany Stewart
- Lab of Cellular Oncology, National Cancer Institute, National Institutes of Health, Bethesda, United States
| | - Alberto Peretti
- Lab of Cellular Oncology, National Cancer Institute, National Institutes of Health, Bethesda, United States
| | - Gabriel J Starrett
- Lab of Cellular Oncology, National Cancer Institute, National Institutes of Health, Bethesda, United States
| | - Yuk-Ying S Pang
- Lab of Cellular Oncology, National Cancer Institute, National Institutes of Health, Bethesda, United States
| | - Siddharth R Krishnamurthy
- Metaorganism Immunity Section, Laboratory of Immune System Biology, National Institute of Allergy and Infectious Diseases, National Institutes of Health, Bethesda, United States
| | - Patricia A Pesavento
- Department of Pathology, Microbiology, and Immunology, University of California, Davis, Davis, United States
| | - David H McDermott
- Molecular Signaling Section, Laboratory of Molecular Immunology, National Institute of Allergy and Infectious Diseases, National Institutes of Health, Bethesda, United States
| | - Philip M Murphy
- Molecular Signaling Section, Laboratory of Molecular Immunology, National Institute of Allergy and Infectious Diseases, National Institutes of Health, Bethesda, United States
| | - Jessica L Whited
- Department of Orthopedic Surgery, Harvard Medical School, The Harvard Stem Cell Institute, Brigham and Women's Hospital, Boston, United States.,Broad Institute of MIT and Harvard, Cambridge, United States.,Department of Stem Cell and Regenerative Biology, Harvard University, Cambridge, United States
| | - Bess Miller
- Department of Orthopedic Surgery, Harvard Medical School, The Harvard Stem Cell Institute, Brigham and Women's Hospital, Boston, United States.,Broad Institute of MIT and Harvard, Cambridge, United States
| | - Jason Brenchley
- Barrier Immunity Section, Lab of Viral Diseases, National Institute of Allergy and Infectious Diseases, National Institutes of Health, Cambridge, United States
| | - Stephan P Rosshart
- Immunology Section, Liver Diseases Branch, National Institute of Diabetes and Digestive and Kidney Diseases, National Institutes of Health, Bethesda, United States
| | - Barbara Rehermann
- Immunology Section, Liver Diseases Branch, National Institute of Diabetes and Digestive and Kidney Diseases, National Institutes of Health, Bethesda, United States
| | - John Doorbar
- Department of Pathology, University of Cambridge, Cambridge, United Kingdom
| | | | - Olga Pletnikova
- Department of Pathology (Neuropathology), Johns Hopkins University School of Medicine, Baltimore, United States
| | - Juan C Troncoso
- Department of Pathology (Neuropathology), Johns Hopkins University School of Medicine, Baltimore, United States
| | - Susan M Resnick
- Laboratory of Behavioral Neuroscience, National Institute on Aging, National Institutes of Health, Baltimore, United States
| | - Ben Bolduc
- Department of Microbiology, Ohio State University, Columbus, United States
| | - Matthew B Sullivan
- Department of Microbiology, Ohio State University, Columbus, United States.,Civil Environmental and Geodetic Engineering, Ohio State University, Columbus, United States
| | - Arvind Varsani
- The Biodesign Center of Fundamental and Applied Microbiomics, School of Life Sciences, Center for Evolution and Medicine, Arizona State University, Tempe, United States.,Structural Biology Research Unit, Department of Clinical Laboratory Sciences, University of Cape Town, Rondebosch, South Africa
| | - Anca M Segall
- Viral Information Institute and Department of Biology, San Diego State University, San Diego, United States
| | - Christopher B Buck
- Lab of Cellular Oncology, National Cancer Institute, National Institutes of Health, Bethesda, United States
| |
Collapse
|
98
|
Schulz F, Roux S, Paez-Espino D, Jungbluth S, Walsh DA, Denef VJ, McMahon KD, Konstantinidis KT, Eloe-Fadrosh EA, Kyrpides NC, Woyke T. Giant virus diversity and host interactions through global metagenomics. Nature 2020; 578:432-436. [PMID: 31968354 PMCID: PMC7162819 DOI: 10.1038/s41586-020-1957-x] [Citation(s) in RCA: 148] [Impact Index Per Article: 37.0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/04/2019] [Accepted: 01/09/2020] [Indexed: 12/11/2022]
Abstract
Our current knowledge about nucleocytoplasmic large DNA viruses (NCLDVs) is largely derived from viral isolates that are co-cultivated with protists and algae. Here we reconstructed 2,074 NCLDV genomes from sampling sites across the globe by building on the rapidly increasing amount of publicly available metagenome data. This led to an 11-fold increase in phylogenetic diversity and a parallel 10-fold expansion in functional diversity. Analysis of 58,023 major capsid proteins from large and giant viruses using metagenomic data revealed the global distribution patterns and cosmopolitan nature of these viruses. The discovered viral genomes encoded a wide range of proteins with putative roles in photosynthesis and diverse substrate transport processes, indicating that host reprogramming is probably a common strategy in the NCLDVs. Furthermore, inferences of horizontal gene transfer connected viral lineages to diverse eukaryotic hosts. We anticipate that the global diversity of NCLDVs that we describe here will establish giant viruses-which are associated with most major eukaryotic lineages-as important players in ecosystems across Earth's biomes.
Collapse
Affiliation(s)
- Frederik Schulz
- DOE Joint Genome Institute, Lawrence Berkeley National Laboratory, Berkeley, CA, USA.
| | - Simon Roux
- DOE Joint Genome Institute, Lawrence Berkeley National Laboratory, Berkeley, CA, USA
| | - David Paez-Espino
- DOE Joint Genome Institute, Lawrence Berkeley National Laboratory, Berkeley, CA, USA
| | - Sean Jungbluth
- DOE Joint Genome Institute, Lawrence Berkeley National Laboratory, Berkeley, CA, USA
| | - David A Walsh
- Groupe de recherche interuniversitaire en limnologie, Department of Biology, Concordia University, Montréal, Québec, Canada
| | - Vincent J Denef
- Department of Ecology and Evolutionary Biology, University of Michigan, Ann Arbor, MI, USA
| | - Katherine D McMahon
- Department of Bacteriology, University of Wisconsin-Madison, Madison, WI, USA
- Department of Civil and Environmental Engineering, University of Wisconsin-Madison, Madison, WI, USA
| | | | - Emiley A Eloe-Fadrosh
- DOE Joint Genome Institute, Lawrence Berkeley National Laboratory, Berkeley, CA, USA
| | - Nikos C Kyrpides
- DOE Joint Genome Institute, Lawrence Berkeley National Laboratory, Berkeley, CA, USA
| | - Tanja Woyke
- DOE Joint Genome Institute, Lawrence Berkeley National Laboratory, Berkeley, CA, USA.
| |
Collapse
|
99
|
A jumbo phage that forms a nucleus-like structure evades CRISPR-Cas DNA targeting but is vulnerable to type III RNA-based immunity. Nat Microbiol 2019; 5:48-55. [PMID: 31819217 DOI: 10.1038/s41564-019-0612-5] [Citation(s) in RCA: 103] [Impact Index Per Article: 20.6] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/03/2019] [Accepted: 10/17/2019] [Indexed: 12/26/2022]
Abstract
CRISPR-Cas systems provide bacteria with adaptive immunity against bacteriophages1. However, DNA modification2,3, the production of anti-CRISPR proteins4,5 and potentially other strategies enable phages to evade CRISPR-Cas. Here, we discovered a Serratia jumbo phage that evades type I CRISPR-Cas systems, but is sensitive to type III immunity. Jumbo phage infection resulted in a nucleus-like structure enclosed by a proteinaceous phage shell-a phenomenon only reported recently for distantly related Pseudomonas phages6,7. All three native CRISPR-Cas complexes in Serratia-type I-E, I-F and III-A-were spatially excluded from the phage nucleus and phage DNA was not targeted. However, the type III-A system still arrested jumbo phage infection by targeting phage RNA in the cytoplasm in a process requiring Cas7, Cas10 and an accessory nuclease. Type III, but not type I, systems frequently targeted nucleus-forming jumbo phages that were identified in global viral sequence datasets. The ability to recognize jumbo phage RNA and elicit immunity probably contributes to the presence of both RNA- and DNA-targeting CRISPR-Cas systems in many bacteria1,8. Together, our results support the model that jumbo phage nucleus-like compartments serve as a barrier to DNA-targeting, but not RNA-targeting, defences, and that this phenomenon is widespread among jumbo phages.
Collapse
|
100
|
Wang Z, Zhao J, Wang L, Li C, Liu J, Zhang L, Zhang Y. A Novel Benthic Phage Infecting Shewanella with Strong Replication Ability. Viruses 2019; 11:v11111081. [PMID: 31752437 PMCID: PMC6893657 DOI: 10.3390/v11111081] [Citation(s) in RCA: 8] [Impact Index Per Article: 1.6] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 11/04/2019] [Accepted: 11/17/2019] [Indexed: 12/31/2022] Open
Abstract
The coastal sediments were considered to contain diverse phages playing important roles in driving biogeochemical cycles based on genetic analysis. However, till now, benthic phages in coastal sediments were very rarely isolated, which largely limits our understanding of their biological characteristics. Here, we describe a novel lytic phage (named Shewanella phage S0112) isolated from the coastal sediments of the Yellow Sea infecting a sediment bacterium of the genus Shewanella. The phage has a very high replication capability, with the burst size of ca. 1170 phage particles per infected cell, which is 5–10 times higher than that of most phages isolated before. Meanwhile, the latent period of this phage is relatively longer, which might ensure adequate time for phage replication. The phage has a double-stranded DNA genome comprising 62,286 bp with 102 ORFs, ca. 60% of which are functionally unknown. The expression products of 16 ORF genes, mainly structural proteins, were identified by LC-MS/MS analysis. Besides the general DNA metabolism and structure assembly genes in the phage genome, there is a cluster of auxiliary metabolic genes that may be involved in 7-cyano-7-deazaguanine (preQ0) biosynthesis. Meanwhile, a pyrophosphohydrolase (MazG) gene being considered as a regulator of programmed cell death or involving in host stringer responses is inserted in this gene cluster. Comparative genomic and phylogenetic analysis both revealed a great novelty of phage S0112. This study represents the first report of a benthic phage infecting Shewanella, which also sheds light on the phage–host interactions in coastal sediments.
Collapse
Affiliation(s)
- Zengmeng Wang
- Key Laboratory of Biofuels, Shandong Provincial Key Laboratory of Energy Genetics, Qingdao Institute of Bioenergy and Bioprocess Technology, Chinese Academy of Sciences, Qingdao 266101, China; (Z.W.); (J.Z.); (L.W.); (C.L.)
- University of Chinese Academy of Sciences, Beijing 100049, China
| | - Jiulong Zhao
- Key Laboratory of Biofuels, Shandong Provincial Key Laboratory of Energy Genetics, Qingdao Institute of Bioenergy and Bioprocess Technology, Chinese Academy of Sciences, Qingdao 266101, China; (Z.W.); (J.Z.); (L.W.); (C.L.)
- University of Chinese Academy of Sciences, Beijing 100049, China
| | - Long Wang
- Key Laboratory of Biofuels, Shandong Provincial Key Laboratory of Energy Genetics, Qingdao Institute of Bioenergy and Bioprocess Technology, Chinese Academy of Sciences, Qingdao 266101, China; (Z.W.); (J.Z.); (L.W.); (C.L.)
| | - Chengcheng Li
- Key Laboratory of Biofuels, Shandong Provincial Key Laboratory of Energy Genetics, Qingdao Institute of Bioenergy and Bioprocess Technology, Chinese Academy of Sciences, Qingdao 266101, China; (Z.W.); (J.Z.); (L.W.); (C.L.)
- University of Chinese Academy of Sciences, Beijing 100049, China
| | - Jianhui Liu
- CAS Key Lab of Separation Sciences for Analytical Chemistry, National Chromatographic Research and Analysis Center, Dalian Institute of Chemical Physics, Chinese Academy of Sciences, Dalian 116023, China; (J.L.); (L.Z.)
| | - Lihua Zhang
- CAS Key Lab of Separation Sciences for Analytical Chemistry, National Chromatographic Research and Analysis Center, Dalian Institute of Chemical Physics, Chinese Academy of Sciences, Dalian 116023, China; (J.L.); (L.Z.)
| | - Yongyu Zhang
- Key Laboratory of Biofuels, Shandong Provincial Key Laboratory of Energy Genetics, Qingdao Institute of Bioenergy and Bioprocess Technology, Chinese Academy of Sciences, Qingdao 266101, China; (Z.W.); (J.Z.); (L.W.); (C.L.)
- University of Chinese Academy of Sciences, Beijing 100049, China
- Correspondence: ; Tel.: +86-532-80662680
| |
Collapse
|