1
|
Riley R, Bowers RM, Camargo AP, Campbell A, Egan R, Eloe-Fadrosh EA, Foster B, Hofmeyr S, Huntemann M, Kellom M, Kimbrel JA, Oliker L, Yelick K, Pett-Ridge J, Salamov A, Varghese NJ, Clum A. Terabase-Scale Coassembly of a Tropical Soil Microbiome. Microbiol Spectr 2023; 11:e0020023. [PMID: 37310219 PMCID: PMC10434106 DOI: 10.1128/spectrum.00200-23] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [What about the content of this article? (0)] [Affiliation(s)] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 01/12/2023] [Accepted: 05/24/2023] [Indexed: 06/14/2023] Open
Abstract
Petabases of environmental metagenomic data are publicly available, presenting an opportunity to characterize complex environments and discover novel lineages of life. Metagenome coassembly, in which many metagenomic samples from an environment are simultaneously analyzed to infer the underlying genomes' sequences, is an essential tool for achieving this goal. We applied MetaHipMer2, a distributed metagenome assembler that runs on supercomputing clusters, to coassemble 3.4 terabases (Tbp) of metagenome data from a tropical soil in the Luquillo Experimental Forest (LEF), Puerto Rico. The resulting coassembly yielded 39 high-quality (>90% complete, <5% contaminated, with predicted 23S, 16S, and 5S rRNA genes and ≥18 tRNAs) metagenome-assembled genomes (MAGs), including two from the candidate phylum Eremiobacterota. Another 268 medium-quality (≥50% complete, <10% contaminated) MAGs were extracted, including the candidate phyla Dependentiae, Dormibacterota, and Methylomirabilota. In total, 307 medium- or higher-quality MAGs were assigned to 23 phyla, compared to 294 MAGs assigned to nine phyla in the same samples individually assembled. The low-quality (<50% complete, <10% contaminated) MAGs from the coassembly revealed a 49% complete rare biosphere microbe from the candidate phylum FCPU426 among other low-abundance microbes, an 81% complete fungal genome from the phylum Ascomycota, and 30 partial eukaryotic MAGs with ≥10% completeness, possibly representing protist lineages. A total of 22,254 viruses, many of them low abundance, were identified. Estimation of metagenome coverage and diversity indicates that we may have characterized ≥87.5% of the sequence diversity in this humid tropical soil and indicates the value of future terabase-scale sequencing and coassembly of complex environments. IMPORTANCE Petabases of reads are being produced by environmental metagenome sequencing. An essential step in analyzing these data is metagenome assembly, the computational reconstruction of genome sequences from microbial communities. "Coassembly" of metagenomic sequence data, in which multiple samples are assembled together, enables more complete detection of microbial genomes in an environment than "multiassembly," in which samples are assembled individually. To demonstrate the potential for coassembling terabases of metagenome data to drive biological discovery, we applied MetaHipMer2, a distributed metagenome assembler that runs on supercomputing clusters, to coassemble 3.4 Tbp of reads from a humid tropical soil environment. The resulting coassembly, its functional annotation, and analysis are presented here. The coassembly yielded more, and phylogenetically more diverse, microbial, eukaryotic, and viral genomes than the multiassembly of the same data. Our resource may facilitate the discovery of novel microbial biology in tropical soils and demonstrates the value of terabase-scale metagenome sequencing.
Collapse
Affiliation(s)
- Robert Riley
- Joint Genome Institute, Lawrence Berkeley National Laboratory, Berkeley California, USA
| | - Robert M. Bowers
- Joint Genome Institute, Lawrence Berkeley National Laboratory, Berkeley California, USA
| | - Antonio Pedro Camargo
- Joint Genome Institute, Lawrence Berkeley National Laboratory, Berkeley California, USA
| | - Ashley Campbell
- Physical and Life Sciences Directorate, Lawrence Livermore National Laboratory, Livermore, California, USA
| | - Rob Egan
- Joint Genome Institute, Lawrence Berkeley National Laboratory, Berkeley California, USA
| | | | - Brian Foster
- Joint Genome Institute, Lawrence Berkeley National Laboratory, Berkeley California, USA
| | - Steven Hofmeyr
- Applied Math and Computational Research Division, Lawrence Berkeley National Laboratory, Berkeley, California, USA
| | - Marcel Huntemann
- Joint Genome Institute, Lawrence Berkeley National Laboratory, Berkeley California, USA
| | - Matthew Kellom
- Joint Genome Institute, Lawrence Berkeley National Laboratory, Berkeley California, USA
| | - Jeffrey A. Kimbrel
- Physical and Life Sciences Directorate, Lawrence Livermore National Laboratory, Livermore, California, USA
| | - Leonid Oliker
- Applied Math and Computational Research Division, Lawrence Berkeley National Laboratory, Berkeley, California, USA
| | - Katherine Yelick
- Applied Math and Computational Research Division, Lawrence Berkeley National Laboratory, Berkeley, California, USA
- Department of Electrical Engineering and Computer Sciences, University of California, Berkeley, California, USA
| | - Jennifer Pett-Ridge
- Physical and Life Sciences Directorate, Lawrence Livermore National Laboratory, Livermore, California, USA
- Life & Environmental Sciences Department, University of California Merced, Merced, California, USA
| | - Asaf Salamov
- Joint Genome Institute, Lawrence Berkeley National Laboratory, Berkeley California, USA
| | - Neha J. Varghese
- Joint Genome Institute, Lawrence Berkeley National Laboratory, Berkeley California, USA
| | - Alicia Clum
- Joint Genome Institute, Lawrence Berkeley National Laboratory, Berkeley California, USA
| |
Collapse
|
2
|
Seshadri R, Roux S, Huber KJ, Wu D, Yu S, Udwary D, Call L, Nayfach S, Hahnke RL, Pukall R, White JR, Varghese NJ, Webb C, Palaniappan K, Reimer LC, Sardà J, Bertsch J, Mukherjee S, Reddy T, Hajek PP, Huntemann M, Chen IMA, Spunde A, Clum A, Shapiro N, Wu ZY, Zhao Z, Zhou Y, Evtushenko L, Thijs S, Stevens V, Eloe-Fadrosh EA, Mouncey NJ, Yoshikuni Y, Whitman WB, Klenk HP, Woyke T, Göker M, Kyrpides NC, Ivanova NN. Expanding the genomic encyclopedia of Actinobacteria with 824 isolate reference genomes. Cell Genom 2022; 2:100213. [PMID: 36778052 PMCID: PMC9903846 DOI: 10.1016/j.xgen.2022.100213] [Citation(s) in RCA: 8] [Impact Index Per Article: 4.0] [Reference Citation Analysis] [What about the content of this article? (0)] [Affiliation(s)] [Abstract] [Key Words] [Track Full Text] [Figures] [Subscribe] [Scholar Register] [Received: 01/14/2022] [Revised: 07/19/2022] [Accepted: 10/16/2022] [Indexed: 11/13/2022]
Abstract
The phylum Actinobacteria includes important human pathogens like Mycobacterium tuberculosis and Corynebacterium diphtheriae and renowned producers of secondary metabolites of commercial interest, yet only a small part of its diversity is represented by sequenced genomes. Here, we present 824 actinobacterial isolate genomes in the context of a phylum-wide analysis of 6,700 genomes including public isolates and metagenome-assembled genomes (MAGs). We estimate that only 30%-50% of projected actinobacterial phylogenetic diversity possesses genomic representation via isolates and MAGs. A comparison of gene functions reveals novel determinants of host-microbe interaction as well as environment-specific adaptations such as potential antimicrobial peptides. We identify plasmids and prophages across isolates and uncover extensive prophage diversity structured mainly by host taxonomy. Analysis of >80,000 biosynthetic gene clusters reveals that horizontal gene transfer and gene loss shape secondary metabolite repertoire across taxa. Our observations illustrate the essential role of and need for high-quality isolate genome sequences.
Collapse
Affiliation(s)
- Rekha Seshadri
- US Department of Energy Joint Genome Institute, Berkeley, CA, USA,Corresponding author
| | - Simon Roux
- US Department of Energy Joint Genome Institute, Berkeley, CA, USA
| | - Katharina J. Huber
- Leibniz Institute DSMZ - German Collection of Microorganisms and Cell Cultures, Braunschweig, Germany
| | - Dongying Wu
- US Department of Energy Joint Genome Institute, Berkeley, CA, USA
| | - Sora Yu
- Environmental Genomics and Systems Biology Division, Lawrence Berkeley National Laboratory, Berkeley, CA, USA
| | - Dan Udwary
- US Department of Energy Joint Genome Institute, Berkeley, CA, USA,Environmental Genomics and Systems Biology Division, Lawrence Berkeley National Laboratory, Berkeley, CA, USA
| | - Lee Call
- US Department of Energy Joint Genome Institute, Berkeley, CA, USA
| | - Stephen Nayfach
- US Department of Energy Joint Genome Institute, Berkeley, CA, USA
| | - Richard L. Hahnke
- Leibniz Institute DSMZ - German Collection of Microorganisms and Cell Cultures, Braunschweig, Germany
| | - Rüdiger Pukall
- Leibniz Institute DSMZ - German Collection of Microorganisms and Cell Cultures, Braunschweig, Germany
| | | | - Neha J. Varghese
- US Department of Energy Joint Genome Institute, Berkeley, CA, USA
| | - Cody Webb
- US Department of Energy Joint Genome Institute, Berkeley, CA, USA
| | | | - Lorenz C. Reimer
- Leibniz Institute DSMZ - German Collection of Microorganisms and Cell Cultures, Braunschweig, Germany
| | - Joaquim Sardà
- Leibniz Institute DSMZ - German Collection of Microorganisms and Cell Cultures, Braunschweig, Germany
| | - Jonathon Bertsch
- US Department of Energy Joint Genome Institute, Berkeley, CA, USA
| | | | - T.B.K. Reddy
- US Department of Energy Joint Genome Institute, Berkeley, CA, USA
| | - Patrick P. Hajek
- US Department of Energy Joint Genome Institute, Berkeley, CA, USA
| | - Marcel Huntemann
- US Department of Energy Joint Genome Institute, Berkeley, CA, USA
| | - I-Min A. Chen
- US Department of Energy Joint Genome Institute, Berkeley, CA, USA
| | - Alex Spunde
- US Department of Energy Joint Genome Institute, Berkeley, CA, USA
| | - Alicia Clum
- US Department of Energy Joint Genome Institute, Berkeley, CA, USA
| | - Nicole Shapiro
- US Department of Energy Joint Genome Institute, Berkeley, CA, USA
| | - Zong-Yen Wu
- Environmental Genomics and Systems Biology Division, Lawrence Berkeley National Laboratory, Berkeley, CA, USA
| | - Zhiying Zhao
- US Department of Energy Joint Genome Institute, Berkeley, CA, USA
| | - Yuguang Zhou
- China General Microbiological Culture Collection Center, Beijing, China
| | - Lyudmila Evtushenko
- Pushchino Scientific Center for Biological Research of the Russian Academy of Sciences, All-Russian Collection of Microorganisms (VKM), Pushchino, Russia
| | - Sofie Thijs
- Center for Environmental Sciences, Environmental Biology, Hasselt University, Diepenbeek, Belgium
| | - Vincent Stevens
- Center for Environmental Sciences, Environmental Biology, Hasselt University, Diepenbeek, Belgium
| | - Emiley A. Eloe-Fadrosh
- US Department of Energy Joint Genome Institute, Berkeley, CA, USA,Environmental Genomics and Systems Biology Division, Lawrence Berkeley National Laboratory, Berkeley, CA, USA
| | - Nigel J. Mouncey
- US Department of Energy Joint Genome Institute, Berkeley, CA, USA,Environmental Genomics and Systems Biology Division, Lawrence Berkeley National Laboratory, Berkeley, CA, USA
| | - Yasuo Yoshikuni
- US Department of Energy Joint Genome Institute, Berkeley, CA, USA,Environmental Genomics and Systems Biology Division, Lawrence Berkeley National Laboratory, Berkeley, CA, USA,Biological Systems and Engineering Division, Lawrence Berkeley National Laboratory, Berkeley, CA 94720, USA,Center for Advanced Bioenergy and Bioproducts Innovation, Lawrence Berkeley National Laboratory, Berkeley, CA 94720, USA,Global Institution for Collaborative Research and Education, Hokkaido University, Hokkaido 060-8589, Japan
| | | | - Hans-Peter Klenk
- School of Biology, Newcastle University, Newcastle upon Tyne, UK
| | - Tanja Woyke
- US Department of Energy Joint Genome Institute, Berkeley, CA, USA,Environmental Genomics and Systems Biology Division, Lawrence Berkeley National Laboratory, Berkeley, CA, USA
| | - Markus Göker
- Leibniz Institute DSMZ - German Collection of Microorganisms and Cell Cultures, Braunschweig, Germany,Corresponding author
| | - Nikos C. Kyrpides
- US Department of Energy Joint Genome Institute, Berkeley, CA, USA,Environmental Genomics and Systems Biology Division, Lawrence Berkeley National Laboratory, Berkeley, CA, USA
| | - Natalia N. Ivanova
- US Department of Energy Joint Genome Institute, Berkeley, CA, USA,Environmental Genomics and Systems Biology Division, Lawrence Berkeley National Laboratory, Berkeley, CA, USA,Corresponding author
| |
Collapse
|
3
|
Ostrowski MP, La Rosa SL, Kunath BJ, Robertson A, Pereira G, Hagen LH, Varghese NJ, Qiu L, Yao T, Flint G, Li J, McDonald SP, Buttner D, Pudlo NA, Schnizlein MK, Young VB, Brumer H, Schmidt TM, Terrapon N, Lombard V, Henrissat B, Hamaker B, Eloe-Fadrosh EA, Tripathi A, Pope PB, Martens EC. Mechanistic insights into consumption of the food additive xanthan gum by the human gut microbiota. Nat Microbiol 2022; 7:556-569. [PMID: 35365790 DOI: 10.1038/s41564-022-01093-0] [Citation(s) in RCA: 19] [Impact Index Per Article: 9.5] [Reference Citation Analysis] [What about the content of this article? (0)] [Affiliation(s)] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 11/03/2021] [Accepted: 02/24/2022] [Indexed: 12/13/2022]
Abstract
Processed foods often include food additives such as xanthan gum, a complex polysaccharide with unique rheological properties, that has established widespread use as a stabilizer and thickening agent. Xanthan gum's chemical structure is distinct from those of host and dietary polysaccharides that are more commonly expected to transit the gastrointestinal tract, and little is known about its direct interaction with the gut microbiota, which plays a central role in digestion of other dietary fibre polysaccharides. Here we show that the ability to digest xanthan gum is common in human gut microbiomes from industrialized countries and appears contingent on a single uncultured bacterium in the family Ruminococcaceae. Our data reveal that this primary degrader cleaves the xanthan gum backbone before processing the released oligosaccharides using additional enzymes. Some individuals harbour Bacteroides intestinalis that is incapable of consuming polymeric xanthan gum but grows on oligosaccharide products generated by the Ruminococcaceae. Feeding xanthan gum to germfree mice colonized with a human microbiota containing the uncultured Ruminococcaceae supports the idea that the additive xanthan gum can drive expansion of the primary degrader Ruminococcaceae, along with exogenously introduced B. intestinalis. Our work demonstrates the existence of a potential xanthan gum food chain involving at least two members of different phyla of gut bacteria and provides an initial framework for understanding how widespread consumption of a recently introduced food additive influences human microbiomes.
Collapse
Affiliation(s)
- Matthew P Ostrowski
- Department of Microbiology and Immunology, University of Michigan, Ann Arbor, MI, USA
| | - Sabina Leanti La Rosa
- Faculty of Chemistry, Biotechnology and Food Science, Norwegian University of Life Sciences, Ås, Norway.,Faculty of Biosciences, Norwegian University of Life Sciences, Ås, Norway
| | - Benoit J Kunath
- Faculty of Chemistry, Biotechnology and Food Science, Norwegian University of Life Sciences, Ås, Norway
| | - Andrew Robertson
- Life Sciences Institute: Natural Products Discovery Core, University of Michigan, Ann Arbor, MI, USA
| | - Gabriel Pereira
- Department of Microbiology and Immunology, University of Michigan, Ann Arbor, MI, USA
| | - Live H Hagen
- Faculty of Chemistry, Biotechnology and Food Science, Norwegian University of Life Sciences, Ås, Norway
| | | | - Ling Qiu
- Department of Microbiology and Immunology, University of Michigan, Ann Arbor, MI, USA
| | - Tianming Yao
- Department of Food Science and Whistler Center for Carbohydrate Research, Purdue University, West Lafayette, IN, USA
| | - Gabrielle Flint
- Department of Microbiology and Immunology, University of Michigan, Ann Arbor, MI, USA
| | - James Li
- Michael Smith Laboratories, University of British Columbia, Vancouver, British Columbia, Canada
| | - Sean P McDonald
- Michael Smith Laboratories, University of British Columbia, Vancouver, British Columbia, Canada
| | - Duna Buttner
- Department of Microbiology and Immunology, University of Michigan, Ann Arbor, MI, USA
| | - Nicholas A Pudlo
- Department of Microbiology and Immunology, University of Michigan, Ann Arbor, MI, USA
| | - Matthew K Schnizlein
- Department of Microbiology and Immunology, University of Michigan, Ann Arbor, MI, USA
| | - Vincent B Young
- Department of Microbiology and Immunology, University of Michigan, Ann Arbor, MI, USA.,Department of Internal Medicine, Infectious Diseases Division, University of Michigan Medical School, Ann Arbor, MI, USA
| | - Harry Brumer
- Michael Smith Laboratories, University of British Columbia, Vancouver, British Columbia, Canada
| | - Thomas M Schmidt
- Department of Microbiology and Immunology, University of Michigan, Ann Arbor, MI, USA
| | - Nicolas Terrapon
- Centre National de la Recherche Scientifique, Aix-Marseille Univ, Marseille, France.,Institut National de Recherche pour l'Agriculture, l'Alimentation et l'Environnement, Marseille, France
| | - Vincent Lombard
- Centre National de la Recherche Scientifique, Aix-Marseille Univ, Marseille, France.,Institut National de Recherche pour l'Agriculture, l'Alimentation et l'Environnement, Marseille, France
| | - Bernard Henrissat
- Department of Biological Sciences, King Abdulaziz University, Jeddah, Saudi Arabia.,Technical University of Denmark, DTU Bioengineering, Lyngby, Denmark
| | - Bruce Hamaker
- Department of Food Science and Whistler Center for Carbohydrate Research, Purdue University, West Lafayette, IN, USA
| | | | - Ashootosh Tripathi
- Life Sciences Institute: Natural Products Discovery Core, University of Michigan, Ann Arbor, MI, USA
| | - Phillip B Pope
- Faculty of Chemistry, Biotechnology and Food Science, Norwegian University of Life Sciences, Ås, Norway. .,Faculty of Biosciences, Norwegian University of Life Sciences, Ås, Norway.
| | - Eric C Martens
- Department of Microbiology and Immunology, University of Michigan, Ann Arbor, MI, USA.
| |
Collapse
|
4
|
Pudlo NA, Pereira GV, Parnami J, Cid M, Markert S, Tingley JP, Unfried F, Ali A, Varghese NJ, Kim KS, Campbell A, Urs K, Xiao Y, Adams R, Martin D, Bolam DN, Becher D, Eloe-Fadrosh EA, Schmidt TM, Abbott DW, Schweder T, Hehemann JH, Martens EC. Diverse events have transferred genes for edible seaweed digestion from marine to human gut bacteria. Cell Host Microbe 2022; 30:314-328.e11. [PMID: 35240043 PMCID: PMC9096808 DOI: 10.1016/j.chom.2022.02.001] [Citation(s) in RCA: 16] [Impact Index Per Article: 8.0] [Reference Citation Analysis] [What about the content of this article? (0)] [Affiliation(s)] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 11/21/2020] [Revised: 11/03/2021] [Accepted: 02/02/2022] [Indexed: 12/16/2022]
Abstract
Humans harbor numerous species of colonic bacteria that digest fiber polysaccharides in commonly consumed terrestrial plants. More recently in history, regional populations have consumed edible macroalgae seaweeds containing unique polysaccharides. It remains unclear how extensively gut bacteria have adapted to digest these nutrients. Here, we show that the ability of gut bacteria to digest seaweed polysaccharides is more pervasive than previously appreciated. Enrichment-cultured Bacteroides harbor previously discovered genes for seaweed degradation, which have mobilized into several members of this genus. Additionally, other examples of marine bacteria-derived genes, and their mobile DNA elements, are involved in gut microbial degradation of seaweed polysaccharides, including genes in gut-resident Firmicutes. Collectively, these results uncover multiple separate events that have mobilized the genes encoding seaweed-degrading-enzymes into gut bacteria. This work further underscores the metabolic plasticity of the human gut microbiome and global exchange of genes in the context of dietary selective pressures.
Collapse
Affiliation(s)
- Nicholas A Pudlo
- Department of Microbiology and Immunology, University of Michigan, Ann Arbor, MI 48109, USA
| | | | - Jaagni Parnami
- Max Planck Institute for Marine Biology, Bremen, Germany
| | - Melissa Cid
- Max Planck Institute for Marine Biology, Bremen, Germany
| | - Stephanie Markert
- Pharmaceutical Biotechnology, University of Greifswald, 17487 Greifswald, Germany; Institute of Marine Biotechnology, 17489 Greifswald, Germany
| | - Jeffrey P Tingley
- Agriculture and Agri-Food Canada, Lethbridge Research and Development Centre, Lethbridge, AB, Canada
| | - Frank Unfried
- Institute of Marine Biotechnology, 17489 Greifswald, Germany
| | - Ahmed Ali
- Department of Microbiology and Immunology, University of Michigan, Ann Arbor, MI 48109, USA
| | | | - Kwi S Kim
- Department of Microbiology and Immunology, University of Michigan, Ann Arbor, MI 48109, USA
| | - Austin Campbell
- Department of Microbiology and Immunology, University of Michigan, Ann Arbor, MI 48109, USA
| | - Karthik Urs
- Department of Microbiology and Immunology, University of Michigan, Ann Arbor, MI 48109, USA
| | - Yao Xiao
- Department of Microbiology and Immunology, University of Michigan, Ann Arbor, MI 48109, USA
| | - Ryan Adams
- Department of Microbiology and Immunology, University of Michigan, Ann Arbor, MI 48109, USA
| | - Duña Martin
- Department of Microbiology and Immunology, University of Michigan, Ann Arbor, MI 48109, USA
| | - David N Bolam
- Institute for Cell and Molecular Biosciences, Newcastle University, Newcastle upon Tyne, UK
| | | | | | - Thomas M Schmidt
- Department of Microbiology and Immunology, University of Michigan, Ann Arbor, MI 48109, USA; Department of Internal Medicine, University of Michigan, Ann Arbor, MI 48109, USA
| | - D Wade Abbott
- Agriculture and Agri-Food Canada, Lethbridge Research and Development Centre, Lethbridge, AB, Canada
| | - Thomas Schweder
- Pharmaceutical Biotechnology, University of Greifswald, 17487 Greifswald, Germany; Institute of Marine Biotechnology, 17489 Greifswald, Germany
| | - Jan Hendrik Hehemann
- Max Planck Institute for Marine Biology, Bremen, Germany; University of Bremen, Center for Marine Environmental Sciences (MARUM), 28359 Bremen, Germany.
| | - Eric C Martens
- Department of Microbiology and Immunology, University of Michigan, Ann Arbor, MI 48109, USA.
| |
Collapse
|
5
|
Mukherjee S, Seshadri R, Varghese NJ, Eloe-Fadrosh EA, Meier-Kolthoff JP, Göker M, Coates RC, Hadjithomas M, Pavlopoulos GA, Paez-Espino D, Yoshikuni Y, Visel A, Whitman WB, Garrity GM, Eisen JA, Hugenholtz P, Pati A, Ivanova NN, Woyke T, Klenk HP, Kyrpides NC. Erratum: 1,003 reference genomes of bacterial and archaeal isolates expand coverage of the tree of life. Nat Biotechnol 2018. [DOI: 10.1038/nbt0418-368c] [Citation(s) in RCA: 2] [Impact Index Per Article: 0.3] [Reference Citation Analysis] [What about the content of this article? (0)] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/09/2022]
|
6
|
Varghese NJ, Mukherjee S, Ivanova N, Konstantinidis KT, Mavrommatis K, Kyrpides NC, Pati A. Microbial species delineation using whole genome sequences. Nucleic Acids Res 2015; 43:6761-71. [PMID: 26150420 PMCID: PMC4538840 DOI: 10.1093/nar/gkv657] [Citation(s) in RCA: 444] [Impact Index Per Article: 49.3] [Reference Citation Analysis] [What about the content of this article? (0)] [Affiliation(s)] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 12/10/2014] [Accepted: 06/12/2015] [Indexed: 01/25/2023] Open
Abstract
Increased sequencing of microbial genomes has revealed that prevailing prokaryotic species assignments can be inconsistent with whole genome information for a significant number of species. The long-standing need for a systematic and scalable species assignment technique can be met by the genome-wide Average Nucleotide Identity (gANI) metric, which is widely acknowledged as a robust measure of genomic relatedness. In this work, we demonstrate that the combination of gANI and the alignment fraction (AF) between two genomes accurately reflects their genomic relatedness. We introduce an efficient implementation of AF,gANI and discuss its successful application to 86.5M genome pairs between 13,151 prokaryotic genomes assigned to 3032 species. Subsequently, by comparing the genome clusters obtained from complete linkage clustering of these pairs to existing taxonomy, we observed that nearly 18% of all prokaryotic species suffer from anomalies in species definition. Our results can be used to explore central questions such as whether microorganisms form a continuum of genetic diversity or distinct species represented by distinct genetic signatures. We propose that this precise and objective AF,gANI-based species definition: the MiSI (Microbial Species Identifier) method, be used to address previous inconsistencies in species classification and as the primary guide for new taxonomic species assignment, supplemented by the traditional polyphasic approach, as required.
Collapse
Affiliation(s)
- Neha J Varghese
- Microbial and Metagenome Superprogram, DOE Joint Genomic Institute, Walnut Creek, CA 94598, USA
| | - Supratim Mukherjee
- Microbial and Metagenome Superprogram, DOE Joint Genomic Institute, Walnut Creek, CA 94598, USA
| | - Natalia Ivanova
- Microbial and Metagenome Superprogram, DOE Joint Genomic Institute, Walnut Creek, CA 94598, USA
| | | | | | - Nikos C Kyrpides
- Microbial and Metagenome Superprogram, DOE Joint Genomic Institute, Walnut Creek, CA 94598, USA
| | - Amrita Pati
- Microbial and Metagenome Superprogram, DOE Joint Genomic Institute, Walnut Creek, CA 94598, USA
| |
Collapse
|