1
|
Liu R, Qiao X, Shi Y, Peterson CB, Bush WS, Cominelli F, Wang M, Zhang L. Constructing phylogenetic trees for microbiome data analysis: A mini-review. Comput Struct Biotechnol J 2024; 23:3859-3868. [PMID: 39554614 PMCID: PMC11564040 DOI: 10.1016/j.csbj.2024.10.032] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/31/2024] [Revised: 10/20/2024] [Accepted: 10/20/2024] [Indexed: 11/19/2024] Open
Abstract
As next-generation sequencing technologies advance rapidly and the cost of metagenomic sequencing continues to decrease, researchers now face an unprecedented volume of microbiome data. This surge has stimulated the development of scalable microbiome data analysis methods and necessitated the incorporation of phylogenetic information into microbiome analysis for improved accuracy. Tools for constructing phylogenetic trees from 16S rRNA sequencing data are well-established, as the highly conserved regions of the 16S gene are limited, simplifying the identification of marker genes. In contrast, metagenomic and whole genome shotgun (WGS) sequencing involve sequencing from random fragments of the entire gene, making identification of consistent marker genes challenging owing to the vast diversity of genomic regions, resulting in a scarcity of robust tools for constructing phylogenetic trees. Although bacterial sequence tree construction tools exist for upstream bioinformatics, many downstream researchers-those integrating these trees into statistical models or machine learning-are either unaware of these tools or find them difficult to use due to the steep learning curve of processing raw sequences. This is compounded by the fact that public datasets often lack phylogenetic trees, providing only abundance tables and taxonomic classifications. To address this, we present a comprehensive review of phylogenetic tree construction techniques for microbiome data (16S rRNA or whole-genome shotgun sequencing). We outline the strengths and limitations of current methods, offering expert insights and step-by-step guidance to make these tools more accessible and widely applicable in quantitative microbiome data analysis.
Collapse
Affiliation(s)
- Ruitao Liu
- Department of Population and Quantitative Health Sciences, School of Medicine, Case Western Reserve University, 10900 Euclid Avenue, Cleveland, 44106, OH, United States
| | - Xi Qiao
- Department of Population and Quantitative Health Sciences, School of Medicine, Case Western Reserve University, 10900 Euclid Avenue, Cleveland, 44106, OH, United States
| | - Yushu Shi
- Weill Cornell Medicine, Cornell University, 1300 York Ave, New York, 10065, NY, United States
| | - Christine B. Peterson
- The University of Texas MD Anderson Cancer Center, 1515 Holcombe Blvd, Houston, 77030, TX, United States
| | - William S. Bush
- Department of Population and Quantitative Health Sciences, School of Medicine, Case Western Reserve University, 10900 Euclid Avenue, Cleveland, 44106, OH, United States
| | - Fabio Cominelli
- Department of Pathology, School of Medicine, Case Western Reserve University, 10900 Euclid Avenue, Cleveland, 44106, OH, United States
- Case Digestive Health Research Institute, Case Western Reserve University, 10900 Euclid Avenue, Cleveland, 44106, OH, United States
| | - Ming Wang
- Department of Population and Quantitative Health Sciences, School of Medicine, Case Western Reserve University, 10900 Euclid Avenue, Cleveland, 44106, OH, United States
| | - Liangliang Zhang
- Department of Population and Quantitative Health Sciences, School of Medicine, Case Western Reserve University, 10900 Euclid Avenue, Cleveland, 44106, OH, United States
- Case Comprehensive Cancer Center, 10900 Euclid Avenue, Cleveland, 44106, OH, United States
| |
Collapse
|
2
|
Pu L, Shamir R. 4CAC: 4-class classifier of metagenome contigs using machine learning and assembly graphs. Nucleic Acids Res 2024; 52:e94. [PMID: 39287139 PMCID: PMC11514454 DOI: 10.1093/nar/gkae799] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 12/21/2023] [Revised: 07/13/2024] [Accepted: 09/02/2024] [Indexed: 09/19/2024] Open
Abstract
Microbial communities usually harbor a mix of bacteria, archaea, plasmids, viruses and microeukaryotes. Within these communities, viruses, plasmids, and microeukaryotes coexist in relatively low abundance, yet they engage in intricate interactions with bacteria. Moreover, viruses and plasmids, as mobile genetic elements, play important roles in horizontal gene transfer and the development of antibiotic resistance within microbial populations. However, due to the difficulty of identifying viruses, plasmids, and microeukaryotes in microbial communities, our understanding of these minor classes lags behind that of bacteria and archaea. Recently, several classifiers have been developed to separate one or more minor classes from bacteria and archaea in metagenome assemblies. However, these classifiers often overlook the issue of class imbalance, leading to low precision in identifying the minor classes. Here, we developed a classifier called 4CAC that is able to identify viruses, plasmids, microeukaryotes, and prokaryotes simultaneously from metagenome assemblies. 4CAC generates an initial four-way classification using several sequence length-adjusted XGBoost models and further improves the classification using the assembly graph. Evaluation on simulated and real metagenome datasets demonstrates that 4CAC substantially outperforms existing classifiers and combinations thereof on short reads. On long reads, it also shows an advantage unless the abundance of the minor classes is very low. 4CAC runs 1-2 orders of magnitude faster than the other classifiers. The 4CAC software is available at https://github.com/Shamir-Lab/4CAC.
Collapse
Affiliation(s)
- Lianrong Pu
- The Blavatnik School of Computer Science, Tel Aviv University, Tel Aviv, Israel
- School of Computer Science and Technology, Shandong University, Qingdao, China
| | - Ron Shamir
- The Blavatnik School of Computer Science, Tel Aviv University, Tel Aviv, Israel
| |
Collapse
|
3
|
Balestrini VP, Pinto OHB, Simmons BA, Gladden JM, Krüger RH, Quirino BF. Analysis of novel bacterial metagenome-assembled genomes from lignin-degrading microbial consortia. CURRENT RESEARCH IN MICROBIAL SCIENCES 2024; 7:100302. [PMID: 39558935 PMCID: PMC11570740 DOI: 10.1016/j.crmicr.2024.100302] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/20/2024] Open
Abstract
Despite recent progress, bacterial degradation of lignin is not completely understood. To address the mechanisms that bacteria from unknown taxonomic groups use to perform lignin-monomer degradation, functional analysis of bacterial metagenome-assembled genomes from soil-derived consortia enriched for microorganisms capable of degrading lignin was performed. A total of 232 metagenome-assembled genomes were recovered. After applying quality criteria of at least 70 % genome completeness and contamination less than or equal to 10 %, 39 genomes were obtained. From these, a total of 14 genomes from bacteria of unknown classification at lower taxonomic levels (i.e., only classified to the order level or higher) were chosen for further functional analysis. A global analysis of the potential ecological functions of these bacteria was performed, followed by a detailed analysis of monolignol degradation pathways. The phylum with the highest number of genomes was Proteobacteria. The genomes presented functions consistent with soil-derived bacteria, like denitrification, with different metabolic capacities related to the sulfur, chlorine, arsenic and carbon cycles, in addition to the degradation of plant cell wall components like cellulose, hemicellulose, and lignin. The Sphingomonadales_OP 08 genome showed the greatest potential to degrade cellulose and hemicellulose, although it does not appear to be able to degrade lignin. The Actinobacteria_BY 70 genome presented the highest number of enzymes and pathways related to the degradation of monolignols; furthermore, it showed the greatest potential for aromatic ring breakage by different fission pathways. The genomes of the two Actinobacteria showed the caffeic acid pathway, an important phenolic compound presenting several biological properties, such as antimicrobial and antioxidant. To our knowledge, this is the first time this pathway has been reported in this class of bacteria.
Collapse
Affiliation(s)
- Vitória Pinheiro Balestrini
- Genetics and Biotechnology Laboratory, Embrapa Agroenergy, Brasília, DF, 70770-901, Brazil
- Microbial Biology Graduate Program, University of Brasília, Brasília, DF, 70790-900, Brazil
| | | | - Blake A. Simmons
- Joint BioEnergy Institute, Emeryville, CA, USA
- Biological Systems and Engineering Division, Lawrence Berkeley National Laboratory, Berkeley, CA, USA
| | - John M. Gladden
- Joint BioEnergy Institute, Emeryville, CA, USA
- Department of Biomaterials and Biomanufacturing, Sandia National Laboratories, Livermore, CA, USA
| | - Ricardo Henrique Krüger
- Department of Cell Biology, Institute of Biological Sciences, University of Brasília, Brasília 70790-900, Brazil
| | - Betania Ferraz Quirino
- Genetics and Biotechnology Laboratory, Embrapa Agroenergy, Brasília, DF, 70770-901, Brazil
- Microbial Biology Graduate Program, University of Brasília, Brasília, DF, 70790-900, Brazil
| |
Collapse
|
4
|
Figueroa-Gonzalez PA, Bornemann TLV, Hinzke T, Maaß S, Trautwein-Schult A, Starke J, Moore CJ, Esser SP, Plewka J, Hesse T, Schmidt TC, Schreiber U, Bor B, Becher D, Probst AJ. Metaproteogenomics resolution of a high-CO 2 aquifer community reveals a complex cellular adaptation of groundwater Gracilibacteria to a host-dependent lifestyle. MICROBIOME 2024; 12:194. [PMID: 39369255 PMCID: PMC11452946 DOI: 10.1186/s40168-024-01889-8] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Subscribe] [Scholar Register] [Received: 12/18/2023] [Accepted: 07/29/2024] [Indexed: 10/07/2024]
Abstract
BACKGROUND Bacteria of the candidate phyla radiation (CPR), constituting about 25% of the bacterial biodiversity, are characterized by small cell size and patchy genomes without complete key metabolic pathways, suggesting a symbiotic lifestyle. Gracilibacteria (BD1-5), which are part of the CPR branch, possess alternate coded genomes and have not yet been cultivated. The lifestyle of Gracilibacteria, their temporal dynamics, and activity in natural ecosystems, particularly in groundwater, has remained largely unexplored. Here, we aimed to investigate Gracilibacteria activity in situ and to discern their lifestyle based on expressed genes, using the metaproteogenome of Gracilibacteria as a function of time in the cold-water geyser Wallender Born in the Volcanic Eifel region in Germany. RESULTS We coupled genome-resolved metagenomics and metaproteomics to investigate a cold-water geyser microbial community enriched in Gracilibacteria across a 12-day time-series. Groundwater was collected and sequentially filtered to fraction CPR and other bacteria. Based on 725 Gbps of metagenomic data, 1129 different ribosomal protein S3 marker genes, and 751 high-quality genomes (123 population genomes after dereplication), we identified dominant bacteria belonging to Gallionellales and Gracilibacteria along with keystone microbes, which were low in genomic abundance but substantially contributing to proteomic abundance. Seven high-quality Gracilibacteria genomes showed typical limitations, such as limited amino acid or nucleotide synthesis, in their central metabolism but no co-occurrence with potential hosts. The genomes of these Gracilibacteria were encoded for a high number of proteins involved in cell to cell interaction, supporting the previously surmised host-dependent lifestyle, e.g., type IV and type II secretion system subunits, transporters, and features related to cell motility, which were also detected on protein level. CONCLUSIONS We here identified microbial keystone taxa in a high-CO2 aquifer, and revealed microbial dynamics of Gracilibacteria. Although Gracilibacteria in this ecosystem did not appear to target specific organisms in this ecosystem due to lack of co-occurrence despite enrichment on 0.2-µm filter fraction, we provide proteomic evidence for the complex machinery behind the host-dependent lifestyle of groundwater Gracilibacteria. Video Abstract.
Collapse
Affiliation(s)
- Perla Abigail Figueroa-Gonzalez
- Environmental Metagenomics, Faculty of Chemistry, Research Center One Health of the University Alliance Ruhr, University of Duisburg-Essen, 45151, Essen, Germany
| | - Till L V Bornemann
- Environmental Metagenomics, Faculty of Chemistry, Research Center One Health of the University Alliance Ruhr, University of Duisburg-Essen, 45151, Essen, Germany
- Centre of Water and Environmental Research (ZWU), University of Duisburg-Essen, 45141, Essen, Germany
| | - Tjorven Hinzke
- Microbial Proteomics, Institute of Microbiology, University of Greifswald, 17489, Greifswald, Germany
- Department of Pathogen Evolution, Helmholtz Institute for One Health, 17489, Greifswald, Germany
- Microbial Physiology and Molecular Biology, Institute of Microbiology, University of Greifswald, Greifswald, 17489, Germany
| | - Sandra Maaß
- Microbial Proteomics, Institute of Microbiology, University of Greifswald, 17489, Greifswald, Germany
| | - Anke Trautwein-Schult
- Microbial Proteomics, Institute of Microbiology, University of Greifswald, 17489, Greifswald, Germany
| | - Joern Starke
- Environmental Metagenomics, Faculty of Chemistry, Research Center One Health of the University Alliance Ruhr, University of Duisburg-Essen, 45151, Essen, Germany
| | - Carrie J Moore
- Environmental Metagenomics, Faculty of Chemistry, Research Center One Health of the University Alliance Ruhr, University of Duisburg-Essen, 45151, Essen, Germany
| | - Sarah P Esser
- Environmental Metagenomics, Faculty of Chemistry, Research Center One Health of the University Alliance Ruhr, University of Duisburg-Essen, 45151, Essen, Germany
| | - Julia Plewka
- Environmental Metagenomics, Faculty of Chemistry, Research Center One Health of the University Alliance Ruhr, University of Duisburg-Essen, 45151, Essen, Germany
| | - Tobias Hesse
- Instrumental Analytical Chemistry and Centre for Water and Environmental Research (ZWU), University of Duisburg-Essen, Essen, 45141, Germany
| | - Torsten C Schmidt
- Centre of Water and Environmental Research (ZWU), University of Duisburg-Essen, 45141, Essen, Germany
- Instrumental Analytical Chemistry and Centre for Water and Environmental Research (ZWU), University of Duisburg-Essen, Essen, 45141, Germany
| | - Ulrich Schreiber
- Department of Geology, University of Duisburg-Essen, 45141, Essen, Germany
| | - Batbileg Bor
- Microbiology, The Forsyth Institute, Cambridge, MA, 02142, USA
| | - Dörte Becher
- Microbial Proteomics, Institute of Microbiology, University of Greifswald, 17489, Greifswald, Germany
| | - Alexander J Probst
- Environmental Metagenomics, Faculty of Chemistry, Research Center One Health of the University Alliance Ruhr, University of Duisburg-Essen, 45151, Essen, Germany.
- Centre of Water and Environmental Research (ZWU), University of Duisburg-Essen, 45141, Essen, Germany.
| |
Collapse
|
5
|
Pavelescu LA, Profir M, Enache RM, Roşu OA, Creţoiu SM, Gaspar BS. A Proteogenomic Approach to Unveiling the Complex Biology of the Microbiome. Int J Mol Sci 2024; 25:10467. [PMID: 39408795 PMCID: PMC11476728 DOI: 10.3390/ijms251910467] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 09/06/2024] [Revised: 09/26/2024] [Accepted: 09/26/2024] [Indexed: 10/20/2024] Open
Abstract
The complex biology of the microbiome was elucidated once the genomics era began. The proteogenomic approach analyzes and integrates genetic makeup (genomics) and microbial communities' expressed proteins (proteomics). Therefore, researchers gained insights into gene expression, protein functions, and metabolic pathways, understanding microbial dynamics and behavior, interactions with host cells, and responses to environmental stimuli. In this context, our work aims to bring together data regarding the application of genomics, proteomics, and bioinformatics in microbiome research and to provide new perspectives for applying microbiota modulation in clinical practice with maximum efficiency. This review also synthesizes data from the literature, shedding light on the potential biomarkers and therapeutic targets for various diseases influenced by the microbiome.
Collapse
Affiliation(s)
- Luciana Alexandra Pavelescu
- Department of Morphological Sciences, Cell and Molecular Biology and Histology, Carol Davila University of Medicine and Pharmacy, 050474 Bucharest, Romania; (L.A.P.); (M.P.); (O.A.R.)
| | - Monica Profir
- Department of Morphological Sciences, Cell and Molecular Biology and Histology, Carol Davila University of Medicine and Pharmacy, 050474 Bucharest, Romania; (L.A.P.); (M.P.); (O.A.R.)
- Department of Oncology, Elias University Emergency Hospital, 011461 Bucharest, Romania
| | - Robert Mihai Enache
- Department of Radiology and Medical Imaging, Fundeni Clinical Institute, 022328 Bucharest, Romania;
| | - Oana Alexandra Roşu
- Department of Morphological Sciences, Cell and Molecular Biology and Histology, Carol Davila University of Medicine and Pharmacy, 050474 Bucharest, Romania; (L.A.P.); (M.P.); (O.A.R.)
- Department of Oncology, Elias University Emergency Hospital, 011461 Bucharest, Romania
| | - Sanda Maria Creţoiu
- Department of Morphological Sciences, Cell and Molecular Biology and Histology, Carol Davila University of Medicine and Pharmacy, 050474 Bucharest, Romania; (L.A.P.); (M.P.); (O.A.R.)
| | - Bogdan Severus Gaspar
- Department of Surgery, Carol Davila University of Medicine and Pharmacy, 050474 Bucharest, Romania;
- Surgery Clinic, Bucharest Emergency Clinical Hospital, 014461 Bucharest, Romania
| |
Collapse
|
6
|
Ghezzi H, Fan YM, Ng KM, Burckhardt JC, Pepin DM, Lin X, Ziels RM, Tropini C. PUPpy: a primer design pipeline for substrain-level microbial detection and absolute quantification. mSphere 2024; 9:e0036024. [PMID: 38980072 PMCID: PMC11288016 DOI: 10.1128/msphere.00360-24] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 04/29/2024] [Accepted: 06/17/2024] [Indexed: 07/10/2024] Open
Abstract
Characterizing microbial communities at high resolution and with absolute quantification is crucial to unravel the complexity and diversity of microbial ecosystems. This can be achieved with PCR assays, which enable highly selective detection and absolute quantification of microbial DNA. However, a major challenge that has hindered PCR applications in microbiome research is the design of highly specific primer sets that exclusively amplify intended targets. Here, we introduce Phylogenetically Unique Primers in python (PUPpy), a fully automated pipeline to design microbe- and group-specific primers within a given microbial community. PUPpy can be executed from a user-friendly graphical user interface, or two simple terminal commands, and it only requires coding sequence files of the community members as input. PUPpy-designed primers enable the detection of individual microbes and quantification of absolute microbial abundance in defined communities below the strain level. We experimentally evaluated the performance of PUPpy-designed primers using two bacterial communities as benchmarks. Each community comprises 10 members, exhibiting a range of genetic similarities that spanned from different phyla to substrains. PUPpy-designed primers also enable the detection of groups of bacteria in an undefined community, such as the detection of a gut bacterial family in a complex stool microbiota sample. Taxon-specific primers designed with PUPpy showed 100% specificity to their intended targets, without unintended amplification, in each community tested. Lastly, we show the absolute quantification of microbial abundance using PUPpy-designed primers in droplet digital PCR, benchmarked against 16S rRNA and shotgun sequencing. Our data shows that PUPpy-designed microbe-specific primers can be used to quantify substrain-level absolute counts, providing more resolved and accurate quantification in defined communities than short-read 16S rRNA and shotgun sequencing. IMPORTANCE Profiling microbial communities at high resolution and with absolute quantification is essential to uncover hidden ecological interactions within microbial ecosystems. Nevertheless, achieving resolved and quantitative investigations has been elusive due to methodological limitations in distinguishing and quantifying highly related microbes. Here, we describe Phylogenetically Unique Primers in python (PUPpy), an automated computational pipeline to design taxon-specific primers within defined microbial communities. Taxon-specific primers can be used to selectively detect and quantify individual microbes and larger taxa within a microbial community. PUPpy achieves substrain-level specificity without the need for computationally intensive databases and prioritizes user-friendliness by enabling both terminal and graphical user interface applications. Altogether, PUPpy enables fast, inexpensive, and highly accurate perspectives into microbial ecosystems, supporting the characterization of bacterial communities in both in vitro and complex microbiota settings.
Collapse
Affiliation(s)
- Hans Ghezzi
- Department of Bioinformatics, University of British Columbia, Vancouver, British Columbia, Canada
| | - Yiyun M. Fan
- Department of Cellular and Physiological Sciences, University of British Columbia, Vancouver, British Columbia, Canada
| | - Katharine M. Ng
- Department of Microbiology and Immunology, University of British Columbia, Vancouver, British Columbia, Canada
| | - Juan C. Burckhardt
- Department of Microbiology and Immunology, University of British Columbia, Vancouver, British Columbia, Canada
| | - Deanna M. Pepin
- Department of Microbiology and Immunology, University of British Columbia, Vancouver, British Columbia, Canada
| | - Xuan Lin
- Civil Engineering, The University of British Columbia, Vancouver, British Columbia, Canada
| | - Ryan M. Ziels
- Civil Engineering, The University of British Columbia, Vancouver, British Columbia, Canada
| | - Carolina Tropini
- Department of Bioinformatics, University of British Columbia, Vancouver, British Columbia, Canada
- Department of Microbiology and Immunology, University of British Columbia, Vancouver, British Columbia, Canada
- School of Biomedical Engineering, University of British Columbia, Vancouver, British Columbia, Canada
- Humans and the Microbiome Program, Canadian Institute for Advanced Research (CIFAR), Toronto, Ontario, Canada
| |
Collapse
|
7
|
Darabi A, Sobhani S, Aghdam R, Eslahchi C. AFITbin: a metagenomic contig binning method using aggregate l-mer frequency based on initial and terminal nucleotides. BMC Bioinformatics 2024; 25:241. [PMID: 39014300 PMCID: PMC11253361 DOI: 10.1186/s12859-024-05859-7] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 08/27/2023] [Accepted: 07/09/2024] [Indexed: 07/18/2024] Open
Abstract
BACKGROUND Using next-generation sequencing technologies, scientists can sequence complex microbial communities directly from the environment. Significant insights into the structure, diversity, and ecology of microbial communities have resulted from the study of metagenomics. The assembly of reads into longer contigs, which are then binned into groups of contigs that correspond to different species in the metagenomic sample, is a crucial step in the analysis of metagenomics. It is necessary to organize these contigs into operational taxonomic units (OTUs) for further taxonomic profiling and functional analysis. For binning, which is synonymous with the clustering of OTUs, the tetra-nucleotide frequency (TNF) is typically utilized as a compositional feature for each OTU. RESULTS In this paper, we present AFIT, a new l-mer statistic vector for each contig, and AFITBin, a novel method for metagenomic binning based on AFIT and a matrix factorization method. To evaluate the performance of the AFIT vector, the t-SNE algorithm is used to compare species clustering based on AFIT and TNF information. In addition, the efficacy of AFITBin is demonstrated on both simulated and real datasets in comparison to state-of-the-art binning methods such as MetaBAT 2, MaxBin 2.0, CONCOT, MetaCon, SolidBin, BusyBee Web, and MetaBinner. To further analyze the performance of the purposed AFIT vector, we compare the barcodes of the AFIT vector and the TNF vector. CONCLUSION The results demonstrate that AFITBin shows superior performance in taxonomic identification compared to existing methods, leveraging the AFIT vector for improved results in metagenomic binning. This approach holds promise for advancing the analysis of metagenomic data, providing more reliable insights into microbial community composition and function. AVAILABILITY A python package is available at: https://github.com/SayehSobhani/AFITBin .
Collapse
Affiliation(s)
- Amin Darabi
- Department of Computer and Data Sciences, Faculty of Mathematical Sciences, Shahid Beheshti University, Tehran, Iran
| | - Sayeh Sobhani
- Department of Computer and Data Sciences, Faculty of Mathematical Sciences, Shahid Beheshti University, Tehran, Iran
- School of Biological Sciences, Institute for Research in Fundamental Sciences (IPM), Tehran, Iran
| | - Rosa Aghdam
- School of Biological Sciences, Institute for Research in Fundamental Sciences (IPM), Tehran, Iran
- Wisconsin Institute for Discovery, University of Wisconsin-Madison, Madison, WI, 53715, USA
| | - Changiz Eslahchi
- Department of Computer and Data Sciences, Faculty of Mathematical Sciences, Shahid Beheshti University, Tehran, Iran.
- School of Biological Sciences, Institute for Research in Fundamental Sciences (IPM), Tehran, Iran.
| |
Collapse
|
8
|
Kang M, Kang M, Yoo J, Lee J, Lee S, Yun B, Song M, Kim JM, Kim HW, Yang J, Kim Y, Oh S. Dietary supplementation with Lacticaseibacillus rhamnosus IDCC3201 alleviates sarcopenia by modulating the gut microbiota and metabolites in dexamethasone-induced models. Food Funct 2024; 15:4936-4953. [PMID: 38602003 DOI: 10.1039/d3fo05420a] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 04/12/2024]
Abstract
Probiotics can exert direct or indirect influences on various aspects of health claims by altering the composition of the gut microbiome and producing bioactive metabolites. The aim of this study was to examine the effect of Lacticaseibacillus rhamnosus IDCC3201 on skeletal muscle atrophy in dexamethasone-induced C2C12 cells and a mouse animal model. Dexamethasone treatment significantly reduced C2C12 muscle cell viability, myotube diameter, and levels of muscle atrophic markers (Atrogin-1 and MuRF-1). These effects were alleviated by conditioned media (CM) and cell extract (EX) derived from L. rhamnosus IDCC3201. In addition, we assessed the in vivo therapeutic effect of L. rhamnosus IDCC3201 in a mouse model of dexamethasone (DEX)-induced muscle atrophy. Supplementation with IDCC3201 resulted in significant enhancements in body composition, particularly in lean mass, muscle strength, and myofibril size, in DEX-induced muscle atrophy mice. In comparison to the DEX-treatment group, the normal and DEX + L. rhamnosus IDCC3201 groups showed a higher transcriptional level of myosin heavy chain family genes (MHC1, MHC1b, MHC2A, 2bB, and 2X) and a reduction in atrophic muscle makers. These analyses revealed that L. rhamnosus IDCC3201 supplementation led to increased production of branched-chain amino acids (BCAAs) and improved the Allobaculum genus within the gut microbiota of muscle atrophy-induced groups. Taken together, our findings suggest that L. rhamnosus IDCC3201 represents a promising dietary supplement with the potential to alleviate sarcopenia by modulating the gut microbiome and metabolites.
Collapse
Affiliation(s)
- Minkyoung Kang
- Department of Food and Nutrition, Jeonju University, Jeonju 55069, Republic of Korea
| | - Minji Kang
- Department of Food and Nutrition, Jeonju University, Jeonju 55069, Republic of Korea
| | - Jiseon Yoo
- Department of Food and Nutrition, Jeonju University, Jeonju 55069, Republic of Korea
| | - Juyeon Lee
- Department of Food and Nutrition, Jeonju University, Jeonju 55069, Republic of Korea
| | - Sujeong Lee
- Department of Food and Nutrition, Jeonju University, Jeonju 55069, Republic of Korea
| | - Bohyun Yun
- Honam National Institute of Biological Resources, Mokpo 58762, Republic of Korea
| | - Minho Song
- Department of Animal Science and Biotechnology, Chungnam National University, Daejeon 34134, Republic of Korea
| | - Jun-Mo Kim
- Department of Animal Science and Technology, Chung-Ang University, Anseong 17546, Gyeonggi-do, Republic of Korea
| | - Hyung Wook Kim
- College of Life Sciences, Sejong University, Seoul 05006, Republic of Korea
| | - Jungwoo Yang
- Department of Microbiology, College of Medicine, Dongguk University, Gyeongju, 38066, Republic of Korea
| | - Younghoon Kim
- Department of Agricultural Biotechnology and Research Institute of Agriculture and Life Science, Seoul National University, Seoul 08826, Republic of Korea
| | - Sangnam Oh
- Department of Food and Nutrition, Jeonju University, Jeonju 55069, Republic of Korea
| |
Collapse
|
9
|
Rocha U, Coelho Kasmanas J, Kallies R, Saraiva JP, Toscan RB, Štefanič P, Bicalho MF, Borim Correa F, Baştürk MN, Fousekis E, Viana Barbosa LM, Plewka J, Probst AJ, Baldrian P, Stadler PF. MuDoGeR: Multi-Domain Genome recovery from metagenomes made easy. Mol Ecol Resour 2024; 24:e13904. [PMID: 37994269 DOI: 10.1111/1755-0998.13904] [Citation(s) in RCA: 1] [Impact Index Per Article: 1.0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 09/12/2022] [Revised: 10/18/2023] [Accepted: 11/13/2023] [Indexed: 11/24/2023]
Abstract
Several computational frameworks and workflows that recover genomes from prokaryotes, eukaryotes and viruses from metagenomes exist. Yet, it is difficult for scientists with little bioinformatics experience to evaluate quality, annotate genes, dereplicate, assign taxonomy and calculate relative abundance and coverage of genomes belonging to different domains. MuDoGeR is a user-friendly tool tailored for those familiar with Unix command-line environment that makes it easy to recover genomes of prokaryotes, eukaryotes and viruses from metagenomes, either alone or in combination. We tested MuDoGeR using 24 individual-isolated genomes and 574 metagenomes, demonstrating the applicability for a few samples and high throughput. While MuDoGeR can recover eukaryotic viral sequences, its characterization is predominantly skewed towards bacterial and archaeal viruses, reflecting the field's current state. However, acting as a dynamic wrapper, the MuDoGeR is designed to constantly incorporate updates and integrate new tools, ensuring its ongoing relevance in the rapidly evolving field. MuDoGeR is open-source software available at https://github.com/mdsufz/MuDoGeR. Additionally, MuDoGeR is also available as a Singularity container.
Collapse
Affiliation(s)
- Ulisses Rocha
- Department of Environmental Microbiology, Helmholtz Centre for Environmental Research - UFZ, Leipzig, Germany
| | - Jonas Coelho Kasmanas
- Department of Environmental Microbiology, Helmholtz Centre for Environmental Research - UFZ, Leipzig, Germany
- Institute of Mathematics and Computer Sciences, University of São Paulo, São Carlos, Brazil
| | - René Kallies
- Department of Environmental Microbiology, Helmholtz Centre for Environmental Research - UFZ, Leipzig, Germany
| | - Joao Pedro Saraiva
- Department of Environmental Microbiology, Helmholtz Centre for Environmental Research - UFZ, Leipzig, Germany
| | - Rodolfo Brizola Toscan
- Department of Environmental Microbiology, Helmholtz Centre for Environmental Research - UFZ, Leipzig, Germany
| | - Polonca Štefanič
- Biotechnical Faculty, University of Ljubljana, Ljubljana, Slovenia
| | - Marcos Fleming Bicalho
- Department of Environmental Microbiology, Helmholtz Centre for Environmental Research - UFZ, Leipzig, Germany
| | - Felipe Borim Correa
- Department of Environmental Microbiology, Helmholtz Centre for Environmental Research - UFZ, Leipzig, Germany
| | - Merve Nida Baştürk
- Department of Environmental Microbiology, Helmholtz Centre for Environmental Research - UFZ, Leipzig, Germany
| | - Efthymios Fousekis
- Department of Environmental Microbiology, Helmholtz Centre for Environmental Research - UFZ, Leipzig, Germany
| | - Luiz Miguel Viana Barbosa
- Department of Environmental Microbiology, Helmholtz Centre for Environmental Research - UFZ, Leipzig, Germany
| | - Julia Plewka
- Environmental Microbiology and Biotechnology, Department of Chemistry, University of Duisburg-Essen, Essen, Germany
| | - Alexander J Probst
- Environmental Microbiology and Biotechnology, Department of Chemistry, University of Duisburg-Essen, Essen, Germany
| | - Petr Baldrian
- Laboratory of Environmental Microbiology, Institute of Microbiology of the Czech Academy of Sciences, Praha 4, Czech Republic
| | - Peter F Stadler
- Department of Computer Science and Interdisciplinary Center of Bioinformatics, University of Leipzig, Leipzig, Germany
- Max Planck Institute for Mathematics in the Sciences, Leipzig, Germany
- Institute for Theoretical Chemistry, University of Vienna, Vienna, Austria
- The Santa Fe Institute, Santa Fe, New Mexico, USA
| |
Collapse
|
10
|
Miao Y, Sun Z, Ma C, Lin C, Wang G, Yang C. VirGrapher: a graph-based viral identifier for long sequences from metagenomes. Brief Bioinform 2024; 25:bbae036. [PMID: 38343326 PMCID: PMC10859693 DOI: 10.1093/bib/bbae036] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 12/13/2023] [Revised: 01/15/2024] [Accepted: 01/18/2024] [Indexed: 02/15/2024] Open
Abstract
Viruses are the most abundant biological entities on earth and are important components of microbial communities. A metagenome contains all microorganisms from an environmental sample. Correctly identifying viruses from these mixed sequences is critical in viral analyses. It is common to identify long viral sequences, which has already been passed thought pipelines of assembly and binning. Existing deep learning-based methods divide these long sequences into short subsequences and identify them separately. This makes the relationships between them be omitted, leading to poor performance on identifying long viral sequences. In this paper, VirGrapher is proposed to improve the identification performance of long viral sequences by constructing relationships among short subsequences from long ones. VirGrapher see a long sequence as a graph and uses a Graph Convolutional Network (GCN) model to learn multilayer connections between nodes from sequences after a GCN-based node embedding model. VirGrapher achieves a better AUC value and accuracy on validation set, which is better than three benchmark methods.
Collapse
Affiliation(s)
- Yan Miao
- College of Computer and Control Engineering, Northeast Forestry University, Hexing Road, 150040, Heilongjiang Province, China
| | - Zhenyuan Sun
- College of Computer and Control Engineering, Northeast Forestry University, Hexing Road, 150040, Heilongjiang Province, China
| | - Chenjing Ma
- College of Computer and Control Engineering, Northeast Forestry University, Hexing Road, 150040, Heilongjiang Province, China
| | - Chen Lin
- National Institute for Data Science in Health and Medicine, Xiamen University, Xiangannan Road, 361104, Fujian Province, China
| | - Guohua Wang
- College of Computer and Control Engineering, Northeast Forestry University, Hexing Road, 150040, Heilongjiang Province, China
| | - Chunxue Yang
- College of Landscape Architecture, Northeast Forestry University, Hexing Road, 150040, Heilongjiang Province, China
| |
Collapse
|
11
|
Lou YC, Chen L, Borges AL, West-Roberts J, Firek BA, Morowitz MJ, Banfield JF. Infant gut DNA bacteriophage strain persistence during the first 3 years of life. Cell Host Microbe 2024; 32:35-47.e6. [PMID: 38096814 PMCID: PMC11156429 DOI: 10.1016/j.chom.2023.11.015] [Citation(s) in RCA: 2] [Impact Index Per Article: 2.0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 09/08/2023] [Revised: 10/27/2023] [Accepted: 11/16/2023] [Indexed: 01/13/2024]
Abstract
Bacteriophages are key components of gut microbiomes, yet the phage colonization process in the infant gut remains uncertain. Here, we establish a large phage sequence database and use strain-resolved analyses to investigate DNA phage succession in infants throughout the first 3 years of life. Analysis of 819 fecal metagenomes collected from 28 full-term and 24 preterm infants and their mothers revealed that early-life phageome richness increases over time and reaches adult-like complexity by age 3. Approximately 9% of early phage colonizers, which are mostly maternally transmitted and infect Bacteroides, persist for 3 years and are more prevalent in full-term than in preterm infants. Although rare, phages with stop codon reassignment are more likely to persist than non-recoded phages and generally display an increase in in-frame reassigned stop codons over 3 years. Overall, maternal seeding, stop codon reassignment, host CRISPR-Cas locus prevalence, and diverse phage populations contribute to stable viral colonization.
Collapse
Affiliation(s)
- Yue Clare Lou
- Department of Plant and Microbial Biology, University of California, Berkeley, Berkeley, CA 94720, USA; Innovative Genomics Institute, University of California, Berkeley, Berkeley, CA 94720, USA
| | - LinXing Chen
- Innovative Genomics Institute, University of California, Berkeley, Berkeley, CA 94720, USA; Department of Earth and Planetary Science, University of California, Berkeley, Berkeley, CA 94709, USA
| | - Adair L Borges
- Innovative Genomics Institute, University of California, Berkeley, Berkeley, CA 94720, USA
| | - Jacob West-Roberts
- Department of Environmental Science, Policy, and Management, University of California, Berkeley, Berkeley, CA 94720, USA
| | - Brian A Firek
- Department of Surgery, University of Pittsburgh School of Medicine, Pittsburgh, PA 15213, USA
| | - Michael J Morowitz
- Department of Surgery, University of Pittsburgh School of Medicine, Pittsburgh, PA 15213, USA
| | - Jillian F Banfield
- Innovative Genomics Institute, University of California, Berkeley, Berkeley, CA 94720, USA; Department of Environmental Science, Policy, and Management, University of California, Berkeley, Berkeley, CA 94720, USA.
| |
Collapse
|
12
|
Gios E, Mosley OE, Hoggard M, Handley KM. High niche specificity and host genetic diversity of groundwater viruses. THE ISME JOURNAL 2024; 18:wrae035. [PMID: 38452204 PMCID: PMC10980836 DOI: 10.1093/ismejo/wrae035] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Subscribe] [Scholar Register] [Received: 10/03/2023] [Revised: 02/14/2024] [Accepted: 02/29/2024] [Indexed: 03/09/2024]
Abstract
Viruses are key members of microbial communities that exert control over host abundance and metabolism, thereby influencing ecosystem processes and biogeochemical cycles. Aquifers are known to host taxonomically diverse microbial life, yet little is known about viruses infecting groundwater microbial communities. Here, we analysed 16 metagenomes from a broad range of groundwater physicochemistries. We recovered 1571 viral genomes that clustered into 468 high-quality viral operational taxonomic units. At least 15% were observed to be transcriptionally active, although lysis was likely constrained by the resource-limited groundwater environment. Most were unclassified (95%), and the remaining 5% were Caudoviricetes. Comparisons with viruses inhabiting other aquifers revealed no shared species, indicating substantial unexplored viral diversity. In silico predictions linked 22.4% of the viruses to microbial host populations, including to ultra-small prokaryotes, such as Patescibacteria and Nanoarchaeota. Many predicted hosts were associated with the biogeochemical cycling of carbon, nitrogen, and sulfur. Metabolic predictions revealed the presence of 205 putative auxiliary metabolic genes, involved in diverse processes associated with the utilization of the host's intracellular resources for biosynthesis and transformation reactions, including those involved in nucleotide sugar, glycan, cofactor, and vitamin metabolism. Viruses, prokaryotes overall, and predicted prokaryotic hosts exhibited narrow spatial distributions, and relative abundance correlations with the same groundwater parameters (e.g. dissolved oxygen, nitrate, and iron), consistent with host control over viral distributions. Results provide insights into underexplored groundwater viruses, and indicate the large extent to which viruses may manipulate microbial communities and biogeochemistry in the terrestrial subsurface.
Collapse
Affiliation(s)
- Emilie Gios
- School of Biological Sciences, The University of Auckland, Auckland 1010, New Zealand
- NINA, Norwegian Institute for Nature Research, Trondheim 7034, Norway
| | - Olivia E Mosley
- School of Biological Sciences, The University of Auckland, Auckland 1010, New Zealand
- NatureMetrics Ltd, Surrey Research Park, Guildford GU2 7HJ, United Kingdom
| | - Michael Hoggard
- School of Biological Sciences, The University of Auckland, Auckland 1010, New Zealand
| | - Kim M Handley
- School of Biological Sciences, The University of Auckland, Auckland 1010, New Zealand
| |
Collapse
|
13
|
Carroll-Portillo A, Lin DM, Lin HC. The Diversity of Bacteriophages in the Human Gut. Methods Mol Biol 2024; 2738:17-30. [PMID: 37966590 DOI: 10.1007/978-1-0716-3549-0_2] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/16/2023]
Abstract
Bacteriophages, commonly referred to as phages, are viruses that infect bacteria and are among the most numerous microorganisms on the planet. They occur throughout nature occupying every habitat where their bacterial hosts can be found. Within these communities, phages are responsible for shaping the bacterial community structure and function through their interactions. Phages shape the community structure and function within the human gut but are also able to influence the human host. As such, there is increased interest in understanding the composition and activity of the gastrointestinal phages, although these studies have been hindered by the difficulties accompanying the study of the human gut. Here, we summarize the methods and findings pertaining to the diversity of the human gastrointestinal phages.
Collapse
Affiliation(s)
- Amanda Carroll-Portillo
- Division of Gastroenterology and Hepatology, University of New Mexico, Albuquerque, NM, USA.
| | - Derek M Lin
- Biomedical Research Institute of New Mexico, Albuquerque, NM, USA
| | - Henry C Lin
- Division of Gastroenterology and Hepatology, University of New Mexico, Albuquerque, NM, USA
- Medicine Service, New Mexico VA Health Care System, Albuquerque, NM, USA
| |
Collapse
|
14
|
Li Y, Rao G, Zhu G, Cheng C, Yuan L, Li C, Gao J, Tang J, Wang Z, Li W. Dysbiosis of lower respiratory tract microbiome are associated with proinflammatory states in non-small cell lung cancer patients. Thorac Cancer 2024; 15:111-121. [PMID: 38041547 PMCID: PMC10788479 DOI: 10.1111/1759-7714.15166] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 08/07/2023] [Revised: 11/02/2023] [Accepted: 11/03/2023] [Indexed: 12/03/2023] Open
Abstract
BACKGROUND The lung has a sophisticated microbiome, and respiratory illnesses are greatly influenced by the lung microbiota. Despite the fact that numerous studies have shown that lung cancer patients have a dysbiosis as compared to healthy people, more research is needed to explore the association between the microbiota dysbiosis and immune profile within the tumor microenvironment (TME). METHODS In this study, we performed metagenomic sequencing of tumor and normal tissues from 61 non-small cell lung cancer (NSCLC) patients and six patients with other lung diseases. In order to characterize the impact of the microbes in TME, the cytokine concentrations of 24 lung tumor and normal tissues were detected using a multiple cytokine panel. RESULTS Our results showed that tumors had lower microbiota diversity than the paired normal tissues, and the microbiota of NSCLC was enriched in Proteobacteria, Firmicutes, and Actinobacteria. In addition, proinflammatory cytokines such as IL-8, MIF, TNF- α, and so on, were significantly upregulated in tumor tissues. CONCLUSION We discovered a subset of bacteria linked to host inflammatory signaling pathways and, more precisely, to particular immune cells. We determined that lower airway microbiome dysbiosis may be linked to the disruption of the equilibrium of the immune system causing lung inflammation. The spread of lung cancer may be linked to specific bacteria.
Collapse
Affiliation(s)
- Yangqian Li
- Department of Respiratory and Critical Care Medicine, Institute of Respiratory Health, Frontiers Science Center for Disease‐related Molecular Network, West China HospitalSichuan UniversityChengduChina
| | - Guanhua Rao
- Genskey Medical Technology Co., LtdBeijingChina
| | - Guonian Zhu
- Department of Respiratory and Critical Care Medicine, Institute of Respiratory Health, Frontiers Science Center for Disease‐related Molecular Network, West China HospitalSichuan UniversityChengduChina
| | - Cheng Cheng
- Department of Respiratory and Critical Care Medicine, Institute of Respiratory Health, Frontiers Science Center for Disease‐related Molecular Network, West China HospitalSichuan UniversityChengduChina
| | - Lijuan Yuan
- Genskey Medical Technology Co., LtdBeijingChina
| | - Chengpin Li
- Department of Respiratory and Critical Care Medicine, Institute of Respiratory Health, Frontiers Science Center for Disease‐related Molecular Network, West China HospitalSichuan UniversityChengduChina
| | | | - Jun Tang
- Department of Respiratory and Critical Care Medicine, Institute of Respiratory Health, Frontiers Science Center for Disease‐related Molecular Network, West China HospitalSichuan UniversityChengduChina
| | - Zhoufeng Wang
- Department of Respiratory and Critical Care Medicine, Institute of Respiratory Health, Frontiers Science Center for Disease‐related Molecular Network, West China HospitalSichuan UniversityChengduChina
| | - Weimin Li
- Department of Respiratory and Critical Care Medicine, Institute of Respiratory Health, Frontiers Science Center for Disease‐related Molecular Network, West China HospitalSichuan UniversityChengduChina
| |
Collapse
|
15
|
Mahmud MR, Tamanna SK, Akter S, Mazumder L, Akter S, Hasan MR, Acharjee M, Esti IZ, Islam MS, Shihab MMR, Nahian M, Gulshan R, Naser S, Pirttilä AM. Role of bacteriophages in shaping gut microbial community. Gut Microbes 2024; 16:2390720. [PMID: 39167701 PMCID: PMC11340752 DOI: 10.1080/19490976.2024.2390720] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Submit a Manuscript] [Subscribe] [Scholar Register] [Received: 12/22/2023] [Revised: 08/05/2024] [Accepted: 08/06/2024] [Indexed: 08/23/2024] Open
Abstract
Phages are the most diversified and dominant members of the gut virobiota. They play a crucial role in shaping the structure and function of the gut microbial community and consequently the health of humans and animals. Phages are found mainly in the mucus, from where they can translocate to the intestinal organs and act as a modulator of gut microbiota. Understanding the vital role of phages in regulating the composition of intestinal microbiota and influencing human and animal health is an emerging area of research. The relevance of phages in the gut ecosystem is supported by substantial evidence, but the importance of phages in shaping the gut microbiota remains unclear. Although information regarding general phage ecology and development has accumulated, detailed knowledge on phage-gut microbe and phage-human interactions is lacking, and the information on the effects of phage therapy in humans remains ambiguous. In this review, we systematically assess the existing data on the structure and ecology of phages in the human and animal gut environments, their development, possible interaction, and subsequent impact on the gut ecosystem dynamics. We discuss the potential mechanisms of prophage activation and the subsequent modulation of gut bacteria. We also review the link between phages and the immune system to collect evidence on the effect of phages on shaping the gut microbial composition. Our review will improve understanding on the influence of phages in regulating the gut microbiota and the immune system and facilitate the development of phage-based therapies for maintaining a healthy and balanced gut microbiota.
Collapse
Affiliation(s)
- Md. Rayhan Mahmud
- Department of Production Animal Medicine, Faculty of Veterinary Medicine, University of Helsinki, Helsinki, Finland
| | | | - Sharmin Akter
- Department of Microbiology, Jagannath University, Dhaka, Bangladesh
| | - Lincon Mazumder
- Department of Microbiology, Jagannath University, Dhaka, Bangladesh
- Department of Biology, Texas A&M University, College Station, TX, USA
| | - Sumona Akter
- Department of Microbiology, Jagannath University, Dhaka, Bangladesh
| | | | - Mrityunjoy Acharjee
- Department of Microbiology, Stamford University Bangladesh, Dhaka, Bangladesh
| | - Israt Zahan Esti
- Department of Microbiology, Jagannath University, Dhaka, Bangladesh
- Department of Molecular Systems Biology, Faculty of Technology, University of Turku, Turku, Finland
| | - Md. Saidul Islam
- Department of Microbiology, Jagannath University, Dhaka, Bangladesh
| | | | - Md. Nahian
- Department of Microbiology, Jagannath University, Dhaka, Bangladesh
| | - Rubaiya Gulshan
- Department of Microbiology, Jagannath University, Dhaka, Bangladesh
| | - Sadia Naser
- Department of Microbiology, Jagannath University, Dhaka, Bangladesh
| | | |
Collapse
|
16
|
Kim N, Kim CY, Ma J, Yang S, Park DJ, Ha SJ, Belenky P, Lee I. MRGM: an enhanced catalog of mouse gut microbial genomes substantially broadening taxonomic and functional landscapes. Gut Microbes 2024; 16:2393791. [PMID: 39230075 PMCID: PMC11376411 DOI: 10.1080/19490976.2024.2393791] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Submit a Manuscript] [Subscribe] [Scholar Register] [Received: 03/10/2024] [Revised: 08/12/2024] [Accepted: 08/13/2024] [Indexed: 09/05/2024] Open
Abstract
Mouse gut microbiome research is pivotal for understanding the human gut microbiome, providing insights into disease modeling, host-microbe interactions, and the dietary influence on the gut microbiome. To enhance the translational value of mouse gut microbiome studies, we need detailed and high-quality catalogs of mouse gut microbial genomes. We introduce the Mouse Reference Gut Microbiome (MRGM), a comprehensive catalog with 42,245 non-redundant mouse gut bacterial genomes across 1,524 species. MRGM marks a 40% increase in the known taxonomic diversity of mouse gut microbes, capturing previously underrepresented lineages through refined genome quality assessment techniques. MRGM not only broadens the taxonomic landscape but also enriches the functional landscape of the mouse gut microbiome. Using deep learning, we have elevated the Gene Ontology annotation rate for mouse gut microbial proteins from 3.2% with orthology to 60%, marking an over 18-fold increase. MRGM supports both DNA- and marker-based taxonomic profiling by providing custom databases, surpassing previous catalogs in performance. Finally, taxonomic and functional comparisons between human and mouse gut microbiota reveal diet-driven divergences in their taxonomic composition and functional enrichment. Overall, our study highlights the value of high-quality microbial genome catalogs in advancing our understanding of the co-evolution between gut microbes and their host.
Collapse
Affiliation(s)
- Nayeon Kim
- Department of Biotechnology, College of Life Science & Biotechnology, Yonsei University, Seoul, Republic of Korea
| | - Chan Yeong Kim
- Department of Biotechnology, College of Life Science & Biotechnology, Yonsei University, Seoul, Republic of Korea
| | - Junyeong Ma
- Department of Biotechnology, College of Life Science & Biotechnology, Yonsei University, Seoul, Republic of Korea
| | - Sunmo Yang
- Department of Biotechnology, College of Life Science & Biotechnology, Yonsei University, Seoul, Republic of Korea
| | - Dong Jin Park
- Department of Biochemistry, College of Life Science & Biotechnology, Yonsei University, Seoul, Republic of Korea
| | - Sang-Jun Ha
- Department of Biochemistry, College of Life Science & Biotechnology, Yonsei University, Seoul, Republic of Korea
| | - Peter Belenky
- Department of Molecular Microbiology and Immunology, Brown University, Providence, RI, USA
| | - Insuk Lee
- Department of Biotechnology, College of Life Science & Biotechnology, Yonsei University, Seoul, Republic of Korea
- POSTECH Biotech Center, Pohang University of Science and Technology (POSTECH), Pohang, Republic of Korea
| |
Collapse
|
17
|
Richie TG, Heeren L, Kamke A, Monk K, Pogranichniy S, Summers T, Wiechman H, Ran Q, Sarkar S, Plattner BL, Lee STM. Limitation of amino acid availability by bacterial populations during enhanced colitis in IBD mouse model. mSystems 2023; 8:e0070323. [PMID: 37909786 PMCID: PMC10746178 DOI: 10.1128/msystems.00703-23] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/10/2023] [Accepted: 09/27/2023] [Indexed: 11/03/2023] Open
Abstract
IMPORTANCE Inflammatory bowel disease is associated with an increase in Enterobacteriaceae and Enterococcus species; however, the specific mechanisms are unclear. Previous research has reported the associations between microbiota and inflammation, here we investigate potential pathways that specific bacteria populations use to drive gut inflammation. Richie et al. show that these bacterial populations utilize an alternate sulfur metabolism and are tolerant of host-derived immune-response products. These metabolic pathways drive host gut inflammation and fuel over colonization of these pathobionts in the dysbiotic colon. Cultured isolates from dysbiotic mice indicated faster growth supplemented with L-cysteine, showing these microbes can utilize essential host nutrients.
Collapse
Affiliation(s)
- Tanner G. Richie
- Division of Biology, Kansas State University, Manhattan, Kansas, USA
| | - Leah Heeren
- Division of Biology, Kansas State University, Manhattan, Kansas, USA
| | - Abigail Kamke
- Division of Biology, Kansas State University, Manhattan, Kansas, USA
| | - Kourtney Monk
- Division of Biology, Kansas State University, Manhattan, Kansas, USA
| | | | - Trey Summers
- Division of Biology, Kansas State University, Manhattan, Kansas, USA
| | - Hallie Wiechman
- Division of Biology, Kansas State University, Manhattan, Kansas, USA
| | - Qinghong Ran
- Division of Biology, Kansas State University, Manhattan, Kansas, USA
| | - Soumyadev Sarkar
- Division of Biology, Kansas State University, Manhattan, Kansas, USA
| | - Brandon L. Plattner
- Department of Diagnostic Medicine and Pathobiology, Kansas State University, Manhattan, Kansas, USA
| | - Sonny T. M. Lee
- Division of Biology, Kansas State University, Manhattan, Kansas, USA
| |
Collapse
|
18
|
de Oliveira Vieira KC, da Silva ABB, Felício SA, Lira FS, de Figueiredo C, Bezirtzoglou E, Pereira VC, Nakagaki WR, Nai GA, Winkelströter LK. Orange juice containing Pediococcus acidilactici CE51 modulates the intestinal microbiota and reduces induced inflammation in a murine model of colitis. Sci Rep 2023; 13:18513. [PMID: 37898635 PMCID: PMC10613252 DOI: 10.1038/s41598-023-45819-4] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 01/28/2023] [Accepted: 10/24/2023] [Indexed: 10/30/2023] Open
Abstract
The management of inflammatory bowel diseases has been widely investigated, especially ulcerative colitis. Thus, studies with the application of new probiotic products are needed in the prevention/treatment of these clinical conditions. The objective of this work was to evaluate the effects of probiotic orange juice containing Pediococcus acidilactici CE51 in a murine model of colitis. 45 male Swiss lineage mice were used, divided into five groups (n = 9): control, colitis, colitis + probiotic (probiotic orange juice containing CE51), colitis + placebo (orange juice) and colitis + sulfasalazine (10 mg/kg/Weight). The induction of colitis was performed with dextran sodium sulfate (3%). The treatment time was 5 and 15 days after induction. Histopathological analysis, serum measurements of TNF-α and C-reactive protein and metagenomic analysis of feces were performed after euthanasia. Probiotic treatment reduced inflammation in the small intestine, large intestine and spleen. The probiotic did not alter the serum dosages of TNF-α and C-reactive protein. Their use maintained the quantitative ratio of the phylum Firmicutes/Bacteroidetes and increased Lactobacillus helveticus with 15 days of treatment (p < 0.05). The probiotic orange juice containing P. acidilactici CE51 positively modulated the gut microbiota composition and attenuated the inflammation induced in colitis.
Collapse
Affiliation(s)
- Karolinny Cristiny de Oliveira Vieira
- Health Sciences Faculty, UNOESTE (University of Western Sao Paulo), 700, Jose Bongiovani St., Cidade Universitária, Presidente Prudente, Sao Paulo, 19050-920, Brazil
| | - Ana Beatriz Batista da Silva
- Master in Health Science, UNOESTE (University of Western Sao Paulo), 700, Jose Bongiovani St., Presidente Prudente, Sao Paulo, 19050-920, Brazil
| | - Suelen Aparecida Felício
- Master in Health Science, UNOESTE (University of Western Sao Paulo), 700, Jose Bongiovani St., Presidente Prudente, Sao Paulo, 19050-920, Brazil
| | - Fábio Santos Lira
- Department of Physical Education, Faculdade de Ciências e Tecnologia, Universidade Estadual Paulista, UNESP, Rua Roberto Simonsen, 305, Presidente Prudente, Sao Paulo, 19060-900, Brazil
| | - Caíque de Figueiredo
- Department of Physical Education, Faculdade de Ciências e Tecnologia, Universidade Estadual Paulista, UNESP, Rua Roberto Simonsen, 305, Presidente Prudente, Sao Paulo, 19060-900, Brazil
| | - Eugenia Bezirtzoglou
- Laboratory of Hygiene and Environmental Protection, Department of Medicine, Democritus University of Thrace, Dragana, 68100, Alexandroupolis, Greece
| | - Valéria Cataneli Pereira
- Health Sciences Faculty, UNOESTE (University of Western Sao Paulo), 700, Jose Bongiovani St., Cidade Universitária, Presidente Prudente, Sao Paulo, 19050-920, Brazil
- Master in Health Science, UNOESTE (University of Western Sao Paulo), 700, Jose Bongiovani St., Presidente Prudente, Sao Paulo, 19050-920, Brazil
| | - Wilson Romero Nakagaki
- Health Sciences Faculty, UNOESTE (University of Western Sao Paulo), 700, Jose Bongiovani St., Cidade Universitária, Presidente Prudente, Sao Paulo, 19050-920, Brazil
- Master in Health Science, UNOESTE (University of Western Sao Paulo), 700, Jose Bongiovani St., Presidente Prudente, Sao Paulo, 19050-920, Brazil
| | - Gisele Alborghetti Nai
- Health Sciences Faculty, UNOESTE (University of Western Sao Paulo), 700, Jose Bongiovani St., Cidade Universitária, Presidente Prudente, Sao Paulo, 19050-920, Brazil
| | - Lizziane Kretli Winkelströter
- Health Sciences Faculty, UNOESTE (University of Western Sao Paulo), 700, Jose Bongiovani St., Cidade Universitária, Presidente Prudente, Sao Paulo, 19050-920, Brazil.
- Master in Health Science, UNOESTE (University of Western Sao Paulo), 700, Jose Bongiovani St., Presidente Prudente, Sao Paulo, 19050-920, Brazil.
| |
Collapse
|
19
|
Wang Z, Guo M, Li J, Jiang C, Yang S, Zheng S, Li M, Ai X, Xu X, Zhang W, He X, Wang Y, Chen Y. Composition and functional profiles of gut microbiota reflect the treatment stage, severity, and etiology of acute pancreatitis. Microbiol Spectr 2023; 11:e0082923. [PMID: 37698429 PMCID: PMC10580821 DOI: 10.1128/spectrum.00829-23] [Citation(s) in RCA: 2] [Impact Index Per Article: 1.0] [Reference Citation Analysis] [Abstract] [Key Words] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 02/26/2023] [Accepted: 07/13/2023] [Indexed: 09/13/2023] Open
Abstract
Acute pancreatitis (AP) is a type of digestive system disease with high mortality. Previous studies have shown that gut microbiota can participate in developing and treating acute pancreatitis by affecting the host's metabolism. In this study, we followed 20 AP patients to generate longitudinal gut microbiota profiles and activity during disease (before treatment, on the third day of treatment, and 1 month after discharge). We analyzed species composition and metabolic pathways' changes across the treatment phase, severity, and etiology. The diversity of the gut microbiome of patients with AP did not show much variation with treatment. In contrast, the metabolic functions of the gut microbiota, such as the essential chemical reactions that produce energy and maintain life, were partially reinstated after treatment. The severe AP (SAP) patients contained less beneficial bacteria (i.e., Bacteroides xylanisolvens, Clostridium lavalense, and Roseburia inulinivorans) and weaker sugar degradation function than mild AP patients before treatment. Moreover, etiology was one of the drivers of gut microbiome composition and explained the 3.54% variation in species' relative abundance. The relative abundance of pathways related to lipid synthesis was higher in the gut of hyperlipidemia AP patients than in biliary AP patients. The composition and functional profiles of the gut microbiota reflect the severity and etiology of AP. Otherwise, we also identified bacterial species associated with SAP, i.e., Oscillibacter sp. 57_20, Parabacteroides johnsonii, Bacteroides stercoris, Methanobrevibacter smithii, Ruminococcus lactaris, Coprococcus comes, and Dorea formicigenerans, which have the potential to identify the SAP at an early stage. IMPORTANCE Acute pancreatitis (AP) is a type of digestive system disease with high mortality. Previous studies have shown that gut microbiota can participate in the development and treatment of acute pancreatitis by affecting the host's metabolism. However, fewer studies acquired metagenomic sequencing data to associate species to functions intuitively and performed longitudinal analysis to explore how gut microbiota influences the development of AP. We followed 20 AP patients to generate longitudinal gut microbiota profiles and activity during disease and studied the differences in intestinal flora under different severities and etiologies. We have two findings. First, the gut microbiota profile has the potential to identify the severity and etiology of AP at an early stage. Second, gut microbiota likely acts synergistically in the development of AP. This study provides a reference for characterizing the driver flora of severe AP to identify the severity of acute pancreatitis at an early stage.
Collapse
Affiliation(s)
- Zhenjiang Wang
- Department of Gastroenterology, Zhuhai Hospital Affiliated with Jinan University (Zhuhai People’s Hospital), Zhuhai, China
| | - Mingyi Guo
- Department of Gastroenterology, Zhuhai Hospital Affiliated with Jinan University (Zhuhai People’s Hospital), Zhuhai, China
| | - Jing Li
- School of Management, University of Science and Technology of China, Hefei, Anhui, China
- Department of Research and Development, Shenzhen Byoryn Technology Co., Ltd., Shenzhen, China
| | - Chuangming Jiang
- Department of Gastroenterology, Gaolangang Branch of Zhuhai People’s Hospital (Hospital of Gaolangang), Zhuhai, China
| | - Sen Yang
- Department of Gastroenterology, Zhuhai Hospital Affiliated with Jinan University (Zhuhai People’s Hospital), Zhuhai, China
| | - Shizhuo Zheng
- Department of Gastroenterology, Zhuhai Hospital Affiliated with Jinan University (Zhuhai People’s Hospital), Zhuhai, China
| | - Mingzhe Li
- College of Medicine and Biological Information Engineering, Northeastern University, Shenyang, China
| | - Xinbo Ai
- Department of Gastroenterology, Zhuhai Hospital Affiliated with Jinan University (Zhuhai People’s Hospital), Zhuhai, China
| | - Xiaohong Xu
- Department of Gastroenterology, Zhuhai Hospital Affiliated with Jinan University (Zhuhai People’s Hospital), Zhuhai, China
| | - Wenbo Zhang
- College of Medicine and Biological Information Engineering, Northeastern University, Shenyang, China
| | - Xingxiang He
- Department of Gastroenterology, The First Affiliated Hospital of Jinan University, Guangzhou, China
| | - Yinan Wang
- Department of Obstetrics and Gynecology, Peking University Shenzhen Hospital, Shenzhen, China
| | - Yuping Chen
- Department of Gastroenterology, Zhuhai Hospital Affiliated with Jinan University (Zhuhai People’s Hospital), Zhuhai, China
| |
Collapse
|
20
|
Ju NP, Liu J, He Q. SNP-Slice Resolves Mixed Infections: Simultaneously Unveiling Strain Haplotypes and Linking Them to Hosts. BIORXIV : THE PREPRINT SERVER FOR BIOLOGY 2023:2023.07.29.551098. [PMID: 37546891 PMCID: PMC10402141 DOI: 10.1101/2023.07.29.551098] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 08/08/2023]
Abstract
Multi-strain infection is a common yet under-investigated phenomenon of many pathogens. Currently, biologists analyzing SNP information have to discard mixed infection samples, because existing downstream analyses require monogenomic inputs. Such a protocol impedes our understanding of the underlying genetic diversity, co-infection patterns, and genomic relatedness of pathogens. A reliable tool to learn and resolve the SNP haplotypes from polygenomic data is an urgent need in molecular epidemiology. In this work, we develop a slice sampling Markov Chain Monte Carlo algorithm, named SNP-Slice, to learn not only the SNP haplotypes of all strains in the populations but also which strains infect which hosts. Our method reconstructs SNP haplotypes and individual heterozygosities accurately without reference panels and outperforms the state of art methods at estimating the multiplicity of infections and allele frequencies. Thus, SNP-Slice introduces a novel approach to address polygenomic data and opens a new avenue for resolving complex infection patterns in molecular surveillance. We illustrate the performance of SNP-Slice on empirical malaria and HIV datasets and provide recommendations for the practical use of the method.
Collapse
|
21
|
Liao H, Ji Y, Sun Y. High-resolution strain-level microbiome composition analysis from short reads. MICROBIOME 2023; 11:183. [PMID: 37587527 PMCID: PMC10433603 DOI: 10.1186/s40168-023-01615-w] [Citation(s) in RCA: 8] [Impact Index Per Article: 4.0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Subscribe] [Scholar Register] [Received: 08/24/2022] [Accepted: 07/07/2023] [Indexed: 08/18/2023]
Abstract
BACKGROUND Bacterial strains under the same species can exhibit different biological properties, making strain-level composition analysis an important step in understanding the dynamics of microbial communities. Metagenomic sequencing has become the major means for probing the microbial composition in host-associated or environmental samples. Although there are a plethora of composition analysis tools, they are not optimized to address the challenges in strain-level analysis: highly similar strain genomes and the presence of multiple strains under one species in a sample. Thus, this work aims to provide a high-resolution and more accurate strain-level analysis tool for short reads. RESULTS In this work, we present a new strain-level composition analysis tool named StrainScan that employs a novel tree-based k-mers indexing structure to strike a balance between the strain identification accuracy and the computational complexity. We tested StrainScan extensively on a large number of simulated and real sequencing data and benchmarked StrainScan with popular strain-level analysis tools including Krakenuniq, StrainSeeker, Pathoscope2, Sigma, StrainGE, and StrainEst. The results show that StrainScan has higher accuracy and resolution than the state-of-the-art tools on strain-level composition analysis. It improves the F1 score by 20% in identifying multiple strains at the strain level. CONCLUSIONS By using a novel k-mer indexing structure, StrainScan is able to provide strain-level analysis with higher resolution than existing tools, enabling it to return more informative strain composition analysis in one sample or across multiple samples. StrainScan takes short reads and a set of reference strains as input and its source codes are freely available at https://github.com/liaoherui/StrainScan . Video Abstract.
Collapse
Affiliation(s)
- Herui Liao
- Department of Electrical Engineering, City University of Hong Kong, Kowloon, China
| | - Yongxin Ji
- Department of Electrical Engineering, City University of Hong Kong, Kowloon, China
| | - Yanni Sun
- Department of Electrical Engineering, City University of Hong Kong, Kowloon, China.
| |
Collapse
|
22
|
Miao Y, Bian J, Dong G, Dai T. DETIRE: a hybrid deep learning model for identifying viral sequences from metagenomes. Front Microbiol 2023; 14:1169791. [PMID: 37396369 PMCID: PMC10313334 DOI: 10.3389/fmicb.2023.1169791] [Citation(s) in RCA: 3] [Impact Index Per Article: 1.5] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 02/20/2023] [Accepted: 05/18/2023] [Indexed: 07/04/2023] Open
Abstract
A metagenome contains all DNA sequences from an environmental sample, including viruses, bacteria, archaea, and eukaryotes. Since viruses are of huge abundance and have caused vast mortality and morbidity to human society in history as a type of major pathogens, detecting viruses from metagenomes plays a crucial role in analyzing the viral component of samples and is the very first step for clinical diagnosis. However, detecting viral fragments directly from the metagenomes is still a tough issue because of the existence of a huge number of short sequences. In this study a hybrid Deep lEarning model for idenTifying vIral sequences fRom mEtagenomes (DETIRE) is proposed to solve the problem. First, the graph-based nucleotide sequence embedding strategy is utilized to enrich the expression of DNA sequences by training an embedding matrix. Then, the spatial and sequential features are extracted by trained CNN and BiLSTM networks, respectively, to enrich the features of short sequences. Finally, the two sets of features are weighted combined for the final decision. Trained by 220,000 sequences of 500 bp subsampled from the Virus and Host RefSeq genomes, DETIRE identifies more short viral sequences (<1,000 bp) than the three latest methods, such as DeepVirFinder, PPR-Meta, and CHEER. DETIRE is freely available at Github (https://github.com/crazyinter/DETIRE).
Collapse
|
23
|
Watson AR, Füssel J, Veseli I, DeLongchamp JZ, Silva M, Trigodet F, Lolans K, Shaiber A, Fogarty E, Runde JM, Quince C, Yu MK, Söylev A, Morrison HG, Lee STM, Kao D, Rubin DT, Jabri B, Louie T, Eren AM. Metabolic independence drives gut microbial colonization and resilience in health and disease. Genome Biol 2023; 24:78. [PMID: 37069665 PMCID: PMC10108530 DOI: 10.1186/s13059-023-02924-x] [Citation(s) in RCA: 19] [Impact Index Per Article: 9.5] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/03/2022] [Accepted: 04/07/2023] [Indexed: 04/19/2023] Open
Abstract
BACKGROUND Changes in microbial community composition as a function of human health and disease states have sparked remarkable interest in the human gut microbiome. However, establishing reproducible insights into the determinants of microbial succession in disease has been a formidable challenge. RESULTS Here we use fecal microbiota transplantation (FMT) as an in natura experimental model to investigate the association between metabolic independence and resilience in stressed gut environments. Our genome-resolved metagenomics survey suggests that FMT serves as an environmental filter that favors populations with higher metabolic independence, the genomes of which encode complete metabolic modules to synthesize critical metabolites, including amino acids, nucleotides, and vitamins. Interestingly, we observe higher completion of the same biosynthetic pathways in microbes enriched in IBD patients. CONCLUSIONS These observations suggest a general mechanism that underlies changes in diversity in perturbed gut environments and reveal taxon-independent markers of "dysbiosis" that may explain why widespread yet typically low-abundance members of healthy gut microbiomes can dominate under inflammatory conditions without any causal association with disease.
Collapse
Affiliation(s)
- Andrea R Watson
- Department of Medicine, The University of Chicago, Chicago, IL, 60637, USA
- Committee On Microbiology, The University of Chicago, Chicago, IL, 60637, USA
| | - Jessika Füssel
- Department of Medicine, The University of Chicago, Chicago, IL, 60637, USA
- Institute for Chemistry and Biology of the Marine Environment, University of Oldenburg, 26129, Oldenburg, Germany
| | - Iva Veseli
- Biophysical Sciences Program, The University of Chicago, Chicago, IL, 60637, USA
| | | | - Marisela Silva
- Department of Medicine, The University of Calgary, Calgary, AB, T2N 1N4, Canada
| | - Florian Trigodet
- Department of Medicine, The University of Chicago, Chicago, IL, 60637, USA
| | - Karen Lolans
- Department of Medicine, The University of Chicago, Chicago, IL, 60637, USA
| | - Alon Shaiber
- Biophysical Sciences Program, The University of Chicago, Chicago, IL, 60637, USA
| | - Emily Fogarty
- Department of Medicine, The University of Chicago, Chicago, IL, 60637, USA
- Committee On Microbiology, The University of Chicago, Chicago, IL, 60637, USA
| | - Joseph M Runde
- Department of Pediatrics, Lurie Children's Hospital of Chicago, Chicago, IL, 60611, USA
| | - Christopher Quince
- Organisms and Ecosystems, Earlham Institute, Norwich, Norwich, NR4 7UZ, UK
- Gut Microbes and Health, Quadram Institute, Norwich, NR4 7UQ, UK
| | - Michael K Yu
- Toyota Technological Institute at Chicago, Chicago, IL, 60637, USA
| | - Arda Söylev
- Department of Computer Engineering, Konya Food and Agriculture University, Konya, Turkey
| | - Hilary G Morrison
- Marine Biological Laboratory, Josephine Bay Paul Center, Woods Hole, Falmouth, MA, 02543, USA
| | - Sonny T M Lee
- Department of Medicine, The University of Chicago, Chicago, IL, 60637, USA
| | - Dina Kao
- Department of Medicine, University of Alberta, Edmonton, AB, T6G 2G3, Canada
| | - David T Rubin
- Department of Medicine, The University of Chicago, Chicago, IL, 60637, USA
| | - Bana Jabri
- Department of Medicine, The University of Chicago, Chicago, IL, 60637, USA
| | - Thomas Louie
- Department of Medicine, The University of Calgary, Calgary, AB, T2N 1N4, Canada
| | - A Murat Eren
- Department of Medicine, The University of Chicago, Chicago, IL, 60637, USA.
- Committee On Microbiology, The University of Chicago, Chicago, IL, 60637, USA.
- Institute for Chemistry and Biology of the Marine Environment, University of Oldenburg, 26129, Oldenburg, Germany.
- Marine Biological Laboratory, Josephine Bay Paul Center, Woods Hole, Falmouth, MA, 02543, USA.
- Helmholtz Institute for Functional Marine Biodiversity, 26129, Oldenburg, Germany.
| |
Collapse
|
24
|
Liu Q, Liu F, Miao Y, He J, Dong T, Hou T, Liu Y. Virsearcher: Identifying Bacteriophages from Metagenomes by Combining Convolutional Neural Network and Gene Information. IEEE/ACM TRANSACTIONS ON COMPUTATIONAL BIOLOGY AND BIOINFORMATICS 2023; 20:763-774. [PMID: 35316191 DOI: 10.1109/tcbb.2022.3161135] [Citation(s) in RCA: 3] [Impact Index Per Article: 1.5] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 05/04/2023]
Abstract
Metagenome sequencing provides an unprecedented opportunity for the discovery of unknown microbes and viruses. A large number of phages and prokaryotes are mixed together in metagenomes. To study the influence of phages on human bodies and environments, it is of great significance to isolate phages from metagenomes. However, it is difficult to identify novel phages because of the diversity of their sequences and the frequent presence of short contigs in metagenomes. Here, virSearcher is developed to identify phages from metagenomes by combining the convolutional neural network (CNN) and the gene information of input sequences. Firstly, an input sequence is encoded in accordance with the different functions of its coding and the non-coding regions and then is converted into word embedding code through a word embedding layer before a convolutional layer. Meanwhile, the hit ratio of the virus genes is combined with the output of the CNN to further improve the performance of the network. The genes used by virSearcher consist of complete and incomplete genes. Experiments on several metagenomes have showed that, compared with others, virSearcher can significantly improve the performance for the identification of short sequences, while maintaining the performance for long ones. The source code of virSearcher is freely available from http://github.com/DrJackson18/virSearcher.
Collapse
|
25
|
Affiliation(s)
- Mads Albertsen
- Center for Microbial Communities, Aalborg University, Aalborg, Denmark.
| |
Collapse
|
26
|
Genome-Centric Dynamics Shape the Diversity of Oral Bacterial Populations. mBio 2022; 13:e0241422. [PMID: 36214570 PMCID: PMC9765137 DOI: 10.1128/mbio.02414-22] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/20/2022] Open
Abstract
Two major viewpoints have been put forward for how microbial populations change, differing in whether adaptation is driven principally by gene-centric or genome-centric processes. Longitudinal sampling at microbially relevant timescales, i.e., days to weeks, is critical for distinguishing these mechanisms. Because of its significance for both microbial ecology and human health and its accessibility and high level of curation, we used the oral microbiota to study bacterial intrapopulation genome dynamics. Metagenomes were generated by shotgun sequencing of total community DNA from the healthy tongues of 17 volunteers at four to seven time points obtained over intervals of days to weeks. We obtained 390 high-quality metagenome-assembled genomes (MAGs) defining population genomes from 55 genera. The vast majority of genes in each MAG were tightly linked over the 2-week sampling window, indicating that the majority of the population's genomes were temporally stable at the MAG level. MAG-defined populations were composed of up to 5 strains, as determined by single-nucleotide-variant frequencies. Although most were stable over time, individual strains carrying over 100 distinct genes that rose from low abundance to dominance in a population over a period of days were detected. These results indicate a genome-wide as opposed to a gene-level process of population change. We infer that genome-wide selection of ecotypes is the dominant mode of adaptation in the oral populations over short timescales. IMPORTANCE The oral microbiome represents a microbial community of critical relevance to human health. Recent studies have documented the diversity and dynamics of different bacteria to reveal a rich, stable ecosystem characterized by strain-level dynamics. However, bacterial populations and their genomes are neither monolithic nor static; their genomes are constantly evolving to lose, gain, or alter their functional potential. To better understand how microbial genomes change in complex communities, we used culture-independent approaches to reconstruct the genomes (MAGs) for bacterial populations that approximated different species, in 17 healthy donors' mouths over a 2-week window. Our results underscored the importance of strain-level dynamics, which agrees with and expands on the conclusions of previous research. Altogether, these observations reveal patterns of genomic dynamics among strains of oral bacteria occurring over a matter of days.
Collapse
|
27
|
Jiang Z, Li X, Guo L. Binning Metagenomic Contigs Using Unsupervised Clustering and Reference Databases. Interdiscip Sci 2022; 14:795-803. [PMID: 35639335 DOI: 10.1007/s12539-022-00526-y] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 12/03/2021] [Revised: 04/23/2022] [Accepted: 04/27/2022] [Indexed: 06/15/2023]
Abstract
Metagenomics can directly extract the genetic material of all microorganisms from the environment, and obtain metagenomic samples with a large number of unknown DNA sequences. Binning of metagenomic contigs is a hot topic in metagenomics research. There are two key challenges for the current unsupervised metagenomic clustering algorithms. First, unsupervised metagenomic clustering methods rarely use reference databases, causing a certain waste of resources. Second, unsupervised metagenomic clustering methods are restricted by the characteristics of the sequences and the clustering algorithms, and the binning effect is limited. Therefore, a new binning method for metagenomic contigs using unsupervised clustering methods and reference databases is proposed to address these challenges, to make full use of the advantages of unsupervised clustering methods and reference databases constructed by scientists to improve the overall binning effect. This method uses the integrated SVM classification model to further bin the unsupervised clustering parts that do not perform well. Our proposed method was tested on simulated datasets and a real dataset and compared with other state-of-the-art metagenomic clustering methods including CONCOCT, Metabin2.0, Autometa, and MetaBAT. The results show that our method can achieve higher precision rate and improve the binning effect.
Collapse
Affiliation(s)
- Zhongjun Jiang
- College of Information Science and Technology, Ningbo University, Ningbo, 315211, China
| | - Xiaobo Li
- College of Mathematics and Computer Science, Zhejiang Normal University, Jinhua, 321004, China.
| | - Lijun Guo
- College of Information Science and Technology, Ningbo University, Ningbo, 315211, China
| |
Collapse
|
28
|
Madival SD, Mishra DC, Sharma A, Kumar S, Maji AK, Budhlakoti N, Sinha D, Rai A. A Deep Clustering-based Novel Approach for Binning of Metagenomics Data. Curr Genomics 2022; 23:353-368. [PMID: 36778191 PMCID: PMC9878855 DOI: 10.2174/1389202923666220928150100] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/04/2022] [Revised: 08/30/2022] [Accepted: 09/02/2022] [Indexed: 11/22/2022] Open
Abstract
Background One major challenge in binning Metagenomics data is the limited availability of reference datasets, as only 1% of the total microbial population is yet cultured. This has given rise to the efficacy of unsupervised methods for binning in the absence of any reference datasets. Objective To develop a deep clustering-based binning approach for Metagenomics data and to evaluate results with suitable measures. Methods In this study, a deep learning-based approach has been taken for binning the Metagenomics data. The results are validated on different datasets by considering features such as Tetra-nucleotide frequency (TNF), Hexa-nucleotide frequency (HNF) and GC-Content. Convolutional Autoencoder is used for feature extraction and for binning; the K-means clustering method is used. Results In most cases, it has been found that evaluation parameters such as the Silhouette index and Rand index are more than 0.5 and 0.8, respectively, which indicates that the proposed approach is giving satisfactory results. The performance of the developed approach is compared with current methods and tools using benchmarked low complexity simulated and real metagenomic datasets. It is found better for unsupervised and at par with semi-supervised methods. Conclusion An unsupervised advanced learning-based approach for binning has been proposed, and the developed method shows promising results for various datasets. This is a novel approach for solving the lack of reference data problem of binning in metagenomics.
Collapse
Affiliation(s)
| | | | - Anu Sharma
- Division of Agriculture Bioinformatics, ICAR-IASRI, New Delhi- 110012, India
| | - Sanjeev Kumar
- Division of Agriculture Bioinformatics, ICAR-IASRI, New Delhi- 110012, India
| | - Arpan Kumar Maji
- Division of Computer Applications, ICAR-IASRI, New Delhi- 110012, India
| | - Neeraj Budhlakoti
- Division of Agriculture Bioinformatics, ICAR-IASRI, New Delhi- 110012, India
| | - Dipro Sinha
- Division of Agriculture Bioinformatics, ICAR-IASRI, New Delhi- 110012, India
| | - Anil Rai
- Division of Agriculture Bioinformatics, ICAR-IASRI, New Delhi- 110012, India
| |
Collapse
|
29
|
Mallawaarachchi V, Lin Y. Accurate Binning of Metagenomic Contigs Using Composition, Coverage, and Assembly Graphs. J Comput Biol 2022; 29:1357-1376. [DOI: 10.1089/cmb.2022.0262] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/13/2022] Open
Affiliation(s)
- Vijini Mallawaarachchi
- School of Computing, College of Engineering and Computer Science, Australian National University, Canberra, Australia
| | - Yu Lin
- School of Computing, College of Engineering and Computer Science, Australian National University, Canberra, Australia
| |
Collapse
|
30
|
Arredondo-Hernandez R, Siebe C, Castillo-Rojas G, Ponce de León S, López-Vidal Y. The synergistic interaction of systemic inflammation, dysbiosis and antimicrobial resistance promotes growth restriction in children with acute severe malnutrition: An emphasis on Escherichia coli. FRONTIERS IN ANTIBIOTICS 2022; 1:1001717. [PMID: 39816412 PMCID: PMC11732057 DOI: 10.3389/frabi.2022.1001717] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Grants] [Track Full Text] [Figures] [Subscribe] [Scholar Register] [Received: 07/23/2022] [Accepted: 10/05/2022] [Indexed: 01/18/2025]
Abstract
A healthy development is denied to millions of children worldwide as harsh life conditions manifest themselves in an altered inflammation-prone microbiome crosstalk environment. Keynote of this tragedy is that insufficient nutritious amino acid blocks lipids-intake to sustain diverse microbiota, and promotes the generalist strategy followed by Escherichia coli -besides other proteobacteria- of shifting gut metabolism, subverting the site specificity of first immune reaction. Furthermore, it could be hypothesized that selective success lies in their ability to induce inflammation, since this phenomenon also fuels horizontal gene transfer (HGT). In this review, we dilucidate how immune mechanisms of environmental enteric dysfunction affect overgrowth restriction, infectious morbidity rate, and acquired lifelong risks among severe acute malnourished children. Also, despite acknowledging complexities of antimicrobial resistant enrichment, we explore and speculate over the links between virulence regulation and HGT as an indissociable part in the quest for new inflammatory niches by open genome bacteria, particularly when both collide in the most vulnerable.
Collapse
Affiliation(s)
- Rene Arredondo-Hernandez
- Laboratorio de Microbioma, División de Investigación y División de Posgrado, Facultad de Medicina, Universidad Nacional Autónoma de México, Mexico City, Mexico
| | - Christina Siebe
- Instituto de Geología, Universidad Nacional Autónoma de México, Mexico City, Mexico
| | - Gonzalo Castillo-Rojas
- Programa de Inmunología Molecular Microbiana, Departamento de Microbiología y Parasitología, Facultad de Medicina, Universidad Nacional Autónoma de México, Mexico City, Mexico
| | - Samuel Ponce de León
- Laboratorio de Microbioma, División de Investigación y División de Posgrado, Facultad de Medicina, Universidad Nacional Autónoma de México, Mexico City, Mexico
| | - Yolanda López-Vidal
- Programa de Inmunología Molecular Microbiana, Departamento de Microbiología y Parasitología, Facultad de Medicina, Universidad Nacional Autónoma de México, Mexico City, Mexico
| |
Collapse
|
31
|
Baniel A, Petrullo L, Mercer A, Reitsema L, Sams S, Beehner JC, Bergman TJ, Snyder-Mackler N, Lu A. Maternal effects on early-life gut microbiota maturation in a wild nonhuman primate. Curr Biol 2022; 32:4508-4520.e6. [PMID: 36099914 DOI: 10.1016/j.cub.2022.08.037] [Citation(s) in RCA: 3] [Impact Index Per Article: 1.0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 01/04/2022] [Revised: 06/14/2022] [Accepted: 08/15/2022] [Indexed: 11/25/2022]
Abstract
Early-life microbial colonization is an important process shaping host physiology,1-3 immunity,4-6 and long-term health outcomes7-10 in humans. However, our understanding of this dynamic process remains poorly investigated in wild animals,11-13 where developmental mechanisms can be better understood within ecological and evolutionarily relevant contexts.11,12 Using one of the largest developmental datasets on a wild primate-the gelada (Theropithecus gelada)-we used 16S rRNA amplicon sequencing to characterize gut microbiota maturation during the first 3 years of life and assessed the role of maternal effects in shaping offspring microbiota assembly. In contrast to recent data on chimpanzees, postnatal microbial colonization in geladas was highly similar to humans:14 microbial alpha diversity increased rapidly following birth, followed by gradual changes in composition until weaning. Dietary changes associated with weaning (from milk- to plant-based diet) were the main drivers of shifts in taxonomic composition and microbial predicted functional pathways. Maternal effects were also an important factor influencing the offspring gut microbiota. During nursing (<12 months), offspring of experienced (multi-time) mothers exhibited faster functional microbial maturation, likely reflecting the general faster developmental pace of infants born to these mothers. Following weaning (>18 months), the composition of the juvenile microbiota tended to be more similar to the maternal microbiota than to the microbiota of other adult females, highlighting that maternal effects may persist even after nursing cessation.15,16 Together, our findings highlight the dynamic nature of early-life gut colonization and the role of maternal effects in shaping this trajectory in a wild primate.
Collapse
Affiliation(s)
- Alice Baniel
- Center for Evolution and Medicine, Arizona State University, E Tyler Mall, Tempe, AZ 85281, USA; School of Life Sciences, Arizona State University, E Tyler Mall, Tempe, AZ 85287, USA.
| | - Lauren Petrullo
- Department of Psychology, University of Michigan, Church St., Ann Arbor, MI 48109, USA
| | - Arianne Mercer
- Department of Psychology, University of Washington, Okanogan Ln., Seattle, WA 98195, USA
| | - Laurie Reitsema
- Department of Anthropology, University of Georgia, Jackson St., Athens, GA 30602, USA
| | - Sierra Sams
- Department of Psychology, University of Washington, Okanogan Ln., Seattle, WA 98195, USA
| | - Jacinta C Beehner
- Department of Psychology, University of Michigan, Church St., Ann Arbor, MI 48109, USA; Department of Anthropology, University of Michigan, S University Ave., Ann Arbor, MI 48109, USA
| | - Thore J Bergman
- Department of Psychology, University of Michigan, Church St., Ann Arbor, MI 48109, USA; Department of Ecology and Evolutionary Biology, University of Michigan, N University Ave., Ann Arbor, MI 48109, USA
| | - Noah Snyder-Mackler
- Center for Evolution and Medicine, Arizona State University, E Tyler Mall, Tempe, AZ 85281, USA; School of Life Sciences, Arizona State University, E Tyler Mall, Tempe, AZ 85287, USA; Department of Psychology, University of Washington, Okanogan Ln., Seattle, WA 98195, USA; School for Human Evolution and Social Change, Arizona State University, Cady Mall, Tempe, AZ 85287, USA.
| | - Amy Lu
- Department of Anthropology, Stony Brook University, Circle Rd., Stony Brook, NY 11794, USA.
| |
Collapse
|
32
|
Tolstoganov I, Kamenev Y, Kruglikov R, Ochkalova S, Korobeynikov A. BinSPreader: Refine binning results for fuller MAG reconstruction. iScience 2022; 25:104770. [PMID: 35992057 PMCID: PMC9386100 DOI: 10.1016/j.isci.2022.104770] [Citation(s) in RCA: 4] [Impact Index Per Article: 1.3] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 02/12/2022] [Revised: 06/20/2022] [Accepted: 07/12/2022] [Indexed: 11/02/2022] Open
Abstract
Despite the recent advances in high-throughput sequencing, metagenome analysis of microbial populations still remains a challenge. In particular, the metagenome-assembled genomes (MAGs) are often fragmented due to interspecies repeats, uneven coverage, and varying strain abundance. MAGs are constructed via a binning process that uses features of input data in order to cluster long contigs presumably belonging to the same species. In this work, we present BinSPreader-a binning refiner tool that exploits the assembly graph topology and other connectivity information to refine binning, correct binning errors, and propagate binning to shorter contigs. We show that BinSPreader could increase the completeness of the bins without sacrificing the purity and could predict contigs belonging to several MAGs. BinSPreader is effective in binning shorter contigs that often contain important conservative sequences that might be of great interest to researchers.
Collapse
Affiliation(s)
- Ivan Tolstoganov
- Center for Algorithmic Biotechnology, Saint Petersburg State University, Saint Petersburg, 199004, Russia
| | - Yuri Kamenev
- ITMO University, Saint Petersburg 197101, Russia
| | | | - Sofia Ochkalova
- Applied Genomics Laboratory, SCAMT Institute, ITMO University, Saint Petersburg 197101, Russia
| | - Anton Korobeynikov
- Center for Algorithmic Biotechnology, Saint Petersburg State University, Saint Petersburg, 199004, Russia
- Department of Statistical Modelling, Saint Petersburg State University, Saint Petersburg, 198504, Russia
| |
Collapse
|
33
|
Lamurias A, Sereika M, Albertsen M, Hose K, Nielsen TD. Metagenomic binning with assembly graph embeddings. Bioinformatics 2022; 38:4481-4487. [PMID: 35972375 PMCID: PMC9525014 DOI: 10.1093/bioinformatics/btac557] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 02/25/2022] [Revised: 08/02/2022] [Accepted: 08/12/2022] [Indexed: 12/24/2022] Open
Abstract
MOTIVATION Despite recent advancements in sequencing technologies and assembly methods, obtaining high-quality microbial genomes from metagenomic samples is still not a trivial task. Current metagenomic binners do not take full advantage of assembly graphs and are not optimized for long-read assemblies. Deep graph learning algorithms have been proposed in other fields to deal with complex graph data structures. The graph structure generated during the assembly process could be integrated with contig features to obtain better bins with deep learning. RESULTS We propose GraphMB, which uses graph neural networks to incorporate the assembly graph into the binning process. We test GraphMB on long-read datasets of different complexities, and compare the performance with other binners in terms of the number of High Quality (HQ) genome bins obtained. With our approach, we were able to obtain unique bins on all real datasets, and obtain more bins on most datasets. In particular, we obtained on average 17.5% more HQ bins when compared with state-of-the-art binners and 13.7% when aggregating the results of our binner with the others. These results indicate that a deep learning model can integrate contig-specific and graph-structure information to improve metagenomic binning. AVAILABILITY AND IMPLEMENTATION GraphMB is available from https://github.com/MicrobialDarkMatter/GraphMB. SUPPLEMENTARY INFORMATION Supplementary data are available at Bioinformatics online.
Collapse
Affiliation(s)
| | - Mantas Sereika
- Center for Microbial Communities, Department of Chemistry and Bioscience, Aalborg University, 9000 Aalborg, Denmark
| | | | | | | |
Collapse
|
34
|
Litos A, Intze E, Pavlidis P, Lagkouvardos I. Cronos: A Machine Learning Pipeline for Description and Predictive Modeling of Microbial Communities Over Time. FRONTIERS IN BIOINFORMATICS 2022; 2:866902. [PMID: 36304308 PMCID: PMC9580867 DOI: 10.3389/fbinf.2022.866902] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 01/31/2022] [Accepted: 06/15/2022] [Indexed: 11/13/2022] Open
Abstract
Microbial time-series analysis, typically, examines the abundances of individual taxa over time and attempts to assign etiology to observed patterns. This approach assumes homogeneous groups in terms of profiles and response to external effectors. These assumptions are not always fulfilled, especially in complex natural systems, like the microbiome of the human gut. It is actually established that humans with otherwise the same demographic or dietary backgrounds can have distinct microbial profiles. We suggest an alternative approach to the analysis of microbial time-series, based on the following premises: 1) microbial communities are organized in distinct clusters of similar composition at any time point, 2) these intrinsic subsets of communities could have different responses to the same external effects, and 3) the fate of the communities is largely deterministic given the same external conditions. Therefore, tracking the transition of communities, rather than individual taxa, across these states, can enhance our understanding of the ecological processes and allow the prediction of future states, by incorporating applied effects. We implement these ideas into Cronos, an analytical pipeline written in R. Cronos’ inputs are a microbial composition table (e.g., OTU table), their phylogenetic relations as a tree, and the associated metadata. Cronos detects the intrinsic microbial profile clusters on all time points, describes them in terms of composition, and records the transitions between them. Cluster assignments, combined with the provided metadata, are used to model the transitions and predict samples’ fate under various effects. We applied Cronos to available data from growing infants’ gut microbiomes, and we observe two distinct trajectories corresponding to breastfed and formula-fed infants that eventually converge to profiles resembling those of mature individuals. Cronos is freely available at https://github.com/Lagkouvardos/Cronos.
Collapse
Affiliation(s)
- Aristeidis Litos
- School of Medicine, University of Crete, Heraklion, Greece
- Institute of Computer Science, Foundation of Research and Technology, Heraklion, Greece
| | - Evangelia Intze
- School of Science and Technology, Hellenic Open University, Patras, Greece
| | - Pavlos Pavlidis
- Institute of Computer Science, Foundation of Research and Technology, Heraklion, Greece
| | - Ilias Lagkouvardos
- Institute of Computer Science, Foundation of Research and Technology, Heraklion, Greece
- Core Facility Microbiome—ZIEL Institute for Food and Health, Technical University of Munich, Freising, Germany
- *Correspondence: Ilias Lagkouvardos,
| |
Collapse
|
35
|
Sereika M, Kirkegaard RH, Karst SM, Michaelsen TY, Sørensen EA, Wollenberg RD, Albertsen M. Oxford Nanopore R10.4 long-read sequencing enables the generation of near-finished bacterial genomes from pure cultures and metagenomes without short-read or reference polishing. Nat Methods 2022; 19:823-826. [PMID: 35789207 PMCID: PMC9262707 DOI: 10.1038/s41592-022-01539-7] [Citation(s) in RCA: 186] [Impact Index Per Article: 62.0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 11/10/2021] [Accepted: 05/24/2022] [Indexed: 12/26/2022]
Abstract
Long-read Oxford Nanopore sequencing has democratized microbial genome sequencing and enables the recovery of highly contiguous microbial genomes from isolates or metagenomes. However, to obtain near-finished genomes it has been necessary to include short-read polishing to correct insertions and deletions derived from homopolymer regions. Here, we show that Oxford Nanopore R10.4 can be used to generate near-finished microbial genomes from isolates or metagenomes without short-read or reference polishing.
Collapse
Affiliation(s)
- Mantas Sereika
- Center for Microbial Communities, Aalborg University, Aalborg, Denmark
| | - Rasmus Hansen Kirkegaard
- Center for Microbial Communities, Aalborg University, Aalborg, Denmark.,Joint Microbiome Facility, University of Vienna, Vienna, Austria
| | | | | | | | | | - Mads Albertsen
- Center for Microbial Communities, Aalborg University, Aalborg, Denmark.
| |
Collapse
|
36
|
Chandrasiri S, Perera T, Dilhara A, Perera I, Mallawaarachchi V. CH-Bin: A Convex Hull Based Approach for Binning Metagenomic Contigs. Comput Biol Chem 2022; 100:107734. [DOI: 10.1016/j.compbiolchem.2022.107734] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/06/2022] [Accepted: 07/12/2022] [Indexed: 11/30/2022]
|
37
|
Tiamani K, Luo S, Schulz S, Xue J, Costa R, Khan Mirzaei M, Deng L. The role of virome in the gastrointestinal tract and beyond. FEMS Microbiol Rev 2022; 46:6608358. [PMID: 35700129 PMCID: PMC9629487 DOI: 10.1093/femsre/fuac027] [Citation(s) in RCA: 10] [Impact Index Per Article: 3.3] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 02/04/2022] [Revised: 06/07/2022] [Accepted: 06/09/2022] [Indexed: 01/11/2023] Open
Abstract
The human gut virome is comprised of diverse commensal and pathogenic viruses. The colonization by these viruses begins right after birth through vaginal delivery, then continues through breastfeeding, and broader environmental exposure. Their constant interaction with their bacterial hosts in the body shapes not only our microbiomes but us. In addition, these viruses interact with the immune cells, trigger a broad range of immune responses, and influence different metabolic pathways. Besides its key role in regulating the human gut homeostasis, the intestinal virome contributes to disease development in distant organs, both directly and indirectly. In this review, we will describe the changes in the gut virome through life, health, and disease, followed by discussing the interactions between the virome, the microbiome, and the human host as well as providing an overview of their contribution to gut disease and disease of distant organs.
Collapse
Affiliation(s)
| | | | - Sarah Schulz
- Institute of Virology, Helmholtz Centre Munich — German Research Centre for Environmental Health, 85764 Neuherberg, Germany,Chair of Microbial Disease Prevention, School of Life Sciences, Technical University of Munich, 85354 Freising, Germany
| | - Jinling Xue
- Institute of Virology, Helmholtz Centre Munich — German Research Centre for Environmental Health, 85764 Neuherberg, Germany,Chair of Microbial Disease Prevention, School of Life Sciences, Technical University of Munich, 85354 Freising, Germany
| | - Rita Costa
- Institute of Virology, Helmholtz Centre Munich — German Research Centre for Environmental Health, 85764 Neuherberg, Germany,Chair of Microbial Disease Prevention, School of Life Sciences, Technical University of Munich, 85354 Freising, Germany
| | - Mohammadali Khan Mirzaei
- Institute of Virology, Helmholtz Centre Munich — German Research Centre for Environmental Health, 85764 Neuherberg, Germany,Chair of Microbial Disease Prevention, School of Life Sciences, Technical University of Munich, 85354 Freising, Germany
| | - Li Deng
- Corresponding author: Institute of Virology, Helmholtz Centre Munich — German Research Centre for Environmental Health, 85764 Neuherberg, Germany; Chair of Prevention of Microbial Diseases, School of Life Sciences, Technical University of Munich, 85354 Freising, Germany. E-mail:
| |
Collapse
|
38
|
Sinha D, Sharma A, Mishra DC, Rai A, Lal SB, Kumar S, Farooqi MS, Chaturvedi KK. MetaConClust - Unsupervised Binning of Metagenomics Data using Consensus Clustering. Curr Genomics 2022; 23:137-146. [PMID: 36778980 PMCID: PMC9878838 DOI: 10.2174/1389202923666220413114659] [Citation(s) in RCA: 1] [Impact Index Per Article: 0.3] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 12/22/2021] [Revised: 01/18/2022] [Accepted: 02/21/2022] [Indexed: 11/22/2022] Open
Abstract
Background: Binning of metagenomic reads is an active area of research, and many unsupervised machine learning-based techniques have been used for taxonomic independent binning of metagenomic reads. Objective: It is important to find the optimum number of the cluster as well as develop an efficient pipeline for deciphering the complexity of the microbial genome. Methods: Applying unsupervised clustering techniques for binning requires finding the optimal number of clusters beforehand and is observed to be a difficult task. This paper describes a novel method, MetaConClust, using coverage information for grouping of contigs and automatically finding the optimal number of clusters for binning of metagenomics data using a consensus-based clustering approach. The coverage of contigs in a metagenomics sample has been observed to be directly proportional to the abundance of species in the sample and is used for grouping of data in the first phase by MetaConClust. The Partitioning Around Medoid (PAM) method is used for clustering in the second phase for generating bins with the initial number of clusters determined automatically through a consensus-based method. Results: Finally, the quality of the obtained bins is tested using silhouette index, rand Index, recall, precision, and accuracy. Performance of MetaConClust is compared with recent methods and tools using benchmarked low complexity simulated and real metagenomic datasets and is found better for unsupervised and comparable for hybrid methods. Conclusion: This is suggestive of the proposition that the consensus-based clustering approach is a promising method for automatically finding the number of bins for metagenomics data.
Collapse
Affiliation(s)
- Dipro Sinha
- These authors contributed equally to this work
| | - Anu Sharma
- Address correspondence to this author at the Division of Agriculture Bioinformatics, ICAR-IASRI, New Delhi- 110012, India; E-mail:
| | | | | | | | | | | | | |
Collapse
|
39
|
Liu F, Miao Y, Liu Y, Hou T. RNN-VirSeeker: A Deep Learning Method for Identification of Short Viral Sequences From Metagenomes. IEEE/ACM TRANSACTIONS ON COMPUTATIONAL BIOLOGY AND BIOINFORMATICS 2022; 19:1840-1849. [PMID: 33315571 DOI: 10.1109/tcbb.2020.3044575] [Citation(s) in RCA: 8] [Impact Index Per Article: 2.7] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 06/12/2023]
Abstract
Viruses are the most abundant biological entities on earth, and play vital roles in many aspects of microbial communities. As major human pathogens, viruses have caused huge mortality and morbidity to human society in history. Metagenomic sequencing methods could capture all microorganisms from microbiota, with sequences of viruses mixed with these of other species. Therefore, it is necessary to identify viral sequences from metagenomes. However, existing methods perform poorly on identifying short viral sequences. To solve this problem, a deep learning based method, RNN-VirSeeker, is proposed in this paper. RNN-VirSeeker was trained by sequences of 500bp sampled from known Virus and Host RefSeq genomes. Experimental results on the testing set have shown that RNN-VirSeeker exhibited AUROC of 0.9175, recall of 0.8640 and precision of 0.9211 for sequences of 500bp, and outperformed three widely used methods, VirSorter, VirFinder, and DeepVirFinder, on identifying short viral sequences. RNN-VirSeeker was also used to identify viral sequences from a CAMI dataset and a human gut metagenome. Compared with DeepVirFinder, RNN-VirSeeker identified more viral sequences from these metagenomes and achieved greater values of AUPRC and AUROC. RNN-VirSeeker is freely available at https://github.com/crazyinter/RNN-VirSeeker.
Collapse
|
40
|
Early indicators of microbial strain dysbiosis in the human gastrointestinal microbial community of certain healthy humans and hospitalized COVID-19 patients. Sci Rep 2022; 12:6562. [PMID: 35449389 PMCID: PMC9022020 DOI: 10.1038/s41598-022-10472-w] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/10/2021] [Accepted: 04/06/2022] [Indexed: 11/08/2022] Open
Abstract
Dysbiosis in the human gastrointestinal microbial community could functionally impact microbial metabolism and colonization resistance to pathogens. To further elucidate the indicators of microbial strain dysbiosis, we have developed an analytic method that detects patterns of presence/absence of selected KEGG metabolic pathways for a selected strain (PKS). Using a metagenomic data set consisting of multiple high-density fecal samples from six normal individuals, we found three had unique PKS for important gut commensal microbes, Bacteroides vulgatus and Bacteroides uniformis, at all sample times examined. Two individuals had multiple shared PKS clusters of B. vulgatus or B. uniformis over time. Analysis of a data set of high-density fecal samples from eight COVID-19 hospitalized patients taken over a short period revealed that two patients had shared PKS clusters for B. vulgatus and one shared cluster for B. uniformis. Our analysis demonstrates that while the majority of normal individuals with no B. vulgatus or B. uniformis strain change over time have unique PKS, in some healthy humans and patients hospitalized with COVID-19, we detected shared PKS clusters at the different times suggesting a slowing down of the intrinsic rates of strain variation that could eventually lead to a dysbiosis in the microbial strain community.
Collapse
|
41
|
Severyn CJ, Siranosian BA, Kong STJ, Moreno A, Li MM, Chen N, Duncan CN, Margossian SP, Lehmann LE, Sun S, Andermann TM, Birbrayer O, Silverstein S, Reynolds CG, Kim S, Banaei N, Ritz J, Fodor AA, London WB, Bhatt AS, Whangbo JS. Microbiota dynamics in a randomized trial of gut decontamination during allogeneic hematopoietic cell transplantation. JCI Insight 2022; 7:e154344. [PMID: 35239511 PMCID: PMC9057614 DOI: 10.1172/jci.insight.154344] [Citation(s) in RCA: 16] [Impact Index Per Article: 5.3] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 12/17/2021] [Accepted: 03/02/2022] [Indexed: 11/25/2022] Open
Abstract
BACKGROUNDGut decontamination (GD) can decrease the incidence and severity of acute graft-versus-host disease (aGVHD) in murine models of allogeneic hematopoietic cell transplantation (HCT). In this pilot study, we examined the impact of GD on gut microbiome composition and the incidence of aGVHD in HCT patients.METHODSWe randomized 20 patients undergoing allogeneic HCT to receive (GD) or not receive (no-GD) oral vancomycin-polymyxin B from day -5 through neutrophil engraftment. We evaluated shotgun metagenomic sequencing of serial stool samples to compare the composition and diversity of the gut microbiome between study arms. We assessed clinical outcomes in the 2 arms and performed strain-specific analyses of pathogens that caused bloodstream infections (BSI).RESULTSThe 2 arms did not differ in the predefined primary outcome of Shannon diversity of the gut microbiome at 2 weeks post-HCT (genus, P = 0.8; species, P = 0.44) or aGVHD incidence (P = 0.58). Immune reconstitution of T cell and B cell subsets was similar between groups. Five patients in the no-GD arm had 8 BSI episodes versus 1 episode in the GD arm (P = 0.09). The BSI-causing pathogens were traceable to the gut in 7 of 8 BSI episodes in the no-GD arm, including Staphylococcus species.CONCLUSIONWhile GD did not differentially affect Shannon diversity or clinical outcomes, our findings suggest that GD may protect against gut-derived BSI in HCT patients by decreasing the prevalence or abundance of gut pathogens.TRIAL REGISTRATIONClinicalTrials.gov NCT02641236.FUNDINGNIH, Damon Runyon Cancer Research Foundation, V Foundation, Sloan Foundation, Emerson Collective, and Stanford Maternal & Child Health Research Institute.
Collapse
Affiliation(s)
- Christopher J. Severyn
- Department of Pediatrics, Division of Pediatric Hematology/Oncology and Division of Pediatric Stem Cell Transplant and Regenerative Medicine
| | | | | | - Angel Moreno
- Department of Pathology, Stanford University, Palo Alto, California, USA
| | - Michelle M. Li
- Department of Biomedical Informatics, Harvard Medical School, Boston, Massachusetts, USA
| | - Nan Chen
- Dana-Farber/Boston Children’s Cancer and Blood Disorders Center, Boston, Massachusetts, USA
| | - Christine N. Duncan
- Dana-Farber/Boston Children’s Cancer and Blood Disorders Center, Boston, Massachusetts, USA
- Harvard Medical School, Boston, Massachusetts, USA
| | - Steven P. Margossian
- Dana-Farber/Boston Children’s Cancer and Blood Disorders Center, Boston, Massachusetts, USA
- Harvard Medical School, Boston, Massachusetts, USA
| | - Leslie E. Lehmann
- Dana-Farber/Boston Children’s Cancer and Blood Disorders Center, Boston, Massachusetts, USA
- Harvard Medical School, Boston, Massachusetts, USA
| | - Shan Sun
- Department of Bioinformatics and Genomics, College of Computing and Informatics, University of North Carolina at Charlotte, Charlotte, North Carolina, USA
| | - Tessa M. Andermann
- Department of Medicine, Division of Infectious Diseases, University of North Carolina at Chapel Hill, Chapel Hill, North Carolina, USA
| | - Olga Birbrayer
- Dana-Farber Cancer Institute, Boston, Massachusetts, USA
| | | | - Carol G. Reynolds
- Harvard Medical School, Boston, Massachusetts, USA
- Dana-Farber Cancer Institute, Boston, Massachusetts, USA
| | - Soomin Kim
- Dana-Farber Cancer Institute, Boston, Massachusetts, USA
| | - Niaz Banaei
- Department of Pathology, Stanford University, Palo Alto, California, USA
- Department of Medicine, Division of Infectious Diseases, Stanford University, Palo Alto, California, USA
| | - Jerome Ritz
- Harvard Medical School, Boston, Massachusetts, USA
- Dana-Farber Cancer Institute, Boston, Massachusetts, USA
| | - Anthony A. Fodor
- Department of Bioinformatics and Genomics, College of Computing and Informatics, University of North Carolina at Charlotte, Charlotte, North Carolina, USA
| | - Wendy B. London
- Dana-Farber/Boston Children’s Cancer and Blood Disorders Center, Boston, Massachusetts, USA
- Harvard Medical School, Boston, Massachusetts, USA
- Dana-Farber Cancer Institute, Boston, Massachusetts, USA
| | - Ami S. Bhatt
- Departments of Genetics and Medicine, Division of Hematology
| | - Jennifer S. Whangbo
- Dana-Farber/Boston Children’s Cancer and Blood Disorders Center, Boston, Massachusetts, USA
- Harvard Medical School, Boston, Massachusetts, USA
| |
Collapse
|
42
|
Wang H, Li J, Wu G, Zhang F, Yin J, He Y. The effect of intrinsic factors and mechanisms in shaping human gut microbiota. MEDICINE IN MICROECOLOGY 2022. [DOI: 10.1016/j.medmic.2022.100054] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/28/2022] Open
|
43
|
Zhou Y, Liu M, Yang J. Recovering metagenome-assembled genomes from shotgun metagenomic sequencing data: methods, applications, challenges, and opportunities. Microbiol Res 2022; 260:127023. [DOI: 10.1016/j.micres.2022.127023] [Citation(s) in RCA: 2] [Impact Index Per Article: 0.7] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 11/18/2021] [Revised: 03/07/2022] [Accepted: 04/05/2022] [Indexed: 12/12/2022]
|
44
|
Adler A, Poirier S, Pagni M, Maillard J, Holliger C. Disentangle genus microdiversity within a complex microbial community by using a multi-distance long-read binning method: example of Candidatus Accumulibacter. Environ Microbiol 2022; 24:2136-2156. [PMID: 35315560 PMCID: PMC9311429 DOI: 10.1111/1462-2920.15947] [Citation(s) in RCA: 2] [Impact Index Per Article: 0.7] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 08/16/2021] [Accepted: 02/19/2022] [Indexed: 11/26/2022]
Abstract
Complete genomes can be recovered from metagenomes by assembling and binning DNA sequences into metagenome assembled genomes (MAGs). Yet, the presence of microdiversity can hamper the assembly and binning processes, possibly yielding chimeric, highly fragmented and incomplete genomes. Here, the metagenomes of four samples of aerobic granular sludge bioreactors containing Candidatus (Ca.) Accumulibacter, a phosphate-accumulating organism of interest for wastewater treatment, were sequenced with both PacBio and Illumina. Different strategies of genome assembly and binning were investigated, including published protocols and a binning procedure adapted to the binning of long contigs (MuLoBiSC). Multiple criteria were considered to select the best strategy for Ca. Accumulibacter, whose multiple strains in every sample represent a challenging microdiversity. In this case, the best strategy relies on long-read only assembly and a custom binning procedure including MuLoBiSC in metaWRAP. Several high-quality Ca. Accumulibacter MAGs, including a novel species, were obtained independently from different samples. Comparative genomic analysis showed that MAGs retrieved in different samples harbour genomic rearrangements in addition to accumulation of point mutations. The microdiversity of Ca. Accumulibacter, likely driven by mobile genetic elements, causes major difficulties in recovering MAGs, but it is also a hallmark of the panmictic lifestyle of these bacteria.
Collapse
Affiliation(s)
- Aline Adler
- Laboratory for Environmental Biotechnology, Ecole Polytechnique Fédérale de Lausanne, Lausanne, Switzerland
| | - Simon Poirier
- Laboratory for Environmental Biotechnology, Ecole Polytechnique Fédérale de Lausanne, Lausanne, Switzerland
| | - Marco Pagni
- Laboratory for Environmental Biotechnology, Ecole Polytechnique Fédérale de Lausanne, Lausanne, Switzerland.,Vital-IT Group, SIB Swiss Institute of Bioinformatics, Lausanne, Switzerland
| | - Julien Maillard
- Laboratory for Environmental Biotechnology, Ecole Polytechnique Fédérale de Lausanne, Lausanne, Switzerland.,IFP Energie nouvelles, 1 et 4 avenue de Bois-Préau, 92852, Rueil-Malmaison Cedex, France
| | - Christof Holliger
- Laboratory for Environmental Biotechnology, Ecole Polytechnique Fédérale de Lausanne, Lausanne, Switzerland
| |
Collapse
|
45
|
Spencer L, Olawuni B, Singh P. Gut Virome: Role and Distribution in Health and Gastrointestinal Diseases. Front Cell Infect Microbiol 2022; 12:836706. [PMID: 35360104 PMCID: PMC8960297 DOI: 10.3389/fcimb.2022.836706] [Citation(s) in RCA: 14] [Impact Index Per Article: 4.7] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 12/15/2021] [Accepted: 02/10/2022] [Indexed: 12/11/2022] Open
Abstract
The study of the intestinal microbiome is an evolving field of research that includes comprehensive analysis of the vast array of microbes – bacterial, archaeal, fungal, and viral. Various gastrointestinal (GI) diseases, such as Crohn’s disease and ulcerative colitis, have been associated with instability of the gut microbiota. Many studies have focused on importance of bacterial communities with relation to health and disease in humans. The role of viruses, specifically bacteriophages, have recently begin to emerge and have profound impact on the host. Here, we comprehensively review the importance of viruses in GI diseases and summarize their influence in the complex intestinal environment, including their biochemical and genetic activities. We also discuss the distribution of the gut virome as it relates with treatment and immunological advantages. In conclusion, we suggest the need for further studies on this critical component of the intestinal microbiome to decipher the role of the gut virome in human health and disease.
Collapse
|
46
|
Zioutis C, Seki D, Bauchinger F, Herbold C, Berger A, Wisgrill L, Berry D. Ecological Processes Shaping Microbiomes of Extremely Low Birthweight Infants. Front Microbiol 2022; 13:812136. [PMID: 35295290 PMCID: PMC8919028 DOI: 10.3389/fmicb.2022.812136] [Citation(s) in RCA: 5] [Impact Index Per Article: 1.7] [Reference Citation Analysis] [Abstract] [Key Words] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 11/09/2021] [Accepted: 02/07/2022] [Indexed: 11/23/2022] Open
Abstract
The human microbiome has been implicated in affecting health outcomes in premature infants, but the ecological processes governing early life microbiome assembly remain poorly understood. Here, we investigated microbial community assembly and dynamics in extremely low birth weight infants (ELBWI) over the first 2 weeks of life. We profiled the gut, oral cavity and skin microbiomes over time using 16S rRNA gene amplicon sequencing and evaluated the ecological forces shaping these microbiomes. Though microbiomes at all three body sites were characterized by compositional instability over time and had low body-site specificity (PERMANOVA, r 2 = 0.09, p = 0.001), they could nonetheless be clustered into four discrete community states. Despite the volatility of these communities, deterministic assembly processes were detectable in this period of initial microbial colonization. To further explore these deterministic dynamics, we developed a probabilistic approach in which we modeled microbiome state transitions in each ELBWI as a Markov process, or a "memoryless" shift, from one community state to another. This analysis revealed that microbiomes from different body sites had distinctive dynamics as well as characteristic equilibrium frequencies. Time-resolved microbiome sampling of premature infants may help to refine and inform clinical practices. Additionally, this work provides an analysis framework for microbial community dynamics based on Markov modeling that can facilitate new insights, not only into neonatal microbiomes but also other human-associated or environmental microbiomes.
Collapse
Affiliation(s)
- Christos Zioutis
- Division of Microbial Ecology, Department of Microbiology and Ecosystem Science, Centre for Microbiology and Environmental Systems Science, University of Vienna, Vienna, Austria
| | - David Seki
- Division of Microbial Ecology, Department of Microbiology and Ecosystem Science, Centre for Microbiology and Environmental Systems Science, University of Vienna, Vienna, Austria
- Division of Neonatology, Department of Pediatrics and Adolescent Medicine, Pediatric Intensive Care and Neuropediatrics, Comprehensive Center for Pediatrics, Medical University of Vienna, Vienna, Austria
| | - Franziska Bauchinger
- Division of Microbial Ecology, Department of Microbiology and Ecosystem Science, Centre for Microbiology and Environmental Systems Science, University of Vienna, Vienna, Austria
| | - Craig Herbold
- Division of Microbial Ecology, Department of Microbiology and Ecosystem Science, Centre for Microbiology and Environmental Systems Science, University of Vienna, Vienna, Austria
| | - Angelika Berger
- Division of Neonatology, Department of Pediatrics and Adolescent Medicine, Pediatric Intensive Care and Neuropediatrics, Comprehensive Center for Pediatrics, Medical University of Vienna, Vienna, Austria
| | - Lukas Wisgrill
- Division of Neonatology, Department of Pediatrics and Adolescent Medicine, Pediatric Intensive Care and Neuropediatrics, Comprehensive Center for Pediatrics, Medical University of Vienna, Vienna, Austria
| | - David Berry
- Division of Microbial Ecology, Department of Microbiology and Ecosystem Science, Centre for Microbiology and Environmental Systems Science, University of Vienna, Vienna, Austria
- Joint Microbiome Facility of the Medical University of Vienna, University of Vienna, Vienna, Austria
| |
Collapse
|
47
|
Ventolero MF, Wang S, Hu H, Li X. Computational analyses of bacterial strains from shotgun reads. Brief Bioinform 2022; 23:6524011. [PMID: 35136954 DOI: 10.1093/bib/bbac013] [Citation(s) in RCA: 4] [Impact Index Per Article: 1.3] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 11/04/2021] [Revised: 01/10/2022] [Accepted: 01/11/2022] [Indexed: 12/21/2022] Open
Abstract
Shotgun sequencing is routinely employed to study bacteria in microbial communities. With the vast amount of shotgun sequencing reads generated in a metagenomic project, it is crucial to determine the microbial composition at the strain level. This study investigated 20 computational tools that attempt to infer bacterial strain genomes from shotgun reads. For the first time, we discussed the methodology behind these tools. We also systematically evaluated six novel-strain-targeting tools on the same datasets and found that BHap, mixtureS and StrainFinder performed better than other tools. Because the performance of the best tools is still suboptimal, we discussed future directions that may address the limitations.
Collapse
Affiliation(s)
| | - Saidi Wang
- Department of Computer Science, University of Central Florida, Orlando, FL 32816, USA
| | - Haiyan Hu
- Department of Computer Science, University of Central Florida, Orlando, FL 32816, USA.,Genomics and Bioinformatics Cluster, University of Central Florida, Orlando, FL 32816, USA
| | - Xiaoman Li
- Burnett School of Biomedical Science, University of Central Florida, Orlando, FL 32816, USA
| |
Collapse
|
48
|
Miao Y, Liu F, Hou T, Liu Y. Virtifier: a deep learning-based identifier for viral sequences from metagenomes. Bioinformatics 2022; 38:1216-1222. [PMID: 34908121 DOI: 10.1093/bioinformatics/btab845] [Citation(s) in RCA: 15] [Impact Index Per Article: 5.0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 12/02/2020] [Revised: 11/13/2021] [Accepted: 12/13/2021] [Indexed: 01/05/2023] Open
Abstract
MOTIVATION Viruses, the most abundant biological entities on earth, are important components of microbial communities, and as major human pathogens, they are responsible for human mortality and morbidity. The identification of viral sequences from metagenomes is critical for viral analysis. As massive quantities of short sequences are generated by next-generation sequencing, most methods utilize discrete and sparse one-hot vectors to encode nucleotide sequences, which are usually ineffective in viral identification. RESULTS In this article, Virtifier, a deep learning-based viral identifier for sequences from metagenomic data is proposed. It includes a meaningful nucleotide sequence encoding method named Seq2Vec and a variant viral sequence predictor with an attention-based long short-term memory (LSTM) network. By utilizing a fully trained embedding matrix to encode codons, Seq2Vec can efficiently extract the relationships among those codons in a nucleotide sequence. Combined with an attention layer, the LSTM neural network can further analyze the codon relationships and sift the parts that contribute to the final features. Experimental results of three datasets have shown that Virtifier can accurately identify short viral sequences (<500 bp) from metagenomes, surpassing three widely used methods, VirFinder, DeepVirFinder and PPR-Meta. Meanwhile, a comparable performance was achieved by Virtifier at longer lengths (>5000 bp). AVAILABILITY AND IMPLEMENTATION A Python implementation of Virtifier and the Python code developed for this study have been provided on Github https://github.com/crazyinter/Seq2Vec. The RefSeq genomes in this article are available in VirFinder at https://dx.doi.org/10.1186/s40168-017-0283-5. The CAMI Challenge Dataset 3 CAMI_high dataset in this article is available in CAMI at https://data.cami-challenge.org/participate. The real human gut metagenomes in this article are available at https://dx.doi.org/10.1101/gr.142315.112. SUPPLEMENTARY INFORMATION Supplementary data are available at Bioinformatics online.
Collapse
Affiliation(s)
- Yan Miao
- College of Communication Engineering, Jilin University, Changchun 130022, China
| | - Fu Liu
- College of Communication Engineering, Jilin University, Changchun 130022, China
| | - Tao Hou
- College of Communication Engineering, Jilin University, Changchun 130022, China
| | - Yun Liu
- College of Communication Engineering, Jilin University, Changchun 130022, China
| |
Collapse
|
49
|
Jiang Z, Li X, Guo L. MetaCRS: unsupervised clustering of contigs with the recursive strategy of reducing metagenomic dataset's complexity. BMC Bioinformatics 2022; 22:315. [PMID: 35045830 PMCID: PMC8772042 DOI: 10.1186/s12859-021-04227-z] [Citation(s) in RCA: 1] [Impact Index Per Article: 0.3] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 05/23/2021] [Accepted: 06/01/2021] [Indexed: 01/02/2023] Open
Abstract
Background Metagenomics technology can directly extract microbial genetic material from the environmental samples to obtain their sequencing reads, which can be further assembled into contigs through assembly tools. Clustering methods of contigs are subsequently applied to recover complete genomes from environmental samples. The main problems with current clustering methods are that they cannot recover more high-quality genes from complex environments. Firstly, there are multiple strains under the same species, resulting in assembly of chimeras. Secondly, different strains under the same species are difficult to be classified. Thirdly, it is difficult to determine the number of strains during the clustering process. Results In view of the shortcomings of current clustering methods, we propose an unsupervised clustering method which can improve the ability to recover genes from complex environments and a new method for selecting the number of sample’s strains in clustering process. The sequence composition characteristics (tetranucleotide frequency) and co-abundance are combined to train the probability model for clustering. A new recursive method that can continuously reduce the complexity of the samples is proposed to improve the ability to recover genes from complex environments. The new clustering method was tested on both simulated and real metagenomic datasets, and compared with five state-of-the-art methods including CONCOCT, Maxbin2.0, MetaBAT, MyCC and COCACOLA. In terms of the number and quality of recovered genes from metagenomic datasets, the results show that our proposed method is more effective. Conclusions A new contigs clustering method is proposed, which can recover more high-quality genes from complex environmental samples.
Collapse
Affiliation(s)
- Zhongjun Jiang
- College of Information Science and Technology, Ningbo University, Ningbo, 315211, China
| | - Xiaobo Li
- College of Mathematics and Computer Science, Zhejiang Normal University, Jinhua, 321004, China. .,College of Engineering, Lishui University, Lishui, 323000, China.
| | - Lijun Guo
- College of Information Science and Technology, Ningbo University, Ningbo, 315211, China
| |
Collapse
|
50
|
Look Who's Talking: Host and Pathogen Drivers of Staphylococcus epidermidis Virulence in Neonatal Sepsis. Int J Mol Sci 2022; 23:ijms23020860. [PMID: 35055041 PMCID: PMC8775791 DOI: 10.3390/ijms23020860] [Citation(s) in RCA: 12] [Impact Index Per Article: 4.0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 11/22/2021] [Revised: 01/10/2022] [Accepted: 01/10/2022] [Indexed: 02/04/2023] Open
Abstract
Preterm infants are at increased risk for invasive neonatal bacterial infections. S. epidermidis, a ubiquitous skin commensal, is a major cause of late-onset neonatal sepsis, particularly in high-resource settings. The vulnerability of preterm infants to serious bacterial infections is commonly attributed to their distinct and developing immune system. While developmentally immature immune defences play a large role in facilitating bacterial invasion, this fails to explain why only a subset of infants develop infections with low-virulence organisms when exposed to similar risk factors in the neonatal ICU. Experimental research has explored potential virulence mechanisms contributing to the pathogenic shift of commensal S. epidermidis strains. Furthermore, comparative genomics studies have yielded insights into the emergence and spread of nosocomial S. epidermidis strains, and their genetic and functional characteristics implicated in invasive disease in neonates. These studies have highlighted the multifactorial nature of S. epidermidis traits relating to pathogenicity and commensalism. In this review, we discuss the known host and pathogen drivers of S. epidermidis virulence in neonatal sepsis and provide future perspectives to close the gap in our understanding of S. epidermidis as a cause of neonatal morbidity and mortality.
Collapse
|