1
|
Salazar-Jaramillo L, de la Cuesta-Zuluaga J, Chica LA, Cadavid M, Ley RE, Reyes A, Escobar JS. Gut microbiome diversity within Clostridia is negatively associated with human obesity. mSystems 2024:e0062724. [PMID: 39012154 DOI: 10.1128/msystems.00627-24] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 05/03/2024] [Accepted: 06/06/2024] [Indexed: 07/17/2024] Open
Abstract
Clostridia are abundant in the human gut and comprise families associated with host health such as Oscillospiraceae, which has been correlated with leanness. However, culturing bacteria within this family is challenging, leading to their detection primarily through 16S rRNA amplicon sequencing, which has a limited ability to unravel diversity at low taxonomic levels, or by shotgun metagenomics, which is hindered by its high costs and complexity. In this cross-sectional study involving 114 Colombian adults, we used an amplicon-based sequencing strategy with alternative markers-gyrase subunit B (gyrB) and DNA K chaperone heat protein 70 (dnaK)-that evolve faster than the 16S rRNA gene. Comparing the diversity and abundance observed with the three markers in our cohort, we found a reduction in the diversity of Clostridia, particularly within Lachnospiraceae and Oscillospiraceae among obese individuals [as measured by the body mass index (BMI)]. Within Lachnospiraceae, the diversity of Ruminococcus_A negatively correlated with BMI. Within Oscillospiraceae, the genera CAG-170 and Vescimonas also exhibited this negative correlation. In addition, the abundance of Vescimonas was negatively correlated with BMI. Leveraging shotgun metagenomic data, we conducted a phylogenetic and genomic characterization of 120 metagenome-assembled genomes from Vescimonas obtained from a larger sample of the same cohort. We identified 17 of the 72 reported species. The functional annotation of these genomes showed the presence of multiple carbohydrate-active enzymes, particularly glycosyl transferases and glycoside hydrolases, suggesting potential beneficial roles in fiber degradation, carbohydrate metabolism, and butyrate production. IMPORTANCE The gut microbiota is diverse across various taxonomic levels. At the intra-species level, it comprises multiple strains, some of which may be host-specific. However, our understanding of fine-grained diversity has been hindered by the use of the conserved 16S rRNA gene. While shotgun metagenomics offers higher resolution, it remains costly, may fail to identify specific microbes in complex samples, and requires extensive computational resources and expertise. To address this, we employed a simple and cost-effective analysis of alternative genetic markers to explore diversity within Clostridia, a crucial group within the human gut microbiota whose diversity may be underestimated. We found high intra-species diversity for certain groups and associations with obesity. Notably, we identified Vescimonas, an understudied group. Making use of metagenomic data, we inferred functionality, uncovering potential beneficial roles in dietary fiber and carbohydrate degradation, as well as in short-chain fatty acid production.
Collapse
Affiliation(s)
- Laura Salazar-Jaramillo
- Vidarium-Nutrition, Health and Wellness Research Center, Grupo Empresarial Nutresa, Medellin, Colombia
| | | | - Luis A Chica
- Department of Biological Sciences, Max Planck Tandem Group in Computational Biology, Research Group in Computational Biology and Microbial Ecology (BCEM), Universidad de los Andes, Bogota, Colombia
| | - María Cadavid
- Vidarium-Nutrition, Health and Wellness Research Center, Grupo Empresarial Nutresa, Medellin, Colombia
| | - Ruth E Ley
- Department of Microbiome Science, Max Planck Institute for Biology, Tübingen, Germany
| | - Alejandro Reyes
- Department of Biological Sciences, Max Planck Tandem Group in Computational Biology, Research Group in Computational Biology and Microbial Ecology (BCEM), Universidad de los Andes, Bogota, Colombia
- Department of Pathology and Immunology, Edison Family Center for Genome Sciences and Systems Biology, Washington University School of Medicine, St. Louis, Missouri, USA
| | - Juan S Escobar
- Vidarium-Nutrition, Health and Wellness Research Center, Grupo Empresarial Nutresa, Medellin, Colombia
| |
Collapse
|
2
|
Pinto Y, Bhatt AS. Sequencing-based analysis of microbiomes. Nat Rev Genet 2024:10.1038/s41576-024-00746-6. [PMID: 38918544 DOI: 10.1038/s41576-024-00746-6] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Accepted: 05/15/2024] [Indexed: 06/27/2024]
Abstract
Microbiomes occupy a range of niches and, in addition to having diverse compositions, they have varied functional roles that have an impact on agriculture, environmental sciences, and human health and disease. The study of microbiomes has been facilitated by recent technological and analytical advances, such as cheaper and higher-throughput DNA and RNA sequencing, improved long-read sequencing and innovative computational analysis methods. These advances are providing a deeper understanding of microbiomes at the genomic, transcriptional and translational level, generating insights into their function and composition at resolutions beyond the species level.
Collapse
Affiliation(s)
- Yishay Pinto
- Department of Genetics, Stanford University, Stanford, CA, USA
- Department of Medicine, Divisions of Hematology and Blood & Marrow Transplantation, Stanford University, Stanford, CA, USA
| | - Ami S Bhatt
- Department of Genetics, Stanford University, Stanford, CA, USA.
- Department of Medicine, Divisions of Hematology and Blood & Marrow Transplantation, Stanford University, Stanford, CA, USA.
| |
Collapse
|
3
|
Enav H, Paz I, Ley RE. Strain tracking in complex microbiomes using synteny analysis reveals per-species modes of evolution. Nat Biotechnol 2024:10.1038/s41587-024-02276-2. [PMID: 38898177 DOI: 10.1038/s41587-024-02276-2] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/31/2023] [Accepted: 05/10/2024] [Indexed: 06/21/2024]
Abstract
Microbial species diversify into strains through single-nucleotide mutations and structural changes, such as recombination, insertions and deletions. Most strain-comparison methods quantify differences in single-nucleotide polymorphisms (SNPs) and are insensitive to structural changes. However, recombination is an important driver of phenotypic diversification in many species, including human pathogens. We introduce SynTracker, a tool that compares microbial strains using genome synteny-the order of sequence blocks in homologous genomic regions-in pairs of metagenomic assemblies or genomes. Genome synteny is a rich source of genomic information untapped by current strain-comparison tools. SynTracker has low sensitivity to SNPs, has no database requirement and is robust to sequencing errors. It outperforms existing tools when tracking strains in metagenomic data and is particularly suited for phages, plasmids and other low-data contexts. Applied to single-species datasets and human gut metagenomes, SynTracker, combined with an SNP-based tool, detects strains enriched in either point mutations or structural changes, providing insights into microbial evolution in situ.
Collapse
Affiliation(s)
- Hagay Enav
- Department of Microbiome Science, Max Planck Institute for Biology, Tübingen, Germany
| | - Inbal Paz
- Department of Microbiome Science, Max Planck Institute for Biology, Tübingen, Germany
| | - Ruth E Ley
- Department of Microbiome Science, Max Planck Institute for Biology, Tübingen, Germany.
- Cluster of Excellence EXC 2124: Controlling Microbes to Fight Infections (CMFI), University of Tübingen, Tübingen, Germany.
| |
Collapse
|
4
|
Coelho LP, Santos-Júnior CD, de la Fuente-Nunez C. Challenges in computational discovery of bioactive peptides in 'omics data. Proteomics 2024; 24:e2300105. [PMID: 38458994 DOI: 10.1002/pmic.202300105] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/13/2023] [Revised: 02/06/2024] [Accepted: 02/06/2024] [Indexed: 03/10/2024]
Abstract
Peptides have a plethora of activities in biological systems that can potentially be exploited biotechnologically. Several peptides are used clinically, as well as in industry and agriculture. The increase in available 'omics data has recently provided a large opportunity for mining novel enzymes, biosynthetic gene clusters, and molecules. While these data primarily consist of DNA sequences, other types of data provide important complementary information. Due to their size, the approaches proven successful at discovering novel proteins of canonical size cannot be naïvely applied to the discovery of peptides. Peptides can be encoded directly in the genome as short open reading frames (smORFs), or they can be derived from larger proteins by proteolysis. Both of these peptide classes pose challenges as simple methods for their prediction result in large numbers of false positives. Similarly, functional annotation of larger proteins, traditionally based on sequence similarity to infer orthology and then transferring functions between characterized proteins and uncharacterized ones, cannot be applied for short sequences. The use of these techniques is much more limited and alternative approaches based on machine learning are used instead. Here, we review the limitations of traditional methods as well as the alternative methods that have recently been developed for discovering novel bioactive peptides with a focus on prokaryotic genomes and metagenomes.
Collapse
Affiliation(s)
- Luis Pedro Coelho
- Centre for Microbiome Research, School of Biomedical Sciences, Queensland University of Technology, Woolloongabba, Queensland, Australia
- Institute of Science and Technology for Brain-Inspired Intelligence - ISTBI, Fudan University, Shanghai, China
| | - Célio Dias Santos-Júnior
- Institute of Science and Technology for Brain-Inspired Intelligence - ISTBI, Fudan University, Shanghai, China
- Laboratory of Microbial Processes & Biodiversity - LMPB, Hydrobiology Department, Federal University of São Carlos - UFSCar, São Paulo, Brazil
| | - Cesar de la Fuente-Nunez
- Machine Biology Group, Departments of Psychiatry and Microbiology, Institute for Biomedical Informatics, Institute for Translational Medicine and Therapeutics, Perelman School of Medicine, University of Pennsylvania, Philadelphia, Pennsylvania, USA
- Departments of Bioengineering and Chemical and Biomolecular Engineering, School of Engineering and Applied Science, University of Pennsylvania, Philadelphia, Pennsylvania, USA
- Department of Chemistry, School of Arts and Sciences, University of Pennsylvania, Philadelphia, Pennsylvania, USA
- Penn Institute for Computational Science, University of Pennsylvania, Philadelphia, Pennsylvania, USA
| |
Collapse
|
5
|
Durrant MG, Perry NT, Pai JJ, Jangid AR, Athukoralage JS, Hiraizumi M, McSpedon JP, Pawluk A, Nishimasu H, Konermann S, Hsu PD. Bridge RNAs direct programmable recombination of target and donor DNA. Nature 2024; 630:984-993. [PMID: 38926615 PMCID: PMC11208160 DOI: 10.1038/s41586-024-07552-4] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 09/07/2023] [Accepted: 05/09/2024] [Indexed: 06/28/2024]
Abstract
Genomic rearrangements, encompassing mutational changes in the genome such as insertions, deletions or inversions, are essential for genetic diversity. These rearrangements are typically orchestrated by enzymes that are involved in fundamental DNA repair processes, such as homologous recombination, or in the transposition of foreign genetic material by viruses and mobile genetic elements1,2. Here we report that IS110 insertion sequences, a family of minimal and autonomous mobile genetic elements, express a structured non-coding RNA that binds specifically to their encoded recombinase. This bridge RNA contains two internal loops encoding nucleotide stretches that base-pair with the target DNA and the donor DNA, which is the IS110 element itself. We demonstrate that the target-binding and donor-binding loops can be independently reprogrammed to direct sequence-specific recombination between two DNA molecules. This modularity enables the insertion of DNA into genomic target sites, as well as programmable DNA excision and inversion. The IS110 bridge recombination system expands the diversity of nucleic-acid-guided systems beyond CRISPR and RNA interference, offering a unified mechanism for the three fundamental DNA rearrangements-insertion, excision and inversion-that are required for genome design.
Collapse
Affiliation(s)
- Matthew G Durrant
- Arc Institute, Palo Alto, CA, USA
- Department of Bioengineering, University of California, Berkeley, Berkeley, CA, USA
| | - Nicholas T Perry
- Arc Institute, Palo Alto, CA, USA
- Department of Bioengineering, University of California, Berkeley, Berkeley, CA, USA
- University of California, Berkeley-University of California, San Francisco Graduate Program in Bioengineering, Berkeley, CA, USA
| | | | - Aditya R Jangid
- Arc Institute, Palo Alto, CA, USA
- Department of Bioengineering, University of California, Berkeley, Berkeley, CA, USA
| | | | - Masahiro Hiraizumi
- Department of Chemistry and Biotechnology, Graduate School of Engineering, University of Tokyo, Tokyo, Japan
| | | | | | - Hiroshi Nishimasu
- Department of Chemistry and Biotechnology, Graduate School of Engineering, University of Tokyo, Tokyo, Japan
- Structural Biology Division, Research Center for Advanced Science and Technology, University of Tokyo, Tokyo, Japan
- Department of Biological Sciences, Graduate School of Science, University of Tokyo, Tokyo, Japan
- Inamori Research Institute for Science, Kyoto, Japan
- Japan Science and Technology Agency, Core Research for Evolutional Science and Technology, Saitama, Japan
| | - Silvana Konermann
- Arc Institute, Palo Alto, CA, USA
- Department of Biochemistry, Stanford University School of Medicine, Stanford, CA, USA
| | - Patrick D Hsu
- Arc Institute, Palo Alto, CA, USA.
- Department of Bioengineering, University of California, Berkeley, Berkeley, CA, USA.
- Center for Computational Biology, University of California, Berkeley, Berkeley, CA, USA.
| |
Collapse
|
6
|
Gorman ED, Lladser ME. Interpretable metric learning in comparative metagenomics: The adaptive Haar-like distance. PLoS Comput Biol 2024; 20:e1011543. [PMID: 38768195 PMCID: PMC11142682 DOI: 10.1371/journal.pcbi.1011543] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 09/26/2023] [Revised: 05/31/2024] [Accepted: 04/25/2024] [Indexed: 05/22/2024] Open
Abstract
Random forests have emerged as a promising tool in comparative metagenomics because they can predict environmental characteristics based on microbial composition in datasets where β-diversity metrics fall short of revealing meaningful relationships between samples. Nevertheless, despite this efficacy, they lack biological insight in tandem with their predictions, potentially hindering scientific advancement. To overcome this limitation, we leverage a geometric characterization of random forests to introduce a data-driven phylogenetic β-diversity metric, the adaptive Haar-like distance. This new metric assigns a weight to each internal node (i.e., split or bifurcation) of a reference phylogeny, indicating the relative importance of that node in discerning environmental samples based on their microbial composition. Alongside this, a weighted nearest-neighbors classifier, constructed using the adaptive metric, can be used as a proxy for the random forest while maintaining accuracy on par with that of the original forest and another state-of-the-art classifier, CoDaCoRe. As shown in datasets from diverse microbial environments, however, the new metric and classifier significantly enhance the biological interpretability and visualization of high-dimensional metagenomic samples.
Collapse
Affiliation(s)
- Evan D. Gorman
- Department of Applied Mathematics, University of Colorado, Boulder, Colorado, United States of America
| | - Manuel E. Lladser
- Department of Applied Mathematics, University of Colorado, Boulder, Colorado, United States of America
| |
Collapse
|
7
|
Lin X, Hu T, Wu Z, Li L, Wang Y, Wen D, Liu X, Li W, Liang H, Jin X, Xu X, Wang J, Yang H, Kristiansen K, Xiao L, Zou Y. Isolation of potentially novel species expands the genomic and functional diversity of Lachnospiraceae. IMETA 2024; 3:e174. [PMID: 38882499 PMCID: PMC11170972 DOI: 10.1002/imt2.174] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Figures] [Subscribe] [Scholar Register] [Received: 11/09/2023] [Accepted: 12/06/2023] [Indexed: 06/18/2024]
Abstract
The Lachnospiraceae family holds promise as a source of next-generation probiotics, yet a comprehensive delineation of its diversity is lacking, hampering the identification of suitable strains for future applications. To address this knowledge gap, we conducted an in-depth genomic and functional analysis of 1868 high-quality genomes, combining data from public databases with our new isolates. This data set represented 387 colonization-selective species-level clusters, of which eight genera represented multilineage clusters. Pan-genome analysis, single-nucleotide polymorphism (SNP) identification, and probiotic functional predictions revealed that species taxonomy, habitats, and geography together shape the functional diversity of Lachnospiraceae. Moreover, analyses of associations with atherosclerotic cardiovascular disease (ACVD) and inflammatory bowel disease (IBD) indicated that several strains of potentially novel Lachnospiraceae species possess the capacity to reduce the abundance of opportunistic pathogens, thereby imparting potential health benefits. Our findings shed light on the untapped potential of novel species enabling knowledge-based selection of strains for the development of next-generation probiotics holding promise for improving human health and disease management.
Collapse
Affiliation(s)
- Xiaoqian Lin
- BGI Research Shenzhen China
- School of Bioscience and Biotechnology South China University of Technology Guangzhou China
| | | | - Zhinan Wu
- BGI Research Shenzhen China
- College of Life Sciences University of Chinese Academy of Sciences Beijing China
| | | | | | | | - Xudong Liu
- BGI Research Shenzhen China
- College of Life Sciences University of Chinese Academy of Sciences Beijing China
| | - Wenxi Li
- BGI Research Shenzhen China
- School of Bioscience and Biotechnology South China University of Technology Guangzhou China
| | | | | | - Xun Xu
- BGI Research Shenzhen China
| | - Jian Wang
- BGI Research Shenzhen China
- James D. Watson Institute of Genome Sciences Hangzhou China
| | - Huanming Yang
- BGI Research Shenzhen China
- James D. Watson Institute of Genome Sciences Hangzhou China
| | - Karsten Kristiansen
- BGI Research Shenzhen China
- Laboratory of Genomics and Molecular Biomedicine University of Copenhagen Copenhagen Denmark
| | - Liang Xiao
- BGI Research Shenzhen China
- College of Life Sciences University of Chinese Academy of Sciences Beijing China
- Shenzhen Engineering Laboratory of Detection and Intervention of human intestinal microbiome, BGI-Shenzhen Shenzhen China
| | - Yuanqiang Zou
- BGI Research Shenzhen China
- Laboratory of Genomics and Molecular Biomedicine University of Copenhagen Copenhagen Denmark
- Shenzhen Engineering Laboratory of Detection and Intervention of human intestinal microbiome, BGI-Shenzhen Shenzhen China
| |
Collapse
|
8
|
Blake KS, Kumar H, Loganathan A, Williford EE, Diorio-Toth L, Xue YP, Tang WK, Campbell TP, Chong DD, Angtuaco S, Wencewicz TA, Tolia NH, Dantas G. Sequence-structure-function characterization of the emerging tetracycline destructase family of antibiotic resistance enzymes. Commun Biol 2024; 7:336. [PMID: 38493211 PMCID: PMC10944477 DOI: 10.1038/s42003-024-06023-w] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/10/2023] [Accepted: 03/07/2024] [Indexed: 03/18/2024] Open
Abstract
Tetracycline destructases (TDases) are flavin monooxygenases which can confer resistance to all generations of tetracycline antibiotics. The recent increase in the number and diversity of reported TDase sequences enables a deep investigation of the TDase sequence-structure-function landscape. Here, we evaluate the sequence determinants of TDase function through two complementary approaches: (1) constructing profile hidden Markov models to predict new TDases, and (2) using multiple sequence alignments to identify conserved positions important to protein function. Using the HMM-based approach we screened 50 high-scoring candidate sequences in Escherichia coli, leading to the discovery of 13 new TDases. The X-ray crystal structures of two new enzymes from Legionella species were determined, and the ability of anhydrotetracycline to inhibit their tetracycline-inactivating activity was confirmed. Using the MSA-based approach we identified 31 amino acid positions 100% conserved across all known TDase sequences. The roles of these positions were analyzed by alanine-scanning mutagenesis in two TDases, to study the impact on cell and in vitro activity, structure, and stability. These results expand the diversity of TDase sequences and provide valuable insights into the roles of important residues in TDases, and flavin monooxygenases more broadly.
Collapse
Affiliation(s)
- Kevin S Blake
- The Edison Family Center for Genome Sciences and Systems Biology, Washington University School of Medicine, St. Louis, MO, USA
| | - Hirdesh Kumar
- Host-Pathogen Interactions and Structural Vaccinology section (HPISV), National Institute of Allergy and Infectious Diseases (NIAID), National Institutes of Health (NIH), Bethesda, MD, USA
| | - Anisha Loganathan
- The Edison Family Center for Genome Sciences and Systems Biology, Washington University School of Medicine, St. Louis, MO, USA
| | - Emily E Williford
- Department of Chemistry, Washington University in St. Louis, St. Louis, MO, USA
| | - Luke Diorio-Toth
- The Edison Family Center for Genome Sciences and Systems Biology, Washington University School of Medicine, St. Louis, MO, USA
| | - Yao-Peng Xue
- The Edison Family Center for Genome Sciences and Systems Biology, Washington University School of Medicine, St. Louis, MO, USA
| | - Wai Kwan Tang
- Host-Pathogen Interactions and Structural Vaccinology section (HPISV), National Institute of Allergy and Infectious Diseases (NIAID), National Institutes of Health (NIH), Bethesda, MD, USA
| | - Tayte P Campbell
- The Edison Family Center for Genome Sciences and Systems Biology, Washington University School of Medicine, St. Louis, MO, USA
| | - David D Chong
- The Edison Family Center for Genome Sciences and Systems Biology, Washington University School of Medicine, St. Louis, MO, USA
| | - Steven Angtuaco
- The Edison Family Center for Genome Sciences and Systems Biology, Washington University School of Medicine, St. Louis, MO, USA
| | - Timothy A Wencewicz
- Department of Chemistry, Washington University in St. Louis, St. Louis, MO, USA.
| | - Niraj H Tolia
- Host-Pathogen Interactions and Structural Vaccinology section (HPISV), National Institute of Allergy and Infectious Diseases (NIAID), National Institutes of Health (NIH), Bethesda, MD, USA.
| | - Gautam Dantas
- The Edison Family Center for Genome Sciences and Systems Biology, Washington University School of Medicine, St. Louis, MO, USA.
- Department of Pathology and Immunology, Division of Laboratory and Genomic Medicine, Washington University School of Medicine, St. Louis, MO, USA.
- Department of Molecular Microbiology, Washington University School of Medicine, St. Louis, MO, USA.
- Department of Biomedical Engineering, Washington University School of Medicine, St. Louis, MO, USA.
- Department of Pediatrics, Washington University School of Medicine, St. Louis, MO, USA.
| |
Collapse
|
9
|
Shen J, Yu Q, Chen S, Tan Q, Li J, Li Y. Unbiased organism-agnostic and highly sensitive signal peptide predictor with deep protein language model. NATURE COMPUTATIONAL SCIENCE 2024; 4:29-42. [PMID: 38177492 DOI: 10.1038/s43588-023-00576-2] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Subscribe] [Scholar Register] [Received: 07/24/2023] [Accepted: 11/22/2023] [Indexed: 01/06/2024]
Abstract
Signal peptides (SPs) are essential to target and transfer transmembrane and secreted proteins to the correct positions. Many existing computational tools for predicting SPs disregard the extreme data imbalance problem and rely on additional group information of proteins. Here we introduce Unbiased Organism-agnostic Signal Peptide Network (USPNet), an SP classification and cleavage-site prediction deep learning method. Extensive experimental results show that USPNet substantially outperforms previous methods on classification performance by 10%. An SP-discovering pipeline with USPNet is designed to explore unprecedented SPs from metagenomic data. It reveals 347 SP candidates, with the lowest sequence identity between our candidates and the closest SP in the training dataset at only 13%. In addition, the template modeling scores between candidates and SPs in the training set are mostly above 0.8. The results showcase that USPNet has learnt the SP structure with raw amino acid sequences and the large protein language model, thereby enabling the discovery of unknown SPs.
Collapse
Affiliation(s)
- Junbo Shen
- Department of Computer Science and Engineering, CUHK, Hong Kong SAR, China
- Department of Computer Science and Engineering, Washington University, St. Louis, MO, US
| | - Qinze Yu
- Department of Computer Science and Engineering, CUHK, Hong Kong SAR, China
| | - Shenyang Chen
- Department of Computer Science and Engineering, CUHK, Hong Kong SAR, China
- The CUHK Shenzhen Research Institute, Shenzhen, China
- Georgia Institute of Technology, Atlanta, GA, US
| | - Qingxiong Tan
- Department of Computer Science and Engineering, CUHK, Hong Kong SAR, China
| | - Jingchen Li
- Department of Computer Science and Engineering, CUHK, Hong Kong SAR, China
| | - Yu Li
- Department of Computer Science and Engineering, CUHK, Hong Kong SAR, China.
- The CUHK Shenzhen Research Institute, Shenzhen, China.
- Shanghai Artificial Intelligence Laboratory, Shanghai, China.
- Institute for Medical Engineering and Science, Massachusetts Institute of Technology, Cambridge, MA, USA.
- Wyss Institute for Biologically Inspired Engineering, Harvard University, Boston, MA, USA.
- Broad Institute of MIT and Harvard, Cambridge, MA, USA.
| |
Collapse
|
10
|
Wei J, Lotfy P, Faizi K, Baungaard S, Gibson E, Wang E, Slabodkin H, Kinnaman E, Chandrasekaran S, Kitano H, Durrant MG, Duffy CV, Pawluk A, Hsu PD, Konermann S. Deep learning and CRISPR-Cas13d ortholog discovery for optimized RNA targeting. Cell Syst 2023; 14:1087-1102.e13. [PMID: 38091991 DOI: 10.1016/j.cels.2023.11.006] [Citation(s) in RCA: 7] [Impact Index Per Article: 7.0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 08/05/2022] [Revised: 05/03/2023] [Accepted: 11/20/2023] [Indexed: 12/23/2023]
Abstract
Effective and precise mammalian transcriptome engineering technologies are needed to accelerate biological discovery and RNA therapeutics. Despite the promise of programmable CRISPR-Cas13 ribonucleases, their utility has been hampered by an incomplete understanding of guide RNA design rules and cellular toxicity resulting from off-target or collateral RNA cleavage. Here, we quantified the performance of over 127,000 RfxCas13d (CasRx) guide RNAs and systematically evaluated seven machine learning models to build a guide efficiency prediction algorithm orthogonally validated across multiple human cell types. Deep learning model interpretation revealed preferred sequence motifs and secondary features for highly efficient guides. We next identified and screened 46 novel Cas13d orthologs, finding that DjCas13d achieves low cellular toxicity and high specificity-even when targeting abundant transcripts in sensitive cell types, including stem cells and neurons. Our Cas13d guide efficiency model was successfully generalized to DjCas13d, illustrating the power of combining machine learning with ortholog discovery to advance RNA targeting in human cells.
Collapse
Affiliation(s)
- Jingyi Wei
- Department of Bioengineering, Stanford University, Stanford, CA, USA; Department of Biochemistry, Stanford University, Stanford, CA, USA; Arc Institute, Palo Alto, CA, USA
| | - Peter Lotfy
- Laboratory of Molecular and Cell Biology, Salk Institute for Biological Studies, La Jolla, CA, USA
| | - Kian Faizi
- Laboratory of Molecular and Cell Biology, Salk Institute for Biological Studies, La Jolla, CA, USA
| | | | | | - Eleanor Wang
- Laboratory of Molecular and Cell Biology, Salk Institute for Biological Studies, La Jolla, CA, USA; Department of Bioengineering, University of California, Berkeley, Berkeley, CA, USA; Innovative Genomics Institute, University of California, Berkeley, Berkeley, CA, USA
| | - Hannah Slabodkin
- Department of Biochemistry, Stanford University, Stanford, CA, USA; Arc Institute, Palo Alto, CA, USA
| | - Emily Kinnaman
- Department of Biochemistry, Stanford University, Stanford, CA, USA; Arc Institute, Palo Alto, CA, USA
| | - Sita Chandrasekaran
- Arc Institute, Palo Alto, CA, USA; Department of Bioengineering, University of California, Berkeley, Berkeley, CA, USA; Innovative Genomics Institute, University of California, Berkeley, Berkeley, CA, USA
| | - Hugo Kitano
- Department of Computer Science, Stanford University, Stanford, CA, USA
| | - Matthew G Durrant
- Arc Institute, Palo Alto, CA, USA; Department of Bioengineering, University of California, Berkeley, Berkeley, CA, USA; Innovative Genomics Institute, University of California, Berkeley, Berkeley, CA, USA
| | - Connor V Duffy
- Arc Institute, Palo Alto, CA, USA; Department of Genetics, Stanford University, Stanford, CA, USA
| | | | - Patrick D Hsu
- Arc Institute, Palo Alto, CA, USA; Department of Bioengineering, University of California, Berkeley, Berkeley, CA, USA; Innovative Genomics Institute, University of California, Berkeley, Berkeley, CA, USA.
| | - Silvana Konermann
- Department of Biochemistry, Stanford University, Stanford, CA, USA; Arc Institute, Palo Alto, CA, USA.
| |
Collapse
|
11
|
Khairunisa BH, Heryakusuma C, Ike K, Mukhopadhyay B, Susanti D. Evolving understanding of rumen methanogen ecophysiology. Front Microbiol 2023; 14:1296008. [PMID: 38029083 PMCID: PMC10658910 DOI: 10.3389/fmicb.2023.1296008] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 09/17/2023] [Accepted: 10/12/2023] [Indexed: 12/01/2023] Open
Abstract
Production of methane by methanogenic archaea, or methanogens, in the rumen of ruminants is a thermodynamic necessity for microbial conversion of feed to volatile fatty acids, which are essential nutrients for the animals. On the other hand, methane is a greenhouse gas and its production causes energy loss for the animal. Accordingly, there are ongoing efforts toward developing effective strategies for mitigating methane emissions from ruminant livestock that require a detailed understanding of the diversity and ecophysiology of rumen methanogens. Rumen methanogens evolved from free-living autotrophic ancestors through genome streamlining involving gene loss and acquisition. The process yielded an oligotrophic lifestyle, and metabolically efficient and ecologically adapted descendants. This specialization poses serious challenges to the efforts of obtaining axenic cultures of rumen methanogens, and consequently, the information on their physiological properties remains in most part inferred from those of their non-rumen representatives. This review presents the current knowledge of rumen methanogens and their metabolic contributions to enteric methane production. It also identifies the respective critical gaps that need to be filled for aiding the efforts to mitigate methane emission from livestock operations and at the same time increasing the productivity in this critical agriculture sector.
Collapse
Affiliation(s)
| | - Christian Heryakusuma
- Genetics, Bioinformatics, and Computational Biology, Virginia Tech, Blacksburg, VA, United States
- Department of Biochemistry, Virginia Tech, Blacksburg, VA, United States
| | - Kelechi Ike
- Department of Biology, North Carolina Agricultural and Technical State University, Greensboro, NC, United States
| | - Biswarup Mukhopadhyay
- Genetics, Bioinformatics, and Computational Biology, Virginia Tech, Blacksburg, VA, United States
- Department of Biochemistry, Virginia Tech, Blacksburg, VA, United States
- Virginia Tech Carilion School of Medicine, Virginia Tech, Blacksburg, VA, United States
| | - Dwi Susanti
- Microbial Discovery Research, BiomEdit, Greenfield, IN, United States
| |
Collapse
|
12
|
Heinken A, Hulshof TO, Nap B, Martinelli F, Basile A, O'Brolchain A, O’Sullivan NF, Gallagher C, Magee E, McDonagh F, Lalor I, Bergin M, Evans P, Daly R, Farrell R, Delaney RM, Hill S, McAuliffe SR, Kilgannon T, Fleming RM, Thinnes CC, Thiele I. APOLLO: A genome-scale metabolic reconstruction resource of 247,092 diverse human microbes spanning multiple continents, age groups, and body sites. BIORXIV : THE PREPRINT SERVER FOR BIOLOGY 2023:2023.10.02.560573. [PMID: 37873072 PMCID: PMC10592896 DOI: 10.1101/2023.10.02.560573] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Grants] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 10/25/2023]
Abstract
Computational modelling of microbiome metabolism has proved instrumental to catalyse our understanding of diet-host-microbiome-disease interactions through the interrogation of mechanistic, strain- and molecule-resolved metabolic models. We present APOLLO, a resource of 247,092 human microbial genome-scale metabolic reconstructions spanning 19 phyla and accounting for microbial genomes from 34 countries, all age groups, and five body sites. We explored the metabolic potential of the reconstructed strains and developed a machine learning classifier able to predict with high accuracy the taxonomic strain assignments. We also built 14,451 sample-specific microbial community models, which could be stratified by body site, age, and disease states. Finally, we predicted faecal metabolites enriched or depleted in gut microbiomes of people with Crohn's disease, Parkinson disease, and undernourished children. APOLLO is compatible with the human whole-body models, and thus, provide unprecedented opportunities for systems-level modelling of personalised host-microbiome co-metabolism. APOLLO will be freely available under https://www.vmh.life/.
Collapse
Affiliation(s)
- Almut Heinken
- School of Medicine, University of Galway, Galway, Ireland
- Ryan Institute, University of Galway, Galway, Ireland
- Inserm UMRS 1256 NGERE, University of Lorraine, Nancy, France
| | - Timothy Otto Hulshof
- School of Medicine, University of Galway, Galway, Ireland
- Ryan Institute, University of Galway, Galway, Ireland
| | - Bram Nap
- School of Medicine, University of Galway, Galway, Ireland
- Ryan Institute, University of Galway, Galway, Ireland
| | - Filippo Martinelli
- School of Medicine, University of Galway, Galway, Ireland
- Ryan Institute, University of Galway, Galway, Ireland
| | - Arianna Basile
- School of Medicine, University of Galway, Galway, Ireland
- Department of Biology, University of Padova, Padova, Italy
| | | | | | | | | | | | - Ian Lalor
- University of Galway, Galway, Ireland
| | | | | | | | | | | | | | | | | | | | - Cyrille C. Thinnes
- School of Medicine, University of Galway, Galway, Ireland
- Ryan Institute, University of Galway, Galway, Ireland
| | - Ines Thiele
- School of Medicine, University of Galway, Galway, Ireland
- Ryan Institute, University of Galway, Galway, Ireland
- Division of Microbiology, University of Galway, Galway, Ireland
- APC Microbiome Ireland, Cork, Ireland
| |
Collapse
|
13
|
González D, Morales-Olavarria M, Vidal-Veuthey B, Cárdenas JP. Insights into early evolutionary adaptations of the Akkermansia genus to the vertebrate gut. Front Microbiol 2023; 14:1238580. [PMID: 37779688 PMCID: PMC10540074 DOI: 10.3389/fmicb.2023.1238580] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/12/2023] [Accepted: 08/21/2023] [Indexed: 10/03/2023] Open
Abstract
Akkermansia, a relevant mucin degrader from the vertebrate gut microbiota, is a member of the deeply branched Verrucomicrobiota, as well as the only known member of this phylum to be described as inhabitants of the gut. Only a few Akkermansia species have been officially described so far, although there is genomic evidence addressing the existence of more species-level variants for this genus. This niche specialization makes Akkermansia an interesting model for studying the evolution of microorganisms to their adaptation to the gastrointestinal tract environment, including which kind of functions were gained when the Akkermansia genus originated or how the evolutionary pressure functions over those genes. In order to gain more insight into Akkermansia adaptations to the gastrointestinal tract niche, we performed a phylogenomic analysis of 367 high-quality Akkermansia isolates and metagenome-assembled genomes, in addition to other members of Verrucomicrobiota. This work was focused on three aspects: the definition of Akkermansia genomic species clusters and the calculation and functional characterization of the pangenome for the most represented species; the evolutionary relationship between Akkermansia and their closest relatives from Verrucomicrobiota, defining the gene families which were gained or lost during the emergence of the last Akkermansia common ancestor (LAkkCA) and; the evaluation of the evolutionary pressure metrics for each relevant gene family of main Akkermansia species. This analysis found 25 Akkermansia genomic species clusters distributed in two main clades, divergent from their non-Akkermansia relatives. Pangenome analyses suggest that Akkermansia species have open pangenomes, and the gene gain/loss model indicates that genes associated with mucin degradation (both glycoside hydrolases and peptidases), (micro)aerobic metabolism, surface interaction, and adhesion were part of LAkkCA. Specifically, mucin degradation is a very ancestral innovation involved in the origin of Akkermansia. Horizontal gene transfer detection suggests that Akkermansia could receive genes mostly from unknown sources or from other Gram-negative gut bacteria. Evolutionary metrics suggest that Akkemansia species evolved differently, and even some conserved genes suffered different evolutionary pressures among clades. These results suggest a complex evolutionary landscape of the genus and indicate that mucin degradation could be an essential feature in Akkermansia evolution as a symbiotic species.
Collapse
Affiliation(s)
- Dámariz González
- Centro de Genómica y Bioinformática, Facultad de Ciencias, Ingeniería y Tecnología, Universidad Mayor, Santiago, Chile
| | - Mauricio Morales-Olavarria
- Centro de Genómica y Bioinformática, Facultad de Ciencias, Ingeniería y Tecnología, Universidad Mayor, Santiago, Chile
| | - Boris Vidal-Veuthey
- Centro de Genómica y Bioinformática, Facultad de Ciencias, Ingeniería y Tecnología, Universidad Mayor, Santiago, Chile
| | - Juan P. Cárdenas
- Centro de Genómica y Bioinformática, Facultad de Ciencias, Ingeniería y Tecnología, Universidad Mayor, Santiago, Chile
- Escuela de Biotecnología, Facultad de Ciencias, Ingeniería y Tecnología, Universidad Mayor, Santiago, Chile
| |
Collapse
|
14
|
Mineeva O, Danciu D, Schölkopf B, Ley RE, Rätsch G, Youngblut ND. ResMiCo: Increasing the quality of metagenome-assembled genomes with deep learning. PLoS Comput Biol 2023; 19:e1011001. [PMID: 37126495 PMCID: PMC10174551 DOI: 10.1371/journal.pcbi.1011001] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/15/2022] [Revised: 05/11/2023] [Accepted: 03/06/2023] [Indexed: 05/02/2023] Open
Abstract
The number of published metagenome assemblies is rapidly growing due to advances in sequencing technologies. However, sequencing errors, variable coverage, repetitive genomic regions, and other factors can produce misassemblies, which are challenging to detect for taxonomically novel genomic data. Assembly errors can affect all downstream analyses of the assemblies. Accuracy for the state of the art in reference-free misassembly prediction does not exceed an AUPRC of 0.57, and it is not clear how well these models generalize to real-world data. Here, we present the Residual neural network for Misassembled Contig identification (ResMiCo), a deep learning approach for reference-free identification of misassembled contigs. To develop ResMiCo, we first generated a training dataset of unprecedented size and complexity that can be used for further benchmarking and developments in the field. Through rigorous validation, we show that ResMiCo is substantially more accurate than the state of the art, and the model is robust to novel taxonomic diversity and varying assembly methods. ResMiCo estimated 7% misassembled contigs per metagenome across multiple real-world datasets. We demonstrate how ResMiCo can be used to optimize metagenome assembly hyperparameters to improve accuracy, instead of optimizing solely for contiguity. The accuracy, robustness, and ease-of-use of ResMiCo make the tool suitable for general quality control of metagenome assemblies and assembly methodology optimization.
Collapse
Affiliation(s)
- Olga Mineeva
- Department of Computer Science, ETH Zürich, Zürich, Switzerland
- Department of Empirical Inference, Max Planck Institute for Intelligent Systems, Tübingen, Germany
- Swiss Institute for Bioinformatics, Lausanne, Switzerland
| | - Daniel Danciu
- Department of Computer Science, ETH Zürich, Zürich, Switzerland
| | - Bernhard Schölkopf
- Department of Computer Science, ETH Zürich, Zürich, Switzerland
- Department of Empirical Inference, Max Planck Institute for Intelligent Systems, Tübingen, Germany
- ETH AI center, ETH Zürich, Zürich, Switzerland
| | - Ruth E Ley
- Department of Microbiome Science, Max Planck Institute for Biology, Tübingen, Germany
| | - Gunnar Rätsch
- Department of Computer Science, ETH Zürich, Zürich, Switzerland
- Swiss Institute for Bioinformatics, Lausanne, Switzerland
- ETH AI center, ETH Zürich, Zürich, Switzerland
- Department of Biology, ETH Zürich, Zürich, Switzerland
- Medical Informatics Unit, Zürich University Hospital, Zürich, Switzerland
| | - Nicholas D Youngblut
- Department of Microbiome Science, Max Planck Institute for Biology, Tübingen, Germany
| |
Collapse
|
15
|
Taxonomic, Genomic, and Functional Variation in the Gut Microbiomes of Wild Spotted Hyenas Across 2 Decades of Study. mSystems 2023; 8:e0096522. [PMID: 36533929 PMCID: PMC9948708 DOI: 10.1128/msystems.00965-22] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 12/23/2022] Open
Abstract
The gut microbiome provides vital functions for mammalian hosts, yet research on its variability and function across adult life spans and multiple generations is limited in large mammalian carnivores. Here, we used 16S rRNA gene and metagenomic high-throughput sequencing to profile the bacterial taxonomic composition, genomic diversity, and metabolic function of fecal samples collected from 12 wild spotted hyenas (Crocuta crocuta) residing in the Masai Mara National Reserve, Kenya, over a 23-year period spanning three generations. The metagenomic data came from four of these hyenas and spanned two 2-year periods. With these data, we determined the extent to which host factors predicted variation in the gut microbiome and identified the core microbes present in the guts of hyenas. We also investigated novel genomic diversity in the mammalian gut by reporting the first metagenome-assembled genomes (MAGs) for hyenas. We found that gut microbiome taxonomic composition varied temporally, but despite this, a core set of 14 bacterial genera were identified. The strongest predictors of the microbiome were host identity and age, suggesting that hyenas possess individualized microbiomes and that these may change with age during adulthood. The gut microbiome functional profiles of the four adult hyenas were also individual specific and were associated with prey abundance, indicating that the functions of the gut microbiome vary with host diet. We recovered 149 high-quality MAGs from the hyenas' guts; some MAGs were classified as taxa previously reported for other carnivores, but many were novel and lacked species-level matches to genomes in existing reference databases. IMPORTANCE There is a gap in knowledge regarding the genomic diversity and variation of the gut microbiome across a host's life span and across multiple generations of hosts in wild mammals. Using two types of sequencing approaches, we found that although gut microbiomes were individualized and temporally variable among hyenas, they correlated similarly to large-scale changes in the ecological conditions experienced by their hosts. We also recovered 149 high-quality MAGs from the hyena gut, greatly expanding the microbial genome repertoire known for hyenas, carnivores, and wild mammals in general. Some MAGs came from genera abundant in the gastrointestinal tracts of canid species and other carnivores, but over 80% of MAGs were novel and from species not previously represented in genome databases. Collectively, our novel body of work illustrates the importance of surveying the gut microbiome of nonmodel wild hosts, using multiple sequencing methods and computational approaches and at distinct scales of analysis.
Collapse
|
16
|
Garritano AN, Song W, Thomas T. Carbon fixation pathways across the bacterial and archaeal tree of life. PNAS NEXUS 2022; 1:pgac226. [PMID: 36712370 PMCID: PMC9802188 DOI: 10.1093/pnasnexus/pgac226] [Citation(s) in RCA: 13] [Impact Index Per Article: 6.5] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Figures] [Subscribe] [Scholar Register] [Received: 02/23/2022] [Accepted: 10/01/2022] [Indexed: 11/17/2022]
Abstract
Carbon fixation is a critical process for our planet; however, its distribution across the bacterial and archaeal domains of life has not been comprehensively studied. Here, we performed an analysis of 52,515 metagenome-assembled genomes and discover carbon fixation pathways in 1,007 bacteria and archaea. We reveal the genomic potential for carbon fixation through the reverse tricarboxylic acid cycle in previously unrecognized archaeal and bacterial phyla (i.e. Thermoplasmatota and Elusimicrobiota) and show that the 3-hydroxypropionate bi-cycle is not, as previously thought, restricted to the phylum Chloroflexota. The data also substantially expand the phylogenetic breadth for autotrophy through the dicarboxylate/4-hydroxybutyrate cycle and the Calvin-Benson-Bassham cycle. Finally, the genomic potential for carbon fixation through the 3-hydroxypropionate/4-hydroxybutyrate cycle, previously exclusively found in Archaea, was also detected in the Bacteria. Carbon fixation thus appears to be much more widespread than previously known, and this study lays the foundation to better understand the role of archaea and bacteria in global primary production and how they contribute to microbial carbon sinks.
Collapse
Affiliation(s)
- Alessandro N Garritano
- Centre for Marine Science and Innovation, School of Biological, Earth and Environmental Sciences, Faculty of Science, The University of New South Wales, Kensington, NSW 2052, Australia
| | - Weizhi Song
- Centre for Marine Science and Innovation, School of Biological, Earth and Environmental Sciences, Faculty of Science, The University of New South Wales, Kensington, NSW 2052, Australia
| | | |
Collapse
|
17
|
Mach N, Midoux C, Leclercq S, Pennarun S, Le Moyec L, Rué O, Robert C, Sallé G, Barrey E. Mining the equine gut metagenome: poorly-characterized taxa associated with cardiovascular fitness in endurance athletes. Commun Biol 2022; 5:1032. [PMID: 36192523 PMCID: PMC9529974 DOI: 10.1038/s42003-022-03977-7] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 01/31/2022] [Accepted: 09/12/2022] [Indexed: 12/01/2022] Open
Abstract
Emerging evidence indicates that the gut microbiome contributes to endurance exercise performance. Still, the extent of its functional and metabolic potential remains unknown. Using elite endurance horses as a model system for exercise responsiveness, we built an integrated horse gut gene catalog comprising ~25 million unique genes and 372 metagenome-assembled genomes. This catalog represents 4179 genera spanning 95 phyla and functional capacities primed to exploit energy from dietary, microbial, and host resources. The holo-omics approach shows that gut microbiomes enriched in Lachnospiraceae taxa are negatively associated with cardiovascular capacity. Conversely, more complex and functionally diverse microbiomes are associated with higher glucose concentrations and reduced accumulation of long-chain acylcarnitines and non-esterified fatty acids in plasma, suggesting increased ß-oxidation capacity in the mitochondria. In line with this hypothesis, more fit athletes show upregulation of mitochondrial-related genes involved in energy metabolism, biogenesis, and Ca2+ cytosolic transport, all of which are necessary to improve aerobic work power, spare glycogen usage, and enhance cardiovascular capacity. The results identify an associative link between endurance performance and gut microbiome composition and gene function, laying the basis for nutritional interventions that could benefit horse athletes. An integrated gene catalog of the gut microbiome in elite endurance horses is build. The holo-omics analyses identify an associative link between endurance performance and gut microbiome composition and gene function.
Collapse
Affiliation(s)
- Núria Mach
- Université Paris-Saclay, INRAE, AgroParisTech, GABI, Jouy-en-Josas, France. .,Université de Toulouse, INRAE, ENVT, IHAP, Toulouse, France.
| | - Cédric Midoux
- Université Paris-Saclay, INRAE, MaIAGE, Jouy-en-Josas, France.,Université Paris-Saclay, INRAE, BioinfOmics, MIGALE bioinformatics facility, Jouy-en-Josas, France.,Université Paris-Saclay, INRAE, PROSE, Antony, France
| | | | | | - Laurence Le Moyec
- Université d'Évry Val d'Essonne, Université Paris-Saclay, Évry, France.,Muséum National d'Histoire Naturelle, CNRS, MCAM, Paris, France
| | - Olivier Rué
- Université Paris-Saclay, INRAE, MaIAGE, Jouy-en-Josas, France.,Université Paris-Saclay, INRAE, BioinfOmics, MIGALE bioinformatics facility, Jouy-en-Josas, France
| | - Céline Robert
- Université Paris-Saclay, INRAE, AgroParisTech, GABI, Jouy-en-Josas, France.,École Nationale Vétérinaire d'Alfort, Maisons-Alfort, France
| | - Guillaume Sallé
- Université François Rabelais de Tours, INRAE, ISP, Nouzilly, France
| | - Eric Barrey
- Université Paris-Saclay, INRAE, AgroParisTech, GABI, Jouy-en-Josas, France
| |
Collapse
|
18
|
Suzuki TA, Fitzstevens JL, Schmidt VT, Enav H, Huus KE, Ngwese MM, Grießhammer A, Pfleiderer A, Adegbite BR, Zinsou JF, Esen M, Velavan TP, Adegnika AA, Song LH, Spector TD, Muehlbauer AL, Marchi N, Kang H, Maier L, Blekhman R, Ségurel L, Ko G, Youngblut ND, Kremsner P, Ley RE. Codiversification of gut microbiota with humans. Science 2022; 377:1328-1332. [PMID: 36108023 PMCID: PMC10777373 DOI: 10.1126/science.abm7759] [Citation(s) in RCA: 62] [Impact Index Per Article: 31.0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 12/01/2023]
Abstract
The gut microbiomes of human populations worldwide have many core microbial species in common. However, within a species, some strains can show remarkable population specificity. The question is whether such specificity arises from a shared evolutionary history (codiversification) between humans and their microbes. To test for codiversification of host and microbiota, we analyzed paired gut metagenomes and human genomes for 1225 individuals in Europe, Asia, and Africa, including mothers and their children. Between and within countries, a parallel evolutionary history was evident for humans and their gut microbes. Moreover, species displaying the strongest codiversification independently evolved traits characteristic of host dependency, including reduced genomes and oxygen and temperature sensitivity. These findings all point to the importance of understanding the potential role of population-specific microbial strains in microbiome-mediated disease phenotypes.
Collapse
Affiliation(s)
- Taichi A. Suzuki
- Department of Microbiome Science, Max Planck Institute for Biology, Tübingen, Germany
| | - J. Liam Fitzstevens
- Department of Microbiome Science, Max Planck Institute for Biology, Tübingen, Germany
| | - Victor T. Schmidt
- Department of Microbiome Science, Max Planck Institute for Biology, Tübingen, Germany
| | - Hagay Enav
- Department of Microbiome Science, Max Planck Institute for Biology, Tübingen, Germany
| | - Kelsey E. Huus
- Department of Microbiome Science, Max Planck Institute for Biology, Tübingen, Germany
| | - Mirabeau Mbong Ngwese
- Department of Microbiome Science, Max Planck Institute for Biology, Tübingen, Germany
| | - Anne Grießhammer
- Interfaculty Institute of Microbiology and Infection Medicine, University of Tübingen, Tübingen, Germany
| | - Anne Pfleiderer
- Institute for Tropical Medicine, University of Tübingen, Tübingen, Germany
| | - Bayode R. Adegbite
- Institute for Tropical Medicine, University of Tübingen, Tübingen, Germany
- Centre de Recherches Médicales de Lambaréné, Lambaréné, Gabon
| | - Jeannot F. Zinsou
- Institute for Tropical Medicine, University of Tübingen, Tübingen, Germany
- Centre de Recherches Médicales de Lambaréné, Lambaréné, Gabon
| | - Meral Esen
- Institute for Tropical Medicine, University of Tübingen, Tübingen, Germany
- German Center for Infection Research, Tübingen, Germany
- Cluster of Excellence EXC 2124 Controlling Microbes to Fight Infections, University of Tübingen, Tübingen, Germany
| | - Thirumalaisamy P. Velavan
- Institute for Tropical Medicine, University of Tübingen, Tübingen, Germany
- Vietnamese German Center for Medical Research, Hanoi, Vietnam
| | - Ayola A. Adegnika
- Institute for Tropical Medicine, University of Tübingen, Tübingen, Germany
- Centre de Recherches Médicales de Lambaréné, Lambaréné, Gabon
- German Center for Infection Research, Tübingen, Germany
- Fondation pour la Recherche Scientifique, Cotonou, Bénin
| | - Le Huu Song
- Vietnamese German Center for Medical Research, Hanoi, Vietnam
- 108 Military Central Hospital, Hanoi, Vietnam
| | - Timothy D. Spector
- Department of Twin Research and Genetic Epidemiology, King’s College London, London, UK
| | - Amanda L. Muehlbauer
- Department of Ecology, Evolution, and Behavior, University of Minnesota, Minneapolis, MN, USA
| | - Nina Marchi
- Eco-anthropologie, Muséum National d’Histoire Naturelle, CNRS, Université de Paris, Paris, France
| | - Hyena Kang
- Department of Environmental Health Sciences, Graduate School of Public Health, Seoul National University, Seoul, Republic of Korea
| | - Lisa Maier
- Interfaculty Institute of Microbiology and Infection Medicine, University of Tübingen, Tübingen, Germany
- Cluster of Excellence EXC 2124 Controlling Microbes to Fight Infections, University of Tübingen, Tübingen, Germany
| | - Ran Blekhman
- Department of Genetics, Cell Biology, and Development, University of Minnesota, Minneapolis, MN, USA
| | - Laure Ségurel
- Eco-anthropologie, Muséum National d’Histoire Naturelle, CNRS, Université de Paris, Paris, France
- Laboratoire de Biométrie et Biologie Evolutive, CNRS, Université Lyon 1, Villeurbanne, France
| | - GwangPyo Ko
- Department of Environmental Health Sciences, Graduate School of Public Health, Seoul National University, Seoul, Republic of Korea
| | - Nicholas D. Youngblut
- Department of Microbiome Science, Max Planck Institute for Biology, Tübingen, Germany
| | - Peter Kremsner
- Institute for Tropical Medicine, University of Tübingen, Tübingen, Germany
- Centre de Recherches Médicales de Lambaréné, Lambaréné, Gabon
- German Center for Infection Research, Tübingen, Germany
- Cluster of Excellence EXC 2124 Controlling Microbes to Fight Infections, University of Tübingen, Tübingen, Germany
| | - Ruth E. Ley
- Department of Microbiome Science, Max Planck Institute for Biology, Tübingen, Germany
- Cluster of Excellence EXC 2124 Controlling Microbes to Fight Infections, University of Tübingen, Tübingen, Germany
| |
Collapse
|
19
|
Escudeiro P, Henry CS, Dias RP. Functional characterization of prokaryotic dark matter: the road so far and what lies ahead. CURRENT RESEARCH IN MICROBIAL SCIENCES 2022; 3:100159. [PMID: 36561390 PMCID: PMC9764257 DOI: 10.1016/j.crmicr.2022.100159] [Citation(s) in RCA: 4] [Impact Index Per Article: 2.0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 01/11/2022] [Revised: 07/18/2022] [Accepted: 08/05/2022] [Indexed: 12/25/2022] Open
Abstract
Eight-hundred thousand to one trillion prokaryotic species may inhabit our planet. Yet, fewer than two-hundred thousand prokaryotic species have been described. This uncharted fraction of microbial diversity, and its undisclosed coding potential, is known as the "microbial dark matter" (MDM). Next-generation sequencing has allowed to collect a massive amount of genome sequence data, leading to unprecedented advances in the field of genomics. Still, harnessing new functional information from the genomes of uncultured prokaryotes is often limited by standard classification methods. These methods often rely on sequence similarity searches against reference genomes from cultured species. This hinders the discovery of unique genetic elements that are missing from the cultivated realm. It also contributes to the accumulation of prokaryotic gene products of unknown function among public sequence data repositories, highlighting the need for new approaches for sequencing data analysis and classification. Increasing evidence indicates that these proteins of unknown function might be a treasure trove of biotechnological potential. Here, we outline the challenges, opportunities, and the potential hidden within the functional dark matter (FDM) of prokaryotes. We also discuss the pitfalls surrounding molecular and computational approaches currently used to probe these uncharted waters, and discuss future opportunities for research and applications.
Collapse
Affiliation(s)
- Pedro Escudeiro
- BioISI - Instituto de Biosistemas e Ciências Integrativas, Faculdade de Ciências, Universidade de Lisboa, Lisboa 1749-016, Portugal
| | - Christopher S. Henry
- Argonne National Laboratory, Lemont, Illinois, USA,University of Chicago, Chicago, Illinois, USA
| | - Ricardo P.M. Dias
- BioISI - Instituto de Biosistemas e Ciências Integrativas, Faculdade de Ciências, Universidade de Lisboa, Lisboa 1749-016, Portugal,iXLab - Innovation for National Biological Resilience, Faculdade de Ciências, Universidade de Lisboa, Lisboa 1749-016, Portugal,Corresponding author.
| |
Collapse
|
20
|
Stevenson SJR, Lee KC, Handley KM, Angert ER, White WL, Clements KD. Substrate degradation pathways, conserved functions and community composition of the hindgut microbiota in the herbivorous marine fish Kyphosus sydneyanus. Comp Biochem Physiol A Mol Integr Physiol 2022; 272:111283. [PMID: 35907589 DOI: 10.1016/j.cbpa.2022.111283] [Citation(s) in RCA: 6] [Impact Index Per Article: 3.0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/08/2022] [Revised: 07/23/2022] [Accepted: 07/24/2022] [Indexed: 02/07/2023]
Abstract
Symbiotic gut microbiota in the herbivorous marine fish Kyphosus sydneyanus play an important role in digestion by converting refractory algal carbohydrate into short-chain fatty acids. Here we characterised community composition using both 16S rRNA gene amplicon sequencing and shotgun-metagenome sequencing. Sequencing was carried out on lumen and mucosa samples (radial sections) from three axial sections taken from the hindgut of wild-caught fish. Both lumen and mucosa communities displayed distinct distributions along the hindgut, likely an effect of the differing selection pressures within these hindgut locations, as well as considerable variation among individual fish. In contrast, metagenomic sequences displayed a high level of functional similarity between individual fish and gut sections in the relative abundance of genes (based on sequencing depth) that encoded enzymes involved in algal-derived substrate degradation. These results suggest that the host gut environment selects for functional capacity in symbionts rather than taxonomic identity. Functional annotation of the enzymes encoded by the gut microbiota was carried out to infer the metabolic pathways used by the gut microbiota for the degradation of important dietary substrates: mannitol, alginate, laminarin, fucoidan and galactan (e.g. agar and carrageenan). This work provides the first evidence of the genomic potential of K. sydneyanus hindgut microbiota to convert highly refractory algal carbohydrates into metabolically useful short-chain fatty acids.
Collapse
Affiliation(s)
- Sam J R Stevenson
- School of Biological Sciences, University of Auckland, Auckland, New Zealand.
| | - Kevin C Lee
- School of Science, Auckland University of Technology, Auckland, New Zealand
| | - Kim M Handley
- School of Biological Sciences, University of Auckland, Auckland, New Zealand
| | - Esther R Angert
- Department of Microbiology, Cornell University, Ithaca, NY 14853, USA
| | - W Lindsey White
- School of Science, Faculty of Health and Environmental Sciences, Auckland University of Technology, Auckland, New Zealand
| | - Kendall D Clements
- School of Biological Sciences, University of Auckland, Auckland, New Zealand
| |
Collapse
|
21
|
Özçam M, Oh JH, Tocmo R, Acharya D, Zhang S, Astmann TJ, Heggen M, Ruiz-Ramírez S, Li F, Cheng CC, Vivas E, Rey FE, Claesen J, Bugni TS, Walter J, van Pijkeren JP. A secondary metabolite drives intraspecies antagonism in a gut symbiont that is inhibited by cell-wall acetylation. Cell Host Microbe 2022; 30:824-835.e6. [PMID: 35443156 DOI: 10.1016/j.chom.2022.03.033] [Citation(s) in RCA: 7] [Impact Index Per Article: 3.5] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/11/2021] [Revised: 12/16/2021] [Accepted: 03/25/2022] [Indexed: 11/03/2022]
Abstract
The mammalian microbiome encodes numerous secondary metabolite biosynthetic gene clusters; yet, their role in microbe-microbe interactions is unclear. Here, we characterized two polyketide synthase gene clusters (fun and pks) in the gut symbiont Limosilactobacillus reuteri. The pks, but not the fun, cluster encodes antimicrobial activity. Forty-one of 51 L. reuteri strains tested are sensitive to Pks products; this finding was independent of strains' host origin. Sensitivity to Pks was also established in intraspecies competition experiments in gnotobiotic mice. Comparative genome analyses between Pks-resistant and -sensitive strains identified an acyltransferase gene (act) unique to Pks-resistant strains. Subsequent cell-wall analysis of wild-type and act mutant strains showed that Act acetylates cell-wall components, providing resistance to Pks-mediated killing. Additionally, pks mutants lost their competitive advantage, while act mutants lost their Pks resistance in in vivo competition assays. These findings provide insight into how closely related gut symbionts can compete and co-exist in the gastrointestinal tract.
Collapse
Affiliation(s)
- Mustafa Özçam
- Department of Food Science, University of Wisconsin-Madison, Madison, WI 53706, USA
| | - Jee-Hwan Oh
- Department of Food Science, University of Wisconsin-Madison, Madison, WI 53706, USA
| | - Restituto Tocmo
- Department of Food Science, University of Wisconsin-Madison, Madison, WI 53706, USA
| | - Deepa Acharya
- Pharmaceutical Sciences Division, University of Wisconsin-Madison, Madison, WI 53705, USA
| | - Shenwei Zhang
- Department of Food Science, University of Wisconsin-Madison, Madison, WI 53706, USA
| | - Theresa J Astmann
- Department of Food Science, University of Wisconsin-Madison, Madison, WI 53706, USA
| | - Mark Heggen
- Department of Food Science, University of Wisconsin-Madison, Madison, WI 53706, USA
| | | | - Fuyong Li
- Department of Agricultural, Food and Nutritional Science, University of Alberta, Edmonton, AB T6G 2P5, Canada
| | - Christopher C Cheng
- Department of Biological Sciences, University of Alberta, Edmonton, AB T6G 2P5, Canada
| | - Eugenio Vivas
- Department of Bacteriology, University of Wisconsin-Madison, Madison, WI 53706, USA
| | - Federico E Rey
- Department of Bacteriology, University of Wisconsin-Madison, Madison, WI 53706, USA
| | - Jan Claesen
- Department of Cardiovascular and Metabolic Sciences and Center for Microbiome and Human Health, Lerner Research Institute, Cleveland Clinic, Cleveland, OH 44195, USA
| | - Tim S Bugni
- Pharmaceutical Sciences Division, University of Wisconsin-Madison, Madison, WI 53705, USA
| | - Jens Walter
- Department of Agricultural, Food and Nutritional Science, University of Alberta, Edmonton, AB T6G 2P5, Canada; Department of Biological Sciences, University of Alberta, Edmonton, AB T6G 2P5, Canada; Department of Medicine and APC Microbiome Ireland, University College Cork, Cork T12 K8AF, Ireland; School of Microbiology, University College Cork, Cork T12 YT20, Ireland
| | - Jan-Peter van Pijkeren
- Department of Food Science, University of Wisconsin-Madison, Madison, WI 53706, USA; Food Research Institute, University of Wisconsin-Madison, Madison, WI 53706, USA.
| |
Collapse
|
22
|
Sim M, Lee J, Wy S, Park N, Lee D, Kwon D, Kim J. Generation and application of pseudo-long reads for metagenome assembly. Gigascience 2022; 11:giac044. [PMID: 35579554 PMCID: PMC9112764 DOI: 10.1093/gigascience/giac044] [Citation(s) in RCA: 1] [Impact Index Per Article: 0.5] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/27/2021] [Revised: 03/10/2022] [Accepted: 04/03/2022] [Indexed: 12/13/2022] Open
Abstract
BACKGROUND Metagenomic assembly using high-throughput sequencing data is a powerful method to construct microbial genomes in environmental samples without cultivation. However, metagenomic assembly, especially when only short reads are available, is a complex and challenging task because mixed genomes of multiple microorganisms constitute the metagenome. Although long read sequencing technologies have been developed and have begun to be used for metagenomic assembly, many metagenomic studies have been performed based on short reads because the generation of long reads requires higher sequencing cost than short reads. RESULTS In this study, we present a new method called PLR-GEN. It creates pseudo-long reads from metagenomic short reads based on given reference genome sequences by considering small sequence variations existing in individual genomes of the same or different species. When applied to a mock community data set in the Human Microbiome Project, PLR-GEN dramatically extended short reads in length of 101 bp to pseudo-long reads with N50 of 33 Kbp and 0.4% error rate. The use of these pseudo-long reads generated by PLR-GEN resulted in an obvious improvement of metagenomic assembly in terms of the number of sequences, assembly contiguity, and prediction of species and genes. CONCLUSIONS PLR-GEN can be used to generate artificial long read sequences without spending extra sequencing cost, thus aiding various studies using metagenomes.
Collapse
Affiliation(s)
- Mikang Sim
- Department of Biomedical Science and Engineering, Konkuk University, Seoul 05029, Republic of Korea
| | - Jongin Lee
- Department of Biomedical Science and Engineering, Konkuk University, Seoul 05029, Republic of Korea
| | - Suyeon Wy
- Department of Biomedical Science and Engineering, Konkuk University, Seoul 05029, Republic of Korea
| | - Nayoung Park
- Department of Biomedical Science and Engineering, Konkuk University, Seoul 05029, Republic of Korea
| | - Daehwan Lee
- Department of Biomedical Science and Engineering, Konkuk University, Seoul 05029, Republic of Korea
| | - Daehong Kwon
- Department of Biomedical Science and Engineering, Konkuk University, Seoul 05029, Republic of Korea
| | - Jaebum Kim
- Department of Biomedical Science and Engineering, Konkuk University, Seoul 05029, Republic of Korea
| |
Collapse
|
23
|
Saenz C, Nigro E, Gunalan V, Arumugam M. MIntO: A Modular and Scalable Pipeline For Microbiome Metagenomic and Metatranscriptomic Data Integration. FRONTIERS IN BIOINFORMATICS 2022; 2:846922. [PMID: 36304282 PMCID: PMC9580859 DOI: 10.3389/fbinf.2022.846922] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 12/31/2021] [Accepted: 04/11/2022] [Indexed: 11/24/2022] Open
Abstract
Omics technologies have revolutionized microbiome research allowing the characterization of complex microbial communities in different biomes without requiring their cultivation. As a consequence, there has been a great increase in the generation of omics data from metagenomes and metatranscriptomes. However, pre-processing and analysis of these data have been limited by the availability of computational resources, bioinformatics expertise and standardized computational workflows to obtain consistent results that are comparable across different studies. Here, we introduce MIntO (Microbiome Integrated meta-Omics), a highly versatile pipeline that integrates metagenomic and metatranscriptomic data in a scalable way. The distinctive feature of this pipeline is the computation of gene expression profile through integrating metagenomic and metatranscriptomic data taking into account the community turnover and gene expression variations to disentangle the mechanisms that shape the metatranscriptome across time and between conditions. The modular design of MIntO enables users to run the pipeline using three available modes based on the input data and the experimental design, including de novo assembly leading to metagenome-assembled genomes. The integrated pipeline will be relevant to provide unique biochemical insights into microbial ecology by linking functions to retrieved genomes and to examine gene expression variation. Functional characterization of community members will be crucial to increase our knowledge of the microbiome’s contribution to human health and environment. MIntO v1.0.1 is available at https://github.com/arumugamlab/MIntO.
Collapse
|
24
|
González D, Robas M, Fernández V, Bárcena M, Probanza A, Jiménez PA. Comparative Metagenomic Study of Rhizospheric and Bulk Mercury-Contaminated Soils in the Mining District of Almadén. Front Microbiol 2022; 13:797444. [PMID: 35330761 PMCID: PMC8940170 DOI: 10.3389/fmicb.2022.797444] [Citation(s) in RCA: 8] [Impact Index Per Article: 4.0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/18/2021] [Accepted: 01/17/2022] [Indexed: 12/22/2022] Open
Abstract
Soil contamination by heavy metals, particularly mercury (Hg), is a problem that can seriously affect the environment, animals, and human health. Hg has the capacity to biomagnify in the food chain. That fact can lead to pathologies, of those which affect the central nervous system being the most severe. It is convenient to know the biological environmental indicators that alert of the effects of Hg contamination as well as the biological mechanisms that can help in its remediation. To contribute to this knowledge, this study conducted comparative analysis by the use of Shotgun metagenomics of the microbial communities in rhizospheric soils and bulk soil of the mining region of Almadén (Ciudad Real, Spain), one of the most affected areas by Hg in the world The sequences obtained was analyzed with MetaPhlAn2 tool and SUPER-FOCUS. The most abundant taxa in the taxonomic analysis in bulk soil were those of Actinobateria and Alphaproteobacteria. On the contrary, in the rhizospheric soil microorganisms belonging to the phylum Proteobacteria were abundant, evidencing that roots have a selective effect on the rhizospheric communities. In order to analyze possible indicators of biological contamination, a functional potential analysis was performed. The results point to a co-selection of the mechanisms of resistance to Hg and the mechanisms of resistance to antibiotics or other toxic compounds in environments contaminated by Hg. Likewise, the finding of antibiotic resistance mechanisms typical of the human clinic, such as resistance to beta-lactams and glycopeptics (vancomycin), suggests that these environments can behave as reservoirs. The sequences involved in Hg resistance (operon mer and efflux pumps) have a similar abundance in both soil types. However, the response to abiotic stress (salinity, desiccation, and contaminants) is more prevalent in rhizospheric soil. Finally, sequences involved in nitrogen fixation and metabolism and plant growth promotion (PGP genes) were identified, with higher relative abundances in rhizospheric soils. These findings can be the starting point for the targeted search for microorganisms suitable for further use in bioremediation processes in Hg-contaminated environments.
Collapse
Affiliation(s)
- Daniel González
- Department of Pharmaceutical Science and Health, CEU Universities, Boadilla del Monte, Spain
| | - Marina Robas
- Department of Pharmaceutical Science and Health, CEU Universities, Boadilla del Monte, Spain
| | - Vanesa Fernández
- Department of Pharmaceutical Science and Health, CEU Universities, Boadilla del Monte, Spain
| | - Marta Bárcena
- Department of Pharmaceutical Science and Health, CEU Universities, Boadilla del Monte, Spain
| | - Agustín Probanza
- Department of Pharmaceutical Science and Health, CEU Universities, Boadilla del Monte, Spain
| | - Pedro A Jiménez
- Department of Pharmaceutical Science and Health, CEU Universities, Boadilla del Monte, Spain
| |
Collapse
|
25
|
Cuscó A, Pérez D, Viñes J, Fàbregas N, Francino O. Novel canine high-quality metagenome-assembled genomes, prophages and host-associated plasmids provided by long-read metagenomics together with Hi-C proximity ligation. Microb Genom 2022; 8. [PMID: 35298370 PMCID: PMC9176287 DOI: 10.1099/mgen.0.000802] [Citation(s) in RCA: 3] [Impact Index Per Article: 1.5] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/18/2022] Open
Abstract
The human gut microbiome has been extensively studied, yet the canine gut microbiome is still largely unknown. The availability of high-quality genomes is essential in the fields of veterinary medicine and nutrition to unravel the biological role of key microbial members in the canine gut environment. Our aim was to evaluate nanopore long-read metagenomics and Hi-C (high-throughput chromosome conformation capture) proximity ligation to provide high-quality metagenome-assembled genomes (HQ MAGs) of the canine gut environment. By combining nanopore long-read metagenomics and Hi-C proximity ligation, we retrieved 27 HQ MAGs and 7 medium-quality MAGs of a faecal sample of a healthy dog. Canine MAGs (CanMAGs) improved genome contiguity of representatives from the animal and human MAG catalogues – short-read MAGs from public datasets – for the species they represented: they were more contiguous with complete ribosomal operons and at least 18 canonical tRNAs. Both canine-specific bacterial species and gut generalists inhabit the dog’s gastrointestinal environment. Most of them belonged to Firmicutes, followed by Bacteroidota and Proteobacteria. We also assembled one Actinobacteriota and one Fusobacteriota MAG. CanMAGs harboured antimicrobial-resistance genes (ARGs) and prophages and were linked to plasmids. ARGs conferring resistance to tetracycline were most predominant within CanMAGs, followed by lincosamide and macrolide ones. At the functional level, carbohydrate transport and metabolism was the most variable within the CanMAGs, and mobilome function was abundant in some MAGs. Specifically, we assigned the mobilome functions and the associated mobile genetic elements to the bacterial host. The CanMAGs harboured 50 bacteriophages, providing novel bacterial-host information for eight viral clusters, and Hi-C proximity ligation data linked the six potential plasmids to their bacterial host. Long-read metagenomics and Hi-C proximity ligation are likely to become a comprehensive approach to HQ MAG discovery and assignment of extra-chromosomal elements to their bacterial host. This will provide essential information for studying the canine gut microbiome in veterinary medicine and animal nutrition.
Collapse
Affiliation(s)
- Anna Cuscó
- Vetgenomics, Edificio Eureka, Parc de Recerca UAB, Barcelona, Spain.,Institute of Science and Technology for Brain-Inspired Intelligence, Fudan University, Shanghai, PR China
| | - Daniel Pérez
- Molecular Genetics Veterinary Service (SVGM), Veterinary School, Universitat Autònoma de Barcelona, Barcelona, Spain
| | - Joaquim Viñes
- Vetgenomics, Edificio Eureka, Parc de Recerca UAB, Barcelona, Spain.,Molecular Genetics Veterinary Service (SVGM), Veterinary School, Universitat Autònoma de Barcelona, Barcelona, Spain
| | - Norma Fàbregas
- Vetgenomics, Edificio Eureka, Parc de Recerca UAB, Barcelona, Spain
| | - Olga Francino
- Molecular Genetics Veterinary Service (SVGM), Veterinary School, Universitat Autònoma de Barcelona, Barcelona, Spain
| |
Collapse
|
26
|
Ryu EP, Davenport ER. Host Genetic Determinants of the Microbiome Across Animals: From Caenorhabditis elegans to Cattle. Annu Rev Anim Biosci 2022; 10:203-226. [PMID: 35167316 PMCID: PMC11000414 DOI: 10.1146/annurev-animal-020420-032054] [Citation(s) in RCA: 3] [Impact Index Per Article: 1.5] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/09/2022]
Abstract
Animals harbor diverse communities of microbes within their gastrointestinal tracts. Phylogenetic relationship, diet, gut morphology, host physiology, and ecology all influence microbiome composition within and between animal clades. Emerging evidence points to host genetics as also playing a role in determining gut microbial composition within species. Here, we discuss recent advances in the study of microbiome heritability across a variety of animal species. Candidate gene and discovery-based studies in humans, mice, Drosophila, Caenorhabditis elegans, cattle, swine, poultry, and baboons reveal trends in the types of microbes that are heritable and the host genes and pathways involved in shaping the microbiome. Heritable gut microbes within a host species tend to be phylogenetically restricted. Host genetic variation in immune- and growth-related genes drives the abundances of these heritable bacteria within the gut. With only a small slice of the metazoan branch of the tree of life explored to date, this is an area rife with opportunities to shed light into the mechanisms governing host-microbe relationships.
Collapse
Affiliation(s)
- Erica P Ryu
- Department of Biology, Pennsylvania State University, University Park, Pennsylvania, USA; ,
| | - Emily R Davenport
- Department of Biology, Pennsylvania State University, University Park, Pennsylvania, USA; ,
- Huck Institutes of the Life Sciences and Institute for Computational and Data Sciences, Pennsylvania State University, University Park, Pennsylvania, USA
| |
Collapse
|
27
|
Rasmussen JA, Villumsen KR, Ernst M, Hansen M, Forberg T, Gopalakrishnan S, Gilbert MTP, Bojesen AM, Kristiansen K, Limborg MT. A multi-omics approach unravels metagenomic and metabolic alterations of a probiotic and synbiotic additive in rainbow trout (Oncorhynchus mykiss). MICROBIOME 2022; 10:21. [PMID: 35094708 PMCID: PMC8802455 DOI: 10.1186/s40168-021-01221-8] [Citation(s) in RCA: 19] [Impact Index Per Article: 9.5] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Subscribe] [Scholar Register] [Received: 09/17/2021] [Accepted: 12/27/2021] [Indexed: 05/02/2023]
Abstract
BACKGROUND Animal protein production is increasingly looking towards microbiome-associated services such as the design of new and better probiotic solutions to further improve gut health and production sustainability. Here, we investigate the functional effects of bacteria-based pro- and synbiotic feed additives on microbiome-associated functions in relation to growth performance in the commercially important rainbow trout (Oncorhynchus mykiss). We combine complementary insights from multiple omics datasets from gut content samples, including 16S bacterial profiling, whole metagenomes, and untargeted metabolomics, to investigate bacterial metagenome-assembled genomes (MAGs) and their molecular interactions with host metabolism. RESULTS Our findings reveal that (I) feed additives changed the microbiome and that rainbow trout reared with feed additives had a significantly reduced relative abundance of the salmonid related Candidatus Mycoplasma salmoninae in both the mid and distal gut content, (II) genome resolved metagenomics revealed that alterations of microbial arginine biosynthesis and terpenoid backbone synthesis pathways were directly associated with the presence of Candidatus Mycoplasma salmoninae, and (III) differences in the composition of intestinal microbiota among feed types were directly associated with significant changes of the metabolomic landscape, including lipids and lipid-like metabolites, amino acids, bile acids, and steroid-related metabolites. CONCLUSION Our results demonstrate how the use of multi-omics to investigate complex host-microbiome interactions enable us to better evaluate the functional potential of probiotics compared to studies that only measure overall growth performance or that only characterise the microbial composition in intestinal environments. Video Abstract.
Collapse
Affiliation(s)
- Jacob Agerbo Rasmussen
- Laboratory of Genomics and Molecular Medicine, Department of Biology, University of Copenhagen, Copenhagen, Denmark.
- Center for Evolutionary Hologenomics, GLOBE Institute, Faculty of Health and Medical Sciences, Copenhagen, Denmark.
| | - Kasper Rømer Villumsen
- Department of Veterinary and Animal Sciences, University of Copenhagen, Veterinary Clinical Microbiology, Copenhagen, Denmark
| | - Madeleine Ernst
- Section for Clinical Mass Spectrometry, Danish Center for Neonatal Screening, Department of Congenital Disorders, Statens Serum Institut, 2300, Copenhagen, Denmark
| | - Martin Hansen
- Department of Environmental Science, Aarhus University, Aarhus, Denmark
| | | | - Shyam Gopalakrishnan
- Center for Evolutionary Hologenomics, GLOBE Institute, Faculty of Health and Medical Sciences, Copenhagen, Denmark
| | - M Thomas P Gilbert
- Center for Evolutionary Hologenomics, GLOBE Institute, Faculty of Health and Medical Sciences, Copenhagen, Denmark
- University Museum NTNU, Trondheim, Norway
| | - Anders Miki Bojesen
- Department of Veterinary and Animal Sciences, University of Copenhagen, Veterinary Clinical Microbiology, Copenhagen, Denmark
| | - Karsten Kristiansen
- Laboratory of Genomics and Molecular Medicine, Department of Biology, University of Copenhagen, Copenhagen, Denmark
- Institute of Metagenomics, Qingdao-Europe Advanced Institute for Life Sciences, Qingdao, China
| | - Morten Tønsberg Limborg
- Laboratory of Genomics and Molecular Medicine, Department of Biology, University of Copenhagen, Copenhagen, Denmark.
- Center for Evolutionary Hologenomics, GLOBE Institute, Faculty of Health and Medical Sciences, Copenhagen, Denmark.
| |
Collapse
|
28
|
Martin S, Heavens D, Lan Y, Horsfield S, Clark MD, Leggett RM. Nanopore adaptive sampling: a tool for enrichment of low abundance species in metagenomic samples. Genome Biol 2022; 23:11. [PMID: 35067223 PMCID: PMC8785595 DOI: 10.1186/s13059-021-02582-x] [Citation(s) in RCA: 64] [Impact Index Per Article: 32.0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 05/08/2021] [Accepted: 12/20/2021] [Indexed: 12/13/2022] Open
Abstract
Adaptive sampling is a method of software-controlled enrichment unique to nanopore sequencing platforms. To test its potential for enrichment of rarer species within metagenomic samples, we create a synthetic mock community and construct sequencing libraries with a range of mean read lengths. Enrichment is up to 13.87-fold for the least abundant species in the longest read length library; factoring in reduced yields from rejecting molecules the calculated efficiency raises this to 4.93-fold. Finally, we introduce a mathematical model of enrichment based on molecule length and relative abundance, whose predictions correlate strongly with mock and complex real-world microbial communities.
Collapse
Affiliation(s)
- Samuel Martin
- Earlham Institute, Norwich Research Park, Norwich, NR4 7UZ, UK
| | - Darren Heavens
- Earlham Institute, Norwich Research Park, Norwich, NR4 7UZ, UK
| | - Yuxuan Lan
- Earlham Institute, Norwich Research Park, Norwich, NR4 7UZ, UK
| | | | | | | |
Collapse
|
29
|
Youngblut ND, de la Cuesta-Zuluaga J, Ley RE. Incorporating genome-based phylogeny and functional similarity into diversity assessments helps to resolve a global collection of human gut metagenomes. Environ Microbiol 2022; 24:3966-3984. [PMID: 35049120 DOI: 10.1111/1462-2920.15910] [Citation(s) in RCA: 2] [Impact Index Per Article: 1.0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 05/27/2021] [Accepted: 01/15/2022] [Indexed: 11/29/2022]
Abstract
Tree-based diversity measures incorporate phylogenetic or functional relatedness into comparisons of microbial communities. This can improve the identification of explanatory factors compared to tree-agnostic diversity measures. However, applying tree-based diversity measures to metagenome data is more challenging than for single-locus sequencing (e.g., 16S rRNA gene). Utilizing the Genome Taxonomy Database (GTDB) for species-level metagenome profiling allows for functional diversity measures based on genomic content or traits inferred from it. Still, it is unclear how metagenome-based assessments of microbiome diversity benefit from incorporating phylogeny or function into measures of diversity. We assessed this by measuring phylogeny-based, function-based, and tree-agnostic diversity measures from a large, global collection of human gut metagenomes composed of 30 studies and 2943 samples. We found tree-based measures to explain phenotypic variation (e.g., westernization, disease status, and gender) better or equivalent to tree-agnostic measures. Ecophylogenetic and functional diversity measures provided unique insight into how microbiome diversity was partitioned by phenotype. Tree-based measures greatly improved machine learning model performance for predicting westernization, disease status, and gender, relative to models trained solely on tree-agnostic measures. Our findings illustrate the usefulness of tree- and function-based measures for metagenomic assessments of microbial diversity, which is a fundamental component of microbiome science. This article is protected by copyright. All rights reserved.
Collapse
Affiliation(s)
- Nicholas D Youngblut
- Department of Microbiome Science, Max Planck Institute for Developmental Biology, Max Planck Ring 5, 72076, Tübingen, Germany
| | - Jacobo de la Cuesta-Zuluaga
- Department of Microbiome Science, Max Planck Institute for Developmental Biology, Max Planck Ring 5, 72076, Tübingen, Germany
| | - Ruth E Ley
- Department of Microbiome Science, Max Planck Institute for Developmental Biology, Max Planck Ring 5, 72076, Tübingen, Germany
| |
Collapse
|
30
|
Chibani CM, Mahnert A, Borrel G, Almeida A, Werner A, Brugère JF, Gribaldo S, Finn RD, Schmitz RA, Moissl-Eichinger C. A catalogue of 1,167 genomes from the human gut archaeome. Nat Microbiol 2022; 7:48-61. [PMID: 34969981 PMCID: PMC8727293 DOI: 10.1038/s41564-021-01020-9] [Citation(s) in RCA: 59] [Impact Index Per Article: 29.5] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 05/01/2021] [Accepted: 11/10/2021] [Indexed: 12/19/2022]
Abstract
The human gut microbiome plays an important role in health, but its archaeal diversity remains largely unexplored. In the present study, we report the analysis of 1,167 nonredundant archaeal genomes (608 high-quality genomes) recovered from human gastrointestinal tract, sampled across 24 countries and rural and urban populations. We identified previously undescribed taxa including 3 genera, 15 species and 52 strains. Based on distinct genomic features, we justify the split of the Methanobrevibacter smithii clade into two separate species, with one represented by the previously undescribed 'Candidatus Methanobrevibacter intestini'. Patterns derived from 28,581 protein clusters showed significant associations with sociodemographic characteristics such as age groups and lifestyle. We additionally show that archaea are characterized by specific genomic and functional adaptations to the host and carry a complex virome. Our work expands our current understanding of the human archaeome and provides a large genome catalogue for future analyses to decipher its impact on human physiology.
Collapse
Affiliation(s)
- Cynthia Maria Chibani
- grid.9764.c0000 0001 2153 9986Institute for Microbiology, Christian-Albrechts-University Kiel, Kiel, Germany
| | - Alexander Mahnert
- grid.11598.340000 0000 8988 2476Diagnostic & Research Institute of Hygiene, Microbiology and Environmental Medicine, Medical University Graz, Graz, Austria
| | - Guillaume Borrel
- grid.428999.70000 0001 2353 6535Department of Microbiology, Unit of Evolutionary Biology of the Microbial Cell, Institut Pasteur, Paris, France
| | - Alexandre Almeida
- grid.225360.00000 0000 9709 7726European Molecular Biology Laboratory, European Bioinformatics Institute, Cambridge, UK ,grid.10306.340000 0004 0606 5382Wellcome Sanger Institute, Cambridge, UK
| | - Almut Werner
- grid.9764.c0000 0001 2153 9986Institute for Microbiology, Christian-Albrechts-University Kiel, Kiel, Germany
| | - Jean-François Brugère
- grid.494717.80000000115480420Institut Universitaire de Technologie Clermont Auvergne, Université Clermont Auvergne, CNRS, UMR 6023 Laboratoire Microorganismes: Genome et Environnement, Clermont-Ferrand, France
| | - Simonetta Gribaldo
- grid.428999.70000 0001 2353 6535Department of Microbiology, Unit of Evolutionary Biology of the Microbial Cell, Institut Pasteur, Paris, France
| | - Robert D. Finn
- grid.225360.00000 0000 9709 7726European Molecular Biology Laboratory, European Bioinformatics Institute, Cambridge, UK
| | - Ruth A. Schmitz
- grid.9764.c0000 0001 2153 9986Institute for Microbiology, Christian-Albrechts-University Kiel, Kiel, Germany
| | - Christine Moissl-Eichinger
- Diagnostic & Research Institute of Hygiene, Microbiology and Environmental Medicine, Medical University Graz, Graz, Austria. .,BioTechMed, Graz, Austria.
| |
Collapse
|
31
|
Biogeography of Bacterial Communities and Specialized Metabolism in Human Aerodigestive Tract Microbiomes. Microbiol Spectr 2021; 9:e0166921. [PMID: 34704787 PMCID: PMC8549736 DOI: 10.1128/spectrum.01669-21] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/30/2022] Open
Abstract
The aerodigestive tract (ADT) is the primary portal through which pathogens and other invading microbes enter the body. As the direct interface with the environment, we hypothesize that the ADT microbiota possess biosynthetic gene clusters (BGCs) for antibiotics and other specialized metabolites to compete with both endogenous and exogenous microbes. From 1,214 bacterial genomes, representing 136 genera and 387 species that colonize the ADT, we identified 3,895 BGCs. To determine the distribution of BGCs and bacteria in different ADT sites, we aligned 1,424 metagenomes, from nine different ADT sites, onto the predicted BGCs. We show that alpha diversity varies across the ADT and that each site is associated with distinct bacterial communities and BGCs. We identify specific BGC families enriched in the buccal mucosa, external naris, gingiva, and tongue dorsum despite these sites harboring closely related bacteria. We reveal BGC enrichment patterns indicative of the ecology at each site. For instance, aryl polyene and resorcinol BGCs are enriched in the gingiva and tongue, which are colonized by many anaerobes. In addition, we find that streptococci colonizing the tongue and cheek possess different ribosomally synthesized and posttranslationally modified peptide BGCs. Finally, we highlight bacterial genera with BGCs but are underexplored for specialized metabolism and demonstrate the bioactivity of Actinomyces against other bacteria, including human pathogens. Together, our results demonstrate that specialized metabolism in the ADT is extensive and that by exploring these microbiomes further, we will better understand the ecology and biogeography of this system and identify new bioactive natural products. IMPORTANCE Bacteria produce specialized metabolites to compete with other microbes. Though the biological activities of many specialized metabolites have been determined, our understanding of their ecology is limited, particularly within the human microbiome. As the aerodigestive tract (ADT) faces the external environment, bacteria colonizing this tract must compete both among themselves and with invading microbes, including human pathogens. We analyzed the genomes of ADT bacteria to identify biosynthetic gene clusters (BGCs) for specialized metabolites. We found that the majority of ADT BGCs are uncharacterized and the metabolites they encode are unknown. We mapped the distribution of BGCs across the ADT and determined that each site is associated with its own distinct bacterial community and BGCs. By further characterizing these BGCs, we will inform our understanding of ecology and biogeography across the ADT, and we may uncover new specialized metabolites, including antibiotics.
Collapse
|
32
|
Youngblut ND, Reischer GH, Dauser S, Maisch S, Walzer C, Stalder G, Farnleitner AH, Ley RE. Vertebrate host phylogeny influences gut archaeal diversity. Nat Microbiol 2021; 6:1443-1454. [PMID: 34702978 PMCID: PMC8556154 DOI: 10.1038/s41564-021-00980-2] [Citation(s) in RCA: 25] [Impact Index Per Article: 8.3] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 11/26/2020] [Accepted: 09/16/2021] [Indexed: 01/04/2023]
Abstract
Commonly used 16S rRNA gene primers do not detect the full range of archaeal diversity present in the vertebrate gut. As a result, several questions regarding the archaeal component of the gut microbiota remain, including which Archaea are host-associated, the specificities of such associations and the major factors influencing archaeal diversity. Using 16S rRNA gene amplicon sequencing with primers that specifically target Archaea, we obtained sufficient sequence data from 185 gastrointestinal samples collected from 110 vertebrate species that span five taxonomic classes (Mammalia, Aves, Reptilia, Amphibia and Actinopterygii), of which the majority were wild. We provide evidence for previously undescribed Archaea-host associations, including Bathyarchaeia and Methanothermobacter, the latter of which was prevalent among Aves and relatively abundant in species with higher body temperatures, although this association could not be decoupled from host phylogeny. Host phylogeny explained archaeal diversity more strongly than diet, while specific taxa were associated with both factors, and cophylogeny was significant and strongest for mammalian herbivores. Methanobacteria was the only class predicted to be present in the last common ancestors of mammals and all host species. Further analysis indicated that Archaea-Bacteria interactions have a limited effect on archaeal diversity. These findings expand our current understanding of Archaea-vertebrate associations.
Collapse
Affiliation(s)
- Nicholas D Youngblut
- Department of Microbiome Science, Max Planck Institute for Developmental Biology, Tübingen, Germany.
| | - Georg H Reischer
- TU Wien, Institute of Chemical, Environmental and Bioscience Engineering, Research Group for Environmental Microbiology and Molecular Diagnostics 166/5/3, Vienna, Austria.,ICC Interuniversity Cooperation Centre Water & Health, Vienna, Austria
| | - Silke Dauser
- Department of Microbiome Science, Max Planck Institute for Developmental Biology, Tübingen, Germany
| | - Sophie Maisch
- Department of Microbiome Science, Max Planck Institute for Developmental Biology, Tübingen, Germany
| | - Chris Walzer
- Wildlife Conservation Society, Bronx, NY, USA.,Research Institute of Wildlife Ecology, University of Veterinary Medicine, Vienna, Austria
| | - Gabrielle Stalder
- Research Institute of Wildlife Ecology, University of Veterinary Medicine, Vienna, Austria
| | - Andreas H Farnleitner
- TU Wien, Institute of Chemical, Environmental and Bioscience Engineering, Research Group for Environmental Microbiology and Molecular Diagnostics 166/5/3, Vienna, Austria.,ICC Interuniversity Cooperation Centre Water & Health, Vienna, Austria.,Research Division Water Quality and Health, Karl Landsteiner University for Health Sciences, Krems an der Donau, Austria
| | - Ruth E Ley
- Department of Microbiome Science, Max Planck Institute for Developmental Biology, Tübingen, Germany.,Cluster of Excellence EXC 2124 Controlling Microbes to Fight Infections, University of Tübingen, Tübingen, Germany
| |
Collapse
|
33
|
Abstract
Conservation research has historically been conducted at the macro level, focusing on animals and plants and their role in the wider ecosystem. However, there is a growing appreciation of the importance of microbial communities in conservation. Most microbiome research in conservation thus far has used amplicon sequencing methods to assess the taxonomic composition of microbial communities and inferred functional capabilities from these data. However, as manipulation of the microbiome as a conservation tool becomes more and more feasible, there is a growing need to understand the direct functional consequences of shifts in microbiome composition. This review outlines the latest advances in microbiome research from a functional perspective and how these data can be used to inform conservation strategies. This review will also consider some of the challenges faced when studying the microbiomes of wild animals and how they can be overcome by careful study design and sampling methods. Environmental changes brought about by climate change or direct human actions have the potential to alter the taxonomic composition of microbiomes in wild populations. Understanding how taxonomic shifts affect the function of microbial communities is important for identifying species most threatened by potential disruption to their microbiome. Preservation or even restoration of these functions has the potential to be a powerful tool in conservation biology and a shift towards functional characterisation of gut microbiome diversity will be an important first step.
Collapse
|
34
|
Youngblut ND, Ley RE. Struo2: efficient metagenome profiling database construction for ever-expanding microbial genome datasets. PeerJ 2021; 9:e12198. [PMID: 34616633 PMCID: PMC8450008 DOI: 10.7717/peerj.12198] [Citation(s) in RCA: 17] [Impact Index Per Article: 5.7] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 08/04/2021] [Accepted: 08/31/2021] [Indexed: 11/20/2022] Open
Abstract
Mapping metagenome reads to reference databases is the standard approach for assessing microbial taxonomic and functional diversity from metagenomic data. However, public reference databases often lack recently generated genomic data such as metagenome-assembled genomes (MAGs), which can limit the sensitivity of read-mapping approaches. We previously developed the Struo pipeline in order to provide a straight-forward method for constructing custom databases; however, the pipeline does not scale well enough to cope with the ever-increasing number of publicly available microbial genomes. Moreover, the pipeline does not allow for efficient database updating as new data are generated. To address these issues, we developed Struo2, which is >3.5 fold faster than Struo at database generation and can also efficiently update existing databases. We also provide custom Kraken2, Bracken, and HUMAnN3 databases that can be easily updated with new genomes and/or individual gene sequences. Efficient database updating, coupled with our pre-generated databases, enables “assembly-enhanced” profiling, which increases database comprehensiveness via inclusion of native genomic content. Inclusion of newly generated genomic content can greatly increase database comprehensiveness, especially for understudied biomes, which will enable more accurate assessments of microbiome diversity.
Collapse
Affiliation(s)
- Nicholas D Youngblut
- Microbiome Science, Max Planck Institute for Developmental Biology, Tuebingen, Baden Wurttemberg, Germany
| | - Ruth E Ley
- Microbiome Science, Max Planck Institute for Developmental Biology, Tuebingen, Baden Wurttemberg, Germany
| |
Collapse
|
35
|
Rämä T, Quandt CA. Improving Fungal Cultivability for Natural Products Discovery. Front Microbiol 2021; 12:706044. [PMID: 34603232 PMCID: PMC8481835 DOI: 10.3389/fmicb.2021.706044] [Citation(s) in RCA: 4] [Impact Index Per Article: 1.3] [Reference Citation Analysis] [Abstract] [Key Words] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 05/06/2021] [Accepted: 08/23/2021] [Indexed: 11/13/2022] Open
Abstract
The pool of fungal secondary metabolites can be extended by activating silent gene clusters of cultured strains or by using sensitive biological assays that detect metabolites missed by analytical methods. Alternatively, or in parallel with the first approach, one can increase the diversity of existing culture collections to improve the access to new natural products. This review focuses on the latter approach of screening previously uncultured fungi for chemodiversity. Both strategies have been practiced since the early days of fungal biodiscovery, yet relatively little has been done to overcome the challenge of cultivability of as-yet-uncultivated fungi. Whereas earlier cultivability studies using media formulations and biological assays to scrutinize fungal growth and associated factors were actively conducted, the application of modern omics methods remains limited to test how to culture the fungal dark matter and recalcitrant groups of described fungi. This review discusses the development of techniques to increase the cultivability of filamentous fungi that include culture media formulations and the utilization of known chemical growth factors, in situ culturing and current synthetic biology approaches that build upon knowledge from sequenced genomes. We list more than 100 growth factors, i.e., molecules, biological or physical factors that have been demonstrated to induce spore germination as well as tens of inducers of mycelial growth. We review culturing conditions that can be successfully manipulated for growth of fungi and visit recent information from omics methods to discuss the metabolic basis of cultivability. Earlier work has demonstrated the power of co-culturing fungi with their host, other microorganisms or their exudates to increase their cultivability. Co-culturing of two or more organisms is also a strategy used today for increasing cultivability. However, fungi possess an increased risk for cross-contaminations between isolates in existing in situ or microfluidics culturing devices. Technological improvements for culturing fungi are discussed in the review. We emphasize that improving the cultivability of fungi remains a relevant strategy in drug discovery and underline the importance of ecological and taxonomic knowledge in culture-dependent drug discovery. Combining traditional and omics techniques such as single cell or metagenome sequencing opens up a new era in the study of growth factors of hundreds of thousands of fungal species with high drug discovery potential.
Collapse
Affiliation(s)
- Teppo Rämä
- Marbio, Norwegian College of Fishery Science, University of Tromsø – The Arctic University of Norway, Tromsø, Norway
| | - C. Alisha Quandt
- Department of Ecology and Evolutionary Biology, University of Colorado, Boulder, Boulder, CO, United States
| |
Collapse
|
36
|
Abstract
Opportunistic feeding and multiple other environment factors can modulate the gut microbiome, and bias conclusions, when wild animals are used for studying the influence of phylogeny and diet on their gut microbiomes. Here, we controlled for these other confounding factors in our investigation of the magnitude of the effect of diet on the gut microbiome assemblies of nonpasserine birds. We collected fecal samples, at one point in time, from 35 species of birds in a single zoo as well as 6 species of domestic poultry from farms in Guangzhou city to minimize the influences from interfering factors. Specifically, we describe 16S rRNA amplicon data from 129 fecal samples obtained from 41 species of birds, with additional shotgun metagenomic sequencing data generated from 16 of these individuals. Our data show that diets containing native starch increase the abundance of Lactobacillus in the gut microbiome, while those containing plant-derived fiber mainly enrich the level of Clostridium Greater numbers of Fusobacteria and Proteobacteria are detected in carnivorous birds, while in birds fed a commercial corn-soybean basal diet, a stronger inner-connected microbial community containing Clostridia and Bacteroidia was enriched. Furthermore, the metagenome functions of the microbes (such as lipid metabolism and amino acid synthesis) were adapted to the different food types to achieve a beneficial state for the host. In conclusion, the covariation of diet and gut microbiome identified in our study demonstrates a modulation of the gut microbiome by dietary diversity and helps us better understand how birds live based on diet-microbiome-host interactions.IMPORTANCE Our study identified food source, rather than host phylogeny, as the main factor modulating the gut microbiome diversity of nonpasserine birds, after minimizing the effects of other complex interfering factors such as weather, season, and geography. Adaptive evolution of microbes to food types formed a dietary-microbiome-host interaction reciprocal state. The covariation of diet and gut microbiome, including the response of microbiota assembly to diet in structure and function, is important for health and nutrition in animals. Our findings help resolve the major modulators of gut microbiome diversity in nonpasserine birds, which had not previously been well studied. The diet-microbe interactions and cooccurrence patterns identified in our study may be of special interest for future health assessment and conservation in birds.
Collapse
|
37
|
Cuscó A, Pérez D, Viñes J, Fàbregas N, Francino O. Long-read metagenomics retrieves complete single-contig bacterial genomes from canine feces. BMC Genomics 2021; 22:330. [PMID: 33957869 PMCID: PMC8103633 DOI: 10.1186/s12864-021-07607-0] [Citation(s) in RCA: 27] [Impact Index Per Article: 9.0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 12/25/2020] [Accepted: 04/12/2021] [Indexed: 12/12/2022] Open
Abstract
Background Long-read sequencing in metagenomics facilitates the assembly of complete genomes out of complex microbial communities. These genomes include essential biologic information such as the ribosomal genes or the mobile genetic elements, which are usually missed with short-reads. We applied long-read metagenomics with Nanopore sequencing to retrieve high-quality metagenome-assembled genomes (HQ MAGs) from a dog fecal sample. Results We used nanopore long-read metagenomics and frameshift aware correction on a canine fecal sample and retrieved eight single-contig HQ MAGs, which were > 90% complete with < 5% contamination, and contained most ribosomal genes and tRNAs. At the technical level, we demonstrated that a high-molecular-weight DNA extraction improved the metagenomics assembly contiguity, the recovery of the rRNA operons, and the retrieval of longer and circular contigs that are potential HQ MAGs. These HQ MAGs corresponded to Succinivibrio, Sutterella, Prevotellamassilia, Phascolarctobacterium, Catenibacterium, Blautia, and Enterococcus genera. Linking our results to previous gastrointestinal microbiome reports (metagenome or 16S rRNA-based), we found that some bacterial species on the gastrointestinal tract seem to be more canid-specific –Succinivibrio, Prevotellamassilia, Phascolarctobacterium, Blautia_A sp900541345–, whereas others are more broadly distributed among animal and human microbiomes –Sutterella, Catenibacterium, Enterococcus, and Blautia sp003287895. Sutterella HQ MAG is potentially the first reported genome assembly for Sutterella stercoricanis, as assigned by 16S rRNA gene similarity. Moreover, we show that long reads are essential to detect mobilome functions, usually missed in short-read MAGs. Conclusions We recovered eight single-contig HQ MAGs from canine feces of a healthy dog with nanopore long-reads. We also retrieved relevant biological insights from these specific bacterial species previously missed in public databases, such as complete ribosomal operons and mobilome functions. The high-molecular-weight DNA extraction improved the assembly’s contiguity, whereas the high-accuracy basecalling, the raw read error correction, the assembly polishing, and the frameshift correction reduced the insertion and deletion errors. Both experimental and analytical steps ensured the retrieval of complete bacterial genomes. Supplementary Information The online version contains supplementary material available at 10.1186/s12864-021-07607-0.
Collapse
Affiliation(s)
- Anna Cuscó
- Vetgenomics, Ed Eureka, Parc de Recerca UAB, Barcelona, Spain.
| | - Daniel Pérez
- Molecular Genetics Veterinary Service (SVGM), Veterinary School, Universitat Autònoma de Barcelona, Barcelona, Spain
| | - Joaquim Viñes
- Vetgenomics, Ed Eureka, Parc de Recerca UAB, Barcelona, Spain.,Molecular Genetics Veterinary Service (SVGM), Veterinary School, Universitat Autònoma de Barcelona, Barcelona, Spain
| | - Norma Fàbregas
- Vetgenomics, Ed Eureka, Parc de Recerca UAB, Barcelona, Spain
| | - Olga Francino
- Molecular Genetics Veterinary Service (SVGM), Veterinary School, Universitat Autònoma de Barcelona, Barcelona, Spain
| |
Collapse
|
38
|
Abstract
Host-adapted microorganisms are generally assumed to have evolved from free-living, environmental microorganisms, as examples of the reverse process are rare. In the phylum Gammaproteobacteria, family Moraxellaceae, the genus Psychrobacter includes strains from a broad ecological distribution including animal bodies as well as sea ice and other nonhost environments. To elucidate the relationship between these ecological niches and Psychrobacter's evolutionary history, we performed tandem genomic analyses with phenotyping of 85 Psychrobacter accessions. Phylogenomic analysis of the family Moraxellaceae reveals that basal members of the Psychrobacter clade are Moraxella spp., a group of often-pathogenic organisms. Psychrobacter exhibited two broad growth patterns in our phenotypic screen: one group that we called the "flexible ecotype" (FE) had the ability to grow between 4 and 37°C, and the other, which we called the "restricted ecotype" (RE), could grow between 4 and 25°C. The FE group includes phylogenetically basal strains, and FE strains exhibit increased transposon copy numbers, smaller genomes, and a higher likelihood to be bile salt resistant. The RE group contains only phylogenetically derived strains and has increased proportions of lipid metabolism and biofilm formation genes, functions that are adaptive to cold stress. In a 16S rRNA gene survey of polar bear fecal samples, we detect both FE and RE strains, but in in vivo colonizations of gnotobiotic mice, only FE strains persist. Our results indicate the ability to grow at 37°C, seemingly necessary for mammalian gut colonization, is an ancestral trait for Psychrobacter, which likely evolved from a pathobiont.IMPORTANCE Host-associated microbes are generally assumed to have evolved from free-living ones. The evolutionary transition of microbes in the opposite direction, from host associated toward free living, has been predicted based on phylogenetic data but not studied in depth. Here, we provide evidence that the genus Psychrobacter, particularly well known for inhabiting low-temperature, high-salt environments such as sea ice, permafrost soils, and frozen foodstuffs, has evolved from a mammalian-associated ancestor. We show that some Psychrobacter strains retain seemingly ancestral genomic and phenotypic traits that correspond with host association while others have diverged to psychrotrophic or psychrophilic lifestyles.
Collapse
|
39
|
Sagita R, Quax WJ, Haslinger K. Current State and Future Directions of Genetics and Genomics of Endophytic Fungi for Bioprospecting Efforts. Front Bioeng Biotechnol 2021; 9:649906. [PMID: 33791289 PMCID: PMC8005728 DOI: 10.3389/fbioe.2021.649906] [Citation(s) in RCA: 16] [Impact Index Per Article: 5.3] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 01/05/2021] [Accepted: 02/16/2021] [Indexed: 12/16/2022] Open
Abstract
The bioprospecting of secondary metabolites from endophytic fungi received great attention in the 1990s and 2000s, when the controversy around taxol production from Taxus spp. endophytes was at its height. Since then, hundreds of reports have described the isolation and characterization of putative secondary metabolites from endophytic fungi. However, only very few studies also report the genetic basis for these phenotypic observations. With low sequencing cost and fast sample turnaround, genetics- and genomics-based approaches have risen to become comprehensive approaches to study natural products from a wide-range of organisms, especially to elucidate underlying biosynthetic pathways. However, in the field of fungal endophyte biology, elucidation of biosynthetic pathways is still a major challenge. As a relatively poorly investigated group of microorganisms, even in the light of recent efforts to sequence more fungal genomes, such as the 1000 Fungal Genomes Project at the Joint Genome Institute (JGI), the basis for bioprospecting of enzymes and pathways from endophytic fungi is still rather slim. In this review we want to discuss the current approaches and tools used to associate phenotype and genotype to elucidate biosynthetic pathways of secondary metabolites in endophytic fungi through the lens of bioprospecting. This review will point out the reported successes and shortcomings, and discuss future directions in sampling, and genetics and genomics of endophytic fungi. Identifying responsible biosynthetic genes for the numerous secondary metabolites isolated from endophytic fungi opens the opportunity to explore the genetic potential of producer strains to discover novel secondary metabolites and enhance secondary metabolite production by metabolic engineering resulting in novel and more affordable medicines and food additives.
Collapse
Affiliation(s)
| | | | - Kristina Haslinger
- Groningen Institute of Pharmacy, Chemical and Pharmaceutical Biology, University of Groningen, Groningen, Netherlands
| |
Collapse
|
40
|
Kautsar SA, van der Hooft JJJ, de Ridder D, Medema MH. BiG-SLiCE: A highly scalable tool maps the diversity of 1.2 million biosynthetic gene clusters. Gigascience 2021; 10:giaa154. [PMID: 33438731 PMCID: PMC7804863 DOI: 10.1093/gigascience/giaa154] [Citation(s) in RCA: 76] [Impact Index Per Article: 25.3] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 08/26/2020] [Revised: 10/29/2020] [Accepted: 11/29/2020] [Indexed: 12/20/2022] Open
Abstract
BACKGROUND Genome mining for biosynthetic gene clusters (BGCs) has become an integral part of natural product discovery. The >200,000 microbial genomes now publicly available hold information on abundant novel chemistry. One way to navigate this vast genomic diversity is through comparative analysis of homologous BGCs, which allows identification of cross-species patterns that can be matched to the presence of metabolites or biological activities. However, current tools are hindered by a bottleneck caused by the expensive network-based approach used to group these BGCs into gene cluster families (GCFs). RESULTS Here, we introduce BiG-SLiCE, a tool designed to cluster massive numbers of BGCs. By representing them in Euclidean space, BiG-SLiCE can group BGCs into GCFs in a non-pairwise, near-linear fashion. We used BiG-SLiCE to analyze 1,225,071 BGCs collected from 209,206 publicly available microbial genomes and metagenome-assembled genomes within 10 days on a typical 36-core CPU server. We demonstrate the utility of such analyses by reconstructing a global map of secondary metabolic diversity across taxonomy to identify uncharted biosynthetic potential. BiG-SLiCE also provides a "query mode" that can efficiently place newly sequenced BGCs into previously computed GCFs, plus a powerful output visualization engine that facilitates user-friendly data exploration. CONCLUSIONS BiG-SLiCE opens up new possibilities to accelerate natural product discovery and offers a first step towards constructing a global and searchable interconnected network of BGCs. As more genomes are sequenced from understudied taxa, more information can be mined to highlight their potentially novel chemistry. BiG-SLiCE is available via https://github.com/medema-group/bigslice.
Collapse
Affiliation(s)
- Satria A Kautsar
- Bioinformatics Group, Wageningen University, Droevendaalsesteeg 1, 6708PB, Wageningen, The Netherlands
| | - Justin J J van der Hooft
- Bioinformatics Group, Wageningen University, Droevendaalsesteeg 1, 6708PB, Wageningen, sThe Netherlands
| | - Dick de Ridder
- Bioinformatics Group, Wageningen University, Droevendaalsesteeg 1, 6708PB, Wageningen, The Netherlands
| | - Marnix H Medema
- Bioinformatics Group, Wageningen University, Droevendaalsesteeg 1, 6708PB, Wageningen, The Netherlands
| |
Collapse
|
41
|
Ruiz-Perez CA, Conrad RE, Konstantinidis KT. MicrobeAnnotator: a user-friendly, comprehensive functional annotation pipeline for microbial genomes. BMC Bioinformatics 2021; 22:11. [PMID: 33407081 PMCID: PMC7789693 DOI: 10.1186/s12859-020-03940-5] [Citation(s) in RCA: 44] [Impact Index Per Article: 14.7] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 09/02/2020] [Accepted: 12/15/2020] [Indexed: 12/21/2022] Open
Abstract
BACKGROUND High-throughput sequencing has increased the number of available microbial genomes recovered from isolates, single cells, and metagenomes. Accordingly, fast and comprehensive functional gene annotation pipelines are needed to analyze and compare these genomes. Although several approaches exist for genome annotation, these are typically not designed for easy incorporation into analysis pipelines, do not combine results from different annotation databases or offer easy-to-use summaries of metabolic reconstructions, and typically require large amounts of computing power for high-throughput analysis not available to the average user. RESULTS Here, we introduce MicrobeAnnotator, a fully automated, easy-to-use pipeline for the comprehensive functional annotation of microbial genomes that combines results from several reference protein databases and returns the matching annotations together with key metadata such as the interlinked identifiers of matching reference proteins from multiple databases [KEGG Orthology (KO), Enzyme Commission (E.C.), Gene Ontology (GO), Pfam, and InterPro]. Further, the functional annotations are summarized into Kyoto Encyclopedia of Genes and Genomes (KEGG) modules as part of a graphical output (heatmap) that allows the user to quickly detect differences among (multiple) query genomes and cluster the genomes based on their metabolic similarity. MicrobeAnnotator is implemented in Python 3 and is freely available under an open-source Artistic License 2.0 from https://github.com/cruizperez/MicrobeAnnotator . CONCLUSIONS We demonstrated the capabilities of MicrobeAnnotator by annotating 100 Escherichia coli and 78 environmental Candidate Phyla Radiation (CPR) bacterial genomes and comparing the results to those of other popular tools. We showed that the use of multiple annotation databases allows MicrobeAnnotator to recover more annotations per genome compared to faster tools that use reduced databases and is computationally efficient for use in personal computers. The output of MicrobeAnnotator can be easily incorporated into other analysis pipelines while the results of other annotation tools can be seemingly incorporated into MicrobeAnnotator to generate summary plots.
Collapse
Affiliation(s)
- Carlos A. Ruiz-Perez
- School of Biological Sciences, Georgia Institute of Technology, Atlanta, GA 30332 USA
| | - Roth E. Conrad
- Ocean Science and Engineering, School of Biological Sciences, Georgia Institute of Technology, Atlanta, GA 30332 USA
| | - Konstantinos T. Konstantinidis
- School of Biological Sciences, Georgia Institute of Technology, Atlanta, GA 30332 USA
- Ocean Science and Engineering, School of Biological Sciences, Georgia Institute of Technology, Atlanta, GA 30332 USA
- School of Civil and Environmental Engineering, Georgia Institute of Technology, Atlanta, GA 30332 USA
- Center for Bioinformatics and Computational Genomics, Georgia Institute of Technology, Atlanta, GA 30332 USA
| |
Collapse
|
42
|
Large-Scale Metagenome Assembly Reveals Novel Animal-Associated Microbial Genomes, Biosynthetic Gene Clusters, and Other Genetic Diversity. mSystems 2020; 5:5/6/e01045-20. [PMID: 33144315 PMCID: PMC7646530 DOI: 10.1128/msystems.01045-20] [Citation(s) in RCA: 34] [Impact Index Per Article: 8.5] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 12/17/2022] Open
Abstract
Large-scale metagenome assemblies of human microbiomes have produced a vast catalogue of previously unseen microbial genomes; however, comparatively few microbial genomes derive from other vertebrates. Here, we generated 5,596 metagenome-assembled genomes (MAGs) from the gut metagenomes of 180 predominantly wild animal species representing 5 classes, in addition to 14 existing animal gut metagenome data sets. The MAGs comprised 1,522 species-level genome bins (SGBs), most of which were novel at the species, genus, or family level, and the majority were enriched in host versus environment metagenomes. Many traits distinguished SGBs enriched in host or environmental biomes, including the number of antimicrobial resistance genes. We identified 1,986 diverse biosynthetic gene clusters; only 23 clustered with any MIBiG database references. Gene-based assembly revealed tremendous gene diversity, much of it host or environment specific. Our MAG and gene data sets greatly expand the microbial genome repertoire and provide a broad view of microbial adaptations to the vertebrate gut.IMPORTANCE Microbiome studies on a select few mammalian species (e.g., humans, mice, and cattle) have revealed a great deal of novel genomic diversity in the gut microbiome. However, little is known of the microbial diversity in the gut of other vertebrates. We studied the gut microbiomes of a large set of mostly wild animal species consisting of mammals, birds, reptiles, amphibians, and fish. Unfortunately, we found that existing reference databases commonly used for metagenomic analyses failed to capture the microbiome diversity among vertebrates. To increase database representation, we applied advanced metagenome assembly methods to our animal gut data and to many public gut metagenome data sets that had not been used to obtain microbial genomes. Our resulting genome and gene cluster collections comprised a great deal of novel taxonomic and genomic diversity, which we extensively characterized. Our findings substantially expand what is known of microbial genomic diversity in the vertebrate gut.
Collapse
|