1
|
Yepes-García J, Falquet L. Metagenome quality metrics and taxonomical annotation visualization through the integration of MAGFlow and BIgMAG. F1000Res 2024; 13:640. [PMID: 39360247 PMCID: PMC11445639 DOI: 10.12688/f1000research.152290.2] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Figures] [Journal Information] [Submit a Manuscript] [Subscribe] [Scholar Register] [Accepted: 09/03/2024] [Indexed: 10/04/2024] Open
Abstract
Background Building Metagenome-Assembled Genomes (MAGs) from highly complex metagenomics datasets encompasses a series of steps covering from cleaning the sequences, assembling them to finally group them into bins. Along the process, multiple tools aimed to assess the quality and integrity of each MAG are implemented. Nonetheless, even when incorporated within end-to-end pipelines, the outputs of these pieces of software must be visualized and analyzed manually lacking integration in a complete framework. Methods We developed a Nextflow pipeline (MAGFlow) for estimating the quality of MAGs through a wide variety of approaches (BUSCO, CheckM2, GUNC and QUAST), as well as for annotating taxonomically the metagenomes using GTDB-Tk2. MAGFlow is coupled to a Python-Dash application (BIgMAG) that displays the concatenated outcomes from the tools included by MAGFlow, highlighting the most important metrics in a single interactive environment along with a comparison/clustering of the input data. Results By using MAGFlow/BIgMAG, the user will be able to benchmark the MAGs obtained through different workflows or establish the quality of the MAGs belonging to different samples following the divide and rule methodology. Conclusions MAGFlow/BIgMAG represents a unique tool that integrates state-of-the-art tools to study different quality metrics and extract visually as much information as possible from a wide range of genome features.
Collapse
Affiliation(s)
- Jeferyd Yepes-García
- Swiss Institute of Bioinformatics, Lausanne, Vaud, 1015, Switzerland
- Department of Biology, University of Fribourg, Fribourg, Canton of Fribourg, 1700, Switzerland
| | - Laurent Falquet
- Swiss Institute of Bioinformatics, Lausanne, Vaud, 1015, Switzerland
- Department of Biology, University of Fribourg, Fribourg, Canton of Fribourg, 1700, Switzerland
| |
Collapse
|
2
|
Dowdell KS, Potgieter SC, Olsen K, Lee S, Vedrin M, Caverly LJ, LiPuma JJ, Raskin L. Source-to-tap investigation of the occurrence of nontuberculous mycobacteria in a full-scale chloraminated drinking water system. Appl Environ Microbiol 2024; 90:e0060924. [PMID: 39109876 PMCID: PMC11409651 DOI: 10.1128/aem.00609-24] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 04/16/2024] [Accepted: 07/08/2024] [Indexed: 09/19/2024] Open
Abstract
Nontuberculous mycobacteria (NTM) in drinking water are a significant public health concern. However, an incomplete understanding of the factors that influence the occurrence of NTM in drinking water limits our ability to characterize risk and prevent infection. This study sought to evaluate the influence of season and water treatment, distribution, and stagnation on NTM in drinking water. Samples were collected source-to-tap in a full-scale, chloraminated drinking water system approximately monthly from December 2019 to November 2020. NTM were characterized using culture-dependent (plate culture with matrix-assisted laser desorption ionization-time-of-flight mass spectrometry [MALDI-TOF MS] isolate analysis) and culture-independent methods (quantitative PCR and genome-resolved metagenomics). Sampling locations included source waters, three locations within the treatment plant, and five buildings receiving water from the distribution system. Building plumbing samples consisted of first draw, 5-min flush, and full flush cold-water samples. As the study took place during the COVID-19 pandemic, the influence of reduced water usage in three of the five buildings was also investigated. The highest NTM densities source-to-tap were found in the summer first draw building water samples (107 gene copies/L), which also had the lowest monochloramine concentrations. Flushing was found to be effective for reducing NTM and restoring disinfectant residuals, though flush times necessary to improve water quality varied by building. Clinically relevant NTM species, including Mycobacterium avium, were recovered via plate culture, with increased occurrence observed in buildings with higher water age. Four of five NTM metagenome-assembled genomes were identified to the species level and matched identified isolates.IMPORTANCENTM infections are increasing in prevalence, difficult to treat, and associated with high morbidity and mortality rates. Our lack of understanding of the factors that influence NTM occurrence in drinking water limits our ability to prevent infections, accurately characterize risk, and focus remediation efforts. In this study, we comprehensively evaluated NTM in a full-scale drinking water system, showing that various steps in treatment and distribution influence NTM presence. Stagnant building water contained the highest NTM densities source-to-tap and was associated with low disinfectant residuals. We illustrated the differences in NTM detection and characterization obtained from culture-based and culture-independent methods, highlighting the complementarity between these approaches. We demonstrated that focusing NTM mitigation efforts in building plumbing systems, which have the highest NTM densities source-to-tap, has potential for immediate positive effects. We also identified steps during treatment that increase NTM levels, which provides beneficial information for utilities seeking to reduce NTM in finished water.
Collapse
Affiliation(s)
- Katherine S. Dowdell
- Department of Civil and Environmental Engineering, University of Michigan, Ann Arbor, Michigan, USA
| | - Sarah C. Potgieter
- Department of Civil and Environmental Engineering, University of Michigan, Ann Arbor, Michigan, USA
| | - Kirk Olsen
- Department of Civil and Environmental Engineering, University of Michigan, Ann Arbor, Michigan, USA
| | - Soojung Lee
- Department of Civil and Environmental Engineering, University of Michigan, Ann Arbor, Michigan, USA
| | - Matthew Vedrin
- Department of Civil and Environmental Engineering, University of Michigan, Ann Arbor, Michigan, USA
| | - Lindsay J. Caverly
- Department of Pediatrics, University of Michigan Medical School, Ann Arbor, Michigan, USA
| | - John J. LiPuma
- Department of Pediatrics, University of Michigan Medical School, Ann Arbor, Michigan, USA
| | - Lutgarde Raskin
- Department of Civil and Environmental Engineering, University of Michigan, Ann Arbor, Michigan, USA
| |
Collapse
|
3
|
Sudarshan AS, Dai Z, Gabrielli M, Oosthuizen-Vosloo S, Konstantinidis KT, Pinto AJ. New Drinking Water Genome Catalog Identifies a Globally Distributed Bacterial Genus Adapted to Disinfected Drinking Water Systems. ENVIRONMENTAL SCIENCE & TECHNOLOGY 2024; 58:16475-16487. [PMID: 39235268 PMCID: PMC11411728 DOI: 10.1021/acs.est.4c05086] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 09/06/2024]
Abstract
Genome-resolved insights into the structure and function of the drinking water microbiome can advance the effective management of drinking water quality. To enable this, we constructed and curated thousands of metagenome-assembled and isolate genomes from drinking water distribution systems globally to develop a Drinking Water Genome Catalog (DWGC). The current DWGC disproportionately represents disinfected drinking water systems due to a paucity of metagenomes from nondisinfected systems. Using the DWGC, we identify core genera of the drinking water microbiome including a genus (UBA4765) within the order Rhizobiales that is frequently detected and highly abundant in disinfected drinking water systems. We demonstrate that this genus has been widely detected but incorrectly classified in previous amplicon sequencing-based investigations of the drinking water microbiome. Further, we show that a single genome variant (genomovar) within this genus is detected in 75% of drinking water systems included in this study. We propose a name for this uncultured bacterium as "Raskinella chloraquaticus" and describe the genus as "Raskinella" (endorsed by SeqCode). Metabolic annotation and modeling-based predictions indicate that this bacterium is capable of necrotrophic growth, is able to metabolize halogenated compounds, proliferates in a biofilm-based environment, and shows clear indications of disinfection-mediated selection.
Collapse
Affiliation(s)
- Ashwin S Sudarshan
- School of Civil and Environmental Engineering, Georgia Institute of Technology, Atlanta, Georgia 30332, United States
| | - Zihan Dai
- School of Civil and Environmental Engineering, Georgia Institute of Technology, Atlanta, Georgia 30332, United States
| | - Marco Gabrielli
- Department of Environmental Microbiology, Eawag, Swiss Federal Institute of Aquatic Science and Technology, Dubendorf CH-8600, Switzerland
| | - Solize Oosthuizen-Vosloo
- Institute for Cellular and Molecular Medicine, Department of Immunology, Faculty of Health Sciences, University of Pretoria, Pretoria 0084, South Africa
| | - Konstantinos T Konstantinidis
- School of Civil and Environmental Engineering, Georgia Institute of Technology, Atlanta, Georgia 30332, United States
| | - Ameet J Pinto
- School of Civil and Environmental Engineering, Georgia Institute of Technology, Atlanta, Georgia 30332, United States
- School of Earth and Atmospheric Sciences, Georgia Institute of Technology, Atlanta, Georgia 30332, United States
| |
Collapse
|
4
|
Krinos AI, Bowers RM, Rohwer RR, McMahon KD, Woyke T, Schulz F. Time-series metagenomics reveals changing protistan ecology of a temperate dimictic lake. MICROBIOME 2024; 12:133. [PMID: 39030632 PMCID: PMC11265017 DOI: 10.1186/s40168-024-01831-y] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Subscribe] [Scholar Register] [Received: 12/16/2023] [Accepted: 05/06/2024] [Indexed: 07/21/2024]
Abstract
BACKGROUND Protists, single-celled eukaryotic organisms, are critical to food web ecology, contributing to primary productivity and connecting small bacteria and archaea to higher trophic levels. Lake Mendota is a large, eutrophic natural lake that is a Long-Term Ecological Research site and among the world's best-studied freshwater systems. Metagenomic samples have been collected and shotgun sequenced from Lake Mendota for the last 20 years. Here, we analyze this comprehensive time series to infer changes to the structure and function of the protistan community and to hypothesize about their interactions with bacteria. RESULTS Based on small subunit rRNA genes extracted from the metagenomes and metagenome-assembled genomes of microeukaryotes, we identify shifts in the eukaryotic phytoplankton community over time, which we predict to be a consequence of reduced zooplankton grazing pressures after the invasion of a invasive predator (the spiny water flea) to the lake. The metagenomic data also reveal the presence of the spiny water flea and the zebra mussel, a second invasive species to Lake Mendota, prior to their visual identification during routine monitoring. Furthermore, we use species co-occurrence and co-abundance analysis to connect the protistan community with bacterial taxa. Correlation analysis suggests that protists and bacteria may interact or respond similarly to environmental conditions. Cryptophytes declined in the second decade of the timeseries, while many alveolate groups (e.g., ciliates and dinoflagellates) and diatoms increased in abundance, changes that have implications for food web efficiency in Lake Mendota. CONCLUSIONS We demonstrate that metagenomic sequence-based community analysis can complement existing efforts to monitor protists in Lake Mendota based on microscopy-based count surveys. We observed patterns of seasonal abundance in microeukaryotes in Lake Mendota that corroborated expectations from other systems, including high abundance of cryptophytes in winter and diatoms in fall and spring, but with much higher resolution than previous surveys. Our study identified long-term changes in the abundance of eukaryotic microbes and provided context for the known establishment of an invasive species that catalyzes a trophic cascade involving protists. Our findings are important for decoding potential long-term consequences of human interventions, including invasive species introduction. Video Abstract.
Collapse
Affiliation(s)
- Arianna I Krinos
- Department of Biology, Woods Hole Oceanographic Institution, Woods Hole, MA, USA.
- Department of Earth, Atmospheric, and Planetary Science, Massachusetts Institute of Technology, Cambridge, MA, USA.
- MIT-WHOI Joint Program in Oceanography/Applied Ocean Science and Engineering, Cambridge, Woods Hole, MA, USA.
- Joint Genome Institute, Lawrence Berkeley National Laboratory, Berkeley, CA, USA.
| | - Robert M Bowers
- Joint Genome Institute, Lawrence Berkeley National Laboratory, Berkeley, CA, USA
| | - Robin R Rohwer
- Department of Integrative Biology, University of Texas at Austin, Austin, TX, USA
| | - Katherine D McMahon
- Department of Bacteriology, University of Wisconsin at Madison, Madison, WI, USA
| | - Tanja Woyke
- Joint Genome Institute, Lawrence Berkeley National Laboratory, Berkeley, CA, USA
| | - Frederik Schulz
- Joint Genome Institute, Lawrence Berkeley National Laboratory, Berkeley, CA, USA.
| |
Collapse
|
5
|
Cantin LJ, Dunning Hotopp JC, Foster JM. Improved metagenome assemblies through selective enrichment of bacterial genomic DNA from eukaryotic host genomic DNA using ATAC-seq. Front Microbiol 2024; 15:1352378. [PMID: 38426058 PMCID: PMC10902005 DOI: 10.3389/fmicb.2024.1352378] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 12/08/2023] [Accepted: 02/05/2024] [Indexed: 03/02/2024] Open
Abstract
Genomics can be used to study the complex relationships between hosts and their microbiota. Many bacteria cannot be cultured in the laboratory, making it difficult to obtain adequate amounts of bacterial DNA and to limit host DNA contamination for the construction of metagenome-assembled genomes (MAGs). For example, Wolbachia is a genus of exclusively obligate intracellular bacteria that live in a wide range of arthropods and some nematodes. While Wolbachia endosymbionts are frequently described as facultative reproductive parasites in arthropods, the bacteria are obligate mutualistic endosymbionts of filarial worms. Here, we achieve 50-fold enrichment of bacterial sequences using ATAC-seq (Assay for Transposase-Accessible Chromatin using sequencing) with Brugia malayi nematodes, containing Wolbachia (wBm). ATAC-seq uses the Tn5 transposase to cut and attach Illumina sequencing adapters to accessible DNA lacking histones, typically thought to be open chromatin. Bacterial and mitochondrial DNA in the lysates are also cut preferentially since they lack histones, leading to the enrichment of these sequences. The benefits of this include minimal tissue input (<1 mg of tissue), a quick protocol (<4 h), low sequencing costs, less bias, correct assembly of lateral gene transfers and no prior sequence knowledge required. We assembled the wBm genome with as few as 1 million Illumina short paired-end reads with >97% coverage of the published genome, compared to only 12% coverage with the standard gDNA libraries. We found significant bacterial sequence enrichment that facilitated genome assembly in previously published ATAC-seq data sets from human cells infected with Mycobacterium tuberculosis and C. elegans contaminated with their food source, the OP50 strain of E. coli. These results demonstrate the feasibility and benefits of using ATAC-seq to easily obtain bacterial genomes to aid in symbiosis, infectious disease, and microbiome research.
Collapse
Affiliation(s)
- Lindsey J. Cantin
- Biochemistry and Microbiology Division, New England BioLabs, Ipswich, MA, United States
| | - Julie C. Dunning Hotopp
- Institute for Genome Sciences, University of Maryland School of Medicine, Baltimore, MD, United States
| | - Jeremy M. Foster
- Biochemistry and Microbiology Division, New England BioLabs, Ipswich, MA, United States
| |
Collapse
|
6
|
Duarte VDS, Porcellato D. Host DNA depletion methods and genome-centric metagenomics of bovine hindmilk microbiome. mSphere 2024; 9:e0047023. [PMID: 38054728 PMCID: PMC10826364 DOI: 10.1128/msphere.00470-23] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 08/22/2023] [Accepted: 10/20/2023] [Indexed: 12/07/2023] Open
Abstract
Bovine mastitis is a multi-etiological and complex disease, resulting in serious economic consequences for dairy farmers and industry. In recent years, the microbiological evaluation of raw milk has been investigated in-depth using next-generation sequencing approaches such as metataxonomic analysis. Despite this, host DNA is a major concern in the shotgun metagenomic sequencing of microbial communities in milk samples, and it represents a big challenge. In this study, we aimed to evaluate different methods for host DNA depletion and/or microbial DNA enrichment and assess the use of PCR-based whole genome amplification in milk samples with high somatic cell count (SCC) by using short- and long-read sequencing technologies. Our results evidenced that DNA extraction performed differently in terms of host DNA removal, impacting metagenome composition and functional profiles.. Moreover, the ratio of SCC/bacteria ultimately impacts microbial DNA yield, and samples with low SCC (SCC below 100,000 cells/mL) are the most problematic. When milk samples with high SCC (SCC above 200,000 cells/mL) underwent multiple-displacement amplification (MDA), we successfully recovered high-quality metagenome-assembled genomes (MAGs), and long-read sequencing was feasible even for samples with low DNA concentration. By associating MDA and short-read sequencing, we recovered two times more MAGs than in untreated samples, and an ongoing co-infection not reported by traditional methods was detected for mastitis pathogen. Overall, this new approach will improve the detection of mastitis-associated microorganisms and make it possible to examine host-microbiome interactions in bovine mastitis.IMPORTANCENext-generation sequencing technologies have been widely used to gain new insights into the diversity of the microbial community of milk samples and dairy products for different purposes such as microbial safety, profiling of starter cultures, and host-microbiome interactions. Milk is a complex food matrix, and additionally, the presence of host nucleic acid sequences is considered a contaminant in untargeted high-throughput sequencing studies. Therefore, genomic-centric metagenomic studies of milk samples focusing on the health-disease status in dairy cattle are still scarce, which makes it difficult to evaluate the microbial ecophysiology of bovine hindmilk. This study provides an alternative method for genome-centric metagenome studies applied to hindmilk samples with high somatic cell content, which is indispensable to examining host-microbiome interactions in bovine mastitis.
Collapse
Affiliation(s)
- Vinícius da Silva Duarte
- Faculty of Chemistry, Biotechnology and Food Science, Norwegian University of Life Sciences, Ås, Norway
| | - Davide Porcellato
- Faculty of Chemistry, Biotechnology and Food Science, Norwegian University of Life Sciences, Ås, Norway
| |
Collapse
|
7
|
Zhang RM, Lian XL, Shi LW, Jiang L, Chen SS, Haung WQ, Wu JE, Wu FJ, Sun J, Liao XP, Chong YX, Liu YH, Jiang C. Dynamic human exposure to airborne bacteria-associated antibiotic resistomes revealed by longitudinal personal monitoring data. THE SCIENCE OF THE TOTAL ENVIRONMENT 2023; 904:166799. [PMID: 37673270 DOI: 10.1016/j.scitotenv.2023.166799] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Subscribe] [Scholar Register] [Received: 06/05/2023] [Revised: 08/26/2023] [Accepted: 09/01/2023] [Indexed: 09/08/2023]
Abstract
Airborne antibiotic-resistant bacteria (ARB) can critically impact human health. We performed resistome profiling of 283 personal airborne exposure samples from 15 participants spanning 890 days and 66 locations. We found a greater diversity and abundance of airborne bacteria community and antibiotic resistomes in spring than in winter, and temperature contributed largely to the difference. A total of 1123 bacterial genera were detected, with 16 genera dominating. Of which, 7/16 were annotated as major antibiotic resistance gene (ARG) hosts. The participants were exposed to a highly dynamic collection of ARGs, including 322 subtypes conferring resistance to 18 antibiotic classes dominated by multidrug, macrolide-lincosamide-streptogramin, β-lactam, and fosfomycin. Unlike the overall community-level bacteria exposure, an extremely high abundance of specific ARG subtypes, including lunA and qacG, were found in some samples. Staphylococcus was the predominant genus in the bacterial community, serving as a primary bacterial host for the ARGs. The annotation of ARG-carrying contigs indicated that humans and companion animals were major reservoirs for ARG-carrying Staphylococcus. This study contextualized airborne antibiotic resistomes in the precision medicine framework through longitudinal personal monitoring, which can have broad implications for human health.
Collapse
Affiliation(s)
- Rong-Min Zhang
- Guangdong Provincial Key Laboratory of Veterinary Pharmaceutics Development and Safety Evaluation, South China Agricultural University, China; Guangdong Laboratory for Lingnan Modern Agriculture, National Risk Assessment Laboratory for Antimicrobial Resistance of Animal Original Bacteria, College of Veterinary Medicine, South China Agricultural University, Guangzhou, China
| | - Xin-Lei Lian
- Guangdong Provincial Key Laboratory of Veterinary Pharmaceutics Development and Safety Evaluation, South China Agricultural University, China; Guangdong Laboratory for Lingnan Modern Agriculture, National Risk Assessment Laboratory for Antimicrobial Resistance of Animal Original Bacteria, College of Veterinary Medicine, South China Agricultural University, Guangzhou, China
| | - Li-Wei Shi
- Guangdong Provincial Key Laboratory of Veterinary Pharmaceutics Development and Safety Evaluation, South China Agricultural University, China; Guangdong Laboratory for Lingnan Modern Agriculture, National Risk Assessment Laboratory for Antimicrobial Resistance of Animal Original Bacteria, College of Veterinary Medicine, South China Agricultural University, Guangzhou, China
| | - Liuyiqi Jiang
- Life Sciences Institute, Zhejiang University, Hangzhou, Zhejiang, China
| | - Shan-Shan Chen
- Guangdong Provincial Key Laboratory of Veterinary Pharmaceutics Development and Safety Evaluation, South China Agricultural University, China; Guangdong Laboratory for Lingnan Modern Agriculture, National Risk Assessment Laboratory for Antimicrobial Resistance of Animal Original Bacteria, College of Veterinary Medicine, South China Agricultural University, Guangzhou, China
| | - Wen-Qing Haung
- Guangdong Provincial Key Laboratory of Veterinary Pharmaceutics Development and Safety Evaluation, South China Agricultural University, China; Guangdong Laboratory for Lingnan Modern Agriculture, National Risk Assessment Laboratory for Antimicrobial Resistance of Animal Original Bacteria, College of Veterinary Medicine, South China Agricultural University, Guangzhou, China
| | - Jia-En Wu
- Guangdong Provincial Key Laboratory of Veterinary Pharmaceutics Development and Safety Evaluation, South China Agricultural University, China; Guangdong Laboratory for Lingnan Modern Agriculture, National Risk Assessment Laboratory for Antimicrobial Resistance of Animal Original Bacteria, College of Veterinary Medicine, South China Agricultural University, Guangzhou, China
| | - Fei-Jing Wu
- School of Life Sciences, South China Normal University, Guangzhou 510642, China
| | - Jian Sun
- Guangdong Provincial Key Laboratory of Veterinary Pharmaceutics Development and Safety Evaluation, South China Agricultural University, China; Guangdong Laboratory for Lingnan Modern Agriculture, National Risk Assessment Laboratory for Antimicrobial Resistance of Animal Original Bacteria, College of Veterinary Medicine, South China Agricultural University, Guangzhou, China
| | - Xiao-Ping Liao
- Guangdong Provincial Key Laboratory of Veterinary Pharmaceutics Development and Safety Evaluation, South China Agricultural University, China; Guangdong Laboratory for Lingnan Modern Agriculture, National Risk Assessment Laboratory for Antimicrobial Resistance of Animal Original Bacteria, College of Veterinary Medicine, South China Agricultural University, Guangzhou, China
| | - Yun-Xiao Chong
- Guangdong Provincial Key Laboratory of Agricultural & Rural Pollution Abatement and Environmental Safety, College of Natural Resources and Environment, South China Agricultural University, Guangzhou 510642, China
| | - Ya-Hong Liu
- Guangdong Provincial Key Laboratory of Veterinary Pharmaceutics Development and Safety Evaluation, South China Agricultural University, China; Guangdong Laboratory for Lingnan Modern Agriculture, National Risk Assessment Laboratory for Antimicrobial Resistance of Animal Original Bacteria, College of Veterinary Medicine, South China Agricultural University, Guangzhou, China
| | - Chao Jiang
- Life Sciences Institute, Zhejiang University, Hangzhou, Zhejiang, China.
| |
Collapse
|
8
|
Davison HR, Hurst GDD. Hidden from plain sight: Novel Simkaniaceae and Rhabdochlamydiaceae diversity emerging from screening genomic and metagenomic data. Syst Appl Microbiol 2023; 46:126468. [PMID: 37847957 DOI: 10.1016/j.syapm.2023.126468] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/21/2023] [Revised: 09/21/2023] [Accepted: 09/22/2023] [Indexed: 10/19/2023]
Abstract
Chlamydiota are an ancient and hyperdiverse phylum of obligate intracellular bacteria. The best characterized representatives are pathogens or parasites of mammals, but it is thought that their most common hosts are microeukaryotes like Amoebozoa. The diversity in taxonomy, evolution, and function of non-pathogenic Chlamydiota are slowly being described. Here we use data mining techniques and genomic analysis to extend our current knowledge of Chlamydiota diversity and its hosts, in particular the Order Parachlamydiales. We extract one Rhabdochlamydiaceae and three Simkaniaceae Metagenome-Assembled Genomes (MAGs) from NCBI Short Read Archive deposits of ciliate and algal genome sequencing projects. We then use these to identify a further 14 and 8 MAGs respectively amongst existing, unidentified environmental assemblies. From these data we identify two novel clades with host associated data, for which we propose the names "Sacchlamyda saccharinae" (Family Rhabdochlamydiaceae) and "Amphrikana amoebophyrae" (Family Simkaniaceae), as well as a third new clade of environmental MAGs "Acheromyda pituitae" (Family Rhabdochlamydiaceae). The extent of uncharacterized diversity within the Rhabdochlamydiaceae and Simkaniaceae is indicated by 16 of the 22 MAGs being evolutionarily distant from currently characterised genera. Within our limited data, there was great predicted diversity in Parachlamydiales metabolism and evolution, including the potential for metabolic and defensive symbioses as well as pathogenicity. These data provide an imperative to link genomic diversity in metagenomics data to their associated eukaryotic host, and to develop onward understanding of the functional significance of symbiosis with this hyperdiverse clade.
Collapse
Affiliation(s)
- Helen R Davison
- Institute of Infection, Veterinary and Ecological Sciences, University of Liverpool, Crown Street, Liverpool L69 7ZB UK.
| | - Gregory D D Hurst
- Institute of Infection, Veterinary and Ecological Sciences, University of Liverpool, Crown Street, Liverpool L69 7ZB UK
| |
Collapse
|
9
|
Schiml VC, Delogu F, Kumar P, Kunath B, Batut B, Mehta S, Johnson JE, Grüning B, Pope PB, Jagtap PD, Griffin TJ, Arntzen MØ. Integrative meta-omics in Galaxy and beyond. ENVIRONMENTAL MICROBIOME 2023; 18:56. [PMID: 37420292 PMCID: PMC10329324 DOI: 10.1186/s40793-023-00514-9] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Grants] [Track Full Text] [Subscribe] [Scholar Register] [Received: 06/16/2023] [Accepted: 07/05/2023] [Indexed: 07/09/2023]
Abstract
BACKGROUND 'Omics methods have empowered scientists to tackle the complexity of microbial communities on a scale not attainable before. Individually, omics analyses can provide great insight; while combined as "meta-omics", they enhance the understanding of which organisms occupy specific metabolic niches, how they interact, and how they utilize environmental nutrients. Here we present three integrative meta-omics workflows, developed in Galaxy, for enhanced analysis and integration of metagenomics, metatranscriptomics, and metaproteomics, combined with our newly developed web-application, ViMO (Visualizer for Meta-Omics) to analyse metabolisms in complex microbial communities. RESULTS In this study, we applied the workflows on a highly efficient cellulose-degrading minimal consortium enriched from a biogas reactor to analyse the key roles of uncultured microorganisms in complex biomass degradation processes. Metagenomic analysis recovered metagenome-assembled genomes (MAGs) for several constituent populations including Hungateiclostridium thermocellum, Thermoclostridium stercorarium and multiple heterogenic strains affiliated to Coprothermobacter proteolyticus. The metagenomics workflow was developed as two modules, one standard, and one optimized for improving the MAG quality in complex samples by implementing a combination of single- and co-assembly, and dereplication after binning. The exploration of the active pathways within the recovered MAGs can be visualized in ViMO, which also provides an overview of the MAG taxonomy and quality (contamination and completeness), and information about carbohydrate-active enzymes (CAZymes), as well as KEGG annotations and pathways, with counts and abundances at both mRNA and protein level. To achieve this, the metatranscriptomic reads and metaproteomic mass-spectrometry spectra are mapped onto predicted genes from the metagenome to analyse the functional potential of MAGs, as well as the actual expressed proteins and functions of the microbiome, all visualized in ViMO. CONCLUSION Our three workflows for integrative meta-omics in combination with ViMO presents a progression in the analysis of 'omics data, particularly within Galaxy, but also beyond. The optimized metagenomics workflow allows for detailed reconstruction of microbial community consisting of MAGs with high quality, and thus improves analyses of the metabolism of the microbiome, using the metatranscriptomics and metaproteomics workflows.
Collapse
Affiliation(s)
- Valerie C Schiml
- Faculty of Chemistry, Biotechnology and Food Science, Norwegian University of Life Sciences (NMBU), P.O. Box 5003, 1432, Ås, Norway
| | - Francesco Delogu
- Faculty of Chemistry, Biotechnology and Food Science, Norwegian University of Life Sciences (NMBU), P.O. Box 5003, 1432, Ås, Norway
| | - Praveen Kumar
- Department of Biochemistry, Biophysics and Molecular Biology, University of Minnesota, Minneapolis, MN, 55455, USA
| | - Benoit Kunath
- Faculty of Chemistry, Biotechnology and Food Science, Norwegian University of Life Sciences (NMBU), P.O. Box 5003, 1432, Ås, Norway
| | - Bérénice Batut
- Bioinformatics Group, Department of Computer Science, University of Freiburg, Freiburg, Germany
| | - Subina Mehta
- Department of Biochemistry, Biophysics and Molecular Biology, University of Minnesota, Minneapolis, MN, 55455, USA
| | - James E Johnson
- Minnesota Supercomputing Institute, University of Minnesota, Minneapolis, MN, 55455, USA
| | - Björn Grüning
- Bioinformatics Group, Department of Computer Science, University of Freiburg, Freiburg, Germany
| | - Phillip B Pope
- Faculty of Chemistry, Biotechnology and Food Science, Norwegian University of Life Sciences (NMBU), P.O. Box 5003, 1432, Ås, Norway
- Faculty of Biosciences, Norwegian University of Life Sciences (NMBU), P.O. Box 5003, 1432, Ås, Norway
| | - Pratik D Jagtap
- Department of Biochemistry, Biophysics and Molecular Biology, University of Minnesota, Minneapolis, MN, 55455, USA
| | - Timothy J Griffin
- Department of Biochemistry, Biophysics and Molecular Biology, University of Minnesota, Minneapolis, MN, 55455, USA
| | - Magnus Ø Arntzen
- Faculty of Chemistry, Biotechnology and Food Science, Norwegian University of Life Sciences (NMBU), P.O. Box 5003, 1432, Ås, Norway.
| |
Collapse
|
10
|
Li B, Yan T. Metagenomic next generation sequencing for studying antibiotic resistance genes in the environment. ADVANCES IN APPLIED MICROBIOLOGY 2023; 123:41-89. [PMID: 37400174 DOI: 10.1016/bs.aambs.2023.05.001] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 07/05/2023]
Abstract
Bacterial antimicrobial resistance (AMR) is a persisting and growing threat to human health. Characterization of antibiotic resistance genes (ARGs) in the environment is important to understand and control ARG-associated microbial risks. Numerous challenges exist in monitoring ARGs in the environment, due to the extraordinary diversity of ARGs, low abundance of ARGs with respect to the complex environmental microbiomes, difficulties in linking ARGs with bacterial hosts by molecular methods, difficulties in achieving quantification and high throughput simultaneously, difficulties in assessing mobility potential of ARGs, and difficulties in determining the specific AMR determinant genes. Advances in the next generation sequencing (NGS) technologies and related computational and bioinformatic tools are facilitating rapid identification and characterization ARGs in genomes and metagenomes from environmental samples. This chapter discusses NGS-based strategies, including amplicon-based sequencing, whole genome sequencing, bacterial population-targeted metagenome sequencing, metagenomic NGS, quantitative metagenomic sequencing, and functional/phenotypic metagenomic sequencing. Current bioinformatic tools for analyzing sequencing data for studying environmental ARGs are also discussed.
Collapse
Affiliation(s)
- Bo Li
- Department of Civil and Environmental Engineering, University of Hawaii at Manoa, Honolulu, HI, United States
| | - Tao Yan
- Department of Civil and Environmental Engineering, University of Hawaii at Manoa, Honolulu, HI, United States.
| |
Collapse
|
11
|
Gabrielli M, Dai Z, Delafont V, Timmers PHA, van der Wielen PWJJ, Antonelli M, Pinto AJ. Identifying Eukaryotes and Factors Influencing Their Biogeography in Drinking Water Metagenomes. ENVIRONMENTAL SCIENCE & TECHNOLOGY 2023; 57:3645-3660. [PMID: 36827617 PMCID: PMC9996835 DOI: 10.1021/acs.est.2c09010] [Citation(s) in RCA: 5] [Impact Index Per Article: 5.0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Figures] [Subscribe] [Scholar Register] [Received: 11/29/2022] [Revised: 02/13/2023] [Accepted: 02/13/2023] [Indexed: 06/18/2023]
Abstract
The biogeography of eukaryotes in drinking water systems is poorly understood relative to that of prokaryotes or viruses, limiting the understanding of their role and management. A challenge with studying complex eukaryotic communities is that metagenomic analysis workflows are currently not as mature as those that focus on prokaryotes or viruses. In this study, we benchmarked different strategies to recover eukaryotic sequences and genomes from metagenomic data and applied the best-performing workflow to explore the factors affecting the relative abundance and diversity of eukaryotic communities in drinking water distribution systems (DWDSs). We developed an ensemble approach exploiting k-mer- and reference-based strategies to improve eukaryotic sequence identification and identified MetaBAT2 as the best-performing binning approach for their clustering. Applying this workflow to the DWDS metagenomes showed that eukaryotic sequences typically constituted small proportions (i.e., <1%) of the overall metagenomic data with higher relative abundances in surface water-fed or chlorinated systems with high residuals. The α and β diversities of eukaryotes were correlated with those of prokaryotic and viral communities, highlighting the common role of environmental/management factors. Finally, a co-occurrence analysis highlighted clusters of eukaryotes whose members' presence and abundance in DWDSs were affected by disinfection strategies, climate conditions, and source water types.
Collapse
Affiliation(s)
- Marco Gabrielli
- Dipartimento
di Ingegneria Civile e Ambientale—Sezione Ambientale, Politecnico di Milano, Milan 20133, Italy
| | - Zihan Dai
- Research
Center for Eco-Environmental Sciences, Chinese Academy of Sciences, Beijing 100085, China
| | - Vincent Delafont
- Laboratoire
Ecologie et Biologie des Interactions (EBI), Equipe Microorganismes,
Hôtes, Environnements, Université
de Poitiers, Poitiers 86073, France
| | - Peer H. A. Timmers
- KWR
Watercycle Research Institute, 3433 PE Nieuwegein, The Netherlands
- Department
of Microbiology, Radboud University, Heyendaalseweg 135, 6525 AJ Nijmegen, The Netherlands
| | - Paul W. J. J. van der Wielen
- KWR
Watercycle Research Institute, 3433 PE Nieuwegein, The Netherlands
- Laboratory
of Microbiology, Wageningen University, 6700 HB Wageningen, The Netherlands
| | - Manuela Antonelli
- Dipartimento
di Ingegneria Civile e Ambientale—Sezione Ambientale, Politecnico di Milano, Milan 20133, Italy
| | - Ameet J. Pinto
- School
of Civil and Environmental Engineering, Georgia Institute of Technology, Atlanta, Georgia 30332, United States
| |
Collapse
|
12
|
Vosloo S, Huo L, Chauhan U, Cotto I, Gincley B, Vilardi KJ, Yoon B, Bian K, Gabrielli M, Pieper KJ, Stubbins A, Pinto AJ. Gradual Recovery of Building Plumbing-Associated Microbial Communities after Extended Periods of Altered Water Demand during the COVID-19 Pandemic. ENVIRONMENTAL SCIENCE & TECHNOLOGY 2023; 57:3248-3259. [PMID: 36795589 PMCID: PMC9969676 DOI: 10.1021/acs.est.2c07333] [Citation(s) in RCA: 4] [Impact Index Per Article: 4.0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Figures] [Subscribe] [Scholar Register] [Received: 10/06/2022] [Revised: 02/06/2023] [Accepted: 02/06/2023] [Indexed: 06/18/2023]
Abstract
COVID-19 pandemic-related building restrictions heightened drinking water microbiological safety concerns post-reopening due to the unprecedented nature of commercial building closures. Starting with phased reopening (i.e., June 2020), we sampled drinking water for 6 months from three commercial buildings with reduced water usage and four occupied residential households. Samples were analyzed using flow cytometry and full-length 16S rRNA gene sequencing along with comprehensive water chemistry characterization. Prolonged building closures resulted in 10-fold higher microbial cell counts in the commercial buildings [(2.95 ± 3.67) × 105 cells mL-1] than in residential households [(1.11 ± 0.58) × 104 cells mL-1] with majority intact cells. While flushing reduced cell counts and increased disinfection residuals, microbial communities in commercial buildings remained distinct from those in residential households on the basis of flow cytometric fingerprinting [Bray-Curtis dissimilarity (dBC) = 0.33 ± 0.07] and 16S rRNA gene sequencing (dBC = 0.72 ± 0.20). An increase in water demand post-reopening resulted in gradual convergence in microbial communities in water samples collected from commercial buildings and residential households. Overall, we find that the gradual recovery of water demand played a key role in the recovery of building plumbing-associated microbial communities as compared to short-term flushing after extended periods of reduced water demand.
Collapse
Affiliation(s)
- Solize Vosloo
- Department
of Civil and Environmental Engineering, Northeastern University, 360 Huntington Avenue, Boston, Massachusetts 021115, United States
| | - Linxuan Huo
- School
of Civil and Environmental Engineering, Georgia Institute of Technology, 311 Ferst Drive, Atlanta, Georgia 30318, United States
| | - Umang Chauhan
- Department
of Civil and Environmental Engineering, Northeastern University, 360 Huntington Avenue, Boston, Massachusetts 021115, United States
| | - Irmarie Cotto
- Department
of Civil and Environmental Engineering, Northeastern University, 360 Huntington Avenue, Boston, Massachusetts 021115, United States
| | - Benjamin Gincley
- School
of Civil and Environmental Engineering, Georgia Institute of Technology, 311 Ferst Drive, Atlanta, Georgia 30318, United States
| | - Katherine J. Vilardi
- Department
of Civil and Environmental Engineering, Northeastern University, 360 Huntington Avenue, Boston, Massachusetts 021115, United States
| | - Bryan Yoon
- Department
of Civil and Environmental Engineering, Northeastern University, 360 Huntington Avenue, Boston, Massachusetts 021115, United States
| | - Kaiqin Bian
- School
of Civil and Environmental Engineering, Georgia Institute of Technology, 311 Ferst Drive, Atlanta, Georgia 30318, United States
| | - Marco Gabrielli
- Dipartimento
di Ingegneria Civile e Ambientale - Sezione Ambientale, Politecnico di Milano, 20133 Milan, Italy
| | - Kelsey J. Pieper
- Department
of Civil and Environmental Engineering, Northeastern University, 360 Huntington Avenue, Boston, Massachusetts 021115, United States
| | - Aron Stubbins
- Department
of Civil and Environmental Engineering, Northeastern University, 360 Huntington Avenue, Boston, Massachusetts 021115, United States
| | - Ameet J. Pinto
- School
of Civil and Environmental Engineering, Georgia Institute of Technology, 311 Ferst Drive, Atlanta, Georgia 30318, United States
| |
Collapse
|
13
|
Pillay S, Calderón-Franco D, Urhan A, Abeel T. Metagenomic-based surveillance systems for antibiotic resistance in non-clinical settings. Front Microbiol 2022; 13:1066995. [PMID: 36532424 PMCID: PMC9755710 DOI: 10.3389/fmicb.2022.1066995] [Citation(s) in RCA: 12] [Impact Index Per Article: 6.0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/11/2022] [Accepted: 11/09/2022] [Indexed: 08/12/2023] Open
Abstract
The success of antibiotics as a therapeutic agent has led to their ineffectiveness. The continuous use and misuse in clinical and non-clinical areas have led to the emergence and spread of antibiotic-resistant bacteria and its genetic determinants. This is a multi-dimensional problem that has now become a global health crisis. Antibiotic resistance research has primarily focused on the clinical healthcare sectors while overlooking the non-clinical sectors. The increasing antibiotic usage in the environment - including animals, plants, soil, and water - are drivers of antibiotic resistance and function as a transmission route for antibiotic resistant pathogens and is a source for resistance genes. These natural compartments are interconnected with each other and humans, allowing the spread of antibiotic resistance via horizontal gene transfer between commensal and pathogenic bacteria. Identifying and understanding genetic exchange within and between natural compartments can provide insight into the transmission, dissemination, and emergence mechanisms. The development of high-throughput DNA sequencing technologies has made antibiotic resistance research more accessible and feasible. In particular, the combination of metagenomics and powerful bioinformatic tools and platforms have facilitated the identification of microbial communities and has allowed access to genomic data by bypassing the need for isolating and culturing microorganisms. This review aimed to reflect on the different sequencing techniques, metagenomic approaches, and bioinformatics tools and pipelines with their respective advantages and limitations for antibiotic resistance research. These approaches can provide insight into resistance mechanisms, the microbial population, emerging pathogens, resistance genes, and their dissemination. This information can influence policies, develop preventative measures and alleviate the burden caused by antibiotic resistance.
Collapse
Affiliation(s)
- Stephanie Pillay
- Delft Bioinformatics Lab, Delft University of Technology, Delft, Netherlands
| | | | - Aysun Urhan
- Delft Bioinformatics Lab, Delft University of Technology, Delft, Netherlands
- Infectious Disease and Microbiome Program, Broad Institute of MIT and Harvard, Cambridge, MA, United States
| | - Thomas Abeel
- Delft Bioinformatics Lab, Delft University of Technology, Delft, Netherlands
- Infectious Disease and Microbiome Program, Broad Institute of MIT and Harvard, Cambridge, MA, United States
| |
Collapse
|
14
|
Lugli GA, Longhi G, Mancabelli L, Alessandri G, Tarracchini C, Fontana F, Turroni F, Milani C, van Sinderen D, Ventura M. Tap water as a natural vehicle for microorganisms shaping the human gut microbiome. Environ Microbiol 2022; 24:3912-3923. [PMID: 35355372 PMCID: PMC9790288 DOI: 10.1111/1462-2920.15988] [Citation(s) in RCA: 3] [Impact Index Per Article: 1.5] [Reference Citation Analysis] [Abstract] [Track Full Text] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 02/23/2022] [Revised: 03/24/2022] [Accepted: 03/25/2022] [Indexed: 12/30/2022]
Abstract
Fresh potable water is an indispensable drink which humans consume daily in substantial amounts. Nonetheless, very little is known about the composition of the microbial community inhabiting drinking water or its impact on our gut microbiota. In the current study, an exhaustive shotgun metagenomics analysis of the tap water microbiome highlighted the occurrence of a highly genetic biodiversity of the microbial communities residing in fresh water and the existence of a conserved core tap water microbiota largely represented by novel microbial species, representing microbial dark matter. Furthermore, genome reconstruction of this microbial dark matter from water samples unveiled homologous sequences present in the faecal microbiome of humans from various geographical locations. Accordingly, investigation of the faecal microbiota content of a subject that daily consumed tap water for 3 years provides proof for horizontal transmission and colonization of water bacteria in the human gut.
Collapse
Affiliation(s)
- Gabriele Andrea Lugli
- Laboratory of Probiogenomics, Department of Chemistry, Life Sciences, and Environmental SustainabilityUniversity of ParmaParmaItaly
| | - Giulia Longhi
- Laboratory of Probiogenomics, Department of Chemistry, Life Sciences, and Environmental SustainabilityUniversity of ParmaParmaItaly,GenProbio SrlParmaItaly
| | - Leonardo Mancabelli
- Laboratory of Probiogenomics, Department of Chemistry, Life Sciences, and Environmental SustainabilityUniversity of ParmaParmaItaly
| | - Giulia Alessandri
- Laboratory of Probiogenomics, Department of Chemistry, Life Sciences, and Environmental SustainabilityUniversity of ParmaParmaItaly
| | - Chiara Tarracchini
- Laboratory of Probiogenomics, Department of Chemistry, Life Sciences, and Environmental SustainabilityUniversity of ParmaParmaItaly
| | - Federico Fontana
- Laboratory of Probiogenomics, Department of Chemistry, Life Sciences, and Environmental SustainabilityUniversity of ParmaParmaItaly,GenProbio SrlParmaItaly
| | - Francesca Turroni
- Laboratory of Probiogenomics, Department of Chemistry, Life Sciences, and Environmental SustainabilityUniversity of ParmaParmaItaly,Microbiome Research HubUniversity of ParmaParmaItaly
| | - Christian Milani
- Laboratory of Probiogenomics, Department of Chemistry, Life Sciences, and Environmental SustainabilityUniversity of ParmaParmaItaly,Microbiome Research HubUniversity of ParmaParmaItaly
| | - Douwe van Sinderen
- APC Microbiome Institute and School of Microbiology, Bioscience Institute, National University of IrelandCorkIreland
| | - Marco Ventura
- Laboratory of Probiogenomics, Department of Chemistry, Life Sciences, and Environmental SustainabilityUniversity of ParmaParmaItaly,Microbiome Research HubUniversity of ParmaParmaItaly
| |
Collapse
|
15
|
Churcheward B, Millet M, Bihouée A, Fertin G, Chaffron S. MAGNETO: An Automated Workflow for Genome-Resolved Metagenomics. mSystems 2022; 7:e0043222. [PMID: 35703559 PMCID: PMC9426564 DOI: 10.1128/msystems.00432-22] [Citation(s) in RCA: 4] [Impact Index Per Article: 2.0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 05/06/2022] [Accepted: 05/06/2022] [Indexed: 12/24/2022] Open
Abstract
Metagenome-assembled genomes (MAGs) represent individual genomes recovered from metagenomic data. MAGs are extremely useful to analyze uncultured microbial genomic diversity, as well as to characterize associated functional and metabolic potential in natural environments. Recent computational developments have considerably improved MAG reconstruction but also emphasized several limitations, such as the nonbinning of sequence regions with repetitions or distinct nucleotidic composition. Different assembly and binning strategies are often used; however, it still remains unclear which assembly strategy, in combination with which binning approach, offers the best performance for MAG recovery. Several workflows have been proposed in order to reconstruct MAGs, but users are usually limited to single-metagenome assembly or need to manually define sets of metagenomes to coassemble prior to genome binning. Here, we present MAGNETO, an automated workflow dedicated to MAG reconstruction, which includes a fully-automated coassembly step informed by optimal clustering of metagenomic distances, and implements complementary genome binning strategies, for improving MAG recovery. MAGNETO is implemented as a Snakemake workflow and is available at: https://gitlab.univ-nantes.fr/bird_pipeline_registry/magneto. IMPORTANCE Genome-resolved metagenomics has led to the discovery of previously untapped biodiversity within the microbial world. As the development of computational methods for the recovery of genomes from metagenomes continues, existing strategies need to be evaluated and compared to eventually lead to standardized computational workflows. In this study, we compared commonly used assembly and binning strategies and assessed their performance using both simulated and real metagenomic data sets. We propose a novel approach to automate coassembly, avoiding the requirement for a priori knowledge to combine metagenomic information. The comparison against a previous coassembly approach demonstrates a strong impact of this step on genome binning results, but also the benefits of informing coassembly for improving the quality of recovered genomes. MAGNETO integrates complementary assembly-binning strategies to optimize genome reconstruction and provides a complete reads-to-genomes workflow for the growing microbiome research community.
Collapse
Affiliation(s)
| | - Maxime Millet
- Nantes Université, École Centrale Nantes, CNRS, LS2N, UMR 6004, Nantes, France
| | - Audrey Bihouée
- Nantes Université, CNRS, INSERM, l’institut du thorax, F-44000 Nantes, France
- Nantes Université, CHU Nantes, SFR Bonamy, F-44000 Nantes, France
| | - Guillaume Fertin
- Nantes Université, École Centrale Nantes, CNRS, LS2N, UMR 6004, Nantes, France
| | - Samuel Chaffron
- Nantes Université, École Centrale Nantes, CNRS, LS2N, UMR 6004, Nantes, France
- Research Federation for the study of Global Ocean Systems Ecology and Evolution, FR2022/Tara Oceans, Paris, France
| |
Collapse
|
16
|
Genome-Resolved Metaproteomics Decodes the Microbial and Viral Contributions to Coupled Carbon and Nitrogen Cycling in River Sediments. mSystems 2022; 7:e0051622. [PMID: 35861508 PMCID: PMC9426555 DOI: 10.1128/msystems.00516-22] [Citation(s) in RCA: 9] [Impact Index Per Article: 4.5] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 12/24/2022] Open
Abstract
Rivers have a significant role in global carbon and nitrogen cycles, serving as a nexus for nutrient transport between terrestrial and marine ecosystems. Although rivers have a small global surface area, they contribute substantially to worldwide greenhouse gas emissions through microbially mediated processes within the river hyporheic zone. Despite this importance, research linking microbial and viral communities to specific biogeochemical reactions is still nascent in these sediment environments. To survey the metabolic potential and gene expression underpinning carbon and nitrogen biogeochemical cycling in river sediments, we collected an integrated data set of 33 metagenomes, metaproteomes, and paired metabolomes. We reconstructed over 500 microbial metagenome-assembled genomes (MAGs), which we dereplicated into 55 unique, nearly complete medium- and high-quality MAGs spanning 12 bacterial and archaeal phyla. We also reconstructed 2,482 viral genomic contigs, which were dereplicated into 111 viral MAGs (vMAGs) of >10 kb in size. As a result of integrating gene expression data with geochemical and metabolite data, we created a conceptual model that uncovered new roles for microorganisms in organic matter decomposition, carbon sequestration, nitrogen mineralization, nitrification, and denitrification. We show how these metabolic pathways, integrated through shared resource pools of ammonium, carbon dioxide, and inorganic nitrogen, could ultimately contribute to carbon dioxide and nitrous oxide fluxes from hyporheic sediments. Further, by linking viral MAGs to these active microbial hosts, we provide some of the first insights into viral modulation of river sediment carbon and nitrogen cycling. IMPORTANCE Here we created HUM-V (hyporheic uncultured microbial and viral), an annotated microbial and viral MAG catalog that captures strain and functional diversity encoded in these Columbia River sediment samples. Demonstrating its utility, this genomic inventory encompasses multiple representatives of dominant microbial and archaeal phyla reported in other river sediments and provides novel viral MAGs that can putatively infect these. Furthermore, we used HUM-V to recruit gene expression data to decipher the functional activities of these MAGs and reconstruct their active roles in Columbia River sediment biogeochemical cycling. Ultimately, we show the power of MAG-resolved multi-omics to uncover interactions and chemical handoffs in river sediments that shape an intertwined carbon and nitrogen metabolic network. The accessible microbial and viral MAGs in HUM-V will serve as a community resource to further advance more untargeted, activity-based measurements in these, and related, freshwater terrestrial-aquatic ecosystems.
Collapse
|
17
|
Lamurias A, Sereika M, Albertsen M, Hose K, Nielsen TD. Metagenomic binning with assembly graph embeddings. Bioinformatics 2022; 38:4481-4487. [PMID: 35972375 PMCID: PMC9525014 DOI: 10.1093/bioinformatics/btac557] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 02/25/2022] [Revised: 08/02/2022] [Accepted: 08/12/2022] [Indexed: 12/24/2022] Open
Abstract
MOTIVATION Despite recent advancements in sequencing technologies and assembly methods, obtaining high-quality microbial genomes from metagenomic samples is still not a trivial task. Current metagenomic binners do not take full advantage of assembly graphs and are not optimized for long-read assemblies. Deep graph learning algorithms have been proposed in other fields to deal with complex graph data structures. The graph structure generated during the assembly process could be integrated with contig features to obtain better bins with deep learning. RESULTS We propose GraphMB, which uses graph neural networks to incorporate the assembly graph into the binning process. We test GraphMB on long-read datasets of different complexities, and compare the performance with other binners in terms of the number of High Quality (HQ) genome bins obtained. With our approach, we were able to obtain unique bins on all real datasets, and obtain more bins on most datasets. In particular, we obtained on average 17.5% more HQ bins when compared with state-of-the-art binners and 13.7% when aggregating the results of our binner with the others. These results indicate that a deep learning model can integrate contig-specific and graph-structure information to improve metagenomic binning. AVAILABILITY AND IMPLEMENTATION GraphMB is available from https://github.com/MicrobialDarkMatter/GraphMB. SUPPLEMENTARY INFORMATION Supplementary data are available at Bioinformatics online.
Collapse
Affiliation(s)
| | - Mantas Sereika
- Center for Microbial Communities, Department of Chemistry and Bioscience, Aalborg University, 9000 Aalborg, Denmark
| | | | | | | |
Collapse
|