Reference Citation Analysis: Find an Article, Find a Category, Find a Journal, Find a Scholar

For: Sedlar K, Kupkova K, Provaznik I. Bioinformatics strategies for taxonomy independent binning and visualization of sequences in shotgun metagenomics. Comput Struct Biotechnol J 2016;15:48-55. [PMID: 27980708 PMCID: PMC5148923 DOI: 10.1016/j.csbj.2016.11.005] [Citation(s) in RCA: 70] [Impact Index Per Article: 8.8] [Reference Citation Analysis] [What about the content of this article? (0)] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 08/30/2016] [Revised: 11/24/2016] [Accepted: 11/26/2016] [Indexed: 12/11/2022] Open

For:	Sedlar K, Kupkova K, Provaznik I. Bioinformatics strategies for taxonomy independent binning and visualization of sequences in shotgun metagenomics. Comput Struct Biotechnol J 2016;15:48-55. [PMID: 27980708 PMCID: PMC5148923 DOI: 10.1016/j.csbj.2016.11.005] [Citation(s) in RCA: 70] [Impact Index Per Article: 8.8] [Reference Citation Analysis] [What about the content of this article? (0)] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 08/30/2016] [Revised: 11/24/2016] [Accepted: 11/26/2016] [Indexed: 12/11/2022] Open

Number

Cited by Other Article(s)

Mallawaarachchi V, Wickramarachchi A, Xue H, Papudeshi B, Grigson SR, Bouras G, Prahl RE, Kaphle A, Verich A, Talamantes-Becerra B, Dinsdale EA, Edwards RA. Solving genomic puzzles: computational methods for metagenomic binning. Brief Bioinform 2024;25:bbae372. [PMID: 39082646 PMCID: PMC11289683 DOI: 10.1093/bib/bbae372] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 04/18/2024] [Revised: 06/05/2024] [Accepted: 07/15/2024] [Indexed: 08/03/2024] Open

Affiliation(s)

Vijini Mallawaarachchi Flinders Accelerator for Microbiome Exploration, College of Science and Engineering, Flinders University, Adelaide, SA 5042, Australia
Anuradha Wickramarachchi Australian e-Health Research Centre, Commonwealth Scientific and Industrial Research Organisation (CSIRO), Westmead, NSW 2145, Australia
Hansheng Xue School of Computing, National University of Singapore, Singapore 119077, Singapore
Bhavya Papudeshi Flinders Accelerator for Microbiome Exploration, College of Science and Engineering, Flinders University, Adelaide, SA 5042, Australia
Susanna R Grigson Flinders Accelerator for Microbiome Exploration, College of Science and Engineering, Flinders University, Adelaide, SA 5042, Australia
George Bouras Adelaide Medical School, Faculty of Health and Medical Sciences, The University of Adelaide, Adelaide, SA 5005, Australia The Department of Surgery—Otolaryngology Head and Neck Surgery, University of Adelaide and the Basil Hetzel Institute for Translational Health Research, Central Adelaide Local Health Network, Adelaide, SA 5011, Australia
Rosa E Prahl Australian e-Health Research Centre, Commonwealth Scientific and Industrial Research Organisation (CSIRO), Westmead, NSW 2145, Australia
Anubhav Kaphle Australian e-Health Research Centre, Commonwealth Scientific and Industrial Research Organisation (CSIRO), Westmead, NSW 2145, Australia
Andrey Verich Australian e-Health Research Centre, Commonwealth Scientific and Industrial Research Organisation (CSIRO), Westmead, NSW 2145, Australia The Kirby Institute, The University of New South Wales, Randwick, Sydney, NSW 2052, Australia
Berenice Talamantes-Becerra Australian e-Health Research Centre, Commonwealth Scientific and Industrial Research Organisation (CSIRO), Westmead, NSW 2145, Australia
Elizabeth A Dinsdale Flinders Accelerator for Microbiome Exploration, College of Science and Engineering, Flinders University, Adelaide, SA 5042, Australia
Robert A Edwards Flinders Accelerator for Microbiome Exploration, College of Science and Engineering, Flinders University, Adelaide, SA 5042, Australia

Collapse

Gnimpieba EZ, Hartman TW, Do T, Zylla J, Aryal S, Haas SJ, Agany DDM, Gurung BDS, Doe V, Yosufzai Z, Pan D, Campbell R, Huber VC, Sani R, Gadhamshetty V, Lushbough C. Biofilm marker discovery with cloud-based dockerized metagenomics analysis of microbial communities. Brief Bioinform 2024;25:bbae429. [PMID: 39266450 PMCID: PMC11392556 DOI: 10.1093/bib/bbae429] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 12/01/2023] [Revised: 08/04/2024] [Accepted: 08/16/2024] [Indexed: 09/14/2024] Open

Abstract

In an environment, microbes often work in communities to achieve most of their essential functions, including the production of essential nutrients. Microbial biofilms are communities of microbes that attach to a nonliving or living surface by embedding themselves into a self-secreted matrix of extracellular polymeric substances. These communities work together to enhance their colonization of surfaces, produce essential nutrients, and achieve their essential functions for growth and survival. They often consist of diverse microbes including bacteria, viruses, and fungi. Biofilms play a critical role in influencing plant phenotypes and human microbial infections. Understanding how these biofilms impact plant health, human health, and the environment is important for analyzing genotype-phenotype-driven rule-of-life functions. Such fundamental knowledge can be used to precisely control the growth of biofilms on a given surface. Metagenomics is a powerful tool for analyzing biofilm genomes through function-based gene and protein sequence identification (functional metagenomics) and sequence-based function identification (sequence metagenomics). Metagenomic sequencing enables a comprehensive sampling of all genes in all organisms present within a biofilm sample. However, the complexity of biofilm metagenomic study warrants the increasing need to follow the Findability, Accessibility, Interoperability, and Reusable (FAIR) Guiding Principles for scientific data management. This will ensure that scientific findings can be more easily validated by the research community. This study proposes a dockerized, self-learning bioinformatics workflow to increase the community adoption of metagenomics toolkits in a metagenomics and meta-transcriptomics investigation. Our biofilm metagenomics workflow self-learning module includes integrated learning resources with an interactive dockerized workflow. This module will allow learners to analyze resources that are beneficial for aggregating knowledge about biofilm marker genes, proteins, and metabolic pathways as they define the composition of specific microbial communities. Cloud and dockerized technology can allow novice learners-even those with minimal knowledge in computer science-to use complicated bioinformatics tools. Our cloud-based, dockerized workflow splits biofilm microbiome metagenomics analyses into four easy-to-follow submodules. A variety of tools are built into each submodule. As students navigate these submodules, they learn about each tool used to accomplish the task. The downstream analysis is conducted using processed data obtained from online resources or raw data processed via Nextflow pipelines. This analysis takes place within Vertex AI's Jupyter notebook instance with R and Python kernels. Subsequently, results are stored and visualized in Google Cloud storage buckets, alleviating the computational burden on local resources. The result is a comprehensive tutorial that guides bioinformaticians of any skill level through the entire workflow. It enables them to comprehend and implement the necessary processes involved in this integrated workflow from start to finish. This manuscript describes the development of a resource module that is part of a learning platform named "NIGMS Sandbox for Cloud-based Learning" https://github.com/NIGMS/NIGMS-Sandbox. The overall genesis of the Sandbox is described in the editorial NIGMS Sandbox [1] at the beginning of this Supplement. This module delivers learning materials on the analysis of bulk and single-cell ATAC-seq data in an interactive format that uses appropriate cloud resources for data access and analyses.

Collapse

Affiliation(s)

Etienne Z Gnimpieba Biomedical Engineering Department, University of South Dakota, 4800 N. Career Ave., Suite 221, Sioux Falls, South Dakota, 57107, United States
Timothy W Hartman Biomedical Engineering Department, University of South Dakota, 4800 N. Career Ave., Suite 221, Sioux Falls, South Dakota, 57107, United States
Tuyen Do Biomedical Engineering Department, University of South Dakota, 4800 N. Career Ave., Suite 221, Sioux Falls, South Dakota, 57107, United States
Jessica Zylla Biomedical Engineering Department, University of South Dakota, 4800 N. Career Ave., Suite 221, Sioux Falls, South Dakota, 57107, United States
Shiva Aryal Biomedical Engineering Department, University of South Dakota, 4800 N. Career Ave., Suite 221, Sioux Falls, South Dakota, 57107, United States
Samuel J Haas Biomedical Engineering Department, University of South Dakota, 4800 N. Career Ave., Suite 221, Sioux Falls, South Dakota, 57107, United States
Diing D M Agany Biomedical Engineering Department, University of South Dakota, 4800 N. Career Ave., Suite 221, Sioux Falls, South Dakota, 57107, United States
Bichar Dip Shrestha Gurung Biomedical Engineering Department, University of South Dakota, 4800 N. Career Ave., Suite 221, Sioux Falls, South Dakota, 57107, United States
Valena Doe Google Cloud, 1900 Reston Metro Plaza, Reston, Virginia, 20190, United States
Zelaikha Yosufzai Health Data and AI, Deloitte Consulting LLP, 1919 N Lynn St., Suite 1500, Arlington, Virginia, 22209, United States
Daniel Pan Health Data and AI, Deloitte Consulting LLP, 1919 N Lynn St., Suite 1500, Arlington, Virginia, 22209, United States
Ross Campbell Health Data and AI, Deloitte Consulting LLP, 1919 N Lynn St., Suite 1500, Arlington, Virginia, 22209, United States
Victor C Huber Basic Biomedical Sciences Division, University of South Dakota, 414 E. Clark St, Vermillion, South Dakota, 57069, United States
Rajesh Sani South Dakota School of Mines & Technology, 501 E. Saint Joseph St., Rapid City, South Dakota, 57701, United States
Venkataramana Gadhamshetty South Dakota School of Mines & Technology, 501 E. Saint Joseph St., Rapid City, South Dakota, 57701, United States
Carol Lushbough Biomedical Engineering Department, University of South Dakota, 4800 N. Career Ave., Suite 221, Sioux Falls, South Dakota, 57107, United States

Collapse

Darabi A, Sobhani S, Aghdam R, Eslahchi C. AFITbin: a metagenomic contig binning method using aggregate l-mer frequency based on initial and terminal nucleotides. BMC Bioinformatics 2024;25:241. [PMID: 39014300 PMCID: PMC11253361 DOI: 10.1186/s12859-024-05859-7] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 08/27/2023] [Accepted: 07/09/2024] [Indexed: 07/18/2024] Open

Abstract

BACKGROUND

Using next-generation sequencing technologies, scientists can sequence complex microbial communities directly from the environment. Significant insights into the structure, diversity, and ecology of microbial communities have resulted from the study of metagenomics. The assembly of reads into longer contigs, which are then binned into groups of contigs that correspond to different species in the metagenomic sample, is a crucial step in the analysis of metagenomics. It is necessary to organize these contigs into operational taxonomic units (OTUs) for further taxonomic profiling and functional analysis. For binning, which is synonymous with the clustering of OTUs, the tetra-nucleotide frequency (TNF) is typically utilized as a compositional feature for each OTU.

RESULTS

In this paper, we present AFIT, a new l-mer statistic vector for each contig, and AFITBin, a novel method for metagenomic binning based on AFIT and a matrix factorization method. To evaluate the performance of the AFIT vector, the t-SNE algorithm is used to compare species clustering based on AFIT and TNF information. In addition, the efficacy of AFITBin is demonstrated on both simulated and real datasets in comparison to state-of-the-art binning methods such as MetaBAT 2, MaxBin 2.0, CONCOT, MetaCon, SolidBin, BusyBee Web, and MetaBinner. To further analyze the performance of the purposed AFIT vector, we compare the barcodes of the AFIT vector and the TNF vector.

CONCLUSION

The results demonstrate that AFITBin shows superior performance in taxonomic identification compared to existing methods, leveraging the AFIT vector for improved results in metagenomic binning. This approach holds promise for advancing the analysis of metagenomic data, providing more reliable insights into microbial community composition and function.

AVAILABILITY

A python package is available at: https://github.com/SayehSobhani/AFITBin .

Collapse

Reynolds G, Mumey B, Strnadova‐Neeley V, Lachowiec J. Hijacking a rapid and scalable metagenomic method reveals subgenome dynamics and evolution in polyploid plants. APPLICATIONS IN PLANT SCIENCES 2024;12:e11581. [PMID: 39184200 PMCID: PMC11342227 DOI: 10.1002/aps3.11581] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Subscribe] [Scholar Register] [Received: 09/07/2023] [Revised: 11/26/2023] [Accepted: 12/20/2023] [Indexed: 08/27/2024]

Abstract

Premise

The genomes of polyploid plants archive the evolutionary events leading to their present forms. However, plant polyploid genomes present numerous hurdles to the genome comparison algorithms for classification of polyploid types and exploring genome dynamics.

Methods

Here, the problem of intra- and inter-genome comparison for examining polyploid genomes is reframed as a metagenomic problem, enabling the use of the rapid and scalable MinHashing approach. To determine how types of polyploidy are described by this metagenomic approach, plant genomes were examined from across the polyploid spectrum for both k-mer composition and frequency with a range of k-mer sizes. In this approach, no subgenome-specific k-mers are identified; rather, whole-chromosome k-mer subspaces were utilized.

Results

Given chromosome-scale genome assemblies with sufficient subgenome-specific repetitive element content, literature-verified subgenomic and genomic evolutionary relationships were revealed, including distinguishing auto- from allopolyploidy and putative progenitor genome assignment. The sequences responsible were the rapidly evolving landscape of transposable elements. An investigation into the MinHashing parameters revealed that the downsampled k-mer space (genomic signatures) produced excellent approximations of sequence similarity. Furthermore, the clustering approach used for comparison of the genomic signatures is scrutinized to ensure applicability of the metagenomics-based method.

Discussion

The easily implementable and highly computationally efficient MinHashing-based sequence comparison strategy enables comparative subgenomics and genomics for large and complex polyploid plant genomes. Such comparisons provide evidence for polyploidy-type subgenomic assignments. In cases where subgenome-specific repeat signal may not be adequate given a chromosomes' global k-mer profile, alternative methods that are more specific but more computationally complex outperform this approach.

Collapse

Hou S, Tang T, Cheng S, Liu Y, Xia T, Chen T, Fuhrman J, Sun F. DeepMicroClass sorts metagenomic contigs into prokaryotes, eukaryotes and viruses. NAR Genom Bioinform 2024;6:lqae044. [PMID: 38711860 PMCID: PMC11071121 DOI: 10.1093/nargab/lqae044] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 09/07/2023] [Revised: 03/18/2024] [Accepted: 04/18/2024] [Indexed: 05/08/2024] Open

Trecarten S, Fongang B, Liss M. Current Trends and Challenges of Microbiome Research in Prostate Cancer. Curr Oncol Rep 2024;26:477-487. [PMID: 38573440 DOI: 10.1007/s11912-024-01520-x] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Accepted: 03/18/2024] [Indexed: 04/05/2024]

Kim C, Pongpanich M, Porntaveetus T. Unraveling metagenomics through long-read sequencing: a comprehensive review. J Transl Med 2024;22:111. [PMID: 38282030 PMCID: PMC10823668 DOI: 10.1186/s12967-024-04917-1] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/15/2023] [Accepted: 01/21/2024] [Indexed: 01/30/2024] Open

Wang Z, You R, Han H, Liu W, Sun F, Zhu S. Effective binning of metagenomic contigs using contrastive multi-view representation learning. Nat Commun 2024;15:585. [PMID: 38233391 PMCID: PMC10794208 DOI: 10.1038/s41467-023-44290-z] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/28/2023] [Accepted: 12/07/2023] [Indexed: 01/19/2024] Open

Feng T, Wu S, Zhou H, Fang Z. MOBFinder: a tool for mobilization typing of plasmid metagenomic fragments based on a language model. Gigascience 2024;13:giae047. [PMID: 39101782 PMCID: PMC11299106 DOI: 10.1093/gigascience/giae047] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 02/28/2024] [Revised: 05/31/2024] [Accepted: 06/24/2024] [Indexed: 08/06/2024] Open

Kleikamp HBC, Grouzdev D, Schaasberg P, van Valderen R, van der Zwaan R, Wijgaart RVD, Lin Y, Abbas B, Pronk M, van Loosdrecht MCM, Pabst M. Metaproteomics, metagenomics and 16S rRNA sequencing provide different perspectives on the aerobic granular sludge microbiome. WATER RESEARCH 2023;246:120700. [PMID: 37866247 DOI: 10.1016/j.watres.2023.120700] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Subscribe] [Scholar Register] [Received: 06/14/2023] [Revised: 09/29/2023] [Accepted: 10/04/2023] [Indexed: 10/24/2023]

Walsh LH, Coakley M, Walsh AM, O'Toole PW, Cotter PD. Bioinformatic approaches for studying the microbiome of fermented food. Crit Rev Microbiol 2023;49:693-725. [PMID: 36287644 DOI: 10.1080/1040841x.2022.2132850] [Citation(s) in RCA: 7] [Impact Index Per Article: 7.0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 05/11/2022] [Revised: 08/11/2022] [Accepted: 09/28/2022] [Indexed: 11/03/2022]

Khan MA, Rahman AU, Khan B, Al-Mijalli SH, Alswat AS, Amin A, Eid RA, Zaki MSA, Butt S, Ahmad J, Fayad E, Ullah A. Antibiotic Resistance Profiling and Phylogenicity of Uropathogenic Bacteria Isolated from Patients with Urinary Tract Infections. Antibiotics (Basel) 2023;12:1508. [PMID: 37887209 PMCID: PMC10603882 DOI: 10.3390/antibiotics12101508] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 08/16/2023] [Revised: 09/16/2023] [Accepted: 09/20/2023] [Indexed: 10/28/2023] Open

Abstract

Urinary tract infections (UTIs) are healthcare problems that commonly involve bacterial and, in some rare instances, fungal or viral infections. The irrational prescription and use of antibiotics in UTI treatment have led to an increase in antibiotic resistance. Urine samples (145) were collected from male and female patients from Lower Dir, Khyber Pakhtunkhwa (KP), Pakistan. Biochemical analyses were carried out to identify uropathogens. Molecular analysis for the identification of 16S ribosomal RNA in samples was performed via Sanger sequencing. Evolutionary linkage was determined using Molecular Evolutionary Genetics Analysis-7 (MEGA-7). The study observed significant growth in 52% of the samples (83/145). Gram-negative bacteria were identified in 85.5% of samples, while Gram-positive bacteria were reported in 14.5%. The UTI prevalence was 67.5% in females and 32.5% in males. The most prevalent uropathogenic bacteria were Klebsiella pneumoniae (39.7%, 33/83), followed by Escherichia coli (27.7%, 23/83), Pseudomonas aeruginosa (10.8%, 9/83), Staphylococcus aureus (9.6%, 8/83), Proteus mirabilis (7.2%, 6/83) and Staphylococcus saprophyticus (4.8%, 4/83). Phylogenetic analysis was performed using the neighbor-joining method, further confirming the relation of the isolates in our study with previously reported uropathogenic isolates. Antibiotic susceptibility tests identified K. pneumonia as being sensitive to imipenem (100%) and fosfomycin (78.7%) and resistant to cefuroxime (100%) and ciprofloxacin (94%). Similarly, E. coli showed high susceptibility to imipenem (100%), fosfomycin (78.2%) and nitrofurantoin (78.2%), and resistance to ciprofloxacin (100%) and cefuroxime (100%). Imipenem was identified as the most effective antibiotic, while cefuroxime and ciprofloxacin were the least. The phylogenetic tree analysis indicated that K. pneumoniae, E. coli, P. aeruginosa, S. aureus and P. mirabilis clustered with each other and the reference sequences, indicating high similarity (based on 16S rRNA sequencing). It can be concluded that genetically varied uropathogenic organisms are commonly present within the KP population. Our findings demonstrate the need to optimize antibiotic use in treating UTIs and the prevention of antibiotic resistance in the KP population.

Collapse

Kishore D, Birzu G, Hu Z, DeLisi C, Korolev KS, Segrè D. Inferring microbial co-occurrence networks from amplicon data: a systematic evaluation. mSystems 2023;8:e0096122. [PMID: 37338270 PMCID: PMC10469762 DOI: 10.1128/msystems.00961-22] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/25/2022] [Accepted: 04/14/2023] [Indexed: 06/21/2023] Open

Abstract

Microbes commonly organize into communities consisting of hundreds of species involved in complex interactions with each other. 16S ribosomal RNA (16S rRNA) amplicon profiling provides snapshots that reveal the phylogenies and abundance profiles of these microbial communities. These snapshots, when collected from multiple samples, can reveal the co-occurrence of microbes, providing a glimpse into the network of associations in these communities. However, the inference of networks from 16S data involves numerous steps, each requiring specific tools and parameter choices. Moreover, the extent to which these steps affect the final network is still unclear. In this study, we perform a meticulous analysis of each step of a pipeline that can convert 16S sequencing data into a network of microbial associations. Through this process, we map how different choices of algorithms and parameters affect the co-occurrence network and identify the steps that contribute substantially to the variance. We further determine the tools and parameters that generate robust co-occurrence networks and develop consensus network algorithms based on benchmarks with mock and synthetic data sets. The Microbial Co-occurrence Network Explorer, or MiCoNE (available at https://github.com/segrelab/MiCoNE) follows these default tools and parameters and can help explore the outcome of these combinations of choices on the inferred networks. We envisage that this pipeline could be used for integrating multiple data sets and generating comparative analyses and consensus networks that can guide our understanding of microbial community assembly in different biomes. IMPORTANCE Mapping the interrelationships between different species in a microbial community is important for understanding and controlling their structure and function. The surge in the high-throughput sequencing of microbial communities has led to the creation of thousands of data sets containing information about microbial abundances. These abundances can be transformed into co-occurrence networks, providing a glimpse into the associations within microbiomes. However, processing these data sets to obtain co-occurrence information relies on several complex steps, each of which involves numerous choices of tools and corresponding parameters. These multiple options pose questions about the robustness and uniqueness of the inferred networks. In this study, we address this workflow and provide a systematic analysis of how these choices of tools affect the final network and guidelines on appropriate tool selection for a particular data set. We also develop a consensus network algorithm that helps generate more robust co-occurrence networks based on benchmark synthetic data sets.

Collapse

Pavia MJ, Chede A, Wu Z, Cadillo-Quiroz H, Zhu Q. BinaRena: a dedicated interactive platform for human-guided exploration and binning of metagenomes. MICROBIOME 2023;11:186. [PMID: 37596696 PMCID: PMC10439608 DOI: 10.1186/s40168-023-01625-8] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Subscribe] [Scholar Register] [Received: 07/29/2022] [Accepted: 07/16/2023] [Indexed: 08/20/2023]

Abstract

BACKGROUND

Exploring metagenomic contigs and "binning" them into metagenome-assembled genomes (MAGs) are essential for the delineation of functional and evolutionary guilds within microbial communities. Despite the advances in automated binning algorithms, their capabilities in recovering MAGs with accuracy and biological relevance are so far limited. Researchers often find that human involvement is necessary to achieve representative binning results. This manual process however is expertise demanding and labor intensive, and it deserves to be supported by software infrastructure.

RESULTS

We present BinaRena, a comprehensive and versatile graphic interface dedicated to aiding human operators to explore metagenome assemblies via customizable visualization and to associate contigs with bins. Contigs are rendered as an interactive scatter plot based on various data types, including sequence metrics, coverage profiles, taxonomic assignments, and functional annotations. Various contig-level operations are permitted, such as selection, masking, highlighting, focusing, and searching. Binning plans can be conveniently edited, inspected, and compared visually or using metrics including silhouette coefficient and adjusted Rand index. Completeness and contamination of user-selected contigs can be calculated in real time. In demonstration of BinaRena's usability, we show that it facilitated biological pattern discovery, hypothesis generation, and bin refinement in a complex tropical peatland metagenome. It enabled isolation of pathogenic genomes within closely related populations from the gut microbiota of diarrheal human subjects. It significantly improved overall binning quality after curating results of automated binners using a simulated marine dataset.

CONCLUSIONS

BinaRena is an installation-free, dependency-free, client-end web application that operates directly in any modern web browser, facilitating ease of deployment and accessibility for researchers of all skill levels. The program is hosted at https://github.com/qiyunlab/binarena , together with documentation, tutorials, example data, and a live demo. It effectively supports human researchers in intuitive interpretation and fine tuning of metagenomic data. Video Abstract.

Collapse

Wang J, Xu S, Zhao K, Song G, Zhao S, Liu R. Risk control of antibiotics, antibiotic resistance genes (ARGs) and antibiotic resistant bacteria (ARB) during sewage sludge treatment and disposal: A review. THE SCIENCE OF THE TOTAL ENVIRONMENT 2023;877:162772. [PMID: 36933744 DOI: 10.1016/j.scitotenv.2023.162772] [Citation(s) in RCA: 34] [Impact Index Per Article: 34.0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Subscribe] [Scholar Register] [Received: 11/29/2022] [Revised: 02/14/2023] [Accepted: 03/06/2023] [Indexed: 05/06/2023]

Pu J, Yang J, Lu S, Jin D, Luo X, Xiong Y, Bai X, Zhu W, Huang Y, Wu S, Niu L, Liu L, Xu J. Species-Level Taxonomic Characterization of Uncultured Core Gut Microbiota of Plateau Pika. Microbiol Spectr 2023;11:e0349522. [PMID: 37067438 PMCID: PMC10269723 DOI: 10.1128/spectrum.03495-22] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 08/31/2022] [Accepted: 02/13/2023] [Indexed: 04/18/2023] Open

Abstract

Rarely has the vast diversity of bacteria on Earth been profiled, particularly on inaccessible plateaus. These uncultured microbes, which are also known as "microbial dark matter," may play crucial roles in maintaining the ecosystem and are linked to human health, regarding pathogenicity and prebioticity. The plateau pika (Ochotona curzoniae) is a small burrowing steppe lagomorph that is endemic to the Qinghai-Tibetan Plateau and is a keystone species in the maintenance of ecological balance. We used a combination of full-length 16S rRNA amplicon sequencing, shotgun metagenomics, and metabolomics to elucidate the species-level community structure and the metabolic potential of the gut microbiota of the plateau pika. Using a full-length 16S rRNA metataxonomic approach, we clustered 618 (166 ± 35 per sample) operational phylogenetic units (OPUs) from 105 plateau pika samples and assigned them to 215 known species, 226 potentially new species, and 177 higher hierarchical taxa. Notably, 39 abundant OPUs (over 60% total relative abundance) are found in over 90% of the samples, thereby representing a "core microbiota." They are all classified as novel microbial lineages, from the class to the species level. Using metagenomic reads, we independently assembled and binned 109 high-quality, species-level genome bins (SGBs). Then, a precise taxonomic assignment was performed to clarify the phylogenetic consistency of the SGBs and the 16S rRNA amplicons. Thus, the majority of the core microbes possess their genomes. SGBs belonging to the genus Treponema, the families Muribaculaceae, Lachnospiraceae, and Oscillospiraceae, and the order Eubacteriales are abundant in the metagenomic samples. In addition, multiple CAZymes are detected in these SGBs, indicating their efficient utilization of plant biomass. As the most widely connected metabolite with the core microbiota, tryptophan may relate to host environmental adaptation. Our investigation allows for a greater comprehension of the composition and functional capacity of the gut microbiota of the plateau pika. IMPORTANCE The great majority of microbial species remain uncultured, severely limiting their taxonomic characterization and biological understanding. The plateau pika (Ochotona curzoniae) is a small burrowing steppe lagomorph that is endemic to the Qinghai-Tibetan Plateau and is considered to be the keystone species in the maintenance of ecological stability. We comprehensively investigated the gut microbiota of the plateau pika via a multiomics endeavor. Combining full-length 16S rRNA metataxonomics, shotgun metagenomics, and metabolomics, we elucidated the species-level taxonomic assignment of the core uncultured intestinal microbiota of the plateau pika and revealed their correlation to host nutritional metabolism and adaptation. Our findings provide insights into the microbial diversity and biological significance of alpine animals.

Collapse

Affiliation(s)

Ji Pu State Key Laboratory of Infectious Disease Prevention and Control and National Institute for Communicable Disease Control and Prevention, Chinese Center for Disease Control and Prevention, Beijing, China
Jing Yang State Key Laboratory of Infectious Disease Prevention and Control and National Institute for Communicable Disease Control and Prevention, Chinese Center for Disease Control and Prevention, Beijing, China Research Units of Discovery of Unknown Bacteria and Function, Chinese Academy of Medical Sciences, Beijing, China
Shan Lu State Key Laboratory of Infectious Disease Prevention and Control and National Institute for Communicable Disease Control and Prevention, Chinese Center for Disease Control and Prevention, Beijing, China Research Units of Discovery of Unknown Bacteria and Function, Chinese Academy of Medical Sciences, Beijing, China
Dong Jin State Key Laboratory of Infectious Disease Prevention and Control and National Institute for Communicable Disease Control and Prevention, Chinese Center for Disease Control and Prevention, Beijing, China Research Units of Discovery of Unknown Bacteria and Function, Chinese Academy of Medical Sciences, Beijing, China
Xuelian Luo State Key Laboratory of Infectious Disease Prevention and Control and National Institute for Communicable Disease Control and Prevention, Chinese Center for Disease Control and Prevention, Beijing, China
Yanwen Xiong State Key Laboratory of Infectious Disease Prevention and Control and National Institute for Communicable Disease Control and Prevention, Chinese Center for Disease Control and Prevention, Beijing, China
Xiangning Bai State Key Laboratory of Infectious Disease Prevention and Control and National Institute for Communicable Disease Control and Prevention, Chinese Center for Disease Control and Prevention, Beijing, China
Wentao Zhu State Key Laboratory of Infectious Disease Prevention and Control and National Institute for Communicable Disease Control and Prevention, Chinese Center for Disease Control and Prevention, Beijing, China
Yuyuan Huang State Key Laboratory of Infectious Disease Prevention and Control and National Institute for Communicable Disease Control and Prevention, Chinese Center for Disease Control and Prevention, Beijing, China
Shusheng Wu Yushu Prefecture Center for Disease Control and Prevention, Yushu, China
Lina Niu Key Laboratory of Tropical Translational Medicine of Ministry of Education, Hainan Medical University, Haikou, China
Liyun Liu State Key Laboratory of Infectious Disease Prevention and Control and National Institute for Communicable Disease Control and Prevention, Chinese Center for Disease Control and Prevention, Beijing, China
Jianguo Xu State Key Laboratory of Infectious Disease Prevention and Control and National Institute for Communicable Disease Control and Prevention, Chinese Center for Disease Control and Prevention, Beijing, China Research Units of Discovery of Unknown Bacteria and Function, Chinese Academy of Medical Sciences, Beijing, China Institute of Public Health, Nankai University, Tianjing, China

Collapse

Tadrent N, Dedeine F, Hervé V. SnakeMAGs: a simple, efficient, flexible and scalable workflow to reconstruct prokaryotic genomes from metagenomes. F1000Res 2022;11:1522. [PMID: 36875992 PMCID: PMC9978240 DOI: 10.12688/f1000research.128091.2] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Submit a Manuscript] [Subscribe] [Scholar Register] [Accepted: 02/23/2023] [Indexed: 03/02/2023] Open

Abstract

Background: Over the last decade, we have observed in microbial ecology a transition from gene-centric to genome-centric analyses. Indeed, the advent of metagenomics combined with binning methods, single-cell genome sequencing as well as high-throughput cultivation methods have contributed to the continuing and exponential increase of available prokaryotic genomes, which in turn has favored the exploration of microbial metabolisms. In the case of metagenomics, data processing, from raw reads to genome reconstruction, involves various steps and software which can represent a major technical obstacle. Methods: To overcome this challenge, we developed SnakeMAGs, a simple workflow that can process Illumina data, from raw reads to metagenome-assembled genomes (MAGs) classification and relative abundance estimate. It integrates state-of-the-art bioinformatic tools to sequentially perform: quality control of the reads (illumina-utils, Trimmomatic), host sequence removal (optional step, using Bowtie2), assembly (MEGAHIT), binning (MetaBAT2), quality filtering of the bins (CheckM, GUNC), classification of the MAGs (GTDB-Tk) and estimate of their relative abundance (CoverM). Developed with the popular Snakemake workflow management system, it can be deployed on various architectures, from single to multicore and from workstation to computer clusters and grids. It is also flexible since users can easily change parameters and/or add new rules. Results: Using termite gut metagenomic datasets, we showed that SnakeMAGs is slower but allowed the recovery of more MAGs encompassing more diverse phyla compared to another similar workflow named ATLAS. Importantly, these additional MAGs showed no significant difference compared to the other ones in terms of completeness, contamination, genome size nor relative abundance. Conclusions: Overall, it should make the reconstruction of MAGs more accessible to microbiologists. SnakeMAGs as well as test files and an extended tutorial are available at https://github.com/Nachida08/SnakeMAGs.

Collapse

Tadrent N, Dedeine F, Hervé V. SnakeMAGs: a simple, efficient, flexible and scalable workflow to reconstruct prokaryotic genomes from metagenomes. F1000Res 2022;11:1522. [PMID: 36875992 PMCID: PMC9978240 DOI: 10.12688/f1000research.128091.1] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Figures] [Journal Information] [Submit a Manuscript] [Subscribe] [Scholar Register] [Accepted: 12/01/2022] [Indexed: 01/05/2024] Open

Abstract

Background: Over the last decade, we have observed in microbial ecology a transition from gene-centric to genome-centric analyses. Indeed, the advent of metagenomics combined with binning methods, single-cell genome sequencing as well as high-throughput cultivation methods have contributed to the continuing and exponential increase of available prokaryotic genomes, which in turn has favored the exploration of microbial metabolisms. In the case of metagenomics, data processing, from raw reads to genome reconstruction, involves various steps and software which can represent a major technical obstacle. Methods: To overcome this challenge, we developed SnakeMAGs, a simple workflow that can process Illumina data, from raw reads to metagenome-assembled genomes (MAGs) classification and relative abundance estimate. It integrates state-of-the-art bioinformatic tools to sequentially perform: quality control of the reads (illumina-utils, Trimmomatic), host sequence removal (optional step, using Bowtie2), assembly (MEGAHIT), binning (MetaBAT2), quality filtering of the bins (CheckM), classification of the MAGs (GTDB-Tk) and estimate of their relative abundance (CoverM). Developed with the popular Snakemake workflow management system, it can be deployed on various architectures, from single to multicore and from workstation to computer clusters and grids. It is also flexible since users can easily change parameters and/or add new rules. Results: Using termite gut metagenomic datasets, we showed that SnakeMAGs is slower but allowed the recovery of more MAGs encompassing more diverse phyla compared to another similar workflow named ATLAS. Conclusions: Overall, it should make the reconstruction of MAGs more accessible to microbiologists. SnakeMAGs as well as test files and an extended tutorial are available at https://github.com/Nachida08/SnakeMAGs.

Collapse

Mallawaarachchi V, Lin Y. Accurate Binning of Metagenomic Contigs Using Composition, Coverage, and Assembly Graphs. J Comput Biol 2022;29:1357-1376. [DOI: 10.1089/cmb.2022.0262] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/13/2022] Open

Vollmers J, Wiegand S, Lenk F, Kaster AK. How clear is our current view on microbial dark matter? (Re-)assessing public MAG & SAG datasets with MDMcleaner. Nucleic Acids Res 2022;50:e76. [PMID: 35536293 PMCID: PMC9303271 DOI: 10.1093/nar/gkac294] [Citation(s) in RCA: 18] [Impact Index Per Article: 9.0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 01/31/2022] [Revised: 04/11/2022] [Accepted: 04/13/2022] [Indexed: 11/12/2022] Open

Chandrasiri S, Perera T, Dilhara A, Perera I, Mallawaarachchi V. CH-Bin: A Convex Hull Based Approach for Binning Metagenomic Contigs. Comput Biol Chem 2022;100:107734. [DOI: 10.1016/j.compbiolchem.2022.107734] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/06/2022] [Accepted: 07/12/2022] [Indexed: 11/30/2022]

Nishimura L, Fujito N, Sugimoto R, Inoue I. Detection of Ancient Viruses and Long-Term Viral Evolution. Viruses 2022;14:v14061336. [PMID: 35746807 PMCID: PMC9230872 DOI: 10.3390/v14061336] [Citation(s) in RCA: 7] [Impact Index Per Article: 3.5] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 04/30/2022] [Revised: 06/15/2022] [Accepted: 06/16/2022] [Indexed: 12/22/2022] Open

Sinha D, Sharma A, Mishra DC, Rai A, Lal SB, Kumar S, Farooqi MS, Chaturvedi KK. MetaConClust - Unsupervised Binning of Metagenomics Data using Consensus Clustering. Curr Genomics 2022;23:137-146. [PMID: 36778980 PMCID: PMC9878838 DOI: 10.2174/1389202923666220413114659] [Citation(s) in RCA: 1] [Impact Index Per Article: 0.5] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 12/22/2021] [Revised: 01/18/2022] [Accepted: 02/21/2022] [Indexed: 11/22/2022] Open

Ko KKK, Chng KR, Nagarajan N. Metagenomics-enabled microbial surveillance. Nat Microbiol 2022;7:486-496. [PMID: 35365786 DOI: 10.1038/s41564-022-01089-w] [Citation(s) in RCA: 71] [Impact Index Per Article: 35.5] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 08/29/2021] [Accepted: 02/22/2022] [Indexed: 12/13/2022]

Assessment of Hydrocarbon Degradation Potential in Microbial Communities in Arctic Sea Ice. Microorganisms 2022;10:microorganisms10020328. [PMID: 35208784 PMCID: PMC8879337 DOI: 10.3390/microorganisms10020328] [Citation(s) in RCA: 7] [Impact Index Per Article: 3.5] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 12/23/2021] [Revised: 01/27/2022] [Accepted: 01/28/2022] [Indexed: 02/04/2023] Open

Abstract

The anthropogenic release of oil hydrocarbons into the cold marine environment is an increasing concern due to the elevated usage of sea routes and the exploration of new oil drilling sites in Arctic areas. The aim of this study was to evaluate prokaryotic community structures and the genetic potential of hydrocarbon degradation in the metagenomes of seawater, sea ice, and crude oil encapsulating the sea ice of the Norwegian fjord, Ofotfjorden. Although the results indicated substantial differences between the structure of prokaryotic communities in seawater and sea ice, the crude oil encapsulating sea ice (SIO) showed increased abundances of many genera-containing hydrocarbon-degrading organisms, including Bermanella, Colwellia, and Glaciecola. Although the metagenome of seawater was rich in a variety of hydrocarbon degradation-related functional genes (HDGs) associated with the metabolism of n-alkanes, and mono- and polyaromatic hydrocarbons, most of the normalized gene counts were highest in the clean sea ice metagenome, whereas in SIO, these counts were the lowest. The long-chain alkane degradation gene almA was detected from all the studied metagenomes and its counts exceeded ladA and alkB counts in both sea ice metagenomes. In addition, almA was related to the most diverse group of prokaryotic genera. Almost all 18 good- and high-quality metagenome-assembled genomes (MAGs) had diverse HDGs profiles. The MAGs recovered from the SIO metagenome belonged to the abundant taxa, such as Glaciecola, Bermanella, and Rhodobacteracea, in this environment. The genera associated with HDGs were often previously known as hydrocarbon-degrading genera. However, a substantial number of new associations, either between already known hydrocarbon-degrading genera and new HDGs or between genera not known to contain hydrocarbon degraders and multiple HDGs, were found. The superimposition of the results of comparing HDG associations with taxonomy, the HDG profiles of MAGs, and the full genomes of organisms in the KEGG database suggest that the found relationships need further investigation and verification.

Collapse

Boeri L, Donnaloja F, Campanile M, Sardelli L, Tunesi M, Fusco F, Giordano C, Albani D. Using integrated meta-omics to appreciate the role of the gut microbiota in epilepsy. Neurobiol Dis 2022;164:105614. [PMID: 35017031 DOI: 10.1016/j.nbd.2022.105614] [Citation(s) in RCA: 4] [Impact Index Per Article: 2.0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/04/2021] [Revised: 12/31/2021] [Accepted: 01/05/2022] [Indexed: 12/16/2022] Open

Choudhari J, Choubey J, Verma M, Chatterjee T, Sahariah B. Metagenomics: the boon for microbial world knowledge and current challenges. Bioinformatics 2022. [DOI: 10.1016/b978-0-323-89775-4.00022-5] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/30/2022] Open

Voigt B, Fischer O, Krumnow C, Herta C, Dabrowski PW. NGS read classification using AI. PLoS One 2021;16:e0261548. [PMID: 34936673 PMCID: PMC8694450 DOI: 10.1371/journal.pone.0261548] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/24/2021] [Accepted: 12/03/2021] [Indexed: 11/19/2022] Open

Wan XH. Artificial intelligence reveals roles of gut microbiota in driving human colorectal cancer evolution. Artif Intell Cancer 2021;2:69-78. [DOI: 10.35713/aic.v2.i5.69] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Journal Information] [Submit a Manuscript] [Subscribe] [Scholar Register] [Received: 10/19/2021] [Revised: 10/24/2021] [Accepted: 10/27/2021] [Indexed: 02/06/2023] Open

Mining the Microbiome and Microbiota-Derived Molecules in Inflammatory Bowel Disease. Int J Mol Sci 2021;22:ijms222011243. [PMID: 34681902 PMCID: PMC8540913 DOI: 10.3390/ijms222011243] [Citation(s) in RCA: 3] [Impact Index Per Article: 1.0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 09/28/2021] [Revised: 10/12/2021] [Accepted: 10/13/2021] [Indexed: 12/12/2022] Open

Liu L, Wang Y, Yang Y, Wang D, Cheng SH, Zheng C, Zhang T. Charting the complexity of the activated sludge microbiome through a hybrid sequencing strategy. MICROBIOME 2021;9:205. [PMID: 34649602 PMCID: PMC8518188 DOI: 10.1186/s40168-021-01155-1] [Citation(s) in RCA: 32] [Impact Index Per Article: 10.7] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Subscribe] [Scholar Register] [Received: 05/12/2021] [Accepted: 09/01/2021] [Indexed: 06/01/2023]

Abstract

BACKGROUND

Long-read sequencing has shown its tremendous potential to address genome assembly challenges, e.g., achieving the first telomere-to-telomere assembly of a gapless human chromosome. However, many issues remain unresolved when leveraging error-prone long reads to characterize high-complexity metagenomes, for instance, complete/high-quality genome reconstruction from highly complex systems.

RESULTS

Here, we developed an iterative haplotype-resolved hierarchical clustering-based hybrid assembly (HCBHA) approach that capitalizes on a hybrid (error-prone long reads and high-accuracy short reads) sequencing strategy to reconstruct (near-) complete genomes from highly complex metagenomes. Using the HCBHA approach, we first phase short and long reads from the highly complex metagenomic dataset into different candidate bacterial haplotypes, then perform hybrid assembly of each bacterial genome individually. We reconstructed 557 metagenome-assembled genomes (MAGs) with an average N50 of 574 Kb from a deeply sequenced, highly complex activated sludge (AS) metagenome. These high-contiguity MAGs contained 14 closed genomes and 111 high-quality (HQ) MAGs including full-length rRNA operons, which accounted for 61.1% of the microbial community. Leveraging the near-complete genomes, we also profiled the metabolic potential of the AS microbiome and identified 2153 biosynthetic gene clusters (BGCs) encoded within the recovered AS MAGs.

CONCLUSION

Our results established the feasibility of an iterative haplotype-resolved HCBHA approach to reconstruct (near-) complete genomes from highly complex ecosystems, providing new insights into "complete metagenomics". The retrieved high-contiguity MAGs illustrated that various biosynthetic gene clusters (BGCs) were harbored in the AS microbiome. The high diversity of BGCs highlights the potential to discover new natural products biosynthesized by the AS microbial community, aside from the traditional function (e.g., organic carbon and nitrogen removal) in wastewater treatment. Video Abstract.

Collapse

Dextro RB, Delbaje E, Cotta SR, Zehr JP, Fiore MF. Trends in Free-access Genomic Data Accelerate Advances in Cyanobacteria Taxonomy. JOURNAL OF PHYCOLOGY 2021;57:1392-1402. [PMID: 34291461 DOI: 10.1111/jpy.13200] [Citation(s) in RCA: 7] [Impact Index Per Article: 2.3] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Subscribe] [Scholar Register] [Received: 03/23/2021] [Accepted: 07/16/2021] [Indexed: 06/13/2023]

Zacho CM, Bager MA, Margaryan A, Gravlund P, Galatius A, Rasmussen AR, Allentoft ME. Uncovering the genomic and metagenomic research potential in old ethanol-preserved snakes. PLoS One 2021;16:e0256353. [PMID: 34424926 PMCID: PMC8382189 DOI: 10.1371/journal.pone.0256353] [Citation(s) in RCA: 3] [Impact Index Per Article: 1.0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 02/16/2021] [Accepted: 08/04/2021] [Indexed: 11/19/2022] Open

Cahn JKB, Piel J. Anwendungen von Einzelzellmethoden in der mikrobiellen Naturstoffforschung. Angew Chem Int Ed Engl 2021. [DOI: 10.1002/ange.201900532] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/06/2022]

Stevens BR, Roesch L, Thiago P, Russell JT, Pepine CJ, Holbert RC, Raizada MK, Triplett EW. Depression phenotype identified by using single nucleotide exact amplicon sequence variants of the human gut microbiome. Mol Psychiatry 2021;26:4277-4287. [PMID: 31988436 DOI: 10.1038/s41380-020-0652-5] [Citation(s) in RCA: 37] [Impact Index Per Article: 12.3] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Submit a Manuscript] [Subscribe] [Scholar Register] [Received: 07/25/2019] [Revised: 01/13/2020] [Accepted: 01/16/2020] [Indexed: 12/15/2022]

Nathani NM, Dave KJ, Vatsa PP, Mahajan MS, Sharma P, Mootapally C. 309 metagenome assembled microbial genomes from deep sediment samples in the Gulfs of Kathiawar Peninsula. Sci Data 2021;8:194. [PMID: 34321485 PMCID: PMC8319310 DOI: 10.1038/s41597-021-00957-0] [Citation(s) in RCA: 3] [Impact Index Per Article: 1.0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 05/12/2020] [Accepted: 05/24/2021] [Indexed: 11/23/2022] Open

Mallawaarachchi VG, Wickramarachchi AS, Lin Y. Improving metagenomic binning results with overlapped bins using assembly graphs. Algorithms Mol Biol 2021;16:3. [PMID: 33947431 PMCID: PMC8097841 DOI: 10.1186/s13015-021-00185-6] [Citation(s) in RCA: 10] [Impact Index Per Article: 3.3] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 02/04/2021] [Accepted: 04/20/2021] [Indexed: 11/18/2022] Open

Recovering prokaryotic genomes from host-associated, short-read shotgun metagenomic sequencing data. Nat Protoc 2021;16:2520-2541. [PMID: 33864056 DOI: 10.1038/s41596-021-00508-2] [Citation(s) in RCA: 18] [Impact Index Per Article: 6.0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/24/2020] [Accepted: 01/12/2021] [Indexed: 02/02/2023]

Cahn JKB, Piel J. Opening up the Single-Cell Toolbox for Microbial Natural Products Research. Angew Chem Int Ed Engl 2021;60:18412-18428. [PMID: 30748086 DOI: 10.1002/anie.201900532] [Citation(s) in RCA: 11] [Impact Index Per Article: 3.7] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 01/14/2019] [Indexed: 02/06/2023]

Borderes M, Gasc C, Prestat E, Galvão Ferrarini M, Vinga S, Boucinha L, Sagot MF. A comprehensive evaluation of binning methods to recover human gut microbial species from a non-redundant reference gene catalog. NAR Genom Bioinform 2021;3:lqab009. [PMID: 33709074 PMCID: PMC7936653 DOI: 10.1093/nargab/lqab009] [Citation(s) in RCA: 2] [Impact Index Per Article: 0.7] [Reference Citation Analysis] [Abstract] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/31/2020] [Revised: 01/18/2021] [Accepted: 01/29/2021] [Indexed: 01/19/2023] Open

Bharti R, Grimm DG. Current challenges and best-practice protocols for microbiome analysis. Brief Bioinform 2021;22:178-193. [PMID: 31848574 PMCID: PMC7820839 DOI: 10.1093/bib/bbz155] [Citation(s) in RCA: 227] [Impact Index Per Article: 75.7] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 09/24/2019] [Revised: 10/23/2019] [Accepted: 11/06/2019] [Indexed: 12/15/2022] Open

Zhao F, Zhang D, Ge C, Zhang L, Reinach PS, Tian X, Tao C, Zhao Z, Zhao C, Fu W, Zeng C, Chen W. Metagenomic Profiling of Ocular Surface Microbiome Changes in Meibomian Gland Dysfunction. Invest Ophthalmol Vis Sci 2021;61:22. [PMID: 32673387 PMCID: PMC7425691 DOI: 10.1167/iovs.61.8.22] [Citation(s) in RCA: 22] [Impact Index Per Article: 7.3] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 12/24/2022] Open

Laso-Jadart R, Ambroise C, Peterlongo P, Madoui MA. metaVaR: Introducing metavariant species models for reference-free metagenomic-based population genomics. PLoS One 2020;15:e0244637. [PMID: 33378381 PMCID: PMC7773188 DOI: 10.1371/journal.pone.0244637] [Citation(s) in RCA: 3] [Impact Index Per Article: 0.8] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 08/18/2020] [Accepted: 12/14/2020] [Indexed: 11/18/2022] Open

Mallawaarachchi V, Wickramarachchi A, Lin Y. GraphBin: refined binning of metagenomic contigs using assembly graphs. Bioinformatics 2020;36:3307-3313. [PMID: 32167528 DOI: 10.1093/bioinformatics/btaa180] [Citation(s) in RCA: 35] [Impact Index Per Article: 8.8] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 04/30/2019] [Revised: 02/18/2020] [Accepted: 03/10/2020] [Indexed: 12/17/2022] Open

Pérez-Cobas AE, Gomez-Valero L, Buchrieser C. Metagenomic approaches in microbial ecology: an update on whole-genome and marker gene sequencing analyses. Microb Genom 2020;6:mgen000409. [PMID: 32706331 PMCID: PMC7641418 DOI: 10.1099/mgen.0.000409] [Citation(s) in RCA: 47] [Impact Index Per Article: 11.8] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 12/18/2019] [Accepted: 06/30/2020] [Indexed: 12/23/2022] Open

Yue Y, Huang H, Qi Z, Dou HM, Liu XY, Han TF, Chen Y, Song XJ, Zhang YH, Tu J. Evaluating metagenomics tools for genome binning with real metagenomic datasets and CAMI datasets. BMC Bioinformatics 2020;21:334. [PMID: 32723290 PMCID: PMC7469296 DOI: 10.1186/s12859-020-03667-3] [Citation(s) in RCA: 39] [Impact Index Per Article: 9.8] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 11/02/2019] [Accepted: 07/16/2020] [Indexed: 12/13/2022] Open

Abstract

Background

Shotgun metagenomics based on untargeted sequencing can explore the taxonomic profile and the function of unknown microorganisms in samples, and complement the shortage of amplicon sequencing. Binning assembled sequences into individual groups, which represent microbial genomes, is the key step and a major challenge in metagenomic research. Both supervised and unsupervised machine learning methods have been employed in binning. Genome binning belonging to unsupervised method clusters contigs into individual genome bins by machine learning methods without the assistance of any reference databases. So far a lot of genome binning tools have emerged. Evaluating these genome tools is of great significance to microbiological research. In this study, we evaluate 15 genome binning tools containing 12 original binning tools and 3 refining binning tools by comparing the performance of these tools on chicken gut metagenomic datasets and the first CAMI challenge datasets.

Results

For chicken gut metagenomic datasets, original genome binner MetaBat, Groopm2 and Autometa performed better than other original binner, and MetaWrap combined the binning results of them generated the most high-quality genome bins. For CAMI datasets, Groopm2 achieved the highest purity (> 0.9) with good completeness (> 0.8), and reconstructed the most high-quality genome bins among original genome binners. Compared with Groopm2, MetaBat2 had similar performance with higher completeness and lower purity. Genome refining binners DASTool predicated the most high-quality genome bins among all genomes binners. Most genome binner performed well for unique strains. Nonetheless, reconstructing common strains still is a substantial challenge for all genome binner.

Conclusions

In conclusion, we tested a set of currently available, state-of-the-art metagenomics hybrid binning tools and provided a guide for selecting tools for metagenomic binning by comparing range of purity, completeness, adjusted rand index, and the number of high-quality reconstructed bins. Furthermore, available information for future binning strategy were concluded.

Collapse

Affiliation(s)

Yi Yue Anhui Province Key Laboratory of Veterinary Pathobiology and Disease Control, Anhui Agricultural University, Hefei, 230036, China. .,School of Information & Computer, Anhui Agricultural University, Hefei, 230036, China. .,School of Life Sciences, Anhui Agricultural University, Hefei, 230036, China.
Hao Huang Anhui Province Key Laboratory of Veterinary Pathobiology and Disease Control, Anhui Agricultural University, Hefei, 230036, China.,School of Life Sciences, Anhui Agricultural University, Hefei, 230036, China.,School of Animal Science and Technology, Anhui Agricultural University, Hefei, 230036, China
Zhao Qi Anhui Province Key Laboratory of Veterinary Pathobiology and Disease Control, Anhui Agricultural University, Hefei, 230036, China.,School of Information & Computer, Anhui Agricultural University, Hefei, 230036, China
Hui-Min Dou School of Information & Computer, Anhui Agricultural University, Hefei, 230036, China
Xin-Yi Liu School of Information & Computer, Anhui Agricultural University, Hefei, 230036, China
Tian-Fei Han Anhui Province Key Laboratory of Veterinary Pathobiology and Disease Control, Anhui Agricultural University, Hefei, 230036, China.,School of Animal Science and Technology, Anhui Agricultural University, Hefei, 230036, China
Yue Chen Anhui Province Key Laboratory of Veterinary Pathobiology and Disease Control, Anhui Agricultural University, Hefei, 230036, China.,School of Animal Science and Technology, Anhui Agricultural University, Hefei, 230036, China
Xiang-Jun Song Anhui Province Key Laboratory of Veterinary Pathobiology and Disease Control, Anhui Agricultural University, Hefei, 230036, China.,School of Animal Science and Technology, Anhui Agricultural University, Hefei, 230036, China
You-Hua Zhang Anhui Province Key Laboratory of Veterinary Pathobiology and Disease Control, Anhui Agricultural University, Hefei, 230036, China. .,School of Information & Computer, Anhui Agricultural University, Hefei, 230036, China. .,School of Life Sciences, Anhui Agricultural University, Hefei, 230036, China.
Jian Tu Anhui Province Key Laboratory of Veterinary Pathobiology and Disease Control, Anhui Agricultural University, Hefei, 230036, China. .,School of Information & Computer, Anhui Agricultural University, Hefei, 230036, China. .,School of Animal Science and Technology, Anhui Agricultural University, Hefei, 230036, China.

Collapse

Wang Z, Wang Z, Lu YY, Sun F, Zhu S. SolidBin: improving metagenome binning with semi-supervised normalized cut. Bioinformatics 2020;35:4229-4238. [PMID: 30977806 DOI: 10.1093/bioinformatics/btz253] [Citation(s) in RCA: 36] [Impact Index Per Article: 9.0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 12/16/2018] [Revised: 03/14/2019] [Accepted: 04/05/2019] [Indexed: 12/19/2022] Open

Abstract

MOTIVATION

Metagenomic contig binning is an important computational problem in metagenomic research, which aims to cluster contigs from the same genome into the same group. Unlike classical clustering problem, contig binning can utilize known relationships among some of the contigs or the taxonomic identity of some contigs. However, the current state-of-the-art contig binning methods do not make full use of the additional biological information except the coverage and sequence composition of the contigs.

RESULTS

We developed a novel contig binning method, Semi-supervised Spectral Normalized Cut for Binning (SolidBin), based on semi-supervised spectral clustering. Using sequence feature similarity and/or additional biological information, such as the reliable taxonomy assignments of some contigs, SolidBin constructs two types of prior information: must-link and cannot-link constraints. Must-link constraints mean that the pair of contigs should be clustered into the same group, while cannot-link constraints mean that the pair of contigs should be clustered in different groups. These constraints are then integrated into a classical spectral clustering approach, normalized cut, for improved contig binning. The performance of SolidBin is compared with five state-of-the-art genome binners, CONCOCT, COCACOLA, MaxBin, MetaBAT and BMC3C on five next-generation sequencing benchmark datasets including simulated multi- and single-sample datasets and real multi-sample datasets. The experimental results show that, SolidBin has achieved the best performance in terms of F-score, Adjusted Rand Index and Normalized Mutual Information, especially while using the real datasets and the single-sample dataset.

AVAILABILITY AND IMPLEMENTATION

https://github.com/sufforest/SolidBin.

SUPPLEMENTARY INFORMATION

Supplementary data are available at Bioinformatics online.

Collapse

Linard B, Swenson K, Pardi F. Rapid alignment-free phylogenetic identification of metagenomic sequences. Bioinformatics 2020;35:3303-3312. [PMID: 30698645 DOI: 10.1093/bioinformatics/btz068] [Citation(s) in RCA: 22] [Impact Index Per Article: 5.5] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/27/2018] [Revised: 01/18/2019] [Accepted: 01/29/2019] [Indexed: 12/20/2022] Open

Shang J, Sun Y. CHEER: HierarCHical taxonomic classification for viral mEtagEnomic data via deep leaRning. Methods 2020;189:95-103. [PMID: 32454212 PMCID: PMC7255349 DOI: 10.1016/j.ymeth.2020.05.018] [Citation(s) in RCA: 21] [Impact Index Per Article: 5.3] [Reference Citation Analysis] [Abstract] [Key Words] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 02/03/2020] [Revised: 05/05/2020] [Accepted: 05/17/2020] [Indexed: 02/07/2023] Open

Levy Karin E, Mirdita M, Söding J. MetaEuk-sensitive, high-throughput gene discovery, and annotation for large-scale eukaryotic metagenomics. MICROBIOME 2020;8:48. [PMID: 32245390 PMCID: PMC7126354 DOI: 10.1186/s40168-020-00808-x] [Citation(s) in RCA: 103] [Impact Index Per Article: 25.8] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Subscribe] [Scholar Register] [Received: 11/15/2019] [Accepted: 02/14/2020] [Indexed: 05/10/2023]

Abstract

BACKGROUND

Metagenomics is revolutionizing the study of microorganisms and their involvement in biological, biomedical, and geochemical processes, allowing us to investigate by direct sequencing a tremendous diversity of organisms without the need for prior cultivation. Unicellular eukaryotes play essential roles in most microbial communities as chief predators, decomposers, phototrophs, bacterial hosts, symbionts, and parasites to plants and animals. Investigating their roles is therefore of great interest to ecology, biotechnology, human health, and evolution. However, the generally lower sequencing coverage, their more complex gene and genome architectures, and a lack of eukaryote-specific experimental and computational procedures have kept them on the sidelines of metagenomics.

RESULTS

MetaEuk is a toolkit for high-throughput, reference-based discovery, and annotation of protein-coding genes in eukaryotic metagenomic contigs. It performs fast searches with 6-frame-translated fragments covering all possible exons and optimally combines matches into multi-exon proteins. We used a benchmark of seven diverse, annotated genomes to show that MetaEuk is highly sensitive even under conditions of low sequence similarity to the reference database. To demonstrate MetaEuk's power to discover novel eukaryotic proteins in large-scale metagenomic data, we assembled contigs from 912 samples of the Tara Oceans project. MetaEuk predicted >12,000,000 protein-coding genes in 8 days on ten 16-core servers. Most of the discovered proteins are highly diverged from known proteins and originate from very sparsely sampled eukaryotic supergroups.

CONCLUSION

The open-source (GPLv3) MetaEuk software (https://github.com/soedinglab/metaeuk) enables large-scale eukaryotic metagenomics through reference-based, sensitive taxonomic and functional annotation. Video abstract.

Collapse