1
|
Ejaz MR, Badr K, Hassan ZU, Al-Thani R, Jaoua S. Metagenomic approaches and opportunities in arid soil research. THE SCIENCE OF THE TOTAL ENVIRONMENT 2024; 953:176173. [PMID: 39260494 DOI: 10.1016/j.scitotenv.2024.176173] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Subscribe] [Scholar Register] [Received: 05/08/2024] [Revised: 09/04/2024] [Accepted: 09/07/2024] [Indexed: 09/13/2024]
Abstract
Arid soils present unique challenges and opportunities for studying microbial diversity and bioactive potential due to the extreme environmental conditions they bear. This review article investigates soil metagenomics as an emerging tool to explore complex microbial dynamics and unexplored bioactive potential in harsh environments. Utilizing advanced metagenomic techniques, diverse microbial populations that grow under extreme conditions such as high temperatures, salinity, high pH levels, and exposure to metals and radiation can be studied. The use of extremophiles to discover novel natural products and biocatalysts emphasizes the role of functional metagenomics in identifying enzymes and secondary metabolites for industrial and pharmaceutical purposes. Metagenomic sequencing uncovers a complex network of microbial diversity, offering significant potential for discovering new bioactive compounds. Functional metagenomics, connecting taxonomic diversity to genetic capabilities, provides a pathway to identify microbes' mechanisms to synthesize valuable secondary metabolites and other bioactive substances. Contrary to the common perception of desert soil as barren land, the metagenomic analysis reveals a rich diversity of life forms adept at extreme survival. It provides valuable findings into their resilience and potential applications in biotechnology. Moreover, the challenges associated with metagenomics in arid soils, such as low microbial biomass, high DNA degradation rates, and DNA extraction inhibitors and strategies to overcome these issues, outline the latest advancements in extraction methods, high-throughput sequencing, and bioinformatics. The importance of metagenomics for investigating diverse environments opens the way for future research to develop sustainable solutions in agriculture, industry, and medicine. Extensive studies are necessary to utilize the full potential of these powerful microbial communities. This research will significantly improve our understanding of microbial ecology and biotechnology in arid environments.
Collapse
Affiliation(s)
- Muhammad Riaz Ejaz
- Environmental Science Program, Department of Biological and Environmental Sciences, College of Arts and Science, Qatar University, P.O. Box 2713, Doha, Qatar
| | - Kareem Badr
- Environmental Science Program, Department of Biological and Environmental Sciences, College of Arts and Science, Qatar University, P.O. Box 2713, Doha, Qatar
| | - Zahoor Ul Hassan
- Environmental Science Program, Department of Biological and Environmental Sciences, College of Arts and Science, Qatar University, P.O. Box 2713, Doha, Qatar
| | - Roda Al-Thani
- Environmental Science Program, Department of Biological and Environmental Sciences, College of Arts and Science, Qatar University, P.O. Box 2713, Doha, Qatar
| | - Samir Jaoua
- Environmental Science Program, Department of Biological and Environmental Sciences, College of Arts and Science, Qatar University, P.O. Box 2713, Doha, Qatar.
| |
Collapse
|
2
|
Moeller AH, Dillard BA, Goldman SL, Real MVF, Sprockett DD. Removal of sequencing adapter contamination improves microbial genome databases. BMC Genomics 2024; 25:1033. [PMID: 39497067 PMCID: PMC11536531 DOI: 10.1186/s12864-024-10956-1] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/20/2024] [Accepted: 10/24/2024] [Indexed: 11/06/2024] Open
Abstract
Advances in assembling microbial genomes have led to growth of reference genome databases, which have been transformative for applied and basic microbiome research. Here we show that published microbial genome databases from humans, mice, cows, pigs, fish, honeybees, and marine environments contain significant sequencing-adapter contamination that systematically reduces assembly accuracy and contiguousness. By removing the adapter-contaminated ends of contiguous sequences and reassembling MGnify reference genomes, we improve the quality of assemblies in these databases.
Collapse
Affiliation(s)
- Andrew H Moeller
- Department of Ecology and Evolutionary Biology, Princeton University, Princeton, NJ, 08540, USA.
| | - Brian A Dillard
- Department of Ecology and Evolutionary Biology, Cornell University, Ithaca, NY, 14853, USA
| | - Samantha L Goldman
- Department of Ecology and Evolutionary Biology, Cornell University, Ithaca, NY, 14853, USA
| | - Madalena V F Real
- Department of Ecology and Evolutionary Biology, Cornell University, Ithaca, NY, 14853, USA
| | - Daniel D Sprockett
- Department of Ecology and Evolutionary Biology, Cornell University, Ithaca, NY, 14853, USA
| |
Collapse
|
3
|
Kazantseva E, Donmez A, Frolova M, Pop M, Kolmogorov M. Strainy: phasing and assembly of strain haplotypes from long-read metagenome sequencing. Nat Methods 2024; 21:2034-2043. [PMID: 39327484 DOI: 10.1038/s41592-024-02424-1] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 11/22/2023] [Accepted: 08/22/2024] [Indexed: 09/28/2024]
Abstract
Bacterial species in microbial communities are often represented by mixtures of strains, distinguished by small variations in their genomes. Short-read approaches can be used to detect small-scale variation between strains but fail to phase these variants into contiguous haplotypes. Long-read metagenome assemblers can generate contiguous bacterial chromosomes but often suppress strain-level variation in favor of species-level consensus. Here we present Strainy, an algorithm for strain-level metagenome assembly and phasing from Nanopore and PacBio reads. Strainy takes a de novo metagenomic assembly as input and identifies strain variants, which are then phased and assembled into contiguous haplotypes. Using simulated and mock Nanopore and PacBio metagenome data, we show that Strainy assembles accurate and complete strain haplotypes, outperforming current Nanopore-based methods and comparable with PacBio-based algorithms in completeness and accuracy. We then use Strainy to assemble strain haplotypes of a complex environmental metagenome, revealing distinct strain distribution and mutational patterns in bacterial species.
Collapse
Affiliation(s)
- Ekaterina Kazantseva
- Bioinformatics and Systems Biology Program, ITMO University, St. Petersburg, Russia
| | - Ataberk Donmez
- Cancer Data Science Laboratory, Center for Cancer Research, National Cancer Institute, National Institutes of Health, Bethesda, MD, USA
- Department of Computer Science, University of Maryland, College Park, MD, USA
| | - Maria Frolova
- Functional Genomics of Prokaryotes Laboratory, Institute of Cell Biophysics, RAS, Pushchino, Russia
| | - Mihai Pop
- Department of Computer Science, University of Maryland, College Park, MD, USA.
| | - Mikhail Kolmogorov
- Cancer Data Science Laboratory, Center for Cancer Research, National Cancer Institute, National Institutes of Health, Bethesda, MD, USA.
| |
Collapse
|
4
|
Shen H, Zhang Q, Li M, Tan X, Dong X, Wang H. Research on intensive nitrogen removal of municipal sewage by mainstream anaerobic ammonia oxidation process. CHEMOSPHERE 2024; 367:143622. [PMID: 39461438 DOI: 10.1016/j.chemosphere.2024.143622] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Subscribe] [Scholar Register] [Received: 08/12/2024] [Revised: 10/19/2024] [Accepted: 10/23/2024] [Indexed: 10/29/2024]
Abstract
The anaerobic ammonia oxidation (anammox) process is a pivotal nitrogen removal technique, playing a significant role in the field of wastewater treatment. The paper commences by delineating the merits of the anammox process in comparison to conventional nitrification-denitrification techniques. Subsequently, it delves into the characteristics of different sludge morphologies process of the behavior of anammox bacteria and their reactions to environmental factors. Revising the issues associated with managing urban sewage in mainstream areas., it discusses the issues faced by the anammox process under reduced nitrogen loads, such as restricted activity due to decreased the levels of ammonia nitrogen and nitrite concentrations, as well as the impact of environmental factors like low temperature, organic matter, and sulfur ions. Following this, a comprehensive review of various types of coupled anammox processes is provided, highlighting the advantages and characteristics of partial nitrification (PN), partial denitrification (PD), methane-dependent nitrite/nitrate reduction (DAMO), sulfur-driven autotrophic denitrification (SAD), iron ammonia oxidation (feammox) and algae photoautotrophy coupling techniques, emphasizing their significance in system stability and resource utilization efficiency. Future research directions include exploring the applicability of the anammox process under various temperature conditions and addressing NO3--N issues in effluent. The findings from these studies will offer valuable insights for further enhancing the optimization of the anammox process in mainstream urban wastewater treatment.
Collapse
Affiliation(s)
- Haonan Shen
- School of Civil Engineering & Architecture, Wuhan University of Technology, Wuhan, 430070, China
| | - Qian Zhang
- School of Civil Engineering & Architecture, Wuhan University of Technology, Wuhan, 430070, China.
| | - Meng Li
- School of Civil Engineering & Architecture, Wuhan University of Technology, Wuhan, 430070, China
| | - Xibei Tan
- School of Civil Engineering & Architecture, Wuhan University of Technology, Wuhan, 430070, China
| | - Xiaoqian Dong
- School of Civil Engineering & Architecture, Wuhan University of Technology, Wuhan, 430070, China
| | - Hongyu Wang
- School of Civil Engineering, Wuhan University, Wuhan, 430072, China
| |
Collapse
|
5
|
Derelle R, von Wachsmann J, Mäklin T, Hellewell J, Russell T, Lalvani A, Chindelevitch L, Croucher NJ, Harris SR, Lees JA. Seamless, rapid, and accurate analyses of outbreak genomic data using split k-mer analysis. Genome Res 2024; 34:1661-1673. [PMID: 39406504 PMCID: PMC11529842 DOI: 10.1101/gr.279449.124] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 04/08/2024] [Accepted: 09/16/2024] [Indexed: 11/01/2024]
Abstract
Sequence variation observed in populations of pathogens can be used for important public health and evolutionary genomic analyses, especially outbreak analysis and transmission reconstruction. Identifying this variation is typically achieved by aligning sequence reads to a reference genome, but this approach is susceptible to reference biases and requires careful filtering of called genotypes. There is a need for tools that can process this growing volume of bacterial genome data, providing rapid results, but that remain simple so they can be used without highly trained bioinformaticians, expensive data analysis, and long-term storage and processing of large files. Here we describe split k-mer analysis (SKA2), a method that supports both reference-free and reference-based mapping to quickly and accurately genotype populations of bacteria using sequencing reads or genome assemblies. SKA2 is highly accurate for closely related samples, and in outbreak simulations, we show superior variant recall compared with reference-based methods, with no false positives. SKA2 can also accurately map variants to a reference and be used with recombination detection methods to rapidly reconstruct vertical evolutionary history. SKA2 is many times faster than comparable methods and can be used to add new genomes to an existing call set, allowing sequential use without the need to reanalyze entire collections. With an inherent absence of reference bias, high accuracy, and a robust implementation, SKA2 has the potential to become the tool of choice for genotyping bacteria. SKA2 is implemented in Rust and is freely available as open-source software.
Collapse
Affiliation(s)
- Romain Derelle
- NIHR Health Protection Research Unit in Respiratory Infections, National Heart and Lung Institute, Imperial College London, London W21PG, United Kingdom
| | - Johanna von Wachsmann
- European Molecular Biology Laboratory, European Bioinformatics Institute, Wellcome Genome Campus, Hinxton CB10 1SD, United Kingdom
| | - Tommi Mäklin
- European Molecular Biology Laboratory, European Bioinformatics Institute, Wellcome Genome Campus, Hinxton CB10 1SD, United Kingdom
- Department of Mathematics and Statistics, University of Helsinki, Helsinki 00014, Finland
| | - Joel Hellewell
- European Molecular Biology Laboratory, European Bioinformatics Institute, Wellcome Genome Campus, Hinxton CB10 1SD, United Kingdom
| | - Timothy Russell
- Centre for Mathematical Modelling of Infectious Diseases, London School of Hygiene & Tropical Medicine, London WC1E 7HT, United Kingdom
| | - Ajit Lalvani
- NIHR Health Protection Research Unit in Respiratory Infections, National Heart and Lung Institute, Imperial College London, London W21PG, United Kingdom
| | - Leonid Chindelevitch
- MRC Centre for Global Infectious Disease Analysis, Department of Infectious Disease Epidemiology, School of Public Health, Imperial College London, London W12 0BZ, United Kingdom
| | - Nicholas J Croucher
- MRC Centre for Global Infectious Disease Analysis, Department of Infectious Disease Epidemiology, School of Public Health, Imperial College London, London W12 0BZ, United Kingdom
| | - Simon R Harris
- Bill and Melinda Gates Foundation, Westminster, London SW1E 6AJ, United Kingdom
| | - John A Lees
- European Molecular Biology Laboratory, European Bioinformatics Institute, Wellcome Genome Campus, Hinxton CB10 1SD, United Kingdom;
| |
Collapse
|
6
|
Zhi Q, Zheng B, Teng K, Xiao Y, Zhou X, Tang Q, Li J, Yin H, Meng D, Liu T. Metagenomic approach reveals the role of bioagents in the environmental dissemination risk of rhizosphere soil antibiotic resistance genes pollution. ENVIRONMENTAL RESEARCH 2024; 263:120090. [PMID: 39374754 DOI: 10.1016/j.envres.2024.120090] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Subscribe] [Scholar Register] [Received: 08/08/2024] [Revised: 09/26/2024] [Accepted: 09/28/2024] [Indexed: 10/09/2024]
Abstract
Antibiotic resistance genes (ARGs) have been identified as emerging contaminants, raising concerns around the world. As environmentally friendly bioagents (BA), plant growth-promoting rhizobacteria (PGPR) have been used in agricultural systems. The introduction of BA will lead to the turnover of the microbial communities structure. Nevertheless, it is still unclear how the colonization of the invaded microorganisms could affects the rhizosphere resistome. Consequently, 190 ARGs and 25 integrative and conjugative elements (ICEs) were annotated using the metagenomic approach in 18 samples from the Solanaceae crop rhizosphere soil under BA and conventional treatment (CK) groups. Our study found that, after 90 days of treatment, ARG abundance was lower in the CK group than in the BA group. The results showed that aminoglycoside antibiotic resistance (OprZ), phenicol antibiotic resistance (OprN), aminoglycoside antibiotic resistance (ceoA/B), aminocoumarin antibiotic resistance (mdtB) and phenicol antibiotic resistance (MexW) syntenic with ICEs. Moreover, in 11 sequences, OprN (phenicol antibiotic resistance) was observed to have synteny with ICEPaeLESB58-1, indicating that the ICEs could contribute to the spread of ARGs. Additionally, the binning result showed that the potential bacterial hosts of the ARGs were beneficial bacteria which could promote the nutrition cycle, such as Haliangium, Nitrospira, Sideroxydans, Burkholderia, etc, suggesting that bacterial hosts have a great influence on ARG profiles. According to the findings, considering the dissemination of ARGs, BA should be applied with caution, especially the use of beneficial bacteria in BA. In a nutshell, this study offers valuable insights into ARGs pollution control from the perspective of the development and application of BA, to make effective strategies for blocking pollution risk migration in the ecological environment.
Collapse
Affiliation(s)
- Qiqi Zhi
- School of Minerals Processing and Bioengineering, Central South University, Changsha, 410083, China; Key Laboratory of Biometallurgy, Ministry of Education, Changsha, 410083, China
| | - Bufan Zheng
- College of Agronomy, Hunan Agricultural University, Changsha, 410127, China
| | - Kai Teng
- Hunan Province Xiangxi Autonomous Prefecture Tobacco Company, Jishou, 416000, China
| | - Yansong Xiao
- Chenzhou Tobacco Company of Hunan Province, Chenzhou, 423000, China
| | - Xiangping Zhou
- Yongzhou Tobacco Company of Hunan Province, Yongzhou, 425000, China
| | - Qianjun Tang
- College of Plant Protection, Hunan Agricultural University, Changsha, 410127, China
| | - Juan Li
- College of Agronomy, Hunan Agricultural University, Changsha, 410127, China
| | - Huaqun Yin
- School of Minerals Processing and Bioengineering, Central South University, Changsha, 410083, China; Key Laboratory of Biometallurgy, Ministry of Education, Changsha, 410083, China
| | - Delong Meng
- School of Minerals Processing and Bioengineering, Central South University, Changsha, 410083, China; Key Laboratory of Biometallurgy, Ministry of Education, Changsha, 410083, China.
| | - Tianbo Liu
- Tobacco Research Institute of Hunan Province, Changsha, 410004, China.
| |
Collapse
|
7
|
Du Y, Zuo W, Sun F. Imputing Metagenomic Hi-C Contacts Facilitates the Integrative Contig Binning Through Constrained Random Walk with Restart. J Comput Biol 2024; 31:1008-1021. [PMID: 39246231 DOI: 10.1089/cmb.2024.0663] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 09/10/2024] Open
Abstract
Metagenomic Hi-C (metaHi-C) has shown remarkable potential for retrieving high-quality metagenome-assembled genomes from complex microbial communities. Nevertheless, existing metaHi-C-based contig binning methods solely rely on Hi-C interactions between contigs, disregarding crucial biological information such as the presence of single-copy marker genes. To overcome this limitation, we introduce ImputeCC, an integrative contig binning tool optimized for metaHi-C datasets. ImputeCC integrates both Hi-C interactions and the discriminative power of single-copy marker genes to group marker-gene-containing contigs into preliminary bins. It also introduces a novel constrained random walk with restart algorithm to enhance Hi-C connectivity among contigs. Comprehensive assessments using both mock and real metaHi-C datasets from diverse environments demonstrate that ImputeCC consistently outperforms other Hi-C-based contig binning tools. A genus-level analysis of the sheep gut microbiota reconstructed by ImputeCC underlines its capability to recover key species from dominant genera and identify previously unknown genera.
Collapse
Affiliation(s)
- Yuxuan Du
- Department of Quantitative and Computational Biology, University of Southern California, Los Angeles, California, USA
| | - Wenxuan Zuo
- Department of Quantitative and Computational Biology, University of Southern California, Los Angeles, California, USA
| | - Fengzhu Sun
- Department of Quantitative and Computational Biology, University of Southern California, Los Angeles, California, USA
| |
Collapse
|
8
|
Lu Y, Yang J, Li C, Tian Y, Chang R, Kong D, Yang S, Wang Y, Zhang Y, Zhu X, Pan W, Kong S. Efficient and easy-to-use capturing three-dimensional metagenome interactions with GutHi-C. IMETA 2024; 3:e227. [PMID: 39429879 PMCID: PMC11487548 DOI: 10.1002/imt2.227] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Grants] [Track Full Text] [Figures] [Subscribe] [Scholar Register] [Received: 06/02/2024] [Revised: 07/07/2024] [Accepted: 07/08/2024] [Indexed: 10/22/2024]
Abstract
Hi-C can obtain three-dimensional chromatin structure information and is widely used for genome assembly. We constructed the GutHi-C technology. As shown in the graphical abstract, it is a highly efficient and quick-to-operate method and can be widely used for human, livestock, and poultry gut microorganisms. It provides a reference for the Hi-C methodology of the microbial metagenome. DPBS, Dulbecco's phosphate-buffered saline; Hi-C, high-through chromatin conformation capture; LB, Luria-Bertani; NGS, next-generation sequencing; PCR, polymerase chain reaction; QC, quality control.
Collapse
Affiliation(s)
- Yu‐Xi Lu
- Shenzhen Branch, Guangdong Laboratory for Lingnan Modern Agriculture, Key Laboratory of Livestock and Poultry Multi‐Omics of MARA, Genome Analysis Laboratory of the Ministry of Agriculture and Rural Affairs, Agricultural Genomics Institute at ShenzhenChinese Academy of Agricultural SciencesShenzhenChina
- School of Life SciencesHenan UniversityKaifengChina
- Shenzhen Research Institute of Henan UniversityShenzhenChina
| | - Jin‐Bao Yang
- Shenzhen Branch, Guangdong Laboratory for Lingnan Modern Agriculture, Key Laboratory of Livestock and Poultry Multi‐Omics of MARA, Genome Analysis Laboratory of the Ministry of Agriculture and Rural Affairs, Agricultural Genomics Institute at ShenzhenChinese Academy of Agricultural SciencesShenzhenChina
- College of InformaticsHuazhong Agricultural UniversityWuhanChina
| | - Chen‐Ying Li
- Shenzhen Branch, Guangdong Laboratory for Lingnan Modern Agriculture, Key Laboratory of Livestock and Poultry Multi‐Omics of MARA, Genome Analysis Laboratory of the Ministry of Agriculture and Rural Affairs, Agricultural Genomics Institute at ShenzhenChinese Academy of Agricultural SciencesShenzhenChina
- College of Animal Science and TechnologyQingdao Agricultural UniversityQingdaoChina
| | - Yun‐Han Tian
- Shenzhen Branch, Guangdong Laboratory for Lingnan Modern Agriculture, Key Laboratory of Livestock and Poultry Multi‐Omics of MARA, Genome Analysis Laboratory of the Ministry of Agriculture and Rural Affairs, Agricultural Genomics Institute at ShenzhenChinese Academy of Agricultural SciencesShenzhenChina
- College of Animal Science and TechnologyQingdao Agricultural UniversityQingdaoChina
| | - Rong‐Rong Chang
- Shenzhen Branch, Guangdong Laboratory for Lingnan Modern Agriculture, Key Laboratory of Livestock and Poultry Multi‐Omics of MARA, Genome Analysis Laboratory of the Ministry of Agriculture and Rural Affairs, Agricultural Genomics Institute at ShenzhenChinese Academy of Agricultural SciencesShenzhenChina
- School of Life SciencesHenan UniversityKaifengChina
- Shenzhen Research Institute of Henan UniversityShenzhenChina
| | - Da‐Shuai Kong
- Shenzhen Branch, Guangdong Laboratory for Lingnan Modern Agriculture, Key Laboratory of Livestock and Poultry Multi‐Omics of MARA, Genome Analysis Laboratory of the Ministry of Agriculture and Rural Affairs, Agricultural Genomics Institute at ShenzhenChinese Academy of Agricultural SciencesShenzhenChina
- School of Life SciencesHenan UniversityKaifengChina
- Shenzhen Research Institute of Henan UniversityShenzhenChina
| | - Shu‐Lin Yang
- State Key Laboratory of Animal Biotech Breeding, Institute of Animal ScienceChinese Academy of Agricultural SciencesBeijingChina
| | - Yan‐Fang Wang
- State Key Laboratory of Animal Biotech Breeding, Institute of Animal ScienceChinese Academy of Agricultural SciencesBeijingChina
| | - Yu‐Bo Zhang
- Frederick National Laboratory for Cancer ResearchFrederickMarylandUSA
| | - Xiu‐Sheng Zhu
- Shenzhen Branch, Guangdong Laboratory for Lingnan Modern Agriculture, Key Laboratory of Livestock and Poultry Multi‐Omics of MARA, Genome Analysis Laboratory of the Ministry of Agriculture and Rural Affairs, Agricultural Genomics Institute at ShenzhenChinese Academy of Agricultural SciencesShenzhenChina
| | - Wei‐Hua Pan
- Shenzhen Branch, Guangdong Laboratory for Lingnan Modern Agriculture, Key Laboratory of Livestock and Poultry Multi‐Omics of MARA, Genome Analysis Laboratory of the Ministry of Agriculture and Rural Affairs, Agricultural Genomics Institute at ShenzhenChinese Academy of Agricultural SciencesShenzhenChina
| | - Si‐Yuan Kong
- Shenzhen Branch, Guangdong Laboratory for Lingnan Modern Agriculture, Key Laboratory of Livestock and Poultry Multi‐Omics of MARA, Genome Analysis Laboratory of the Ministry of Agriculture and Rural Affairs, Agricultural Genomics Institute at ShenzhenChinese Academy of Agricultural SciencesShenzhenChina
| |
Collapse
|
9
|
Wei G, Wu N, Zhao K, Yang S, Wang L, Liu Y. DeepCheck: multitask learning aids in assessing microbial genome quality. Brief Bioinform 2024; 25:bbae539. [PMID: 39438078 PMCID: PMC11495869 DOI: 10.1093/bib/bbae539] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 05/31/2024] [Revised: 08/26/2024] [Accepted: 10/09/2024] [Indexed: 10/25/2024] Open
Abstract
Metagenomic analyses facilitate the exploration of the microbial world, advancing our understanding of microbial roles in ecological and biological processes. A pivotal aspect of metagenomic analysis involves assessing the quality of metagenome-assembled genomes (MAGs), crucial for accurate biological insights. Current machine learning-based methods often treat completeness and contamination prediction as separate tasks, overlooking their inherent relationship and limiting models' generalization. In this study, we present DeepCheck, a multitasking deep learning framework for simultaneous prediction of MAG completeness and contamination. DeepCheck consistently outperforms existing tools in accuracy across various experimental settings and demonstrates comparable speed while maintaining high predictive accuracy even for new lineages. Additionally, we employ interpretable machine learning techniques to identify specific genes and pathways that drive the model's predictions, enabling independent investigation and assessment of these biological elements for deeper insights.
Collapse
Affiliation(s)
- Guo Wei
- State Key Laboratory of Pharmaceutical Biotechnology, School of Life Sciences, Nanjing University, 163 Xianlin Avenue, Qixia District, Nanjing 210000, China
| | - Nannan Wu
- State Key Laboratory of Pharmaceutical Biotechnology, School of Life Sciences, Nanjing University, 163 Xianlin Avenue, Qixia District, Nanjing 210000, China
| | - Kunyang Zhao
- State Key Laboratory of Pharmaceutical Biotechnology, School of Life Sciences, Nanjing University, 163 Xianlin Avenue, Qixia District, Nanjing 210000, China
| | - Sihai Yang
- State Key Laboratory of Pharmaceutical Biotechnology, School of Life Sciences, Nanjing University, 163 Xianlin Avenue, Qixia District, Nanjing 210000, China
- Co-Innovation Center for Sustainable Forestry in Southern China, Nanjing Forestry University, 159 Panlong road, Xuanwu District, Nanjing 210000, China
| | - Long Wang
- State Key Laboratory of Pharmaceutical Biotechnology, School of Life Sciences, Nanjing University, 163 Xianlin Avenue, Qixia District, Nanjing 210000, China
| | - Yan Liu
- Department of Computer Science, Yangzhou University, 196 Huaxi Road, Hanjiang District, Yangzhou 225100, China
| |
Collapse
|
10
|
Dong W, Fan X, Guo Y, Wang S, Jia S, Lv N, Yuan T, Pan Y, Xue Y, Chen X, Xiong Q, Yang R, Zhao W, Zhu B. An expanded database and analytical toolkit for identifying bacterial virulence factors and their associations with chronic diseases. Nat Commun 2024; 15:8084. [PMID: 39278950 PMCID: PMC11402979 DOI: 10.1038/s41467-024-51864-y] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 01/23/2024] [Accepted: 08/16/2024] [Indexed: 09/18/2024] Open
Abstract
Virulence factor genes (VFGs) play pivotal roles in bacterial infections and have been identified within the human gut microbiota. However, their involvement in chronic diseases remains poorly understood. Here, we establish an expanded VFG database (VFDB 2.0) consisting of 62,332 nonredundant orthologues and alleles of VFGs using species-specific average nucleotide identity ( https://github.com/Wanting-Dong/MetaVF_toolkit/tree/main/databases ). We further develop the MetaVF toolkit, facilitating the precise identification of pathobiont-carried VFGs at the species level. A thorough characterization of VFGs for 5452 commensal isolates from healthy individuals reveals that only 11 of 301 species harbour these factors. Further analyses of VFGs within the gut microbiomes of nine chronic diseases reveal both common and disease-specific VFG features. Notably, in type 2 diabetes patients, long HiFi sequencing confirms that shared VF features are carried by pathobiont strains of Escherichia coli and Klebsiella pneumoniae. These findings underscore the critical importance of identifying and understanding VFGs in microbiome-associated diseases.
Collapse
Affiliation(s)
- Wanting Dong
- CAS Key Laboratory of Pathogenic Microbiology and Immunology, Institute of Microbiology, Chinese Academy of Sciences, Beijing, 100101, China
- University of Chinese Academy of Sciences, Beijing, 100049, China
| | - Xinyue Fan
- CAS Key Laboratory of Pathogenic Microbiology and Immunology, Institute of Microbiology, Chinese Academy of Sciences, Beijing, 100101, China
| | - Yaqiong Guo
- Institute of Biotechnology and Health, Beijing Academy of Science and Technology, Beijing, 100089, China
| | - Siyi Wang
- CAS Key Laboratory of Pathogenic Microbiology and Immunology, Institute of Microbiology, Chinese Academy of Sciences, Beijing, 100101, China
- University of Chinese Academy of Sciences, Beijing, 100049, China
| | - Shulei Jia
- School of Basic Medical Sciences, Tianjin Medical University, Tianjin, 300070, China
| | - Na Lv
- CAS Key Laboratory of Pathogenic Microbiology and Immunology, Institute of Microbiology, Chinese Academy of Sciences, Beijing, 100101, China
| | - Tao Yuan
- Department of Endocrinology, Key Laboratory of Endocrinology of Ministry of Health, Peking Union Medical College Hospital, Chinese Academy of Medical Science and Peking Union Medical College, Beijing, 100730, China
| | - Yuanlong Pan
- CAS Key Laboratory of Pathogenic Microbiology and Immunology, Institute of Microbiology, Chinese Academy of Sciences, Beijing, 100101, China
| | - Yong Xue
- National Engineering and Technology Research Center for Fruits and Vegetables, College of Food Science and Nutritional Engineering, China Agricultural University, Beijing, 100083, China
| | - Xi Chen
- College of Veterinary Medicine, Nanjing Agricultural University, Nanjing, 210095, China
| | - Qian Xiong
- CAS Key Laboratory of Pathogenic Microbiology and Immunology, Institute of Microbiology, Chinese Academy of Sciences, Beijing, 100101, China
| | - Ruifu Yang
- Beijing Institute of Microbiology and Epidemiology, Beijing, 100071, China.
| | - Weigang Zhao
- Department of Endocrinology, Key Laboratory of Endocrinology of Ministry of Health, Peking Union Medical College Hospital, Chinese Academy of Medical Science and Peking Union Medical College, Beijing, 100730, China.
| | - Baoli Zhu
- CAS Key Laboratory of Pathogenic Microbiology and Immunology, Institute of Microbiology, Chinese Academy of Sciences, Beijing, 100101, China.
- University of Chinese Academy of Sciences, Beijing, 100049, China.
- Department of Pathogenic Biology, School of Basic Medical Sciences, Southwest Medical University, Luzhou, 646000, China.
- Jinan Microecological Biomedicine Shandong Laboratory, Jinan, 250117, China.
- Beijing Key Laboratory of Antimicrobial Resistance and Pathogen Genomics, Beijing, 100101, China.
| |
Collapse
|
11
|
Benoit G, Raguideau S, James R, Phillippy AM, Chikhi R, Quince C. High-quality metagenome assembly from long accurate reads with metaMDBG. Nat Biotechnol 2024; 42:1378-1383. [PMID: 38168989 PMCID: PMC11392814 DOI: 10.1038/s41587-023-01983-6] [Citation(s) in RCA: 4] [Impact Index Per Article: 4.0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 02/17/2023] [Accepted: 09/08/2023] [Indexed: 01/05/2024]
Abstract
We introduce metaMDBG, a metagenomics assembler for PacBio HiFi reads. MetaMDBG combines a de Bruijn graph assembly in a minimizer space with an iterative assembly over sequences of minimizers to address variations in genome coverage depth and an abundance-based filtering strategy to simplify strain complexity. For complex communities, we obtained up to twice as many high-quality circularized prokaryotic metagenome-assembled genomes as existing methods and had better recovery of viruses and plasmids.
Collapse
Affiliation(s)
- Gaëtan Benoit
- Organisms and Ecosystems, Earlham Institute, Norwich, UK
| | | | - Robert James
- Gut Microbes and Health, Quadram Institute, Norwich, UK
| | - Adam M Phillippy
- Genome Informatics Section, National Human Genome Research Institute, Bethesda, MD, USA
| | - Rayan Chikhi
- Sequence Bioinformatics, Department of Computational Biology, Institut Pasteur, Paris, France
| | - Christopher Quince
- Organisms and Ecosystems, Earlham Institute, Norwich, UK.
- Gut Microbes and Health, Quadram Institute, Norwich, UK.
- School of Biological Sciences, University of East Anglia, Norwich, UK.
- Warwick Medical School, University of Warwick, Coventry, UK.
| |
Collapse
|
12
|
Tian F, Wainaina JM, Howard-Varona C, Domínguez-Huerta G, Bolduc B, Gazitúa MC, Smith G, Gittrich MR, Zablocki O, Cronin DR, Eveillard D, Hallam SJ, Sullivan MB. Prokaryotic-virus-encoded auxiliary metabolic genes throughout the global oceans. MICROBIOME 2024; 12:159. [PMID: 39198891 PMCID: PMC11360552 DOI: 10.1186/s40168-024-01876-z] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Subscribe] [Scholar Register] [Received: 03/06/2024] [Accepted: 07/16/2024] [Indexed: 09/01/2024]
Abstract
BACKGROUND Prokaryotic microbes have impacted marine biogeochemical cycles for billions of years. Viruses also impact these cycles, through lysis, horizontal gene transfer, and encoding and expressing genes that contribute to metabolic reprogramming of prokaryotic cells. While this impact is difficult to quantify in nature, we hypothesized that it can be examined by surveying virus-encoded auxiliary metabolic genes (AMGs) and assessing their ecological context. RESULTS We systematically developed a global ocean AMG catalog by integrating previously described and newly identified AMGs and then placed this catalog into ecological and metabolic contexts relevant to ocean biogeochemistry. From 7.6 terabases of Tara Oceans paired prokaryote- and virus-enriched metagenomic sequence data, we increased known ocean virus populations to 579,904 (up 16%). From these virus populations, we then conservatively identified 86,913 AMGs that grouped into 22,779 sequence-based gene clusters, 7248 (~ 32%) of which were not previously reported. Using our catalog and modeled data from mock communities, we estimate that ~ 19% of ocean virus populations carry at least one AMG. To understand AMGs in their metabolic context, we identified 340 metabolic pathways encoded by ocean microbes and showed that AMGs map to 128 of them. Furthermore, we identified metabolic "hot spots" targeted by virus AMGs, including nine pathways where most steps (≥ 0.75) were AMG-targeted (involved in carbohydrate, amino acid, fatty acid, and nucleotide metabolism), as well as other pathways where virus-encoded AMGs outnumbered cellular homologs (involved in lipid A phosphates, phosphatidylethanolamine, creatine biosynthesis, phosphoribosylamine-glycine ligase, and carbamoyl-phosphate synthase pathways). CONCLUSIONS Together, this systematically curated, global ocean AMG catalog and analyses provide a valuable resource and foundational observations to understand the role of viruses in modulating global ocean metabolisms and their biogeochemical implications. Video Abstract.
Collapse
Affiliation(s)
- Funing Tian
- Department of Microbiology, Ohio State University, Columbus, OH, 43210, USA
- Center of Microbiome Science, Ohio State University, Columbus, OH, 43210, USA
- Department of Medicine, The University of Chicago, Chicago, IL, USA
| | - James M Wainaina
- Department of Microbiology, Ohio State University, Columbus, OH, 43210, USA
- Center of Microbiome Science, Ohio State University, Columbus, OH, 43210, USA
- Biology Department, Woods Hole Oceanographic Institution, Woods Hole, MA, USA
| | - Cristina Howard-Varona
- Department of Microbiology, Ohio State University, Columbus, OH, 43210, USA
- Center of Microbiome Science, Ohio State University, Columbus, OH, 43210, USA
| | - Guillermo Domínguez-Huerta
- Department of Microbiology, Ohio State University, Columbus, OH, 43210, USA
- Center of Microbiome Science, Ohio State University, Columbus, OH, 43210, USA
- EMERGE Biology Integration Institute, Ohio State University, Columbus, OH, 43210, USA
- Centro Oceanográfico de Málaga (IEO-CSIC), Puerto Pesquero S/N, 29640, Fuengirola (Málaga), Spain
| | - Benjamin Bolduc
- Department of Microbiology, Ohio State University, Columbus, OH, 43210, USA
- Center of Microbiome Science, Ohio State University, Columbus, OH, 43210, USA
- EMERGE Biology Integration Institute, Ohio State University, Columbus, OH, 43210, USA
| | | | - Garrett Smith
- Department of Microbiology, Ohio State University, Columbus, OH, 43210, USA
- Center of Microbiome Science, Ohio State University, Columbus, OH, 43210, USA
| | - Marissa R Gittrich
- Department of Microbiology, Ohio State University, Columbus, OH, 43210, USA
- Center of Microbiome Science, Ohio State University, Columbus, OH, 43210, USA
| | - Olivier Zablocki
- Department of Microbiology, Ohio State University, Columbus, OH, 43210, USA
- Center of Microbiome Science, Ohio State University, Columbus, OH, 43210, USA
| | - Dylan R Cronin
- Department of Microbiology, Ohio State University, Columbus, OH, 43210, USA
- Center of Microbiome Science, Ohio State University, Columbus, OH, 43210, USA
- EMERGE Biology Integration Institute, Ohio State University, Columbus, OH, 43210, USA
| | - Damien Eveillard
- Université de Nantes, CNRS, LS2N, Nantes, France
- Research Federation for the Study of Global Ocean Systems Ecology and Evolution, R2022/Tara GO-SEE, Paris, France
| | - Steven J Hallam
- Department of Microbiology & Immunology, University of British Columbia, Vancouver, BC, V6T 1Z1, Canada
- Graduate Program in Bioinformatics, University of British Columbia, Vancouver, BC, V6T 1Z4, Canada
- Genome Science and Technology Program, University of British Columbia, 2329 West Mall, Vancouver, BC, V6T 1Z4, Canada
- Life Sciences Institute, University of British Columbia, Vancouver, BC, V6T 1Z3, Canada
- ECOSCOPE Training Program, University of British Columbia, Vancouver, BC, V6T 1Z3, Canada
| | - Matthew B Sullivan
- Department of Microbiology, Ohio State University, Columbus, OH, 43210, USA.
- Center of Microbiome Science, Ohio State University, Columbus, OH, 43210, USA.
- EMERGE Biology Integration Institute, Ohio State University, Columbus, OH, 43210, USA.
- Department of Civil, Environmental, and Geodetic Engineering, Ohio State University, Columbus, OH, 43210, USA.
| |
Collapse
|
13
|
Lekbua A, Thiruppathy D, Coker J, Weng Y, Askarian F, Kousha A, Marotz C, Hauw A, Nizet V, Zengler K. SkinCom, a synthetic skin microbial community, enables reproducible investigations of the human skin microbiome. CELL REPORTS METHODS 2024; 4:100832. [PMID: 39111313 PMCID: PMC11384088 DOI: 10.1016/j.crmeth.2024.100832] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Subscribe] [Scholar Register] [Received: 04/22/2024] [Revised: 05/30/2024] [Accepted: 07/12/2024] [Indexed: 08/22/2024]
Abstract
Existing models of the human skin have aided our understanding of skin health and disease. However, they currently lack a microbial component, despite microbes' demonstrated connections to various skin diseases. Here, we present a robust, standardized model of the skin microbial community (SkinCom) to support in vitro and in vivo investigations. Our methods lead to the formation of an accurate, reproducible, and diverse community of aerobic and anaerobic bacteria. Subsequent testing of SkinCom on the dorsal skin of mice allowed for DNA and RNA recovery from both the applied SkinCom and the dorsal skin, highlighting its practicality for in vivo studies and -omics analyses. Furthermore, 66% of the responses to common cosmetic chemicals in vitro were in agreement with a human trial. Therefore, SkinCom represents a valuable, standardized tool for investigating microbe-metabolite interactions and facilitates the experimental design of in vivo studies targeting host-microbe relationships.
Collapse
Affiliation(s)
- Asama Lekbua
- Division of Host-Microbe Systems & Therapeutics, Department of Pediatrics, University of California, San Diego, La Jolla, CA 92093, USA; School of Biological Sciences, University of California, San Diego, La Jolla, CA 92093, USA
| | - Deepan Thiruppathy
- Division of Host-Microbe Systems & Therapeutics, Department of Pediatrics, University of California, San Diego, La Jolla, CA 92093, USA; Department of Bioengineering, University of California, San Diego, La Jolla, CA 92093, USA
| | - Joanna Coker
- Division of Host-Microbe Systems & Therapeutics, Department of Pediatrics, University of California, San Diego, La Jolla, CA 92093, USA
| | - Yuhan Weng
- Division of Host-Microbe Systems & Therapeutics, Department of Pediatrics, University of California, San Diego, La Jolla, CA 92093, USA; Bioinformatics and Systems Biology Program, University of California, San Diego, La Jolla, CA 92093, USA
| | - Fatemeh Askarian
- Division of Host-Microbe Systems & Therapeutics, Department of Pediatrics, University of California, San Diego, La Jolla, CA 92093, USA
| | - Armin Kousha
- Division of Host-Microbe Systems & Therapeutics, Department of Pediatrics, University of California, San Diego, La Jolla, CA 92093, USA
| | - Clarisse Marotz
- Division of Host-Microbe Systems & Therapeutics, Department of Pediatrics, University of California, San Diego, La Jolla, CA 92093, USA
| | - Amber Hauw
- Division of Host-Microbe Systems & Therapeutics, Department of Pediatrics, University of California, San Diego, La Jolla, CA 92093, USA
| | - Victor Nizet
- Division of Host-Microbe Systems & Therapeutics, Department of Pediatrics, University of California, San Diego, La Jolla, CA 92093, USA; Glycobiology Research and Training Center, University of California, San Diego, La Jolla, CA 92093, USA; Skaggs School of Pharmacy and Pharmaceutical Sciences, University of California, San Diego, La Jolla, CA 92093, USA
| | - Karsten Zengler
- Division of Host-Microbe Systems & Therapeutics, Department of Pediatrics, University of California, San Diego, La Jolla, CA 92093, USA; Department of Bioengineering, University of California, San Diego, La Jolla, CA 92093, USA; Center for Microbiome Innovation, University of California, San Diego, La Jolla, CA 92093, USA; Program in Materials Science and Engineering, University of California, San Diego, La Jolla, CA 92093, USA.
| |
Collapse
|
14
|
Whitener MR, Mangelson H, Sweigart AL. Patterns of genomic variation reveal a single evolutionary origin of the wild allotetraploid Mimulus sookensis. Evolution 2024; 78:1464-1477. [PMID: 38766685 DOI: 10.1093/evolut/qpae079] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 09/17/2023] [Revised: 03/12/2024] [Accepted: 05/17/2024] [Indexed: 05/22/2024]
Abstract
Polyploidy occurs across the tree of life and is especially common in plants. Because newly formed cytotypes are often incompatible with their progenitors, polyploidy is also said to trigger "instantaneous" speciation. If a polyploid can self-fertilize or reproduce asexually, it is even possible for one individual to produce an entirely new lineage, but how often this scenario occurs is unclear. Here, we investigate the evolutionary history of the wild allotetraploid Mimulus sookensis, which was formed through hybridization between self-compatible, diploid species in the Mimulus guttatus complex. We generate a chromosome-scale reference assembly for M. sookensis and define its distinct subgenomes. Despite previous reports suggesting multiple origins of this highly selfing polyploid, we discover patterns of population genomic variation that provide unambiguous support for a single origin. One M. sookensis subgenome is clearly derived from the selfer Mimulus nasutus, which organellar variation suggests is the maternal progenitor. The ancestor of the other subgenome is less certain, but it shares variation with both Mimulus decorus and M. guttatus, two outcrossing diploids with geographic ranges that overlap broadly with M. sookensis. This study establishes M. sookensis as an example of instantaneous speciation, likely facilitated by the polyploid's predisposition to self-fertilize.
Collapse
Affiliation(s)
- Makenzie R Whitener
- Department of Genetics, University of Georgia, Athens, GA 30602, United States
| | | | - Andrea L Sweigart
- Department of Genetics, University of Georgia, Athens, GA 30602, United States
| |
Collapse
|
15
|
Yang W, Luyten Y, Reister E, Mangelson H, Sisson Z, Auch B, Liachko I, Roberts RJ, Ettwiller L. Proxi-RIMS-seq2 applied to native microbiomes uncovers hundreds of known and novel m5C methyltransferase specificities. BIORXIV : THE PREPRINT SERVER FOR BIOLOGY 2024:2024.07.15.603628. [PMID: 39071437 PMCID: PMC11275837 DOI: 10.1101/2024.07.15.603628] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Grants] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 07/30/2024]
Abstract
Methylation patterns in bacteria can be used to study Restriction-Modification (RM) or other defense systems with novel properties. While m4C and m6A methylation is well characterized mainly through PacBio sequencing, the landscape of m5C methylation is under-characterized. To bridge this gap, we performed RIMS-seq2 on microbiomes composed of resolved assemblies of distinct genomes through proximity ligation. This high-throughput approach enables the identification of m5C methylated motifs and links them to cognate methyltransferases directly on native microbiomes without the need to isolate bacterial strains. Methylation patterns can also be identified on viral DNA and compared to host DNA, strengthening evidence for virus-host interaction. Applied to three different microbiomes, the method unveils over 1900 motifs that were deposited in REBASE. The motifs include a novel 8-base recognition site (CATm5CGATG) that was experimentally validated by characterizing its cognate methyltransferase. Our findings suggest that microbiomes harbor arrays of untapped m5C methyltransferase specificities, providing insights to bacterial biology and biotechnological applications.
Collapse
Affiliation(s)
- Weiwei Yang
- New England Biolabs Inc., 240 County Road, Ipswich, MA 01938, United States
| | - Yvette Luyten
- New England Biolabs Inc., 240 County Road, Ipswich, MA 01938, United States
| | - Emily Reister
- Phase Genomics Inc, 1617 8th Ave N Seattle, WA 98109, United States
| | - Hayley Mangelson
- Phase Genomics Inc, 1617 8th Ave N Seattle, WA 98109, United States
| | - Zach Sisson
- Phase Genomics Inc, 1617 8th Ave N Seattle, WA 98109, United States
| | - Benjamin Auch
- Phase Genomics Inc, 1617 8th Ave N Seattle, WA 98109, United States
| | - Ivan Liachko
- Phase Genomics Inc, 1617 8th Ave N Seattle, WA 98109, United States
| | - Richard J. Roberts
- New England Biolabs Inc., 240 County Road, Ipswich, MA 01938, United States
| | - Laurence Ettwiller
- New England Biolabs Inc., 240 County Road, Ipswich, MA 01938, United States
| |
Collapse
|
16
|
Chang T, Gavelis GS, Brown JM, Stepanauskas R. Genomic representativeness and chimerism in large collections of SAGs and MAGs of marine prokaryoplankton. MICROBIOME 2024; 12:126. [PMID: 39010229 PMCID: PMC11247762 DOI: 10.1186/s40168-024-01848-3] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Subscribe] [Scholar Register] [Received: 02/09/2024] [Accepted: 05/28/2024] [Indexed: 07/17/2024]
Abstract
BACKGROUND Single amplified genomes (SAGs) and metagenome-assembled genomes (MAGs) are the predominant sources of information about the coding potential of uncultured microbial lineages, but their strengths and limitations remain poorly understood. Here, we performed a direct comparison of two previously published collections of thousands of SAGs and MAGs obtained from the same, global environment. RESULTS We found that SAGs were less prone to chimerism and more accurately reflected the relative abundance and the pangenome content of microbial lineages inhabiting the epipelagic of the tropical and subtropical ocean, as compared to MAGs. SAGs were also better suited to link genome information with taxa discovered through 16S rRNA amplicon analyses. Meanwhile, MAGs had the advantage of more readily recovering genomes of rare lineages. CONCLUSIONS Our analyses revealed the relative strengths and weaknesses of the two most commonly used genome recovery approaches in environmental microbiology. These considerations, as well as the need for better tools for genome quality assessment, should be taken into account when designing studies and interpreting data that involve SAGs or MAGs. Video Abstract.
Collapse
Affiliation(s)
- Tianyi Chang
- Bigelow Laboratory for Ocean Sciences, East Boothbay, Maine, 04544, USA
| | - Gregory S Gavelis
- Bigelow Laboratory for Ocean Sciences, East Boothbay, Maine, 04544, USA
| | - Julia M Brown
- Bigelow Laboratory for Ocean Sciences, East Boothbay, Maine, 04544, USA
| | | |
Collapse
|
17
|
He C, Fucich D, Sosa A, Wang H, Kan J, Liu J, Xu Y, Jiao N, Gonsior M, Chen F. Deep metagenomic sequencing unveils novel SAR202 lineages and their vertical adaptation in the ocean. Commun Biol 2024; 7:853. [PMID: 38997445 PMCID: PMC11245477 DOI: 10.1038/s42003-024-06535-5] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/12/2024] [Accepted: 07/02/2024] [Indexed: 07/14/2024] Open
Abstract
SAR202 bacteria in the Chloroflexota phylum are abundant and widely distributed in the ocean. Their genome coding capacities indicate their potential roles in degrading complex and recalcitrant organic compounds in the ocean. However, our understanding of their genomic diversity, vertical distribution, and depth-related metabolisms is still limited by the number of assembled SAR202 genomes. In this study, we apply deep metagenomic sequencing (180 Gb per sample) to investigate microbial communities collected from six representative depths at the Bermuda Atlantic Time Series (BATS) station. We obtain 173 SAR202 metagenome-assembled genomes (MAGs). Intriguingly, 154 new species and 104 new genera are found based on these 173 SAR202 genomes. We add 12 new subgroups to the current SAR202 lineages. The vertical distribution of 20 SAR202 subgroups shows their niche partitioning in the euphotic, mesopelagic, and bathypelagic oceans, respectively. Deep-ocean SAR202 bacteria contain more genes and exhibit more metabolic potential for degrading complex organic substrates than those from the euphotic zone. With deep metagenomic sequencing, we uncover many new lineages of SAR202 bacteria and their potential functions which greatly deepen our understanding of their diversity, vertical profile, and contribution to the ocean's carbon cycling, especially in the deep ocean.
Collapse
Affiliation(s)
- Changfei He
- State Key Laboratory of Marine Environmental Science, College of Ocean and Earth Sciences, Carbon Neutral Innovation Research Center and Fujian Key Laboratory of Marine Carbon Sequestration, Xiamen University, Xiamen, 361102, PR China
- Institute of Marine Science and Technology, Shandong University, Qingdao, 266237, China
- Institute of Marine and Environmental Technology, University of Maryland Center for Environmental Science, Baltimore, MD, 21202, USA
| | - Daniel Fucich
- Institute of Marine and Environmental Technology, University of Maryland Center for Environmental Science, Baltimore, MD, 21202, USA
| | - Ana Sosa
- Institute of Marine and Environmental Technology, University of Maryland Center for Environmental Science, Baltimore, MD, 21202, USA
| | - Hualong Wang
- College of Marine Life Sciences, Frontiers Science Center for Deep Ocean Multispheres and Earth System, and Key Lab of Polar Oceanography and Global Ocean Change, Ocean University of China, Qingdao, China
| | - Jinjun Kan
- Microbiology Division, Stroud Water Research Center, Avondale, PA, 19311, USA
| | - Jihua Liu
- Institute of Marine Science and Technology, Shandong University, Qingdao, 266237, China
| | - Yongle Xu
- Institute of Marine Science and Technology, Shandong University, Qingdao, 266237, China
| | - Nianzhi Jiao
- State Key Laboratory of Marine Environmental Science, College of Ocean and Earth Sciences, Carbon Neutral Innovation Research Center and Fujian Key Laboratory of Marine Carbon Sequestration, Xiamen University, Xiamen, 361102, PR China
| | - Michael Gonsior
- Chesapeake Biological Laboratory, University of Maryland Center for Environmental Science, Solomons, MD, 20783, USA
| | - Feng Chen
- Institute of Marine and Environmental Technology, University of Maryland Center for Environmental Science, Baltimore, MD, 21202, USA.
| |
Collapse
|
18
|
Luan T, Commichaux S, Hoffmann M, Jayeola V, Jang JH, Pop M, Rand H, Luo Y. Benchmarking short and long read polishing tools for nanopore assemblies: achieving near-perfect genomes for outbreak isolates. BMC Genomics 2024; 25:679. [PMID: 38978005 PMCID: PMC11232133 DOI: 10.1186/s12864-024-10582-x] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 02/29/2024] [Accepted: 07/01/2024] [Indexed: 07/10/2024] Open
Abstract
BACKGROUND Oxford Nanopore provides high throughput sequencing platforms able to reconstruct complete bacterial genomes with 99.95% accuracy. However, even small levels of error can obscure the phylogenetic relationships between closely related isolates. Polishing tools have been developed to correct these errors, but it is uncertain if they obtain the accuracy needed for the high-resolution source tracking of foodborne illness outbreaks. RESULTS We tested 132 combinations of assembly and short- and long-read polishing tools to assess their accuracy for reconstructing the genome sequences of 15 highly similar Salmonella enterica serovar Newport isolates from a 2020 onion outbreak. While long-read polishing alone improved accuracy, near perfect accuracy (99.9999% accuracy or ~ 5 nucleotide errors across the 4.8 Mbp genome, excluding low confidence regions) was only obtained by pipelines that combined both long- and short-read polishing tools. Notably, medaka was a more accurate and efficient long-read polisher than Racon. Among short-read polishers, NextPolish showed the highest accuracy, but Pilon, Polypolish, and POLCA performed similarly. Among the 5 best performing pipelines, polishing with medaka followed by NextPolish was the most common combination. Importantly, the order of polishing tools mattered i.e., using less accurate tools after more accurate ones introduced errors. Indels in homopolymers and repetitive regions, where the short reads could not be uniquely mapped, remained the most challenging errors to correct. CONCLUSIONS Short reads are still needed to correct errors in nanopore sequenced assemblies to obtain the accuracy required for source tracking investigations. Our granular assessment of the performance of the polishing pipelines allowed us to suggest best practices for tool users and areas for improvement for tool developers.
Collapse
Affiliation(s)
- Tu Luan
- Department of Computer Science, University of Maryland, College Park, MD, 20742, USA
| | - Seth Commichaux
- Center for Food Safety and Applied Nutrition, Food and Drug Administration, Laurel, MD, 20708, USA.
| | - Maria Hoffmann
- Center for Food Safety and Applied Nutrition, Food and Drug Administration, College Park, MD, 20740, USA
| | - Victor Jayeola
- Center for Food Safety and Applied Nutrition, Food and Drug Administration, College Park, MD, 20740, USA
| | - Jae Hee Jang
- Center for Food Safety and Applied Nutrition, Food and Drug Administration, College Park, MD, 20740, USA
| | - Mihai Pop
- Department of Computer Science, University of Maryland, College Park, MD, 20742, USA
| | - Hugh Rand
- Center for Food Safety and Applied Nutrition, Food and Drug Administration, College Park, MD, 20740, USA
| | - Yan Luo
- Center for Food Safety and Applied Nutrition, Food and Drug Administration, College Park, MD, 20740, USA
| |
Collapse
|
19
|
Babajanyan SG, Garushyants SK, Wolf YI, Koonin EV. Microbial diversity and ecological complexity emerging from environmental variation and horizontal gene transfer in a simple mathematical model. BMC Biol 2024; 22:148. [PMID: 38965531 PMCID: PMC11225191 DOI: 10.1186/s12915-024-01937-7] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 01/17/2024] [Accepted: 06/13/2024] [Indexed: 07/06/2024] Open
Abstract
BACKGROUND Microbiomes are generally characterized by high diversity of coexisting microbial species and strains, and microbiome composition typically remains stable across a broad range of conditions. However, under fixed conditions, microbial ecology conforms with the exclusion principle under which two populations competing for the same resource within the same niche cannot coexist because the less fit population inevitably goes extinct. Therefore, the long-term persistence of microbiome diversity calls for an explanation. RESULTS To explore the conditions for stabilization of microbial diversity, we developed a simple mathematical model consisting of two competing populations that could exchange a single gene allele via horizontal gene transfer (HGT). We found that, although in a fixed environment, with unbiased HGT, the system obeyed the exclusion principle, in an oscillating environment, within large regions of the phase space bounded by the rates of reproduction and HGT, the two populations coexist. Moreover, depending on the parameter combination, all three major types of symbiosis were obtained, namely, pure competition, host-parasite relationship, and mutualism. In each of these regimes, certain parameter combinations provided for synergy, that is, a greater total abundance of both populations compared to the abundance of the winning population in the fixed environment. CONCLUSIONS The results of this modeling study show that basic phenomena that are universal in microbial communities, namely, environmental variation and HGT, provide for stabilization and persistence of microbial diversity, and emergence of ecological complexity.
Collapse
Affiliation(s)
- Sanasar G Babajanyan
- National Center for Biotechnology Information, National Library of Medicine, National Institutes of Health, Bethesda, 20894, MD, USA.
| | - Sofya K Garushyants
- National Center for Biotechnology Information, National Library of Medicine, National Institutes of Health, Bethesda, 20894, MD, USA
| | - Yuri I Wolf
- National Center for Biotechnology Information, National Library of Medicine, National Institutes of Health, Bethesda, 20894, MD, USA
| | - Eugene V Koonin
- National Center for Biotechnology Information, National Library of Medicine, National Institutes of Health, Bethesda, 20894, MD, USA.
| |
Collapse
|
20
|
Qiu Z, Zhu Y, Zhang Q, Qiao X, Mu R, Xu Z, Yan Y, Wang F, Zhang T, Zhuang WQ, Yu K. Unravelling biosynthesis and biodegradation potentials of microbial dark matters in hypersaline lakes. ENVIRONMENTAL SCIENCE AND ECOTECHNOLOGY 2024; 20:100359. [PMID: 39221074 PMCID: PMC11361885 DOI: 10.1016/j.ese.2023.100359] [Citation(s) in RCA: 1] [Impact Index Per Article: 1.0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Subscribe] [Scholar Register] [Received: 07/12/2023] [Revised: 11/26/2023] [Accepted: 11/26/2023] [Indexed: 09/04/2024]
Abstract
Biosynthesis and biodegradation of microorganisms critically underpin the development of biotechnology, new drugs and therapies, and environmental remediation. However, most uncultured microbial species along with their metabolic capacities in extreme environments, remain obscured. Here we unravel the metabolic potential of microbial dark matters (MDMs) in four deep-inland hypersaline lakes in Xinjiang, China. Utilizing metagenomic binning, we uncovered a rich diversity of 3030 metagenome-assembled genomes (MAGs) across 82 phyla, revealing a substantial portion, 2363 MAGs, as previously unclassified at the genus level. These unknown MAGs displayed unique distribution patterns across different lakes, indicating a strong correlation with varied physicochemical conditions. Our analysis revealed an extensive array of 9635 biosynthesis gene clusters (BGCs), with a remarkable 9403 being novel, suggesting untapped biotechnological potential. Notably, some MAGs from potentially new phyla exhibited a high density of these BGCs. Beyond biosynthesis, our study also identified novel biodegradation pathways, including dehalogenation, anaerobic ammonium oxidation (Anammox), and degradation of polycyclic aromatic hydrocarbons (PAHs) and plastics, in previously unknown microbial clades. These findings significantly enrich our understanding of biosynthesis and biodegradation processes and open new avenues for biotechnological innovation, emphasizing the untapped potential of microbial diversity in hypersaline environments.
Collapse
Affiliation(s)
- Zhiguang Qiu
- School of Environment and Energy, Peking University Shenzhen Graduate School, Shenzhen, 518055, China
- AI for Science (AI4S)-Preferred Program, Peking University, Shenzhen, 518055, China
| | - Yuanyuan Zhu
- School of Environment and Energy, Peking University Shenzhen Graduate School, Shenzhen, 518055, China
| | - Qing Zhang
- School of Environment and Energy, Peking University Shenzhen Graduate School, Shenzhen, 518055, China
| | - Xuejiao Qiao
- School of Environment and Energy, Peking University Shenzhen Graduate School, Shenzhen, 518055, China
| | - Rong Mu
- School of Environment and Energy, Peking University Shenzhen Graduate School, Shenzhen, 518055, China
| | - Zheng Xu
- Southern University of Sciences and Technology Yantian Hospital, Shenzhen, 518081, China
- Institute of Biomedicine and Biotechnology, Shenzhen Institute of Advanced Technology, Chinese Academy of Sciences, Shenzhen, 518055, China
| | - Yan Yan
- State Key Laboratory of Isotope Geochemistry, CAS Center for Excellence in Deep Earth Science, Guangzhou Institute of Geochemistry, Chinese Academy of Sciences, Guangzhou, 510640, China
| | - Fan Wang
- School of Atmospheric Sciences, Sun Yat-sen University, Southern Marine Science and Engineering Guangdong Laboratory (Zhuhai), Zhuhai, 519082, China
| | - Tong Zhang
- Department of Civil Engineering, University of Hong Kong, 999077, Hong Kong, China
| | - Wei-Qin Zhuang
- Department of Civil and Environmental Engineering, Faculty of Engineering, University of Auckland, New Zealand
| | - Ke Yu
- School of Environment and Energy, Peking University Shenzhen Graduate School, Shenzhen, 518055, China
- AI for Science (AI4S)-Preferred Program, Peking University, Shenzhen, 518055, China
| |
Collapse
|
21
|
Yamaguchi M, Uchihashi T, Kawabata S. Hybrid sequence-based analysis reveals the distribution of bacterial species and genes in the oral microbiome at a high resolution. Biochem Biophys Rep 2024; 38:101717. [PMID: 38708423 PMCID: PMC11066573 DOI: 10.1016/j.bbrep.2024.101717] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/05/2024] [Revised: 04/14/2024] [Accepted: 04/19/2024] [Indexed: 05/07/2024] Open
Abstract
Bacteria in the oral microbiome are poorly identified owing to the lack of established culture methods for them. Thus, this study aimed to use culture-free analysis techniques, including bacterial single-cell genome sequencing, to identify bacterial species and investigate gene distribution in saliva. Saliva samples from the same individual were classified as inactivated or viable and then analyzed using 16S rRNA sequencing, metagenomic shotgun sequencing, and bacterial single-cell sequencing. The results of 16S rRNA sequencing revealed similar microbiota structures in both samples, with Streptococcus being the predominant genus. Metagenomic shotgun sequencing showed that approximately 80 % of the DNA in the samples was of non-bacterial origin, whereas single-cell sequencing showed an average contamination rate of 10.4 % per genome. Single-cell sequencing also yielded genome sequences for 43 out of 48 wells for the inactivated samples and 45 out of 48 wells for the viable samples. With respect to resistance genes, four out of 88 isolates carried cfxA, which encodes a β-lactamase, and four isolates carried erythromycin resistance genes. Tetracycline resistance genes were found in nine bacteria. Metagenomic shotgun sequencing provided complete sequences of cfxA, ermF, and ermX, whereas other resistance genes, such as tetQ and tetM, were detected as fragments. In addition, virulence factors from Streptococcus pneumoniae were the most common, with 13 genes detected. Our average nucleotide identity analysis also suggested five single-cell-isolated bacteria as potential novel species. These data would contribute to expanding the oral microbiome data resource.
Collapse
Affiliation(s)
- Masaya Yamaguchi
- Bioinformatics Research Unit, Osaka University Graduate School of Dentistry, Suita, Osaka, Japan
- Department of Microbiology, Osaka University Graduate School of Dentistry, Suita, Osaka, Japan
- Bioinformatics Center, Research Institute for Microbial Diseases, Osaka University, Japan
- Center for Infectious Diseases Education and Research, Osaka University, Japan
| | - Toshihiro Uchihashi
- Department of Oral and Maxillofacial Surgery, Osaka University Graduate School of Dentistry, Suita, Osaka, Japan
| | - Shigetada Kawabata
- Department of Microbiology, Osaka University Graduate School of Dentistry, Suita, Osaka, Japan
- Center for Infectious Diseases Education and Research, Osaka University, Japan
| |
Collapse
|
22
|
Wang YC, Fu HM, Shen Y, Wang J, Wang N, Chen YP, Yan P. Biosynthetic potential of uncultured anammox community bacteria revealed through multi-omics analysis. BIORESOURCE TECHNOLOGY 2024; 401:130740. [PMID: 38677385 DOI: 10.1016/j.biortech.2024.130740] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Subscribe] [Scholar Register] [Received: 01/29/2024] [Revised: 03/11/2024] [Accepted: 04/24/2024] [Indexed: 04/29/2024]
Abstract
Microbial secondary metabolites (SMs) and their derivatives have been widely used in medicine, agriculture, and energy. Growing needs for renewable energy and the challenges posed by antibiotic resistance, cancer, and pesticides emphasize the crucial hunt for new SMs. Anaerobic ammonium-oxidation (anammox) systems harbor many uncultured or underexplored bacteria, representing potential resources for discovering novel SMs. Leveraging HiFi long-read metagenomic sequencing, 1,040 biosynthetic gene clusters (BGCs) were unearthed from the anammox microbiome with 58% being complete and showcasing rich diversity. Most of them showed distant relations to known BGCs, implying novelty. Members of the underexplored lineages (Chloroflexota and Planctomycetota) and Proteobacteria contained lots of BGCs, showcasing substantial biosynthetic potential. Metaproteomic results indicated that Planctomycetota members harbored the most active BGCs, particularly those involved in producing potential biofuel-ladderane. Overall, these findings underscore that anammox microbiomes could serve as valuable resources for mining novel BGCs and discovering new SMs for practical application.
Collapse
Affiliation(s)
- Yi-Cheng Wang
- Key Laboratory of the Three Gorges Reservoir Region's Eco-Environments of MOE, Chongqing University, Chongqing 400045, China
| | - Hui-Min Fu
- National Research Base of Intelligent Manufacturing Service, Chongqing Technology and Business University, Chongqing 400067, China
| | - Yu Shen
- National Research Base of Intelligent Manufacturing Service, Chongqing Technology and Business University, Chongqing 400067, China
| | - Jin Wang
- Key Laboratory of the Three Gorges Reservoir Region's Eco-Environments of MOE, Chongqing University, Chongqing 400045, China
| | - Nuo Wang
- Key Laboratory of the Three Gorges Reservoir Region's Eco-Environments of MOE, Chongqing University, Chongqing 400045, China
| | - You-Peng Chen
- Key Laboratory of the Three Gorges Reservoir Region's Eco-Environments of MOE, Chongqing University, Chongqing 400045, China
| | - Peng Yan
- Key Laboratory of the Three Gorges Reservoir Region's Eco-Environments of MOE, Chongqing University, Chongqing 400045, China.
| |
Collapse
|
23
|
Agustinho DP, Fu Y, Menon VK, Metcalf GA, Treangen TJ, Sedlazeck FJ. Unveiling microbial diversity: harnessing long-read sequencing technology. Nat Methods 2024; 21:954-966. [PMID: 38689099 DOI: 10.1038/s41592-024-02262-1] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 09/08/2022] [Accepted: 03/29/2024] [Indexed: 05/02/2024]
Abstract
Long-read sequencing has recently transformed metagenomics, enhancing strain-level pathogen characterization, enabling accurate and complete metagenome-assembled genomes, and improving microbiome taxonomic classification and profiling. These advancements are not only due to improvements in sequencing accuracy, but also happening across rapidly changing analysis methods. In this Review, we explore long-read sequencing's profound impact on metagenomics, focusing on computational pipelines for genome assembly, taxonomic characterization and variant detection, to summarize recent advancements in the field and provide an overview of available analytical methods to fully leverage long reads. We provide insights into the advantages and disadvantages of long reads over short reads and their evolution from the early days of long-read sequencing to their recent impact on metagenomics and clinical diagnostics. We further point out remaining challenges for the field such as the integration of methylation signals in sub-strain analysis and the lack of benchmarks.
Collapse
Affiliation(s)
- Daniel P Agustinho
- Human Genome Sequencing center, Baylor College of Medicine, Houston, TX, USA
| | - Yilei Fu
- Department of Computer Science, Rice University, Houston, TX, USA
| | - Vipin K Menon
- Human Genome Sequencing center, Baylor College of Medicine, Houston, TX, USA
- Senior research project manager, Human Genetics, Genentech, South San Francisco, CA, USA
| | - Ginger A Metcalf
- Human Genome Sequencing center, Baylor College of Medicine, Houston, TX, USA
| | - Todd J Treangen
- Department of Computer Science, Rice University, Houston, TX, USA
- Department of Bioengineering, Rice University, Houston, TX, USA
| | - Fritz J Sedlazeck
- Human Genome Sequencing center, Baylor College of Medicine, Houston, TX, USA.
- Department of Computer Science, Rice University, Houston, TX, USA.
| |
Collapse
|
24
|
Wang YC, Mao Y, Fu HM, Wang J, Weng X, Liu ZH, Xu XW, Yan P, Fang F, Guo JS, Shen Y, Chen YP. New insights into functional divergence and adaptive evolution of uncultured bacteria in anammox community by complete genome-centric analysis. THE SCIENCE OF THE TOTAL ENVIRONMENT 2024; 924:171530. [PMID: 38453092 DOI: 10.1016/j.scitotenv.2024.171530] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Subscribe] [Scholar Register] [Received: 09/26/2023] [Revised: 11/13/2023] [Accepted: 03/04/2024] [Indexed: 03/09/2024]
Abstract
Anaerobic ammonium-oxidation (anammox) bacteria play a crucial role in global nitrogen cycling and wastewater nitrogen removal, but they share symbiotic relationships with various other microorganisms. Functional divergence and adaptive evolution of uncultured bacteria in anammox community remain underexplored. Although shotgun metagenomics based on short reads has been widely used in anammox research, metagenome-assembled genomes (MAGs) are often discontinuous and highly contaminated, which limits in-depth analyses of anammox communities. Here, for the first time, we performed Pacific Biosciences high-fidelity (HiFi) long-read sequencing on the anammox granule sludge sample from a lab-scale bioreactor, and obtained 30 accurate and complete metagenome-assembled genomes (cMAGs). These cMAGs were obtained by selecting high-quality circular contigs from initial assemblies of long reads generated by HiFi sequencing, eliminating the need for Illumina short reads, binning, and reassembly. One new anammox species affiliated with Candidatus Jettenia and three species affiliated with novel families were found in this anammox community. cMAG-centric analysis revealed functional divergence in general and nitrogen metabolism among the anammox community members, and they might adopt a cross-feeding strategy in organic matter, cofactors, and vitamins. Furthermore, we identified 63 mobile genetic elements (MGEs) and 50 putative horizontal gene transfer (HGT) events within these cMAGs. The results suggest that HGT events and MGEs related to phage and integration or excision, particularly transposons containing tnpA in anammox bacteria, might play important roles in the adaptive evolution of this anammox community. The cMAGs generated in the present study could be used to establish of a comprehensive database for anammox bacteria and associated microorganisms. These findings highlight the advantages of HiFi sequencing for the studies of complex mixed cultures and advance the understanding of anammox communities.
Collapse
Affiliation(s)
- Yi-Cheng Wang
- Key Laboratory of the Three Gorges Reservoir Region's Eco-Environments of MOE, Chongqing University, Chongqing 400045, China
| | - Yanping Mao
- College of Chemistry and Environmental Engineering, Shenzhen University, Shenzhen 518071, Guangdong, China
| | - Hui-Min Fu
- Key Laboratory of the Three Gorges Reservoir Region's Eco-Environments of MOE, Chongqing University, Chongqing 400045, China; National Research Base of Intelligent Manufacturing Service, Chongqing Technology and Business University, Chongqing 400067, China
| | - Jin Wang
- Key Laboratory of the Three Gorges Reservoir Region's Eco-Environments of MOE, Chongqing University, Chongqing 400045, China
| | - Xun Weng
- Key Laboratory of the Three Gorges Reservoir Region's Eco-Environments of MOE, Chongqing University, Chongqing 400045, China
| | - Zi-Hao Liu
- Key Laboratory of the Three Gorges Reservoir Region's Eco-Environments of MOE, Chongqing University, Chongqing 400045, China
| | - Xiao-Wei Xu
- Key Laboratory of the Three Gorges Reservoir Region's Eco-Environments of MOE, Chongqing University, Chongqing 400045, China
| | - Peng Yan
- Key Laboratory of the Three Gorges Reservoir Region's Eco-Environments of MOE, Chongqing University, Chongqing 400045, China
| | - Fang Fang
- Key Laboratory of the Three Gorges Reservoir Region's Eco-Environments of MOE, Chongqing University, Chongqing 400045, China
| | - Jin-Song Guo
- Key Laboratory of the Three Gorges Reservoir Region's Eco-Environments of MOE, Chongqing University, Chongqing 400045, China
| | - Yu Shen
- National Research Base of Intelligent Manufacturing Service, Chongqing Technology and Business University, Chongqing 400067, China
| | - You-Peng Chen
- Key Laboratory of the Three Gorges Reservoir Region's Eco-Environments of MOE, Chongqing University, Chongqing 400045, China.
| |
Collapse
|
25
|
Feng X, Li H. Evaluating and improving the representation of bacterial contents in long-read metagenome assemblies. Genome Biol 2024; 25:92. [PMID: 38605401 PMCID: PMC11007910 DOI: 10.1186/s13059-024-03234-6] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 11/21/2022] [Accepted: 03/29/2024] [Indexed: 04/13/2024] Open
Abstract
BACKGROUND In the metagenomic assembly of a microbial community, abundant species are often thought to assemble well given their deeper sequencing coverage. This conjuncture is rarely tested or evaluated in practice. We often do not know how many abundant species are missing and do not have an approach to recover them. RESULTS Here, we propose k-mer based and 16S RNA based methods to measure the completeness of metagenome assembly. We show that even with PacBio high-fidelity (HiFi) reads, abundant species are often not assembled, as high strain diversity may lead to fragmented contigs. We develop a novel reference-free algorithm to recover abundant metagenome-assembled genomes (MAGs) by identifying circular assembly subgraphs. Complemented with a reference-free genome binning heuristics based on dimension reduction, the proposed method rescues many abundant species that would be missing with existing methods and produces competitive results compared to those state-of-the-art binners in terms of total number of near-complete genome bins. CONCLUSIONS Our work emphasizes the importance of metagenome completeness, which has often been overlooked. Our algorithm generates more circular MAGs and moves a step closer to the complete representation of microbial communities.
Collapse
Affiliation(s)
- Xiaowen Feng
- Department of Data Sciences, Dana-Farber Cancer Institute, Boston, USA
- Department of Biomedical Informatics, Harvard Medical School, Boston, USA
| | - Heng Li
- Department of Data Sciences, Dana-Farber Cancer Institute, Boston, USA.
- Department of Biomedical Informatics, Harvard Medical School, Boston, USA.
| |
Collapse
|
26
|
Long DR, Bryson-Cahn C, Waalkes A, Holmes EA, Penewit K, Tavolaro C, Bellabarba C, Zhang F, Chan JD, Fang FC, Lynch JB, Salipante SJ. Contribution of the patient microbiome to surgical site infection and antibiotic prophylaxis failure in spine surgery. Sci Transl Med 2024; 16:eadk8222. [PMID: 38598612 DOI: 10.1126/scitranslmed.adk8222] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 09/12/2023] [Accepted: 03/18/2024] [Indexed: 04/12/2024]
Abstract
Despite modern antiseptic techniques, surgical site infection (SSI) remains a leading complication of surgery. However, the origins of SSI and the high rates of antimicrobial resistance observed in these infections are poorly understood. Using instrumented spine surgery as a model of clean (class I) skin incision, we prospectively sampled preoperative microbiomes and postoperative SSI isolates in a cohort of 204 patients. Combining multiple forms of genomic analysis, we correlated the identity, anatomic distribution, and antimicrobial resistance profiles of SSI pathogens with those of preoperative strains obtained from the patient skin microbiome. We found that 86% of SSIs, comprising a broad range of bacterial species, originated endogenously from preoperative strains, with no evidence of common source infection among a superset of 1610 patients. Most SSI isolates (59%) were resistant to the prophylactic antibiotic administered during surgery, and their resistance phenotypes correlated with the patient's preoperative resistome (P = 0.0002). These findings indicate the need for SSI prevention strategies tailored to the preoperative microbiome and resistome present in individual patients.
Collapse
Affiliation(s)
- Dustin R Long
- Division of Critical Care Medicine, Department of Anesthesiology and Pain Medicine, University of Washington School of Medicine, Seattle, WA 98195, USA
| | - Chloe Bryson-Cahn
- Division of Allergy and Infectious Diseases, Department of Medicine, University of Washington School of Medicine, Seattle, WA 98195, USA
| | - Adam Waalkes
- Department of Laboratory Medicine and Pathology, University of Washington School of Medicine, Seattle, WA 98195, USA
| | - Elizabeth A Holmes
- Department of Laboratory Medicine and Pathology, University of Washington School of Medicine, Seattle, WA 98195, USA
| | - Kelsi Penewit
- Department of Laboratory Medicine and Pathology, University of Washington School of Medicine, Seattle, WA 98195, USA
| | - Celeste Tavolaro
- Department of Orthopaedics and Sports Medicine, University of Washington School of Medicine, Seattle, WA 98195, USA
| | - Carlo Bellabarba
- Department of Orthopaedics and Sports Medicine, University of Washington School of Medicine, Seattle, WA 98195, USA
| | - Fangyi Zhang
- Department of Orthopaedics and Sports Medicine, University of Washington School of Medicine, Seattle, WA 98195, USA
- Department of Neurological Surgery, University of Washington School of Medicine, Seattle, WA 98195, USA
| | - Jeannie D Chan
- Division of Allergy and Infectious Diseases, Department of Medicine, University of Washington School of Medicine, Seattle, WA 98195, USA
- Department of Pharmacy, Harborview Medical Center, University of Washington School of Pharmacy, Seattle, WA 98104, USA
| | - Ferric C Fang
- Department of Laboratory Medicine and Pathology, University of Washington School of Medicine, Seattle, WA 98195, USA
- Department of Microbiology, University of Washington School of Medicine, Seattle, WA 98195, USA
- Clinical Microbiology Laboratory, Harborview Medical Center, Seattle, WA 98104, USA
| | - John B Lynch
- Division of Allergy and Infectious Diseases, Department of Medicine, University of Washington School of Medicine, Seattle, WA 98195, USA
| | - Stephen J Salipante
- Department of Laboratory Medicine and Pathology, University of Washington School of Medicine, Seattle, WA 98195, USA
| |
Collapse
|
27
|
Eisenhofer R, Nesme J, Santos-Bay L, Koziol A, Sørensen SJ, Alberdi A, Aizpurua O. A comparison of short-read, HiFi long-read, and hybrid strategies for genome-resolved metagenomics. Microbiol Spectr 2024; 12:e0359023. [PMID: 38451230 PMCID: PMC10986573 DOI: 10.1128/spectrum.03590-23] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/07/2023] [Accepted: 02/11/2024] [Indexed: 03/08/2024] Open
Abstract
Shotgun metagenomics enables the reconstruction of complex microbial communities at a high level of detail. Such an approach can be conducted using both short-read and long-read sequencing data, as well as a combination of both. To assess the pros and cons of these different approaches, we used 22 fecal DNA extracts collected weekly for 11 weeks from two respective lab mice to study seven performance metrics over four combinations of sequencing depth and technology: (i) 20 Gbp of Illumina short-read data, (ii) 40 Gbp of short-read data, (iii) 20 Gbp of PacBio HiFi long-read data, and (iv) 40 Gbp of hybrid (20 Gbp of short-read +20 Gbp of long-read) data. No strategy was best for all metrics; instead, each one excelled across different metrics. The long-read approach yielded the best assembly statistics, with the highest N50 and lowest number of contigs. The 40 Gbp short-read approach yielded the highest number of refined bins. Finally, the hybrid approach yielded the longest assemblies and the highest mapping rate to the bacterial genomes. Our results suggest that while long-read sequencing significantly improves the quality of reconstructed bacterial genomes, it is more expensive and requires deeper sequencing than short-read approaches to recover a comparable amount of reconstructed genomes. The most optimal strategy is study-specific and depends on how researchers assess the trade-off between the quantity and quality of recovered genomes.IMPORTANCEMice are an important model organism for understanding the gut microbiome. When studying these gut microbiomes using DNA techniques, researchers can choose from technologies that use short or long DNA reads. In this study, we perform an extensive benchmark between short- and long-read DNA sequencing for studying mice gut microbiomes. We find that no one approach was best for all metrics and provide information that can help guide researchers in planning their experiments.
Collapse
Affiliation(s)
- Raphael Eisenhofer
- Center for Evolutionary Hologenomics, Globe Institute, University of Copenhagen, Copenhagen, Denmark
| | - Joseph Nesme
- Section of Microbiology, Department of Biology, University of Copenhagen, Copenhagen, Denmark
| | - Luisa Santos-Bay
- Center for Evolutionary Hologenomics, Globe Institute, University of Copenhagen, Copenhagen, Denmark
| | - Adam Koziol
- Center for Evolutionary Hologenomics, Globe Institute, University of Copenhagen, Copenhagen, Denmark
| | - Søren Johannes Sørensen
- Section of Microbiology, Department of Biology, University of Copenhagen, Copenhagen, Denmark
| | - Antton Alberdi
- Center for Evolutionary Hologenomics, Globe Institute, University of Copenhagen, Copenhagen, Denmark
| | - Ostaizka Aizpurua
- Center for Evolutionary Hologenomics, Globe Institute, University of Copenhagen, Copenhagen, Denmark
| |
Collapse
|
28
|
Cook R, Telatin A, Hsieh SY, Newberry F, Tariq MA, Baker DJ, Carding SR, Adriaenssens EM. Nanopore and Illumina sequencing reveal different viral populations from human gut samples. Microb Genom 2024; 10. [PMID: 38683195 DOI: 10.1099/mgen.0.001236] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 05/01/2024] Open
Abstract
The advent of viral metagenomics, or viromics, has improved our knowledge and understanding of global viral diversity. High-throughput sequencing technologies enable explorations of the ecological roles, contributions to host metabolism, and the influence of viruses in various environments, including the human intestinal microbiome. However, bacterial metagenomic studies frequently have the advantage. The adoption of advanced technologies like long-read sequencing has the potential to be transformative in refining viromics and metagenomics. Here, we examined the effectiveness of long-read and hybrid sequencing by comparing Illumina short-read and Oxford Nanopore Technology (ONT) long-read sequencing technologies and different assembly strategies on recovering viral genomes from human faecal samples. Our findings showed that if a single sequencing technology is to be chosen for virome analysis, Illumina is preferable due to its superior ability to recover fully resolved viral genomes and minimise erroneous genomes. While ONT assemblies were effective in recovering viral diversity, the challenges related to input requirements and the necessity for amplification made it less ideal as a standalone solution. However, using a combined, hybrid approach enabled a more authentic representation of viral diversity to be obtained within samples.
Collapse
Affiliation(s)
- Ryan Cook
- Quadram Institute Bioscience, Norwich, NR4 7UQ, UK
| | | | | | - Fiona Newberry
- Department of Biosciences, Nottingham Trent University, Nottingham, NG11 8NS, UK
| | - Mohammad A Tariq
- Faculty of Health and Life Sciences, University of Northumbria, Newcastle upon Tyne, NE1 8ST, UK
| | - Dave J Baker
- Quadram Institute Bioscience, Norwich, NR4 7UQ, UK
| | - Simon R Carding
- Quadram Institute Bioscience, Norwich, NR4 7UQ, UK
- Norwich Medical School, University of East Anglia, Norwich, NR4 7TJ, UK
| | | |
Collapse
|
29
|
Yu W, Luo H, Yang J, Zhang S, Jiang H, Zhao X, Hui X, Sun D, Li L, Wei XQ, Lonardi S, Pan W. Comprehensive assessment of 11 de novo HiFi assemblers on complex eukaryotic genomes and metagenomes. Genome Res 2024; 34:326-340. [PMID: 38428994 PMCID: PMC10984382 DOI: 10.1101/gr.278232.123] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/29/2023] [Accepted: 01/23/2024] [Indexed: 03/03/2024]
Abstract
Pacific Biosciences (PacBio) HiFi sequencing technology generates long reads (>10 kbp) with very high accuracy (<0.01% sequencing error). Although several de novo assembly tools are available for HiFi reads, there are no comprehensive studies on the evaluation of these assemblers. We evaluated the performance of 11 de novo HiFi assemblers on (1) real data for three eukaryotic genomes; (2) 34 synthetic data sets with different ploidy, sequencing coverage levels, heterozygosity rates, and sequencing error rates; (3) one real metagenomic data set; and (4) five synthetic metagenomic data sets with different composition abundance and heterozygosity rates. The 11 assemblers were evaluated using quality assessment tool (QUAST) and benchmarking universal single-copy ortholog (BUSCO). We also used several additional criteria, namely, completion rate, single-copy completion rate, duplicated completion rate, average proportion of largest category, average distance difference, quality value, run-time, and memory utilization. Results show that hifiasm and hifiasm-meta should be the first choice for assembling eukaryotic genomes and metagenomes with HiFi data. We performed a comprehensive benchmarking study of commonly used assemblers on complex eukaryotic genomes and metagenomes. Our study will help the research community to choose the most appropriate assembler for their data and identify possible improvements in assembly algorithms.
Collapse
Affiliation(s)
- Wenjuan Yu
- Shenzhen Branch, Guangdong Laboratory of Lingnan Modern Agriculture, Genome Analysis Laboratory of the Ministry of Agriculture and Rural Affairs, Agricultural Genomics Institute at Shenzhen, Chinese Academy of Agricultural Sciences, Shenzhen 518120, China
| | - Haohui Luo
- Shenzhen Branch, Guangdong Laboratory of Lingnan Modern Agriculture, Genome Analysis Laboratory of the Ministry of Agriculture and Rural Affairs, Agricultural Genomics Institute at Shenzhen, Chinese Academy of Agricultural Sciences, Shenzhen 518120, China
| | - Jinbao Yang
- Shenzhen Branch, Guangdong Laboratory of Lingnan Modern Agriculture, Genome Analysis Laboratory of the Ministry of Agriculture and Rural Affairs, Agricultural Genomics Institute at Shenzhen, Chinese Academy of Agricultural Sciences, Shenzhen 518120, China
- College of Informatics, Huazhong Agricultural University, Wuhan 430070, China
| | - Shengchen Zhang
- Shenzhen Branch, Guangdong Laboratory of Lingnan Modern Agriculture, Genome Analysis Laboratory of the Ministry of Agriculture and Rural Affairs, Agricultural Genomics Institute at Shenzhen, Chinese Academy of Agricultural Sciences, Shenzhen 518120, China
- College of Informatics, Huazhong Agricultural University, Wuhan 430070, China
| | - Heling Jiang
- Shenzhen Branch, Guangdong Laboratory of Lingnan Modern Agriculture, Genome Analysis Laboratory of the Ministry of Agriculture and Rural Affairs, Agricultural Genomics Institute at Shenzhen, Chinese Academy of Agricultural Sciences, Shenzhen 518120, China
| | - Xianjia Zhao
- Shenzhen Branch, Guangdong Laboratory of Lingnan Modern Agriculture, Genome Analysis Laboratory of the Ministry of Agriculture and Rural Affairs, Agricultural Genomics Institute at Shenzhen, Chinese Academy of Agricultural Sciences, Shenzhen 518120, China
- School of Agricultural Sciences, Zhengzhou University, Zhengzhou, Henan 450001, China
| | - Xingqi Hui
- Shenzhen Branch, Guangdong Laboratory of Lingnan Modern Agriculture, Genome Analysis Laboratory of the Ministry of Agriculture and Rural Affairs, Agricultural Genomics Institute at Shenzhen, Chinese Academy of Agricultural Sciences, Shenzhen 518120, China
- School of Agricultural Sciences, Zhengzhou University, Zhengzhou, Henan 450001, China
| | - Da Sun
- Shenzhen Branch, Guangdong Laboratory of Lingnan Modern Agriculture, Genome Analysis Laboratory of the Ministry of Agriculture and Rural Affairs, Agricultural Genomics Institute at Shenzhen, Chinese Academy of Agricultural Sciences, Shenzhen 518120, China
| | - Liang Li
- Fruit Research Institute, Fujian Academy of Agricultural Sciences, Fuzhou, Fujian 350002, China
| | - Xiu-Qing Wei
- Fruit Research Institute, Fujian Academy of Agricultural Sciences, Fuzhou, Fujian 350002, China;
| | - Stefano Lonardi
- Department of Computer Science and Engineering, University of California, Riverside, California 92521, USA;
| | - Weihua Pan
- Shenzhen Branch, Guangdong Laboratory of Lingnan Modern Agriculture, Genome Analysis Laboratory of the Ministry of Agriculture and Rural Affairs, Agricultural Genomics Institute at Shenzhen, Chinese Academy of Agricultural Sciences, Shenzhen 518120, China;
| |
Collapse
|
30
|
Qiu Z, Yuan L, Lian CA, Lin B, Chen J, Mu R, Qiao X, Zhang L, Xu Z, Fan L, Zhang Y, Wang S, Li J, Cao H, Li B, Chen B, Song C, Liu Y, Shi L, Tian Y, Ni J, Zhang T, Zhou J, Zhuang WQ, Yu K. BASALT refines binning from metagenomic data and increases resolution of genome-resolved metagenomic analysis. Nat Commun 2024; 15:2179. [PMID: 38467684 PMCID: PMC10928208 DOI: 10.1038/s41467-024-46539-7] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/10/2023] [Accepted: 03/01/2024] [Indexed: 03/13/2024] Open
Abstract
Metagenomic binning is an essential technique for genome-resolved characterization of uncultured microorganisms in various ecosystems but hampered by the low efficiency of binning tools in adequately recovering metagenome-assembled genomes (MAGs). Here, we introduce BASALT (Binning Across a Series of Assemblies Toolkit) for binning and refinement of short- and long-read sequencing data. BASALT employs multiple binners with multiple thresholds to produce initial bins, then utilizes neural networks to identify core sequences to remove redundant bins and refine non-redundant bins. Using the same assemblies generated from Critical Assessment of Metagenome Interpretation (CAMI) datasets, BASALT produces up to twice as many MAGs as VAMB, DASTool, or metaWRAP. Processing assemblies from a lake sediment dataset, BASALT produces ~30% more MAGs than metaWRAP, including 21 unique class-level prokaryotic lineages. Functional annotations reveal that BASALT can retrieve 47.6% more non-redundant opening-reading frames than metaWRAP. These results highlight the robust handling of metagenomic sequencing data of BASALT.
Collapse
Affiliation(s)
- Zhiguang Qiu
- Eco-environment and Resource Efficiency Research Laboratory, School of Environment and Energy, Shenzhen Graduate School, Peking University, Shenzhen, China
- AI for Science (AI4S)-Preferred Program, Peking University, Shenzhen, China
| | - Li Yuan
- AI for Science (AI4S)-Preferred Program, Peking University, Shenzhen, China
- School of Electronic and Computer Engineering, Peking University, Shenzhen, China
- Peng Cheng Laboratory, Shenzhen, China
| | - Chun-Ang Lian
- Eco-environment and Resource Efficiency Research Laboratory, School of Environment and Energy, Shenzhen Graduate School, Peking University, Shenzhen, China
- AI for Science (AI4S)-Preferred Program, Peking University, Shenzhen, China
| | - Bin Lin
- School of Electronic and Computer Engineering, Peking University, Shenzhen, China
| | - Jie Chen
- AI for Science (AI4S)-Preferred Program, Peking University, Shenzhen, China
- School of Electronic and Computer Engineering, Peking University, Shenzhen, China
- Peng Cheng Laboratory, Shenzhen, China
| | - Rong Mu
- Eco-environment and Resource Efficiency Research Laboratory, School of Environment and Energy, Shenzhen Graduate School, Peking University, Shenzhen, China
| | - Xuejiao Qiao
- Eco-environment and Resource Efficiency Research Laboratory, School of Environment and Energy, Shenzhen Graduate School, Peking University, Shenzhen, China
| | - Liyu Zhang
- Eco-environment and Resource Efficiency Research Laboratory, School of Environment and Energy, Shenzhen Graduate School, Peking University, Shenzhen, China
| | - Zheng Xu
- Southern University of Sciences and Technology Yantian Hospital, Shenzhen, China
- Institute of Biomedicine and Biotechnology, Shenzhen Institute of Advanced Technology, Chinese Academy of Sciences, Shenzhen, Guangdong, China
| | - Lu Fan
- Department of Ocean Science and Engineering, Southern University of Science and Technology (SUSTech), Shenzhen, China
| | - Yunzeng Zhang
- Joint International Research Laboratory of Agriculture and Agri-Product Safety, the Ministry of Education of China, Yangzhou University, Yangzhou, China
| | - Shanquan Wang
- Environmental Microbiomics Research Center, School of Environmental Science and Engineering, Sun Yat-Sen University, Guangzhou, China
| | - Junyi Li
- School of Computer Science and Technology, Harbin Institute of Technology (Shenzhen), Shenzhen, Guangdong, China
| | - Huiluo Cao
- Department of Microbiology, University of Hong Kong, Hong Kong, China
| | - Bing Li
- Shenzhen International Graduate School, Tsinghua University, Shenzhen, China
| | - Baowei Chen
- Guangdong Provincial Key Laboratory of Marine Resources and Coastal Engineering, School of Marine Sciences, Sun Yat-sen University, Zhuhai, China
| | - Chi Song
- Institute of Herbgenomics, Chengdu University of Traditional Chinese Medicine, Chengdu, China
- Wuhan Benagen Technology Co., Ltd, Wuhan, China
| | - Yongxin Liu
- Shenzhen Branch, Guangdong Laboratory of Lingnan Modern Agriculture, Genome Analysis Laboratory of the Ministry of Agriculture and Rural Affairs, Agricultural Genomics Institute at Shenzhen, Chinese Academy of Agricultural Sciences, Shenzhen, China
| | - Lili Shi
- AI for Science (AI4S)-Preferred Program, Peking University, Shenzhen, China
- State Key Laboratory of Chemical Oncogenomics, School of Chemical Biology and Biotechnology, Peking University Shenzhen Graduate School, Shenzhen, China
| | - Yonghong Tian
- AI for Science (AI4S)-Preferred Program, Peking University, Shenzhen, China
- School of Electronic and Computer Engineering, Peking University, Shenzhen, China
- Peng Cheng Laboratory, Shenzhen, China
| | - Jinren Ni
- Eco-environment and Resource Efficiency Research Laboratory, School of Environment and Energy, Shenzhen Graduate School, Peking University, Shenzhen, China
- College of Environmental Sciences and Engineering, Key Laboratory of Water and Sediment Sciences, Ministry of Education, Peking University, Beijing, China
| | - Tong Zhang
- Department of Civil Engineering, University of Hong Kong, Hong Kong, China
| | - Jizhong Zhou
- Institute for Environmental Genomics, University of Oklahoma, Norman, OK, USA
| | - Wei-Qin Zhuang
- Department of Civil and Environmental Engineering, Faculty of Engineering, University of Auckland, Auckland, New Zealand
| | - Ke Yu
- Eco-environment and Resource Efficiency Research Laboratory, School of Environment and Energy, Shenzhen Graduate School, Peking University, Shenzhen, China.
- AI for Science (AI4S)-Preferred Program, Peking University, Shenzhen, China.
| |
Collapse
|
31
|
Hui X, Yang J, Sun J, Liu F, Pan W. MCSS: microbial community simulator based on structure. Front Microbiol 2024; 15:1358257. [PMID: 38516019 PMCID: PMC10956353 DOI: 10.3389/fmicb.2024.1358257] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Grants] [Track Full Text] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 12/20/2023] [Accepted: 02/20/2024] [Indexed: 03/23/2024] Open
Abstract
De novo assembly plays a pivotal role in metagenomic analysis, and the incorporation of third-generation sequencing technology can significantly improve the integrity and accuracy of assembly results. Recently, with advancements in sequencing technology (Hi-Fi, ultra-long), several long-read-based bioinformatic tools have been developed. However, the validation of the performance and reliability of these tools is a crucial concern. To address this gap, we present MCSS (microbial community simulator based on structure), which has the capability to generate simulated microbial community and sequencing datasets based on the structure attributes of real microbiome communities. The evaluation results indicate that it can generate simulated communities that exhibit both diversity and similarity to actual community structures. Additionally, MCSS generates synthetic PacBio Hi-Fi and Oxford Nanopore Technologies (ONT) long reads for the species within the simulated community. This innovative tool provides a valuable resource for benchmarking and refining metagenomic analysis methods. Code available at: https://github.com/panlab-bio/mcss.
Collapse
Affiliation(s)
- Xingqi Hui
- Zhengzhou Research Base, State Key Laboratory of Cotton Biology, School of Agricultural Sciences, Zhengzhou University, Zhengzhou, China
- Shenzhen Branch, Guangdong Laboratory of Lingnan Modern Agriculture, Genome Analysis Laboratory of the Ministry of Agriculture and Rural Affairs, Agricultural Genomics Institute at Shenzhen, Chinese Academy of Agricultural Sciences (ICR, CAAS), Shenzhen, China
| | - Jinbao Yang
- Shenzhen Branch, Guangdong Laboratory of Lingnan Modern Agriculture, Genome Analysis Laboratory of the Ministry of Agriculture and Rural Affairs, Agricultural Genomics Institute at Shenzhen, Chinese Academy of Agricultural Sciences (ICR, CAAS), Shenzhen, China
- College of Informatics, Huazhong Agricultural University, Wuhan, China
| | - Jinhuan Sun
- Key Laboratory of Plant Molecular Physiology, CAS Center for Excellence in Molecular Plant Sciences, Institute of Botany, Chinese Academy of Sciences, Beijing, China
| | - Fang Liu
- Zhengzhou Research Base, State Key Laboratory of Cotton Biology, School of Agricultural Sciences, Zhengzhou University, Zhengzhou, China
- National Key Laboratory of Cotton Bio-Breeding and Integrated Utilization, Institute of Cotton Research, Chinese Academy of Agricultural Sciences (ICR, CAAS), Anyang, China
| | - Weihua Pan
- Shenzhen Branch, Guangdong Laboratory of Lingnan Modern Agriculture, Genome Analysis Laboratory of the Ministry of Agriculture and Rural Affairs, Agricultural Genomics Institute at Shenzhen, Chinese Academy of Agricultural Sciences (ICR, CAAS), Shenzhen, China
| |
Collapse
|
32
|
Ma J, Sun H, Li B, Wu B, Zhang X, Ye L. Horizontal transfer potential of antibiotic resistance genes in wastewater treatment plants unraveled by microfluidic-based mini-metagenomics. JOURNAL OF HAZARDOUS MATERIALS 2024; 465:133493. [PMID: 38228000 DOI: 10.1016/j.jhazmat.2024.133493] [Citation(s) in RCA: 5] [Impact Index Per Article: 5.0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Subscribe] [Scholar Register] [Received: 09/11/2023] [Revised: 12/30/2023] [Accepted: 01/08/2024] [Indexed: 01/18/2024]
Abstract
Wastewater treatment plants (WWTPs) are known to harbor antibiotic resistance genes (ARGs), which can potentially spread to the environment and human populations. However, the extent and mechanisms of ARG transfer in WWTPs are not well understood due to the high microbial diversity and limitations of molecular techniques. In this study, we used a microfluidic-based mini-metagenomics approach to investigate the transfer potential and mechanisms of ARGs in activated sludge from WWTPs. Our results show that while diverse ARGs are present in activated sludge, only a few highly similar ARGs are observed across different taxa, indicating limited transfer potential. We identified two ARGs, ermF and tla-1, which occur in a variety of bacterial taxa and may have high transfer potential facilitated by mobile genetic elements. Interestingly, genes that are highly similar to the sequences of these two ARGs, as identified in this study, display varying patterns of abundance across geographic regions. Genes similar to ermF found are widely found in Asia and the Americas, while genes resembling tla-1 are primarily detected in Asia. Genes similar to both genes are barely detected in European WWTPs. These findings shed light on the limited horizontal transfer potential of ARGs in WWTPs and highlight the importance of monitoring specific ARGs in different regions to mitigate the spread of antibiotic resistance.
Collapse
Affiliation(s)
- Jiachen Ma
- State Key Laboratory of Pollution Control and Resource Reuse, School of Environment, Nanjing University, Nanjing 210023, China
| | - Haohao Sun
- State Key Laboratory of Pollution Control and Resource Reuse, School of Environment, Nanjing University, Nanjing 210023, China; School of Environmental Science and Engineering, Changzhou University, Changzhou 213164, China
| | - Bing Li
- State Environmental Protection Key Laboratory of Microorganism Application and Risk Control, Tsinghua Shenzhen International Graduate School, Tsinghua University, Shenzhen 518055, China
| | - Bing Wu
- State Key Laboratory of Pollution Control and Resource Reuse, School of Environment, Nanjing University, Nanjing 210023, China
| | - Xuxiang Zhang
- State Key Laboratory of Pollution Control and Resource Reuse, School of Environment, Nanjing University, Nanjing 210023, China
| | - Lin Ye
- State Key Laboratory of Pollution Control and Resource Reuse, School of Environment, Nanjing University, Nanjing 210023, China.
| |
Collapse
|
33
|
Law SR, Mathes F, Paten AM, Alexandre PA, Regmi R, Reid C, Safarchi A, Shaktivesh S, Wang Y, Wilson A, Rice SA, Gupta VVSR. Life at the borderlands: microbiomes of interfaces critical to One Health. FEMS Microbiol Rev 2024; 48:fuae008. [PMID: 38425054 PMCID: PMC10977922 DOI: 10.1093/femsre/fuae008] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/26/2023] [Revised: 02/12/2024] [Accepted: 02/27/2024] [Indexed: 03/02/2024] Open
Abstract
Microbiomes are foundational components of the environment that provide essential services relating to food security, carbon sequestration, human health, and the overall well-being of ecosystems. Microbiota exert their effects primarily through complex interactions at interfaces with their plant, animal, and human hosts, as well as within the soil environment. This review aims to explore the ecological, evolutionary, and molecular processes governing the establishment and function of microbiome-host relationships, specifically at interfaces critical to One Health-a transdisciplinary framework that recognizes that the health outcomes of people, animals, plants, and the environment are tightly interconnected. Within the context of One Health, the core principles underpinning microbiome assembly will be discussed in detail, including biofilm formation, microbial recruitment strategies, mechanisms of microbial attachment, community succession, and the effect these processes have on host function and health. Finally, this review will catalogue recent advances in microbiology and microbial ecology methods that can be used to profile microbial interfaces, with particular attention to multi-omic, advanced imaging, and modelling approaches. These technologies are essential for delineating the general and specific principles governing microbiome assembly and functions, mapping microbial interconnectivity across varying spatial and temporal scales, and for the establishment of predictive frameworks that will guide the development of targeted microbiome-interventions to deliver One Health outcomes.
Collapse
Affiliation(s)
- Simon R Law
- CSIRO MOSH-Future Science Platform, Australia
- CSIRO Agriculture and Food, Canberra, ACT 2601, Australia
| | - Falko Mathes
- CSIRO MOSH-Future Science Platform, Australia
- CSIRO Environment, Floreat, WA 6014, Australia
| | - Amy M Paten
- CSIRO MOSH-Future Science Platform, Australia
- CSIRO Environment, Canberra, ACT 2601, Australia
| | - Pamela A Alexandre
- CSIRO MOSH-Future Science Platform, Australia
- CSIRO Agriculture and Food, St Lucia, Qld 4072, Australia
| | - Roshan Regmi
- CSIRO MOSH-Future Science Platform, Australia
- CSIRO Agriculture and Food, Urrbrae, SA 5064, Australia
| | - Cameron Reid
- CSIRO MOSH-Future Science Platform, Australia
- CSIRO Environment, Urrbrae, SA 5064, Australia
| | - Azadeh Safarchi
- CSIRO MOSH-Future Science Platform, Australia
- CSIRO Health and Biosecurity, Westmead, NSW 2145, Australia
| | - Shaktivesh Shaktivesh
- CSIRO MOSH-Future Science Platform, Australia
- CSIRO Data 61, Clayton, Vic 3168, Australia
| | - Yanan Wang
- CSIRO MOSH-Future Science Platform, Australia
- CSIRO Health and Biosecurity, Adelaide SA 5000, Australia
| | - Annaleise Wilson
- CSIRO MOSH-Future Science Platform, Australia
- CSIRO Health and Biosecurity, Geelong, Vic 3220, Australia
| | - Scott A Rice
- CSIRO MOSH-Future Science Platform, Australia
- CSIRO Agriculture, and Food, Westmead, NSW 2145, Australia
| | - Vadakattu V S R Gupta
- CSIRO MOSH-Future Science Platform, Australia
- CSIRO Agriculture and Food, Urrbrae, SA 5064, Australia
| |
Collapse
|
34
|
Vancaester E, Blaxter ML. MarkerScan: Separation and assembly of cobionts sequenced alongside target species in biodiversity genomics projects. Wellcome Open Res 2024; 9:33. [PMID: 38617467 PMCID: PMC11016177 DOI: 10.12688/wellcomeopenres.20730.1] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Grants] [Track Full Text] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Accepted: 12/18/2023] [Indexed: 04/16/2024] Open
Abstract
Contamination of public databases by mislabelled sequences has been highlighted for many years and the avalanche of novel sequencing data now being deposited has the potential to make databases difficult to use effectively. It is therefore crucial that sequencing projects and database curators perform pre-submission checks to remove obvious contamination and avoid propagating erroneous taxonomic relationships. However, it is important also to recognise that biological contamination of a target sample with unexpected species' DNA can also lead to the discovery of fascinating biological phenomena through the identification of environmental organisms or endosymbionts. Here, we present a novel, integrated method for detection and generation of high-quality genomes of all non-target genomes co-sequenced in eukaryotic genome sequencing projects. After performing taxonomic profiling of an assembly from the raw data, and leveraging the identity of small rRNA sequences discovered therein as markers, a targeted classification approach retrieves and assembles high-quality genomes. The genomes of these cobionts are then not only removed from the target species' genome but also available for further interrogation. Source code is available from https://github.com/CobiontID/MarkerScan. MarkerScan is written in Python and is deployed as a Docker container.
Collapse
Affiliation(s)
| | - Mark L. Blaxter
- Tree of Life, Wellcome Sanger Institute, Hinxton, England, UK
| |
Collapse
|
35
|
Hosokawa M, Nishikawa Y. Tools for microbial single-cell genomics for obtaining uncultured microbial genomes. Biophys Rev 2024; 16:69-77. [PMID: 38495448 PMCID: PMC10937852 DOI: 10.1007/s12551-023-01124-y] [Citation(s) in RCA: 3] [Impact Index Per Article: 3.0] [Reference Citation Analysis] [Abstract] [Key Words] [Grants] [Track Full Text] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 02/14/2023] [Accepted: 08/23/2023] [Indexed: 03/19/2024] Open
Abstract
The advent of next-generation sequencing technologies has facilitated the acquisition of large amounts of DNA sequence data at a relatively low cost, leading to numerous breakthroughs in decoding microbial genomes. Among the various genome sequencing activities, metagenomic analysis, which entails the direct analysis of uncultured microbial DNA, has had a profound impact on microbiome research and has emerged as an indispensable technology in this field. Despite its valuable contributions, metagenomic analysis is a "bulk analysis" technique that analyzes samples containing a wide diversity of microbes, such as bacteria, yielding information that is averaged across the entire microbial population. In order to gain a deeper understanding of the heterogeneous nature of the microbial world, there is a growing need for single-cell analysis, similar to its use in human cell biology. With this paradigm shift in mind, comprehensive single-cell genomics technology has become a much-anticipated innovation that is now poised to revolutionize microbiome research. It has the potential to enable the discovery of differences at the strain level and to facilitate a more comprehensive examination of microbial ecosystems. In this review, we summarize the current state-of-the-art in microbial single-cell genomics, highlighting the potential impact of this technology on our understanding of the microbial world. The successful implementation of this technology is expected to have a profound impact in the field, leading to new discoveries and insights into the diversity and evolution of microbes.
Collapse
Affiliation(s)
- Masahito Hosokawa
- Department of Life Science and Medical Bioscience, Waseda University, 2-2 Wakamatsu-Cho, Shinjuku-Ku, Tokyo, 162-8480 Japan
- Computational Bio Big-Data Open Innovation Laboratory, National Institute of Advanced Industrial Science and Technology, 3-4-1 Okubo, Shinjuku-Ku, Tokyo, 169-8555 Japan
- Research Organization for Nano and Life Innovation, Waseda University, 513 Wasedatsurumaki-Cho, Shinjuku-Ku, Tokyo, 162-0041 Japan
- Institute for Advanced Research of Biosystem Dynamics, Waseda Research Institute for Science and Engineering, 3-4-1 Okubo, Shinjuku-Ku, Tokyo, 169-8555 Japan
- bitBiome, Inc., 513 Wasedatsurumaki-Cho, Shinjuku-Ku, Tokyo, 162-0041 Japan
| | - Yohei Nishikawa
- Computational Bio Big-Data Open Innovation Laboratory, National Institute of Advanced Industrial Science and Technology, 3-4-1 Okubo, Shinjuku-Ku, Tokyo, 169-8555 Japan
- Research Organization for Nano and Life Innovation, Waseda University, 513 Wasedatsurumaki-Cho, Shinjuku-Ku, Tokyo, 162-0041 Japan
| |
Collapse
|
36
|
Babajanyan SG, Garushyants SK, Wolf YI, Koonin EV. Microbial diversity and ecological complexity emerging from environmental variation and horizontal gene transfer in a simple mathematical model. BIORXIV : THE PREPRINT SERVER FOR BIOLOGY 2024:2024.01.17.576128. [PMID: 38313259 PMCID: PMC10836074 DOI: 10.1101/2024.01.17.576128] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 02/06/2024]
Abstract
Microbiomes are generally characterized by high diversity of coexisting microbial species and strains that remains stable within a broad range of conditions. However, under fixed conditions, microbial ecology conforms with the exclusion principle under which two populations competing for the same resource within the same niche cannot coexist because the less fit population inevitably goes extinct. To explore the conditions for stabilization of microbial diversity, we developed a simple mathematical model consisting of two competing populations that could exchange a single gene allele via horizontal gene transfer (HGT). We found that, although in a fixed environment, with unbiased HGT, the system obeyed the exclusion principle, in an oscillating environment, within large regions of the phase space bounded by the rates of reproduction and HGT, the two populations coexist. Moreover, depending on the parameter combination, all three major types of symbiosis obtained, namely, pure competition, host-parasite relationship and mutualism. In each of these regimes, certain parameter combinations provided for synergy, that is, a greater total abundance of both populations compared to the abundance of the winning population in the fixed environments. These findings show that basic phenomena that are universal in microbial communities, environmental variation and HGT, provide for stabilization of microbial diversity and ecological complexity.
Collapse
Affiliation(s)
- Sanasar G. Babajanyan
- National Center for Biotechnology Information, National Library of Medicine, National Institutes of Health, Bethesda, MD 20894, USA
| | - Sofya K. Garushyants
- National Center for Biotechnology Information, National Library of Medicine, National Institutes of Health, Bethesda, MD 20894, USA
| | - Yuri I. Wolf
- National Center for Biotechnology Information, National Library of Medicine, National Institutes of Health, Bethesda, MD 20894, USA
| | - Eugene V. Koonin
- National Center for Biotechnology Information, National Library of Medicine, National Institutes of Health, Bethesda, MD 20894, USA
| |
Collapse
|
37
|
Xu Y, Leung SKK, Li TMW, Yung CCM. Hidden genomic diversity drives niche partitioning in a cosmopolitan eukaryotic picophytoplankton. THE ISME JOURNAL 2024; 18:wrae163. [PMID: 39141834 PMCID: PMC11409870 DOI: 10.1093/ismejo/wrae163] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Subscribe] [Scholar Register] [Received: 05/11/2024] [Revised: 08/01/2024] [Accepted: 08/13/2024] [Indexed: 08/16/2024]
Abstract
Marine eukaryotic phytoplankton are fundamental to the marine food web, yet the lack of reference genomes or just a single genome representing a taxon has led to an underestimation of their taxonomic, adaptive, and functional diversity. Here, we integrated strain isolation with metagenomic binning to recover genomes from the cosmopolitan picophytoplankton genus Bathycoccus, traditionally considered monospecific. Our recovery and analysis of 37 Bathycoccus genomes delineated their global genomic diversity and established four evolutionary clades (BI, BII, BIII, BIV). Our metagenomic abundance survey revealed well-differentiated ecological niches and distinct biogeographic distributions for each clade, predominantly shaped by temperature, salinity, and nutrient availability. Comparative genomics analyses further revealed clade-specific genomic traits that underpin niche adaptation and contribute to the global prevalence of Bathycoccus. Our findings underscore temperature as a major driver of genome diversification in this genus, with clade divergences coinciding with major paleoclimatic events that influenced their contemporary thermal niches. Moreover, the unique enrichment of C2H2 zinc finger and ankyrin repeat gene families in polar-adapted clades suggests previously unrecognized cold-adaptation mechanisms in marine eukaryotic phytoplankton. Our study offers a comprehensive genomic landscape of this crucial eukaryotic picophytoplankton, providing insights into their microdiversity and adaptive evolution in response to changing environments.
Collapse
Affiliation(s)
- Yangbing Xu
- Department of Ocean Science, The Hong Kong University of Science and Technology, Hong Kong SAR
| | - Shara K K Leung
- Department of Ocean Science, The Hong Kong University of Science and Technology, Hong Kong SAR
| | - Taylor M W Li
- Department of Ocean Science, The Hong Kong University of Science and Technology, Hong Kong SAR
| | - Charmaine C M Yung
- Department of Ocean Science, The Hong Kong University of Science and Technology, Hong Kong SAR
| |
Collapse
|
38
|
Haft DH, Badretdin A, Coulouris G, DiCuccio M, Durkin A, Jovenitti E, Li W, Mersha M, O’Neill K, Virothaisakun J, Thibaud-Nissen F. RefSeq and the prokaryotic genome annotation pipeline in the age of metagenomes. Nucleic Acids Res 2024; 52:D762-D769. [PMID: 37962425 PMCID: PMC10767926 DOI: 10.1093/nar/gkad988] [Citation(s) in RCA: 12] [Impact Index Per Article: 12.0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 09/15/2023] [Revised: 10/13/2023] [Accepted: 10/18/2023] [Indexed: 11/15/2023] Open
Abstract
The Reference Sequence (RefSeq) project at the National Center for Biotechnology Information (NCBI) contains over 315 000 bacterial and archaeal genomes and 236 million proteins with up-to-date and consistent annotation. In the past 3 years, we have expanded the diversity of the RefSeq collection by including the best quality metagenome-assembled genomes (MAGs) submitted to INSDC (DDBJ, ENA and GenBank), while maintaining its quality by adding validation checks. Assemblies are now more stringently evaluated for contamination and for completeness of annotation prior to acceptance into RefSeq. MAGs now account for over 17000 assemblies in RefSeq, split over 165 orders and 362 families. Changes in the Prokaryotic Genome Annotation Pipeline (PGAP), which is used to annotate nearly all RefSeq assemblies include better detection of protein-coding genes. Nearly 83% of RefSeq proteins are now named by a curated Protein Family Model, a 4.7% increase in the past three years ago. In addition to literature citations, Enzyme Commission numbers, and gene symbols, Gene Ontology terms are now assigned to 48% of RefSeq proteins, allowing for easier multi-genome comparison. RefSeq is found at https://www.ncbi.nlm.nih.gov/refseq/. PGAP is available as a stand-alone tool able to produce GenBank-ready files at https://github.com/ncbi/pgap.
Collapse
Affiliation(s)
- Daniel H Haft
- National Center for Biotechnology Information, National Library of Medicine, National Institutes of Health, Bethesda, MD 20894, USA
| | - Azat Badretdin
- National Center for Biotechnology Information, National Library of Medicine, National Institutes of Health, Bethesda, MD 20894, USA
| | - George Coulouris
- National Center for Biotechnology Information, National Library of Medicine, National Institutes of Health, Bethesda, MD 20894, USA
| | - Michael DiCuccio
- National Center for Biotechnology Information, National Library of Medicine, National Institutes of Health, Bethesda, MD 20894, USA
| | - A Scott Durkin
- National Center for Biotechnology Information, National Library of Medicine, National Institutes of Health, Bethesda, MD 20894, USA
| | - Eric Jovenitti
- National Center for Biotechnology Information, National Library of Medicine, National Institutes of Health, Bethesda, MD 20894, USA
| | - Wenjun Li
- National Center for Biotechnology Information, National Library of Medicine, National Institutes of Health, Bethesda, MD 20894, USA
| | - Megdelawit Mersha
- National Center for Biotechnology Information, National Library of Medicine, National Institutes of Health, Bethesda, MD 20894, USA
| | - Kathleen R O’Neill
- National Center for Biotechnology Information, National Library of Medicine, National Institutes of Health, Bethesda, MD 20894, USA
| | - Joel Virothaisakun
- National Center for Biotechnology Information, National Library of Medicine, National Institutes of Health, Bethesda, MD 20894, USA
| | - Françoise Thibaud-Nissen
- National Center for Biotechnology Information, National Library of Medicine, National Institutes of Health, Bethesda, MD 20894, USA
| |
Collapse
|
39
|
Cerk K, Ugalde‐Salas P, Nedjad CG, Lecomte M, Muller C, Sherman DJ, Hildebrand F, Labarthe S, Frioux C. Community-scale models of microbiomes: Articulating metabolic modelling and metagenome sequencing. Microb Biotechnol 2024; 17:e14396. [PMID: 38243750 PMCID: PMC10832553 DOI: 10.1111/1751-7915.14396] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 01/09/2023] [Revised: 11/27/2023] [Accepted: 12/20/2023] [Indexed: 01/21/2024] Open
Abstract
Building models is essential for understanding the functions and dynamics of microbial communities. Metabolic models built on genome-scale metabolic network reconstructions (GENREs) are especially relevant as a means to decipher the complex interactions occurring among species. Model reconstruction increasingly relies on metagenomics, which permits direct characterisation of naturally occurring communities that may contain organisms that cannot be isolated or cultured. In this review, we provide an overview of the field of metabolic modelling and its increasing reliance on and synergy with metagenomics and bioinformatics. We survey the means of assigning functions and reconstructing metabolic networks from (meta-)genomes, and present the variety and mathematical fundamentals of metabolic models that foster the understanding of microbial dynamics. We emphasise the characterisation of interactions and the scaling of model construction to large communities, two important bottlenecks in the applicability of these models. We give an overview of the current state of the art in metagenome sequencing and bioinformatics analysis, focusing on the reconstruction of genomes in microbial communities. Metagenomics benefits tremendously from third-generation sequencing, and we discuss the opportunities of long-read sequencing, strain-level characterisation and eukaryotic metagenomics. We aim at providing algorithmic and mathematical support, together with tool and application resources, that permit bridging the gap between metagenomics and metabolic modelling.
Collapse
Affiliation(s)
- Klara Cerk
- Quadram Institute BioscienceNorwichUK
- Earlham InstituteNorwichUK
| | | | - Chabname Ghassemi Nedjad
- Inria, University of Bordeaux, INRAETalenceFrance
- University of Bordeaux, CNRS, Bordeaux INP, LaBRI, UMR 5800TalenceFrance
| | - Maxime Lecomte
- Inria, University of Bordeaux, INRAETalenceFrance
- INRAE STLO¸University of RennesRennesFrance
| | | | | | - Falk Hildebrand
- Quadram Institute BioscienceNorwichUK
- Earlham InstituteNorwichUK
| | - Simon Labarthe
- Inria, University of Bordeaux, INRAETalenceFrance
- INRAE, University of Bordeaux, BIOGECO, UMR 1202CestasFrance
| | | |
Collapse
|
40
|
Burz SD, Causevic S, Dal Co A, Dmitrijeva M, Engel P, Garrido-Sanz D, Greub G, Hapfelmeier S, Hardt WD, Hatzimanikatis V, Heiman CM, Herzog MKM, Hockenberry A, Keel C, Keppler A, Lee SJ, Luneau J, Malfertheiner L, Mitri S, Ngyuen B, Oftadeh O, Pacheco AR, Peaudecerf F, Resch G, Ruscheweyh HJ, Sahin A, Sanders IR, Slack E, Sunagawa S, Tackmann J, Tecon R, Ugolini GS, Vacheron J, van der Meer JR, Vayena E, Vonaesch P, Vorholt JA. From microbiome composition to functional engineering, one step at a time. Microbiol Mol Biol Rev 2023; 87:e0006323. [PMID: 37947420 PMCID: PMC10732080 DOI: 10.1128/mmbr.00063-23] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/12/2023] Open
Abstract
SUMMARYCommunities of microorganisms (microbiota) are present in all habitats on Earth and are relevant for agriculture, health, and climate. Deciphering the mechanisms that determine microbiota dynamics and functioning within the context of their respective environments or hosts (the microbiomes) is crucially important. However, the sheer taxonomic, metabolic, functional, and spatial complexity of most microbiomes poses substantial challenges to advancing our knowledge of these mechanisms. While nucleic acid sequencing technologies can chart microbiota composition with high precision, we mostly lack information about the functional roles and interactions of each strain present in a given microbiome. This limits our ability to predict microbiome function in natural habitats and, in the case of dysfunction or dysbiosis, to redirect microbiomes onto stable paths. Here, we will discuss a systematic approach (dubbed the N+1/N-1 concept) to enable step-by-step dissection of microbiome assembly and functioning, as well as intervention procedures to introduce or eliminate one particular microbial strain at a time. The N+1/N-1 concept is informed by natural invasion events and selects culturable, genetically accessible microbes with well-annotated genomes to chart their proliferation or decline within defined synthetic and/or complex natural microbiota. This approach enables harnessing classical microbiological and diversity approaches, as well as omics tools and mathematical modeling to decipher the mechanisms underlying N+1/N-1 microbiota outcomes. Application of this concept further provides stepping stones and benchmarks for microbiome structure and function analyses and more complex microbiome intervention strategies.
Collapse
Affiliation(s)
- Sebastian Dan Burz
- Department of Fundamental Microbiology, University of Lausanne, Lausanne, Switzerland
| | - Senka Causevic
- Department of Fundamental Microbiology, University of Lausanne, Lausanne, Switzerland
| | - Alma Dal Co
- Department of Computational Biology, University of Lausanne, Lausanne, Switzerland
| | - Marija Dmitrijeva
- Department of Molecular Life Sciences, University of Zurich, Zurich, Switzerland
| | - Philipp Engel
- Department of Fundamental Microbiology, University of Lausanne, Lausanne, Switzerland
| | - Daniel Garrido-Sanz
- Department of Fundamental Microbiology, University of Lausanne, Lausanne, Switzerland
| | - Gilbert Greub
- Institut de microbiologie, CHUV University Hospital Lausanne, Lausanne, Switzerland
| | | | | | | | - Clara Margot Heiman
- Department of Fundamental Microbiology, University of Lausanne, Lausanne, Switzerland
| | | | | | - Christoph Keel
- Department of Fundamental Microbiology, University of Lausanne, Lausanne, Switzerland
| | | | - Soon-Jae Lee
- Department of Ecology and Evolution, University of Lausanne, Lausanne, Switzerland
| | - Julien Luneau
- Department of Fundamental Microbiology, University of Lausanne, Lausanne, Switzerland
- Department of Computational Biology, University of Lausanne, Lausanne, Switzerland
| | - Lukas Malfertheiner
- Department of Molecular Life Sciences, University of Zurich, Zurich, Switzerland
| | - Sara Mitri
- Department of Fundamental Microbiology, University of Lausanne, Lausanne, Switzerland
| | - Bidong Ngyuen
- Institute of Microbiology, ETH Zürich, Zürich, Switzerland
| | - Omid Oftadeh
- Laboratory of Computational Systems Biotechnology, EPF Lausanne, Lausanne, Switzerland
| | | | | | - Grégory Resch
- Center for Research and Innovation in Clinical Pharmaceutical Sciences, CHUV University Hospital Lausanne, Lausanne, Switzerland
| | | | - Asli Sahin
- Laboratory of Computational Systems Biotechnology, EPF Lausanne, Lausanne, Switzerland
| | - Ian R. Sanders
- Department of Ecology and Evolution, University of Lausanne, Lausanne, Switzerland
| | - Emma Slack
- Department of Health Sciences and Technology, ETH Zürich, Zürich, Switzerland
| | | | - Janko Tackmann
- Department of Molecular Life Sciences, University of Zurich, Zurich, Switzerland
| | - Robin Tecon
- Department of Fundamental Microbiology, University of Lausanne, Lausanne, Switzerland
| | | | - Jordan Vacheron
- Department of Fundamental Microbiology, University of Lausanne, Lausanne, Switzerland
| | | | - Evangelia Vayena
- Laboratory of Computational Systems Biotechnology, EPF Lausanne, Lausanne, Switzerland
| | - Pascale Vonaesch
- Department of Fundamental Microbiology, University of Lausanne, Lausanne, Switzerland
| | | |
Collapse
|
41
|
Hamilton T, Joris BR, Shrestha A, Browne TS, Rodrigue S, Karas BJ, Gloor GB, Edgell DR. De Novo Synthesis of a Conjugative System from Human Gut Metagenomic Data for Targeted Delivery of Cas9 Antimicrobials. ACS Synth Biol 2023; 12:3578-3590. [PMID: 38049144 PMCID: PMC10729033 DOI: 10.1021/acssynbio.3c00319] [Citation(s) in RCA: 1] [Impact Index Per Article: 1.0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 05/23/2023] [Revised: 11/03/2023] [Accepted: 11/08/2023] [Indexed: 12/06/2023]
Abstract
Metagenomic sequences represent an untapped source of genetic novelty, particularly for conjugative systems that could be used for plasmid-based delivery of Cas9-derived antimicrobial agents. However, unlocking the functional potential of conjugative systems purely from metagenomic sequences requires the identification of suitable candidate systems as starting scaffolds for de novo DNA synthesis. Here, we developed a bioinformatics approach that searches through the metagenomic "trash bin" for genes associated with conjugative systems present on contigs that are typically excluded from common metagenomic analysis pipelines. Using a human metagenomic gut data set representing 2805 taxonomically distinct units, we identified 1598 contigs containing conjugation genes with a differential distribution in human cohorts. We synthesized de novo an entire Citrobacter spp. conjugative system of 54 kb containing at least 47 genes and assembled it into a plasmid, pCitro. We found that pCitro conjugates from Escherichia coli to Citrobacter rodentium with a 30-fold higher frequency than to E. coli, and is compatible with Citrobacter resident plasmids. Mutations in the traV and traY conjugation components of pCitro inhibited conjugation. We showed that pCitro can be repurposed as an antimicrobial delivery agent by programming it with the TevCas9 nuclease and Citrobacter-specific sgRNAs to kill C. rodentium. Our study reveals a trove of uncharacterized conjugative systems in metagenomic data and describes an experimental framework to animate these large genetic systems as novel target-adapted delivery vectors for Cas9-based editing of bacterial genomes.
Collapse
Affiliation(s)
- Thomas
A. Hamilton
- Department
of Biochemistry, Schulich School of Medicine & Dentistry, Western University, London N6A 5C1, ON, Canada
| | - Benjamin R. Joris
- Department
of Biochemistry, Schulich School of Medicine & Dentistry, Western University, London N6A 5C1, ON, Canada
| | - Arina Shrestha
- Department
of Biochemistry, Schulich School of Medicine & Dentistry, Western University, London N6A 5C1, ON, Canada
| | - Tyler S. Browne
- Department
of Biochemistry, Schulich School of Medicine & Dentistry, Western University, London N6A 5C1, ON, Canada
| | - Sébastien Rodrigue
- Départment
de Biologie, Université de Sherbrooke, Sherbrooke J1K 2R1, QC, Canada
| | - Bogumil J. Karas
- Department
of Biochemistry, Schulich School of Medicine & Dentistry, Western University, London N6A 5C1, ON, Canada
| | - Gregory B. Gloor
- Department
of Biochemistry, Schulich School of Medicine & Dentistry, Western University, London N6A 5C1, ON, Canada
| | - David R. Edgell
- Department
of Biochemistry, Schulich School of Medicine & Dentistry, Western University, London N6A 5C1, ON, Canada
| |
Collapse
|
42
|
Sun Y, Wang M, Cao L, Seim I, Zhou L, Chen J, Wang H, Zhong Z, Chen H, Fu L, Li M, Li C, Sun S. Mosaic environment-driven evolution of the deep-sea mussel Gigantidas platifrons bacterial endosymbiont. MICROBIOME 2023; 11:253. [PMID: 37974296 PMCID: PMC10652631 DOI: 10.1186/s40168-023-01695-8] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Subscribe] [Scholar Register] [Received: 11/15/2022] [Accepted: 10/11/2023] [Indexed: 11/19/2023]
Abstract
BACKGROUND The within-species diversity of symbiotic bacteria represents an important genetic resource for their environmental adaptation, especially for horizontally transmitted endosymbionts. Although strain-level intraspecies variation has recently been detected in many deep-sea endosymbionts, their ecological role in environmental adaptation, their genome evolution pattern under heterogeneous geochemical environments, and the underlying molecular forces remain unclear. RESULTS Here, we conducted a fine-scale metagenomic analysis of the deep-sea mussel Gigantidas platifrons bacterial endosymbiont collected from distinct habitats: hydrothermal vent and methane seep. Endosymbiont genomes were assembled using a pipeline that distinguishes within-species variation and revealed highly heterogeneous compositions in mussels from different habitats. Phylogenetic analysis separated the assemblies into three distinct environment-linked clades. Their functional differentiation follows a mosaic evolutionary pattern. Core genes, essential for central metabolic function and symbiosis, were conserved across all clades. Clade-specific genes associated with heavy metal resistance, pH homeostasis, and nitrate utilization exhibited signals of accelerated evolution. Notably, transposable elements and plasmids contributed to the genetic reshuffling of the symbiont genomes and likely accelerated adaptive evolution through pseudogenization and the introduction of new genes. CONCLUSIONS The current study uncovers the environment-driven evolution of deep-sea symbionts mediated by mobile genetic elements. Its findings highlight a potentially common and critical role of within-species diversity in animal-microbiome symbioses. Video Abstract.
Collapse
Affiliation(s)
- Yan Sun
- CAS Key Laboratory of Marine Ecology and Environmental Sciences, and Center of Deep Sea Research, Institute of Oceanology, Chinese Academy of Sciences, Qingdao, 266071, China
- Laboratory for Marine Ecology and Environmental Science, Laoshan Laboratory, Qingdao, 266237, China
| | - Minxiao Wang
- CAS Key Laboratory of Marine Ecology and Environmental Sciences, and Center of Deep Sea Research, Institute of Oceanology, Chinese Academy of Sciences, Qingdao, 266071, China
- Laboratory for Marine Ecology and Environmental Science, Laoshan Laboratory, Qingdao, 266237, China
| | - Lei Cao
- CAS Key Laboratory of Marine Ecology and Environmental Sciences, and Center of Deep Sea Research, Institute of Oceanology, Chinese Academy of Sciences, Qingdao, 266071, China
- Laboratory for Marine Ecology and Environmental Science, Laoshan Laboratory, Qingdao, 266237, China
| | - Inge Seim
- Integrative Biology Laboratory, College of Life Sciences, Nanjing Normal University, Nanjing, 210046, China
- School of Biology and Environmental Science, Queensland University of Technology, Brisbane, QLD, 4000, Australia
| | - Li Zhou
- CAS Key Laboratory of Marine Ecology and Environmental Sciences, and Center of Deep Sea Research, Institute of Oceanology, Chinese Academy of Sciences, Qingdao, 266071, China
- Laboratory for Marine Ecology and Environmental Science, Laoshan Laboratory, Qingdao, 266237, China
| | - Jianwei Chen
- BGI Research-Qingdao, BGI, Qingdao, 266555, China
| | - Hao Wang
- CAS Key Laboratory of Marine Ecology and Environmental Sciences, and Center of Deep Sea Research, Institute of Oceanology, Chinese Academy of Sciences, Qingdao, 266071, China
- Laboratory for Marine Ecology and Environmental Science, Laoshan Laboratory, Qingdao, 266237, China
| | - Zhaoshan Zhong
- CAS Key Laboratory of Marine Ecology and Environmental Sciences, and Center of Deep Sea Research, Institute of Oceanology, Chinese Academy of Sciences, Qingdao, 266071, China
- Laboratory for Marine Ecology and Environmental Science, Laoshan Laboratory, Qingdao, 266237, China
| | - Hao Chen
- CAS Key Laboratory of Marine Ecology and Environmental Sciences, and Center of Deep Sea Research, Institute of Oceanology, Chinese Academy of Sciences, Qingdao, 266071, China
- Laboratory for Marine Ecology and Environmental Science, Laoshan Laboratory, Qingdao, 266237, China
| | - Lulu Fu
- CAS Key Laboratory of Marine Ecology and Environmental Sciences, and Center of Deep Sea Research, Institute of Oceanology, Chinese Academy of Sciences, Qingdao, 266071, China
- Laboratory for Marine Ecology and Environmental Science, Laoshan Laboratory, Qingdao, 266237, China
| | - Mengna Li
- CAS Key Laboratory of Marine Ecology and Environmental Sciences, and Center of Deep Sea Research, Institute of Oceanology, Chinese Academy of Sciences, Qingdao, 266071, China
- Laboratory for Marine Ecology and Environmental Science, Laoshan Laboratory, Qingdao, 266237, China
| | - Chaolun Li
- CAS Key Laboratory of Marine Ecology and Environmental Sciences, and Center of Deep Sea Research, Institute of Oceanology, Chinese Academy of Sciences, Qingdao, 266071, China.
- Laboratory for Marine Ecology and Environmental Science, Laoshan Laboratory, Qingdao, 266237, China.
- South China Sea Institute of Oceanology, Chinese Academy of Sciences, Guangzhou, 510301, China.
- University of Chinese Academy of Sciences, Beijing, 100049, China.
| | - Song Sun
- CAS Key Laboratory of Marine Ecology and Environmental Sciences, and Center of Deep Sea Research, Institute of Oceanology, Chinese Academy of Sciences, Qingdao, 266071, China.
- Laboratory for Marine Ecology and Environmental Science, Laoshan Laboratory, Qingdao, 266237, China.
- University of Chinese Academy of Sciences, Beijing, 100049, China.
| |
Collapse
|
43
|
Fu S, Zhang Y, Wang R, Qiu Z, Song W, Yang Q, Shen L. A novel culture-enriched metagenomic sequencing strategy effectively guarantee the microbial safety of drinking water by uncovering the low abundance pathogens. JOURNAL OF ENVIRONMENTAL MANAGEMENT 2023; 345:118737. [PMID: 37657296 DOI: 10.1016/j.jenvman.2023.118737] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Subscribe] [Scholar Register] [Received: 04/02/2023] [Revised: 07/21/2023] [Accepted: 07/31/2023] [Indexed: 09/03/2023]
Abstract
Assessing the presence of waterborne pathogens and antibiotic resistance genes (ARGs) is crucial for managing the environmental quality of drinking water sources. However, detecting low abundance pathogens in such settings is challenging. In this study, a workflow was developed to enrich for broad spectrum pathogens from drinking water samples. A mock community was used to evaluate the effectiveness of various enrichment broths in detecting low-abundance pathogens. Monthly metagenomic surveillance was conducted in a drinking water source from May to September 2021, and water samples were subjected to five enrichment procedures for 6 h to recover the majority of waterborne bacterial pathogens. Oxford Nanopore Technology (ONT) was used for metagenomic sequencing of enriched samples to obtain high-quality pathogen genomes. The results showed that selective enrichment significantly increased the proportions of targeted bacterial pathogens. Compared to direct metagenomic sequencing of untreated water samples, targeted enrichment followed by ONT sequencing significantly improved the detection of waterborne pathogens and the quality of metagenome-assembled genomes (MAGs). Eighty-six high-quality MAGs, including 70 pathogen MAGs, were obtained from ONT sequencing, while only 12 MAGs representing 10 species were obtained from direct metagenomic sequencing of untreated water samples. In addition, ONT sequencing improved the recovery of mobile genetic elements and the accuracy of phylogenetic analysis. This study highlights the urgent need for efficient methodologies to detect and manage microbial risks in drinking water sources. The developed workflow provides a cost-effective approach for environmental management of drinking water sources with microbial risks. The study also uncovered pathogens that were not detected by traditional methods, thereby advancing microbial risk management of drinking water sources.
Collapse
Affiliation(s)
- Songzhe Fu
- Key Laboratory of Resource Biology and Biotechnology in Western China, Ministry of Education, Northwest University, Xi'an, 710069, China; Key Laboratory of Environment Controlled Aquaculture (Dalian Ocean University), Ministry of Education, 116023, China.
| | - Yixiang Zhang
- CAS Center for Excellence in Molecular Plant Sciences, Shanghai Institutes for Biological Sciences (SIBS), Chinese Academy of Sciences. Shanghai, China; University of Chinese Academy of Sciences, Shanghai, China
| | - Rui Wang
- Key Laboratory of Environment Controlled Aquaculture (Dalian Ocean University), Ministry of Education, 116023, China
| | - Zhiguang Qiu
- School of Environment and Energy, Shenzhen Graduate School, Peking University, Shenzhen, 518055, China
| | - Weizhi Song
- School of Life Sciences and State Key Laboratory of Agrobiotechnology, The Chinese University of Hong Kong, Shatin, SAR, Hong Kong, China
| | - Qian Yang
- Center for Microbial Ecology and Technology, Ghent University, Ghent, Belgium
| | - Lixin Shen
- Key Laboratory of Resource Biology and Biotechnology in Western China, Ministry of Education, Northwest University, Xi'an, 710069, China.
| |
Collapse
|
44
|
Rojas CA, Gardy J, Eisen JA, Ganz HH. Recovery of 52 bacterial genomes from the fecal microbiome of the domestic cat ( Felis catus) using Hi-C proximity ligation and shotgun metagenomics. Microbiol Resour Announc 2023; 12:e0060123. [PMID: 37695121 PMCID: PMC10586161 DOI: 10.1128/mra.00601-23] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/06/2023] [Accepted: 08/08/2023] [Indexed: 09/12/2023] Open
Abstract
We used Hi-C proximity ligation with shotgun sequencing to retrieve metagenome-assembled genomes (MAGs) from the fecal microbiomes of two domestic cats (Felis catus). The genomes were assessed for completeness and contamination, classified taxonomically, and annotated for putative antimicrobial resistance (AMR) genes.
Collapse
Affiliation(s)
| | - Jennifer Gardy
- Bill & Melinda Gates Foundation, Seattle, Washington, USA
| | - Jonathan A. Eisen
- Evolution and Ecology, University of California, Davis, California, USA
| | | |
Collapse
|
45
|
Li K, Xu P, Wang J, Yi X, Jiao Y. Identification of errors in draft genome assemblies at single-nucleotide resolution for quality assessment and improvement. Nat Commun 2023; 14:6556. [PMID: 37848433 PMCID: PMC10582259 DOI: 10.1038/s41467-023-42336-w] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 02/13/2023] [Accepted: 10/05/2023] [Indexed: 10/19/2023] Open
Abstract
Assembly of a high-quality genome is important for downstream comparative and functional genomic studies. However, most tools for genome assembly assessment only give qualitative reports, which do not pinpoint assembly errors at specific regions. Here, we develop a new reference-free tool, Clipping information for Revealing Assembly Quality (CRAQ), which maps raw reads back to assembled sequences to identify regional and structural assembly errors based on effective clipped alignment information. Error counts are transformed into corresponding assembly evaluation indexes to reflect the assembly quality at single-nucleotide resolution. Notably, CRAQ distinguishes assembly errors from heterozygous sites or structural differences between haplotypes. This tool can clearly indicate low-quality regions and potential structural error breakpoints; thus, it can identify misjoined regions that should be split for further scaffold building and improvement of the assembly. We have benchmarked CRAQ on multiple genomes assembled using different strategies, and demonstrated the misjoin correction for improving the constructed pseudomolecules.
Collapse
Affiliation(s)
- Kunpeng Li
- State Key Laboratory of Plant Diversity and Specialty Crops, Institute of Botany, the Chinese Academy of Sciences, Beijing, China
- University of Chinese Academy of Sciences, Beijing, China
| | - Peng Xu
- State Key Laboratory of Plant Diversity and Specialty Crops, Institute of Botany, the Chinese Academy of Sciences, Beijing, China
- University of Chinese Academy of Sciences, Beijing, China
| | - Jinpeng Wang
- State Key Laboratory of Plant Diversity and Specialty Crops, Institute of Botany, the Chinese Academy of Sciences, Beijing, China
- University of Chinese Academy of Sciences, Beijing, China
| | - Xin Yi
- State Key Laboratory of Plant Diversity and Specialty Crops, Institute of Botany, the Chinese Academy of Sciences, Beijing, China
- China National Botanical Garden, Beijing, China
| | - Yuannian Jiao
- State Key Laboratory of Plant Diversity and Specialty Crops, Institute of Botany, the Chinese Academy of Sciences, Beijing, China.
- University of Chinese Academy of Sciences, Beijing, China.
- China National Botanical Garden, Beijing, China.
| |
Collapse
|
46
|
Du Y, Sun F. MetaCC allows scalable and integrative analyses of both long-read and short-read metagenomic Hi-C data. Nat Commun 2023; 14:6231. [PMID: 37802989 PMCID: PMC10558524 DOI: 10.1038/s41467-023-41209-6] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/16/2023] [Accepted: 08/25/2023] [Indexed: 10/08/2023] Open
Abstract
Metagenomic Hi-C (metaHi-C) can identify contig-to-contig relationships with respect to their proximity within the same physical cell. Shotgun libraries in metaHi-C experiments can be constructed by next-generation sequencing (short-read metaHi-C) or more recent third-generation sequencing (long-read metaHi-C). However, all existing metaHi-C analysis methods are developed and benchmarked on short-read metaHi-C datasets and there exists much room for improvement in terms of more scalable and stable analyses, especially for long-read metaHi-C data. Here we report MetaCC, an efficient and integrative framework for analyzing both short-read and long-read metaHi-C datasets. MetaCC outperforms existing methods on normalization and binning. In particular, the MetaCC normalization module, named NormCC, is more than 3000 times faster than the current state-of-the-art method HiCzin on a complex wastewater dataset. When applied to one sheep gut long-read metaHi-C dataset, MetaCC binning module can retrieve 709 high-quality genomes with the largest species diversity using one single sample, including an expansion of five uncultured members from the order Erysipelotrichales, and is the only binner that can recover the genome of one important species Bacteroides vulgatus. Further plasmid analyses reveal that MetaCC binning is able to capture multi-copy plasmids.
Collapse
Affiliation(s)
- Yuxuan Du
- Department of Quantitative and Computational Biology, University of Southern California, Los Angeles, CA, USA
| | - Fengzhu Sun
- Department of Quantitative and Computational Biology, University of Southern California, Los Angeles, CA, USA.
| |
Collapse
|
47
|
Ahmad N, Ritz M, Calchera A, Otte J, Schmitt I, Brueck T, Mehlmer N. Biosynthetic gene cluster synteny: Orthologous polyketide synthases in Hypogymnia physodes, Hypogymnia tubulosa, and Parmelia sulcata. Microbiologyopen 2023; 12:e1386. [PMID: 37877655 PMCID: PMC10582450 DOI: 10.1002/mbo3.1386] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 08/03/2023] [Revised: 09/15/2023] [Accepted: 09/26/2023] [Indexed: 10/26/2023] Open
Abstract
Lichens are symbiotic associations consisting of a photobiont (algae or cyanobacteria) and a mycobiont (fungus), which together generate a variety of unique secondary metabolites. To access this biosynthetic potential for biotechnological applications, deeper insights into the biosynthetic pathways and corresponding gene clusters are necessary. Here, we provide a comparative view of the biosynthetic gene clusters of three lichen mycobionts derived from Hypogymnia physodes, Hypogymnia tubulosa, and Parmelia sulcata. In addition, we present a high-quality PacBio metagenome of Parmelia sulcata, from which we extracted the mycobiont bin containing 214 biosynthetic gene clusters. Most biosynthetic gene clusters in these genomes were associated with T1PKSs, followed by NRPSs and terpenes. This study focused on biosynthetic gene clusters related to polyketide synthesis. Based on ketosynthase homology, we identified nine highly syntenic clusters present in all three species. Among the four clusters belonging to nonreducing PKSs, two are putatively linked to lichen substances derived from orsellinic acid (orcinol depsides and depsidones, e.g., lecanoric acid, physodic acid, lobaric acid), one to compounds derived from methylated forms of orsellinic acid (beta orcinol depsides, e.g., atranorin), and one to melanins. Five clusters with orthologs in all three species are linked to reducing PKSs. Our study contributes to sorting and dereplicating the vast PKS diversity found in lichenized fungi. High-quality sequences of biosynthetic gene clusters of these three common species provide a foundation for further exploration into biotechnological applications and the molecular evolution of lichen substances.
Collapse
Affiliation(s)
- Nadim Ahmad
- Department of Chemistry, Werner Siemens Chair of Synthetic Biotechnology, TUM School of Natural SciencesTechnical University of Munich (TUM)GarchingGermany
| | - Manfred Ritz
- Department of Chemistry, Werner Siemens Chair of Synthetic Biotechnology, TUM School of Natural SciencesTechnical University of Munich (TUM)GarchingGermany
| | - Anjuli Calchera
- Senckenberg Biodiversity and Climate Research Centre (SBiK‐F)Frankfurt am MainGermany
| | - Jürgen Otte
- Senckenberg Biodiversity and Climate Research Centre (SBiK‐F)Frankfurt am MainGermany
| | - Imke Schmitt
- Senckenberg Biodiversity and Climate Research Centre (SBiK‐F)Frankfurt am MainGermany
- Institute of Ecology, Evolution and DiversityGoethe University FrankfurtFrankfurt am MainGermany
| | - Thomas Brueck
- Department of Chemistry, Werner Siemens Chair of Synthetic Biotechnology, TUM School of Natural SciencesTechnical University of Munich (TUM)GarchingGermany
| | - Norbert Mehlmer
- Department of Chemistry, Werner Siemens Chair of Synthetic Biotechnology, TUM School of Natural SciencesTechnical University of Munich (TUM)GarchingGermany
| |
Collapse
|
48
|
Ho H, Chovatia M, Egan R, He G, Yoshinaga Y, Liachko I, O’Malley R, Wang Z. Integrating chromatin conformation information in a self-supervised learning model improves metagenome binning. PeerJ 2023; 11:e16129. [PMID: 37753177 PMCID: PMC10519199 DOI: 10.7717/peerj.16129] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 02/27/2023] [Accepted: 08/28/2023] [Indexed: 09/28/2023] Open
Abstract
Metagenome binning is a key step, downstream of metagenome assembly, to group scaffolds by their genome of origin. Although accurate binning has been achieved on datasets containing multiple samples from the same community, the completeness of binning is often low in datasets with a small number of samples due to a lack of robust species co-abundance information. In this study, we exploited the chromatin conformation information obtained from Hi-C sequencing and developed a new reference-independent algorithm, Metagenome Binning with Abundance and Tetra-nucleotide frequencies-Long Range (metaBAT-LR), to improve the binning completeness of these datasets. This self-supervised algorithm builds a model from a set of high-quality genome bins to predict scaffold pairs that are likely to be derived from the same genome. Then, it applies these predictions to merge incomplete genome bins, as well as recruit unbinned scaffolds. We validated metaBAT-LR's ability to bin-merge and recruit scaffolds on both synthetic and real-world metagenome datasets of varying complexity. Benchmarking against similar software tools suggests that metaBAT-LR uncovers unique bins that were missed by all other methods. MetaBAT-LR is open-source and is available at https://bitbucket.org/project-metabat/metabat-lr.
Collapse
Affiliation(s)
- Harrison Ho
- Department of Energy Joint Genome Institute, Lawrence Berkeley National Lab, Berkeley, CA, United States
- School of Natural Sciences, University of California, Merced, CA, United States
| | - Mansi Chovatia
- Department of Energy Joint Genome Institute, Lawrence Berkeley National Lab, Berkeley, CA, United States
| | - Rob Egan
- Department of Energy Joint Genome Institute, Lawrence Berkeley National Lab, Berkeley, CA, United States
| | - Guifen He
- Department of Energy Joint Genome Institute, Lawrence Berkeley National Lab, Berkeley, CA, United States
| | - Yuko Yoshinaga
- Department of Energy Joint Genome Institute, Lawrence Berkeley National Lab, Berkeley, CA, United States
| | | | - Ronan O’Malley
- Department of Energy Joint Genome Institute, Lawrence Berkeley National Lab, Berkeley, CA, United States
- Environmental Genomics and Systems Biology Division, Lawrence Berkeley National Lab, Berkeley, CA, United States
| | - Zhong Wang
- Department of Energy Joint Genome Institute, Lawrence Berkeley National Lab, Berkeley, CA, United States
- School of Natural Sciences, University of California, Merced, CA, United States
- Environmental Genomics and Systems Biology Division, Lawrence Berkeley National Lab, Berkeley, CA, United States
| |
Collapse
|
49
|
Li S, Lian WH, Han JR, Ali M, Lin ZL, Liu YH, Li L, Zhang DY, Jiang XZ, Li WJ, Dong L. Capturing the microbial dark matter in desert soils using culturomics-based metagenomics and high-resolution analysis. NPJ Biofilms Microbiomes 2023; 9:67. [PMID: 37736746 PMCID: PMC10516943 DOI: 10.1038/s41522-023-00439-8] [Citation(s) in RCA: 18] [Impact Index Per Article: 18.0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 05/04/2023] [Accepted: 09/14/2023] [Indexed: 09/23/2023] Open
Abstract
Deserts occupy one-third of the Earth's terrestrial surface and represent a potentially significant reservoir of microbial biodiversity, yet the majority of desert microorganisms remain uncharacterized and are seen as "microbial dark matter". Here, we introduce a multi-omics strategy, culturomics-based metagenomics (CBM) that integrates large-scale cultivation, full-length 16S rRNA gene amplicon, and shotgun metagenomic sequencing. The results showed that CBM captured a significant amount of taxonomic and functional diversity missed in direct sequencing by increasing the recovery of amplicon sequence variants (ASVs) and high/medium-quality metagenome-assembled genomes (MAGs). Importantly, CBM allowed the post hoc recovery of microbes of interest (e.g., novel or specific taxa), even those with extremely low abundance in the culture. Furthermore, strain-level analyses based on CBM and direct sequencing revealed that the desert soils harbored a considerable number of novel bacterial candidates (1941, 51.4%), of which 1095 (from CBM) were culturable. However, CBM would not exactly reflect the relative abundance of true microbial composition and functional pathways in the in situ environment, and its use coupled with direct metagenomic sequencing could provide greater insight into desert microbiomes. Overall, this study exemplifies the CBM strategy with high-resolution is an ideal way to deeply explore the untapped novel bacterial resources in desert soils, and substantially expands our knowledge on the microbial dark matter hidden in the vast expanse of deserts.
Collapse
Affiliation(s)
- Shuai Li
- State Key Laboratory of Biocontrol, Guangdong Provincial Key Laboratory of Plant Resources and Southern Marine Science and Engineering Guangdong Laboratory (Zhuhai), School of Life Sciences, Sun Yat‑sen University, Guangzhou, 510275, China
- School of Life Science, Jiaying University, Meizhou, 514015, China
| | - Wen-Hui Lian
- State Key Laboratory of Biocontrol, Guangdong Provincial Key Laboratory of Plant Resources and Southern Marine Science and Engineering Guangdong Laboratory (Zhuhai), School of Life Sciences, Sun Yat‑sen University, Guangzhou, 510275, China
| | - Jia-Rui Han
- State Key Laboratory of Biocontrol, Guangdong Provincial Key Laboratory of Plant Resources and Southern Marine Science and Engineering Guangdong Laboratory (Zhuhai), School of Life Sciences, Sun Yat‑sen University, Guangzhou, 510275, China
| | - Mukhtiar Ali
- State Key Laboratory of Biocontrol, Guangdong Provincial Key Laboratory of Plant Resources and Southern Marine Science and Engineering Guangdong Laboratory (Zhuhai), School of Life Sciences, Sun Yat‑sen University, Guangzhou, 510275, China
| | - Zhi-Liang Lin
- State Key Laboratory of Biocontrol, Guangdong Provincial Key Laboratory of Plant Resources and Southern Marine Science and Engineering Guangdong Laboratory (Zhuhai), School of Life Sciences, Sun Yat‑sen University, Guangzhou, 510275, China
| | - Yong-Hong Liu
- State Key Laboratory of Desert and Oasis Ecology, Xinjiang Institute of Ecology and Geography, Chinese Academy of Sciences, Urumqi, 830011, China
| | - Li Li
- State Key Laboratory of Desert and Oasis Ecology, Xinjiang Institute of Ecology and Geography, Chinese Academy of Sciences, Urumqi, 830011, China
| | - Dong-Ya Zhang
- Microbiome Research Center, Moon (Guangzhou) Biotech Ltd., Guangzhou, 510700, China
| | - Xian-Zhi Jiang
- Microbiome Research Center, Moon (Guangzhou) Biotech Ltd., Guangzhou, 510700, China
| | - Wen-Jun Li
- State Key Laboratory of Biocontrol, Guangdong Provincial Key Laboratory of Plant Resources and Southern Marine Science and Engineering Guangdong Laboratory (Zhuhai), School of Life Sciences, Sun Yat‑sen University, Guangzhou, 510275, China.
- State Key Laboratory of Desert and Oasis Ecology, Xinjiang Institute of Ecology and Geography, Chinese Academy of Sciences, Urumqi, 830011, China.
| | - Lei Dong
- State Key Laboratory of Biocontrol, Guangdong Provincial Key Laboratory of Plant Resources and Southern Marine Science and Engineering Guangdong Laboratory (Zhuhai), School of Life Sciences, Sun Yat‑sen University, Guangzhou, 510275, China.
| |
Collapse
|
50
|
Seong HJ, Kim JJ, Sul WJ. ACR: metagenome-assembled prokaryotic and eukaryotic genome refinement tool. Brief Bioinform 2023; 24:bbad381. [PMID: 37889119 DOI: 10.1093/bib/bbad381] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/20/2023] [Revised: 09/16/2023] [Accepted: 10/03/2023] [Indexed: 10/28/2023] Open
Abstract
Microbial genome recovery from metagenomes can further explain microbial ecosystem structures, functions and dynamics. Thus, this study developed the Additional Clustering Refiner (ACR) to enhance high-purity prokaryotic and eukaryotic metagenome-assembled genome (MAGs) recovery. ACR refines low-quality MAGs by subjecting them to iterative k-means clustering predicated on contig abundance and increasing bin purity through validated universal marker genes. Synthetic and real-world metagenomic datasets, including short- and long-read sequences, evaluated ACR's effectiveness. The results demonstrated improved MAG purity and a significant increase in high- and medium-quality MAG recovery rates. In addition, ACR seamlessly integrates with various binning algorithms, augmenting their strengths without modifying core features. Furthermore, its multiple sequencing technology compatibilities expand its applicability. By efficiently recovering high-quality prokaryotic and eukaryotic genomes, ACR is a promising tool for deepening our understanding of microbial communities through genome-centric metagenomics.
Collapse
Affiliation(s)
- Hoon Je Seong
- Korean Medicine Data Division, Korea Institute of Oriental Medicine, Daejeon, Republic of Korea
| | - Jin Ju Kim
- Department of Systems Biotechnology, Chung-Ang University, Anseong, Republic of Korea
| | - Woo Jun Sul
- Department of Systems Biotechnology, Chung-Ang University, Anseong, Republic of Korea
| |
Collapse
|