1
|
Espinoza JL, Phillips A, Prentice MB, Tan GS, Kamath PL, Lloyd KG, Dupont CL. Unveiling the microbial realm with VEBA 2.0: a modular bioinformatics suite for end-to-end genome-resolved prokaryotic, (micro)eukaryotic and viral multi-omics from either short- or long-read sequencing. Nucleic Acids Res 2024; 52:e63. [PMID: 38909293 DOI: 10.1093/nar/gkae528] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/08/2024] [Revised: 05/21/2024] [Accepted: 06/10/2024] [Indexed: 06/24/2024] Open
Abstract
The microbiome is a complex community of microorganisms, encompassing prokaryotic (bacterial and archaeal), eukaryotic, and viral entities. This microbial ensemble plays a pivotal role in influencing the health and productivity of diverse ecosystems while shaping the web of life. However, many software suites developed to study microbiomes analyze only the prokaryotic community and provide limited to no support for viruses and microeukaryotes. Previously, we introduced the Viral Eukaryotic Bacterial Archaeal (VEBA) open-source software suite to address this critical gap in microbiome research by extending genome-resolved analysis beyond prokaryotes to encompass the understudied realms of eukaryotes and viruses. Here we present VEBA 2.0 with key updates including a comprehensive clustered microeukaryotic protein database, rapid genome/protein-level clustering, bioprospecting, non-coding/organelle gene modeling, genome-resolved taxonomic/pathway profiling, long-read support, and containerization. We demonstrate VEBA's versatile application through the analysis of diverse case studies including marine water, Siberian permafrost, and white-tailed deer lung tissues with the latter showcasing how to identify integrated viruses. VEBA represents a crucial advancement in microbiome research, offering a powerful and accessible software suite that bridges the gap between genomics and biotechnological solutions.
Collapse
Affiliation(s)
- Josh L Espinoza
- Department of Environment and Sustainability, J. Craig Venter Institute, La Jolla, CA 92037, USA
- Department of Genomic Medicine and Infectious Diseases, J. Craig Venter Institute, La Jolla, CA 92037, USA
| | - Allan Phillips
- Department of Environment and Sustainability, J. Craig Venter Institute, La Jolla, CA 92037, USA
- Department of Genomic Medicine and Infectious Diseases, J. Craig Venter Institute, La Jolla, CA 92037, USA
| | - Melanie B Prentice
- School of Food and Agriculture, University of Maine, Orono, ME 04469, USA
| | - Gene S Tan
- Department of Genomic Medicine and Infectious Diseases, J. Craig Venter Institute, La Jolla, CA 92037, USA
| | - Pauline L Kamath
- School of Food and Agriculture, University of Maine, Orono, ME 04469, USA
- Maine Center for Genetics in the Environment, University of Maine, Orono, ME 04469, USA
| | - Karen G Lloyd
- Microbiology Department, University of Tennessee, Knoxville, TN 37917, USA
| | - Chris L Dupont
- Department of Environment and Sustainability, J. Craig Venter Institute, La Jolla, CA 92037, USA
- Department of Genomic Medicine and Infectious Diseases, J. Craig Venter Institute, La Jolla, CA 92037, USA
| |
Collapse
|
2
|
Hu K, Chou CW, Wilke CO, Finkelstein IJ. Distinct horizontal transfer mechanisms for type I and type V CRISPR-associated transposons. Nat Commun 2024; 15:6653. [PMID: 39103341 DOI: 10.1038/s41467-024-50816-w] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/11/2023] [Accepted: 07/22/2024] [Indexed: 08/07/2024] Open
Abstract
CASTs use both CRISPR-associated proteins and Tn7-family transposons for RNA-guided vertical and horizontal transmission. CASTs encode minimal CRISPR arrays but can't acquire new spacers. Here, we report that CASTs can co-opt defense-associated CRISPR arrays for horizontal transmission. A bioinformatic analysis shows that CASTs co-occur with defense-associated CRISPR systems, with the highest prevalence for type I-B and type V CAST sub-types. Using an E. coli quantitative transposition assay and in vitro reconstitution, we show that CASTs can use CRISPR RNAs from these defense systems. A high-resolution structure of the type I-F CAST-Cascade in complex with a type III-B CRISPR RNA reveals that Cas6 recognizes direct repeats via sequence-independent π - π interactions. In addition to using heterologous CRISPR arrays, type V CASTs can also transpose via an unguided mechanism, even when the S15 co-factor is over-expressed. Over-expressing S15 and the trans-activating CRISPR RNA or a single guide RNA reduces, but does not abrogate, off-target integration for type V CASTs. Our findings suggest that some CASTs may exploit defense-associated CRISPR arrays and that this fact must be considered when porting CASTs to heterologous bacterial hosts. More broadly, this work will guide further efforts to engineer the activity and specificity of CASTs for gene editing applications.
Collapse
Affiliation(s)
- Kuang Hu
- Department of Molecular Biosciences, University of Texas at Austin, Austin, TX, 78712, USA.
| | - Chia-Wei Chou
- Department of Molecular Biosciences, University of Texas at Austin, Austin, TX, 78712, USA
| | - Claus O Wilke
- Department of Integrative Biology, University of Texas at Austin, Austin, TX, 78712, USA
| | - Ilya J Finkelstein
- Department of Molecular Biosciences, University of Texas at Austin, Austin, TX, 78712, USA.
- Center for Systems and Synthetic Biology, University of Texas at Austin, Austin, TX, 78712, USA.
| |
Collapse
|
3
|
Biswas T, Vogel H, Biedermann PHW, Lehenberger M, Yuvaraj JK, Andersson MN. Few chemoreceptor genes in the ambrosia beetle Trypodendron lineatum may reflect its specialized ecology. BMC Genomics 2024; 25:764. [PMID: 39107741 PMCID: PMC11302349 DOI: 10.1186/s12864-024-10678-4] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 04/24/2024] [Accepted: 07/31/2024] [Indexed: 08/10/2024] Open
Abstract
BACKGROUND Chemoreception is crucial for insect fitness, underlying for instance food-, host-, and mate finding. Chemicals in the environment are detected by receptors from three divergent gene families: odorant receptors (ORs), gustatory receptors (GRs), and ionotropic receptors (IRs). However, how the chemoreceptor gene families evolve in parallel with ecological specializations remains poorly understood, especially in the order Coleoptera. Hence, we sequenced the genome and annotated the chemoreceptor genes of the specialised ambrosia beetle Trypodendron lineatum (Coleoptera, Curculionidae, Scolytinae) and compared its chemoreceptor gene repertoires with those of other scolytines with different ecological adaptations, as well as a polyphagous cerambycid species. RESULTS We identified 67 ORs, 38 GRs, and 44 IRs in T. lineatum ('Tlin'). Across gene families, T. lineatum has fewer chemoreceptors compared to related scolytines, the coffee berry borer Hypothenemus hampei and the mountain pine beetle Dendroctonus ponderosae, and clearly fewer receptors than the polyphagous cerambycid Anoplophora glabripennis. The comparatively low number of chemoreceptors is largely explained by the scarcity of large receptor lineage radiations, especially among the bitter taste GRs and the 'divergent' IRs, and the absence of alternatively spliced GR genes. Only one non-fructose sugar receptor was found, suggesting several sugar receptors have been lost. Also, we found no orthologue in the 'GR215 clade', which is widely conserved across Coleoptera. Two TlinORs are orthologous to ORs that are functionally conserved across curculionids, responding to 2-phenylethanol (2-PE) and green leaf volatiles (GLVs), respectively. CONCLUSIONS Trypodendron lineatum reproduces inside the xylem of decaying conifers where it feeds on its obligate fungal mutualist Phialophoropsis ferruginea. Like previous studies, our results suggest that stenophagy correlates with small chemoreceptor numbers in wood-boring beetles; indeed, the few GRs may be due to its restricted fungal diet. The presence of TlinORs orthologous to those detecting 2-PE and GLVs in other species suggests these compounds are important for T. lineatum. Future functional studies should test this prediction, and chemoreceptor annotations should be conducted on additional ambrosia beetle species to investigate whether few chemoreceptors is a general trait in this specialized group of beetles.
Collapse
Affiliation(s)
- Twinkle Biswas
- Department of Biology, Lund University, Sölvegatan 37, 223 62, Lund, Sweden
| | - Heiko Vogel
- Department of Insect Symbiosis, Max Planck Institute for Chemical Ecology, Jena, Germany
| | - Peter H W Biedermann
- Chair of Forest Entomology and Protection, University of Freiburg, Stegen-Wittental, Germany
| | | | | | - Martin N Andersson
- Department of Biology, Lund University, Sölvegatan 37, 223 62, Lund, Sweden.
| |
Collapse
|
4
|
Medina JE, Castañeda S, Camargo M, Garcia-Corredor DJ, Muñoz M, Ramírez JD. Exploring viral diversity and metagenomics in livestock: insights into disease emergence and spillover risks in cattle. Vet Res Commun 2024; 48:2029-2049. [PMID: 38865041 DOI: 10.1007/s11259-024-10403-2] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/10/2023] [Accepted: 05/01/2024] [Indexed: 06/13/2024]
Abstract
Cattle have a significant impact on human societies in terms of both economics and health. Viral infections pose a relevant problem as they directly or indirectly disrupt the balance within cattle populations. This has negative consequences at the economic level for producers and territories, and also jeopardizes human health through the transmission of zoonotic diseases that can escalate into outbreaks or pandemics. To establish prevention strategies and control measures at various levels (animal, farm, region, or global), it is crucial to identify the viral agents present in animals. Various techniques, including virus isolation, serological tests, and molecular techniques like PCR, are typically employed for this purpose. However, these techniques have two major drawbacks: they are ineffective for non-culturable viruses, and they only detect a small fraction of the viruses present. In contrast, metagenomics offers a promising approach by providing a comprehensive and unbiased analysis for detecting all viruses in a given sample. It has the potential to identify rare or novel infectious agents promptly and establish a baseline of healthy animals. Nevertheless, the routine application of viral metagenomics for epidemiological surveillance and diagnostics faces challenges related to socioeconomic variables, such as resource availability and space dedicated to metagenomics, as well as the lack of standardized protocols and resulting heterogeneity in presenting results. This review aims to provide an overview of the current knowledge and prospects for using viral metagenomics to detect and identify viruses in cattle raised for livestock, while discussing the epidemiological and clinical implications.
Collapse
Affiliation(s)
- Julián Esteban Medina
- Centro de Investigaciones en Microbiología y Biotecnología - UR (CIMBIUR), Facultad de Ciencias Naturales, Universidad del Rosario, Bogotá, Colombia
| | - Sergio Castañeda
- Centro de Investigaciones en Microbiología y Biotecnología - UR (CIMBIUR), Facultad de Ciencias Naturales, Universidad del Rosario, Bogotá, Colombia
| | - Milena Camargo
- Centro de Investigaciones en Microbiología y Biotecnología - UR (CIMBIUR), Facultad de Ciencias Naturales, Universidad del Rosario, Bogotá, Colombia
- Centro de Tecnología en Salud (CETESA), Innovaseq SAS, Mosquera, Cundinamarca, Colombia
| | - Diego J Garcia-Corredor
- Centro de Investigaciones en Microbiología y Biotecnología - UR (CIMBIUR), Facultad de Ciencias Naturales, Universidad del Rosario, Bogotá, Colombia
- Grupo de Investigación en Medicina Veterinaria y Zootecnia, Facultad de Ciencias Agropecuarias, Universidad Pedagógica y Tecnológica de Colombia, Tunja, Colombia
| | - Marina Muñoz
- Centro de Investigaciones en Microbiología y Biotecnología - UR (CIMBIUR), Facultad de Ciencias Naturales, Universidad del Rosario, Bogotá, Colombia
| | - Juan David Ramírez
- Centro de Investigaciones en Microbiología y Biotecnología - UR (CIMBIUR), Facultad de Ciencias Naturales, Universidad del Rosario, Bogotá, Colombia.
- Molecular Microbiology Laboratory, Department of Pathology, Molecular and Cell-Based Medicine, Icahn School of Medicine at Mount Sinai, New York, NY, USA.
| |
Collapse
|
5
|
Valentin-Alvarado LE, Appler KE, De Anda V, Schoelmerich MC, West-Roberts J, Kivenson V, Crits-Christoph A, Ly L, Sachdeva R, Greening C, Savage DF, Baker BJ, Banfield JF. Asgard archaea modulate potential methanogenesis substrates in wetland soil. Nat Commun 2024; 15:6384. [PMID: 39085194 PMCID: PMC11291895 DOI: 10.1038/s41467-024-49872-z] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 05/02/2024] [Accepted: 06/20/2024] [Indexed: 08/02/2024] Open
Abstract
The roles of Asgard archaea in eukaryogenesis and marine biogeochemical cycles are well studied, yet their contributions in soil ecosystems remain unknown. Of particular interest are Asgard archaeal contributions to methane cycling in wetland soils. To investigate this, we reconstructed two complete genomes for soil-associated Atabeyarchaeia, a new Asgard lineage, and a complete genome of Freyarchaeia, and predicted their metabolism in situ. Metatranscriptomics reveals expression of genes for [NiFe]-hydrogenases, pyruvate oxidation and carbon fixation via the Wood-Ljungdahl pathway. Also expressed are genes encoding enzymes for amino acid metabolism, anaerobic aldehyde oxidation, hydrogen peroxide detoxification and carbohydrate breakdown to acetate and formate. Overall, soil-associated Asgard archaea are predicted to include non-methanogenic acetogens, highlighting their potential role in carbon cycling in terrestrial environments.
Collapse
Affiliation(s)
- Luis E Valentin-Alvarado
- Innovative Genomics Institute, University of California, Berkeley, California, USA
- Department of Plant and Microbial Biology, University of California, Berkeley, CA, USA
| | - Kathryn E Appler
- Department of Marine Science, University of Texas at Austin; Marine Science Institute, Port Aransas, TX, USA
| | - Valerie De Anda
- Department of Marine Science, University of Texas at Austin; Marine Science Institute, Port Aransas, TX, USA
- Department of Integrative Biology, University of Texas at Austin, Austin, TX, USA
| | - Marie C Schoelmerich
- Innovative Genomics Institute, University of California, Berkeley, California, USA
- Department of Environmental Systems Sciences; ETH Zürich, Zürich, Switzerland
| | - Jacob West-Roberts
- Environmental Science, Policy and Management, University of California, Berkeley, CA, USA
| | - Veronika Kivenson
- Innovative Genomics Institute, University of California, Berkeley, California, USA
| | - Alexander Crits-Christoph
- Innovative Genomics Institute, University of California, Berkeley, California, USA
- Department of Plant and Microbial Biology, University of California, Berkeley, CA, USA
- Cultivarium, Watertown, MA, USA
| | - Lynn Ly
- Oxford Nanopore Technologies Inc, New York, NY, USA
| | - Rohan Sachdeva
- Innovative Genomics Institute, University of California, Berkeley, California, USA
| | - Chris Greening
- Department of Microbiology, Biomedicine Discovery Institute; Monash University, Clayton, VIC, Australia
- Securing Antarctica's Environmental Future, Monash University, Clayton, VIC, Australia
| | - David F Savage
- Innovative Genomics Institute, University of California, Berkeley, California, USA
- Howard Hughes Medical Institute, University of California, Berkeley, California, USA
- Department of Molecular and Cell Biology, University of California Berkeley, Berkeley, USA
| | - Brett J Baker
- Department of Marine Science, University of Texas at Austin; Marine Science Institute, Port Aransas, TX, USA.
- Department of Integrative Biology, University of Texas at Austin, Austin, TX, USA.
| | - Jillian F Banfield
- Innovative Genomics Institute, University of California, Berkeley, California, USA.
- Environmental Science, Policy and Management, University of California, Berkeley, CA, USA.
- Department of Microbiology, Biomedicine Discovery Institute; Monash University, Clayton, VIC, Australia.
- Earth and Planetary Science, University of California, Berkeley, CA, USA.
| |
Collapse
|
6
|
Wang X, Liu Z, Zhang F, Xiao H, Cao S, Xue H, Liu W, Su Y, Liu Z, Zhong H, Zhang F, Ahmad B, Long Q, Zhang Y, Liu Y, Gan Y, Hou T, Jin Z, Wu X, Liu G, Wang Y, Peng Y, Zhou Y. Integrative genomics reveals the polygenic basis of seedlessness in grapevine. Curr Biol 2024:S0960-9822(24)00925-4. [PMID: 39094571 DOI: 10.1016/j.cub.2024.07.022] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/03/2024] [Revised: 05/04/2024] [Accepted: 07/03/2024] [Indexed: 08/04/2024]
Abstract
Seedlessness is a crucial quality trait in table grape (Vitis vinifera L.) breeding. However, the development of seeds involved intricate regulations, and the polygenic basis of seed abortion remains unclear. Here, we combine comparative genomics, population genetics, quantitative genetics, and integrative genomics to unravel the evolution and polygenic basis of seedlessness in grapes. We generated the haplotype-resolved genomes for two seedless grape cultivars, "Thompson Seedless" (TS, syn. "Sultania") and "Black Monukka" (BM). Comparative genomics identified a ∼4.25 Mb hemizygous inversion on Chr10 specific in seedless cultivars, with seedless-associated genes VvTT16 and VvSUS2 located at breakpoints. Population genomic analyses of 548 grapevine accessions revealed two distinct clusters of seedless cultivars, and the identity-by-descent (IBD) results indicated that the origin of the seedlessness trait could be traced back to "Sultania." Introgression, rather than convergent selection, shaped the evolutionary history of seedlessness in grape improvement. Genome-wide association study (GWAS) analysis identified 110 quantitative trait loci (QTLs) associated with 634 candidate genes, including previously unidentified candidate genes, such as three 11S GLOBULIN SEED STORAGE PROTEIN and two CYTOCHROME P450 genes, and well-known genes like VviAGL11. Integrative genomic analyses resulted in 339 core candidate genes categorized into 13 functional categories related to seed development. Machine learning-based genomic selection achieved a remarkable prediction accuracy of 97% for seedlessness in grapevines. Our findings highlight the polygenic nature of seedlessness and provide candidate genes for molecular genetics and an effective prediction for seedlessness in grape genomic breeding.
Collapse
Affiliation(s)
- Xu Wang
- National Key Laboratory of Tropical Crop Breeding, Shenzhen Branch, Guangdong Laboratory of Lingnan Modern Agriculture, Key Laboratory of Synthetic Biology, Ministry of Agriculture and Rural Affairs, Agricultural Genomics Institute at Shenzhen, Chinese Academy of Agricultural Sciences, Shenzhen, China; School of Agriculture and Food Science, University College Dublin, Belfield, Dublin, Ireland
| | - Zhongjie Liu
- National Key Laboratory of Tropical Crop Breeding, Shenzhen Branch, Guangdong Laboratory of Lingnan Modern Agriculture, Key Laboratory of Synthetic Biology, Ministry of Agriculture and Rural Affairs, Agricultural Genomics Institute at Shenzhen, Chinese Academy of Agricultural Sciences, Shenzhen, China
| | - Fan Zhang
- National Key Laboratory of Tropical Crop Breeding, Shenzhen Branch, Guangdong Laboratory of Lingnan Modern Agriculture, Key Laboratory of Synthetic Biology, Ministry of Agriculture and Rural Affairs, Agricultural Genomics Institute at Shenzhen, Chinese Academy of Agricultural Sciences, Shenzhen, China
| | - Hua Xiao
- National Key Laboratory of Tropical Crop Breeding, Shenzhen Branch, Guangdong Laboratory of Lingnan Modern Agriculture, Key Laboratory of Synthetic Biology, Ministry of Agriculture and Rural Affairs, Agricultural Genomics Institute at Shenzhen, Chinese Academy of Agricultural Sciences, Shenzhen, China
| | - Shuo Cao
- National Key Laboratory of Tropical Crop Breeding, Shenzhen Branch, Guangdong Laboratory of Lingnan Modern Agriculture, Key Laboratory of Synthetic Biology, Ministry of Agriculture and Rural Affairs, Agricultural Genomics Institute at Shenzhen, Chinese Academy of Agricultural Sciences, Shenzhen, China
| | - Hui Xue
- National Key Laboratory of Tropical Crop Breeding, Shenzhen Branch, Guangdong Laboratory of Lingnan Modern Agriculture, Key Laboratory of Synthetic Biology, Ministry of Agriculture and Rural Affairs, Agricultural Genomics Institute at Shenzhen, Chinese Academy of Agricultural Sciences, Shenzhen, China
| | - Wenwen Liu
- National Key Laboratory of Tropical Crop Breeding, Shenzhen Branch, Guangdong Laboratory of Lingnan Modern Agriculture, Key Laboratory of Synthetic Biology, Ministry of Agriculture and Rural Affairs, Agricultural Genomics Institute at Shenzhen, Chinese Academy of Agricultural Sciences, Shenzhen, China
| | - Ying Su
- National Key Laboratory of Tropical Crop Breeding, Shenzhen Branch, Guangdong Laboratory of Lingnan Modern Agriculture, Key Laboratory of Synthetic Biology, Ministry of Agriculture and Rural Affairs, Agricultural Genomics Institute at Shenzhen, Chinese Academy of Agricultural Sciences, Shenzhen, China
| | - Zhenya Liu
- National Key Laboratory of Tropical Crop Breeding, Shenzhen Branch, Guangdong Laboratory of Lingnan Modern Agriculture, Key Laboratory of Synthetic Biology, Ministry of Agriculture and Rural Affairs, Agricultural Genomics Institute at Shenzhen, Chinese Academy of Agricultural Sciences, Shenzhen, China
| | - Haixia Zhong
- The State Key Laboratory of Genetic Improvement and Germplasm Innovation of Crop Resistance in Arid Desert Regions (Preparation), Key Laboratory of Genome Research and Genetic Improvement of Xinjiang Characteristic Fruits and Vegetables, Institute of Horticultural Crops, Xinjiang Academy of Agricultural Sciences, Urumqi, China
| | - Fuchun Zhang
- The State Key Laboratory of Genetic Improvement and Germplasm Innovation of Crop Resistance in Arid Desert Regions (Preparation), Key Laboratory of Genome Research and Genetic Improvement of Xinjiang Characteristic Fruits and Vegetables, Institute of Horticultural Crops, Xinjiang Academy of Agricultural Sciences, Urumqi, China
| | - Bilal Ahmad
- National Key Laboratory of Tropical Crop Breeding, Shenzhen Branch, Guangdong Laboratory of Lingnan Modern Agriculture, Key Laboratory of Synthetic Biology, Ministry of Agriculture and Rural Affairs, Agricultural Genomics Institute at Shenzhen, Chinese Academy of Agricultural Sciences, Shenzhen, China
| | - Qiming Long
- National Key Laboratory of Tropical Crop Breeding, Shenzhen Branch, Guangdong Laboratory of Lingnan Modern Agriculture, Key Laboratory of Synthetic Biology, Ministry of Agriculture and Rural Affairs, Agricultural Genomics Institute at Shenzhen, Chinese Academy of Agricultural Sciences, Shenzhen, China
| | - Yingchun Zhang
- National Key Laboratory of Tropical Crop Breeding, Shenzhen Branch, Guangdong Laboratory of Lingnan Modern Agriculture, Key Laboratory of Synthetic Biology, Ministry of Agriculture and Rural Affairs, Agricultural Genomics Institute at Shenzhen, Chinese Academy of Agricultural Sciences, Shenzhen, China
| | - Yuting Liu
- National Key Laboratory of Tropical Crop Breeding, Shenzhen Branch, Guangdong Laboratory of Lingnan Modern Agriculture, Key Laboratory of Synthetic Biology, Ministry of Agriculture and Rural Affairs, Agricultural Genomics Institute at Shenzhen, Chinese Academy of Agricultural Sciences, Shenzhen, China
| | - Yu Gan
- National Key Laboratory of Tropical Crop Breeding, Shenzhen Branch, Guangdong Laboratory of Lingnan Modern Agriculture, Key Laboratory of Synthetic Biology, Ministry of Agriculture and Rural Affairs, Agricultural Genomics Institute at Shenzhen, Chinese Academy of Agricultural Sciences, Shenzhen, China
| | - Ting Hou
- National Key Laboratory of Tropical Crop Breeding, Shenzhen Branch, Guangdong Laboratory of Lingnan Modern Agriculture, Key Laboratory of Synthetic Biology, Ministry of Agriculture and Rural Affairs, Agricultural Genomics Institute at Shenzhen, Chinese Academy of Agricultural Sciences, Shenzhen, China
| | - Zhongxin Jin
- National Key Laboratory of Tropical Crop Breeding, Shenzhen Branch, Guangdong Laboratory of Lingnan Modern Agriculture, Key Laboratory of Synthetic Biology, Ministry of Agriculture and Rural Affairs, Agricultural Genomics Institute at Shenzhen, Chinese Academy of Agricultural Sciences, Shenzhen, China
| | - Xinyu Wu
- The State Key Laboratory of Genetic Improvement and Germplasm Innovation of Crop Resistance in Arid Desert Regions (Preparation), Key Laboratory of Genome Research and Genetic Improvement of Xinjiang Characteristic Fruits and Vegetables, Institute of Horticultural Crops, Xinjiang Academy of Agricultural Sciences, Urumqi, China
| | - Guotian Liu
- State Key Laboratory of Crop Stress Resistance and High-Efficiency Production, College of Horticulture, Northwest A&F University, Yangling, Shaanxi 712100, China
| | - Yiwen Wang
- National Key Laboratory of Tropical Crop Breeding, Shenzhen Branch, Guangdong Laboratory of Lingnan Modern Agriculture, Key Laboratory of Synthetic Biology, Ministry of Agriculture and Rural Affairs, Agricultural Genomics Institute at Shenzhen, Chinese Academy of Agricultural Sciences, Shenzhen, China
| | - Yanling Peng
- National Key Laboratory of Tropical Crop Breeding, Shenzhen Branch, Guangdong Laboratory of Lingnan Modern Agriculture, Key Laboratory of Synthetic Biology, Ministry of Agriculture and Rural Affairs, Agricultural Genomics Institute at Shenzhen, Chinese Academy of Agricultural Sciences, Shenzhen, China
| | - Yongfeng Zhou
- National Key Laboratory of Tropical Crop Breeding, Shenzhen Branch, Guangdong Laboratory of Lingnan Modern Agriculture, Key Laboratory of Synthetic Biology, Ministry of Agriculture and Rural Affairs, Agricultural Genomics Institute at Shenzhen, Chinese Academy of Agricultural Sciences, Shenzhen, China; National Key Laboratory of Tropical Crop Breeding, Tropical Crops Genetic Resources Institute, Chinese Academy of Tropical Agricultural Sciences, Haikou, China.
| |
Collapse
|
7
|
Mallawaarachchi V, Wickramarachchi A, Xue H, Papudeshi B, Grigson SR, Bouras G, Prahl RE, Kaphle A, Verich A, Talamantes-Becerra B, Dinsdale EA, Edwards RA. Solving genomic puzzles: computational methods for metagenomic binning. Brief Bioinform 2024; 25:bbae372. [PMID: 39082646 PMCID: PMC11289683 DOI: 10.1093/bib/bbae372] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 04/18/2024] [Revised: 06/05/2024] [Accepted: 07/15/2024] [Indexed: 08/03/2024] Open
Abstract
Metagenomics involves the study of genetic material obtained directly from communities of microorganisms living in natural environments. The field of metagenomics has provided valuable insights into the structure, diversity and ecology of microbial communities. Once an environmental sample is sequenced and processed, metagenomic binning clusters the sequences into bins representing different taxonomic groups such as species, genera, or higher levels. Several computational tools have been developed to automate the process of metagenomic binning. These tools have enabled the recovery of novel draft genomes of microorganisms allowing us to study their behaviors and functions within microbial communities. This review classifies and analyzes different approaches of metagenomic binning and different refinement, visualization, and evaluation techniques used by these methods. Furthermore, the review highlights the current challenges and areas of improvement present within the field of research.
Collapse
Affiliation(s)
- Vijini Mallawaarachchi
- Flinders Accelerator for Microbiome Exploration, College of Science and Engineering, Flinders University, Adelaide, SA 5042, Australia
| | - Anuradha Wickramarachchi
- Australian e-Health Research Centre, Commonwealth Scientific and Industrial Research Organisation (CSIRO), Westmead, NSW 2145, Australia
| | - Hansheng Xue
- School of Computing, National University of Singapore, Singapore 119077, Singapore
| | - Bhavya Papudeshi
- Flinders Accelerator for Microbiome Exploration, College of Science and Engineering, Flinders University, Adelaide, SA 5042, Australia
| | - Susanna R Grigson
- Flinders Accelerator for Microbiome Exploration, College of Science and Engineering, Flinders University, Adelaide, SA 5042, Australia
| | - George Bouras
- Adelaide Medical School, Faculty of Health and Medical Sciences, The University of Adelaide, Adelaide, SA 5005, Australia
- The Department of Surgery—Otolaryngology Head and Neck Surgery, University of Adelaide and the Basil Hetzel Institute for Translational Health Research, Central Adelaide Local Health Network, Adelaide, SA 5011, Australia
| | - Rosa E Prahl
- Australian e-Health Research Centre, Commonwealth Scientific and Industrial Research Organisation (CSIRO), Westmead, NSW 2145, Australia
| | - Anubhav Kaphle
- Australian e-Health Research Centre, Commonwealth Scientific and Industrial Research Organisation (CSIRO), Westmead, NSW 2145, Australia
| | - Andrey Verich
- Australian e-Health Research Centre, Commonwealth Scientific and Industrial Research Organisation (CSIRO), Westmead, NSW 2145, Australia
- The Kirby Institute, The University of New South Wales, Randwick, Sydney, NSW 2052, Australia
| | - Berenice Talamantes-Becerra
- Australian e-Health Research Centre, Commonwealth Scientific and Industrial Research Organisation (CSIRO), Westmead, NSW 2145, Australia
| | - Elizabeth A Dinsdale
- Flinders Accelerator for Microbiome Exploration, College of Science and Engineering, Flinders University, Adelaide, SA 5042, Australia
| | - Robert A Edwards
- Flinders Accelerator for Microbiome Exploration, College of Science and Engineering, Flinders University, Adelaide, SA 5042, Australia
| |
Collapse
|
8
|
Kohtz AJ, Petrosian N, Krukenberg V, Jay ZJ, Pilhofer M, Hatzenpichler R. Cultivation and visualization of a methanogen of the phylum Thermoproteota. Nature 2024:10.1038/s41586-024-07631-6. [PMID: 39048824 DOI: 10.1038/s41586-024-07631-6] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 01/20/2023] [Accepted: 05/30/2024] [Indexed: 07/27/2024]
Abstract
Methane is the second most abundant climate-active gas, and understanding its sources and sinks is an important endeavour in microbiology, biogeochemistry, and climate sciences1,2. For decades, it was thought that methanogenesis, the ability to conserve energy coupled to methane production, was taxonomically restricted to a metabolically specialized group of archaea, the Euryarchaeota1. The discovery of marker genes for anaerobic alkane cycling in metagenome-assembled genomes obtained from diverse habitats has led to the hypothesis that archaeal lineages outside the Euryarchaeota are also involved in methanogenesis3-6. Here we cultured Candidatus Methanosuratincola verstraetei strain LCB70, a member of the archaeal class Methanomethylicia (formerly Verstraetearchaeota) within the phylum Thermoproteota, from a terrestrial hot spring. Growth experiments combined with activity assays, stable isotope tracing, and genomic and transcriptomic analyses demonstrated that this thermophilic archaeon grows by means of methyl-reducing hydrogenotrophic methanogenesis. Cryo-electron tomography revealed that Ca. M. verstraetei are coccoid cells with archaella and chemoreceptor arrays, and that they can form intercellular bridges connecting two to three cells with continuous cytoplasm and S-layer. The wide environmental distribution of Ca. M. verstraetei suggests that they might play important and hitherto overlooked roles in carbon cycling within diverse anoxic habitats.
Collapse
Affiliation(s)
- Anthony J Kohtz
- Department of Chemistry and Biochemistry, Center for Biofilm Engineering, and Thermal Biology Institute, Montana State University, Bozeman, MT, USA
| | - Nikolai Petrosian
- Institute of Molecular Biology and Biophysics, ETH Zürich, Zürich, Switzerland
| | - Viola Krukenberg
- Department of Chemistry and Biochemistry, Center for Biofilm Engineering, and Thermal Biology Institute, Montana State University, Bozeman, MT, USA
| | - Zackary J Jay
- Department of Chemistry and Biochemistry, Center for Biofilm Engineering, and Thermal Biology Institute, Montana State University, Bozeman, MT, USA
| | - Martin Pilhofer
- Institute of Molecular Biology and Biophysics, ETH Zürich, Zürich, Switzerland
| | - Roland Hatzenpichler
- Department of Chemistry and Biochemistry, Center for Biofilm Engineering, and Thermal Biology Institute, Montana State University, Bozeman, MT, USA.
- Department of Microbiology and Cell Biology, Montana State University, Bozeman, MT, USA.
| |
Collapse
|
9
|
Wu K, Zhou L, Tahon G, Liu L, Li J, Zhang J, Zheng F, Deng C, Han W, Bai L, Fu L, Dong X, Zhang C, Ettema TJG, Sousa DZ, Cheng L. Isolation of a methyl-reducing methanogen outside the Euryarchaeota. Nature 2024:10.1038/s41586-024-07728-y. [PMID: 39048829 DOI: 10.1038/s41586-024-07728-y] [Citation(s) in RCA: 1] [Impact Index Per Article: 1.0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 01/21/2023] [Accepted: 06/18/2024] [Indexed: 07/27/2024]
Abstract
Methanogenic archaea are main contributors to methane emissions, and have a crucial role in carbon cycling and global warming. Until recently, methanogens were confined to Euryarchaeota, but metagenomic studies revealed the presence of genes encoding the methyl coenzyme M reductase complex in other archaeal clades1-4, thereby opening up the premise that methanogenesis is taxonomically more widespread. Nevertheless, laboratory cultivation of these non-euryarchaeal methanogens was lacking to corroborate their potential methanogenic ability and physiology. Here we report the isolation of a thermophilic archaeon LWZ-6 from an oil field. This archaeon belongs to the class Methanosuratincolia (originally affiliated with 'Candidatus Verstraetearchaeota') in the phylum Thermoproteota. Methanosuratincola petrocarbonis LWZ-6 is a strict hydrogen-dependent methylotrophic methanogen. Although previous metagenomic studies speculated on the fermentative potential of Methanosuratincolia members, strain LWZ-6 does not ferment sugars, peptides or amino acids. Its energy metabolism is linked only to methanogenesis, with methanol and monomethylamine as electron acceptors and hydrogen as an electron donor. Comparative (meta)genome analysis confirmed that hydrogen-dependent methylotrophic methanogenesis is a widespread trait among Methanosuratincolia. Our findings confirm that the diversity of methanogens expands beyond the classical Euryarchaeota and imply the importance of hydrogen-dependent methylotrophic methanogenesis in global methane emissions and carbon cycle.
Collapse
Affiliation(s)
- Kejia Wu
- Key Laboratory of Development and Application of Rural Renewable Energy, Biogas Institute of Ministry of Agriculture and Rural Affairs, Chengdu, China
- Laboratory of Microbiology, Wageningen University and Research, Wageningen, The Netherlands
| | - Lei Zhou
- Key Laboratory of Development and Application of Rural Renewable Energy, Biogas Institute of Ministry of Agriculture and Rural Affairs, Chengdu, China
| | - Guillaume Tahon
- Laboratory of Microbiology, Wageningen University and Research, Wageningen, The Netherlands
| | - Laiyan Liu
- Key Laboratory of Development and Application of Rural Renewable Energy, Biogas Institute of Ministry of Agriculture and Rural Affairs, Chengdu, China
| | - Jiang Li
- Key Laboratory of Development and Application of Rural Renewable Energy, Biogas Institute of Ministry of Agriculture and Rural Affairs, Chengdu, China
| | - Jianchao Zhang
- School of Earth System Science, Institute of Surface-Earth System Science, Tianjin University, Tianjin, China
| | - Fengfeng Zheng
- Shenzhen Key Laboratory of Marine Geo-Omics Research, Southern University of Science and Technology, Shenzhen, China
| | - Chengpeng Deng
- Key Laboratory of Development and Application of Rural Renewable Energy, Biogas Institute of Ministry of Agriculture and Rural Affairs, Chengdu, China
| | - Wenhao Han
- Key Laboratory of Development and Application of Rural Renewable Energy, Biogas Institute of Ministry of Agriculture and Rural Affairs, Chengdu, China
| | - Liping Bai
- Key Laboratory of Development and Application of Rural Renewable Energy, Biogas Institute of Ministry of Agriculture and Rural Affairs, Chengdu, China
| | - Lin Fu
- Key Laboratory of Development and Application of Rural Renewable Energy, Biogas Institute of Ministry of Agriculture and Rural Affairs, Chengdu, China
| | - Xiuzhu Dong
- State Key Laboratory of Microbial Resources, Institute of Microbiology, Chinese Academy of Sciences, Beijing, China
| | - Chuanlun Zhang
- Shenzhen Key Laboratory of Marine Geo-Omics Research, Southern University of Science and Technology, Shenzhen, China
| | - Thijs J G Ettema
- Laboratory of Microbiology, Wageningen University and Research, Wageningen, The Netherlands
| | - Diana Z Sousa
- Laboratory of Microbiology, Wageningen University and Research, Wageningen, The Netherlands.
| | - Lei Cheng
- Key Laboratory of Development and Application of Rural Renewable Energy, Biogas Institute of Ministry of Agriculture and Rural Affairs, Chengdu, China.
| |
Collapse
|
10
|
Wang D, Liu L, Xu X, Wang C, Wang Y, Deng Y, Zhang T. Distributions, interactions, and dynamics of prokaryotes and phages in a hybrid biological wastewater treatment system. MICROBIOME 2024; 12:134. [PMID: 39039555 PMCID: PMC11265110 DOI: 10.1186/s40168-024-01853-6] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Subscribe] [Scholar Register] [Received: 09/22/2023] [Accepted: 06/03/2024] [Indexed: 07/24/2024]
Abstract
BACKGROUND Understanding the interactions and dynamics of microbiotas within biological wastewater treatment systems is essential for ensuring their stability and long-term sustainability. In this study, we developed a systematic framework employing multi-omics and Hi-C sequencing to extensively investigate prokaryotic and phage communities within a hybrid biofilm and activated sludge system. RESULTS We uncovered distinct distribution patterns, metabolic capabilities, and activities of functional prokaryotes through the analysis of 454 reconstructed prokaryotic genomes. Additionally, we reconstructed a phage catalog comprising 18,645 viral operational taxonomic units (vOTUs) with high length and contiguity using hybrid assembly, and a distinct distribution of phages was depicted between activated sludge (AS) and biofilm. Importantly, 1340 host-phage pairs were established using Hi-C and conventional in silico methods, unveiling the host-determined phage prevalence. The majority of predicted hosts were found to be involved in various crucial metabolic processes, highlighting the potential vital roles of phages in influencing substance metabolism within this system. Moreover, auxiliary metabolic genes (AMGs) related to various categories (e.g., carbohydrate degradation, sulfur metabolism, transporter) were predicted. Subsequent activity analysis emphasized their potential ability to mediate host metabolism during infection. We also profiled the temporal dynamics of phages and their associated hosts using 13-month time-series metagenomic data, further demonstrating their tight interactions. Notably, we observed lineage-specific infection patterns, such as potentially host abundance- or phage/host ratio-driven phage population changes. CONCLUSIONS The insights gained from this research contribute to the growing body of knowledge surrounding interactions and dynamics of host-phage and pave the way for further exploration and potential applications in the field of microbial ecology. Video Abstract.
Collapse
Affiliation(s)
- Dou Wang
- Environmental Microbiome Engineering and Biotechnology Laboratory, Center for Environmental Engineering Research, Department of Civil Engineering, The University of Hong Kong, Hong Kong SAR, China
| | - Lei Liu
- Environmental Microbiome Engineering and Biotechnology Laboratory, Center for Environmental Engineering Research, Department of Civil Engineering, The University of Hong Kong, Hong Kong SAR, China
| | - Xiaoqing Xu
- Environmental Microbiome Engineering and Biotechnology Laboratory, Center for Environmental Engineering Research, Department of Civil Engineering, The University of Hong Kong, Hong Kong SAR, China
| | - Chunxiao Wang
- Environmental Microbiome Engineering and Biotechnology Laboratory, Center for Environmental Engineering Research, Department of Civil Engineering, The University of Hong Kong, Hong Kong SAR, China
| | - Yulin Wang
- Environmental Microbiome Engineering and Biotechnology Laboratory, Center for Environmental Engineering Research, Department of Civil Engineering, The University of Hong Kong, Hong Kong SAR, China
| | - Yu Deng
- Environmental Microbiome Engineering and Biotechnology Laboratory, Center for Environmental Engineering Research, Department of Civil Engineering, The University of Hong Kong, Hong Kong SAR, China
| | - Tong Zhang
- Environmental Microbiome Engineering and Biotechnology Laboratory, Center for Environmental Engineering Research, Department of Civil Engineering, The University of Hong Kong, Hong Kong SAR, China.
- School of Public Health, The University of Hong Kong, Hong Kong SAR, China.
- Macau Institute for Applied Research in Medicine and Health, Macau University of Science and Technology, Macau SAR, China.
| |
Collapse
|
11
|
Manning VA, Moore PA, Trippe KM. Metagenome-assembled genomes of an acid-tolerant nitrifying bacterial community isolated from a bioreactor used in ammonium scrubbers at animal-rearing facilities. Microbiol Resour Announc 2024; 13:e0038624. [PMID: 38864651 PMCID: PMC11256797 DOI: 10.1128/mra.00386-24] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 04/12/2024] [Accepted: 05/09/2024] [Indexed: 06/13/2024] Open
Abstract
We report 12 metagenome-assembled genomes (MAGS) of a bioreactor community of acid-tolerant nitrifying bacteria. The MAGS include autotrophs in the Nitrospira genus and heterotrophs in the Xanthomonadales, Ktedonobacterales, Cytophagales, Burkholderiales, and Hyphomicrobiales. These taxonomic and genomic data provide insights into the core community members required for nitrification at low pH.
Collapse
Affiliation(s)
- Viola A. Manning
- USDA-Agricultural Research Service, National Forage Seed Production Research Center, Corvallis, Oregon, USA
| | - Phillip A. Moore
- USDA-Agricultural Research Service, Poultry Production and Product Safety Research Center, Fayetteville, Arkansas, USA
| | - Kristin M. Trippe
- USDA-Agricultural Research Service, National Forage Seed Production Research Center, Corvallis, Oregon, USA
- Department of Crop and Soil Science, Oregon State University, Corvallis, Oregon, USA
| |
Collapse
|
12
|
Tang T, Leisner JJ. Complete genome sequence of Listeria seeligeri strain 43-1 isolated from a Danish forest freshwater swamp. Microbiol Resour Announc 2024; 13:e0031124. [PMID: 38864652 PMCID: PMC11256833 DOI: 10.1128/mra.00311-24] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/27/2024] [Accepted: 05/17/2024] [Indexed: 06/13/2024] Open
Abstract
Here, we report the genome sequence of Listeria seeligeri 43-1 isolated from a Danish freshwater swamp using Oxford Nanopore sequencing. The isolate shared a high genomic similarity to two other L. seeligeri isolates from soil and water.
Collapse
Affiliation(s)
- Taya Tang
- Department of Veterinary and Animal Sciences, Faculty of Health and Medical Sciences, University of Copenhagen, Frederiksberg, Denmark
| | - Jørgen J. Leisner
- Department of Veterinary and Animal Sciences, Faculty of Health and Medical Sciences, University of Copenhagen, Frederiksberg, Denmark
| |
Collapse
|
13
|
Moreno KMF, de Andrade VA, de Melo Iani FC, Fonseca V, Lima MT, de Castro Barbosa E, Tomé LMR, Guimarães NR, Fritsch HM, Adelino T, Oliveira Fereguetti T, Aspahan MC, Gamarano Barros T, Alcantara LCJ, Giovanetti M. Exploring Microorganisms Associated to Acute Febrile Illness and Severe Neurological Disorders of Unknown Origin: A Nanopore Metagenomics Approach. Genes (Basel) 2024; 15:922. [PMID: 39062701 PMCID: PMC11276239 DOI: 10.3390/genes15070922] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/18/2024] [Revised: 07/10/2024] [Accepted: 07/12/2024] [Indexed: 07/28/2024] Open
Abstract
Acute febrile illness (AFI) and severe neurological disorders (SNDs) often present diagnostic challenges due to their potential origins from a wide range of infectious agents. Nanopore metagenomics is emerging as a powerful tool for identifying the microorganisms potentially responsible for these undiagnosed clinical cases. In this study, we aim to shed light on the etiological agents underlying AFI and SND cases that conventional diagnostic methods have not been able to fully elucidate. Our approach involved analyzing samples from fourteen hospitalized patients using a comprehensive nanopore metagenomic approach. This process included RNA extraction and enrichment using the SMART-9N protocol, followed by nanopore sequencing. Subsequent steps involved quality control, host DNA/cDNA removal, de novo genome assembly, and taxonomic classification. Our findings in AFI cases revealed a spectrum of disease-associated microbes, including Escherichia coli, Streptococcus sp., Human Immunodeficiency Virus 1 (Subtype B), and Human Pegivirus. Similarly, SND cases revealed the presence of pathogens such as Escherichia coli, Clostridium sp., and Dengue virus type 2 (Genotype-II lineage). This study employed a metagenomic analysis method, demonstrating its efficiency and adaptability in pathogen identification. Our investigation successfully identified pathogens likely associated with AFI and SNDs, underscoring the feasibility of retrieving near-complete genomes from RNA viruses. These findings offer promising prospects for advancing our understanding and control of infectious diseases, by facilitating detailed genomic analysis which is critical for developing targeted interventions and therapeutic strategies.
Collapse
Affiliation(s)
- Keldenn Melo Farias Moreno
- Institute of Biological Sciences, Federal University of Minas Gerais, Belo Horizonte 31270-901, Brazil; (K.M.F.M.); (M.T.L.); (H.M.F.)
| | | | - Felipe Campos de Melo Iani
- Central Public Health Laboratory of the State of Minas Gerais, Ezequiel Dias Foundation, Belo Horizonte 30510-010, Brazil; (F.C.d.M.I.); (T.A.)
| | - Vagner Fonseca
- Department of Exact and Earth Sciences, University of the State of Bahia, Salvador 41150-000, Brazil;
| | - Maurício Teixeira Lima
- Institute of Biological Sciences, Federal University of Minas Gerais, Belo Horizonte 31270-901, Brazil; (K.M.F.M.); (M.T.L.); (H.M.F.)
- René Rachou Institute, Oswaldo Cruz Foundation, Belo Horizonte 30190-002, Brazil; (E.d.C.B.); (L.M.R.T.); (N.R.G.); (L.C.J.A.)
| | - Emerson de Castro Barbosa
- René Rachou Institute, Oswaldo Cruz Foundation, Belo Horizonte 30190-002, Brazil; (E.d.C.B.); (L.M.R.T.); (N.R.G.); (L.C.J.A.)
| | - Luiz Marcelo Ribeiro Tomé
- René Rachou Institute, Oswaldo Cruz Foundation, Belo Horizonte 30190-002, Brazil; (E.d.C.B.); (L.M.R.T.); (N.R.G.); (L.C.J.A.)
| | - Natália Rocha Guimarães
- René Rachou Institute, Oswaldo Cruz Foundation, Belo Horizonte 30190-002, Brazil; (E.d.C.B.); (L.M.R.T.); (N.R.G.); (L.C.J.A.)
| | - Hegger Machado Fritsch
- Institute of Biological Sciences, Federal University of Minas Gerais, Belo Horizonte 31270-901, Brazil; (K.M.F.M.); (M.T.L.); (H.M.F.)
- Morphogenesis and Antigenicity of HIV and Hepatitis Viruses, University of Tours, 37032 Tours, France
| | - Talita Adelino
- Central Public Health Laboratory of the State of Minas Gerais, Ezequiel Dias Foundation, Belo Horizonte 30510-010, Brazil; (F.C.d.M.I.); (T.A.)
| | | | - Maíra Cardoso Aspahan
- Eduardo de Menezes Hospital, Belo Horizonte 30622-020, Brazil; (V.A.d.A.); (T.O.F.); (M.C.A.); (T.G.B.)
| | - Tereza Gamarano Barros
- Eduardo de Menezes Hospital, Belo Horizonte 30622-020, Brazil; (V.A.d.A.); (T.O.F.); (M.C.A.); (T.G.B.)
| | - Luiz Carlos Junior Alcantara
- René Rachou Institute, Oswaldo Cruz Foundation, Belo Horizonte 30190-002, Brazil; (E.d.C.B.); (L.M.R.T.); (N.R.G.); (L.C.J.A.)
| | - Marta Giovanetti
- Department of Sciences and Technologies for Sustainable Development and One Health, Università Campus Bio-Medico di Roma, 00128 Rome, Italy
- Oswaldo Cruz Foundation, Rio de Janeiro 21040-900, Brazil
| |
Collapse
|
14
|
Hiralal A, Geelhoed JS, Neukirchen S, Meysman FJR. Comparative genomic analysis of nickel homeostasis in cable bacteria. BMC Genomics 2024; 25:692. [PMID: 39009997 PMCID: PMC11247825 DOI: 10.1186/s12864-024-10594-7] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/27/2024] [Accepted: 07/03/2024] [Indexed: 07/17/2024] Open
Abstract
BACKGROUND Cable bacteria are filamentous members of the Desulfobulbaceae family that are capable of performing centimetre‑scale electron transport in marine and freshwater sediments. This long‑distance electron transport is mediated by a network of parallel conductive fibres embedded in the cell envelope. This fibre network efficiently transports electrical currents along the entire length of the centimetre‑long filament. Recent analyses show that these fibres consist of metalloproteins that harbour a novel nickel‑containing cofactor, which indicates that cable bacteria have evolved a unique form of biological electron transport. This nickel‑dependent conduction mechanism suggests that cable bacteria are strongly dependent on nickel as a biosynthetic resource. Here, we performed a comprehensive comparative genomic analysis of the genes linked to nickel homeostasis. We compared the genome‑encoded adaptation to nickel of cable bacteria to related members of the Desulfobulbaceae family and other members of the Desulfobulbales order. RESULTS Presently, four closed genomes are available for the monophyletic cable bacteria clade that consists of the genera Candidatus Electrothrix and Candidatus Electronema. To increase the phylogenomic coverage, we additionally generated two closed genomes of cable bacteria: Candidatus Electrothrix gigas strain HY10‑6 and Candidatus Electrothrix antwerpensis strain GW3‑4, which are the first closed genomes of their respective species. Nickel homeostasis genes were identified in a database of 38 cable bacteria genomes (including 6 closed genomes). Gene prevalence was compared to 19 genomes of related strains, residing within the Desulfobulbales order but outside of the cable bacteria clade, revealing several genome‑encoded adaptations to nickel homeostasis in cable bacteria. Phylogenetic analysis indicates that nickel importers, nickel‑binding enzymes and nickel chaperones of cable bacteria are affiliated to organisms outside the Desulfobulbaceae family, with several proteins showing affiliation to organisms outside of the Desulfobacterota phylum. Conspicuously, cable bacteria encode a unique periplasmic nickel export protein RcnA, which possesses a putative cytoplasmic histidine‑rich loop that has been largely expanded compared to RcnA homologs in other organisms. CONCLUSION Cable bacteria genomes show a clear genetic adaptation for nickel utilization when compared to closely related genera. This fully aligns with the nickel‑dependent conduction mechanism that is uniquely found in cable bacteria.
Collapse
Affiliation(s)
- Anwar Hiralal
- Geobiology Research Group, University of Antwerp, Antwerp, Belgium
| | | | - Sinje Neukirchen
- Geobiology Research Group, University of Antwerp, Antwerp, Belgium
| | - Filip J R Meysman
- Geobiology Research Group, University of Antwerp, Antwerp, Belgium.
- Department of Biotechnology, Delft University of Technology, Delft, The Netherlands.
| |
Collapse
|
15
|
Uppal S, Waterworth SC, Nick A, Vogel H, Flórez LV, Kaltenpoth M, Kwan JC. Repeated horizontal acquisition of lagriamide-producing symbionts in Lagriinae beetles. BIORXIV : THE PREPRINT SERVER FOR BIOLOGY 2024:2024.01.23.576914. [PMID: 39026795 PMCID: PMC11257431 DOI: 10.1101/2024.01.23.576914] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 07/20/2024]
Abstract
Microbial symbionts associate with multicellular organisms on a continuum from facultative associations to mutual codependency. In some of the oldest intracellular symbioses there is exclusive vertical symbiont transmission, and co-diversification of symbiotic partners over millions of years. Such symbionts often undergo genome reduction due to low effective population sizes, frequent population bottlenecks, and reduced purifying selection. Here, we describe multiple independent acquisition events of closely related defensive symbionts followed by genome erosion in a group of Lagriinae beetles. Previous work in Lagria villosa revealed the dominant genome-eroded symbiont of the genus Burkholderia produces the antifungal compound lagriamide and protects the beetle's eggs and larvae from antagonistic fungi. Here, we use metagenomics to assemble 11 additional genomes of lagriamide-producing symbionts from seven different host species within Lagriinae from five countries, to unravel the evolutionary history of this symbiotic relationship. In each host species, we detected one dominant genome-eroded Burkholderia symbiont encoding the lagriamide biosynthetic gene cluster (BGC). Surprisingly, however, we did not find evidence for host-symbiont co-diversification, or for a monophyly of the lagriamide-producing symbionts. Instead, our analyses support at least four independent acquisition events of lagriamide-encoding symbionts and subsequent genome erosion in each of these lineages. By contrast, a clade of plant-associated relatives retained large genomes but secondarily lost the lagriamide BGC. In conclusion, our results reveal a dynamic evolutionary history with multiple independent symbiont acquisitions characterized by high degree of specificity. They highlight the importance of the specialized metabolite lagriamide for the establishment and maintenance of this defensive symbiosis.
Collapse
Affiliation(s)
- Siddharth Uppal
- Division of Pharmaceutical Sciences, School of Pharmacy, University of Wisconsin-Madison, Madison, USA
| | - Samantha C. Waterworth
- Division of Pharmaceutical Sciences, School of Pharmacy, University of Wisconsin-Madison, Madison, USA
- Current address: National Cancer Institute, Frederick, Maryland, USA
| | - Alina Nick
- Max Planck Institute for Chemical Ecology, Jena, Germany
| | - Heiko Vogel
- Max Planck Institute for Chemical Ecology, Jena, Germany
| | - Laura V. Flórez
- Department of Plant and Environmental Science, University of Copenhagen, Copenhagen, Denmark
| | | | - Jason C. Kwan
- Division of Pharmaceutical Sciences, School of Pharmacy, University of Wisconsin-Madison, Madison, USA
| |
Collapse
|
16
|
Luan T, Commichaux S, Hoffmann M, Jayeola V, Jang JH, Pop M, Rand H, Luo Y. Benchmarking short and long read polishing tools for nanopore assemblies: achieving near-perfect genomes for outbreak isolates. BMC Genomics 2024; 25:679. [PMID: 38978005 PMCID: PMC11232133 DOI: 10.1186/s12864-024-10582-x] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 02/29/2024] [Accepted: 07/01/2024] [Indexed: 07/10/2024] Open
Abstract
BACKGROUND Oxford Nanopore provides high throughput sequencing platforms able to reconstruct complete bacterial genomes with 99.95% accuracy. However, even small levels of error can obscure the phylogenetic relationships between closely related isolates. Polishing tools have been developed to correct these errors, but it is uncertain if they obtain the accuracy needed for the high-resolution source tracking of foodborne illness outbreaks. RESULTS We tested 132 combinations of assembly and short- and long-read polishing tools to assess their accuracy for reconstructing the genome sequences of 15 highly similar Salmonella enterica serovar Newport isolates from a 2020 onion outbreak. While long-read polishing alone improved accuracy, near perfect accuracy (99.9999% accuracy or ~ 5 nucleotide errors across the 4.8 Mbp genome, excluding low confidence regions) was only obtained by pipelines that combined both long- and short-read polishing tools. Notably, medaka was a more accurate and efficient long-read polisher than Racon. Among short-read polishers, NextPolish showed the highest accuracy, but Pilon, Polypolish, and POLCA performed similarly. Among the 5 best performing pipelines, polishing with medaka followed by NextPolish was the most common combination. Importantly, the order of polishing tools mattered i.e., using less accurate tools after more accurate ones introduced errors. Indels in homopolymers and repetitive regions, where the short reads could not be uniquely mapped, remained the most challenging errors to correct. CONCLUSIONS Short reads are still needed to correct errors in nanopore sequenced assemblies to obtain the accuracy required for source tracking investigations. Our granular assessment of the performance of the polishing pipelines allowed us to suggest best practices for tool users and areas for improvement for tool developers.
Collapse
Affiliation(s)
- Tu Luan
- Department of Computer Science, University of Maryland, College Park, MD, 20742, USA
| | - Seth Commichaux
- Center for Food Safety and Applied Nutrition, Food and Drug Administration, Laurel, MD, 20708, USA.
| | - Maria Hoffmann
- Center for Food Safety and Applied Nutrition, Food and Drug Administration, College Park, MD, 20740, USA
| | - Victor Jayeola
- Center for Food Safety and Applied Nutrition, Food and Drug Administration, College Park, MD, 20740, USA
| | - Jae Hee Jang
- Center for Food Safety and Applied Nutrition, Food and Drug Administration, College Park, MD, 20740, USA
| | - Mihai Pop
- Department of Computer Science, University of Maryland, College Park, MD, 20742, USA
| | - Hugh Rand
- Center for Food Safety and Applied Nutrition, Food and Drug Administration, College Park, MD, 20740, USA
| | - Yan Luo
- Center for Food Safety and Applied Nutrition, Food and Drug Administration, College Park, MD, 20740, USA
| |
Collapse
|
17
|
Funnicelli MIG, de Carvalho LAL, Teheran-Sierra LG, Dibelli SC, Lemos EGDM, Pinheiro DG. Unveiling genomic features linked to traits of plant growth-promoting bacterial communities from sugarcane. THE SCIENCE OF THE TOTAL ENVIRONMENT 2024; 947:174577. [PMID: 38981540 DOI: 10.1016/j.scitotenv.2024.174577] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Subscribe] [Scholar Register] [Received: 04/09/2024] [Revised: 07/04/2024] [Accepted: 07/05/2024] [Indexed: 07/11/2024]
Abstract
Microorganisms are ubiquitous, and those inhabiting plants have been the subject of several studies. Plant-associated bacteria exhibit various biological mechanisms that enable them to colonize host plants and, in some cases, enhance their fitness. In this study, we describe the genomic features predicted to be associated with plant growth-promoting traits in six bacterial communities isolated from sugarcane. The use of highly accurate single-molecule real-time sequencing technology for metagenomic samples from these bacterial communities allowed us to recover 17 genomes. The taxonomic assignments for the binned genomes were performed, revealing taxa distributed across three main phyla: Bacillota, Bacteroidota, and Pseudomonadota, with the latter being the most representative. Subsequently, we functionally annotated the metagenome-assembled genomes (MAGs) to characterize their metabolic pathways related to plant growth-promoting traits. Our study successfully identified the enrichment of important functions related to phosphate and potassium acquisition, modulation of phytohormones, and mechanisms for coping with abiotic stress. These findings could be linked to the robust colonization of these sugarcane endophytes.
Collapse
Affiliation(s)
- Michelli Inácio Gonçalves Funnicelli
- Laboratory of Bioinformatics, Department of Agricultural, Livestock and Environmental Biotechnology, São Paulo State University (UNESP), School of Agricultural and Veterinary Sciences, Jaboticabal, SP, Brazil; Graduate Program in Agricultural and Livestock Microbiology, São Paulo State University (UNESP), School of Agricultural and Veterinary Sciences, Jaboticabal, SP, Brazil
| | - Lucas Amoroso Lopes de Carvalho
- Laboratory of Bioinformatics, Department of Agricultural, Livestock and Environmental Biotechnology, São Paulo State University (UNESP), School of Agricultural and Veterinary Sciences, Jaboticabal, SP, Brazil; Graduate Program in Agricultural and Livestock Microbiology, São Paulo State University (UNESP), School of Agricultural and Veterinary Sciences, Jaboticabal, SP, Brazil
| | - Luis Guillermo Teheran-Sierra
- Agronomy Research Program, Colombian Oil Palm Research Center, Cenipalma, Calle 98 No. 70-91, Piso 14, Bogotá 111121, Colombia
| | - Sabrina Custodio Dibelli
- Laboratory of Bioinformatics, Department of Agricultural, Livestock and Environmental Biotechnology, São Paulo State University (UNESP), School of Agricultural and Veterinary Sciences, Jaboticabal, SP, Brazil; Graduate Program in Agricultural and Livestock Microbiology, São Paulo State University (UNESP), School of Agricultural and Veterinary Sciences, Jaboticabal, SP, Brazil
| | - Eliana Gertrudes de Macedo Lemos
- Graduate Program in Agricultural and Livestock Microbiology, São Paulo State University (UNESP), School of Agricultural and Veterinary Sciences, Jaboticabal, SP, Brazil; Molecular Biology Laboratory, Institute for Research in Bioenergy (IPBEN), São Paulo State University (UNESP), School of Agricultural and Veterinary Sciences, Jaboticabal, SP, Brazil
| | - Daniel Guariz Pinheiro
- Laboratory of Bioinformatics, Department of Agricultural, Livestock and Environmental Biotechnology, São Paulo State University (UNESP), School of Agricultural and Veterinary Sciences, Jaboticabal, SP, Brazil; Graduate Program in Agricultural and Livestock Microbiology, São Paulo State University (UNESP), School of Agricultural and Veterinary Sciences, Jaboticabal, SP, Brazil.
| |
Collapse
|
18
|
Jia H, Tan S, Cai Y, Guo Y, Shen J, Zhang Y, Ma H, Zhang Q, Chen J, Qiao G, Ruan J, Zhang YE. Low-input PacBio sequencing generates high-quality individual fly genomes and characterizes mutational processes. Nat Commun 2024; 15:5644. [PMID: 38969648 PMCID: PMC11226609 DOI: 10.1038/s41467-024-49992-6] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 11/01/2023] [Accepted: 06/20/2024] [Indexed: 07/07/2024] Open
Abstract
Long-read sequencing, exemplified by PacBio, revolutionizes genomics, overcoming challenges like repetitive sequences. However, the high DNA requirement ( > 1 µg) is prohibitive for small organisms. We develop a low-input (100 ng), low-cost, and amplification-free library-generation method for PacBio sequencing (LILAP) using Tn5-based tagmentation and DNA circularization within one tube. We test LILAP with two Drosophila melanogaster individuals, and generate near-complete genomes, surpassing preexisting single-fly genomes. By analyzing variations in these two genomes, we characterize mutational processes: complex transpositions (transposon insertions together with extra duplications and/or deletions) prefer regions characterized by non-B DNA structures, and gene conversion of transposons occurs on both DNA and RNA levels. Concurrently, we generate two complete assemblies for the endosymbiotic bacterium Wolbachia in these flies and similarly detect transposon conversion. Thus, LILAP promises a broad PacBio sequencing adoption for not only mutational studies of flies and their symbionts but also explorations of other small organisms or precious samples.
Collapse
Affiliation(s)
- Hangxing Jia
- Key Laboratory of Zoological Systematics and Evolution, Institute of Zoology, Chinese Academy of Sciences, Beijing, China.
| | - Shengjun Tan
- Key Laboratory of Zoological Systematics and Evolution, Institute of Zoology, Chinese Academy of Sciences, Beijing, China.
| | - Yingao Cai
- Key Laboratory of Zoological Systematics and Evolution, Institute of Zoology, Chinese Academy of Sciences, Beijing, China
- University of Chinese Academy of Sciences, Beijing, China
| | - Yanyan Guo
- Key Laboratory of Zoological Systematics and Evolution, Institute of Zoology, Chinese Academy of Sciences, Beijing, China
- University of Chinese Academy of Sciences, Beijing, China
| | - Jieyu Shen
- Key Laboratory of Zoological Systematics and Evolution, Institute of Zoology, Chinese Academy of Sciences, Beijing, China
- University of Chinese Academy of Sciences, Beijing, China
| | - Yaqiong Zhang
- Key Laboratory of Zoological Systematics and Evolution, Institute of Zoology, Chinese Academy of Sciences, Beijing, China
| | - Huijing Ma
- Key Laboratory of Zoological Systematics and Evolution, Institute of Zoology, Chinese Academy of Sciences, Beijing, China
| | - Qingzhu Zhang
- Key Laboratory of Zoological Systematics and Evolution, Institute of Zoology, Chinese Academy of Sciences, Beijing, China
- University of Chinese Academy of Sciences, Beijing, China
| | - Jinfeng Chen
- University of Chinese Academy of Sciences, Beijing, China
- State Key Laboratory of Integrated Management of Pest Insects and Rodents, Institute of Zoology, Chinese Academy of Sciences, Beijing, China
| | - Gexia Qiao
- Key Laboratory of Zoological Systematics and Evolution, Institute of Zoology, Chinese Academy of Sciences, Beijing, China
- University of Chinese Academy of Sciences, Beijing, China
| | - Jue Ruan
- Shenzhen Branch, Guangdong Laboratory of Lingnan Modern Agriculture, Genome Analysis Laboratory of the Ministry of Agriculture and Rural Affairs, Agricultural Genomics Institute at Shenzhen, Chinese Academy of Agricultural Sciences, Shenzhen, China.
| | - Yong E Zhang
- Key Laboratory of Zoological Systematics and Evolution, Institute of Zoology, Chinese Academy of Sciences, Beijing, China.
- University of Chinese Academy of Sciences, Beijing, China.
| |
Collapse
|
19
|
Dong Y, Chen WH, Zhao XM. VirRep: a hybrid language representation learning framework for identifying viruses from human gut metagenomes. Genome Biol 2024; 25:177. [PMID: 38965579 PMCID: PMC11229495 DOI: 10.1186/s13059-024-03320-9] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 12/18/2023] [Accepted: 06/24/2024] [Indexed: 07/06/2024] Open
Abstract
Identifying viruses from metagenomes is a common step to explore the virus composition in the human gut. Here, we introduce VirRep, a hybrid language representation learning framework, for identifying viruses from human gut metagenomes. VirRep combines a context-aware encoder and an evolution-aware encoder to improve sequence representation by incorporating k-mer patterns and sequence homologies. Benchmarking on both simulated and real datasets with varying viral proportions demonstrates that VirRep outperforms state-of-the-art methods. When applied to fecal metagenomes from a colorectal cancer cohort, VirRep identifies 39 high-quality viral species associated with the disease, many of which cannot be detected by existing methods.
Collapse
Affiliation(s)
- Yanqi Dong
- Department of Neurology, Zhongshan Hospital and Institute of Science and Technology for Brain-Inspired Intelligence, Fudan University, Shanghai, 200433, China
| | - Wei-Hua Chen
- Key Laboratory of Molecular Biophysics of the Ministry of Education, Hubei Key Laboratory of Bioinformatics and Molecular Imaging, Center for Artificial Intelligence Biology, Department of Bioinformatics and Systems Biology, College of Life Science and Technology, Huazhong University of Science and Technology, Wuhan, 430074, China.
- Institution of Medical Artificial Intelligence, Binzhou Medical University, Yantai, 264003, China.
| | - Xing-Ming Zhao
- Department of Neurology, Zhongshan Hospital and Institute of Science and Technology for Brain-Inspired Intelligence, Fudan University, Shanghai, 200433, China.
- State Key Laboratory of Medical Neurobiology, Institutes of Brain Science, Fudan University, Shanghai, China.
- MOE Key Laboratory of Computational Neuroscience and Brain-Inspired Intelligence, and MOE Frontiers Center for Brain Science, Fudan University, Shanghai, China.
| |
Collapse
|
20
|
Sumang FA, Ward A, Errington J, Dashti Y. Hibiscus acid and hydroxycitric acid dimethyl esters from Hibiscus flowers induce production of dithiolopyrrolone antibiotics by Streptomyces Strain MBN2-2. NATURAL PRODUCTS AND BIOPROSPECTING 2024; 14:40. [PMID: 38955942 PMCID: PMC11219617 DOI: 10.1007/s13659-024-00460-0] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Grants] [Track Full Text] [Subscribe] [Scholar Register] [Received: 05/24/2024] [Accepted: 06/14/2024] [Indexed: 07/04/2024]
Abstract
Plants and microbes are closely associated with each other in their ecological niches. Much has been studied about plant-microbe interactions, but little is known about the effect of phytochemicals on microbes at the molecular level. To access the products of cryptic biosynthetic gene clusters in bacteria, we incorporated an organic extract of hibiscus flowers into the culture media of different Actinobacteria isolated from plant rhizospheres. This approach led to the production of broad-spectrum dithiolopyrrolone (DTP) antibiotics, thiolutin (1) and aureothricin (2), by Streptomyces sp. MBN2-2. The compounds from the hibiscus extract responsible for triggering the production of these two DTPs were found to be hibiscus acid dimethyl ester (3) and hydroxycitric acid 1,3-dimethyl ester (4). It was subsequently found that the addition of either Fe2+ or Fe3+ to culture media induced the production of 1 and 2. The Chrome Azurol S (CAS) assay revealed that 3 and 4 can chelate iron, and therefore, the mechanism leading to the production of thiolutin and aureothricin appears to be related to changes in iron concentration levels. This work supports the idea that phytochemicals can be used to activate the production of cryptic microbial biosynthetic gene clusters and further understand plant-microbe interactions.
Collapse
Affiliation(s)
- Felaine Anne Sumang
- Faculty of Medicine and Health, University of Sydney, Sydney, NSW, 2015, Australia
| | - Alan Ward
- School of Biology, Newcastle University, Newcastle Upon Tyne, UK
| | - Jeff Errington
- Faculty of Medicine and Health, University of Sydney, Sydney, NSW, 2015, Australia
- Sydney Infectious Diseases Institute, University of Sydney, Sydney, NSW, 2015, Australia
| | - Yousef Dashti
- Faculty of Medicine and Health, University of Sydney, Sydney, NSW, 2015, Australia.
- Sydney Infectious Diseases Institute, University of Sydney, Sydney, NSW, 2015, Australia.
| |
Collapse
|
21
|
Liu M, Xu N, Chen B, Zhang Z, Chen X, Zhu Y, Hong W, Wang T, Zhang Q, Ye Y, Lu T, Qian H. Effects of different assembly strategies on gene annotation in activated sludge. ENVIRONMENTAL RESEARCH 2024; 252:119116. [PMID: 38734289 DOI: 10.1016/j.envres.2024.119116] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Subscribe] [Scholar Register] [Received: 03/17/2024] [Revised: 04/27/2024] [Accepted: 05/08/2024] [Indexed: 05/13/2024]
Abstract
Activated sludge comprises diverse bacteria, fungi, and other microorganisms, featuring a rich repertoire of genes involved in antibiotic resistance, pollutant degradation, and elemental cycling. In this regard, hybrid assembly technology can revolutionize metagenomics by detecting greater gene diversity in environmental samples. Nonetheless, the optimal utilization and comparability of genomic information between hybrid assembly and short- or long-read technology remain unclear. To address this gap, we compared the performance of the hybrid assembly, short- and long-read technologies, abundance and diversity of annotated genes, and taxonomic diversity by analysing 46, 161, and 45 activated sludge metagenomic datasets, respectively. The results revealed that hybrid assembly technology exhibited the best performance, generating the most contiguous and longest contigs but with a lower proportion of high-quality metagenome-assembled genomes than short-read technology. Compared with short- or long-read technologies, hybrid assembly technology can detect a greater diversity of microbiota and antibiotic resistance genes, as well as a wider range of potential hosts. However, this approach may yield lower gene abundance and pathogen detection. Our study revealed the specific advantages and disadvantages of hybrid assembly and short- and long-read applications in wastewater treatment plants, and our approach could serve as a blueprint to be extended to terrestrial environments.
Collapse
Affiliation(s)
- Meng Liu
- College of Environment, Zhejiang University of Technology, Hangzhou, 310032, PR China
| | - Nuohan Xu
- College of Environment, Zhejiang University of Technology, Hangzhou, 310032, PR China
| | - Bingfeng Chen
- College of Environment, Zhejiang University of Technology, Hangzhou, 310032, PR China
| | - Zhenyan Zhang
- College of Environment, Zhejiang University of Technology, Hangzhou, 310032, PR China
| | - Xinyu Chen
- College of Environment, Zhejiang University of Technology, Hangzhou, 310032, PR China
| | - Yuke Zhu
- College of Environment, Zhejiang University of Technology, Hangzhou, 310032, PR China
| | - Wenjie Hong
- Key Laboratory of Microbial Technology and Bioinformatics of Zhejiang Province, Hangzhou, 310012, PR China
| | - Tingzhang Wang
- Key Laboratory of Microbial Technology and Bioinformatics of Zhejiang Province, Hangzhou, 310012, PR China
| | - Qi Zhang
- College of Environment, Zhejiang University of Technology, Hangzhou, 310032, PR China
| | - Yangqing Ye
- College of Mechanical Engineering, Zhejiang University of Technology, Hangzhou, 310032, PR China
| | - Tao Lu
- College of Environment, Zhejiang University of Technology, Hangzhou, 310032, PR China
| | - Haifeng Qian
- College of Environment, Zhejiang University of Technology, Hangzhou, 310032, PR China.
| |
Collapse
|
22
|
Chen D, Fan H, Tang S, Gan Z, Lu Y, Long M. Thioclava litoralis sp. nov., a novel species of alphaproteobacterium, isolated from surface seawater. Arch Microbiol 2024; 206:333. [PMID: 38951168 DOI: 10.1007/s00203-024-04057-6] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 12/26/2023] [Revised: 06/15/2024] [Accepted: 06/16/2024] [Indexed: 07/03/2024]
Abstract
A Gram-negative, aerobic, rod-shaped, non-motile bacterium, designated as FTW29T, was isolated from surface seawater sampled in Futian district, Shenzhen, China. Growth of strain FTW29T was observed at 15-42 ℃ (optimum, 28-30 ℃), pH 4.0-9.0 (optimum, pH 5.5-7.5) and in the presence of 0.5-10% NaCl (optimum, 3.0% NaCl). Strain FTW29T showed 95.0-96.8% 16 S rRNA gene sequence similarity to various type strains of the genera Thioclava, Sinirhodobacter, Rhodobacter, Haematobacter and Frigidibacter of the family Paracoccaceae, and its most closely related strains were Thioclava pacifica DSM 10,166T (96.8%) and Thioclava marina 11.10-0-13T (96.7%). The phylogenomic tree constructed on the bac120 gene set showed that strain FTW29T formed a clade with the genus Thioclava, with a bootstrap value of 100%. The evolutionary distance values between FTW29T and type strains of the genus Thioclava were 0.17-0.19, which are below the recommended standard (0.21-0.23) for defining a novel genus in the family Paracoccaceae. In strain FTW29T, the major fatty acids identified were summed feature 8 (C18:1ω7c) and C16:0, and the predominant respiratory quinones were ubiquinone-10 and ubiquinone-9. The composition of polar lipids in strain FTW29T included diphosphatidylglycerol, phosphatidylglycerol, phosphatidylethanolamine, an unidentified phospholipid, an unidentified aminolipid, two unidentified glycolipids and an unidentified lipid. The genome of strain FTW29T comprised one circle chromosome and six plasmids, with a G + C content of 61.4%. The average nucleotide identity, average amino acid identity, and digital DNA-DNA hybridization values between strain FTW29T and seven type strains of the genus Thioclava were 76.6-78.4%, 53.2-56.4% and 19.3-20.4%, respectively. Altogether, the phenotypic, phylogenetic and chemotaxonomic evidence illustrated in this study suggested that strain FTW29T represents a novel species of the genus Thioclava, with the proposed name Thioclava litoralis sp. nov. The type strain is FTW29T (= KCTC 82,841T = MCCC 1K08523T).
Collapse
Affiliation(s)
- Dakun Chen
- Guangdong Provincial Key Laboratory of Aquatic Animal Disease Control and Healthy culture, Key Laboratory of Control for Disease of Aquatic Animals of Guangdong Higher Education Institute, College of Fishery, Guangdong Ocean University, Zhanjiang, China
- Guangdong Provincial Engineering Research Center for Aquatic Animal Health Assessment, Shenzhen Public Service Platform for Evaluation of Marine Economic Animal Seedings, Shenzhen Institute of Guangdong Ocean University, Shenzhen, China
| | - Huimin Fan
- Guangdong Provincial Engineering Research Center for Aquatic Animal Health Assessment, Shenzhen Public Service Platform for Evaluation of Marine Economic Animal Seedings, Shenzhen Institute of Guangdong Ocean University, Shenzhen, China
| | - Shaoshuai Tang
- Guangdong Provincial Engineering Research Center for Aquatic Animal Health Assessment, Shenzhen Public Service Platform for Evaluation of Marine Economic Animal Seedings, Shenzhen Institute of Guangdong Ocean University, Shenzhen, China
| | - Zhen Gan
- Guangdong Provincial Key Laboratory of Aquatic Animal Disease Control and Healthy culture, Key Laboratory of Control for Disease of Aquatic Animals of Guangdong Higher Education Institute, College of Fishery, Guangdong Ocean University, Zhanjiang, China
- Guangdong Provincial Engineering Research Center for Aquatic Animal Health Assessment, Shenzhen Public Service Platform for Evaluation of Marine Economic Animal Seedings, Shenzhen Institute of Guangdong Ocean University, Shenzhen, China
| | - Yishan Lu
- Guangdong Provincial Key Laboratory of Aquatic Animal Disease Control and Healthy culture, Key Laboratory of Control for Disease of Aquatic Animals of Guangdong Higher Education Institute, College of Fishery, Guangdong Ocean University, Zhanjiang, China.
- Guangdong Provincial Engineering Research Center for Aquatic Animal Health Assessment, Shenzhen Public Service Platform for Evaluation of Marine Economic Animal Seedings, Shenzhen Institute of Guangdong Ocean University, Shenzhen, China.
| | - Meng Long
- Guangdong Provincial Key Laboratory of Aquatic Animal Disease Control and Healthy culture, Key Laboratory of Control for Disease of Aquatic Animals of Guangdong Higher Education Institute, College of Fishery, Guangdong Ocean University, Zhanjiang, China.
- Guangdong Provincial Engineering Research Center for Aquatic Animal Health Assessment, Shenzhen Public Service Platform for Evaluation of Marine Economic Animal Seedings, Shenzhen Institute of Guangdong Ocean University, Shenzhen, China.
| |
Collapse
|
23
|
Klages LJ, Kaup O, Busche T, Kalinowski J, Rückert-Reed C. Classification of a novel Serratia species, isolated from a wound swab in North Rhine-Westphalia: Proposal of Serratia sarumanii sp. nov. Syst Appl Microbiol 2024; 47:126527. [PMID: 38959748 DOI: 10.1016/j.syapm.2024.126527] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 01/11/2024] [Revised: 05/08/2024] [Accepted: 06/27/2024] [Indexed: 07/05/2024]
Abstract
Novel, white-pigmented, Gram-negative bacterial strains (K-M0706T, K-M0228, K-M0252, K-M0260) were isolated from clinical samples. With a similarity of up to 69.7 % to Serratia nevei S15T and up to 63.8 % to Serratia marcescens ATCC 13880T, as determined by digital DNA-DNA hybridization, the strains were assigned as novel species of the genus Serratia. The species can easily be differentiated from the red colored Serratia marcescens sensu stricto by its white phenotype. Discrimination between this strain and Serratia nevei is possible due to alpha-glucosidase activity and O/129-resistance, as shown for strain K-M0706T. The major fatty acids were determined as myristate, palmitate, cis-9,10-methylenehexadecanoate, linoleate, and (all cis-9,10)-methyleneoctadecanoate. These phenotypical and genomic data support the assignment of a novel species within the genus Serratia, named S. sarumanii due to its pathogenicity and white phenotype, with strain K-M0706T as the type strain.
Collapse
Affiliation(s)
- Levin Joe Klages
- Bielefeld University, Microbial Genomics and Biotechnology, Center for Biotechnology (CeBiTec), Bielefeld, Germany
| | - Olaf Kaup
- Bielefeld University, Medical School and University Medical Center OWL, Bielefeld Hospital, Institute of Laboratory Medicine, Microbiology and Transfusion Medicine, Bielefeld, Germany
| | - Tobias Busche
- Bielefeld University, Microbial Genomics and Biotechnology, Center for Biotechnology (CeBiTec), Bielefeld, Germany; Bielefeld University, Medical School OWL, Bielefeld, Germany
| | - Jörn Kalinowski
- Bielefeld University, Microbial Genomics and Biotechnology, Center for Biotechnology (CeBiTec), Bielefeld, Germany
| | - Christian Rückert-Reed
- Bielefeld University, Microbial Genomics and Biotechnology, Center for Biotechnology (CeBiTec), Bielefeld, Germany; Bielefeld University, Medical School OWL, Bielefeld, Germany.
| |
Collapse
|
24
|
Curry KD, Yu FB, Vance SE, Segarra S, Bhaya D, Chikhi R, Rocha EPC, Treangen TJ. Reference-free structural variant detection in microbiomes via long-read co-assembly graphs. Bioinformatics 2024; 40:i58-i67. [PMID: 38940156 PMCID: PMC11211843 DOI: 10.1093/bioinformatics/btae224] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 06/29/2024] Open
Abstract
MOTIVATION The study of bacterial genome dynamics is vital for understanding the mechanisms underlying microbial adaptation, growth, and their impact on host phenotype. Structural variants (SVs), genomic alterations of 50 base pairs or more, play a pivotal role in driving evolutionary processes and maintaining genomic heterogeneity within bacterial populations. While SV detection in isolate genomes is relatively straightforward, metagenomes present broader challenges due to the absence of clear reference genomes and the presence of mixed strains. In response, our proposed method rhea, forgoes reference genomes and metagenome-assembled genomes (MAGs) by encompassing all metagenomic samples in a series (time or other metric) into a single co-assembly graph. The log fold change in graph coverage between successive samples is then calculated to call SVs that are thriving or declining. RESULTS We show rhea to outperform existing methods for SV and horizontal gene transfer (HGT) detection in two simulated mock metagenomes, particularly as the simulated reads diverge from reference genomes and an increase in strain diversity is incorporated. We additionally demonstrate use cases for rhea on series metagenomic data of environmental and fermented food microbiomes to detect specific sequence alterations between successive time and temperature samples, suggesting host advantage. Our approach leverages previous work in assembly graph structural and coverage patterns to provide versatility in studying SVs across diverse and poorly characterized microbial communities for more comprehensive insights into microbial gene flux. AVAILABILITY AND IMPLEMENTATION rhea is open source and available at: https://github.com/treangenlab/rhea.
Collapse
Affiliation(s)
- Kristen D Curry
- Department of Computer Science, Rice University, 6100 Main St., Houston, TX 77005, United States
- Department of Genomes and Genetics, Microbial Evolutionary Genomics, Institut Pasteur, Université Paris Cité, CNRS, UMR3525, Paris 75015, France
| | | | - Summer E Vance
- Department of Environmental Science, Policy, and Management, University of California, Berkeley, CA 94720, United States
| | - Santiago Segarra
- Department of Electrical and Computer Engineering, Rice University, Houston, TX 77005, United States
| | - Devaki Bhaya
- Carnegie Institution for Science, Department of Plant Biology, Stanford, CA 94305, United States
| | - Rayan Chikhi
- Department of Computational Biology, Institut Pasteur, Université Paris Cité, Paris 75015, France
| | - Eduardo P C Rocha
- Department of Genomes and Genetics, Microbial Evolutionary Genomics, Institut Pasteur, Université Paris Cité, CNRS, UMR3525, Paris 75015, France
| | - Todd J Treangen
- Department of Computer Science, Rice University, 6100 Main St., Houston, TX 77005, United States
| |
Collapse
|
25
|
Shaw J, Gounot JS, Chen H, Nagarajan N, Yu YW. Floria: fast and accurate strain haplotyping in metagenomes. Bioinformatics 2024; 40:i30-i38. [PMID: 38940183 PMCID: PMC11211831 DOI: 10.1093/bioinformatics/btae252] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 06/29/2024] Open
Abstract
SUMMARY Shotgun metagenomics allows for direct analysis of microbial community genetics, but scalable computational methods for the recovery of bacterial strain genomes from microbiomes remains a key challenge. We introduce Floria, a novel method designed for rapid and accurate recovery of strain haplotypes from short and long-read metagenome sequencing data, based on minimum error correction (MEC) read clustering and a strain-preserving network flow model. Floria can function as a standalone haplotyping method, outputting alleles and reads that co-occur on the same strain, as well as an end-to-end read-to-assembly pipeline (Floria-PL) for strain-level assembly. Benchmarking evaluations on synthetic metagenomes show that Floria is > 3× faster and recovers 21% more strain content than base-level assembly methods (Strainberry) while being over an order of magnitude faster when only phasing is required. Applying Floria to a set of 109 deeply sequenced nanopore metagenomes took <20 min on average per sample and identified several species that have consistent strain heterogeneity. Applying Floria's short-read haplotyping to a longitudinal gut metagenomics dataset revealed a dynamic multi-strain Anaerostipes hadrus community with frequent strain loss and emergence events over 636 days. With Floria, accurate haplotyping of metagenomic datasets takes mere minutes on standard workstations, paving the way for extensive strain-level metagenomic analyses. AVAILABILITY AND IMPLEMENTATION Floria is available at https://github.com/bluenote-1577/floria, and the Floria-PL pipeline is available at https://github.com/jsgounot/Floria_analysis_workflow along with code for reproducing the benchmarks.
Collapse
Affiliation(s)
- Jim Shaw
- Department of Mathematics, University of Toronto, Toronto, Ontario, M5S 2E4, Canada
| | - Jean-Sebastien Gounot
- Genome Institute of Singapore (GIS), Agency for Science, Technology and Research (A*STAR), 60 Biopolis Street, Singapore, 138672, Republic of Singapore
| | - Hanrong Chen
- Genome Institute of Singapore (GIS), Agency for Science, Technology and Research (A*STAR), 60 Biopolis Street, Singapore, 138672, Republic of Singapore
| | - Niranjan Nagarajan
- Genome Institute of Singapore (GIS), Agency for Science, Technology and Research (A*STAR), 60 Biopolis Street, Singapore, 138672, Republic of Singapore
- Yong Loo Lin School of Medicine, National University of Singapore, Singapore, 117597, Republic of Singapore
| | - Yun William Yu
- Department of Mathematics, University of Toronto, Toronto, Ontario, M5S 2E4, Canada
- Ray and Stephanie Lane Computational Biology Department, Carnegie Mellon University, Pittsburgh, PA, 15213, United States
| |
Collapse
|
26
|
Schoelmerich MC, Ly L, West-Roberts J, Shi LD, Shen C, Malvankar NS, Taib N, Gribaldo S, Woodcroft BJ, Schadt CW, Al-Shayeb B, Dai X, Mozsary C, Hickey S, He C, Beaulaurier J, Juul S, Sachdeva R, Banfield JF. Borg extrachromosomal elements of methane-oxidizing archaea have conserved and expressed genetic repertoires. Nat Commun 2024; 15:5414. [PMID: 38926353 PMCID: PMC11208441 DOI: 10.1038/s41467-024-49548-8] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 04/11/2024] [Accepted: 06/10/2024] [Indexed: 06/28/2024] Open
Abstract
Borgs are huge extrachromosomal elements (ECE) of anaerobic methane-consuming "Candidatus Methanoperedens" archaea. Here, we used nanopore sequencing to validate published complete genomes curated from short reads and to reconstruct new genomes. 13 complete and four near-complete linear genomes share 40 genes that define a largely syntenous genome backbone. We use these conserved genes to identify new Borgs from peatland soil and to delineate Borg phylogeny, revealing two major clades. Remarkably, Borg genes encoding nanowire-like electron-transferring cytochromes and cell surface proteins are more highly expressed than those of host Methanoperedens, indicating that Borgs augment the Methanoperedens activity in situ. We reconstructed the first complete 4.00 Mbp genome for a Methanoperedens that is inferred to be a Borg host and predicted its methylation motifs, which differ from pervasive TC and CC methylation motifs of the Borgs. Thus, methylation may enable Methanoperedens to distinguish their genomes from those of Borgs. Very high Borg to Methanoperedens ratios and structural predictions suggest that Borgs may be capable of encapsulation. The findings clearly define Borgs as a distinct class of ECE with shared genomic signatures, establish their diversification from a common ancestor with genetic inheritance, and raise the possibility of periodic existence outside of host cells.
Collapse
Affiliation(s)
- Marie C Schoelmerich
- Innovative Genomics Institute, University of California, Berkeley, CA, USA
- Department of Environmental Systems Sciences, ETH Zurich, 8092, Zurich, Switzerland
| | - Lynn Ly
- Oxford Nanopore Technologies Inc, New York, NY, USA
| | - Jacob West-Roberts
- Department of Environmental Science, Policy and Management, University of California, Berkeley, CA, USA
| | - Ling-Dong Shi
- Innovative Genomics Institute, University of California, Berkeley, CA, USA
| | - Cong Shen
- Microbial Sciences Institute, Yale University, New Haven, CT, USA
- Deptartment of Molecular Biophysics and Biochemistry, Yale University, West Haven, CT, USA
| | - Nikhil S Malvankar
- Microbial Sciences Institute, Yale University, New Haven, CT, USA
- Deptartment of Molecular Biophysics and Biochemistry, Yale University, West Haven, CT, USA
| | - Najwa Taib
- Institut Pasteur, Université de Paris cité, Unit Evolutionary Biology of the Microbial Cell, Paris, France
| | - Simonetta Gribaldo
- Institut Pasteur, Université de Paris cité, Unit Evolutionary Biology of the Microbial Cell, Paris, France
| | - Ben J Woodcroft
- Centre for Microbiome Research, School of Biomedical Sciences, Queensland University of Technology (QUT), Translational Research Institute, Woolloongabba, QLD, Australia
| | - Christopher W Schadt
- Biosciences Division, Oak Ridge National Laboratory, Oak Ridge, TN, USA
- Department of Microbiology, University of Tennessee, Knoxville, TN, USA
| | - Basem Al-Shayeb
- Innovative Genomics Institute, University of California, Berkeley, CA, USA
| | | | | | - Scott Hickey
- Oxford Nanopore Technologies Inc, New York, NY, USA
| | - Christine He
- Oxford Nanopore Technologies Inc, New York, NY, USA
| | | | - Sissel Juul
- Oxford Nanopore Technologies Inc, New York, NY, USA
| | - Rohan Sachdeva
- Innovative Genomics Institute, University of California, Berkeley, CA, USA
| | - Jillian F Banfield
- Innovative Genomics Institute, University of California, Berkeley, CA, USA.
- Department of Environmental Science, Policy and Management, University of California, Berkeley, CA, USA.
- Biomedicine Discovery Institute, Monash University, Clayton, VIC, Australia.
- Department of Earth and Planetary Science, University of California, Berkeley, CA, USA.
| |
Collapse
|
27
|
Pinto Y, Bhatt AS. Sequencing-based analysis of microbiomes. Nat Rev Genet 2024:10.1038/s41576-024-00746-6. [PMID: 38918544 DOI: 10.1038/s41576-024-00746-6] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Accepted: 05/15/2024] [Indexed: 06/27/2024]
Abstract
Microbiomes occupy a range of niches and, in addition to having diverse compositions, they have varied functional roles that have an impact on agriculture, environmental sciences, and human health and disease. The study of microbiomes has been facilitated by recent technological and analytical advances, such as cheaper and higher-throughput DNA and RNA sequencing, improved long-read sequencing and innovative computational analysis methods. These advances are providing a deeper understanding of microbiomes at the genomic, transcriptional and translational level, generating insights into their function and composition at resolutions beyond the species level.
Collapse
Affiliation(s)
- Yishay Pinto
- Department of Genetics, Stanford University, Stanford, CA, USA
- Department of Medicine, Divisions of Hematology and Blood & Marrow Transplantation, Stanford University, Stanford, CA, USA
| | - Ami S Bhatt
- Department of Genetics, Stanford University, Stanford, CA, USA.
- Department of Medicine, Divisions of Hematology and Blood & Marrow Transplantation, Stanford University, Stanford, CA, USA.
| |
Collapse
|
28
|
Pan C, Reinert K. Leaf: an ultrafast filter for population-scale long-read SV detection. Genome Biol 2024; 25:155. [PMID: 38872200 PMCID: PMC11170821 DOI: 10.1186/s13059-024-03297-5] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 11/28/2022] [Accepted: 06/04/2024] [Indexed: 06/15/2024] Open
Abstract
Advances in sequencing technology have facilitated population-scale long-read structural variant (SV) detection. Arguably, one of the main challenges in population-scale analysis is developing effective computational pipelines. Here, we present a new filter-based pipeline for population-scale long-read SV detection. It better captures SV signals at an early stage than conventional assembly-based or alignment-based pipelines. Assessments in this work suggest that the filter-based pipeline helps better resolve intra-read rearrangements. Moreover, it is also more computationally efficient than conventional pipelines and thus may facilitate population-scale long-read applications.
Collapse
Affiliation(s)
- Chenxu Pan
- Department of Mathematics and Computer Science, Freie Universität Berlin, Takustr. 9, 14195, Berlin, Germany.
| | - Knut Reinert
- Department of Mathematics and Computer Science, Freie Universität Berlin, Takustr. 9, 14195, Berlin, Germany
- Department of Computational Molecular Biology, Max Planck Institute for Molecular Genetics, Berlin, 14195, Germany
| |
Collapse
|
29
|
Young R, Ahmed KA, Court L, Castro-Vargas C, Marcora A, Boctor J, Paull C, Wijffels G, Rane R, Edwards O, Walsh T, Pandey G. Improved reference quality genome sequence of the plastic-degrading greater wax moth, Galleria mellonella. G3 (BETHESDA, MD.) 2024; 14:jkae070. [PMID: 38564250 DOI: 10.1093/g3journal/jkae070] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Subscribe] [Scholar Register] [Received: 12/19/2023] [Revised: 12/19/2023] [Accepted: 03/22/2024] [Indexed: 04/04/2024]
Abstract
Galleria mellonella is a pest of honeybees in many countries because its larvae feed on beeswax. However, G. mellonella larvae can also eat various plastics, including polyethylene, polystyrene, and polypropylene, and therefore, the species is garnering increasing interest as a tool for plastic biodegradation research. This paper presents an improved genome (99.3% completed lepidoptera_odb10 BUSCO; genome mode) for G. mellonella. This 472 Mb genome is in 221 contigs with an N50 of 6.4 Mb and contains 13,604 protein-coding genes. Genes that code for known and putative polyethylene-degrading enzymes and their similarity to proteins found in other Lepidoptera are highlighted. An analysis of secretory proteins more likely to be involved in the plastic catabolic process has also been carried out.
Collapse
Affiliation(s)
| | | | - Leon Court
- CSIRO Environment, Acton, ACT 2601, Australia
| | | | - Anna Marcora
- CSIRO Agriculture and Food, Dutton Park, QLD 4102, Australia
| | - Joseph Boctor
- Bioplastics Innovation Hub, Food Futures Institute, Murdoch University, Murdoch, WA 6150, Australia
| | - Cate Paull
- CSIRO Agriculture and Food, Dutton Park, QLD 4102, Australia
| | - Gene Wijffels
- CSIRO Agriculture and Food, St Lucia, QLD 4067, Australia
| | - Rahul Rane
- CSIRO Health and Biosecurity, Parkville, VIC 3052, Australia
| | | | - Tom Walsh
- CSIRO Environment, Acton, ACT 2601, Australia
| | | |
Collapse
|
30
|
Chen Z, Grim CJ, Ramachandran P, Meng J. Advancing metagenome-assembled genome-based pathogen identification: unraveling the power of long-read assembly algorithms in Oxford Nanopore sequencing. Microbiol Spectr 2024; 12:e0011724. [PMID: 38687063 PMCID: PMC11237517 DOI: 10.1128/spectrum.00117-24] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 01/22/2024] [Accepted: 04/05/2024] [Indexed: 05/02/2024] Open
Abstract
Oxford Nanopore sequencing is one of the high-throughput sequencing technologies that facilitates the reconstruction of metagenome-assembled genomes (MAGs). This study aimed to assess the potential of long-read assembly algorithms in Oxford Nanopore sequencing to enhance the MAG-based identification of bacterial pathogens using both simulated and mock communities. Simulated communities were generated to mimic those on fresh spinach and in surface water. Long reads were produced using R9.4.1+SQK-LSK109 and R10.4 + SQK-LSK112, with 0.5, 1, and 2 million reads. The simulated bacterial communities included multidrug-resistant Salmonella enterica serotypes Heidelberg, Montevideo, and Typhimurium in the fresh spinach community individually or in combination, as well as multidrug-resistant Pseudomonas aeruginosa in the surface water community. Real data sets of the ZymoBIOMICS HMW DNA Standard were also studied. A bioinformatic pipeline (MAGenie, freely available at https://github.com/jackchen129/MAGenie) that combines metagenome assembly, taxonomic classification, and sequence extraction was developed to reconstruct draft MAGs from metagenome assemblies. Five assemblers were evaluated based on a series of genomic analyses. Overall, Flye outperformed the other assemblers, followed by Shasta, Raven, and Unicycler, while Canu performed least effectively. In some instances, the extracted sequences resulted in draft MAGs and provided the locations and structures of antimicrobial resistance genes and mobile genetic elements. Our study showcases the viability of utilizing the extracted sequences for precise phylogenetic inference, as demonstrated by the consistent alignment of phylogenetic topology between the reference genome and the extracted sequences. R9.4.1+SQK-LSK109 was more effective in most cases than R10.4+SQK-LSK112, and greater sequencing depths generally led to more accurate results.IMPORTANCEBy examining diverse bacterial communities, particularly those housing multiple Salmonella enterica serotypes, this study holds significance in uncovering the potential of long-read assembly algorithms to improve metagenome-assembled genome (MAG)-based pathogen identification through Oxford Nanopore sequencing. Our research demonstrates that long-read assembly stands out as a promising avenue for boosting precision in MAG-based pathogen identification, thus advancing the development of more robust surveillance measures. The findings also support ongoing endeavors to fine-tune a bioinformatic pipeline for accurate pathogen identification within complex metagenomic samples.
Collapse
Affiliation(s)
- Zhao Chen
- Joint Institute for Food Safety and Applied Nutrition, Center for Food Safety and Security Systems, University of Maryland, College Park, Maryland, USA
| | - Christopher J. Grim
- Center for Food Safety and Applied Nutrition, United States Food and Drug Administration, College Park, Maryland, USA
| | - Padmini Ramachandran
- Center for Food Safety and Applied Nutrition, United States Food and Drug Administration, College Park, Maryland, USA
| | - Jianghong Meng
- Joint Institute for Food Safety and Applied Nutrition, Center for Food Safety and Security Systems, University of Maryland, College Park, Maryland, USA
- Department of Nutrition and Food Science, University of Maryland, College Park, Maryland, USA
| |
Collapse
|
31
|
Goussarov G, Mysara M, Cleenwerck I, Claesen J, Leys N, Vandamme P, Van Houdt R. Benchmarking short-, long- and hybrid-read assemblers for metagenome sequencing of complex microbial communities. MICROBIOLOGY (READING, ENGLAND) 2024; 170:001469. [PMID: 38916949 PMCID: PMC11261854 DOI: 10.1099/mic.0.001469] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Subscribe] [Scholar Register] [Received: 03/14/2024] [Accepted: 05/23/2024] [Indexed: 06/26/2024]
Abstract
Metagenome community analyses, driven by the continued development in sequencing technology, is rapidly providing insights in many aspects of microbiology and becoming a cornerstone tool. Illumina, Oxford Nanopore Technologies (ONT) and Pacific Biosciences (PacBio) are the leading technologies, each with their own advantages and drawbacks. Illumina provides accurate reads at a low cost, but their length is too short to close bacterial genomes. Long reads overcome this limitation, but these technologies produce reads with lower accuracy (ONT) or with lower throughput (PacBio high-fidelity reads). In a critical first analysis step, reads are assembled to reconstruct genomes or individual genes within the community. However, to date, the performance of existing assemblers has never been challenged with a complex mock metagenome. Here, we evaluate the performance of current assemblers that use short, long or both read types on a complex mock metagenome consisting of 227 bacterial strains with varying degrees of relatedness. We show that many of the current assemblers are not suited to handle such a complex metagenome. In addition, hybrid assemblies do not fulfil their potential. We conclude that ONT reads assembled with CANU and Illumina reads assembled with SPAdes offer the best value for reconstructing genomes and individual genes of complex metagenomes, respectively.
Collapse
Affiliation(s)
- Gleb Goussarov
- Microbiology Unit, Belgian Nuclear Research Centre (SCK CEN), Mol, Belgium
- Laboratory of Microbiology and BCCM/LMG Bacteria Collection, Faculty of Sciences, Ghent University, Ghent, Belgium
| | - Mohamed Mysara
- Microbiology Unit, Belgian Nuclear Research Centre (SCK CEN), Mol, Belgium
- Bioinformatics group, Information Technology & Computer Science, Nile University, Giza, Egypt
| | - Ilse Cleenwerck
- Laboratory of Microbiology and BCCM/LMG Bacteria Collection, Faculty of Sciences, Ghent University, Ghent, Belgium
| | - Jürgen Claesen
- Microbiology Unit, Belgian Nuclear Research Centre (SCK CEN), Mol, Belgium
| | - Natalie Leys
- Microbiology Unit, Belgian Nuclear Research Centre (SCK CEN), Mol, Belgium
| | - Peter Vandamme
- Laboratory of Microbiology and BCCM/LMG Bacteria Collection, Faculty of Sciences, Ghent University, Ghent, Belgium
| | - Rob Van Houdt
- Microbiology Unit, Belgian Nuclear Research Centre (SCK CEN), Mol, Belgium
| |
Collapse
|
32
|
Agustinho DP, Fu Y, Menon VK, Metcalf GA, Treangen TJ, Sedlazeck FJ. Unveiling microbial diversity: harnessing long-read sequencing technology. Nat Methods 2024; 21:954-966. [PMID: 38689099 DOI: 10.1038/s41592-024-02262-1] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 09/08/2022] [Accepted: 03/29/2024] [Indexed: 05/02/2024]
Abstract
Long-read sequencing has recently transformed metagenomics, enhancing strain-level pathogen characterization, enabling accurate and complete metagenome-assembled genomes, and improving microbiome taxonomic classification and profiling. These advancements are not only due to improvements in sequencing accuracy, but also happening across rapidly changing analysis methods. In this Review, we explore long-read sequencing's profound impact on metagenomics, focusing on computational pipelines for genome assembly, taxonomic characterization and variant detection, to summarize recent advancements in the field and provide an overview of available analytical methods to fully leverage long reads. We provide insights into the advantages and disadvantages of long reads over short reads and their evolution from the early days of long-read sequencing to their recent impact on metagenomics and clinical diagnostics. We further point out remaining challenges for the field such as the integration of methylation signals in sub-strain analysis and the lack of benchmarks.
Collapse
Affiliation(s)
- Daniel P Agustinho
- Human Genome Sequencing center, Baylor College of Medicine, Houston, TX, USA
| | - Yilei Fu
- Department of Computer Science, Rice University, Houston, TX, USA
| | - Vipin K Menon
- Human Genome Sequencing center, Baylor College of Medicine, Houston, TX, USA
- Senior research project manager, Human Genetics, Genentech, South San Francisco, CA, USA
| | - Ginger A Metcalf
- Human Genome Sequencing center, Baylor College of Medicine, Houston, TX, USA
| | - Todd J Treangen
- Department of Computer Science, Rice University, Houston, TX, USA
- Department of Bioengineering, Rice University, Houston, TX, USA
| | - Fritz J Sedlazeck
- Human Genome Sequencing center, Baylor College of Medicine, Houston, TX, USA.
- Department of Computer Science, Rice University, Houston, TX, USA.
| |
Collapse
|
33
|
Mo C, Wang H, Wei M, Zeng Q, Zhang X, Fei Z, Zhang Y, Kong Q. Complete genome assembly provides a high-quality skeleton for pan-NLRome construction in melon. THE PLANT JOURNAL : FOR CELL AND MOLECULAR BIOLOGY 2024; 118:2249-2268. [PMID: 38430487 DOI: 10.1111/tpj.16705] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Subscribe] [Scholar Register] [Received: 11/10/2023] [Revised: 02/16/2024] [Accepted: 02/22/2024] [Indexed: 03/03/2024]
Abstract
Melon (Cucumis melo L.), being under intensive domestication and selective breeding, displays an abundant phenotypic diversity. Wild germplasm with tolerance to stress represents an untapped genetic resource for discovery of disease-resistance genes. To comprehensively characterize resistance genes in melon, we generate a telomere-to-telomere (T2T) and gap-free genome of wild melon accession PI511890 (C. melo var. chito) with a total length of 375.0 Mb and a contig N50 of 31.24 Mb. The complete genome allows us to dissect genome architecture and identify resistance gene analogs. We construct a pan-NLRome using seven melon genomes, which include 208 variable and 18 core nucleotide-binding leucine-rich repeat receptors (NLRs). Multiple disease-related transcriptome analyses indicate that most up-regulated NLRs induced by pathogens are shell or cloud NLRs. The T2T gap-free assembly and the pan-NLRome not only serve as essential resources for genomic studies and molecular breeding of melon but also provide insights into the genome architecture and NLR diversity.
Collapse
Affiliation(s)
- Changjuan Mo
- National Key Laboratory for Germplasm Innovation and Utilization of Horticultural Crops, College of Horticulture and Forestry Sciences, Huazhong Agricultural University, Wuhan, 430070, China
| | - Haiyan Wang
- National Key Laboratory for Germplasm Innovation and Utilization of Horticultural Crops, College of Horticulture and Forestry Sciences, Huazhong Agricultural University, Wuhan, 430070, China
| | - Minghua Wei
- National Key Laboratory for Germplasm Innovation and Utilization of Horticultural Crops, College of Horticulture and Forestry Sciences, Huazhong Agricultural University, Wuhan, 430070, China
| | - Qingguo Zeng
- National Key Laboratory for Germplasm Innovation and Utilization of Horticultural Crops, College of Horticulture and Forestry Sciences, Huazhong Agricultural University, Wuhan, 430070, China
| | - Xuejun Zhang
- Hami-melon Research Center, Xinjiang Academy of Agricultural Sciences, Urumqi, 830091, China
| | | | - Yongbing Zhang
- Hami-melon Research Center, Xinjiang Academy of Agricultural Sciences, Urumqi, 830091, China
| | - Qiusheng Kong
- National Key Laboratory for Germplasm Innovation and Utilization of Horticultural Crops, College of Horticulture and Forestry Sciences, Huazhong Agricultural University, Wuhan, 430070, China
| |
Collapse
|
34
|
Zhang Z, Xiao J, Wang H, Yang C, Huang Y, Yue Z, Chen Y, Han L, Yin K, Lyu A, Fang X, Zhang L. Exploring high-quality microbial genomes by assembling short-reads with long-range connectivity. Nat Commun 2024; 15:4631. [PMID: 38821971 PMCID: PMC11143213 DOI: 10.1038/s41467-024-49060-z] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 08/20/2023] [Accepted: 05/17/2024] [Indexed: 06/02/2024] Open
Abstract
Although long-read sequencing enables the generation of complete genomes for unculturable microbes, its high cost limits the widespread adoption of long-read sequencing in large-scale metagenomic studies. An alternative method is to assemble short-reads with long-range connectivity, which can be a cost-effective way to generate high-quality microbial genomes. Here, we develop Pangaea, a bioinformatic approach designed to enhance metagenome assembly using short-reads with long-range connectivity. Pangaea leverages connectivity derived from physical barcodes of linked-reads or virtual barcodes by aligning short-reads to long-reads. Pangaea utilizes a deep learning-based read binning algorithm to assemble co-barcoded reads exhibiting similar sequence contexts and abundances, thereby improving the assembly of high- and medium-abundance microbial genomes. Pangaea also leverages a multi-thresholding algorithm strategy to refine assembly for low-abundance microbes. We benchmark Pangaea on linked-reads and a combination of short- and long-reads from simulation data, mock communities and human gut metagenomes. Pangaea achieves significantly higher contig continuity as well as more near-complete metagenome-assembled genomes (NCMAGs) than the existing assemblers. Pangaea also generates three complete and circular NCMAGs on the human gut microbiomes.
Collapse
Grants
- This research was partially supported by the Young Collaborative Research Grant (C2004-23Y, L.Z.), HMRF (11221026, L.Z.), the open project of BGI-Shenzhen, Shenzhen 518000, China (BGIRSZ20220012, L.Z.), the Hong Kong Research Grant Council Early Career Scheme (HKBU 22201419, L.Z.), HKBU Start-up Grant Tier 2 (RC-SGT2/19-20/SCI/007, L.Z.), HKBU IRCMS (No. IRCMS/19-20/D02, L.Z.).
- This research was partially supported by the open project of BGI-Shenzhen, Shenzhen 518000, China (BGIRSZ20220014, KJ.Y.).
- The study were partially supported by the Science Technology and Innovation Committee of Shenzhen Municipality, China (SGDX20190919142801722, XD.F.),
Collapse
Affiliation(s)
- Zhenmiao Zhang
- Department of Computer Science, Hong Kong Baptist University, Hong Kong, China
| | - Jin Xiao
- Department of Computer Science, Hong Kong Baptist University, Hong Kong, China
| | - Hongbo Wang
- Department of Computer Science, Hong Kong Baptist University, Hong Kong, China
| | - Chao Yang
- Department of Computer Science, Hong Kong Baptist University, Hong Kong, China
| | | | - Zhen Yue
- BGI Research, Sanya, 572025, China
| | - Yang Chen
- State Key Laboratory of Dampness Syndrome of Chinese Medicine, The Second Affiliated Hospital of Guangzhou University of Chinese, Guangzhou, China
| | - Lijuan Han
- Department of Scientific Research, Kangmeihuada GeneTech Co., Ltd (KMHD), Shenzhen, China
| | - Kejing Yin
- Department of Computer Science, Hong Kong Baptist University, Hong Kong, China
- Institute for Research and Continuing Education, Hong Kong Baptist University, Shenzhen, China
| | - Aiping Lyu
- School of Chinese Medicine, Hong Kong Baptist University, Hong Kong, China
| | - Xiaodong Fang
- BGI Research, Shenzhen, 518083, China
- BGI Research, Sanya, 572025, China
- Department of Scientific Research, Kangmeihuada GeneTech Co., Ltd (KMHD), Shenzhen, China
| | - Lu Zhang
- Department of Computer Science, Hong Kong Baptist University, Hong Kong, China.
- Institute for Research and Continuing Education, Hong Kong Baptist University, Shenzhen, China.
| |
Collapse
|
35
|
Wattanasombat S, Tongjai S. Easing genomic surveillance: A comprehensive performance evaluation of long-read assemblers across multi-strain mixture data of HIV-1 and Other pathogenic viruses for constructing a user-friendly bioinformatic pipeline. F1000Res 2024; 13:556. [PMID: 38984017 PMCID: PMC11231628 DOI: 10.12688/f1000research.149577.1] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Figures] [Journal Information] [Submit a Manuscript] [Subscribe] [Scholar Register] [Accepted: 05/14/2024] [Indexed: 07/11/2024] Open
Abstract
Background Determining the appropriate computational requirements and software performance is essential for efficient genomic surveillance. The lack of standardized benchmarking complicates software selection, especially with limited resources. Methods We developed a containerized benchmarking pipeline to evaluate seven long-read assemblers-Canu, GoldRush, MetaFlye, Strainline, HaploDMF, iGDA, and RVHaplo-for viral haplotype reconstruction, using both simulated and experimental Oxford Nanopore sequencing data of HIV-1 and other viruses. Benchmarking was conducted on three computational systems to assess each assembler's performance, utilizing QUAST and BLASTN for quality assessment. Results Our findings show that assembler choice significantly impacts assembly time, with CPU and memory usage having minimal effect. Assembler selection also influences the size of the contigs, with a minimum read length of 2,000 nucleotides required for quality assembly. A 4,000-nucleotide read length improves quality further. Canu was efficient among de novo assemblers but not suitable for multi-strain mixtures, while GoldRush produced only consensus assemblies. Strainline and MetaFlye were suitable for metagenomic sequencing data, with Strainline requiring high memory and MetaFlye operable on low-specification machines. Among reference-based assemblers, iGDA had high error rates, RVHaplo showed the best runtime and accuracy but became ineffective with similar sequences, and HaploDMF, utilizing machine learning, had fewer errors with a slightly longer runtime. Conclusions The HIV-64148 pipeline, containerized using Docker, facilitates easy deployment and offers flexibility to select from a range of assemblers to match computational systems or study requirements. This tool aids in genome assembly and provides valuable information on HIV-1 sequences, enhancing viral evolution monitoring and understanding.
Collapse
Affiliation(s)
- Sara Wattanasombat
- Department of Microbiology, Faculty of Medicine, Chiang Mai University, Chiang Mai, 50200, Thailand
| | - Siripong Tongjai
- Department of Microbiology, Faculty of Medicine, Chiang Mai University, Chiang Mai, 50200, Thailand
| |
Collapse
|
36
|
Chen X, Li H, Dong Y, Xu Y, Xu K, Zhang Q, Yao Z, Yu Q, Zhang H, Zhang Z. A wild melon reference genome provides novel insights into the domestication of a key gene responsible for melon fruit acidity. TAG. THEORETICAL AND APPLIED GENETICS. THEORETISCHE UND ANGEWANDTE GENETIK 2024; 137:144. [PMID: 38809285 DOI: 10.1007/s00122-024-04647-4] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Subscribe] [Scholar Register] [Received: 02/08/2024] [Accepted: 05/07/2024] [Indexed: 05/30/2024]
Abstract
KEY MESSAGE A wild melon reference genome elucidates the genomic basis of fruit acidity domestication. Structural variants (SVs) have been reported to impose major effects on agronomic traits, representing a significant contributor to crop domestication. However, the landscape of SVs between wild and cultivated melons is elusive and how SVs have contributed to melon domestication remains largely unexplored. Here, we report a 379-Mb chromosome-scale genome of a wild progenitor melon accession "P84", with a contig N50 of 14.9 Mb. Genome comparison identifies 10,589 SVs between P84 and four cultivated melons with 6937 not characterized in previously analysis of 25 melon genome sequences. Furthermore, the population-scale genotyping of these SVs was determined in 1175 accessions, and 18 GWAS signals including fruit acidity, fruit length, fruit weight, fruit color and sex determination were detected. Based on these genotyped SVs, we identified 3317 highly diverged SVs between wild and cultivated melons, which could be the potential SVs associated with domestication-related traits. Furthermore, we identify novel SVs affecting fruit acidity and proposed the diverged evolutionary trajectories of CmPH, a key regulator of melon fruit acidity, during domestication and selection of different populations. These results will offer valuable resources for genomic studies and genetic improvement in melon.
Collapse
Affiliation(s)
- Xinxiu Chen
- Engineering Laboratory of Genetic Improvement of Horticultural Crops of Shandong Province, College of Horticulture, Qingdao Agricultural University, Qingdao, 266109, China
| | - Hongbo Li
- Guangdong Laboratory of Lingnan Modern Agriculture, Genome Analysis Laboratory of the Ministry of Agriculture and Rural Affairs, Agricultural Genomics Institute at Shenzhen, Shenzhen Branch, Chinese Academy of Agricultural Sciences, Shenzhen, 518120, Guangdong, China
- College of Horticulture Science and Engineering, Shandong Agricultural University, Tai'an, 271018, Shandong, China
| | - Yuanhua Dong
- College of Horticulture Science and Engineering, Shandong Agricultural University, Tai'an, 271018, Shandong, China
| | - Yuanchao Xu
- Guangdong Laboratory of Lingnan Modern Agriculture, Genome Analysis Laboratory of the Ministry of Agriculture and Rural Affairs, Agricultural Genomics Institute at Shenzhen, Shenzhen Branch, Chinese Academy of Agricultural Sciences, Shenzhen, 518120, Guangdong, China
| | - Kuipeng Xu
- Engineering Laboratory of Genetic Improvement of Horticultural Crops of Shandong Province, College of Horticulture, Qingdao Agricultural University, Qingdao, 266109, China
| | - Qiqi Zhang
- College of Horticulture Science and Engineering, Shandong Agricultural University, Tai'an, 271018, Shandong, China
| | - Zhiwang Yao
- Engineering Laboratory of Genetic Improvement of Horticultural Crops of Shandong Province, College of Horticulture, Qingdao Agricultural University, Qingdao, 266109, China
| | - Qing Yu
- Engineering Laboratory of Genetic Improvement of Horticultural Crops of Shandong Province, College of Horticulture, Qingdao Agricultural University, Qingdao, 266109, China
| | - Huimin Zhang
- Engineering Laboratory of Genetic Improvement of Horticultural Crops of Shandong Province, College of Horticulture, Qingdao Agricultural University, Qingdao, 266109, China.
| | - Zhonghua Zhang
- Engineering Laboratory of Genetic Improvement of Horticultural Crops of Shandong Province, College of Horticulture, Qingdao Agricultural University, Qingdao, 266109, China.
| |
Collapse
|
37
|
Makri N, Ring N, Shaw DJ, Athinodorou A, Robinson V, Paterson GK, Richardson J, Gow D, Nuttall T. Cytological evaluation, culture and genomics to evaluate the microbiome in healthy rabbit external ear canals. Vet Dermatol 2024. [PMID: 38742484 DOI: 10.1111/vde.13256] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 11/27/2023] [Revised: 02/27/2024] [Accepted: 04/18/2024] [Indexed: 05/16/2024]
Abstract
BACKGROUND Lop-eared rabbits may be predisposed to otitis externa (OE) as a consequence of their ear conformation. Although otoscopy, otic cytological evaluation and culture are valuable tools in dogs and cats, published data on rabbits remain lacking. HYPOTHESIS/OBJECTIVES This study aimed to assess the utility of otoscopy and cytological results in evaluating healthy rabbit external ear canals (EECs) and to characterise ear cytological and microbiological findings through culture techniques and metagenomic sequencing. ANIMALS Sixty-three otitis-free client-owned rabbits. MATERIALS AND METHODS All rabbits underwent otoscopy and ear cytological evaluation. In a subset of 12 rabbits, further bacterial and fungal culture, fungal DNA assessment and metagenomic sequencing were performed. RESULTS Otic cytological results revealed yeast in 73%, cocci in 42.9% and rods in 28.6% of healthy rabbit EECs. Compared to upright-eared rabbits, lop-eared rabbits had more discharge and more bacteria per oil immersion field. Culture isolated eight different species yet metagenomic sequencing identified 36, belonging to the Bacillota (Firmicutes), Pseudomonadota and Actinomycetota phyla. Staphylococcus were the most commonly observed species with both methods. Ten of 12 rabbits were yeast-positive on cytological evaluation with only three yielding fungal growth identified as Yarrowia (Candida) lipolytica, Eurotium echinulatum and Cystofilobasidium infirmominiatum. CONCLUSIONS AND CLINICAL RELEVANCE Healthy rabbit EECs lack inflammatory cells yet can host yeast and bacteria, emphasising the need to evaluate cytological results alongside the clinical signs. Lop-ear anatomy may predispose to bacterial overgrowth and OE. Notably, yeasts may be present despite a negative culture.
Collapse
Affiliation(s)
- Nikoleta Makri
- Royal (Dick) School of Veterinary Studies and the Roslin Institute, University of Edinburgh, Midlothian, UK
| | - Natalie Ring
- Royal (Dick) School of Veterinary Studies and the Roslin Institute, University of Edinburgh, Midlothian, UK
| | - Darren J Shaw
- Royal (Dick) School of Veterinary Studies and the Roslin Institute, University of Edinburgh, Midlothian, UK
| | - Athinodoros Athinodorou
- Royal (Dick) School of Veterinary Studies and the Roslin Institute, University of Edinburgh, Midlothian, UK
| | - Victoria Robinson
- Royal (Dick) School of Veterinary Studies and the Roslin Institute, University of Edinburgh, Midlothian, UK
| | - Gavin K Paterson
- Royal (Dick) School of Veterinary Studies and the Roslin Institute, University of Edinburgh, Midlothian, UK
| | - Jenna Richardson
- Royal (Dick) School of Veterinary Studies and the Roslin Institute, University of Edinburgh, Midlothian, UK
| | - Debbie Gow
- Royal (Dick) School of Veterinary Studies and the Roslin Institute, University of Edinburgh, Midlothian, UK
| | - Tim Nuttall
- Royal (Dick) School of Veterinary Studies and the Roslin Institute, University of Edinburgh, Midlothian, UK
| |
Collapse
|
38
|
Warwick-Dugdale J, Tian F, Michelsen ML, Cronin DR, Moore K, Farbos A, Chittick L, Bell A, Zayed AA, Buchholz HH, Bolanos LM, Parsons RJ, Allen MJ, Sullivan MB, Temperton B. Long-read powered viral metagenomics in the oligotrophic Sargasso Sea. Nat Commun 2024; 15:4089. [PMID: 38744831 PMCID: PMC11094077 DOI: 10.1038/s41467-024-48300-6] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/30/2023] [Accepted: 04/24/2024] [Indexed: 05/16/2024] Open
Abstract
Dominant microorganisms of the Sargasso Sea are key drivers of the global carbon cycle. However, associated viruses that shape microbial community structure and function are not well characterised. Here, we combined short and long read sequencing to survey Sargasso Sea phage communities in virus- and cellular fractions at viral maximum (80 m) and mesopelagic (200 m) depths. We identified 2,301 Sargasso Sea phage populations from 186 genera. Over half of the phage populations identified here lacked representation in global ocean viral metagenomes, whilst 177 of the 186 identified genera lacked representation in genomic databases of phage isolates. Viral fraction and cell-associated viral communities were decoupled, indicating viral turnover occurred across periods longer than the sampling period of three days. Inclusion of long-read data was critical for capturing the breadth of viral diversity. Phage isolates that infect the dominant bacterial taxa Prochlorococcus and Pelagibacter, usually regarded as cosmopolitan and abundant, were poorly represented.
Collapse
Affiliation(s)
- Joanna Warwick-Dugdale
- School of Biosciences, University of Exeter, Exeter, Devon, EX4 4SB, UK.
- Plymouth Marine Laboratory, Plymouth, Devon, PL1 3DH, UK.
| | - Funing Tian
- Center of Microbiome Science and Department of Microbiology, Ohio State University, Columbus, OH, 43210, USA
| | | | - Dylan R Cronin
- Center of Microbiome Science and Department of Microbiology, Ohio State University, Columbus, OH, 43210, USA
- EMERGE Biology Integration Institute, Ohio State University, Columbus, OH, 43210, USA
| | - Karen Moore
- School of Biosciences, University of Exeter, Exeter, Devon, EX4 4SB, UK
| | - Audrey Farbos
- School of Biosciences, University of Exeter, Exeter, Devon, EX4 4SB, UK
| | - Lauren Chittick
- Center of Microbiome Science and Department of Microbiology, Ohio State University, Columbus, OH, 43210, USA
| | - Ashley Bell
- School of Biosciences, University of Exeter, Exeter, Devon, EX4 4SB, UK
| | - Ahmed A Zayed
- Center of Microbiome Science and Department of Microbiology, Ohio State University, Columbus, OH, 43210, USA
- EMERGE Biology Integration Institute, Ohio State University, Columbus, OH, 43210, USA
| | - Holger H Buchholz
- School of Biosciences, University of Exeter, Exeter, Devon, EX4 4SB, UK
- Department of Microbiology, Oregon State University, Corvallis, OR, 97331, USA
| | - Luis M Bolanos
- School of Biosciences, University of Exeter, Exeter, Devon, EX4 4SB, UK
| | - Rachel J Parsons
- Bermuda Institute of Ocean Sciences, St.George's, GE, 01, Bermuda
- School of Ocean Futures, Arizona State University, Tempe, AZ, US
| | - Michael J Allen
- School of Biosciences, University of Exeter, Exeter, Devon, EX4 4SB, UK
| | - Matthew B Sullivan
- Center of Microbiome Science and Department of Microbiology, Ohio State University, Columbus, OH, 43210, USA
- EMERGE Biology Integration Institute, Ohio State University, Columbus, OH, 43210, USA
- Department of Civil, Environmental, and Geodetic Engineering, Ohio State University, Columbus, OH, 43210, USA
| | - Ben Temperton
- School of Biosciences, University of Exeter, Exeter, Devon, EX4 4SB, UK.
| |
Collapse
|
39
|
Buysse M, Koual R, Binetruy F, de Thoisy B, Baudrimont X, Garnier S, Douine M, Chevillon C, Delsuc F, Catzeflis F, Bouchon D, Duron O. Detection of Anaplasma and Ehrlichia bacteria in humans, wildlife, and ticks in the Amazon rainforest. Nat Commun 2024; 15:3988. [PMID: 38734682 PMCID: PMC11088697 DOI: 10.1038/s41467-024-48459-y] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 08/30/2023] [Accepted: 04/29/2024] [Indexed: 05/13/2024] Open
Abstract
Tick-borne bacteria of the genera Ehrlichia and Anaplasma cause several emerging human infectious diseases worldwide. In this study, we conduct an extensive survey for Ehrlichia and Anaplasma infections in the rainforests of the Amazon biome of French Guiana. Through molecular genetics and metagenomics reconstruction, we observe a high indigenous biodiversity of infections circulating among humans, wildlife, and ticks inhabiting these ecosystems. Molecular typing identifies these infections as highly endemic, with a majority of new strains and putative species specific to French Guiana. They are detected in unusual rainforest wild animals, suggesting they have distinctive sylvatic transmission cycles. They also present potential health hazards, as revealed by the detection of Candidatus Anaplasma sparouinense in human red blood cells and that of a new close relative of the human pathogen Ehrlichia ewingii, Candidatus Ehrlichia cajennense, in the tick species that most frequently bite humans in South America. The genome assembly of three new putative species obtained from human, sloth, and tick metagenomes further reveals the presence of major homologs of Ehrlichia and Anaplasma virulence factors. These observations converge to classify health hazards associated with Ehrlichia and Anaplasma infections in the Amazon biome as distinct from those in the Northern Hemisphere.
Collapse
Affiliation(s)
- Marie Buysse
- MIVEGEC, University of Montpellier, CNRS, IRD, Montpellier, France
| | - Rachid Koual
- MIVEGEC, University of Montpellier, CNRS, IRD, Montpellier, France
| | - Florian Binetruy
- MIVEGEC, University of Montpellier, CNRS, IRD, Montpellier, France
| | - Benoit de Thoisy
- Laboratoire des Interactions Virus-Hôtes, Institut Pasteur de Guyane, Cayenne, France
- Association Kwata 'Study and Conservation of Guianan Wildlife', Cayenne, France
| | - Xavier Baudrimont
- Direction Générale des Territoires et de la Mer (DGTM) - Direction de l'environnement, de l'agriculture, de l'alimentation et de la forêt (DEAAF), Cayenne, France
| | - Stéphane Garnier
- Biogéosciences, UMR 6282 uB/CNRS/EPHE, Université Bourgogne Franche-Comté, Dijon, France
| | - Maylis Douine
- Centre d'Investigation Clinique Antilles-Guyane, INSERM 1424, Centre Hospitalier de Cayenne, Cayenne, France
| | | | - Frédéric Delsuc
- Institut des Sciences de l'Evolution de Montpellier (ISEM), CNRS, IRD, EPHE, Université de Montpellier, Montpellier, France
| | - François Catzeflis
- Institut des Sciences de l'Evolution de Montpellier (ISEM), CNRS, IRD, EPHE, Université de Montpellier, Montpellier, France
| | - Didier Bouchon
- EBI, University of Poitiers, UMR CNRS 7267, Poitiers, France
| | - Olivier Duron
- MIVEGEC, University of Montpellier, CNRS, IRD, Montpellier, France.
| |
Collapse
|
40
|
Wang YC, Mao Y, Fu HM, Wang J, Weng X, Liu ZH, Xu XW, Yan P, Fang F, Guo JS, Shen Y, Chen YP. New insights into functional divergence and adaptive evolution of uncultured bacteria in anammox community by complete genome-centric analysis. THE SCIENCE OF THE TOTAL ENVIRONMENT 2024; 924:171530. [PMID: 38453092 DOI: 10.1016/j.scitotenv.2024.171530] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Subscribe] [Scholar Register] [Received: 09/26/2023] [Revised: 11/13/2023] [Accepted: 03/04/2024] [Indexed: 03/09/2024]
Abstract
Anaerobic ammonium-oxidation (anammox) bacteria play a crucial role in global nitrogen cycling and wastewater nitrogen removal, but they share symbiotic relationships with various other microorganisms. Functional divergence and adaptive evolution of uncultured bacteria in anammox community remain underexplored. Although shotgun metagenomics based on short reads has been widely used in anammox research, metagenome-assembled genomes (MAGs) are often discontinuous and highly contaminated, which limits in-depth analyses of anammox communities. Here, for the first time, we performed Pacific Biosciences high-fidelity (HiFi) long-read sequencing on the anammox granule sludge sample from a lab-scale bioreactor, and obtained 30 accurate and complete metagenome-assembled genomes (cMAGs). These cMAGs were obtained by selecting high-quality circular contigs from initial assemblies of long reads generated by HiFi sequencing, eliminating the need for Illumina short reads, binning, and reassembly. One new anammox species affiliated with Candidatus Jettenia and three species affiliated with novel families were found in this anammox community. cMAG-centric analysis revealed functional divergence in general and nitrogen metabolism among the anammox community members, and they might adopt a cross-feeding strategy in organic matter, cofactors, and vitamins. Furthermore, we identified 63 mobile genetic elements (MGEs) and 50 putative horizontal gene transfer (HGT) events within these cMAGs. The results suggest that HGT events and MGEs related to phage and integration or excision, particularly transposons containing tnpA in anammox bacteria, might play important roles in the adaptive evolution of this anammox community. The cMAGs generated in the present study could be used to establish of a comprehensive database for anammox bacteria and associated microorganisms. These findings highlight the advantages of HiFi sequencing for the studies of complex mixed cultures and advance the understanding of anammox communities.
Collapse
Affiliation(s)
- Yi-Cheng Wang
- Key Laboratory of the Three Gorges Reservoir Region's Eco-Environments of MOE, Chongqing University, Chongqing 400045, China
| | - Yanping Mao
- College of Chemistry and Environmental Engineering, Shenzhen University, Shenzhen 518071, Guangdong, China
| | - Hui-Min Fu
- Key Laboratory of the Three Gorges Reservoir Region's Eco-Environments of MOE, Chongqing University, Chongqing 400045, China; National Research Base of Intelligent Manufacturing Service, Chongqing Technology and Business University, Chongqing 400067, China
| | - Jin Wang
- Key Laboratory of the Three Gorges Reservoir Region's Eco-Environments of MOE, Chongqing University, Chongqing 400045, China
| | - Xun Weng
- Key Laboratory of the Three Gorges Reservoir Region's Eco-Environments of MOE, Chongqing University, Chongqing 400045, China
| | - Zi-Hao Liu
- Key Laboratory of the Three Gorges Reservoir Region's Eco-Environments of MOE, Chongqing University, Chongqing 400045, China
| | - Xiao-Wei Xu
- Key Laboratory of the Three Gorges Reservoir Region's Eco-Environments of MOE, Chongqing University, Chongqing 400045, China
| | - Peng Yan
- Key Laboratory of the Three Gorges Reservoir Region's Eco-Environments of MOE, Chongqing University, Chongqing 400045, China
| | - Fang Fang
- Key Laboratory of the Three Gorges Reservoir Region's Eco-Environments of MOE, Chongqing University, Chongqing 400045, China
| | - Jin-Song Guo
- Key Laboratory of the Three Gorges Reservoir Region's Eco-Environments of MOE, Chongqing University, Chongqing 400045, China
| | - Yu Shen
- National Research Base of Intelligent Manufacturing Service, Chongqing Technology and Business University, Chongqing 400067, China
| | - You-Peng Chen
- Key Laboratory of the Three Gorges Reservoir Region's Eco-Environments of MOE, Chongqing University, Chongqing 400045, China.
| |
Collapse
|
41
|
Greenman N, Hassouneh SAD, Abdelli LS, Johnston C, Azarian T. Improving Bacterial Metagenomic Research through Long-Read Sequencing. Microorganisms 2024; 12:935. [PMID: 38792764 PMCID: PMC11124196 DOI: 10.3390/microorganisms12050935] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 04/09/2024] [Revised: 04/22/2024] [Accepted: 04/25/2024] [Indexed: 05/26/2024] Open
Abstract
Metagenomic sequencing analysis is central to investigating microbial communities in clinical and environmental studies. Short-read sequencing remains the primary approach for metagenomic research; however, long-read sequencing may offer advantages of improved metagenomic assembly and resolved taxonomic identification. To compare the relative performance for metagenomic studies, we simulated short- and long-read datasets using increasingly complex metagenomes comprising 10, 20, and 50 microbial taxa. Additionally, we used an empirical dataset of paired short- and long-read data generated from mouse fecal pellets to assess real-world performance. We compared metagenomic assembly quality, taxonomic classification, and metagenome-assembled genome (MAG) recovery rates. We show that long-read sequencing data significantly improve taxonomic classification and assembly quality. Metagenomic assemblies using simulated long reads were more complete and more contiguous with higher rates of MAG recovery. This resulted in more precise taxonomic classifications. Principal component analysis of empirical data demonstrated that sequencing technology affects compositional results as samples clustered by sequence type, not sample type. Overall, we highlight strengths of long-read metagenomic sequencing for microbiome studies, including improving the accuracy of classification and relative abundance estimates. These results will aid researchers when considering which sequencing approaches to use for metagenomic projects.
Collapse
Affiliation(s)
- Noah Greenman
- College of Medicine, University of Central Florida, Orlando, FL 32827, USA; (N.G.); (S.A.-D.H.); (C.J.)
| | - Sayf Al-Deen Hassouneh
- College of Medicine, University of Central Florida, Orlando, FL 32827, USA; (N.G.); (S.A.-D.H.); (C.J.)
| | - Latifa S. Abdelli
- Department of Health Science, College of Health Professions and Sciences, University of Central Florida, Orlando, FL 32816, USA;
| | - Catherine Johnston
- College of Medicine, University of Central Florida, Orlando, FL 32827, USA; (N.G.); (S.A.-D.H.); (C.J.)
| | - Taj Azarian
- College of Medicine, University of Central Florida, Orlando, FL 32827, USA; (N.G.); (S.A.-D.H.); (C.J.)
| |
Collapse
|
42
|
Bouras G, Houtak G, Wick RR, Mallawaarachchi V, Roach MJ, Papudeshi B, Judd LM, Sheppard AE, Edwards RA, Vreugde S. Hybracter: enabling scalable, automated, complete and accurate bacterial genome assemblies. Microb Genom 2024; 10:001244. [PMID: 38717808 PMCID: PMC11165638 DOI: 10.1099/mgen.0.001244] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 01/08/2024] [Accepted: 04/16/2024] [Indexed: 05/21/2024] Open
Abstract
Improvements in the accuracy and availability of long-read sequencing mean that complete bacterial genomes are now routinely reconstructed using hybrid (i.e. short- and long-reads) assembly approaches. Complete genomes allow a deeper understanding of bacterial evolution and genomic variation beyond single nucleotide variants. They are also crucial for identifying plasmids, which often carry medically significant antimicrobial resistance genes. However, small plasmids are often missed or misassembled by long-read assembly algorithms. Here, we present Hybracter which allows for the fast, automatic and scalable recovery of near-perfect complete bacterial genomes using a long-read first assembly approach. Hybracter can be run either as a hybrid assembler or as a long-read only assembler. We compared Hybracter to existing automated hybrid and long-read only assembly tools using a diverse panel of samples of varying levels of long-read accuracy with manually curated ground truth reference genomes. We demonstrate that Hybracter as a hybrid assembler is more accurate and faster than the existing gold standard automated hybrid assembler Unicycler. We also show that Hybracter with long-reads only is the most accurate long-read only assembler and is comparable to hybrid methods in accurately recovering small plasmids.
Collapse
Affiliation(s)
- George Bouras
- Adelaide Medical School, Faculty of Health and Medical Sciences, The University of Adelaide, Adelaide, Australia
- The Department of Surgery – Otolaryngology Head and Neck Surgery, University of Adelaide and the Basil Hetzel Institute for Translational Health Research, Central Adelaide Local Health Network, Adelaide, South Australia, Australia
| | - Ghais Houtak
- Adelaide Medical School, Faculty of Health and Medical Sciences, The University of Adelaide, Adelaide, Australia
- The Department of Surgery – Otolaryngology Head and Neck Surgery, University of Adelaide and the Basil Hetzel Institute for Translational Health Research, Central Adelaide Local Health Network, Adelaide, South Australia, Australia
| | - Ryan R. Wick
- Department of Microbiology and Immunology, University of Melbourne at the Peter Doherty Institute for Infection and Immunity, Melbourne, Australia
| | - Vijini Mallawaarachchi
- Flinders Accelerator for Microbiome Exploration, College of Science and Engineering, Flinders University, Adelaide, Australia
| | - Michael J. Roach
- Flinders Accelerator for Microbiome Exploration, College of Science and Engineering, Flinders University, Adelaide, Australia
- Adelaide Centre for Epigenetics and South Australian Immunogenomics Cancer Institute, The University of Adelaide, Adelaide, Australia
| | - Bhavya Papudeshi
- Flinders Accelerator for Microbiome Exploration, College of Science and Engineering, Flinders University, Adelaide, Australia
| | - Lousie M. Judd
- Department of Microbiology and Immunology, University of Melbourne at the Peter Doherty Institute for Infection and Immunity, Melbourne, Australia
| | - Anna E. Sheppard
- School of Biological Sciences, The University of Adelaide, Adelaide, Australia
| | - Robert A. Edwards
- Flinders Accelerator for Microbiome Exploration, College of Science and Engineering, Flinders University, Adelaide, Australia
| | - Sarah Vreugde
- Adelaide Medical School, Faculty of Health and Medical Sciences, The University of Adelaide, Adelaide, Australia
- The Department of Surgery – Otolaryngology Head and Neck Surgery, University of Adelaide and the Basil Hetzel Institute for Translational Health Research, Central Adelaide Local Health Network, Adelaide, South Australia, Australia
| |
Collapse
|
43
|
Li H, Durbin R. Genome assembly in the telomere-to-telomere era. Nat Rev Genet 2024:10.1038/s41576-024-00718-w. [PMID: 38649458 DOI: 10.1038/s41576-024-00718-w] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Accepted: 02/27/2024] [Indexed: 04/25/2024]
Abstract
Genome sequences largely determine the biology and encode the history of an organism, and de novo assembly - the process of reconstructing the genome sequence of an organism from sequencing reads - has been a central problem in bioinformatics for four decades. Until recently, genomes were typically assembled into fragments of a few megabases at best, but now technological advances in long-read sequencing enable the near-complete assembly of each chromosome - also known as telomere-to-telomere assembly - for many organisms. Here, we review recent progress on assembly algorithms and protocols, with a focus on how to derive near-telomere-to-telomere assemblies. We also discuss the additional developments that will be required to resolve remaining assembly gaps and to assemble non-diploid genomes.
Collapse
Affiliation(s)
- Heng Li
- Department of Data Science, Dana-Farber Cancer Institute, Boston, MA, USA.
- Department of Biomedical Informatics, Harvard Medical School, Boston, MA, USA.
| | - Richard Durbin
- Department of Genetics, Cambridge University, Cambridge, UK.
| |
Collapse
|
44
|
Peres da Silva R, Suphavilai C, Nagarajan N. MetageNN: a memory-efficient neural network taxonomic classifier robust to sequencing errors and missing genomes. BMC Bioinformatics 2024; 25:153. [PMID: 38627615 PMCID: PMC11022314 DOI: 10.1186/s12859-024-05760-3] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 09/26/2022] [Accepted: 03/22/2024] [Indexed: 04/19/2024] Open
Abstract
BACKGROUND With the rapid increase in throughput of long-read sequencing technologies, recent studies have explored their potential for taxonomic classification by using alignment-based approaches to reduce the impact of higher sequencing error rates. While alignment-based methods are generally slower, k-mer-based taxonomic classifiers can overcome this limitation, potentially at the expense of lower sensitivity for strains and species that are not in the database. RESULTS We present MetageNN, a memory-efficient long-read taxonomic classifier that is robust to sequencing errors and missing genomes. MetageNN is a neural network model that uses short k-mer profiles of sequences to reduce the impact of distribution shifts on error-prone long reads. Benchmarking MetageNN against other machine learning approaches for taxonomic classification (GeNet) showed substantial improvements with long-read data (20% improvement in F1 score). By utilizing nanopore sequencing data, MetageNN exhibits improved sensitivity in situations where the reference database is incomplete. It surpasses the alignment-based MetaMaps and MEGAN-LR, as well as the k-mer-based Kraken2 tools, with improvements of 100%, 36%, and 23% respectively at the read-level analysis. Notably, at the community level, MetageNN consistently demonstrated higher sensitivities than the previously mentioned tools. Furthermore, MetageNN requires < 1/4th of the database storage used by Kraken2, MEGAN-LR and MMseqs2 and is > 7× faster than MetaMaps and GeNet and > 2× faster than MEGAN-LR and MMseqs2. CONCLUSION This proof of concept work demonstrates the utility of machine-learning-based methods for taxonomic classification using long reads. MetageNN can be used on sequences not classified by conventional methods and offers an alternative approach for memory-efficient classifiers that can be optimized further.
Collapse
Affiliation(s)
- Rafael Peres da Silva
- School of Computing, National University of Singapore, Singapore, 117417, Republic of Singapore.
- Agency for Science, Technology and Research (A*STAR), Genome Institute of Singapore (GIS), Singapore, 138672, Republic of Singapore.
| | - Chayaporn Suphavilai
- Agency for Science, Technology and Research (A*STAR), Genome Institute of Singapore (GIS), Singapore, 138672, Republic of Singapore
| | - Niranjan Nagarajan
- School of Computing, National University of Singapore, Singapore, 117417, Republic of Singapore.
- Agency for Science, Technology and Research (A*STAR), Genome Institute of Singapore (GIS), Singapore, 138672, Republic of Singapore.
- Yong Loo Lin School of Medicine, National University of Singapore, Singapore, 119228, Republic of Singapore.
| |
Collapse
|
45
|
Ridley RS, Conrad RE, Lindner BG, Woo S, Konstantinidis KT. Potential routes of plastics biotransformation involving novel plastizymes revealed by global multi-omic analysis of plastic associated microbes. Sci Rep 2024; 14:8798. [PMID: 38627476 PMCID: PMC11021508 DOI: 10.1038/s41598-024-59279-x] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 11/17/2023] [Accepted: 04/09/2024] [Indexed: 04/19/2024] Open
Abstract
Despite increasing efforts across various disciplines, the fate, transport, and impact of synthetic plastics on the environment and public health remain poorly understood. To better elucidate the microbial ecology of plastic waste and its potential for biotransformation, we conducted a large-scale analysis of all publicly available meta-omic studies investigating plastics (n = 27) in the environment. Notably, we observed low prevalence of known plastic degraders throughout most environments, except for substantial enrichment in riverine systems. This indicates rivers may be a highly promising environment for discovery of novel plastic bioremediation products. Ocean samples associated with degrading plastics showed clear differentiation from non-degrading polymers, showing enrichment of novel putative biodegrading taxa in the degraded samples. Regarding plastisphere pathogenicity, we observed significant enrichment of antimicrobial resistance genes on plastics but not of virulence factors. Additionally, we report a co-occurrence network analysis of 10 + million proteins associated with the plastisphere. This analysis revealed a localized sub-region enriched with known and putative plastizymes-these may be useful for deeper investigation of nature's ability to biodegrade man-made plastics. Finally, the combined data from our meta-analysis was used to construct a publicly available database, the Plastics Meta-omic Database (PMDB)-accessible at plasticmdb.org. These data should aid in the integrated exploration of the microbial plastisphere and facilitate research efforts investigating the fate and bioremediation potential of environmental plastic waste.
Collapse
Affiliation(s)
- Rodney S Ridley
- School of Chemical and Biomolecular Engineering, Georgia Institute of Technology, Atlanta, GA, 30332, USA.
| | - Roth E Conrad
- School of Biological Sciences, Georgia Institute of Technology, Atlanta, GA, 30332, USA
- School of Civil and Environmental Engineering, Georgia Institute of Technology, Atlanta, GA, 30332, USA
| | - Blake G Lindner
- School of Civil and Environmental Engineering, Georgia Institute of Technology, Atlanta, GA, 30332, USA
| | - Seongwook Woo
- School of Biological Sciences, Georgia Institute of Technology, Atlanta, GA, 30332, USA
| | - Konstantinos T Konstantinidis
- School of Biological Sciences, Georgia Institute of Technology, Atlanta, GA, 30332, USA.
- School of Civil and Environmental Engineering, Georgia Institute of Technology, Atlanta, GA, 30332, USA.
| |
Collapse
|
46
|
Bouras G, Houtak G, Wick RR, Mallawaarachchi V, Roach MJ, Papudeshi B, Judd LM, Sheppard AE, Edwards RA, Vreugde S. Hybracter: Enabling Scalable, Automated, Complete and Accurate Bacterial Genome Assemblies. BIORXIV : THE PREPRINT SERVER FOR BIOLOGY 2024:2023.12.12.571215. [PMID: 38168369 PMCID: PMC10760025 DOI: 10.1101/2023.12.12.571215] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Grants] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 01/05/2024]
Abstract
Improvements in the accuracy and availability of long-read sequencing mean that complete bacterial genomes are now routinely reconstructed using hybrid (i.e. short- and long-reads) assembly approaches. Complete genomes allow a deeper understanding of bacterial evolution and genomic variation beyond single nucleotide variants (SNVs). They are also crucial for identifying plasmids, which often carry medically significant antimicrobial resistance (AMR) genes. However, small plasmids are often missed or misassembled by long-read assembly algorithms. Here, we present Hybracter which allows for the fast, automatic, and scalable recovery of near-perfect complete bacterial genomes using a long-read first assembly approach. Hybracter can be run either as a hybrid assembler or as a long-read only assembler. We compared Hybracter to existing automated hybrid and long-read only assembly tools using a diverse panel of samples of varying levels of long-read accuracy with manually curated ground truth reference genomes. We demonstrate that Hybracter as a hybrid assembler is more accurate and faster than the existing gold standard automated hybrid assembler Unicycler. We also show that Hybracter with long-reads only is the most accurate long-read only assembler and is comparable to hybrid methods in accurately recovering small plasmids.
Collapse
Affiliation(s)
- George Bouras
- Adelaide Medical School, Faculty of Health and Medical Sciences, The University of Adelaide, Adelaide, Australia
- The Department of Surgery - Otolaryngology Head and Neck Surgery, University of Adelaide and the Basil Hetzel Institute for Translational Health Research, Central Adelaide Local Health Network, South Australia, Australia
| | - Ghais Houtak
- Adelaide Medical School, Faculty of Health and Medical Sciences, The University of Adelaide, Adelaide, Australia
- The Department of Surgery - Otolaryngology Head and Neck Surgery, University of Adelaide and the Basil Hetzel Institute for Translational Health Research, Central Adelaide Local Health Network, South Australia, Australia
| | - Ryan R. Wick
- Department of Microbiology and Immunology, University of Melbourne at the Peter Doherty Institute for Infection and Immunity, Melbourne, Australia
| | - Vijini Mallawaarachchi
- Flinders Accelerator for Microbiome Exploration, College of Science and Engineering, Flinders University, Adelaide, Australia
| | - Michael J. Roach
- Flinders Accelerator for Microbiome Exploration, College of Science and Engineering, Flinders University, Adelaide, Australia
- Adelaide Centre for Epigenetics and South Australian Immunogenomics Cancer Institute, The University of Adelaide, Adelaide, Australia
| | - Bhavya Papudeshi
- Flinders Accelerator for Microbiome Exploration, College of Science and Engineering, Flinders University, Adelaide, Australia
| | - Lousie M. Judd
- Department of Microbiology and Immunology, University of Melbourne at the Peter Doherty Institute for Infection and Immunity, Melbourne, Australia
| | - Anna E. Sheppard
- School of Biological Sciences, The University of Adelaide, Adelaide, Australia
| | - Robert A. Edwards
- Flinders Accelerator for Microbiome Exploration, College of Science and Engineering, Flinders University, Adelaide, Australia
| | - Sarah Vreugde
- Adelaide Medical School, Faculty of Health and Medical Sciences, The University of Adelaide, Adelaide, Australia
- The Department of Surgery - Otolaryngology Head and Neck Surgery, University of Adelaide and the Basil Hetzel Institute for Translational Health Research, Central Adelaide Local Health Network, South Australia, Australia
| |
Collapse
|
47
|
Feng X, Li H. Evaluating and improving the representation of bacterial contents in long-read metagenome assemblies. Genome Biol 2024; 25:92. [PMID: 38605401 PMCID: PMC11007910 DOI: 10.1186/s13059-024-03234-6] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 11/21/2022] [Accepted: 03/29/2024] [Indexed: 04/13/2024] Open
Abstract
BACKGROUND In the metagenomic assembly of a microbial community, abundant species are often thought to assemble well given their deeper sequencing coverage. This conjuncture is rarely tested or evaluated in practice. We often do not know how many abundant species are missing and do not have an approach to recover them. RESULTS Here, we propose k-mer based and 16S RNA based methods to measure the completeness of metagenome assembly. We show that even with PacBio high-fidelity (HiFi) reads, abundant species are often not assembled, as high strain diversity may lead to fragmented contigs. We develop a novel reference-free algorithm to recover abundant metagenome-assembled genomes (MAGs) by identifying circular assembly subgraphs. Complemented with a reference-free genome binning heuristics based on dimension reduction, the proposed method rescues many abundant species that would be missing with existing methods and produces competitive results compared to those state-of-the-art binners in terms of total number of near-complete genome bins. CONCLUSIONS Our work emphasizes the importance of metagenome completeness, which has often been overlooked. Our algorithm generates more circular MAGs and moves a step closer to the complete representation of microbial communities.
Collapse
Affiliation(s)
- Xiaowen Feng
- Department of Data Sciences, Dana-Farber Cancer Institute, Boston, USA
- Department of Biomedical Informatics, Harvard Medical School, Boston, USA
| | - Heng Li
- Department of Data Sciences, Dana-Farber Cancer Institute, Boston, USA.
- Department of Biomedical Informatics, Harvard Medical School, Boston, USA.
| |
Collapse
|
48
|
Narsing Rao MP, Singh RN, Sani RK, Banerjee A. Genome-based approach to evaluate the metabolic potentials and exopolysaccharides production of Bacillus paralicheniformis CamBx3 isolated from a Chilean hot spring. Front Microbiol 2024; 15:1377965. [PMID: 38628868 PMCID: PMC11018918 DOI: 10.3389/fmicb.2024.1377965] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 01/28/2024] [Accepted: 03/13/2024] [Indexed: 04/19/2024] Open
Abstract
In the present study, a thermophilic strain designated CamBx3 was isolated from the Campanario hot spring, Chile. Based on 16S rRNA gene sequence, phylogenomic, and average nucleotide identity analysis the strain CamBx3 was identified as Bacillus paralicheniformis. Genome analysis of B. paralicheniformis CamBx3 revealed the presence of genes related to heat tolerance, exopolysaccharides (EPS), dissimilatory nitrate reduction, and assimilatory sulfate reduction. The pangenome analysis of strain CamBx3 with eight Bacillus spp. resulted in 26,562 gene clusters, 7,002 shell genes, and 19,484 cloud genes. The EPS produced by B. paralicheniformis CamBx3 was extracted, partially purified, and evaluated for its functional activities. B. paralicheniformis CamBx3 EPS with concentration 5 mg mL-1 showed an optimum 92 mM ferrous equivalent FRAP activity, while the same concentration showed a maximum 91% of Fe2+ chelating activity. B. paralicheniformis CamBx3 EPS (0.2 mg mL-1) demonstrated β-glucosidase inhibition. The EPS formed a viscoelastic gel at 45°C with a maximum instantaneous viscosity of 315 Pa.s at acidic pH 5. The present study suggests that B. paralicheniformis CamBx3 could be a valuable resource for biopolymers and bioactive molecules for industrial applications.
Collapse
Affiliation(s)
- Manik Prabhu Narsing Rao
- Instituto de Ciencias Aplicadas, Facultad de Ingeniería, Universidad Autónoma de Chile, Sede Talca, Talca, Chile
| | - Ram Nageena Singh
- Department of Chemical and Biological Engineering, South Dakota Mines, Rapid City, SD, United States
- 2-Dimensional Materials for Biofilm Engineering, Science and Technology, South Dakota Mines, Rapid City, SD, United States
| | - Rajesh K. Sani
- Department of Chemical and Biological Engineering, South Dakota Mines, Rapid City, SD, United States
- 2-Dimensional Materials for Biofilm Engineering, Science and Technology, South Dakota Mines, Rapid City, SD, United States
- Data Driven Material Discovery Center for Bioengineering Innovation, South Dakota Mines, Rapid City, SD, United States
- BioWRAP (Bioplastics With Regenerative Agricultural Properties), Rapid City, SD, United States
| | - Aparna Banerjee
- Instituto de Ciencias Aplicadas, Facultad de Ingeniería, Universidad Autónoma de Chile, Sede Talca, Talca, Chile
| |
Collapse
|
49
|
Ding Y, Er S, Tan A, Gounot JS, Saw WY, Tan LWL, Teo YY, Nagarajan N, Seedorf H. Comparison of tet(X4)-containing contigs assembled from metagenomic sequencing data with plasmid sequences of isolates from a cohort of healthy subjects. Microbiol Spectr 2024; 12:e0396923. [PMID: 38441466 PMCID: PMC10986321 DOI: 10.1128/spectrum.03969-23] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 11/17/2023] [Accepted: 02/12/2024] [Indexed: 04/06/2024] Open
Abstract
Recently discovered tet(X) gene variants have provided new insights into microbial antibiotic resistance mechanisms and their potential consequences for public health. This study focused on detection, analysis, and characterization of Tet(X4)-positive Enterobacterales from the gut microbiota of a healthy cohort of individuals in Singapore using cultivation-dependent and cultivation-independent approaches. Twelve Tet(X4)-positive Enterobacterales strains that were previously obtained from the cohort were fully genome-sequenced and comparatively analyzed. A metagenomic sequencing (MS) data set of the same samples was mined for contigs that harbored the tet(X4) resistance gene. The sequences of tet(X4)-containing contigs and plasmids sequences were compared. The presence of the resistance genes floR and estT (previously annotated as catD) was detected in the same cassette in 10 and 12 out of the 12 tet(X4)-carrying plasmids, respectively. MS detected tet(X4)-containing contigs in 2 out of the 109 subjects, while cultivation-dependent analysis previously reported a prevalence of 10.1%. The tet(X4)-containing sequences assembled from MS data are relatively short (~14 to 33 kb) but show high similarity to the respective plasmid sequences of the isolates. Our findings show that MS can complement efforts in the surveillance of antibiotic resistance genes for clinical samples, while it has a lower sensitivity than a cultivation-based method when the target organism has a low abundance. Further optimization is required if MS is to be utilized in antibiotic resistance surveillance.IMPORTANCEThe global rise in antibiotic resistance makes it necessary to develop and apply new approaches to detect and monitor the prevalence of antibiotic resistance genes in human populations. In this regard, of particular interest are resistances against last-resort antibiotics, such as tigecycline. In this study, we show that metagenomic sequencing can help to detect high abundance of the tigecycline resistance gene tet(X4) in fecal samples from a cohort of healthy human subjects. However, cultivation-based approaches currently remain the most reliable and cost-effective method for detection of antibiotic-resistant bacteria.
Collapse
Affiliation(s)
- Yichen Ding
- Temasek Life Sciences Laboratory, 1 Research Link, Singapore, Singapore
| | - Shuan Er
- Temasek Life Sciences Laboratory, 1 Research Link, Singapore, Singapore
| | - Abel Tan
- Temasek Life Sciences Laboratory, 1 Research Link, Singapore, Singapore
| | - Jean-Sebastien Gounot
- Genome Institute of Singapore (GIS), Agency for Science, Technology and Research (A*STAR), Singapore, Singapore
| | - Woei-Yuh Saw
- Baker Heart and Diabetes Institute, Melbourne, Victoria, Singapore
| | - Linda Wei Lin Tan
- Saw Swee Hock School of Public Health, National University of Singapore, Singapore, Singapore
| | - Yik Ying Teo
- Genome Institute of Singapore (GIS), Agency for Science, Technology and Research (A*STAR), Singapore, Singapore
- Saw Swee Hock School of Public Health, National University of Singapore, Singapore, Singapore
- NUS Graduate School for Integrative Science and Engineering, National University of Singapore, Singapore, Singapore
- Department of Statistics and Applied Probability, National University of Singapore, Singapore, Singapore
- Life Sciences Institute, National University of Singapore, Singapore, Singapore
| | - Niranjan Nagarajan
- Genome Institute of Singapore (GIS), Agency for Science, Technology and Research (A*STAR), Singapore, Singapore
- NUS Graduate School for Integrative Science and Engineering, National University of Singapore, Singapore, Singapore
| | - Henning Seedorf
- Temasek Life Sciences Laboratory, 1 Research Link, Singapore, Singapore
- Department of Biological Sciences, National University of Singapore, Singapore, Singapore
| |
Collapse
|
50
|
Cook R, Telatin A, Hsieh SY, Newberry F, Tariq MA, Baker DJ, Carding SR, Adriaenssens EM. Nanopore and Illumina sequencing reveal different viral populations from human gut samples. Microb Genom 2024; 10. [PMID: 38683195 DOI: 10.1099/mgen.0.001236] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 05/01/2024] Open
Abstract
The advent of viral metagenomics, or viromics, has improved our knowledge and understanding of global viral diversity. High-throughput sequencing technologies enable explorations of the ecological roles, contributions to host metabolism, and the influence of viruses in various environments, including the human intestinal microbiome. However, bacterial metagenomic studies frequently have the advantage. The adoption of advanced technologies like long-read sequencing has the potential to be transformative in refining viromics and metagenomics. Here, we examined the effectiveness of long-read and hybrid sequencing by comparing Illumina short-read and Oxford Nanopore Technology (ONT) long-read sequencing technologies and different assembly strategies on recovering viral genomes from human faecal samples. Our findings showed that if a single sequencing technology is to be chosen for virome analysis, Illumina is preferable due to its superior ability to recover fully resolved viral genomes and minimise erroneous genomes. While ONT assemblies were effective in recovering viral diversity, the challenges related to input requirements and the necessity for amplification made it less ideal as a standalone solution. However, using a combined, hybrid approach enabled a more authentic representation of viral diversity to be obtained within samples.
Collapse
Affiliation(s)
- Ryan Cook
- Quadram Institute Bioscience, Norwich, NR4 7UQ, UK
| | | | | | - Fiona Newberry
- Department of Biosciences, Nottingham Trent University, Nottingham, NG11 8NS, UK
| | - Mohammad A Tariq
- Faculty of Health and Life Sciences, University of Northumbria, Newcastle upon Tyne, NE1 8ST, UK
| | - Dave J Baker
- Quadram Institute Bioscience, Norwich, NR4 7UQ, UK
| | - Simon R Carding
- Quadram Institute Bioscience, Norwich, NR4 7UQ, UK
- Norwich Medical School, University of East Anglia, Norwich, NR4 7TJ, UK
| | | |
Collapse
|