1
|
Wu Z, Guo L, Wu Y, Yang M, Du S, Shao J, Zhang Z, Zhao Y. Novel phage infecting the Roseobacter CHUG lineage reveals a diverse and globally distributed phage family. mSphere 2024; 9:e0045824. [PMID: 38926906 PMCID: PMC11288001 DOI: 10.1128/msphere.00458-24] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 05/29/2024] [Accepted: 06/01/2024] [Indexed: 06/28/2024] Open
Abstract
Bacteriophages play an essential role in shaping the diversity and metabolism of bacterial communities. Marine Roseobacter group is an abundant heterotrophic bacterial group that is involved in many major element cycles, especially carbon and sulfur. Members of the Roseobacter CHUG (Clade Hidden and Underappreciated Globally) lineage are globally distributed and are activated in pelagic marine environments. In this study, we isolated and characterized a phage, CRP-810, that infects the CHUG strain FZCC0198. The genome of CRP-810 was dissimilar to those of other known phages. Additionally, 251 uncultured viral genomes (UViGs) closely related to CRP-810 were obtained from the uncultivated marine viral contig databases. Comparative genomic and phylogenetic analyses revealed that CRP-810 and these related UViGs exhibited conserved genome synteny, representing a new phage family with at least eight subgroups. Most of the CRP-810-type phages contain an integrase gene, and CRP-810 can be integrated into the host genome. Further analysis revealed that three CRP-810-type members were prophages found in the genomes of marine SAR11, Poseidonocella, and Sphingomonadaceae. Finally, viromic read-mapping analysis showed that CRP-810-type phages were globally distributed and displayed distinct biogeographic patterns related to temperature and latitude. Many members with a lower G + C content were mainly distributed in the trade station, whereas members with a higher G + C content were mainly distributed in polar and westerlies station, indicating that the niche differentiation of phages was subject to host adaptation. Collectively, these findings identify a novel phage family and expand our understanding of phylogenetic diversity, evolution, and biogeography of marine phages. IMPORTANCE The Roseobacter CHUG lineage, affiliated with the Pelagic Roseobacter Cluster (PRC), is widely distributed in the global oceans and is active in oligotrophic seawater. However, knowledge of the bacteriophages that infect CHUG members is limited. In this study, a CHUG phage, CRP-810, that infects the CHUG strain FZCC0198, was isolated and shown to have a novel genomic architecture. In addition, 251 uncultured viral genomes closely related to CRP-810 were recovered and included in the analyses. Phylogenomic analyses revealed that the CRP-810-type phages represent a new phage family containing at least eight genus-level subgroups. Members of this family were predicted to infect various marine bacteria. We also demonstrated that the CRP-810-type phages are widely distributed in global oceans and display distinct biogeographic patterns related to latitude. Collectively, this study provides important insights into the genomic organization, diversity, and ecology of a novel phage family that infect ecologically important bacteria in the global ocean.
Collapse
Affiliation(s)
- Zuqing Wu
- Fujian Provincial Key Laboratory of Agroecological Processing and Safety Monitoring, College of JunCao Sciences and Ecology, Fujian Agriculture and Forestry University, Fuzhou, China
| | - Luyuan Guo
- Fujian Provincial Key Laboratory of Agroecological Processing and Safety Monitoring, College of JunCao Sciences and Ecology, Fujian Agriculture and Forestry University, Fuzhou, China
| | - Ying Wu
- Fujian Provincial Key Laboratory of Agroecological Processing and Safety Monitoring, College of JunCao Sciences and Ecology, Fujian Agriculture and Forestry University, Fuzhou, China
| | - Mingyu Yang
- Fujian Provincial Key Laboratory of Agroecological Processing and Safety Monitoring, College of JunCao Sciences and Ecology, Fujian Agriculture and Forestry University, Fuzhou, China
| | - Sen Du
- Fujian Provincial Key Laboratory of Agroecological Processing and Safety Monitoring, College of JunCao Sciences and Ecology, Fujian Agriculture and Forestry University, Fuzhou, China
| | - Jiabing Shao
- Fujian Provincial Key Laboratory of Agroecological Processing and Safety Monitoring, College of JunCao Sciences and Ecology, Fujian Agriculture and Forestry University, Fuzhou, China
| | - Zefeng Zhang
- Fujian Provincial Key Laboratory of Agroecological Processing and Safety Monitoring, College of JunCao Sciences and Ecology, Fujian Agriculture and Forestry University, Fuzhou, China
| | - Yanlin Zhao
- Fujian Provincial Key Laboratory of Agroecological Processing and Safety Monitoring, College of JunCao Sciences and Ecology, Fujian Agriculture and Forestry University, Fuzhou, China
| |
Collapse
|
2
|
Shi LD, West-Roberts J, Schoelmerich MC, Penev PI, Chen L, Amano Y, Lei S, Sachdeva R, Banfield JF. Methanotrophic Methanoperedens archaea host diverse and interacting extrachromosomal elements. Nat Microbiol 2024:10.1038/s41564-024-01740-8. [PMID: 38918468 DOI: 10.1038/s41564-024-01740-8] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 08/16/2023] [Accepted: 05/20/2024] [Indexed: 06/27/2024]
Abstract
Methane emissions are mitigated by anaerobic methane-oxidizing archaea, including Methanoperedens. Some Methanoperedens host huge extrachromosomal genetic elements (ECEs) called Borgs that may modulate their activity, yet the broader diversity of Methanoperedens ECEs is understudied. Here we report small enigmatic linear ECEs, circular viruses and unclassified ECEs that are predicted to replicate within Methanoperedens. Linear ECEs have inverted terminal repeats, tandem repeats and coding patterns that are strongly reminiscent of Borgs, but they are only 52-145 kb in length. As they share proteins with Borgs and Methanoperedens, we refer to them as mini-Borgs. Mini-Borgs are genetically diverse and can be assigned to at least five family-level groups. We identify eight families of Methanoperedens viruses, some of which encode multi-haem cytochromes, and circular ECEs encoding transposon-associated TnpB genes with proximal population-heterogeneous CRISPR arrays. These ECEs exchange genetic information with each other and with Methanoperedens, probably impacting their archaeal host activity and evolution.
Collapse
Affiliation(s)
- Ling-Dong Shi
- Innovative Genomics Institute, University of California, Berkeley, Berkeley, CA, USA
| | - Jacob West-Roberts
- Environmental Science, Policy and Management, University of California, Berkeley, Berkeley, CA, USA
| | - Marie C Schoelmerich
- Innovative Genomics Institute, University of California, Berkeley, Berkeley, CA, USA
- Department of Environmental Systems Sciences, ETH Zurich, Zurich, Switzerland
| | - Petar I Penev
- Innovative Genomics Institute, University of California, Berkeley, Berkeley, CA, USA
| | - LinXing Chen
- Innovative Genomics Institute, University of California, Berkeley, Berkeley, CA, USA
| | - Yuki Amano
- Sector of Decommissioning and Radioactive Wastes Management, Japan Atomic Energy Agency, Ibaraki, Japan
| | - Shufei Lei
- Earth and Planetary Science, University of California, Berkeley, Berkeley, CA, USA
| | - Rohan Sachdeva
- Innovative Genomics Institute, University of California, Berkeley, Berkeley, CA, USA
| | - Jillian F Banfield
- Innovative Genomics Institute, University of California, Berkeley, Berkeley, CA, USA.
- Environmental Science, Policy and Management, University of California, Berkeley, Berkeley, CA, USA.
- Earth and Planetary Science, University of California, Berkeley, Berkeley, CA, USA.
| |
Collapse
|
3
|
Dantas CWD, Martins DT, Nogueira WG, Alegria OVC, Ramos RTJ. Tools and methodology to in silico phage discovery in freshwater environments. Front Microbiol 2024; 15:1390726. [PMID: 38881659 PMCID: PMC11176557 DOI: 10.3389/fmicb.2024.1390726] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 02/23/2024] [Accepted: 05/16/2024] [Indexed: 06/18/2024] Open
Abstract
Freshwater availability is essential, and its maintenance has become an enormous challenge. Due to population growth and climate changes, freshwater sources are becoming scarce, imposing the need for strategies for its reuse. Currently, the constant discharge of waste into water bodies from human activities leads to the dissemination of pathogenic bacteria, negatively impacting water quality from the source to the infrastructure required for treatment, such as the accumulation of biofilms. Current water treatment methods cannot keep pace with bacterial evolution, which increasingly exhibits a profile of multidrug resistance to antibiotics. Furthermore, using more powerful disinfectants may affect the balance of aquatic ecosystems. Therefore, there is a need to explore sustainable ways to control the spreading of pathogenic bacteria. Bacteriophages can infect bacteria and archaea, hijacking their host machinery to favor their replication. They are widely abundant globally and provide a biological alternative to bacterial treatment with antibiotics. In contrast to common disinfectants and antibiotics, bacteriophages are highly specific, minimizing adverse effects on aquatic microbial communities and offering a lower cost-benefit ratio in production compared to antibiotics. However, due to the difficulty involving cultivating and identifying environmental bacteriophages, alternative approaches using NGS metagenomics in combination with some bioinformatic tools can help identify new bacteriophages that can be useful as an alternative treatment against resistant bacteria. In this review, we discuss advances in exploring the virome of freshwater, as well as current applications of bacteriophages in freshwater treatment, along with current challenges and future perspectives.
Collapse
Affiliation(s)
- Carlos Willian Dias Dantas
- Department of Biochemistry and Immunology, Institute of Biological Sciences, Federal University of Minas Gerais, Belo Horizonte, Minas Gerais, Brazil
- Laboratory of Simulation and Computational Biology - SIMBIC, High Performance Computing Center - CCAD, Federal University of Pará, Belém, Pará, Brazil
- Laboratory of Bioinformatics and Genomics of Microorganisms, Institute of Biological Sciences, Federal University of Pará, Belém, Pará, Brazil
| | - David Tavares Martins
- Laboratory of Simulation and Computational Biology - SIMBIC, High Performance Computing Center - CCAD, Federal University of Pará, Belém, Pará, Brazil
- Laboratory of Bioinformatics and Genomics of Microorganisms, Institute of Biological Sciences, Federal University of Pará, Belém, Pará, Brazil
| | - Wylerson Guimarães Nogueira
- Department of Biochemistry and Immunology, Institute of Biological Sciences, Federal University of Minas Gerais, Belo Horizonte, Minas Gerais, Brazil
| | - Oscar Victor Cardenas Alegria
- Laboratory of Simulation and Computational Biology - SIMBIC, High Performance Computing Center - CCAD, Federal University of Pará, Belém, Pará, Brazil
- Laboratory of Bioinformatics and Genomics of Microorganisms, Institute of Biological Sciences, Federal University of Pará, Belém, Pará, Brazil
| | - Rommel Thiago Jucá Ramos
- Laboratory of Simulation and Computational Biology - SIMBIC, High Performance Computing Center - CCAD, Federal University of Pará, Belém, Pará, Brazil
- Laboratory of Bioinformatics and Genomics of Microorganisms, Institute of Biological Sciences, Federal University of Pará, Belém, Pará, Brazil
| |
Collapse
|
4
|
Cook R, Crisci MA, Pye HV, Telatin A, Adriaenssens EM, Santini JM. Decoding huge phage diversity: a taxonomic classification of Lak megaphages. J Gen Virol 2024; 105:001997. [PMID: 38814706 PMCID: PMC11165621 DOI: 10.1099/jgv.0.001997] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 02/01/2024] [Accepted: 05/21/2024] [Indexed: 05/31/2024] Open
Abstract
High-throughput sequencing for uncultivated viruses has accelerated the understanding of global viral diversity and uncovered viral genomes substantially larger than any that have so far been cultured. Notably, the Lak phages are an enigmatic group of viruses that present some of the largest known phage genomes identified in human and animal microbiomes, and are dissimilar to any cultivated viruses. Despite the wealth of viral diversity that exists within sequencing datasets, uncultivated viruses have rarely been used for taxonomic classification. We investigated the evolutionary relationships of 23 Lak phages and propose a taxonomy for their classification. Predicted protein analysis revealed the Lak phages formed a deeply branching monophyletic clade within the class Caudoviricetes which contained no other phage genomes. One of the interesting features of this clade is that all current members are characterised by an alternative genetic code. We propose the Lak phages belong to a new order, the 'Grandevirales'. Protein and nucleotide-based analyses support the creation of two families, three sub-families, and four genera within the order 'Grandevirales'. We anticipate that the proposed taxonomy of Lak megaphages will simplify the future classification of related viral genomes as they are uncovered. Continued efforts to classify divergent viruses are crucial to aid common analyses of viral genomes and metagenomes.
Collapse
Affiliation(s)
- Ryan Cook
- Quadram Institute Bioscience, Norwich Research Park, Norwich, UK
| | - Marco A. Crisci
- Department of Structural and Molecular Biology, Division of Biosciences, UCL, London, UK
| | - Hannah V. Pye
- Quadram Institute Bioscience, Norwich Research Park, Norwich, UK
| | - Andrea Telatin
- Quadram Institute Bioscience, Norwich Research Park, Norwich, UK
| | | | - Joanne M. Santini
- Department of Structural and Molecular Biology, Division of Biosciences, UCL, London, UK
| |
Collapse
|
5
|
Liu X, Liu Y, Liu J, Zhang H, Shan C, Guo Y, Gong X, Cui M, Li X, Tang M. Correlation between the gut microbiome and neurodegenerative diseases: a review of metagenomics evidence. Neural Regen Res 2024; 19:833-845. [PMID: 37843219 PMCID: PMC10664138 DOI: 10.4103/1673-5374.382223] [Citation(s) in RCA: 1] [Impact Index Per Article: 1.0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 02/23/2023] [Revised: 04/19/2023] [Accepted: 06/17/2023] [Indexed: 10/17/2023] Open
Abstract
A growing body of evidence suggests that the gut microbiota contributes to the development of neurodegenerative diseases via the microbiota-gut-brain axis. As a contributing factor, microbiota dysbiosis always occurs in pathological changes of neurodegenerative diseases, such as Alzheimer's disease, Parkinson's disease, and amyotrophic lateral sclerosis. High-throughput sequencing technology has helped to reveal that the bidirectional communication between the central nervous system and the enteric nervous system is facilitated by the microbiota's diverse microorganisms, and for both neuroimmune and neuroendocrine systems. Here, we summarize the bioinformatics analysis and wet-biology validation for the gut metagenomics in neurodegenerative diseases, with an emphasis on multi-omics studies and the gut virome. The pathogen-associated signaling biomarkers for identifying brain disorders and potential therapeutic targets are also elucidated. Finally, we discuss the role of diet, prebiotics, probiotics, postbiotics and exercise interventions in remodeling the microbiome and reducing the symptoms of neurodegenerative diseases.
Collapse
Affiliation(s)
- Xiaoyan Liu
- School of Life Sciences, Jiangsu University, Zhenjiang, Jiangsu Province, China
| | - Yi Liu
- School of Life Sciences, Jiangsu University, Zhenjiang, Jiangsu Province, China
- Institute of Animal Husbandry, Jiangsu Academy of Agricultural Sciences, Nanjing, Jiangsu Province, China
| | - Junlin Liu
- School of Life Sciences, Jiangsu University, Zhenjiang, Jiangsu Province, China
| | - Hantao Zhang
- School of Life Sciences, Jiangsu University, Zhenjiang, Jiangsu Province, China
| | - Chaofan Shan
- School of Life Sciences, Jiangsu University, Zhenjiang, Jiangsu Province, China
| | - Yinglu Guo
- School of Life Sciences, Jiangsu University, Zhenjiang, Jiangsu Province, China
| | - Xun Gong
- Department of Rheumatology & Immunology, Affiliated Hospital of Jiangsu University, Zhenjiang, Jiangsu Province, China
| | - Mengmeng Cui
- Department of Neurology, The Second Affiliated Hospital of Shandong First Medical University, Taian, Shandong Province, China
| | - Xiubin Li
- Department of Neurology, The Second Affiliated Hospital of Shandong First Medical University, Taian, Shandong Province, China
| | - Min Tang
- School of Life Sciences, Jiangsu University, Zhenjiang, Jiangsu Province, China
| |
Collapse
|
6
|
Nie W, Qiu T, Wei Y, Ding H, Guo Z, Qiu J. Advances in phage-host interaction prediction: in silico method enhances the development of phage therapies. Brief Bioinform 2024; 25:bbae117. [PMID: 38555471 PMCID: PMC10981677 DOI: 10.1093/bib/bbae117] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 09/10/2023] [Revised: 01/15/2024] [Accepted: 03/02/2024] [Indexed: 04/02/2024] Open
Abstract
Phages can specifically recognize and kill bacteria, which lead to important application value of bacteriophage in bacterial identification and typing, livestock aquaculture and treatment of human bacterial infection. Considering the variety of human-infected bacteria and the continuous discovery of numerous pathogenic bacteria, screening suitable therapeutic phages that are capable of infecting pathogens from massive phage databases has been a principal step in phage therapy design. Experimental methods to identify phage-host interaction (PHI) are time-consuming and expensive; high-throughput computational method to predict PHI is therefore a potential substitute. Here, we systemically review bioinformatic methods for predicting PHI, introduce reference databases and in silico models applied in these methods and highlight the strengths and challenges of current tools. Finally, we discuss the application scope and future research direction of computational prediction methods, which contribute to the performance improvement of prediction models and the development of personalized phage therapy.
Collapse
Affiliation(s)
- Wanchun Nie
- School of Health Science and Engineering, University of Shanghai for Science and Technology, Shanghai, 200093, China
| | - Tianyi Qiu
- Institute of Clinical Science, Zhongshan Hospital; Intelligent Medicine Institute, Fudan University, Shanghai, 200032, China
- Shanghai Institute of Infectious Disease and Biosecurity, Fudan University, Shanghai, 200032, China
| | - Yiwen Wei
- School of Health Science and Engineering, University of Shanghai for Science and Technology, Shanghai, 200093, China
| | - Hao Ding
- Institute of Clinical Science, Zhongshan Hospital; Intelligent Medicine Institute, Fudan University, Shanghai, 200032, China
| | - Zhixiang Guo
- School of Health Science and Engineering, University of Shanghai for Science and Technology, Shanghai, 200093, China
| | - Jingxuan Qiu
- School of Health Science and Engineering, University of Shanghai for Science and Technology, Shanghai, 200093, China
| |
Collapse
|
7
|
Mahony J. Biological and bioinformatic tools for the discovery of unknown phage-host combinations. Curr Opin Microbiol 2024; 77:102426. [PMID: 38246125 DOI: 10.1016/j.mib.2024.102426] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 09/21/2023] [Revised: 12/21/2023] [Accepted: 01/02/2024] [Indexed: 01/23/2024]
Abstract
The field of microbial ecology has been transformed by metagenomics in recent decades and has culminated in vast datasets that facilitate the bioinformatic dissection of complex microbial communities. Recently, attention has turned from defining the microbiota composition to the interactions and relationships that occur between members of the microbiota. Within complex microbiota, the identification of bacteriophage-host combinations has been a major challenge. Recent developments in artificial intelligence tools to predict protein structure and function as well as the relationships between bacteria and their infecting bacteriophages allow a strategic approach to identifying and validating phage-host relationships. However, biological validation of these predictions remains essential and will serve to improve the existing predictive tools. In this review, I provide an overview of the most recent developments in both bioinformatic and experimental approaches to predicting and experimentally validating unknown phage-host combinations.
Collapse
Affiliation(s)
- Jennifer Mahony
- School of Microbiology & APC Microbiome Ireland, University College Cork, Western Road, T12 YT20 Cork, Ireland.
| |
Collapse
|
8
|
Ni Y, Xu T, Yan S, Chen L, Wang Y. Hiding in plain sight: The discovery of complete genomes of 11 hypothetical spindle-shaped viruses that putatively infect mesophilic ammonia-oxidizing archaea. ENVIRONMENTAL MICROBIOLOGY REPORTS 2024; 16:e13230. [PMID: 38263861 PMCID: PMC10866085 DOI: 10.1111/1758-2229.13230] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Subscribe] [Scholar Register] [Received: 12/11/2023] [Accepted: 12/14/2023] [Indexed: 01/25/2024]
Abstract
The genome of a putative Nitrosopumilaceae virus with a hypothetical spindle-shaped particle morphology was identified in the Yangshan Harbour metavirome from the East China Sea through protein similarity comparison and structure analysis. This discovery was accompanied by a set of 10 geographically dispersed close relatives found in the environmental virus datasets from typical locations of ammonia-oxidizing archaeon distribution. Its host prediction was supported by iPHoP prediction and protein sequence similarity. The structure of the predicted major capsid protein, together with the overall N-glycosylation site, the transmembrane helices prediction, the hydrophilicity profile, and the docking simulation of the major capsid proteins, indicate that these viruses resemble spindle-shaped viruses. It suggests a similarly assembled structure and, consequently, a possibly spindle-shaped morphology of these newly discovered archaeal viruses.
Collapse
Affiliation(s)
- Yimin Ni
- College of Food Science and TechnologyShanghai Ocean UniversityShanghaiChina
| | - Tianqi Xu
- College of Food Science and TechnologyShanghai Ocean UniversityShanghaiChina
| | - Shuling Yan
- Entwicklungsgenetik und Zellbiologie der TierePhilipps‐Universität MarburgMarburgGermany
| | - Lanming Chen
- College of Food Science and TechnologyShanghai Ocean UniversityShanghaiChina
- Laboratory of Quality and Safety Risk Assessment for Aquatic Products on Storage and Preservation (Shanghai)Ministry of AgricultureShanghaiChina
| | - Yongjie Wang
- College of Food Science and TechnologyShanghai Ocean UniversityShanghaiChina
- Laboratory of Quality and Safety Risk Assessment for Aquatic Products on Storage and Preservation (Shanghai)Ministry of AgricultureShanghaiChina
- Laboratory for Marine Biology and BiotechnologyQingdao National Laboratory for Marine Science and TechnologyQingdaoChina
| |
Collapse
|
9
|
Rozwalak P, Barylski J, Wijesekara Y, Dutilh BE, Zielezinski A. Ultraconserved bacteriophage genome sequence identified in 1300-year-old human palaeofaeces. Nat Commun 2024; 15:495. [PMID: 38263397 PMCID: PMC10805732 DOI: 10.1038/s41467-023-44370-0] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/13/2023] [Accepted: 12/11/2023] [Indexed: 01/25/2024] Open
Abstract
Bacteriophages are widely recognised as rapidly evolving biological entities. However, knowledge about ancient bacteriophages is limited. Here, we analyse DNA sequence datasets previously generated from ancient palaeofaeces and human gut-content samples, and identify an ancient phage genome nearly identical to present-day Mushuvirus mushu, a virus that infects gut commensal bacteria. The DNA damage patterns of the genome are consistent with its ancient origin and, despite 1300 years of evolution, the ancient Mushuvirus genome shares 97.7% nucleotide identity with its modern counterpart, indicating a long-term relationship between the prophage and its host. In addition, we reconstruct and authenticate 297 other phage genomes from the last 5300 years, including those belonging to unknown families. Our findings demonstrate the feasibility of reconstructing ancient phage genome sequences, thus expanding the known virosphere and offering insights into phage-bacteria interactions spanning several millennia.
Collapse
Affiliation(s)
- Piotr Rozwalak
- Department of Computational Biology, Faculty of Biology, Adam Mickiewicz University, Poznan, 61-614, Poland
| | - Jakub Barylski
- Department of Molecular Virology, Faculty of Biology, Adam Mickiewicz University, Poznan, 61-614, Poland
| | - Yasas Wijesekara
- Institute of Bioinformatics, University Medicine Greifswald, Felix-Hausdorff-Str. 8, 17475, Greifswald, Germany
| | - Bas E Dutilh
- Institute of Biodiversity, Faculty of Biological Sciences, Cluster of Excellence Balance of the Microverse, Friedrich Schiller University Jena, 07743, Jena, Germany.
- Theoretical Biology and Bioinformatics, Science4Life, Utrecht University, Padualaan 8, 3584 CH, Utrecht, the Netherlands.
| | - Andrzej Zielezinski
- Department of Computational Biology, Faculty of Biology, Adam Mickiewicz University, Poznan, 61-614, Poland.
| |
Collapse
|
10
|
Yin H, Wu S, Tan J, Guo Q, Li M, Guo J, Wang Y, Jiang X, Zhu H. IPEV: identification of prokaryotic and eukaryotic virus-derived sequences in virome using deep learning. Gigascience 2024; 13:giae018. [PMID: 38649300 PMCID: PMC11034026 DOI: 10.1093/gigascience/giae018] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 11/23/2023] [Revised: 03/14/2024] [Accepted: 03/25/2024] [Indexed: 04/25/2024] Open
Abstract
BACKGROUND The virome obtained through virus-like particle enrichment contains a mixture of prokaryotic and eukaryotic virus-derived fragments. Accurate identification and classification of these elements are crucial to understanding their roles and functions in microbial communities. However, the rapid mutation rates of viral genomes pose challenges in developing high-performance tools for classification, potentially limiting downstream analyses. FINDINGS We present IPEV, a novel method to distinguish prokaryotic and eukaryotic viruses in viromes, with a 2-dimensional convolutional neural network combining trinucleotide pair relative distance and frequency. Cross-validation assessments of IPEV demonstrate its state-of-the-art precision, significantly improving the F1-score by approximately 22% on an independent test set compared to existing methods when query viruses share less than 30% sequence similarity with known viruses. Furthermore, IPEV outperforms other methods in accuracy on marine and gut virome samples based on annotations by sequence alignments. IPEV reduces runtime by at most 1,225 times compared to existing methods under the same computing configuration. We also utilized IPEV to analyze longitudinal samples and found that the gut virome exhibits a higher degree of temporal stability than previously observed in persistent personal viromes, providing novel insights into the resilience of the gut virome in individuals. CONCLUSIONS IPEV is a high-performance, user-friendly tool that assists biologists in identifying and classifying prokaryotic and eukaryotic viruses within viromes. The tool is available at https://github.com/basehc/IPEV.
Collapse
Affiliation(s)
- Hengchuang Yin
- Department of Biomedical Engineering, College of Future Technology, and Center for Quantitative Biology, Peking University, Beijing 100871, China
| | - Shufang Wu
- Department of Biomedical Engineering, College of Future Technology, and Center for Quantitative Biology, Peking University, Beijing 100871, China
| | - Jie Tan
- Department of Biomedical Engineering, College of Future Technology, and Center for Quantitative Biology, Peking University, Beijing 100871, China
| | - Qian Guo
- Department of Biomedical Engineering, College of Future Technology, and Center for Quantitative Biology, Peking University, Beijing 100871, China
| | - Mo Li
- Department of Biomedical Engineering, College of Future Technology, and Center for Quantitative Biology, Peking University, Beijing 100871, China
- School of Life Sciences, Peking University, Beijing 100871, China
| | - Jinyuan Guo
- Department of Biomedical Engineering, College of Future Technology, and Center for Quantitative Biology, Peking University, Beijing 100871, China
- Department of Biomedical Engineering, Georgia Institute of Technology and Emory University, Atlanta, GA 30332, USA
| | - Yaqi Wang
- Department of Biomedical Engineering, College of Future Technology, and Center for Quantitative Biology, Peking University, Beijing 100871, China
| | - Xiaoqing Jiang
- Department of Biomedical Engineering, College of Future Technology, and Center for Quantitative Biology, Peking University, Beijing 100871, China
- Beijing Institute of Genomics, Chinese Academy of Sciences, and China National Center for Bioinformation, Beijing 100101, China
| | - Huaiqiu Zhu
- Department of Biomedical Engineering, College of Future Technology, and Center for Quantitative Biology, Peking University, Beijing 100871, China
- School of Life Sciences, Peking University, Beijing 100871, China
- Department of Biomedical Engineering, Georgia Institute of Technology and Emory University, Atlanta, GA 30332, USA
| |
Collapse
|
11
|
Rossi FPN, Flores VS, Uceda-Campos G, Amgarten DE, Setubal JC, da Silva AM. Comparative Analyses of Bacteriophage Genomes. Methods Mol Biol 2024; 2802:427-453. [PMID: 38819567 DOI: 10.1007/978-1-0716-3838-5_14] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 06/01/2024]
Abstract
Bacterial viruses (bacteriophages or phages) are the most abundant and diverse biological entities on Earth. There is a renewed worldwide interest in phage-centered research motivated by their enormous potential as antimicrobials to cope with multidrug-resistant pathogens. An ever-growing number of complete phage genomes are becoming available, derived either from newly isolated phages (cultivated phages) or recovered from metagenomic sequencing data (uncultivated phages). Robust comparative analysis is crucial for a comprehensive understanding of genotypic variations of phages and their related evolutionary processes, and to investigate the interaction mechanisms between phages and their hosts. In this chapter, we present a protocol for phage comparative genomics employing tools selected out of the many currently available, focusing on complete genomes of phages classified in the class Caudoviricetes. This protocol provides accurate identification of similarities, differences, and patterns among new and previously known complete phage genomes as well as phage clustering and taxonomic classification.
Collapse
Affiliation(s)
| | - Vinicius Sousa Flores
- Departamento de Bioquímica, Instituto de Química, Universidade de São Paulo, Sao Paulo, SP, Brazil
| | - Guillermo Uceda-Campos
- Departamento de Bioquímica, Instituto de Química, Universidade de São Paulo, Sao Paulo, SP, Brazil
| | | | - João Carlos Setubal
- Departamento de Bioquímica, Instituto de Química, Universidade de São Paulo, Sao Paulo, SP, Brazil
| | - Aline Maria da Silva
- Departamento de Bioquímica, Instituto de Química, Universidade de São Paulo, Sao Paulo, SP, Brazil.
| |
Collapse
|
12
|
Shuai X, Zhou Z, Ba X, Lin Y, Lin Z, Liu Z, Yu X, Zhou J, Zeng G, Ge Z, Chen H. Bacteriophages: Vectors of or weapons against the transmission of antibiotic resistance genes in hospital wastewater systems? WATER RESEARCH 2024; 248:120833. [PMID: 37952327 DOI: 10.1016/j.watres.2023.120833] [Citation(s) in RCA: 2] [Impact Index Per Article: 2.0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Subscribe] [Scholar Register] [Received: 04/18/2023] [Revised: 10/26/2023] [Accepted: 11/03/2023] [Indexed: 11/14/2023]
Abstract
Antimicrobial resistance poses a serious threat to human health and is responsible for the death of millions of people annually. Hospital wastewater is an important hotspot for antibiotic-resistance genes (ARGs) and antibiotic-resistant bacteria (ARB). However, little is known about the relationship between phages and ARGs in hospital wastewater systems (HWS). In the present study, the viral diversity of 12 HWSs using data from public metagenomic databases was investigated. Viruses were widely found in both the influent and effluent of each HWS. A total of 45 unique ARGs were carried by 85 viral contigs, which accounted for only 0.14% of the total viral populations, implying that ARGs were not commonly present in phages. Three efflux pump genes were identified as shared between phages and bacterial genomes. However, the predominant types of ARGs in HWS such as aminoglycoside- and beta-lactam-resistance genes were rarely found in phages. Based on CRISPR spacer and tRNA matches, interactions between 171 viral contigs and 60 antibiotic-resistant genomes were predicted, including interactions involving phages and vancomycin-resistant Enterococcus_B faecium or beta-lactam-resistant Klebsiella pneumoniae. More than half (56.1%) of these viral contigs indicated lytic and none of them carried ARGs. As the vOTUs in this study had few ARGs and were primarily lytic, HWS may be a valuable source for phage discovery. Future studies will be able to experimentally validate these sequence-based results to confirm the suitability of HWS phages for pathogen control measures in wastewater.
Collapse
Affiliation(s)
- Xinyi Shuai
- Department of Environmental Engineering, College of Environmental and Resource Sciences, Zhejiang University, Hangzhou, 310058, China
| | - Zhenchao Zhou
- Department of Environmental Engineering, College of Environmental and Resource Sciences, Zhejiang University, Hangzhou, 310058, China
| | - Xiaoliang Ba
- Department of Veterinary Medicine, University of Cambridge, Cambridge, United Kingdom
| | - Yanhan Lin
- Department of Environmental Engineering, College of Environmental and Resource Sciences, Zhejiang University, Hangzhou, 310058, China
| | - Zejun Lin
- Department of Environmental Engineering, College of Environmental and Resource Sciences, Zhejiang University, Hangzhou, 310058, China
| | - Zhe Liu
- Department of Environmental Engineering, College of Environmental and Resource Sciences, Zhejiang University, Hangzhou, 310058, China
| | - Xi Yu
- Department of Environmental Engineering, College of Environmental and Resource Sciences, Zhejiang University, Hangzhou, 310058, China
| | - Jinyu Zhou
- Department of Environmental Engineering, College of Environmental and Resource Sciences, Zhejiang University, Hangzhou, 310058, China
| | - Guangshu Zeng
- Department of Environmental Engineering, College of Environmental and Resource Sciences, Zhejiang University, Hangzhou, 310058, China
| | - Ziye Ge
- Department of Environmental Engineering, College of Environmental and Resource Sciences, Zhejiang University, Hangzhou, 310058, China
| | - Hong Chen
- Department of Environmental Engineering, College of Environmental and Resource Sciences, Zhejiang University, Hangzhou, 310058, China; International Cooperation Base of Environmental Pollution and Ecological Health, Science and Technology Agency of Zhejiang, Zhejiang University, China; Key Laboratory of Environment Remediation and Ecological Health, Ministry of Education, College of Environmental Resource Sciences, Zhejiang University, Hangzhou, China.
| |
Collapse
|
13
|
Zhou Y, Wang Y, Prangishvili D, Krupovic M. Exploring the Archaeal Virosphere by Metagenomics. Methods Mol Biol 2024; 2732:1-22. [PMID: 38060114 DOI: 10.1007/978-1-0716-3515-5_1] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 12/08/2023]
Abstract
During the past decade, environmental research has demonstrated that archaea are abundant and widespread in nature and play important ecological roles at a global scale. Currently, however, the majority of archaeal lineages cannot be cultivated under laboratory conditions and are known exclusively or nearly exclusively through metagenomics. A similar trend extends to the archaeal virosphere, where isolated representatives are available for a handful of model archaeal virus-host systems. Viral metagenomics provides an alternative way to circumvent the limitations of culture-based virus discovery and offers insight into the diversity, distribution, and environmental impact of uncultured archaeal viruses. Presently, metagenomics approaches have been successfully applied to explore the viromes associated with various lineages of extremophilic and mesophilic archaea, including Asgard archaea (Asgardarchaeota), ANME-1 archaea (Methanophagales), thaumarchaea (Nitrososphaeria), altiarchaea (Altiarchaeota), and marine group II archaea (Poseidoniales). Here, we provide an overview of methods widely used in archaeal virus metagenomics, covering metavirome preparation, genome annotation, phylogenetic and phylogenomic analyses, and archaeal host assignment. We hope that this summary will contribute to further exploration and characterization of the enigmatic archaeal virome lurking in diverse environments.
Collapse
Affiliation(s)
- Yifan Zhou
- Institut Pasteur, Université Paris Cité, Archaeal Virology Unit, Paris, France
- Sorbonne Université, Collège Doctoral, Paris, France
| | - Yongjie Wang
- College of Food Science and Technology, Shanghai Ocean University, Shanghai, China
- Laboratory for Marine Biology and Biotechnology, Qingdao National Laboratory for Marine Science and Technology, Qingdao, China
- Laboratory of Quality and Safety Risk Assessment for Aquatic Products on Storage and Preservation (Shanghai), Ministry of Agriculture, Shanghai, China
| | - David Prangishvili
- Institut Pasteur, Université Paris Cité, Archaeal Virology Unit, Paris, France
- Ivane Javakhishvili Tbilisi State University, Tbilisi, Georgia
| | - Mart Krupovic
- Institut Pasteur, Université Paris Cité, Archaeal Virology Unit, Paris, France.
| |
Collapse
|
14
|
Lopez-Simon J, Vila-Nistal M, Rosenova A, De Corte D, Baltar F, Martinez-Garcia M. Viruses under the Antarctic Ice Shelf are active and potentially involved in global nutrient cycles. Nat Commun 2023; 14:8295. [PMID: 38097581 PMCID: PMC10721903 DOI: 10.1038/s41467-023-44028-x] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 05/30/2023] [Accepted: 11/28/2023] [Indexed: 12/17/2023] Open
Abstract
Viruses play an important role in the marine ecosystem. However, our comprehension of viruses inhabiting the dark ocean, and in particular, under the Antarctic Ice Shelves, remains limited. Here, we mine single-cell genomic, transcriptomic, and metagenomic data to uncover the viral diversity, biogeography, activity, and their role as metabolic facilitators of microbes beneath the Ross Ice Shelf. This is the largest Antarctic ice shelf with a major impact on global carbon cycle. The viral community found in the cavity under the ice shelf mainly comprises endemic viruses adapted to polar and mesopelagic environments. The low abundance of genes related to lysogenic lifestyle (<3%) does not support a predominance of the Piggyback-the-Winner hypothesis, consistent with a low-productivity habitat. Our results indicate a viral community actively infecting key ammonium and sulfur-oxidizing chemolithoautotrophs (e.g. Nitrosopumilus spp, Thioglobus spp.), supporting a "kill-the-winner" dynamic. Based on genome analysis, these viruses carry specific auxiliary metabolic genes potentially involved in nitrogen, sulfur, and phosphorus acquisition. Altogether, the viruses under Antarctic ice shelves are putatively involved in programming the metabolism of ecologically relevant microbes that maintain primary production in these chemosynthetically-driven ecosystems, which have a major role in global nutrient cycles.
Collapse
Affiliation(s)
- Javier Lopez-Simon
- Department of Physiology, Genetics, and Microbiology, University of Alicante, Carretera San Vicente del Raspeig, San Vicente del Raspeig, Alicante, 03690, Spain
| | - Marina Vila-Nistal
- Department of Physiology, Genetics, and Microbiology, University of Alicante, Carretera San Vicente del Raspeig, San Vicente del Raspeig, Alicante, 03690, Spain
| | - Aleksandra Rosenova
- Department of Physiology, Genetics, and Microbiology, University of Alicante, Carretera San Vicente del Raspeig, San Vicente del Raspeig, Alicante, 03690, Spain
| | - Daniele De Corte
- Institute for Chemistry and Biology of the Marine Environment, Carl von Ossietzky University of Oldenburg, Oldenburg, Germany
- Ocean Technology and Engineering, National Oceanography Centre, Southampton, UK
| | - Federico Baltar
- Department of Functional & Evolutionary Ecology, University of Vienna, Djerassi-Platz 1, 1030, Vienna, Austria.
| | - Manuel Martinez-Garcia
- Department of Physiology, Genetics, and Microbiology, University of Alicante, Carretera San Vicente del Raspeig, San Vicente del Raspeig, Alicante, 03690, Spain.
- Instituto Multidisciplinar para el Estudio del Medio Ramon Margalef, University of Alicante, San Vicente del Raspeig, Alicante, 03690, Spain.
| |
Collapse
|
15
|
Grigson SR, Giles SK, Edwards RA, Papudeshi B. Knowing and Naming: Phage Annotation and Nomenclature for Phage Therapy. Clin Infect Dis 2023; 77:S352-S359. [PMID: 37932119 PMCID: PMC10627814 DOI: 10.1093/cid/ciad539] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/08/2023] Open
Abstract
Bacteriophages, or phages, are viruses that infect bacteria shaping microbial communities and ecosystems. They have gained attention as potential agents against antibiotic resistance. In phage therapy, lytic phages are preferred for their bacteria killing ability, while temperate phages, which can transfer antibiotic resistance or toxin genes, are avoided. Selection relies on plaque morphology and genome sequencing. This review outlines annotating genomes, identifying critical genomic features, and assigning functional labels to protein-coding sequences. These annotations prevent the transfer of unwanted genes, such as antimicrobial resistance or toxin genes, during phage therapy. Additionally, it covers International Committee on Taxonomy of Viruses (ICTV)-an established phage nomenclature system for simplified classification and communication. Accurate phage genome annotation and nomenclature provide insights into phage-host interactions, replication strategies, and evolution, accelerating our understanding of the diversity and evolution of phages and facilitating the development of phage-based therapies.
Collapse
Affiliation(s)
- Susanna R Grigson
- Flinders Accelerator for Microbiome Exploration, College of Science and Engineering, Flinders University, Adelaide, Australia
| | - Sarah K Giles
- Flinders Accelerator for Microbiome Exploration, College of Science and Engineering, Flinders University, Adelaide, Australia
| | - Robert A Edwards
- Flinders Accelerator for Microbiome Exploration, College of Science and Engineering, Flinders University, Adelaide, Australia
| | - Bhavya Papudeshi
- Flinders Accelerator for Microbiome Exploration, College of Science and Engineering, Flinders University, Adelaide, Australia
| |
Collapse
|
16
|
Peng Y, Lu Z, Pan D, Shi LD, Zhao Z, Liu Q, Zhang C, Jia K, Li J, Hubert CRJ, Dong X. Viruses in deep-sea cold seep sediments harbor diverse survival mechanisms and remain genetically conserved within species. THE ISME JOURNAL 2023; 17:1774-1784. [PMID: 37573455 PMCID: PMC10504277 DOI: 10.1038/s41396-023-01491-0] [Citation(s) in RCA: 3] [Impact Index Per Article: 3.0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Subscribe] [Scholar Register] [Received: 03/22/2023] [Revised: 08/01/2023] [Accepted: 08/01/2023] [Indexed: 08/14/2023]
Abstract
Deep sea cold seep sediments have been discovered to harbor novel, abundant, and diverse bacterial and archaeal viruses. However, little is known about viral genetic features and evolutionary patterns in these environments. Here, we examined the evolutionary ecology of viruses across active and extinct seep stages in the area of Haima cold seeps in the South China Sea. A total of 338 viral operational taxonomic units are identified and linked to 36 bacterial and archaeal phyla. The dynamics of host-virus interactions are informed by diverse antiviral defense systems across 43 families found in 487 microbial genomes. Cold seep viruses are predicted to harbor diverse adaptive strategies to persist in this environment, including counter-defense systems, auxiliary metabolic genes, reverse transcriptases, and alternative genetic code assignments. Extremely low nucleotide diversity is observed in cold seep viral populations, being influenced by factors including microbial host, sediment depth, and cold seep stage. Most cold seep viral genes are under strong purifying selection with trajectories that differ depending on whether cold seeps are active or extinct. This work sheds light on the understanding of environmental adaptation mechanisms and evolutionary patterns of viruses in the sub-seafloor biosphere.
Collapse
Affiliation(s)
- Yongyi Peng
- Key Laboratory of Marine Genetic Resources, Third Institute of Oceanography, Ministry of Natural Resources, Xiamen, 361005, China
- School of Marine Sciences, Sun Yat-Sen University, Zhuhai, 519082, China
| | - Zijian Lu
- South China Sea Institute of Oceanology, Chinese Academy of Sciences, Guangzhou, 510301, China
| | - Donald Pan
- School of Oceanography, Shanghai Jiao Tong University, Shanghai, 200030, China
| | - Ling-Dong Shi
- College of Environmental and Resource Sciences, Zhejiang University, Hangzhou, 310058, China
| | - Zhao Zhao
- School of Marine Sciences, Sun Yat-Sen University, Zhuhai, 519082, China
- Southern Marine Science and Engineering Guangdong Laboratory (Zhuhai), Zhuhai, 519000, China
| | - Qing Liu
- School of Marine Sciences, Sun Yat-Sen University, Zhuhai, 519082, China
| | - Chuwen Zhang
- Key Laboratory of Marine Genetic Resources, Third Institute of Oceanography, Ministry of Natural Resources, Xiamen, 361005, China
| | - Kuntong Jia
- School of Marine Sciences, Sun Yat-Sen University, Zhuhai, 519082, China
| | - Jiwei Li
- Institute of Deep-Sea Science and Engineering, Chinese Academy of Sciences, Sanya, 572000, China
| | - Casey R J Hubert
- Department of Biological Sciences, University of Calgary, Calgary, AB, T2N 1N4, Canada
| | - Xiyang Dong
- Key Laboratory of Marine Genetic Resources, Third Institute of Oceanography, Ministry of Natural Resources, Xiamen, 361005, China.
- Southern Marine Science and Engineering Guangdong Laboratory (Zhuhai), Zhuhai, 519000, China.
| |
Collapse
|
17
|
Alarcón-Schumacher T, Lücking D, Erdmann S. Revisiting evolutionary trajectories and the organization of the Pleolipoviridae family. PLoS Genet 2023; 19:e1010998. [PMID: 37831715 PMCID: PMC10599561 DOI: 10.1371/journal.pgen.1010998] [Citation(s) in RCA: 2] [Impact Index Per Article: 2.0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 05/22/2023] [Revised: 10/25/2023] [Accepted: 09/26/2023] [Indexed: 10/15/2023] Open
Abstract
Archaeal pleomorphic viruses belonging to the Pleolipoviridae family represent an enigmatic group as they exhibit unique genomic features and are thought to have evolved through recombination with different archaeal plasmids. However, most of our understanding of the diversity and evolutionary trajectories of this clade comes from a handful of isolated representatives. Here we present 164 new genomes of pleolipoviruses obtained from metagenomic data of Australian hypersaline lakes and publicly available metagenomic data. We perform a comprehensive analysis on the diversity and evolutionary relationships of the newly discovered viruses and previously described pleolipoviruses. We propose to classify the viruses into five genera within the Pleolipoviridae family, with one new genus represented only by virus genomes retrieved in this study. Our data support the current hypothesis that pleolipoviruses reshaped their genomes through recombining with multiple different groups of plasmids, which is reflected in the diversity of their predicted replication strategies. We show that the proposed genus Epsilonpleolipovirus has evolutionary ties to pRN1-like plasmids from Sulfolobus, suggesting that this group could be infecting other archaeal phyla. Interestingly, we observed that the genome size of pleolipoviruses is correlated to the presence or absence of an integrase. Analyses of the host range revealed that all but one virus exhibit an extremely narrow range, and we show that the predicted tertiary structure of the spike protein is strongly associated with the host family, suggesting a specific adaptation to the host S-layer glycoprotein organization.
Collapse
Affiliation(s)
| | - Dominik Lücking
- Max-Planck-Institute for Marine Microbiology, Bremen, Germany
| | - Susanne Erdmann
- Max-Planck-Institute for Marine Microbiology, Bremen, Germany
| |
Collapse
|
18
|
Yi Y, Liu S, Hao Y, Sun Q, Lei X, Wang Y, Wang J, Zhang M, Tang S, Tang Q, Zhang Y, Liu X, Wang Y, Xiao X, Jian H. A systematic analysis of marine lysogens and proviruses. Nat Commun 2023; 14:6013. [PMID: 37758717 PMCID: PMC10533544 DOI: 10.1038/s41467-023-41699-4] [Citation(s) in RCA: 7] [Impact Index Per Article: 7.0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/21/2023] [Accepted: 09/13/2023] [Indexed: 09/29/2023] Open
Abstract
Viruses are ubiquitous in the oceans, exhibiting high abundance and diversity. Here, we systematically analyze existing genomic sequences of marine prokaryotes to compile a Marine Prokaryotic Genome Dataset (MPGD, consisting of over 12,000 bacterial and archaeal genomes) and a Marine Temperate Viral Genome Dataset (MTVGD). At least 40% of the MPGD genomes contain one or more proviral sequences, indicating that they are lysogens. The MTVGD includes over 12,900 viral contigs or putative proviruses, clustered into 10,897 viral genera. We show that lysogens and proviruses are abundant in marine ecosystems, particularly in the deep sea, and marine lysogens differ from non-lysogens in multiple genomic features and growth properties. We reveal several virus-host interaction networks of potential ecological relevance, and identify proviruses that appear to be able to infect (or to be transferred between) different bacterial classes and phyla. Auxiliary metabolic genes in the MTVGD are enriched in functions related to carbohydrate metabolism. Finally, we experimentally demonstrate the impact of a prophage on the transcriptome of a representative marine Shewanella bacterium. Our work contributes to a better understanding of the ecology of marine prokaryotes and their viruses.
Collapse
Affiliation(s)
- Yi Yi
- State Key Laboratory of Microbial Metabolism, Joint International Research Laboratory of Metabolic & Development Sciences, School of Life Sciences and Biotechnology, Shanghai Jiao Tong University, Shanghai, China
| | - Shunzhang Liu
- State Key Laboratory of Microbial Metabolism, Joint International Research Laboratory of Metabolic & Development Sciences, School of Life Sciences and Biotechnology, Shanghai Jiao Tong University, Shanghai, China
| | - Yali Hao
- State Key Laboratory of Microbial Metabolism, Joint International Research Laboratory of Metabolic & Development Sciences, School of Life Sciences and Biotechnology, Shanghai Jiao Tong University, Shanghai, China
- Yazhou Bay Institute of Deepsea Sci-Tech, Shanghai Jiao Tong University, Sanya, China
| | - Qingyang Sun
- State Key Laboratory of Microbial Metabolism, Joint International Research Laboratory of Metabolic & Development Sciences, School of Life Sciences and Biotechnology, Shanghai Jiao Tong University, Shanghai, China
| | - Xinjuan Lei
- State Key Laboratory of Microbial Metabolism, Joint International Research Laboratory of Metabolic & Development Sciences, School of Life Sciences and Biotechnology, Shanghai Jiao Tong University, Shanghai, China
- Yazhou Bay Institute of Deepsea Sci-Tech, Shanghai Jiao Tong University, Sanya, China
| | - Yecheng Wang
- State Key Laboratory of Microbial Metabolism, Joint International Research Laboratory of Metabolic & Development Sciences, School of Life Sciences and Biotechnology, Shanghai Jiao Tong University, Shanghai, China
| | - Jiahua Wang
- State Key Laboratory of Microbial Metabolism, Joint International Research Laboratory of Metabolic & Development Sciences, School of Life Sciences and Biotechnology, Shanghai Jiao Tong University, Shanghai, China
| | - Mujie Zhang
- State Key Laboratory of Microbial Metabolism, Joint International Research Laboratory of Metabolic & Development Sciences, School of Life Sciences and Biotechnology, Shanghai Jiao Tong University, Shanghai, China
- Yazhou Bay Institute of Deepsea Sci-Tech, Shanghai Jiao Tong University, Sanya, China
| | - Shan Tang
- State Key Laboratory of Microbial Metabolism, Joint International Research Laboratory of Metabolic & Development Sciences, School of Life Sciences and Biotechnology, Shanghai Jiao Tong University, Shanghai, China
- Yazhou Bay Institute of Deepsea Sci-Tech, Shanghai Jiao Tong University, Sanya, China
| | - Qingxue Tang
- State Key Laboratory of Microbial Metabolism, Joint International Research Laboratory of Metabolic & Development Sciences, School of Life Sciences and Biotechnology, Shanghai Jiao Tong University, Shanghai, China
| | - Yue Zhang
- State Key Laboratory of Microbial Metabolism, Joint International Research Laboratory of Metabolic & Development Sciences, School of Life Sciences and Biotechnology, Shanghai Jiao Tong University, Shanghai, China
| | - Xipeng Liu
- State Key Laboratory of Microbial Metabolism, Joint International Research Laboratory of Metabolic & Development Sciences, School of Life Sciences and Biotechnology, Shanghai Jiao Tong University, Shanghai, China
- Yazhou Bay Institute of Deepsea Sci-Tech, Shanghai Jiao Tong University, Sanya, China
| | - Yinzhao Wang
- State Key Laboratory of Microbial Metabolism, Joint International Research Laboratory of Metabolic & Development Sciences, School of Life Sciences and Biotechnology, Shanghai Jiao Tong University, Shanghai, China
- Yazhou Bay Institute of Deepsea Sci-Tech, Shanghai Jiao Tong University, Sanya, China
| | - Xiang Xiao
- State Key Laboratory of Microbial Metabolism, Joint International Research Laboratory of Metabolic & Development Sciences, School of Life Sciences and Biotechnology, Shanghai Jiao Tong University, Shanghai, China
- Yazhou Bay Institute of Deepsea Sci-Tech, Shanghai Jiao Tong University, Sanya, China
- Southern Marine Science and Engineering Guangdong Laboratory (Zhuhai), Zhuhai, China
| | - Huahua Jian
- State Key Laboratory of Microbial Metabolism, Joint International Research Laboratory of Metabolic & Development Sciences, School of Life Sciences and Biotechnology, Shanghai Jiao Tong University, Shanghai, China.
- Yazhou Bay Institute of Deepsea Sci-Tech, Shanghai Jiao Tong University, Sanya, China.
| |
Collapse
|
19
|
Zhang YZ, Liu Y, Bai Z, Fujimoto K, Uematsu S, Imoto S. Zero-shot-capable identification of phage-host relationships with whole-genome sequence representation by contrastive learning. Brief Bioinform 2023; 24:bbad239. [PMID: 37466138 PMCID: PMC10516345 DOI: 10.1093/bib/bbad239] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 02/15/2023] [Revised: 05/17/2023] [Accepted: 06/08/2023] [Indexed: 07/20/2023] Open
Abstract
Accurately identifying phage-host relationships from their genome sequences is still challenging, especially for those phages and hosts with less homologous sequences. In this work, focusing on identifying the phage-host relationships at the species and genus level, we propose a contrastive learning based approach to learn whole-genome sequence embeddings that can take account of phage-host interactions (PHIs). Contrastive learning is used to make phages infecting the same hosts close to each other in the new representation space. Specifically, we rephrase whole-genome sequences with frequency chaos game representation (FCGR) and learn latent embeddings that 'encapsulate' phages and host relationships through contrastive learning. The contrastive learning method works well on the imbalanced dataset. Based on the learned embeddings, a proposed pipeline named CL4PHI can predict known hosts and unseen hosts in training. We compare our method with two recently proposed state-of-the-art learning-based methods on their benchmark datasets. The experiment results demonstrate that the proposed method using contrastive learning improves the prediction accuracy on known hosts and demonstrates a zero-shot prediction capability on unseen hosts. In terms of potential applications, the rapid pace of genome sequencing across different species has resulted in a vast amount of whole-genome sequencing data that require efficient computational methods for identifying phage-host interactions. The proposed approach is expected to address this need by efficiently processing whole-genome sequences of phages and prokaryotic hosts and capturing features related to phage-host relationships for genome sequence representation. This approach can be used to accelerate the discovery of phage-host interactions and aid in the development of phage-based therapies for infectious diseases.
Collapse
Affiliation(s)
- Yao-zhong Zhang
- Division of Health Medical Intelligence, Human Genome Center, The Institute of Medical Science, The University of Tokyo, Shirokanedai 4-6-1, Minato-ku, 108-8639 Tokyo, Japan
| | - Yunjie Liu
- Division of Health Medical Intelligence, Human Genome Center, The Institute of Medical Science, The University of Tokyo, Shirokanedai 4-6-1, Minato-ku, 108-8639 Tokyo, Japan
| | - Zeheng Bai
- Division of Health Medical Intelligence, Human Genome Center, The Institute of Medical Science, The University of Tokyo, Shirokanedai 4-6-1, Minato-ku, 108-8639 Tokyo, Japan
| | - Kosuke Fujimoto
- Department of Immunology and Genomics, Graduate School of Medicine, Osaka Metropolitan University, Asahi-machi 1-4-3, Abeno-ku, 545-8585 Osaka, Japan
- Division of Metagenome Medicine, Human Genome Center, The Institute of Medical Science, The University of Tokyo, Shirokanedai 4-6-1, Minato-ku, 108-8639 Tokyo, Japan
| | - Satoshi Uematsu
- Department of Immunology and Genomics, Graduate School of Medicine, Osaka Metropolitan University, Asahi-machi 1-4-3, Abeno-ku, 545-8585 Osaka, Japan
- Division of Metagenome Medicine, Human Genome Center, The Institute of Medical Science, The University of Tokyo, Shirokanedai 4-6-1, Minato-ku, 108-8639 Tokyo, Japan
| | - Seiya Imoto
- Division of Health Medical Intelligence, Human Genome Center, The Institute of Medical Science, The University of Tokyo, Shirokanedai 4-6-1, Minato-ku, 108-8639 Tokyo, Japan
| |
Collapse
|
20
|
Yang M, Du S, Zhang Z, Xia Q, Liu H, Qin F, Wu Z, Ying H, Wu Y, Shao J, Zhao Y. Genomic diversity and biogeographic distributions of a novel lineage of bacteriophages that infect marine OM43 bacteria. Microbiol Spectr 2023; 11:e0494222. [PMID: 37607063 PMCID: PMC10580990 DOI: 10.1128/spectrum.04942-22] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 12/03/2022] [Accepted: 07/07/2023] [Indexed: 08/24/2023] Open
Abstract
The marine methylotrophic OM43 clade is considered an important bacterial group in coastal microbial communities. OM43 bacteria, which are closely related to phytoplankton blooms, have small cell sizes and streamlined genomes. Bacteriophages profoundly shape the evolutionary trajectories, population dynamics, and physiology of microbes. The prevalence and diversity of several phages that infect OM43 bacteria have been reported. In this study, we isolated and sequenced two novel OM43 phages, MEP401 and MEP402. These phages share 90% of their open reading frames (ORFs) and are distinct from other known phage isolates. Furthermore, a total of 99 metagenomic viral genomes (MVGs) closely related to MEP401 and MEP402 were identified. Phylogenomic analyses suggest that MEP401, MEP402, and these identified MVGs belong to a novel subfamily in the family Zobellviridae and that they can be separated into two groups. Group I MVGs show conserved whole-genome synteny with MEP401, while group II MVGs possess the MEP401-type DNA replication module and a distinct type of morphogenesis and packaging module, suggesting that genomic recombination occurred between phages. Most members in these two groups were predicted to infect OM43 bacteria. Metagenomic read-mapping analysis revealed that the phages in these two groups are globally ubiquitous and display distinct biogeographic distributions, with some phages being predominant in cold regions, some exclusively detected in estuarine stations, and others displaying wider distributions. This study expands our knowledge of the diversity and ecology of a novel phage lineage that infects OM43 bacteria by describing their genomic diversity and global distribution patterns. IMPORTANCE OM43 phages that infect marine OM43 bacteria are important for host mortality, community structure, and physiological functions. In this study, two OM43 phages were isolated and characterized. Metagenomic viral genome (MVG) retrieval using these two OM43 phages as baits led to the identification of two phage groups of a new subfamily in the family Zobellviridae. We found that group I MVGs share similar genomic content and arrangement with MEP401 and MEP402, whereas group II MVGs only possess the MEP401-type DNA replication module. Metagenomic mapping analysis suggests that members in these two groups are globally ubiquitous with distinct distribution patterns. This study provides important insights into the genomic diversity and biogeography of the OM43 phages in the global ocean.
Collapse
Affiliation(s)
- Mingyu Yang
- Fujian Provincial Key Laboratory of Agroecological Processing and Safety Monitoring, College of Life Sciences, Fujian Agriculture and Forestry University, Fuzhou, China
| | - Sen Du
- Fujian Provincial Key Laboratory of Agroecological Processing and Safety Monitoring, College of Life Sciences, Fujian Agriculture and Forestry University, Fuzhou, China
| | - Zefeng Zhang
- Fujian Provincial Key Laboratory of Agroecological Processing and Safety Monitoring, College of Life Sciences, Fujian Agriculture and Forestry University, Fuzhou, China
| | - Qian Xia
- Fujian Provincial Key Laboratory of Agroecological Processing and Safety Monitoring, College of Life Sciences, Fujian Agriculture and Forestry University, Fuzhou, China
| | - He Liu
- Fujian Provincial Key Laboratory of Agroecological Processing and Safety Monitoring, College of Life Sciences, Fujian Agriculture and Forestry University, Fuzhou, China
| | - Fang Qin
- Fujian Provincial Key Laboratory of Agroecological Processing and Safety Monitoring, College of Life Sciences, Fujian Agriculture and Forestry University, Fuzhou, China
| | - Zuqing Wu
- Fujian Provincial Key Laboratory of Agroecological Processing and Safety Monitoring, College of Life Sciences, Fujian Agriculture and Forestry University, Fuzhou, China
| | - Hanqi Ying
- Fujian Provincial Key Laboratory of Agroecological Processing and Safety Monitoring, College of Life Sciences, Fujian Agriculture and Forestry University, Fuzhou, China
| | - Yin Wu
- Fujian Provincial Key Laboratory of Agroecological Processing and Safety Monitoring, College of Life Sciences, Fujian Agriculture and Forestry University, Fuzhou, China
| | - Jiabing Shao
- Fujian Provincial Key Laboratory of Agroecological Processing and Safety Monitoring, College of Life Sciences, Fujian Agriculture and Forestry University, Fuzhou, China
| | - Yanlin Zhao
- Fujian Provincial Key Laboratory of Agroecological Processing and Safety Monitoring, College of Life Sciences, Fujian Agriculture and Forestry University, Fuzhou, China
- Key Laboratory of Marine Biotechnology of Fujian Province, Institute of Oceanology, Fujian Agriculture and Forestry University, Fuzhou, China
| |
Collapse
|
21
|
Gonzales MEM, Ureta JC, Shrestha AMS. Protein embeddings improve phage-host interaction prediction. PLoS One 2023; 18:e0289030. [PMID: 37486915 PMCID: PMC10365317 DOI: 10.1371/journal.pone.0289030] [Citation(s) in RCA: 2] [Impact Index Per Article: 2.0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/21/2023] [Accepted: 07/07/2023] [Indexed: 07/26/2023] Open
Abstract
With the growing interest in using phages to combat antimicrobial resistance, computational methods for predicting phage-host interactions have been explored to help shortlist candidate phages. Most existing models consider entire proteomes and rely on manual feature engineering, which poses difficulty in selecting the most informative sequence properties to serve as input to the model. In this paper, we framed phage-host interaction prediction as a multiclass classification problem that takes as input the embeddings of a phage's receptor-binding proteins, which are known to be the key machinery for host recognition, and predicts the host genus. We explored different protein language models to automatically encode these protein sequences into dense embeddings without the need for additional alignment or structural information. We show that the use of embeddings of receptor-binding proteins presents improvements over handcrafted genomic and protein sequence features. The highest performance was obtained using the transformer-based protein language model ProtT5, resulting in a 3% to 4% increase in weighted F1 and recall scores across different prediction confidence thresholds, compared to using selected handcrafted sequence features.
Collapse
Affiliation(s)
- Mark Edward M Gonzales
- Bioinformatics Laboratory, Advanced Research Institute for Informatics, Computing and Networking, De La Salle University, Manila, Philippines
- Department of Software Technology, College of Computer Studies, De La Salle University, Manila, Philippines
| | - Jennifer C Ureta
- Bioinformatics Laboratory, Advanced Research Institute for Informatics, Computing and Networking, De La Salle University, Manila, Philippines
- Department of Software Technology, College of Computer Studies, De La Salle University, Manila, Philippines
| | - Anish M S Shrestha
- Bioinformatics Laboratory, Advanced Research Institute for Informatics, Computing and Networking, De La Salle University, Manila, Philippines
- Systems and Computational Biology Research Unit, Center for Natural Sciences and Environmental Research, De La Salle University, Manila, Philippines
- Department of Software Technology, College of Computer Studies, De La Salle University, Manila, Philippines
| |
Collapse
|
22
|
Roux S, Camargo AP, Coutinho FH, Dabdoub SM, Dutilh BE, Nayfach S, Tritt A. iPHoP: An integrated machine learning framework to maximize host prediction for metagenome-derived viruses of archaea and bacteria. PLoS Biol 2023; 21:e3002083. [PMID: 37083735 PMCID: PMC10155999 DOI: 10.1371/journal.pbio.3002083] [Citation(s) in RCA: 54] [Impact Index Per Article: 54.0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 08/30/2022] [Revised: 05/03/2023] [Accepted: 03/15/2023] [Indexed: 04/22/2023] Open
Abstract
The extraordinary diversity of viruses infecting bacteria and archaea is now primarily studied through metagenomics. While metagenomes enable high-throughput exploration of the viral sequence space, metagenome-derived sequences lack key information compared to isolated viruses, in particular host association. Different computational approaches are available to predict the host(s) of uncultivated viruses based on their genome sequences, but thus far individual approaches are limited either in precision or in recall, i.e., for a number of viruses they yield erroneous predictions or no prediction at all. Here, we describe iPHoP, a two-step framework that integrates multiple methods to reliably predict host taxonomy at the genus rank for a broad range of viruses infecting bacteria and archaea, while retaining a low false discovery rate. Based on a large dataset of metagenome-derived virus genomes from the IMG/VR database, we illustrate how iPHoP can provide extensive host prediction and guide further characterization of uncultivated viruses.
Collapse
Affiliation(s)
- Simon Roux
- DOE Joint Genome Institute, Lawrence Berkeley National Laboratory, Berkeley, California, United States of America
| | - Antonio Pedro Camargo
- DOE Joint Genome Institute, Lawrence Berkeley National Laboratory, Berkeley, California, United States of America
| | | | - Shareef M Dabdoub
- Division of Biostatistics and Computational Biology, University of Iowa College of Dentistry, Iowa City, Iowa, United States of America
| | - Bas E Dutilh
- Institute of Biodiversity, Faculty of Biological Sciences, Cluster of Excellence Balance of the Microverse, Friedrich Schiller University, Jena, Germany
- Theoretical Biology and Bioinformatics, Science for Life, Utrecht University, Utrecht, the Netherlands
| | - Stephen Nayfach
- DOE Joint Genome Institute, Lawrence Berkeley National Laboratory, Berkeley, California, United States of America
| | - Andrew Tritt
- Computational Research Division, Lawrence Berkeley National Laboratory, Berkeley, California, United States of America
| |
Collapse
|
23
|
Beamud B, García-González N, Gómez-Ortega M, González-Candelas F, Domingo-Calap P, Sanjuan R. Genetic determinants of host tropism in Klebsiella phages. Cell Rep 2023; 42:112048. [PMID: 36753420 PMCID: PMC9989827 DOI: 10.1016/j.celrep.2023.112048] [Citation(s) in RCA: 15] [Impact Index Per Article: 15.0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/02/2022] [Revised: 11/25/2022] [Accepted: 01/13/2023] [Indexed: 02/08/2023] Open
Abstract
Bacteriophages play key roles in bacterial ecology and evolution and are potential antimicrobials. However, the determinants of phage-host specificity remain elusive. Here, we isolate 46 phages to challenge 138 representative clinical isolates of Klebsiella pneumoniae, a widespread opportunistic pathogen. Spot tests show a narrow host range for most phages, with <2% of 6,319 phage-host combinations tested yielding detectable interactions. Bacterial capsule diversity is the main factor restricting phage host range. Consequently, phage-encoded depolymerases are key determinants of host tropism, and depolymerase sequence types are associated with the ability to infect specific capsular types across phage families. However, all phages with a broader host range found do not encode canonical depolymerases, suggesting alternative modes of entry. These findings expand our knowledge of the complex interactions between bacteria and their viruses and point out the feasibility of predicting the first steps of phage infection using bacterial and phage genome sequences.
Collapse
Affiliation(s)
- Beatriz Beamud
- Joint Research Unit Infection and Public Health, FISABIO-Universitat de València, 46020 València, Spain; Institute for Integrative Systems Biology (I(2)SysBio), Universitat de València-CSIC, 46980 Paterna, Spain
| | - Neris García-González
- Joint Research Unit Infection and Public Health, FISABIO-Universitat de València, 46020 València, Spain; Institute for Integrative Systems Biology (I(2)SysBio), Universitat de València-CSIC, 46980 Paterna, Spain
| | - Mar Gómez-Ortega
- Joint Research Unit Infection and Public Health, FISABIO-Universitat de València, 46020 València, Spain
| | - Fernando González-Candelas
- Joint Research Unit Infection and Public Health, FISABIO-Universitat de València, 46020 València, Spain; Institute for Integrative Systems Biology (I(2)SysBio), Universitat de València-CSIC, 46980 Paterna, Spain.
| | - Pilar Domingo-Calap
- Institute for Integrative Systems Biology (I(2)SysBio), Universitat de València-CSIC, 46980 Paterna, Spain.
| | - Rafael Sanjuan
- Institute for Integrative Systems Biology (I(2)SysBio), Universitat de València-CSIC, 46980 Paterna, Spain.
| |
Collapse
|
24
|
Spatiotemporal Dynamics of Coastal Viral Community Structure and Potential Biogeochemical Roles Affected by an Ulva prolifera Green Tide. mSystems 2023; 8:e0121122. [PMID: 36815859 PMCID: PMC10134843 DOI: 10.1128/msystems.01211-22] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 02/24/2023] Open
Abstract
The world's largest macroalgal green tide, caused by Ulva prolifera, has resulted in serious consequences for coastal waters of the Yellow Sea, China. Although viruses are considered to be one of the key factors in controlling microalgal bloom demise, understanding of the relationship between viral communities and the macroalgal green tide is still poor. Here, a Qingdao coastal virome (QDCV) time-series data set was constructed based on the metagenomic analysis of 17 DNA viromes along three coastal stations of the Yellow Sea, covering different stages of the green tide from Julian days 165 to 271. A total of 40,076 viral contigs were detected and clustered into 28,058 viral operational taxonomic units (vOTUs). About 84% of the vOTUs could not be classified, and 62% separated from vOTUs in other ecosystems. Green tides significantly influenced the spatiotemporal dynamics of the viral community structure, diversity, and potential functions. For the classified vOTUs, the relative abundance of Pelagibacter phages declined with the arrival of the bloom and rebounded after the bloom, while Synechococcus and Roseobacter phages increased, although with a time lag from the peak of their hosts. More than 80% of the vOTUs reached peaks in abundance at different specific stages, and the viral peaks were correlated with specific hosts at different stages of the green tide. Most of the viral auxiliary metabolic genes (AMGs) were associated with carbon and sulfur metabolism and showed spatiotemporal dynamics relating to the degradation of the large amount of organic matter released by the green tide. IMPORTANCE To the best of our knowledge, this study is the first to investigate the responses of viruses to the world's largest macroalgal green tide. It revealed the spatiotemporal dynamics of the unique viral assemblages and auxiliary metabolic genes (AMGs) following the variation and degradation of Ulva prolifera. These findings demonstrate a tight coupling between viral assemblages, and prokaryotic and eukaryotic abundances were influenced by the green tide.
Collapse
|
25
|
Bajiya N, Dhall A, Aggarwal S, Raghava GPS. Advances in the field of phage-based therapy with special emphasis on computational resources. Brief Bioinform 2023; 24:6961791. [PMID: 36575815 DOI: 10.1093/bib/bbac574] [Citation(s) in RCA: 2] [Impact Index Per Article: 2.0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/28/2022] [Revised: 11/07/2022] [Accepted: 11/25/2022] [Indexed: 12/29/2022] Open
Abstract
In the current era, one of the major challenges is to manage the treatment of drug/antibiotic-resistant strains of bacteria. Phage therapy, a century-old technique, may serve as an alternative to antibiotics in treating bacterial infections caused by drug-resistant strains of bacteria. In this review, a systematic attempt has been made to summarize phage-based therapy in depth. This review has been divided into the following two sections: general information and computer-aided phage therapy (CAPT). In the case of general information, we cover the history of phage therapy, the mechanism of action, the status of phage-based products (approved and clinical trials) and the challenges. This review emphasizes CAPT, where we have covered primary phage-associated resources, phage prediction methods and pipelines. This review covers a wide range of databases and resources, including viral genomes and proteins, phage receptors, host genomes of phages, phage-host interactions and lytic proteins. In the post-genomic era, identifying the most suitable phage for lysing a drug-resistant strain of bacterium is crucial for developing alternate treatments for drug-resistant bacteria and this remains a challenging problem. Thus, we compile all phage-associated prediction methods that include the prediction of phages for a bacterial strain, the host for a phage and the identification of interacting phage-host pairs. Most of these methods have been developed using machine learning and deep learning techniques. This review also discussed recent advances in the field of CAPT, where we briefly describe computational tools available for predicting phage virions, the life cycle of phages and prophage identification. Finally, we describe phage-based therapy's advantages, challenges and opportunities.
Collapse
Affiliation(s)
- Nisha Bajiya
- Department of Computational Biology, Indraprastha Institute of Information Technology, Okhla Phase 3, New Delhi, 110020, India
| | - Anjali Dhall
- Department of Computational Biology, Indraprastha Institute of Information Technology, Okhla Phase 3, New Delhi, 110020, India
| | - Suchet Aggarwal
- Department of Computer Science and Engineering, Indraprastha Institute of Information Technology, Okhla Phase 3, New Delhi, 110020, India
| | - Gajendra P S Raghava
- Department of Computational Biology, Indraprastha Institute of Information Technology, Okhla Phase 3, New Delhi, 110020, India
| |
Collapse
|
26
|
Iuchi H, Kawasaki J, Kubo K, Fukunaga T, Hokao K, Yokoyama G, Ichinose A, Suga K, Hamada M. Bioinformatics approaches for unveiling virus-host interactions. Comput Struct Biotechnol J 2023; 21:1774-1784. [PMID: 36874163 PMCID: PMC9969756 DOI: 10.1016/j.csbj.2023.02.044] [Citation(s) in RCA: 4] [Impact Index Per Article: 4.0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 11/17/2022] [Revised: 02/22/2023] [Accepted: 02/22/2023] [Indexed: 03/03/2023] Open
Abstract
The coronavirus disease-2019 (COVID-19) pandemic has elucidated major limitations in the capacity of medical and research institutions to appropriately manage emerging infectious diseases. We can improve our understanding of infectious diseases by unveiling virus-host interactions through host range prediction and protein-protein interaction prediction. Although many algorithms have been developed to predict virus-host interactions, numerous issues remain to be solved, and the entire network remains veiled. In this review, we comprehensively surveyed algorithms used to predict virus-host interactions. We also discuss the current challenges, such as dataset biases toward highly pathogenic viruses, and the potential solutions. The complete prediction of virus-host interactions remains difficult; however, bioinformatics can contribute to progress in research on infectious diseases and human health.
Collapse
Affiliation(s)
- Hitoshi Iuchi
- Waseda Research Institute for Science and Engineering, Waseda University, Tokyo 169-8555, Japan.,Computational Bio Big-Data Open Innovation Laboratory (CBBD-OIL), National Institute of Advanced Industrial Science and Technology (AIST), Tokyo 169-8555, Japan
| | - Junna Kawasaki
- Faculty of Science and Engineering, Waseda University, Okubo Shinjuku-ku, Tokyo 169-8555, Japan
| | - Kento Kubo
- Computational Bio Big-Data Open Innovation Laboratory (CBBD-OIL), National Institute of Advanced Industrial Science and Technology (AIST), Tokyo 169-8555, Japan.,School of Advanced Science and Engineering, Waseda University, Okubo Shinjuku-ku, Tokyo 169-8555, Japan
| | - Tsukasa Fukunaga
- Waseda Institute for Advanced Study, Waseda University, Nishi Waseda, Shinjuku-ku, Tokyo 169-0051, Japan
| | - Koki Hokao
- School of Advanced Science and Engineering, Waseda University, Okubo Shinjuku-ku, Tokyo 169-8555, Japan
| | - Gentaro Yokoyama
- Computational Bio Big-Data Open Innovation Laboratory (CBBD-OIL), National Institute of Advanced Industrial Science and Technology (AIST), Tokyo 169-8555, Japan.,School of Advanced Science and Engineering, Waseda University, Okubo Shinjuku-ku, Tokyo 169-8555, Japan
| | - Akiko Ichinose
- Waseda Research Institute for Science and Engineering, Waseda University, Tokyo 169-8555, Japan
| | - Kanta Suga
- School of Advanced Science and Engineering, Waseda University, Okubo Shinjuku-ku, Tokyo 169-8555, Japan
| | - Michiaki Hamada
- Waseda Research Institute for Science and Engineering, Waseda University, Tokyo 169-8555, Japan.,Computational Bio Big-Data Open Innovation Laboratory (CBBD-OIL), National Institute of Advanced Industrial Science and Technology (AIST), Tokyo 169-8555, Japan.,School of Advanced Science and Engineering, Waseda University, Okubo Shinjuku-ku, Tokyo 169-8555, Japan.,Graduate School of Medicine, Nippon Medical School, Tokyo 113-8602, Japan
| |
Collapse
|
27
|
Zhang Z, Wu Z, Liu H, Yang M, Wang R, Zhao Y, Chen F. Genomic analysis and characterization of phages infecting the marine Roseobacter CHAB-I-5 lineage reveal a globally distributed and abundant phage genus. Front Microbiol 2023; 14:1164101. [PMID: 37138617 PMCID: PMC10149686 DOI: 10.3389/fmicb.2023.1164101] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Grants] [Track Full Text] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 02/12/2023] [Accepted: 03/27/2023] [Indexed: 05/05/2023] Open
Abstract
Marine phages play an important role in marine biogeochemical cycles by regulating the death, physiological metabolism, and evolutionary trajectory of bacteria. The Roseobacter group is an abundant and important heterotrophic bacterial group in the ocean, and plays an important role in carbon, nitrogen, sulfur and phosphorus cycling. The CHAB-I-5 lineage is one of the most dominant Roseobacter lineages, but remains largely uncultured. Phages infecting CHAB-I-5 bacteria have not yet been investigated due to the lack of culturable CHAB-I-5 strains. In this study, we isolated and sequenced two new phages (CRP-901 and CRP-902) infecting the CHAB-I-5 strain FZCC0083. We applied metagenomic data mining, comparative genomics, phylogenetic analysis, and metagenomic read-mapping to investigate the diversity, evolution, taxonomy, and biogeography of the phage group represented by the two phages. The two phages are highly similar, with an average nucleotide identity of 89.17%, and sharing 77% of their open reading frames. We identified several genes involved in DNA replication and metabolism, virion structure, DNA packing, and host lysis from their genomes. Metagenomic mining identified 24 metagenomic viral genomes closely related to CRP-901 and CRP-902. Genomic comparison and phylogenetic analysis demonstrated that these phages are distinct from other known viruses, representing a novel genus-level phage group (CRP-901-type). The CRP-901-type phages do not contain DNA primase and DNA polymerase genes, but possess a novel bifunctional DNA primase-polymerase gene with both primase and polymerase activities. Read-mapping analysis showed that the CRP-901-type phages are widespread across the world's oceans and are most abundant in estuarine and polar waters. Their abundance is generally higher than other known roseophages and even higher than most pelagiphages in the polar region. In summary, this study has greatly expanded our understanding of the genetic diversity, evolution, and distribution of roseophages. Our analysis suggests that the CRP-901-type phage is an important and novel marine phage group that plays important roles in the physiology and ecology of roseobacters.
Collapse
Affiliation(s)
- Zefeng Zhang
- Institute of Marine Science and Technology, Shandong University, Qingdao, China
- Fujian Provincial Key Laboratory of Agroecological Processing and Safety Monitoring, College of Life Sciences, Fujian Agriculture and Forestry University, Fuzhou, China
| | - Zuqing Wu
- Fujian Provincial Key Laboratory of Agroecological Processing and Safety Monitoring, College of Life Sciences, Fujian Agriculture and Forestry University, Fuzhou, China
| | - He Liu
- Fujian Provincial Key Laboratory of Agroecological Processing and Safety Monitoring, College of Life Sciences, Fujian Agriculture and Forestry University, Fuzhou, China
| | - Mingyu Yang
- Fujian Provincial Key Laboratory of Agroecological Processing and Safety Monitoring, College of Life Sciences, Fujian Agriculture and Forestry University, Fuzhou, China
| | - Rui Wang
- Institute of Marine Science and Technology, Shandong University, Qingdao, China
| | - Yanlin Zhao
- Fujian Provincial Key Laboratory of Agroecological Processing and Safety Monitoring, College of Life Sciences, Fujian Agriculture and Forestry University, Fuzhou, China
- *Correspondence: Yanlin Zhao,
| | - Feng Chen
- Institute of Marine and Environmental Technology, University of Maryland Center for Environmental Science, Baltimore, MD, United States
- Feng Chen,
| |
Collapse
|
28
|
Andrianjakarivony HF, Bettarel Y, Armougom F, Desnues C. Phage-Host Prediction Using a Computational Tool Coupled with 16S rRNA Gene Amplicon Sequencing. Viruses 2022; 15:76. [PMID: 36680116 PMCID: PMC9862649 DOI: 10.3390/v15010076] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 11/15/2022] [Revised: 12/13/2022] [Accepted: 12/20/2022] [Indexed: 12/29/2022] Open
Abstract
Metagenomics studies have revealed tremendous viral diversity in aquatic environments. Yet, while the genomic data they have provided is extensive, it is unannotated. For example, most phage sequences lack accurate information about their bacterial host, which prevents reliable phage identification and the investigation of phage-host interactions. This study aimed to take this knowledge further, using a viral metagenomic framework to decipher the composition and diversity of phage communities and to predict their bacterial hosts. To this end, we used water and sediment samples collected from seven sites with varying contamination levels in the Ebrié Lagoon in Abidjan, Ivory Coast. The bacterial communities were characterized using the 16S rRNA metabarcoding approach, and a framework was developed to investigate the virome datasets that: (1) identified phage contigs with VirSorter and VIBRANT; (2) classified these contigs with MetaPhinder using the phage database (taxonomic annotation); and (3) predicted the phages' bacterial hosts with a machine learning-based tool: the Prokaryotic Virus-Host Predictor. The findings showed that the taxonomic profiles of phages and bacteria were specific to sediment or water samples. Phage sequences assigned to the Microviridae family were widespread in sediment samples, whereas phage sequences assigned to the Siphoviridae, Myoviridae and Podoviridae families were predominant in water samples. In terms of bacterial communities, the phyla Latescibacteria, Zixibacteria, Bacteroidetes, Acidobacteria, Calditrichaeota, Gemmatimonadetes, Cyanobacteria and Patescibacteria were most widespread in sediment samples, while the phyla Epsilonbacteraeota, Tenericutes, Margulisbacteria, Proteobacteria, Actinobacteria, Planctomycetes and Marinimicrobia were most prevalent in water samples. Significantly, the relative abundance of bacterial communities (at major phylum level) estimated by 16S rRNA metabarcoding and phage-host prediction were significantly similar. These results demonstrate the reliability of this novel approach for predicting the bacterial hosts of phages from shotgun metagenomic sequencing data.
Collapse
Affiliation(s)
- Harilanto Felana Andrianjakarivony
- Microbes, Evolution, Phylogeny, and Infection (MEΦI), IHU—Méditerranée Infection, 19-21 Boulevard Jean Moulin, 13005 Marseille, France
- Microbiologie Environnementale Biotechnologie (MEB), Mediterranean Institute of Oceanography (MIO), 163 Avenue de Luminy, 13009 Marseille, France
| | - Yvan Bettarel
- MARBEC, Marine Biodiversity, Exploitation & Conservation, Université de Montpellier, CNRS, Ifremer, IRD, 093 Place Eugène Bataillon, 34090 Montpellier, France
| | - Fabrice Armougom
- Microbiologie Environnementale Biotechnologie (MEB), Mediterranean Institute of Oceanography (MIO), 163 Avenue de Luminy, 13009 Marseille, France
| | - Christelle Desnues
- Microbes, Evolution, Phylogeny, and Infection (MEΦI), IHU—Méditerranée Infection, 19-21 Boulevard Jean Moulin, 13005 Marseille, France
- Microbiologie Environnementale Biotechnologie (MEB), Mediterranean Institute of Oceanography (MIO), 163 Avenue de Luminy, 13009 Marseille, France
| |
Collapse
|
29
|
Phage Therapy for Crops: Concepts, Experimental and Bioinformatics Approaches to Direct Its Application. Int J Mol Sci 2022; 24:ijms24010325. [PMID: 36613768 PMCID: PMC9820149 DOI: 10.3390/ijms24010325] [Citation(s) in RCA: 3] [Impact Index Per Article: 1.5] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/20/2022] [Revised: 12/14/2022] [Accepted: 12/16/2022] [Indexed: 12/28/2022] Open
Abstract
Phage therapy consists of applying bacteriophages, whose natural function is to kill specific bacteria. Bacteriophages are safe, evolve together with their host, and are environmentally friendly. At present, the indiscriminate use of antibiotics and salt minerals (Zn2+ or Cu2+) has caused the emergence of resistant strains that infect crops, causing difficulties and loss of food production. Phage therapy is an alternative that has shown positive results and can improve the treatments available for agriculture. However, the success of phage therapy depends on finding effective bacteriophages. This review focused on describing the potential, up to now, of applying phage therapy as an alternative treatment against bacterial diseases, with sustainable improvement in food production. We described the current isolation techniques, characterization, detection, and selection of lytic phages, highlighting the importance of complementary studies using genome analysis of the phage and its host. Finally, among these studies, we concentrated on the most relevant bacteriophages used for biocontrol of Pseudomonas spp., Xanthomonas spp., Pectobacterium spp., Ralstonia spp., Burkholderia spp., Dickeya spp., Clavibacter michiganensis, and Agrobacterium tumefaciens as agents that cause damage to crops, and affect food production around the world.
Collapse
|
30
|
Impact of HIV infection and integrase strand transfer inhibitors-based treatment on the gut virome. Sci Rep 2022; 12:21658. [PMID: 36522388 PMCID: PMC9755154 DOI: 10.1038/s41598-022-25979-5] [Citation(s) in RCA: 11] [Impact Index Per Article: 5.5] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 08/17/2022] [Accepted: 12/07/2022] [Indexed: 12/23/2022] Open
Abstract
Viruses are the most abundant components of the human gut microbiome with a significant impact on health and disease. The effects of human immunodeficiency virus (HIV) infection on gut virome has been scarcely analysed. Several studies suggested that integrase strand transfers inhibitors (INSTIs) are associated with a healthier gut. Thus, the objective of this work was to evaluate the effects of HIV infection and INSTIs on gut virome composition. 26 non-HIV-infected volunteers, 15 naive HIV-infected patients and 15 INSTIs-treated HIV-infected patients were recruited and their gut virome composition was analysed using shotgun sequencing. Bacteriophages were the most abundant and diverse viruses present in gut. HIV infection was accompanied by a decrease in phage richness which was reverted after INSTIs-based treatment. β-diversity of phages revealed that samples from HIV-infected patients clustered separately from those belonging to the control group. Differential abundant analysis showed an increase in phages belonging to Caudoviricetes class in the naive group and a decrease of Malgrandaviricetes class phages in the INSTIs-treated group compared to the control group. Besides, it was observed that INSTIs-based treatment was not able to reverse the increase of lysogenic phages associated with HIV infection or to modify the decrease observed on the relative abundance of Proteobacteria-infecting phages. Our study describes for the first time the impact of HIV and INSTIs on gut virome and demonstrates that INSTIs-based treatments are able to partially restore gut dysbiosis at the viral level, which opens several opportunities for new studies focused on microbiota-based therapies.
Collapse
|
31
|
Amgarten D, Iha BKV, Piroupo CM, da Silva AM, Setubal JC. vHULK, a New Tool for Bacteriophage Host Prediction Based on Annotated Genomic Features and Neural Networks. PHAGE (NEW ROCHELLE, N.Y.) 2022; 3:204-212. [PMID: 36793881 PMCID: PMC9917316 DOI: 10.1089/phage.2021.0016] [Citation(s) in RCA: 3] [Impact Index Per Article: 1.5] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 11/12/2022]
Abstract
Background The experimental determination of a bacteriophage host is a laborious procedure. Thus, there is a pressing need for reliable computational predictions of bacteriophage hosts. Materials and Methods We developed the program vHULK for phage host prediction based on 9504 phage genome features, which consider alignment significance scores between predicted proteins and a curated database of viral protein families. The features were fed to a neural network, and two models were trained to predict 77 host genera and 118 host species. Results In controlled random test sets with 90% redundancy reduction in terms of protein similarity, vHULK obtained on average 83% precision and 79% recall at the genus level, and 71% precision and 67% recall at the species level. The performance of vHULK was compared against three other tools on a test data set with 2153 phage genomes. On this data set, vHULK achieved better performance at both the genus and the species levels than the other tools. Conclusions Our results suggest that vHULK represents an advance on the state of art in phage host prediction.
Collapse
Affiliation(s)
- Deyvid Amgarten
- Departamento de Bioquímica, Instituto de Química, Universidade de São Paulo, São Paulo, Brazil
| | | | - Carlos Morais Piroupo
- Departamento de Bioquímica, Instituto de Química, Universidade de São Paulo, São Paulo, Brazil
| | - Aline Maria da Silva
- Departamento de Bioquímica, Instituto de Química, Universidade de São Paulo, São Paulo, Brazil
| | - João Carlos Setubal
- Departamento de Bioquímica, Instituto de Química, Universidade de São Paulo, São Paulo, Brazil
| |
Collapse
|
32
|
Albrycht K, Rynkiewicz AA, Harasymczuk M, Barylski J, Zielezinski A. Daily Reports on Phage-Host Interactions. Front Microbiol 2022; 13:946070. [PMID: 35910653 PMCID: PMC9329054 DOI: 10.3389/fmicb.2022.946070] [Citation(s) in RCA: 6] [Impact Index Per Article: 3.0] [Reference Citation Analysis] [Abstract] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 05/17/2022] [Accepted: 06/23/2022] [Indexed: 11/13/2022] Open
Abstract
Understanding phage-host relationships is crucial for the study of virus biology and the application of phages in biotechnology and medicine. However, information concerning the range of hosts for bacterial and archaeal viruses is scattered across numerous databases and is difficult to obtain. Therefore, here we present PHD (Phage & Host Daily), a web application that offers a comprehensive, up-to-date catalog of known phage-host associations that allows users to select viruses targeting specific bacterial and archaeal taxa of interest. Our service combines the latest information on virus-host interactions from seven source databases with current taxonomic classification retrieved directly from the groups and institutions responsible for its maintenance. The web application also provides summary statistics on host and virus diversity, their pairwise interactions, and the host range of deposited phages. PHD is updated daily and available at http://phdaily.info or http://combio.pl/phdaily.
Collapse
Affiliation(s)
- Kamil Albrycht
- Department of Computational Biology, Faculty of Biology, Adam Mickiewicz University, Poznań, Poland
| | - Adam A. Rynkiewicz
- Department of Computational Biology, Faculty of Biology, Adam Mickiewicz University, Poznań, Poland
| | - Michal Harasymczuk
- Department of Traumatology, Orthopaedics and Hand Surgery, University of Medical Sciences, Poznan, Poland
| | - Jakub Barylski
- Department of Molecular Virology, Faculty of Biology, Adam Mickiewicz University, Poznań, Poland
| | - Andrzej Zielezinski
- Department of Computational Biology, Faculty of Biology, Adam Mickiewicz University, Poznań, Poland
- *Correspondence: Andrzej Zielezinski,
| |
Collapse
|
33
|
Díaz-Galián MV, Vega-Rodríguez MA, Molina F. PhageCocktail: An R package to design phage cocktails from experimental phage-bacteria infection networks. COMPUTER METHODS AND PROGRAMS IN BIOMEDICINE 2022; 221:106865. [PMID: 35576688 DOI: 10.1016/j.cmpb.2022.106865] [Citation(s) in RCA: 1] [Impact Index Per Article: 0.5] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Subscribe] [Scholar Register] [Received: 09/16/2021] [Revised: 04/18/2022] [Accepted: 05/07/2022] [Indexed: 06/15/2023]
Abstract
BACKGROUND AND OBJECTIVE Phage therapy is a resurgent strategy used in medicine and the food industry to lyse bacteria that cause damage to health or spoil a food product. Frequently, phage-bacteria infection networks have a large size, making it impossible to manually study all possible phage cocktails. Thus, this article presents an R package called PhageCocktail to automatically design efficient phage cocktails from phage-bacteria infection networks. METHODS This R package includes four different methods for designing phage cocktails: ExhaustiveSearch, ExhaustivePhi, ClusteringSearch, and ClusteringPhi. These four methods are explained in detail and are evaluated using 13 empirical phage-bacteria infection networks. More specifically, runtime and expected success (fraction of lysed bacteria) are analyzed. RESULTS The four methods have variations in terms of runtime and quality of the results. ExhaustiveSearch always provides the best possible phage cocktail, but its runtime could be long. ExhaustivePhi only focuses on one cocktail size, the one estimated as the best; thus, its runtime is less than ExhaustiveSearch, but it can produce cocktails with more phages than necessary. ClusteringSearch and ClusteringPhi are very fast (generally, less than one millisecond), providing always immediate results due to clustering techniques, but their accuracies can be lower, yielding cocktails with lower expected successes. CONCLUSIONS The larger the phage-bacteria infection network is, the more complex its analysis is. Thus, this tool eases this task for scientists and other users while designing phage cocktails of good quality. This R package includes four different methods; therefore, users may choose among them, considering their preferences in speed and accuracy of results.
Collapse
Affiliation(s)
- María Victoria Díaz-Galián
- Escuela Politécnica, Universidad de Extremadura (https://ror.org/0174shg90), Avda. de la Universidad s/n, Cáceres, 10003, Spain.
| | - Miguel A Vega-Rodríguez
- Escuela Politécnica, Universidad de Extremadura (https://ror.org/0174shg90), Avda. de la Universidad s/n, Cáceres, 10003, Spain.
| | - Felipe Molina
- Facultad de Ciencias, Universidad de Extremadura (https://ror.org/0174shg90), Avda. de Elvas s/n, Badajoz, 06006, Spain.
| |
Collapse
|
34
|
Shang J, Sun Y. CHERRY: a Computational metHod for accuratE pRediction of virus-pRokarYotic interactions using a graph encoder-decoder model. Brief Bioinform 2022; 23:6589865. [PMID: 35595715 PMCID: PMC9487644 DOI: 10.1093/bib/bbac182] [Citation(s) in RCA: 16] [Impact Index Per Article: 8.0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 01/03/2022] [Revised: 04/01/2022] [Accepted: 04/24/2022] [Indexed: 01/01/2023] Open
Abstract
Prokaryotic viruses, which infect bacteria and archaea, are key players in microbial communities. Predicting the hosts of prokaryotic viruses helps decipher the dynamic relationship between microbes. Experimental methods for host prediction cannot keep pace with the fast accumulation of sequenced phages. Thus, there is a need for computational host prediction. Despite some promising results, computational host prediction remains a challenge because of the limited known interactions and the sheer amount of sequenced phages by high-throughput sequencing technologies. The state-of-the-art methods can only achieve 43% accuracy at the species level. In this work, we formulate host prediction as link prediction in a knowledge graph that integrates multiple protein and DNA-based sequence features. Our implementation named CHERRY can be applied to predict hosts for newly discovered viruses and to identify viruses infecting targeted bacteria. We demonstrated the utility of CHERRY for both applications and compared its performance with 11 popular host prediction methods. To our best knowledge, CHERRY has the highest accuracy in identifying virus–prokaryote interactions. It outperforms all the existing methods at the species level with an accuracy increase of 37%. In addition, CHERRY’s performance on short contigs is more stable than other tools.
Collapse
Affiliation(s)
- Jiayu Shang
- Department of Electrical Engineering, City University of Hong Kong, Tat Chee Avenue, Kowloon, Hong Kong, China SAR
| | - Yanni Sun
- Department of Electrical Engineering, City University of Hong Kong, Tat Chee Avenue, Kowloon, Hong Kong, China SAR
| |
Collapse
|
35
|
Newly identified HMO-2011-type phages reveal genomic diversity and biogeographic distributions of this marine viral group. THE ISME JOURNAL 2022; 16:1363-1375. [PMID: 35022515 PMCID: PMC9038755 DOI: 10.1038/s41396-021-01183-7] [Citation(s) in RCA: 1] [Impact Index Per Article: 0.5] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Subscribe] [Scholar Register] [Received: 08/30/2021] [Revised: 12/17/2021] [Accepted: 12/23/2021] [Indexed: 12/13/2022]
Abstract
Viruses play critical roles in influencing biogeochemical cycles and adjusting host mortality, population structure, physiology, and evolution in the ocean. Marine viral communities are composed of numerous genetically distinct subfamily/genus-level viral groups. Among currently identified viral groups, the HMO-2011-type group is known to be dominant and broadly distributed. However, only four HMO-2011-type cultivated representatives that infect marine SAR116 and Roseobacter strains have been reported to date, and the genetic diversity, potential hosts, and ecology of this group remain poorly elucidated. Here, we present the genomes of seven HMO-2011-type phages that were isolated using four Roseobacter strains and one SAR11 strain, as well as additional 207 HMO-2011-type metagenomic viral genomes (MVGs) identified from various marine viromes. Phylogenomic and shared-gene analyses revealed that the HMO-2011-type group is a subfamily-level group comprising at least 10 discernible genus-level subgroups. Moreover, >2000 HMO-2011-type DNA polymerase sequences were identified, and the DNA polymerase phylogeny also revealed that the HMO-2011-type group contains diverse subgroups and is globally distributed. Metagenomic read-mapping results further showed that most HMO-2011-type phages are prevalent in global oceans and display distinct geographic distributions, with the distribution of most HMO-2011-type phages being associated with temperature. Lastly, we found that members in subgroup IX, represented by pelagiphage HTVC033P, were among the most abundant HMO-2011-type phages, which implies that SAR11 bacteria are crucial hosts for this viral group. In summary, our findings substantially expand current knowledge regarding the phylogenetic diversity, evolution, and distribution of HMO-2011-type phages, highlighting HMO-2011-type phages as major ecological agents that can infect certain key bacterial groups.
Collapse
|
36
|
A Novel and Ubiquitous Marine Methylophage Provides Insights into Viral-Host Coevolution and Possible Host-Range Expansion in Streamlined Marine Heterotrophic Bacteria. Appl Environ Microbiol 2022; 88:e0025522. [PMID: 35311512 PMCID: PMC9004378 DOI: 10.1128/aem.00255-22] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/25/2022] Open
Abstract
The methylotrophic OM43 clade are Gammaproteobacteria that comprise some of the smallest free-living cells known and have highly streamlined genomes. OM43 represents an important microbial link between marine primary production and remineralization of carbon back to the atmosphere. Bacteriophages shape microbial communities and are major drivers of mortality and global marine biogeochemistry. Recent cultivation efforts have brought the first viruses infecting members of the OM43 clade into culture. Here, we characterize a novel myophage infecting OM43 called Melnitz. Melnitz was isolated independently from water samples from a subtropical ocean gyre (Sargasso Sea) and temperate coastal (Western English Channel) systems. Metagenomic recruitment from global ocean viromes confirmed that Melnitz is globally ubiquitous, congruent with patterns of host abundance. Bacteria with streamlined genomes such as OM43 and the globally dominant SAR11 clade use riboswitches as an efficient method to regulate metabolism. Melnitz encodes a two-piece tmRNA (ssrA), controlled by a glutamine riboswitch, providing evidence that riboswitch use also occurs for regulation during phage infection of streamlined heterotrophs. Virally encoded tRNAs and ssrA found in Melnitz were phylogenetically more closely related to those found within the alphaproteobacterial SAR11 clade and their associated myophages than those within their gammaproteobacterial hosts. This suggests the possibility of an ancestral host transition event between SAR11 and OM43. Melnitz and a related myophage that infects SAR11 were unable to infect hosts of the SAR11 and OM43, respectively, suggesting host transition rather than a broadening of host range. IMPORTANCE Isolation and cultivation of viruses are the foundations on which the mechanistic understanding of virus-host interactions and parameterization of bioinformatic tools for viral ecology are based. This study isolated and characterized the first myophage known to infect the OM43 clade, expanding our knowledge of this understudied group of microbes. The nearly identical genomes of four strains of Melnitz isolated from different marine provinces and the global abundance estimations from metagenomic data suggest that this viral population is globally ubiquitous. Genome analysis revealed several unusual features in Melnitz and related genomes recovered from viromes, such as a curli operon and virally encoded tmRNA controlled by a glutamine riboswitch, neither of which are found in the host. Further phylogenetic analysis of shared genes indicates that this group of viruses infecting the gammaproteobacterial OM43 shares a recent common ancestor with viruses infecting the abundant alphaproteobacterial SAR11 clade. Host ranges are affected by compatible cell surface receptors, successful circumvention of superinfection exclusion systems, and the presence of required accessory proteins, which typically limits phages to singular narrow groups of closely related bacterial hosts. This study provides intriguing evidence that for streamlined heterotrophic bacteria, virus-host transitioning may not be necessarily restricted to phylogenetically related hosts but is a function of shared physical and biochemical properties of the cell.
Collapse
|
37
|
Zuppi M, Hendrickson HL, O’Sullivan JM, Vatanen T. Phages in the Gut Ecosystem. Front Cell Infect Microbiol 2022; 11:822562. [PMID: 35059329 PMCID: PMC8764184 DOI: 10.3389/fcimb.2021.822562] [Citation(s) in RCA: 26] [Impact Index Per Article: 13.0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 11/25/2021] [Accepted: 12/10/2021] [Indexed: 12/23/2022] Open
Abstract
Phages, short for bacteriophages, are viruses that specifically infect bacteria and are the most abundant biological entities on earth found in every explored environment, from the deep sea to the Sahara Desert. Phages are abundant within the human biome and are gaining increasing recognition as potential modulators of the gut ecosystem. For example, they have been connected to gastrointestinal diseases and the treatment efficacy of Fecal Microbiota Transplant. The ability of phages to modulate the human gut microbiome has been attributed to the predation of bacteria or the promotion of bacterial survival by the transfer of genes that enhance bacterial fitness upon infection. In addition, phages have been shown to interact with the human immune system with variable outcomes. Despite the increasing evidence supporting the importance of phages in the gut ecosystem, the extent of their influence on the shape of the gut ecosystem is yet to be fully understood. Here, we discuss evidence for phage modulation of the gut microbiome, postulating that phages are pivotal contributors to the gut ecosystem dynamics. We therefore propose novel research questions to further elucidate the role(s) that they have within the human ecosystem and its impact on our health and well-being.
Collapse
Affiliation(s)
- Michele Zuppi
- The Liggins Institute, University of Auckland, Auckland, New Zealand
| | - Heather L. Hendrickson
- The School of Natural and Computational Sciences, Massey University, Auckland, New Zealand
| | - Justin M. O’Sullivan
- The Liggins Institute, University of Auckland, Auckland, New Zealand
- The Maurice Wilkins Centre, The University of Auckland, Auckland, New Zealand
- MRC Lifecourse Epidemiology Unit, University of Southampton, Southampton, United Kingdom
| | - Tommi Vatanen
- The Liggins Institute, University of Auckland, Auckland, New Zealand
- The Broad Institute of MIT and Harvard, Cambridge, MA, United States
| |
Collapse
|
38
|
Abstract
Motivation Phage–host associations play important roles in microbial communities. But in natural communities, as opposed to culture-based lab studies where phages are discovered and characterized metagenomically, their hosts are generally not known. Several programs have been developed for predicting which phage infects which host based on various sequence similarity measures or machine learning approaches. These are often based on whole viral and host genomes, but in metagenomics-based studies, we rarely have whole genomes but rather must rely on contigs that are sometimes as short as hundreds of bp long. Therefore, we need programs that predict hosts of phage contigs on the basis of these short contigs. Although most existing programs can be applied to metagenomic datasets for these predictions, their accuracies are generally low. Here, we develop ContigNet, a convolutional neural network-based model capable of predicting phage–host matches based on relatively short contigs, and compare it to previously published VirHostMatcher (VHM) and WIsH. Results On the validation set, ContigNet achieves 72–85% area under the receiver operating characteristic curve (AUROC) scores, compared to the maximum of 68% by VHM or WIsH for contigs of lengths between 200 bps to 50 kbps. We also apply the model to the Metagenomic Gut Virus (MGV) catalogue, a dataset containing a wide range of draft genomes from metagenomic samples and achieve 60–70% AUROC scores compared to that of VHM and WIsH of 52%. Surprisingly, ContigNet can also be used to predict plasmid-host contig associations with high accuracy, indicating a similar genetic exchange between mobile genetic elements and their hosts. Availability and implementation The source code of ContigNet and related datasets can be downloaded from https://github.com/tianqitang1/ContigNet.
Collapse
Affiliation(s)
- Tianqi Tang
- Department of Quantitative and Computational Biology, University of Southern California, Los Angeles, CA 90089, USA
| | - Shengwei Hou
- Department of Ocean Science and Engineering, Southern University of Science and Technology, Shenzhen 518055, China
- Marine and Environmental Biology, Department of Biological Sciences, University of Southern California, Los Angeles, CA 90089, USA
| | - Jed A Fuhrman
- Marine and Environmental Biology, Department of Biological Sciences, University of Southern California, Los Angeles, CA 90089, USA
| | - Fengzhu Sun
- To whom correspondence should be addressed. E-mail:
| |
Collapse
|
39
|
Menor-Flores M, Vega-Rodríguez MA, Molina F. Computational design of phage cocktails based on phage-bacteria infection networks. Comput Biol Med 2022; 142:105186. [PMID: 34998221 DOI: 10.1016/j.compbiomed.2021.105186] [Citation(s) in RCA: 3] [Impact Index Per Article: 1.5] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 11/08/2021] [Revised: 12/22/2021] [Accepted: 12/26/2021] [Indexed: 01/16/2023]
Abstract
The misuse and overuse of antibiotics have boosted the proliferation of multidrug-resistant (MDR) bacteria, which are considered a major public health issue in the twenty-first century. Phage therapy may be a promising way in the treatment of infections caused by MDR pathogens, without the side effects of the current available antimicrobials. Phage therapy is based on phage cocktails, that is, combinations of phages able to lyse the target bacteria. In this work, we present and explain in detail two innovative computational methods to design phage cocktails taking into account a given phage-bacteria infection network. One of the methods (Exhaustive Search) always generates the best possible phage cocktail, while the other method (Network Metrics) always keeps a very reduced runtime (a few milliseconds). Both methods have been included in a Cytoscape application that is available for any user. A complete experimental study has been performed, evaluating and comparing the biological quality, runtime, and the impact when additional phages are included in the cocktail.
Collapse
Affiliation(s)
- Manuel Menor-Flores
- Escuela Politécnica, Universidad de Extremadura(1), Avda. de la Universidad s/n, 10 003, Cáceres, Spain.
| | - Miguel A Vega-Rodríguez
- Escuela Politécnica, Universidad de Extremadura(1), Avda. de la Universidad s/n, 10 003, Cáceres, Spain.
| | - Felipe Molina
- Facultad de Ciencias, Universidad de Extremadura(1), Avda. de Elvas s/n, 06 006, Badajoz, Spain.
| |
Collapse
|
40
|
Shang J, Sun Y. Predicting the hosts of prokaryotic viruses using GCN-based semi-supervised learning. BMC Biol 2021; 19:250. [PMID: 34819064 PMCID: PMC8611875 DOI: 10.1186/s12915-021-01180-4] [Citation(s) in RCA: 20] [Impact Index Per Article: 6.7] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/07/2021] [Accepted: 10/29/2021] [Indexed: 11/23/2022] Open
Abstract
Background Prokaryotic viruses, which infect bacteria and archaea, are the most abundant and diverse biological entities in the biosphere. To understand their regulatory roles in various ecosystems and to harness the potential of bacteriophages for use in therapy, more knowledge of viral-host relationships is required. High-throughput sequencing and its application to the microbiome have offered new opportunities for computational approaches for predicting which hosts particular viruses can infect. However, there are two main challenges for computational host prediction. First, the empirically known virus-host relationships are very limited. Second, although sequence similarity between viruses and their prokaryote hosts have been used as a major feature for host prediction, the alignment is either missing or ambiguous in many cases. Thus, there is still a need to improve the accuracy of host prediction. Results In this work, we present a semi-supervised learning model, named HostG, to conduct host prediction for novel viruses. We construct a knowledge graph by utilizing both virus-virus protein similarity and virus-host DNA sequence similarity. Then graph convolutional network (GCN) is adopted to exploit viruses with or without known hosts in training to enhance the learning ability. During the GCN training, we minimize the expected calibrated error (ECE) to ensure the confidence of the predictions. We tested HostG on both simulated and real sequencing data and compared its performance with other state-of-the-art methods specifically designed for virus host classification (VHM-net, WIsH, PHP, HoPhage, RaFAH, vHULK, and VPF-Class). Conclusion HostG outperforms other popular methods, demonstrating the efficacy of using a GCN-based semi-supervised learning approach. A particular advantage of HostG is its ability to predict hosts from new taxa. Supplementary Information The online version contains supplementary material available at (10.1186/s12915-021-01180-4).
Collapse
Affiliation(s)
- Jiayu Shang
- Electrical Engineering, City University of Hong Kong, Hong Kong, China
| | - Yanni Sun
- Electrical Engineering, City University of Hong Kong, Hong Kong, China.
| |
Collapse
|
41
|
Zielezinski A, Barylski J, Karlowski WM. Taxonomy-aware, sequence similarity ranking reliably predicts phage-host relationships. BMC Biol 2021; 19:223. [PMID: 34625070 PMCID: PMC8501573 DOI: 10.1186/s12915-021-01146-6] [Citation(s) in RCA: 13] [Impact Index Per Article: 4.3] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 04/30/2021] [Accepted: 09/06/2021] [Indexed: 12/02/2022] Open
Abstract
Background Characterizing phage–host interactions is critical to understanding the ecological role of both partners and effective isolation of phage therapeuticals. Unfortunately, experimental methods for studying these interactions are markedly slow, low-throughput, and unsuitable for phages or hosts difficult to maintain in laboratory conditions. Therefore, a number of in silico methods emerged to predict prokaryotic hosts based on viral sequences. One of the leading approaches is the application of the BLAST tool that searches for local similarities between viral and microbial genomes. However, this prediction method has three major limitations: (i) top-scoring sequences do not always point to the actual host; (ii) mosaic virus genomes may match to many, typically related, bacteria; and (iii) viral and host sequences may diverge beyond the point where their relationship can be detected by a BLAST alignment. Results We created an extension to BLAST, named Phirbo, that improves host prediction quality beyond what is obtainable from standard BLAST searches. The tool harnesses information concerning sequence similarity and bacteria relatedness to predict phage–host interactions. Phirbo was evaluated on three benchmark sets of known virus–host pairs, and it improved precision and recall by 11–40 percentage points over currently available, state-of-the-art, alignment-based, alignment-free, and machine-learning host prediction tools. Moreover, the discriminatory power of Phirbo for the recognition of virus–host relationships surpassed the results of other tools by at least 10 percentage points (area under the curve = 0.95), yielding a mean host prediction accuracy of 57% and 68% at the genus and family levels, respectively, and drops by 12 percentage points when using only a fraction of viral genome sequences (3 kb). Finally, we provide insights into a repertoire of protein and ncRNA genes that are shared between phages and hosts and may be prone to horizontal transfer during infection. Conclusions Our results suggest that Phirbo is a simple and effective tool for predicting phage–host relationships. Supplementary Information The online version contains supplementary material available at 10.1186/s12915-021-01146-6.
Collapse
Affiliation(s)
- Andrzej Zielezinski
- Department of Computational Biology, Faculty of Biology, Adam Mickiewicz University Poznan, Uniwersytetu Poznanskiego 6, 61-614, Poznan, Poland.
| | - Jakub Barylski
- Molecular Virology Research Unit, Faculty of Biology, Adam Mickiewicz University Poznan, Uniwersytetu Poznanskiego 6, 61-614, Poznan, Poland
| | - Wojciech M Karlowski
- Department of Computational Biology, Faculty of Biology, Adam Mickiewicz University Poznan, Uniwersytetu Poznanskiego 6, 61-614, Poznan, Poland.
| |
Collapse
|