1
|
Lai S, Wang H, Bork P, Chen WH, Zhao XM. Long-read sequencing reveals extensive gut phageome structural variations driven by genetic exchange with bacterial hosts. SCIENCE ADVANCES 2024; 10:eadn3316. [PMID: 39141729 PMCID: PMC11323893 DOI: 10.1126/sciadv.adn3316] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Subscribe] [Scholar Register] [Received: 12/05/2023] [Accepted: 07/10/2024] [Indexed: 08/16/2024]
Abstract
Genetic variations are instrumental for unraveling phage evolution and deciphering their functional implications. Here, we explore the underlying fine-scale genetic variations in the gut phageome, especially structural variations (SVs). By using virome-enriched long-read metagenomic sequencing across 91 individuals, we identified a total of 14,438 nonredundant phage SVs and revealed their prevalence within the human gut phageome. These SVs are mainly enriched in genes involved in recombination, DNA methylation, and antibiotic resistance. Notably, a substantial fraction of phage SV sequences share close homology with bacterial fragments, with most SVs enriched for horizontal gene transfer (HGT) mechanism. Further investigations showed that these SV sequences were genetic exchanged between specific phage-bacteria pairs, particularly between phages and their respective bacterial hosts. Temperate phages exhibit a higher frequency of genetic exchange with bacterial chromosomes and then virulent phages. Collectively, our findings provide insights into the genetic landscape of the human gut phageome.
Collapse
Affiliation(s)
- Senying Lai
- Department of Neurology, Zhongshan Hospital and Institute of Science and Technology for Brain-Inspired Intelligence, Fudan University, Shanghai, China
- State Key Laboratory of Medical Neurobiology, Institutes of Brain Science, Fudan University, Shanghai, China
- MOE Key Laboratory of Computational Neuroscience and Brain-Inspired Intelligence, and MOE Frontiers Center for Brain Science, Fudan University, Shanghai, China
| | - Huarui Wang
- Key Laboratory of Molecular Biophysics of the Ministry of Education, Hubei Key Laboratory of Bioinformatics and Molecular Imaging, Center for Artificial Intelligence Biology, Department of Bioinformatics and Systems Biology, College of Life Science and Technology, Huazhong University of Science and Technology, Wuhan, Hubei, China
| | - Peer Bork
- European Molecular Biology Laboratory, Structural and Computational Biology Unit, Heidelberg, Germany
- Max Delbrück Centre for Molecular Medicine, Berlin, Germany
- Department of Bioinformatics, Biocenter, University of Würzburg, Würzburg, Germany
| | - Wei-Hua Chen
- State Key Laboratory of Medical Neurobiology, Institutes of Brain Science, Fudan University, Shanghai, China
- College of Life Science, Henan Normal University, Xinxiang, Henan, China
| | - Xing-Ming Zhao
- Department of Neurology, Zhongshan Hospital and Institute of Science and Technology for Brain-Inspired Intelligence, Fudan University, Shanghai, China
- State Key Laboratory of Medical Neurobiology, Institutes of Brain Science, Fudan University, Shanghai, China
- MOE Key Laboratory of Computational Neuroscience and Brain-Inspired Intelligence, and MOE Frontiers Center for Brain Science, Fudan University, Shanghai, China
| |
Collapse
|
2
|
Aldeguer-Riquelme B, Conrad RE, Antón J, Rossello-Mora R, Konstantinidis KT. A natural ANI gap that can define intra-species units of bacteriophages and other viruses. mBio 2024; 15:e0153624. [PMID: 39037288 PMCID: PMC11323488 DOI: 10.1128/mbio.01536-24] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/11/2024] [Accepted: 06/24/2024] [Indexed: 07/23/2024] Open
Abstract
Despite the importance of intra-species variants of viruses for causing disease and/or disrupting ecosystem functioning, there is no universally applicable standard to define these. A (natural) gap in whole-genome average nucleotide identity (ANI) values around 95% is commonly used to define species, especially for bacteriophages, but whether a similar gap exists within species that can be used to define intra-species units has not been evaluated yet. Whole-genome comparisons among members of 1,016 bacteriophage (Caudoviricetes) species revealed a region of low frequency of ANI values around 99.2%-99.8%, showing threefold or fewer pairs than expected for an even distribution. This second gap is prevalent in viruses infecting various cultured or uncultured hosts from a variety of environments, although a few exceptions to this pattern were also observed (3.7% of total species) and are likely attributed to cultivation biases or other factors. Similar results were observed for a limited set of eukaryotic viruses that are adequately sampled, including SARS-CoV-2, whose ANI-based clusters matched well with the WHO-defined variants of concern, indicating that our findings from bacteriophages might be more broadly applicable and the ANI-based clusters may represent functionally and/or ecologically distinct units. These units appear to be predominantly driven by (high) ecological cohesiveness coupled to either frequent recombination for bacteriophages or selection and clonal evolution for other viruses such as SARS-CoV-2, indicating that fundamentally different underlying mechanisms could lead to similar diversity patterns. Accordingly, we propose the ANI gap approach outlined above for defining viral intra-species units, for which we propose the term genomovars. IMPORTANCE Viral species are composed of an ensemble of intra-species variants whose individual dynamics may have major implications for human and animal health and/or ecosystem functioning. However, the lack of universally accepted standards to define these intra-species variants has led researchers to use different approaches for this task, creating inconsistent intra-species units across different viral families and confusion in communication. By comparing hundreds of mostly bacteriophage genomes, we show that there is a widely distributed natural gap in whole-genome average nucleotide identity values in most, but not all, of these species that can be used to define intra-species units. Therefore, these results advance the molecular toolbox for tracking viral intra-species units and should facilitate future epidemiological and environmental studies.
Collapse
Affiliation(s)
- Borja Aldeguer-Riquelme
- School of Civil & Environmental Engineering and School of Biological Sciences, Georgia Institute of Technology, Atlanta, Georgia, USA
| | - Roth E. Conrad
- School of Civil & Environmental Engineering and School of Biological Sciences, Georgia Institute of Technology, Atlanta, Georgia, USA
| | - Josefa Antón
- Department of Physiology, Genetics and Microbiology, University of Alicante, San Vicente del Raspeig, Spain
| | - Ramon Rossello-Mora
- Marine Microbiology Group, Department of Animal and Microbial Biodiversity, Mediterranean Institute for Advanced Studies (IMEDEA, CSIC-UIB), Esporles, Spain
| | - Konstantinos T. Konstantinidis
- School of Civil & Environmental Engineering and School of Biological Sciences, Georgia Institute of Technology, Atlanta, Georgia, USA
| |
Collapse
|
3
|
Veldsman WP, Yang C, Zhang Z, Huang Y, Chowdhury D, Zhang L. Structural and Functional Disparities within the Human Gut Virome in Terms of Genome Topology and Representative Genome Selection. Viruses 2024; 16:134. [PMID: 38257834 PMCID: PMC10820185 DOI: 10.3390/v16010134] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 12/18/2023] [Revised: 01/12/2024] [Accepted: 01/16/2024] [Indexed: 01/24/2024] Open
Abstract
Circularity confers protection to viral genomes where linearity falls short, thereby fulfilling the form follows function aphorism. However, a shift away from morphology-based classification toward the molecular and ecological classification of viruses is currently underway within the field of virology. Recent years have seen drastic changes in the International Committee on Taxonomy of Viruses' operational definitions of viruses, particularly for the tailed phages that inhabit the human gut. After the abolition of the order Caudovirales, these tailed phages are best defined as members of the class Caudoviricetes. To determine the epistemological value of genome topology in the context of the human gut virome, we designed a set of seven experiments to assay the impact of genome topology and representative viral selection on biological interpretation. Using Oxford Nanopore long reads for viral genome assembly coupled with Illumina short-read polishing, we showed that circular and linear virus genomes differ remarkably in terms of genome quality, GC skew, transfer RNA gene frequency, structural variant frequency, cross-reference functional annotation (COG, KEGG, Pfam, and TIGRfam), state-of-the-art marker-based classification, and phage-host interaction. Furthermore, the disparity profile changes during dereplication. In particular, our phage-host interaction results demonstrated that proportional abundances cannot be meaningfully compared without due regard for genome topology and dereplication threshold, which necessitates the need for standardized reporting. As a best practice guideline, we recommend that comparative studies of the human gut virome always report the ratio of circular to linear viral genomes along with the dereplication threshold so that structural and functional metrics can be placed into context when assessing biologically relevant metagenomic properties such as proportional abundance.
Collapse
Affiliation(s)
- Werner P. Veldsman
- Department of Computer Science, Hong Kong Baptist University, Kowloon, Hong Kong SAR, China; (W.P.V.); (C.Y.); (Z.Z.)
| | - Chao Yang
- Department of Computer Science, Hong Kong Baptist University, Kowloon, Hong Kong SAR, China; (W.P.V.); (C.Y.); (Z.Z.)
| | - Zhenmiao Zhang
- Department of Computer Science, Hong Kong Baptist University, Kowloon, Hong Kong SAR, China; (W.P.V.); (C.Y.); (Z.Z.)
| | | | - Debajyoti Chowdhury
- School of Chinese Medicine, Hong Kong Baptist University, Hong Kong SAR, China;
- Computational Medicine Laboratory, Hong Kong Baptist University, Hong Kong SAR, China
| | - Lu Zhang
- Department of Computer Science, Hong Kong Baptist University, Kowloon, Hong Kong SAR, China; (W.P.V.); (C.Y.); (Z.Z.)
- Institute for Research and Continuing Education, Hong Kong Baptist University, Shenzhen 518057, China
| |
Collapse
|
4
|
Oh C, Zhou A, O'Brien K, Schmidt AR, Geltz J, Shisler JL, Schmidt AR, Keefer L, Brown WM, Nguyen TH. Improved performance of nucleic acid-based assays for genetically diverse norovirus surveillance. Appl Environ Microbiol 2023; 89:e0033123. [PMID: 37791775 PMCID: PMC10654041 DOI: 10.1128/aem.00331-23] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 02/24/2023] [Accepted: 07/07/2023] [Indexed: 10/05/2023] Open
Abstract
Nucleic acid-based assays, such as polymerase chain reaction (PCR), that amplify and detect organism-specific genome sequences are a standard method for infectious disease surveillance. However, challenges arise for virus surveillance because of their genetic diversity. Here, we calculated the variability of nucleotides within the genomes of 10 human viral species in silico and found that endemic viruses exhibit a high percentage of variable nucleotides (e.g., 51.4% for norovirus genogroup II). This genetic diversity led to the variable probability of detection of PCR assays (the proportion of viral sequences that contain the assay's target sequences divided by the total number of viral sequences). We then experimentally confirmed that the probability of the target sequence detection is indicative of the number of mismatches between PCR assays and norovirus genomes. Next, we developed a degenerate PCR assay that detects 97% of known norovirus genogroup II genome sequences and recognized norovirus in eight clinical samples. By contrast, previously developed assays with 31% and 16% probability of detection had 1.1 and 2.5 mismatches on average, respectively, which negatively impacted RNA quantification. In addition, the two PCR assays with a lower probability of detection also resulted in false negatives for wastewater-based epidemiology. Our findings suggest that the probability of detection serves as a simple metric for evaluating nucleic acid-based assays for genetically diverse virus surveillance.IMPORTANCENucleic acid-based assays, such as polymerase chain reaction (PCR), that amplify and detect organism-specific genome sequences are employed widely as a standard method for infectious disease surveillance. However, challenges arise for virus surveillance because of the rapid evolution and genetic variation of viruses. The study analyzed clinical and wastewater samples using multiple PCR assays and found significant performance variation among the PCR assays for genetically diverse norovirus surveillance. This finding suggests that some PCR assays may miss detecting certain virus strains, leading to a compromise in detection sensitivity. To address this issue, we propose a metric called the probability of detection, which can be simply calculated in silico using a code developed in this study, to evaluate nucleic acid-based assays for genetically diverse virus surveillance. This new approach can help improve the sensitivity and accuracy of virus detection, which is crucial for effective infectious disease surveillance and control.
Collapse
Affiliation(s)
- Chamteut Oh
- Department of Civil and Environmental Engineering, University of Illinois Urbana-Champaign, Urbana, Illinois, USA
- Department of Environmental Engineering Sciences, University of Florida, Gainesville, Florida, USA
| | - Aijia Zhou
- Department of Civil and Environmental Engineering, University of Illinois Urbana-Champaign, Urbana, Illinois, USA
| | - Kate O'Brien
- School of Integrative Biology, University of Illinois Urbana-Champaign, Urbana, Illinois, USA
| | - Arthur R. Schmidt
- Department of Civil and Environmental Engineering, University of Illinois Urbana-Champaign, Urbana, Illinois, USA
| | - Joshua Geltz
- Division of Laboratories, Illinois Department of Public Health, Springfield, Illinois, USA
| | - Joanna L. Shisler
- Department of Microbiology, University of Illinois Urbana-Champaign, Urbana, Illinois, USA
| | - Arthur R. Schmidt
- Department of Civil and Environmental Engineering, University of Illinois Urbana-Champaign, Urbana, Illinois, USA
| | - Laura Keefer
- Illinois State Water Survey, Prairie Research Institute, University of Illinois Urbana-Champaign, Urbana, Illinois, USA
| | - William M. Brown
- Department of Pathobiology, College of Veterinary Medicine, University of Illinois Urbana-Champaign, Urbana, Illinois, USA
| | - Thanh H. Nguyen
- Department of Civil and Environmental Engineering, University of Illinois Urbana-Champaign, Urbana, Illinois, USA
- Institute of Genomic Biology, University of Illinois Urbana-Champaign, Urbana, Illinois, USA
- Carle Illinois College of Medicine, University of Illinois Urbana-Champaign, Urbana, Illinois, USA
| |
Collapse
|
5
|
Peng Y, Lu Z, Pan D, Shi LD, Zhao Z, Liu Q, Zhang C, Jia K, Li J, Hubert CRJ, Dong X. Viruses in deep-sea cold seep sediments harbor diverse survival mechanisms and remain genetically conserved within species. THE ISME JOURNAL 2023; 17:1774-1784. [PMID: 37573455 PMCID: PMC10504277 DOI: 10.1038/s41396-023-01491-0] [Citation(s) in RCA: 3] [Impact Index Per Article: 3.0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Subscribe] [Scholar Register] [Received: 03/22/2023] [Revised: 08/01/2023] [Accepted: 08/01/2023] [Indexed: 08/14/2023]
Abstract
Deep sea cold seep sediments have been discovered to harbor novel, abundant, and diverse bacterial and archaeal viruses. However, little is known about viral genetic features and evolutionary patterns in these environments. Here, we examined the evolutionary ecology of viruses across active and extinct seep stages in the area of Haima cold seeps in the South China Sea. A total of 338 viral operational taxonomic units are identified and linked to 36 bacterial and archaeal phyla. The dynamics of host-virus interactions are informed by diverse antiviral defense systems across 43 families found in 487 microbial genomes. Cold seep viruses are predicted to harbor diverse adaptive strategies to persist in this environment, including counter-defense systems, auxiliary metabolic genes, reverse transcriptases, and alternative genetic code assignments. Extremely low nucleotide diversity is observed in cold seep viral populations, being influenced by factors including microbial host, sediment depth, and cold seep stage. Most cold seep viral genes are under strong purifying selection with trajectories that differ depending on whether cold seeps are active or extinct. This work sheds light on the understanding of environmental adaptation mechanisms and evolutionary patterns of viruses in the sub-seafloor biosphere.
Collapse
Affiliation(s)
- Yongyi Peng
- Key Laboratory of Marine Genetic Resources, Third Institute of Oceanography, Ministry of Natural Resources, Xiamen, 361005, China
- School of Marine Sciences, Sun Yat-Sen University, Zhuhai, 519082, China
| | - Zijian Lu
- South China Sea Institute of Oceanology, Chinese Academy of Sciences, Guangzhou, 510301, China
| | - Donald Pan
- School of Oceanography, Shanghai Jiao Tong University, Shanghai, 200030, China
| | - Ling-Dong Shi
- College of Environmental and Resource Sciences, Zhejiang University, Hangzhou, 310058, China
| | - Zhao Zhao
- School of Marine Sciences, Sun Yat-Sen University, Zhuhai, 519082, China
- Southern Marine Science and Engineering Guangdong Laboratory (Zhuhai), Zhuhai, 519000, China
| | - Qing Liu
- School of Marine Sciences, Sun Yat-Sen University, Zhuhai, 519082, China
| | - Chuwen Zhang
- Key Laboratory of Marine Genetic Resources, Third Institute of Oceanography, Ministry of Natural Resources, Xiamen, 361005, China
| | - Kuntong Jia
- School of Marine Sciences, Sun Yat-Sen University, Zhuhai, 519082, China
| | - Jiwei Li
- Institute of Deep-Sea Science and Engineering, Chinese Academy of Sciences, Sanya, 572000, China
| | - Casey R J Hubert
- Department of Biological Sciences, University of Calgary, Calgary, AB, T2N 1N4, Canada
| | - Xiyang Dong
- Key Laboratory of Marine Genetic Resources, Third Institute of Oceanography, Ministry of Natural Resources, Xiamen, 361005, China.
- Southern Marine Science and Engineering Guangdong Laboratory (Zhuhai), Zhuhai, 519000, China.
| |
Collapse
|
6
|
Genomes from Uncultivated Pelagiphages Reveal Multiple Phylogenetic Clades Exhibiting Extensive Auxiliary Metabolic Genes and Cross-Family Multigene Transfers. mSystems 2022; 7:e0152221. [PMID: 35972150 PMCID: PMC9599517 DOI: 10.1128/msystems.01522-21] [Citation(s) in RCA: 4] [Impact Index Per Article: 2.0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 12/24/2022] Open
Abstract
For the abundant marine Alphaproteobacterium Pelagibacter (SAR11), and other bacteria, phages are powerful forces of mortality. However, little is known about the most abundant Pelagiphages in nature, such as the widespread HTVC023P-type, which is currently represented by two cultured phages. Using viral metagenomic data sets and fluorescence-activated cell sorting, we recovered 80 complete, undescribed Podoviridae genomes that form 10 phylogenomically distinct clades (herein, named Clades I to X) related to the HTVC023P-type. These expanded the HTVC023P-type pan-genome by 15-fold and revealed 41 previously unknown auxiliary metabolic genes (AMGs) in this viral lineage. Numerous instances of partner-AMGs (colocated and involved in related functions) were observed, including partners in nucleotide metabolism, DNA hypermodification, and Curli biogenesis. The Type VIII secretion system (T8SS) responsible for Curli biogenesis was identified in nine genomes and expanded the repertoire of T8SS proteins reported thus far in viruses. Additionally, the identified T8SS gene cluster contained an iron-dependent regulator (FecR), as well as a histidine kinase and adenylate cyclase that can be implicated in T8SS function but are not within T8SS operons in bacteria. While T8SS are lacking in known Pelagibacter, they contribute to aggregation and biofilm formation in other bacteria. Phylogenetic reconstructions of partner-AMGs indicate derivation from cellular lineages with a more recent transfer between viral families. For example, homologs of all T8SS genes are present in syntenic regions of distant Myoviridae Pelagiphages, and they appear to have alphaproteobacterial origins with a later transfer between viral families. The results point to an unprecedented multipartner-AMG transfer between marine Myoviridae and Podoviridae. Together with the expansion of known metabolic functions, our studies provide new prospects for understanding the ecology and evolution of marine phages and their hosts. IMPORTANCE One of the most abundant and diverse marine bacterial groups is Pelagibacter. Phages have roles in shaping Pelagibacter ecology; however, several Pelagiphage lineages are represented by only a few genomes. This paucity of data from even the most widespread lineages has imposed limits on the understanding of the diversity of Pelagiphages and their impacts on hosts. Here, we report 80 complete genomes, assembled directly from environmental data, which are from undescribed Pelagiphages and render new insights into the manipulation of host metabolism during infection. Notably, the viruses have functionally related partner genes that appear to be transferred between distant viruses, including a suite that encode a secretion system which both brings a new functional capability to the host and is abundant in phages across the ocean. Together, these functions have important implications for phage evolution and for how Pelagiphage infection influences host biology in manners extending beyond canonical viral lysis and mortality.
Collapse
|
7
|
Martinez-Hernandez F, Fornas O, Martinez-Garcia M. Into the Dark: Exploring the Deep Ocean with Single-Virus Genomics. Viruses 2022; 14:1589. [PMID: 35891567 PMCID: PMC9322844 DOI: 10.3390/v14071589] [Citation(s) in RCA: 2] [Impact Index Per Article: 1.0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 05/23/2022] [Revised: 07/15/2022] [Accepted: 07/18/2022] [Indexed: 12/03/2022] Open
Abstract
Single-virus genomics (SVGs) has been successfully applied to ocean surface samples allowing the discovery of widespread dominant viruses overlooked for years by metagenomics, such as the uncultured virus vSAG 37-F6 infecting the ubiquitous Pelagibacter spp. In SVGs, one uncultured virus at a time is sorted from the environmental sample, whole-genome amplified, and sequenced. Here, we have applied SVGs to deep-ocean samples (200-4000 m depth) from global Malaspina and MEDIMAX expeditions, demonstrating the feasibility of this method in deep-ocean samples. A total of 1328 virus-like particles were sorted from the North Atlantic Ocean, the deep Mediterranean Sea, and the Pacific Ocean oxygen minimum zone (OMZ). For this proof of concept, sixty single viruses were selected at random for sequencing. Genome annotation identified 27 of these genomes as bona fide viruses, and detected three auxiliary metabolic genes involved in nucleotide biosynthesis and sugar metabolism. Massive protein profile analysis confirmed that these viruses represented novel viral groups not present in databases. Although they were not previously assembled by viromics, global fragment recruitment analysis showed a conserved profile of relative abundance of these viruses in all analyzed samples spanning different oceans. Altogether, these results reveal the feasibility in using SVGs in this vast environment to unveil the genomes of relevant viruses.
Collapse
Affiliation(s)
| | - Oscar Fornas
- Centre for Genomic Regulation (CRG), The Barcelona Institute for Science and Technology (BIST), PRBB Building, 08003 Barcelona, Spain;
| | - Manuel Martinez-Garcia
- Department of Physiology, Genetics and Microbiology, University of Alicante, 03690 Alicante, Spain;
| |
Collapse
|