1
|
Moeckel C, Mareboina M, Konnaris MA, Chan CS, Mouratidis I, Montgomery A, Chantzi N, Pavlopoulos GA, Georgakopoulos-Soares I. A survey of k-mer methods and applications in bioinformatics. Comput Struct Biotechnol J 2024; 23:2289-2303. [PMID: 38840832 PMCID: PMC11152613 DOI: 10.1016/j.csbj.2024.05.025] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/13/2024] [Revised: 05/14/2024] [Accepted: 05/15/2024] [Indexed: 06/07/2024] Open
Abstract
The rapid progression of genomics and proteomics has been driven by the advent of advanced sequencing technologies, large, diverse, and readily available omics datasets, and the evolution of computational data processing capabilities. The vast amount of data generated by these advancements necessitates efficient algorithms to extract meaningful information. K-mers serve as a valuable tool when working with large sequencing datasets, offering several advantages in computational speed and memory efficiency and carrying the potential for intrinsic biological functionality. This review provides an overview of the methods, applications, and significance of k-mers in genomic and proteomic data analyses, as well as the utility of absent sequences, including nullomers and nullpeptides, in disease detection, vaccine development, therapeutics, and forensic science. Therefore, the review highlights the pivotal role of k-mers in addressing current genomic and proteomic problems and underscores their potential for future breakthroughs in research.
Collapse
Affiliation(s)
- Camille Moeckel
- Institute for Personalized Medicine, Department of Biochemistry and Molecular Biology, The Pennsylvania State University College of Medicine, Hershey, PA, USA
| | - Manvita Mareboina
- Institute for Personalized Medicine, Department of Biochemistry and Molecular Biology, The Pennsylvania State University College of Medicine, Hershey, PA, USA
| | - Maxwell A. Konnaris
- Institute for Personalized Medicine, Department of Biochemistry and Molecular Biology, The Pennsylvania State University College of Medicine, Hershey, PA, USA
| | - Candace S.Y. Chan
- Department of Bioengineering and Therapeutic Sciences, University of California San Francisco, San Francisco, CA, USA
| | - Ioannis Mouratidis
- Institute for Personalized Medicine, Department of Biochemistry and Molecular Biology, The Pennsylvania State University College of Medicine, Hershey, PA, USA
- Huck Institute of the Life Sciences, Penn State University, University Park, Pennsylvania, USA
| | - Austin Montgomery
- Institute for Personalized Medicine, Department of Biochemistry and Molecular Biology, The Pennsylvania State University College of Medicine, Hershey, PA, USA
| | - Nikol Chantzi
- Institute for Personalized Medicine, Department of Biochemistry and Molecular Biology, The Pennsylvania State University College of Medicine, Hershey, PA, USA
| | | | - Ilias Georgakopoulos-Soares
- Institute for Personalized Medicine, Department of Biochemistry and Molecular Biology, The Pennsylvania State University College of Medicine, Hershey, PA, USA
- Huck Institute of the Life Sciences, Penn State University, University Park, Pennsylvania, USA
| |
Collapse
|
2
|
Rahimian M, Panahi B. Metagenome sequence data mining for viral interaction studies: Review on progress and prospects. Virus Res 2024; 349:199450. [PMID: 39151562 PMCID: PMC11388672 DOI: 10.1016/j.virusres.2024.199450] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/13/2024] [Revised: 08/11/2024] [Accepted: 08/13/2024] [Indexed: 08/19/2024]
Abstract
Metagenomics has been greatly accelerated by the development of next-generation sequencing (NGS) technologies, which allow scientists to discover and describe novel microorganisms without the need for conventional culture techniques. Examining integrative bioinformatics methods used in viral interaction research, this study highlights metagenomic data from various contexts. Accurate viral identification depends on high-purity genetic material extraction, appropriate NGS platform selection, and sophisticated bioinformatics tools like VirPipe and VirFinder. The efficiency and precision of metagenomic analysis are further improved with the advent of AI-based techniques. The diversity and dynamics of viral communities are demonstrated by case studies from a variety of environments, emphasizing the seasonal and geographical variations that influence viral populations. In addition to speeding up the discovery of new viruses, metagenomics offers thorough understanding of virus-host interactions and their ecological effects. This review provides a promising framework for comprehending the complexity of viral communities and their interactions with hosts, highlighting the transformational potential of metagenomics and bioinformatics in viral research.
Collapse
Affiliation(s)
- Mohammadreza Rahimian
- Department of Biology, Faculty of Basic Sciences, University of Maragheh, Maragheh, Iran
| | - Bahman Panahi
- Department of Genomics, Branch for Northwest & West Region, Agricultural Biotechnology Research Institute of Iran (ABRII), Agricultural Research, Education and Extension Organization (AREEO), Tabriz, Iran.
| |
Collapse
|
3
|
Brait N, Hackl T, Morel C, Exbrayat A, Gutierrez S, Lequime S. A tale of caution: How endogenous viral elements affect virus discovery in transcriptomic data. Virus Evol 2023; 10:vead088. [PMID: 38516656 PMCID: PMC10956553 DOI: 10.1093/ve/vead088] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 09/09/2023] [Revised: 11/24/2023] [Accepted: 12/22/2023] [Indexed: 03/23/2024] Open
Abstract
Large-scale metagenomic and -transcriptomic studies have revolutionized our understanding of viral diversity and abundance. In contrast, endogenous viral elements (EVEs), remnants of viral sequences integrated into host genomes, have received limited attention in the context of virus discovery, especially in RNA-Seq data. EVEs resemble their original viruses, a challenge that makes distinguishing between active infections and integrated remnants difficult, affecting virus classification and biases downstream analyses. Here, we systematically assess the effects of EVEs on a prototypical virus discovery pipeline, evaluate their impact on data integrity and classification accuracy, and provide some recommendations for better practices. We examined EVEs and exogenous viral sequences linked to Orthomyxoviridae, a diverse family of negative-sense segmented RNA viruses, in 13 genomic and 538 transcriptomic datasets of Culicinae mosquitoes. Our analysis revealed a substantial number of viral sequences in transcriptomic datasets. However, a significant portion appeared not to be exogenous viruses but transcripts derived from EVEs. Distinguishing between transcribed EVEs and exogenous virus sequences was especially difficult in samples with low viral abundance. For example, three transcribed EVEs showed full-length segments, devoid of frameshift and nonsense mutations, exhibiting sufficient mean read depths that qualify them as exogenous virus hits. Mapping reads on a host genome containing EVEs before assembly somewhat alleviated the EVE burden, but it led to a drastic reduction of viral hits and reduced quality of assemblies, especially in regions of the viral genome relatively similar to EVEs. Our study highlights that our knowledge of the genetic diversity of viruses can be altered by the underestimated presence of EVEs in transcriptomic datasets, leading to false positives and altered or missing sequence information. Thus, recognizing and addressing the influence of EVEs in virus discovery pipelines will be key in enhancing our ability to capture the full spectrum of viral diversity.
Collapse
Affiliation(s)
- Nadja Brait
- Cluster of Microbial Ecology, Groningen Institute for Evolutionary Life Sciences, University of Groningen, Groningen 9747 AG, The Netherlands
| | | | - Côme Morel
- ASTRE research unit, Cirad, INRAe, Université de Montpellier, Montpellier 34398, France
| | - Antoni Exbrayat
- ASTRE research unit, Cirad, INRAe, Université de Montpellier, Montpellier 34398, France
| | - Serafin Gutierrez
- ASTRE research unit, Cirad, INRAe, Université de Montpellier, Montpellier 34398, France
| | - Sebastian Lequime
- Cluster of Microbial Ecology, Groningen Institute for Evolutionary Life Sciences, University of Groningen, Groningen 9747 AG, The Netherlands
| |
Collapse
|
4
|
Brinch ML, Hald T, Wainaina L, Merlotti A, Remondini D, Henri C, Njage PMK. Comparison of Source Attribution Methodologies for Human Campylobacteriosis. Pathogens 2023; 12:786. [PMID: 37375476 DOI: 10.3390/pathogens12060786] [Citation(s) in RCA: 1] [Impact Index Per Article: 1.0] [Reference Citation Analysis] [Abstract] [Key Words] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/22/2023] [Revised: 05/09/2023] [Accepted: 05/10/2023] [Indexed: 06/29/2023] Open
Abstract
Campylobacter spp. are the most common cause of bacterial gastrointestinal infection in humans both in Denmark and worldwide. Studies have found microbial subtyping to be a powerful tool for source attribution, but comparisons of different methodologies are limited. In this study, we compare three source attribution approaches (Machine Learning, Network Analysis, and Bayesian modeling) using three types of whole genome sequences (WGS) data inputs (cgMLST, 5-Mers and 7-Mers). We predicted and compared the sources of human campylobacteriosis cases in Denmark. Using 7mer as an input feature provided the best model performance. The network analysis algorithm had a CSC value of 78.99% and an F1-score value of 67%, while the machine-learning algorithm showed the highest accuracy (98%). The models attributed between 965 and all of the 1224 human cases to a source (network applying 5mer and machine learning applying 7mer, respectively). Chicken from Denmark was the primary source of human campylobacteriosis with an average percentage probability of attribution of 45.8% to 65.4%, representing Bayesian with 7mer and machine learning with cgMLST, respectively. Our results indicate that the different source attribution methodologies based on WGS have great potential for the surveillance and source tracking of Campylobacter. The results of such models may support decision makers to prioritize and target interventions.
Collapse
Affiliation(s)
- Maja Lykke Brinch
- Research Group for Foodborne Pathogens and Epidemiology, National Food Institute, Technical University of Denmark, 2800 Kongens Lyngby, Denmark
| | - Tine Hald
- Research Group for Foodborne Pathogens and Epidemiology, National Food Institute, Technical University of Denmark, 2800 Kongens Lyngby, Denmark
| | - Lynda Wainaina
- Department of Mathematics, University of Padova, 35121 Padova, Italy
| | - Alessandra Merlotti
- Department of Physics and Astronomy, University of Bologna, 40126 Bologna, Italy
| | - Daniel Remondini
- Department of Physics and Astronomy, University of Bologna, 40126 Bologna, Italy
| | - Clementine Henri
- Research Group for Foodborne Pathogens and Epidemiology, National Food Institute, Technical University of Denmark, 2800 Kongens Lyngby, Denmark
| | - Patrick Murigu Kamau Njage
- Research Group for Genomic Epidemiology, National Food Institute, Technical University of Denmark, 2800 Kongens Lyngby, Denmark
| |
Collapse
|
5
|
Plyusnin I, Vapalahti O, Sironen T, Kant R, Smura T. Enhanced Viral Metagenomics with Lazypipe 2. Viruses 2023; 15:v15020431. [PMID: 36851645 PMCID: PMC9960287 DOI: 10.3390/v15020431] [Citation(s) in RCA: 2] [Impact Index Per Article: 2.0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 12/22/2022] [Revised: 01/29/2023] [Accepted: 01/31/2023] [Indexed: 02/08/2023] Open
Abstract
Viruses are the main agents causing emerging and re-emerging infectious diseases. It is therefore important to screen for and detect them and uncover the evolutionary processes that support their ability to jump species boundaries and establish themselves in new hosts. Metagenomic next-generation sequencing (mNGS) is a high-throughput, impartial technology that has enabled virologists to detect either known or novel, divergent viruses from clinical, animal, wildlife and environmental samples, with little a priori assumptions. mNGS is heavily dependent on bioinformatic analysis, with an emerging demand for integrated bioinformatic workflows. Here, we present Lazypipe 2, an updated mNGS pipeline with, as compared to Lazypipe1, significant improvements in code stability and transparency, with added functionality and support for new software components. We also present extensive benchmarking results, including evaluation of a novel canine simulated metagenome, precision and recall of virus detection at varying sequencing depth, and a low to extremely low proportion of viral genetic material. Additionally, we report accuracy of virus detection with two strategies: homology searches using nucleotide or amino acid sequences. We show that Lazypipe 2 with nucleotide-based annotation approaches near perfect detection for eukaryotic viruses and, in terms of accuracy, outperforms the compared pipelines. We also discuss the importance of homology searches with amino acid sequences for the detection of highly divergent novel viruses.
Collapse
Affiliation(s)
- Ilya Plyusnin
- Department of Veterinary Biosciences, University of Helsinki, 00014 Helsinki, Finland
- Department of Virology, University of Helsinki, 00014 Helsinki, Finland
- Correspondence:
| | - Olli Vapalahti
- Department of Veterinary Biosciences, University of Helsinki, 00014 Helsinki, Finland
- Department of Virology, University of Helsinki, 00014 Helsinki, Finland
- HUS Diagnostic Center, Clinical Microbiology, Helsinki University Hospital, University of Helsinki, 00029 Helsinki, Finland
| | - Tarja Sironen
- Department of Veterinary Biosciences, University of Helsinki, 00014 Helsinki, Finland
- Department of Virology, University of Helsinki, 00014 Helsinki, Finland
| | - Ravi Kant
- Department of Veterinary Biosciences, University of Helsinki, 00014 Helsinki, Finland
- Department of Virology, University of Helsinki, 00014 Helsinki, Finland
- Department of Tropical Parasitology, Institute of Maritime and Tropical Medicine, Medical University of Gdansk, 81-519 Gdynia, Poland
| | - Teemu Smura
- Department of Virology, University of Helsinki, 00014 Helsinki, Finland
- HUS Diagnostic Center, Clinical Microbiology, Helsinki University Hospital, University of Helsinki, 00029 Helsinki, Finland
| |
Collapse
|
6
|
Nguyen M, Wemheuer B, Laffy PW, Webster NS, Thomas T. Taxonomic, functional and expression analysis of viral communities associated with marine sponges. PeerJ 2021; 9:e10715. [PMID: 33604175 PMCID: PMC7863781 DOI: 10.7717/peerj.10715] [Citation(s) in RCA: 6] [Impact Index Per Article: 2.0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 08/25/2020] [Accepted: 12/15/2020] [Indexed: 12/19/2022] Open
Abstract
Viruses play an essential role in shaping the structure and function of ecological communities. Marine sponges have the capacity to filter large volumes of ‘virus-laden’ seawater through their bodies and host dense communities of microbial symbionts, which are likely accessible to viral infection. However, despite the potential of sponges and their symbionts to act as viral reservoirs, little is known about the sponge-associated virome. Here we address this knowledge gap by analysing metagenomic and (meta-) transcriptomic datasets from several sponge species to determine what viruses are present and elucidate their predicted and expressed functionality. Sponges were found to carry diverse, abundant and active bacteriophages as well as eukaryotic viruses belonging to the Megavirales and Phycodnaviridae. These viruses contain and express auxiliary metabolic genes (AMGs) for photosynthesis and vitamin synthesis as well as for the production of antimicrobials and the defence against toxins. These viral AMGs can therefore contribute to the metabolic capacities of their hosts and also potentially enhance the survival of infected cells. This suggest that viruses may play a key role in regulating the abundance and activities of members of the sponge holobiont.
Collapse
Affiliation(s)
- Mary Nguyen
- Centre for Marine Science and Innovation & School of Biological & Earth and Environmental Sciences, University of New South Wales, Sydney, NSW, Australia
| | - Bernd Wemheuer
- Centre for Marine Science and Innovation & School of Biological & Earth and Environmental Sciences, University of New South Wales, Sydney, NSW, Australia
| | - Patrick W Laffy
- Australian Institute of Marine Science, Townsville, QLD, Australia
| | - Nicole S Webster
- Australian Institute of Marine Science, Townsville, QLD, Australia.,Australian Centre for Ecogenomics, University of Queensland, Brisbane, QLD, Australia
| | - Torsten Thomas
- Centre for Marine Science and Innovation & School of Biological & Earth and Environmental Sciences, University of New South Wales, Sydney, NSW, Australia
| |
Collapse
|
7
|
Plyusnin I, Kant R, Jääskeläinen AJ, Sironen T, Holm L, Vapalahti O, Smura T. Novel NGS pipeline for virus discovery from a wide spectrum of hosts and sample types. Virus Evol 2020; 6:veaa091. [PMID: 33408878 PMCID: PMC7772471 DOI: 10.1093/ve/veaa091] [Citation(s) in RCA: 22] [Impact Index Per Article: 5.5] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 12/12/2022] Open
Abstract
The study of the microbiome data holds great potential for elucidating the biological and metabolic functioning of living organisms and their role in the environment. Metagenomic analyses have shown that humans, along with for example, domestic animals, wildlife and arthropods, are colonized by an immense community of viruses. The current Coronavirus pandemic (COVID-19) heightens the need to rapidly detect previously unknown viruses in an unbiased way. The increasing availability of metagenomic data in this era of next-generation sequencing (NGS), along with increasingly affordable sequencing technologies, highlight the need for reliable and comprehensive methods to manage such data. In this article, we present a novel bioinformatics pipeline called LAZYPIPE for identifying both previously known and novel viruses in host associated or environmental samples and give examples of virus discovery based on it. LAZYPIPE is a Unix-based pipeline for automated assembling and taxonomic profiling of NGS libraries implemented as a collection of C++, Perl, and R scripts.
Collapse
Affiliation(s)
- Ilya Plyusnin
- Institute of Biotechnology, University of Helsinki, Helsinki 00014, Finland
| | - Ravi Kant
- Department of Veterinary Bioscience, University of Helsinki, Helsinki 00014, Finland
| | - Anne J Jääskeläinen
- Department of Virology and Immunology, University of Helsinki and Helsinki University Hospital, Helsinki 00014, Finland
| | - Tarja Sironen
- Department of Veterinary Bioscience, University of Helsinki, Helsinki 00014, Finland
| | - Liisa Holm
- Institute of Biotechnology, University of Helsinki, Helsinki 00014, Finland
| | - Olli Vapalahti
- Department of Veterinary Bioscience, University of Helsinki, Helsinki 00014, Finland
| | - Teemu Smura
- Department of Virology, University of Helsinki, Helsinki 00014, Finland
| |
Collapse
|
8
|
Nieuwenhuijse DF, Oude Munnink BB, Phan MVT, Munk P, Venkatakrishnan S, Aarestrup FM, Cotten M, Koopmans MPG. Setting a baseline for global urban virome surveillance in sewage. Sci Rep 2020; 10:13748. [PMID: 32792677 PMCID: PMC7426863 DOI: 10.1038/s41598-020-69869-0] [Citation(s) in RCA: 31] [Impact Index Per Article: 7.8] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 01/28/2020] [Accepted: 06/29/2020] [Indexed: 11/09/2022] Open
Abstract
The rapid development of megacities, and their growing connectedness across the world is becoming a distinct driver for emerging disease outbreaks. Early detection of unusual disease emergence and spread should therefore include such cities as part of risk-based surveillance. A catch-all metagenomic sequencing approach of urban sewage could potentially provide an unbiased insight into the dynamics of viral pathogens circulating in a community irrespective of access to care, a potential which already has been proven for the surveillance of poliovirus. Here, we present a detailed characterization of sewage viromes from a snapshot of 81 high density urban areas across the globe, including in-depth assessment of potential biases, as a proof of concept for catch-all viral pathogen surveillance. We show the ability to detect a wide range of viruses and geographical and seasonal differences for specific viral groups. Our findings offer a cross-sectional baseline for further research in viral surveillance from urban sewage samples and place previous studies in a global perspective.
Collapse
Affiliation(s)
| | - Bas B Oude Munnink
- Viroscience Department, Erasmus Medical Center, Rotterdam, The Netherlands
| | - My V T Phan
- Viroscience Department, Erasmus Medical Center, Rotterdam, The Netherlands
| | | | - Patrick Munk
- National Food Institute, Technical University of Denmark, Lyngby, Denmark
| | | | - Frank M Aarestrup
- National Food Institute, Technical University of Denmark, Lyngby, Denmark
| | - Matthew Cotten
- Viroscience Department, Erasmus Medical Center, Rotterdam, The Netherlands
| | | |
Collapse
|
9
|
Detecting viral sequences in NGS data. Curr Opin Virol 2019; 39:41-48. [DOI: 10.1016/j.coviro.2019.07.010] [Citation(s) in RCA: 32] [Impact Index Per Article: 6.4] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/05/2019] [Revised: 07/29/2019] [Accepted: 07/30/2019] [Indexed: 01/03/2023]
|
10
|
Modha S, Hughes J, Bianco G, Ferguson HM, Helm B, Tong L, Wilkie GS, Kohl A, Schnettler E. Metaviromics Reveals Unknown Viral Diversity in the Biting Midge Culicoides impunctatus. Viruses 2019; 11:v11090865. [PMID: 31533247 PMCID: PMC6784199 DOI: 10.3390/v11090865] [Citation(s) in RCA: 8] [Impact Index Per Article: 1.6] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 08/27/2019] [Revised: 09/11/2019] [Accepted: 09/14/2019] [Indexed: 12/15/2022] Open
Abstract
Biting midges (Culicoides species) are vectors of arboviruses and were responsible for the emergence and spread of Schmallenberg virus (SBV) in Europe in 2011 and are likely to be involved in the emergence of other arboviruses in Europe. Improved surveillance and better understanding of risks require a better understanding of the circulating viral diversity in these biting insects. In this study, we expand the sequence space of RNA viruses by identifying a number of novel RNA viruses from Culicoides impunctatus (biting midge) using a meta-transcriptomic approach. A novel metaviromic pipeline called MetaViC was developed specifically to identify novel virus sequence signatures from high throughput sequencing (HTS) datasets in the absence of a known host genome. MetaViC is a protein centric pipeline that looks for specific protein signatures in the reads and contigs generated as part of the pipeline. Several novel viruses, including an alphanodavirus with both segments, a novel relative of the Hubei sobemo-like virus 49, two rhabdo-like viruses and a chuvirus, were identified in the Scottish midge samples. The newly identified viruses were found to be phylogenetically distinct to those previous known. These findings expand our current knowledge of viral diversity in arthropods and especially in these understudied disease vectors.
Collapse
Affiliation(s)
- Sejal Modha
- MRC-University of Glasgow Centre for Virus Research, Glasgow G61 1QH, UK.
| | - Joseph Hughes
- MRC-University of Glasgow Centre for Virus Research, Glasgow G61 1QH, UK
| | - Giovanni Bianco
- Institute of Biodiversity, Animal Health and Comparative Medicine, University of Glasgow, Glasgow G12 8QQ, UK
| | - Heather M Ferguson
- Institute of Biodiversity, Animal Health and Comparative Medicine, University of Glasgow, Glasgow G12 8QQ, UK
| | - Barbara Helm
- Institute of Biodiversity, Animal Health and Comparative Medicine, University of Glasgow, Glasgow G12 8QQ, UK
| | - Lily Tong
- MRC-University of Glasgow Centre for Virus Research, Glasgow G61 1QH, UK
| | - Gavin S Wilkie
- MRC-University of Glasgow Centre for Virus Research, Glasgow G61 1QH, UK
| | - Alain Kohl
- MRC-University of Glasgow Centre for Virus Research, Glasgow G61 1QH, UK
| | - Esther Schnettler
- MRC-University of Glasgow Centre for Virus Research, Glasgow G61 1QH, UK.
| |
Collapse
|
11
|
Garretto A, Hatzopoulos T, Putonti C. virMine: automated detection of viral sequences from complex metagenomic samples. PeerJ 2019; 7:e6695. [PMID: 30993039 PMCID: PMC6462185 DOI: 10.7717/peerj.6695] [Citation(s) in RCA: 26] [Impact Index Per Article: 5.2] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 11/30/2018] [Accepted: 02/26/2019] [Indexed: 12/29/2022] Open
Abstract
Metagenomics has enabled sequencing of viral communities from a myriad of different environments. Viral metagenomic studies routinely uncover sequences with no recognizable homology to known coding regions or genomes. Nevertheless, complete viral genomes have been constructed directly from complex community metagenomes, often through tedious manual curation. To address this, we developed the software tool virMine to identify viral genomes from raw reads representative of viral or mixed (viral and bacterial) communities. virMine automates sequence read quality control, assembly, and annotation. Researchers can easily refine their search for a specific study system and/or feature(s) of interest. In contrast to other viral genome detection tools that often rely on the recognition of viral signature sequences, virMine is not restricted by the insufficient representation of viral diversity in public data repositories. Rather, viral genomes are identified through an iterative approach, first omitting non-viral sequences. Thus, both relatives of previously characterized viruses and novel species can be detected, including both eukaryotic viruses and bacteriophages. Here we present virMine and its analysis of synthetic communities as well as metagenomic data sets from three distinctly different environments: the gut microbiota, the urinary microbiota, and freshwater viromes. Several new viral genomes were identified and annotated, thus contributing to our understanding of viral genetic diversity in these three environments.
Collapse
Affiliation(s)
- Andrea Garretto
- Bioinformatics Program, Loyola University of Chicago, Chicago, IL, United States of America
| | - Thomas Hatzopoulos
- Department of Computer Science, Loyola University of Chicago, Chicago, IL, United States of America
| | - Catherine Putonti
- Bioinformatics Program, Loyola University of Chicago, Chicago, IL, United States of America.,Department of Computer Science, Loyola University of Chicago, Chicago, IL, United States of America.,Department of Biology, Loyola University of Chicago, Chicago, IL, United States of America.,Department of Microbiology and Immunology, Loyola University of Chicago, Maywood, IL, United States of America
| |
Collapse
|
12
|
Sharma A, Schmidt M, Kiesel B, Mahato NK, Cralle L, Singh Y, Richnow HH, Gilbert JA, Arnold W, Lal R. Bacterial and Archaeal Viruses of Himalayan Hot Springs at Manikaran Modulate Host Genomes. Front Microbiol 2018; 9:3095. [PMID: 30619174 PMCID: PMC6302217 DOI: 10.3389/fmicb.2018.03095] [Citation(s) in RCA: 19] [Impact Index Per Article: 3.2] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 08/21/2018] [Accepted: 11/29/2018] [Indexed: 11/30/2022] Open
Abstract
Hot spring-associated viruses, particularly the archaeal viruses, remain under-examined compared to bacteriophages. Previous metagenomic studies of the Manikaran hot springs in India suggested an abundance of viral DNA, which prompted us to examine the virus–host (bacterial and archaeal) interactions in sediment and microbial mat samples collected from the thermal discharges. Here, we characterize the viruses (both bacterial and archaeal) from this Himalayan hot spring using both metagenomics assembly and electron microscopy. We utilized four shotgun samples from sediment (78–98°C) and two from microbial mats (50°C) to reconstruct 65 bacteriophage genomes (24–200 kb). We also identified 59 archaeal viruses that were notably abundant across the sediment samples. Whole-genome analyses of the reconstructed bacteriophage genomes revealed greater genomic conservation in sediments (65%) compared to microbial mats (49%). However, a minimal phage genome was still maintained across both sediment and microbial mats suggesting a common origin. To complement the metagenomic data, scanning-electron and helium-ion microscopy were used to reveal diverse morphotypes of Caudovirales and archaeal viruses. The genome level annotations provide further evidence for gene-level exchange between virus and host in these hot springs, and augments our knowledgebase for bacteriophages, archaeal viruses and Clustered Regularly Interspaced Short Palindromic Repeat cassettes, which provide a critical resource for studying viromes in extreme natural environments.
Collapse
Affiliation(s)
- Anukriti Sharma
- Department of Zoology, University of Delhi, New Delhi, India.,Biosciences Division, Argonne National Laboratory, Lemont, IL, United States.,Department of Surgery, University of Chicago, Chicago, IL, United States
| | - Matthias Schmidt
- Department of Isotope Biogeochemistry, Helmholtz Centre for Environmental Research, Leipzig, Germany
| | - Bärbel Kiesel
- Department of Environmental Microbiology, Helmholtz Centre for Environmental Research, Leipzig, Germany
| | - Nitish K Mahato
- Department of Zoology, University of Delhi, New Delhi, India
| | - Lauren Cralle
- Biosciences Division, Argonne National Laboratory, Lemont, IL, United States.,Department of Surgery, University of Chicago, Chicago, IL, United States
| | - Yogendra Singh
- Department of Zoology, University of Delhi, New Delhi, India
| | - Hans H Richnow
- Department of Isotope Biogeochemistry, Helmholtz Centre for Environmental Research, Leipzig, Germany
| | - Jack A Gilbert
- Biosciences Division, Argonne National Laboratory, Lemont, IL, United States.,Department of Surgery, University of Chicago, Chicago, IL, United States
| | - Wyatt Arnold
- Department of Surgery, University of Chicago, Chicago, IL, United States
| | - Rup Lal
- Department of Zoology, University of Delhi, New Delhi, India
| |
Collapse
|
13
|
Kruppa J, Jo WK, van der Vries E, Ludlow M, Osterhaus A, Baumgaertner W, Jung K. Virus detection in high-throughput sequencing data without a reference genome of the host. INFECTION GENETICS AND EVOLUTION 2018; 66:180-187. [DOI: 10.1016/j.meegid.2018.09.026] [Citation(s) in RCA: 9] [Impact Index Per Article: 1.5] [Reference Citation Analysis] [Track Full Text] [Subscribe] [Scholar Register] [Received: 08/02/2018] [Revised: 09/25/2018] [Accepted: 09/27/2018] [Indexed: 01/19/2023]
|
14
|
Watkins SC, Sible E, Putonti C. Pseudomonas PB1-Like Phages: Whole Genomes from Metagenomes Offer Insight into an Abundant Group of Bacteriophages. Viruses 2018; 10:v10060331. [PMID: 29914169 PMCID: PMC6024596 DOI: 10.3390/v10060331] [Citation(s) in RCA: 7] [Impact Index Per Article: 1.2] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 05/15/2018] [Revised: 06/11/2018] [Accepted: 06/11/2018] [Indexed: 02/07/2023] Open
Abstract
Despite the abundance, ubiquity and impact of environmental viruses, their inherent genomic plasticity and extreme diversity pose significant challenges for the examination of bacteriophages on Earth. Viral metagenomic studies have offered insight into broader aspects of phage ecology and repeatedly uncover genes to which we are currently unable to assign function. A combined effort of phage isolation and metagenomic survey of Chicago’s nearshore waters of Lake Michigan revealed the presence of Pbunaviruses, relatives of the Pseudomonas phage PB1. This prompted our expansive investigation of PB1-like phages. Genomic signatures of PB1-like phages and Pbunaviruses were identified, permitting the unambiguous distinction between the presence/absence of these phages in soils, freshwater and wastewater samples, as well as publicly available viral metagenomic datasets. This bioinformatic analysis led to the de novo assembly of nine novel PB1-like phage genomes from a metagenomic survey of samples collected from Lake Michigan. While this study finds that Pbunaviruses are abundant in various environments of Northern Illinois, genomic variation also exists to a considerable extent within individual communities.
Collapse
Affiliation(s)
- Siobhan C Watkins
- Department of Biology, Loyola University Chicago, Chicago, IL 60660, USA.
| | - Emily Sible
- Department of Biology, Loyola University Chicago, Chicago, IL 60660, USA.
| | - Catherine Putonti
- Department of Biology, Loyola University Chicago, Chicago, IL 60660, USA.
- Department of Computer Science, Loyola University Chicago, Chicago, IL 60660, USA.
- Bioinformatics Program, Loyola University Chicago, Chicago, IL 60660, USA.
| |
Collapse
|
15
|
Nooij S, Schmitz D, Vennema H, Kroneman A, Koopmans MPG. Overview of Virus Metagenomic Classification Methods and Their Biological Applications. Front Microbiol 2018; 9:749. [PMID: 29740407 PMCID: PMC5924777 DOI: 10.3389/fmicb.2018.00749] [Citation(s) in RCA: 83] [Impact Index Per Article: 13.8] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 12/08/2017] [Accepted: 04/03/2018] [Indexed: 12/20/2022] Open
Abstract
Metagenomics poses opportunities for clinical and public health virology applications by offering a way to assess complete taxonomic composition of a clinical sample in an unbiased way. However, the techniques required are complicated and analysis standards have yet to develop. This, together with the wealth of different tools and workflows that have been proposed, poses a barrier for new users. We evaluated 49 published computational classification workflows for virus metagenomics in a literature review. To this end, we described the methods of existing workflows by breaking them up into five general steps and assessed their ease-of-use and validation experiments. Performance scores of previous benchmarks were summarized and correlations between methods and performance were investigated. We indicate the potential suitability of the different workflows for (1) time-constrained diagnostics, (2) surveillance and outbreak source tracing, (3) detection of remote homologies (discovery), and (4) biodiversity studies. We provide two decision trees for virologists to help select a workflow for medical or biodiversity studies, as well as directions for future developments in clinical viral metagenomics.
Collapse
Affiliation(s)
- Sam Nooij
- Emerging and Endemic Viruses, Centre for Infectious Disease Control, National Institute for Public Health and the Environment (RIVM), Bilthoven, Netherlands.,Viroscience Laboratory, Erasmus University Medical Centre, Rotterdam, Netherlands
| | - Dennis Schmitz
- Emerging and Endemic Viruses, Centre for Infectious Disease Control, National Institute for Public Health and the Environment (RIVM), Bilthoven, Netherlands.,Viroscience Laboratory, Erasmus University Medical Centre, Rotterdam, Netherlands
| | - Harry Vennema
- Emerging and Endemic Viruses, Centre for Infectious Disease Control, National Institute for Public Health and the Environment (RIVM), Bilthoven, Netherlands
| | - Annelies Kroneman
- Emerging and Endemic Viruses, Centre for Infectious Disease Control, National Institute for Public Health and the Environment (RIVM), Bilthoven, Netherlands
| | - Marion P G Koopmans
- Emerging and Endemic Viruses, Centre for Infectious Disease Control, National Institute for Public Health and the Environment (RIVM), Bilthoven, Netherlands.,Viroscience Laboratory, Erasmus University Medical Centre, Rotterdam, Netherlands
| |
Collapse
|
16
|
Nieuwenhuijse DF, Koopmans MPG. Metagenomic Sequencing for Surveillance of Food- and Waterborne Viral Diseases. Front Microbiol 2017; 8:230. [PMID: 28261185 PMCID: PMC5309255 DOI: 10.3389/fmicb.2017.00230] [Citation(s) in RCA: 45] [Impact Index Per Article: 6.4] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 11/16/2016] [Accepted: 02/01/2017] [Indexed: 12/25/2022] Open
Abstract
A plethora of viruses can be transmitted by the food- and waterborne route. However, their recognition is challenging because of the variety of viruses, heterogeneity of symptoms, the lack of awareness of clinicians, and limited surveillance efforts. Classical food- and waterborne viral disease outbreaks are mainly caused by caliciviruses, but the source of the virus is often not known and the foodborne mode of transmission is difficult to discriminate from human-to-human transmission. Atypical food- and waterborne viral disease can be caused by viruses such as hepatitis A and hepatitis E. In addition, a source of novel emerging viruses with a potential to spread via the food- and waterborne route is the repeated interaction of humans with wildlife. Wildlife-to-human adaptation may give rise to self- limiting outbreaks in some cases, but when fully adjusted to the human host can be devastating. Metagenomic sequencing has been investigated as a promising solution for surveillance purposes as it detects all viruses in a single protocol, delivers additional genomic information for outbreak tracing, and detects novel unknown viruses. Nevertheless, several issues must be addressed to apply metagenomic sequencing in surveillance. First, sample preparation is difficult since the genomic material of viruses is generally overshadowed by host- and bacterial genomes. Second, several data analysis issues hamper the efficient, robust, and automated processing of metagenomic data. Third, interpretation of metagenomic data is hard, because of the lack of general knowledge of the virome in the food chain and the environment. Further developments in virus-specific nucleic acid extraction methods, bioinformatic data processing applications, and unifying data visualization tools are needed to gain insightful surveillance knowledge from suspect food samples.
Collapse
|
17
|
Contreras AV, Cocom-Chan B, Hernandez-Montes G, Portillo-Bobadilla T, Resendis-Antonio O. Host-Microbiome Interaction and Cancer: Potential Application in Precision Medicine. Front Physiol 2016; 7:606. [PMID: 28018236 PMCID: PMC5145879 DOI: 10.3389/fphys.2016.00606] [Citation(s) in RCA: 22] [Impact Index Per Article: 2.8] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 08/08/2016] [Accepted: 11/21/2016] [Indexed: 12/19/2022] Open
Abstract
It has been experimentally shown that host-microbial interaction plays a major role in shaping the wellness or disease of the human body. Microorganisms coexisting in human tissues provide a variety of benefits that contribute to proper functional activity in the host through the modulation of fundamental processes such as signal transduction, immunity and metabolism. The unbalance of this microbial profile, or dysbiosis, has been correlated with the genesis and evolution of complex diseases such as cancer. Although this latter disease has been thoroughly studied using different high-throughput (HT) technologies, its heterogeneous nature makes its understanding and proper treatment in patients a remaining challenge in clinical settings. Notably, given the outstanding role of host-microbiome interactions, the ecological interactions with microorganisms have become a new significant aspect in the systems that can contribute to the diagnosis and potential treatment of solid cancers. As a part of expanding precision medicine in the area of cancer research, efforts aimed at effective treatments for various kinds of cancer based on the knowledge of genetics, biology of the disease and host-microbiome interactions might improve the prediction of disease risk and implement potential microbiota-directed therapeutics. In this review, we present the state of the art of sequencing and metabolome technologies, computational methods and schemes in systems biology that have addressed recent breakthroughs of uncovering relationships or associations between microorganisms and cancer. Together, microbiome studies extend the horizon of new personalized treatments against cancer from the perspective of precision medicine through a synergistic strategy integrating clinical knowledge, HT data, bioinformatics, and systems biology.
Collapse
Affiliation(s)
| | - Benjamin Cocom-Chan
- Instituto Nacional de Medicina GenómicaMexico City, Mexico; Human Systems Biology Laboratory, Instituto Nacional de Medicina GenómicaMexico City, Mexico
| | - Georgina Hernandez-Montes
- Coordinación de la Investigación Científica, Red de Apoyo a la Investigación-National Autonomous University of Mexico (UNAM) Mexico City, Mexico
| | - Tobias Portillo-Bobadilla
- Coordinación de la Investigación Científica, Red de Apoyo a la Investigación-National Autonomous University of Mexico (UNAM) Mexico City, Mexico
| | - Osbaldo Resendis-Antonio
- Instituto Nacional de Medicina GenómicaMexico City, Mexico; Human Systems Biology Laboratory, Instituto Nacional de Medicina GenómicaMexico City, Mexico; Coordinación de la Investigación Científica, Red de Apoyo a la Investigación-National Autonomous University of Mexico (UNAM)Mexico City, Mexico
| |
Collapse
|
18
|
Laffy PW, Wood-Charlson EM, Turaev D, Weynberg KD, Botté ES, van Oppen MJH, Webster NS, Rattei T. HoloVir: A Workflow for Investigating the Diversity and Function of Viruses in Invertebrate Holobionts. Front Microbiol 2016; 7:822. [PMID: 27375564 PMCID: PMC4899465 DOI: 10.3389/fmicb.2016.00822] [Citation(s) in RCA: 39] [Impact Index Per Article: 4.9] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 12/31/2015] [Accepted: 05/16/2016] [Indexed: 11/13/2022] Open
Abstract
Abundant bioinformatics resources are available for the study of complex microbial metagenomes, however their utility in viral metagenomics is limited. HoloVir is a robust and flexible data analysis pipeline that provides an optimized and validated workflow for taxonomic and functional characterization of viral metagenomes derived from invertebrate holobionts. Simulated viral metagenomes comprising varying levels of viral diversity and abundance were used to determine the optimal assembly and gene prediction strategy, and multiple sequence assembly methods and gene prediction tools were tested in order to optimize our analysis workflow. HoloVir performs pairwise comparisons of single read and predicted gene datasets against the viral RefSeq database to assign taxonomy and additional comparison to phage-specific and cellular markers is undertaken to support the taxonomic assignments and identify potential cellular contamination. Broad functional classification of the predicted genes is provided by assignment of COG microbial functional category classifications using EggNOG and higher resolution functional analysis is achieved by searching for enrichment of specific Swiss-Prot keywords within the viral metagenome. Application of HoloVir to viral metagenomes from the coral Pocillopora damicornis and the sponge Rhopaloeides odorabile demonstrated that HoloVir provides a valuable tool to characterize holobiont viral communities across species, environments, or experiments.
Collapse
Affiliation(s)
- Patrick W. Laffy
- Australian Institute of Marine ScienceTownsville, QLD, Australia
| | - Elisha M. Wood-Charlson
- Center for Microbial Oceanography: Research and Education, University of Hawai‘i at MānoaHonolulu, HI, USA
| | - Dmitrij Turaev
- Division of Computational Systems Biology, Department of Microbiology and Ecosystem Science, University of ViennaVienna, Austria
| | | | | | - Madeleine J. H. van Oppen
- Australian Institute of Marine ScienceTownsville, QLD, Australia
- School of Biosciences, University of MelbourneMelbourne, VIC, Australia
| | | | - Thomas Rattei
- Division of Computational Systems Biology, Department of Microbiology and Ecosystem Science, University of ViennaVienna, Austria
| |
Collapse
|
19
|
Turaev D, Rattei T. High definition for systems biology of microbial communities: metagenomics gets genome-centric and strain-resolved. Curr Opin Biotechnol 2016; 39:174-181. [PMID: 27115497 DOI: 10.1016/j.copbio.2016.04.011] [Citation(s) in RCA: 28] [Impact Index Per Article: 3.5] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 11/09/2015] [Revised: 04/08/2016] [Accepted: 04/12/2016] [Indexed: 11/28/2022]
Abstract
The systems biology of microbial communities, organismal communities inhabiting all ecological niches on earth, has in recent years been strongly facilitated by the rapid development of experimental, sequencing and data analysis methods. Novel experimental approaches and binning methods in metagenomics render the semi-automatic reconstructions of near-complete genomes of uncultivable bacteria possible, while advances in high-resolution amplicon analysis allow for efficient and less biased taxonomic community characterization. This will also facilitate predictive modeling approaches, hitherto limited by the low resolution of metagenomic data. In this review, we pinpoint the most promising current developments in metagenomics. They facilitate microbial systems biology towards a systemic understanding of mechanisms in microbial communities with scopes of application in many areas of our daily life.
Collapse
Affiliation(s)
- Dmitrij Turaev
- Department of Microbiology and Ecosystem Science, University of Vienna, 1090 Vienna, Austria
| | - Thomas Rattei
- Department of Microbiology and Ecosystem Science, University of Vienna, 1090 Vienna, Austria.
| |
Collapse
|
20
|
Alves JMP, de Oliveira AL, Sandberg TOM, Moreno-Gallego JL, de Toledo MAF, de Moura EMM, Oliveira LS, Durham AM, Mehnert DU, Zanotto PMDA, Reyes A, Gruber A. GenSeed-HMM: A Tool for Progressive Assembly Using Profile HMMs as Seeds and its Application in Alpavirinae Viral Discovery from Metagenomic Data. Front Microbiol 2016; 7:269. [PMID: 26973638 PMCID: PMC4777721 DOI: 10.3389/fmicb.2016.00269] [Citation(s) in RCA: 21] [Impact Index Per Article: 2.6] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/29/2015] [Accepted: 02/19/2016] [Indexed: 01/01/2023] Open
Abstract
This work reports the development of GenSeed-HMM, a program that implements seed-driven progressive assembly, an approach to reconstruct specific sequences from unassembled data, starting from short nucleotide or protein seed sequences or profile Hidden Markov Models (HMM). The program can use any one of a number of sequence assemblers. Assembly is performed in multiple steps and relatively few reads are used in each cycle, consequently the program demands low computational resources. As a proof-of-concept and to demonstrate the power of HMM-driven progressive assemblies, GenSeed-HMM was applied to metagenomic datasets in the search for diverse ssDNA bacteriophages from the recently described Alpavirinae subfamily. Profile HMMs were built using Alpavirinae-specific regions from multiple sequence alignments (MSA) using either the viral protein 1 (VP1; major capsid protein) or VP4 (genome replication initiation protein). These profile HMMs were used by GenSeed-HMM (running Newbler assembler) as seeds to reconstruct viral genomes from sequencing datasets of human fecal samples. All contigs obtained were annotated and taxonomically classified using similarity searches and phylogenetic analyses. The most specific profile HMM seed enabled the reconstruction of 45 partial or complete Alpavirinae genomic sequences. A comparison with conventional (global) assembly of the same original dataset, using Newbler in a standalone execution, revealed that GenSeed-HMM outperformed global genomic assembly in several metrics employed. This approach is capable of detecting organisms that have not been used in the construction of the profile HMM, which opens up the possibility of diagnosing novel viruses, without previous specific information, constituting a de novo diagnosis. Additional applications include, but are not limited to, the specific assembly of extrachromosomal elements such as plastid and mitochondrial genomes from metagenomic data. Profile HMM seeds can also be used to reconstruct specific protein coding genes for gene diversity studies, and to determine all possible gene variants present in a metagenomic sample. Such surveys could be useful to detect the emergence of drug-resistance variants in sensitive environments such as hospitals and animal production facilities, where antibiotics are regularly used. Finally, GenSeed-HMM can be used as an adjunct for gap closure on assembly finishing projects, by using multiple contig ends as anchored seeds.
Collapse
Affiliation(s)
- João M P Alves
- Department of Parasitology, Institute of Biomedical Sciences, University of São Paulo São Paulo, Brazil
| | - André L de Oliveira
- Department of Parasitology, Institute of Biomedical Sciences, University of São Paulo São Paulo, Brazil
| | - Tatiana O M Sandberg
- Department of Parasitology, Institute of Biomedical Sciences, University of São Paulo São Paulo, Brazil
| | | | - Marcelo A F de Toledo
- Department of Parasitology, Institute of Biomedical Sciences, University of São Paulo São Paulo, Brazil
| | - Elisabeth M M de Moura
- Department of Microbiology, Institute of Biomedical Sciences, University of São Paulo São Paulo, Brazil
| | - Liliane S Oliveira
- Department of Parasitology, Institute of Biomedical Sciences, University of São PauloSão Paulo, Brazil; Department of Computer Science, Institute of Mathematics and Statistics, University of São PauloSão Paulo, Brazil
| | - Alan M Durham
- Department of Computer Science, Institute of Mathematics and Statistics, University of São Paulo São Paulo, Brazil
| | - Dolores U Mehnert
- Department of Microbiology, Institute of Biomedical Sciences, University of São Paulo São Paulo, Brazil
| | - Paolo M de A Zanotto
- Department of Microbiology, Institute of Biomedical Sciences, University of São Paulo São Paulo, Brazil
| | - Alejandro Reyes
- Department of Biological Sciences, Universidad de los AndesBogotá, Colombia; Center for Genome Sciences and Systems Biology, Department of Pathology and Immunology, Washington University in Saint LouisMO, USA
| | - Arthur Gruber
- Department of Parasitology, Institute of Biomedical Sciences, University of São Paulo São Paulo, Brazil
| |
Collapse
|
21
|
Aarestrup FM, Koopmans MG. Sharing Data for Global Infectious Disease Surveillance and Outbreak Detection. Trends Microbiol 2016; 24:241-245. [PMID: 26875619 DOI: 10.1016/j.tim.2016.01.009] [Citation(s) in RCA: 39] [Impact Index Per Article: 4.9] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 12/11/2015] [Revised: 01/24/2016] [Accepted: 01/27/2016] [Indexed: 12/26/2022]
Abstract
Rapid global sharing and comparison of epidemiological and genomic data on infectious diseases would enable more rapid and efficient global outbreak control and tracking of diseases. Several barriers for global sharing exist but, in our opinion, the presumed magnitude of the problems appears larger than they are, and solutions can be found.
Collapse
Affiliation(s)
| | - Marion G Koopmans
- Department of Viroscience, Erasmus Medical Center, Rotterdam, The Netherlands
| |
Collapse
|
22
|
Abstract
The characterization of the human blood-associated viral community (also called blood virome) is essential for epidemiological surveillance and to anticipate new potential threats for blood transfusion safety. Currently, the risk of blood-borne agent transmission of well-known viruses (HBV, HCV, HIV and HTLV) can be considered as under control in high-resource countries. However, other viruses unknown or unsuspected may be transmitted to recipients by blood-derived products. This is particularly relevant considering that a significant proportion of transfused patients are immunocompromised and more frequently subjected to fatal outcomes. Several measures to prevent transfusion transmission of unknown viruses have been implemented including the exclusion of at-risk donors, leukocyte reduction of donor blood, and physicochemical treatment of the different blood components. However, up to now there is no universal method for pathogen inactivation, which would be applicable for all types of blood components and, equally effective for all viral families. In addition, among available inactivation procedures of viral genomes, some of them are recognized to be less effective on non-enveloped viruses, and inadequate to inactivate higher viral titers in plasma pools or derivatives. Given this, there is the need to implement new methodologies for the discovery of unknown viruses that may affect blood transfusion. Viral metagenomics combined with High Throughput Sequencing appears as a promising approach for the identification and global surveillance of new and/or unexpected viruses that could impair blood transfusion safety.
Collapse
Affiliation(s)
- V Sauvage
- Département d'études des agents transmissibles par le sang, Institut national de la transfusion sanguine (INTS), Centre national de référence des hépatites virales B et C et du VIH en transfusion, 75015 Paris, France.
| | - M Eloit
- PathoQuest, bâtiment François-Jacob, 25, rue du Dr-Roux, 75015 Paris, France; Inserm U1117, Biology of Infection Unit, Laboratory of Pathogen Discovery, Institut Pasteur, 28, rue du Docteur-Roux, 75724 Paris, France
| |
Collapse
|