1
|
Blanco-Melo D, Campbell MA, Zhu H, Dennis TPW, Modha S, Lytras S, Hughes J, Gatseva A, Gifford RJ. A novel approach to exploring the dark genome and its application to mapping of the vertebrate virus fossil record. Genome Biol 2024; 25:120. [PMID: 38741126 PMCID: PMC11089739 DOI: 10.1186/s13059-024-03258-y] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 11/03/2023] [Accepted: 04/22/2024] [Indexed: 05/16/2024] Open
Abstract
BACKGROUND Genomic regions that remain poorly understood, often referred to as the dark genome, contain a variety of functionally relevant and biologically informative features. These include endogenous viral elements (EVEs)-virus-derived sequences that can dramatically impact host biology and serve as a virus fossil record. In this study, we introduce a database-integrated genome screening (DIGS) approach to investigate the dark genome in silico, focusing on EVEs found within vertebrate genomes. RESULTS Using DIGS on 874 vertebrate genomes, we uncover approximately 1.1 million EVE sequences, with over 99% originating from endogenous retroviruses or transposable elements that contain EVE DNA. We show that the remaining 6038 sequences represent over a thousand distinct horizontal gene transfer events across 10 virus families, including some that have not previously been reported as EVEs. We explore the genomic and phylogenetic characteristics of non-retroviral EVEs and determine their rates of acquisition during vertebrate evolution. Our study uncovers novel virus diversity, broadens knowledge of virus distribution among vertebrate hosts, and provides new insights into the ecology and evolution of vertebrate viruses. CONCLUSIONS We comprehensively catalog and analyze EVEs within 874 vertebrate genomes, shedding light on the distribution, diversity, and long-term evolution of viruses and reveal their extensive impact on vertebrate genome evolution. Our results demonstrate the power of linking a relational database management system to a similarity search-based screening pipeline for in silico exploration of the dark genome.
Collapse
Affiliation(s)
- Daniel Blanco-Melo
- Vaccine and Infectious Disease Division, Fred Hutchinson Cancer Center, 1100 Fairview Ave N, Seattle, WA, 98109, USA
- Herbold Computational Biology Program, Public Health Sciences Division, Fred Hutchinson Cancer Center, 1100 Fairview Ave N, Seattle, WA, 98109, USA
| | | | - Henan Zhu
- MRC-University of Glasgow Centre for Virus Research, 464 Bearsden Rd, Bearsden, Glasgow, G61 1QH, UK
| | - Tristan P W Dennis
- MRC-University of Glasgow Centre for Virus Research, 464 Bearsden Rd, Bearsden, Glasgow, G61 1QH, UK
| | - Sejal Modha
- MRC-University of Glasgow Centre for Virus Research, 464 Bearsden Rd, Bearsden, Glasgow, G61 1QH, UK
| | - Spyros Lytras
- MRC-University of Glasgow Centre for Virus Research, 464 Bearsden Rd, Bearsden, Glasgow, G61 1QH, UK
| | - Joseph Hughes
- MRC-University of Glasgow Centre for Virus Research, 464 Bearsden Rd, Bearsden, Glasgow, G61 1QH, UK
| | - Anna Gatseva
- MRC-University of Glasgow Centre for Virus Research, 464 Bearsden Rd, Bearsden, Glasgow, G61 1QH, UK
| | - Robert J Gifford
- MRC-University of Glasgow Centre for Virus Research, 464 Bearsden Rd, Bearsden, Glasgow, G61 1QH, UK.
- Centre for Epidemic Response and Innovation (CERI), School of Data Science and Computational Thinking, Stellenbosch University, Stellenbosch, South Africa.
| |
Collapse
|
2
|
Veglia AJ, Bistolas KSI, Voolstra CR, Hume BCC, Ruscheweyh HJ, Planes S, Allemand D, Boissin E, Wincker P, Poulain J, Moulin C, Bourdin G, Iwankow G, Romac S, Agostini S, Banaigs B, Boss E, Bowler C, de Vargas C, Douville E, Flores M, Forcioli D, Furla P, Galand PE, Gilson E, Lombard F, Pesant S, Reynaud S, Sunagawa S, Thomas OP, Troublé R, Zoccola D, Correa AMS, Vega Thurber RL. Endogenous viral elements reveal associations between a non-retroviral RNA virus and symbiotic dinoflagellate genomes. Commun Biol 2023; 6:566. [PMID: 37264063 DOI: 10.1038/s42003-023-04917-9] [Citation(s) in RCA: 1] [Impact Index Per Article: 1.0] [Reference Citation Analysis] [Abstract] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 04/20/2022] [Accepted: 04/24/2023] [Indexed: 06/03/2023] Open
Abstract
Endogenous viral elements (EVEs) offer insight into the evolutionary histories and hosts of contemporary viruses. This study leveraged DNA metagenomics and genomics to detect and infer the host of a non-retroviral dinoflagellate-infecting +ssRNA virus (dinoRNAV) common in coral reefs. As part of the Tara Pacific Expedition, this study surveyed 269 newly sequenced cnidarians and their resident symbiotic dinoflagellates (Symbiodiniaceae), associated metabarcodes, and publicly available metagenomes, revealing 178 dinoRNAV EVEs, predominantly among hydrocoral-dinoflagellate metagenomes. Putative associations between Symbiodiniaceae and dinoRNAV EVEs were corroborated by the characterization of dinoRNAV-like sequences in 17 of 18 scaffold-scale and one chromosome-scale dinoflagellate genome assembly, flanked by characteristically cellular sequences and in proximity to retroelements, suggesting potential mechanisms of integration. EVEs were not detected in dinoflagellate-free (aposymbiotic) cnidarian genome assemblies, including stony corals, hydrocorals, jellyfish, or seawater. The pervasive nature of dinoRNAV EVEs within dinoflagellate genomes (especially Symbiodinium), as well as their inconsistent within-genome distribution and fragmented nature, suggest ancestral or recurrent integration of this virus with variable conservation. Broadly, these findings illustrate how +ssRNA viruses may obscure their genomes as members of nested symbioses, with implications for host evolution, exaptation, and immunity in the context of reef health and disease.
Collapse
Affiliation(s)
- Alex J Veglia
- BioSciences Department, Rice University, Houston, TX, USA
| | | | | | | | - Hans-Joachim Ruscheweyh
- Department of Biology, Institute of Microbiology and Swiss Institute of Bioinformatics, Vladimir-Prelog-Weg 4, ETH Zürich, CH-8093, Zürich, Switzerland
| | - Serge Planes
- PSL Research University: EPHE-UPVD-CNRS, USR 3278 CRIOBE, Laboratoire d'Excellence CORAIL, Université de Perpignan, 52 Avenue Paul Alduy, 66860, Perpignan, Cedex, France
| | - Denis Allemand
- Centre Scientifique de Monaco, 8 Quai Antoine Ier, Monaco, MC-98000, Principality of Monaco
| | - Emilie Boissin
- PSL Research University: EPHE-UPVD-CNRS, USR 3278 CRIOBE, Laboratoire d'Excellence CORAIL, Université de Perpignan, 52 Avenue Paul Alduy, 66860, Perpignan, Cedex, France
| | - Patrick Wincker
- Génomique Métabolique, Genoscope, Institut François Jacob, CEA, CNRS, Univ Evry, Université Paris-Saclay, Evry, France
- Research Federation for the study of Global Ocean Systems Ecology and Evolution, FR2022/ Tara Oceans-GOSEE, 3 rue Michel-Ange, 75016, Paris, France
| | - Julie Poulain
- Génomique Métabolique, Genoscope, Institut François Jacob, CEA, CNRS, Univ Evry, Université Paris-Saclay, Evry, France
- Research Federation for the study of Global Ocean Systems Ecology and Evolution, FR2022/ Tara Oceans-GOSEE, 3 rue Michel-Ange, 75016, Paris, France
| | - Clémentine Moulin
- Fondation Tara Océan, Base Tara, 8 rue de Prague, 75012, Paris, France
| | | | - Guillaume Iwankow
- PSL Research University: EPHE-UPVD-CNRS, USR 3278 CRIOBE, Laboratoire d'Excellence CORAIL, Université de Perpignan, 52 Avenue Paul Alduy, 66860, Perpignan, Cedex, France
| | - Sarah Romac
- Sorbonne Université, CNRS, Station Biologique de Roscoff, AD2M, UMR 7144, ECOMAP, Roscoff, France
| | - Sylvain Agostini
- Shimoda Marine Research Center, University of Tsukuba, 5-10-1, Shimoda, Shizuoka, Japan
| | - Bernard Banaigs
- PSL Research University: EPHE-UPVD-CNRS, USR 3278 CRIOBE, Laboratoire d'Excellence CORAIL, Université de Perpignan, 52 Avenue Paul Alduy, 66860, Perpignan, Cedex, France
| | - Emmanuel Boss
- School of Marine Sciences, University of Maine, Orono, ME, USA
| | - Chris Bowler
- Institut de Biologie de l'Ecole Normale Supérieure (IBENS), Ecole normale supérieure, CNRS, INSERM, Université PSL, 75005, Paris, France
| | - Colomban de Vargas
- Sorbonne Université, CNRS, Station Biologique de Roscoff, AD2M, UMR 7144, ECOMAP, Roscoff, France
| | - Eric Douville
- Laboratoire des Sciences du Climat et de l'Environnement, LSCE/IPSL, CEA-CNRS-UVSQ, Université Paris-Saclay, F-91191, Gif-sur-Yvette, France
| | - Michel Flores
- Weizmann Institute of Science, Department of Earth and Planetary Sciences, 76100, Rehovot, Israel
| | - Didier Forcioli
- Université Côte d'Azur, CNRS, INSERM, IRCAN, Medical School, Nice, France
- Laboratoire International Associé Université Côte d'Azur-Centre Scientifique de Monaco, LIA ROPSE, Monaco, France
| | - Paola Furla
- Université Côte d'Azur, CNRS, INSERM, IRCAN, Medical School, Nice, France
- Laboratoire International Associé Université Côte d'Azur-Centre Scientifique de Monaco, LIA ROPSE, Monaco, France
| | - Pierre E Galand
- Sorbonne Université, CNRS, Laboratoire d'Ecogéochimie des Environnements Benthiques (LECOB), Observatoire Océanologique de Banyuls, 66650, Banyuls sur mer, France
| | - Eric Gilson
- Université Côte d'Azur, CNRS, INSERM, IRCAN, Medical School, Nice, France
- Department of Medical Genetics, CHU of Nice, Nice, France
| | - Fabien Lombard
- Sorbonne Université, Institut de la Mer de Villefranche sur mer, Laboratoire d'Océanographie de Villefranche, F-06230, Villefranche-sur-Mer, France
| | - Stéphane Pesant
- European Molecular Biology Laboratory, European Bioinformatics Institute, Wellcome Genome Campus, Hinxton, Cambridge, CB10 1SD, UK
| | - Stéphanie Reynaud
- Centre Scientifique de Monaco, 8 Quai Antoine Ier, Monaco, MC-98000, Principality of Monaco
| | - Shinichi Sunagawa
- Department of Biology, Institute of Microbiology and Swiss Institute of Bioinformatics, Vladimir-Prelog-Weg 4, ETH Zürich, CH-8093, Zürich, Switzerland
| | - Olivier P Thomas
- School of Biological and Chemical Sciences, Ryan Institute, University of Galway, University Road H91 TK33, Galway, Ireland
| | - Romain Troublé
- Fondation Tara Océan, Base Tara, 8 rue de Prague, 75012, Paris, France
| | - Didier Zoccola
- Centre Scientifique de Monaco, 8 Quai Antoine Ier, Monaco, MC-98000, Principality of Monaco
| | | | | |
Collapse
|
3
|
Devaux CA, Pontarotti P, Nehari S, Raoult D. 'Cannibalism' of exogenous DNA sequences: The ancestral form of adaptive immunity which entails recognition of danger. Front Immunol 2022; 13:989707. [PMID: 36618387 PMCID: PMC9816338 DOI: 10.3389/fimmu.2022.989707] [Citation(s) in RCA: 1] [Impact Index Per Article: 0.5] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/08/2022] [Accepted: 12/05/2022] [Indexed: 12/24/2022] Open
Abstract
Adaptive immunity is a sophisticated form of immune response capable of retaining the molecular memory of a very great diversity of target antigens (epitopes) as non-self. It is capable of reactivating itself upon a second encounter with an immunoglobulin or T-cell receptor antigen-binding site with a known epitope that had previously primed the host immune system. It has long been considered that adaptive immunity is a highly evolved form of non-self recognition that appeared quite late in speciation and complemented a more generalist response called innate immunity. Innate immunity offers a relatively non-specific defense (although mediated by sensors that could specifically recognize virus or bacteria compounds) and which does not retain a memory of the danger. But this notion of recent acquisition of adaptive immunity is challenged by the fact that another form of specific recognition mechanisms already existed in prokaryotes that may be able to specifically auto-protect against external danger. This recognition mechanism can be considered a primitive form of specific (adaptive) non-self recognition. It is based on the fact that many archaea and bacteria use a genome editing system that confers the ability to appropriate viral DNA sequences allowing prokaryotes to prevent host damage through a mechanism very similar to adaptive immunity. This is indistinctly called, 'endogenization of foreign DNA' or 'viral DNA predation' or, more pictorially 'DNA cannibalism'. For several years evidence has been accumulating, highlighting the crucial role of endogenization of foreign DNA in the fundamental processes related to adaptive immunity and leading to a change in the dogma that adaptive immunity appeared late in speciation.
Collapse
Affiliation(s)
- Christian A. Devaux
- Aix-Marseille University, Institut de recherche pour le développement (IRD), Assistance Publique Hôpitaux de Marseille (APHM), MEPHI, Institut Hospitalo-universitaire (IHU)-Méditerranée Infection, Marseille, France,Department of Biological Sciences, Centre National de la Recherche Scientifique, Centre National de la Recherche Scientifique (CNRS)-SNC5039, Marseille, France,*Correspondence: Christian A. Devaux,
| | - Pierre Pontarotti
- Aix-Marseille University, Institut de recherche pour le développement (IRD), Assistance Publique Hôpitaux de Marseille (APHM), MEPHI, Institut Hospitalo-universitaire (IHU)-Méditerranée Infection, Marseille, France,Department of Biological Sciences, Centre National de la Recherche Scientifique, Centre National de la Recherche Scientifique (CNRS)-SNC5039, Marseille, France
| | - Sephora Nehari
- Aix-Marseille University, Institut de recherche pour le développement (IRD), Assistance Publique Hôpitaux de Marseille (APHM), MEPHI, Institut Hospitalo-universitaire (IHU)-Méditerranée Infection, Marseille, France
| | - Didier Raoult
- Aix-Marseille University, Institut de recherche pour le développement (IRD), Assistance Publique Hôpitaux de Marseille (APHM), MEPHI, Institut Hospitalo-universitaire (IHU)-Méditerranée Infection, Marseille, France
| |
Collapse
|