1
|
Handa S, Biswas T, Chakraborty J, Ghosh G, Paul BG, Ghosh P. Structural Requirements for Reverse Transcription by a Diversity-generating Retroelement. BIORXIV : THE PREPRINT SERVER FOR BIOLOGY 2024:2023.10.23.563531. [PMID: 37961358 PMCID: PMC10634737 DOI: 10.1101/2023.10.23.563531] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Grants] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 11/15/2023]
Abstract
Diversity-generating retroelements (DGRs) create massive protein sequence variation in ecologically diverse microbes. Variation occurs during reverse transcription of a protein-encoding RNA template coupled to misincorporation at adenosines. In the prototypical Bordetella bacteriophage DGR, the template must be surrounded by upstream and downstream RNA segments for cDNA synthesis by the reverse transcriptase bRT and associated protein Avd. The function of the surrounding RNA was unknown. Cryo-EM revealed that this RNA enveloped bRT and lay over barrel-shaped Avd, forming an intimate ribonucleoprotein (RNP). An abundance of essential interactions between RNA structural elements and bRT-Avd precisely positioned an RNA homoduplex for initiation of cDNA synthesis by cis -priming. Our results explain how the surrounding RNA primes cDNA synthesis, promotes processivity, terminates polymerization, and strictly limits mutagenesis to select proteins through mechanisms that are likely conserved in DGRs from distant taxa.
Collapse
|
2
|
Mohr G, Yao J, Park SK, Markham L, Lambowitz AM. Mechanisms used for cDNA synthesis and site-specific integration of RNA into DNA genomes by a reverse transcriptase-Cas1 fusion protein. SCIENCE ADVANCES 2024; 10:eadk8791. [PMID: 38608016 PMCID: PMC11014452 DOI: 10.1126/sciadv.adk8791] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Subscribe] [Scholar Register] [Received: 09/20/2023] [Accepted: 03/08/2024] [Indexed: 04/14/2024]
Abstract
Reverse transcriptase-Cas1 (RT-Cas1) fusion proteins found in some CRISPR systems enable spacer acquisition from both RNA and DNA, but the mechanism of RNA spacer acquisition has remained unclear. Here, we found that Marinomonas mediterranea RT-Cas1/Cas2 adds short 3'-DNA (dN) tails to RNA protospacers, enabling their direct integration into CRISPR arrays as 3'-dN-RNAs or 3'-dN-RNA/cDNA duplexes at rates comparable to similarly configured DNAs. Reverse transcription of RNA protospacers is initiated at 3' proximal sites by multiple mechanisms, including recently described de novo initiation, protein priming with any dNTP, and use of short exogenous or synthesized DNA oligomer primers, enabling synthesis of near full-length cDNAs of diverse RNAs without fixed sequence requirements. The integration of 3'-dN-RNAs or single-stranded DNAs (ssDNAs) is favored over duplexes at higher protospacer concentrations, potentially relevant to spacer acquisition from abundant pathogen RNAs or ssDNA fragments generated by phage defense nucleases. Our findings reveal mechanisms for site-specifically integrating RNA into DNA genomes with potential biotechnological applications.
Collapse
Affiliation(s)
- Georg Mohr
- Departments of Molecular Biosciences and Oncology, University of Texas at Austin, Austin, TX 78712, USA
| | - Jun Yao
- Departments of Molecular Biosciences and Oncology, University of Texas at Austin, Austin, TX 78712, USA
| | | | - Laura Markham
- Departments of Molecular Biosciences and Oncology, University of Texas at Austin, Austin, TX 78712, USA
| | | |
Collapse
|
3
|
Mohr G, Yao J, Park SK, Markham LM, Lambowitz AM. Mechanisms used for cDNA synthesis and site-specific integration of RNA into DNA genomes by a reverse transcriptase-Cas1 fusion protein. BIORXIV : THE PREPRINT SERVER FOR BIOLOGY 2023:2023.09.01.555893. [PMID: 37693417 PMCID: PMC10491204 DOI: 10.1101/2023.09.01.555893] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Grants] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 09/12/2023]
Abstract
Reverse transcriptase-Cas1 (RT-Cas1) fusion proteins found in some CRISPR systems enable spacer acquisition from both RNA and DNA, but the mechanism of RNA spacer acquisition has remained unclear. Here, we found Marinomonas mediterranea RT-Cas1/Cas2 adds short 3'-DNA (dN) tails to RNA protospacers enabling their direct integration into CRISPR arrays as 3'-dN-RNA/cDNA duplexes or 3'-dN-RNAs at rates comparable to similarly configured DNAs. Reverse transcription of RNA protospacers occurs by multiple mechanisms, including recently described de novo initiation, protein priming with any dNTP, and use of short exogenous or synthesized DNA oligomer primers, enabling synthesis of cDNAs from diverse RNAs without fixed sequence requirements. The integration of 3'-dN-RNAs or single-stranded (ss) DNAs is favored over duplexes at higher protospacer concentrations, potentially relevant to spacer acquisition from abundant pathogen RNAs or ssDNA fragments generated by phage-defense nucleases. Our findings reveal novel mechanisms for site-specifically integrating RNA into DNA genomes with potential biotechnological applications.
Collapse
Affiliation(s)
- Georg Mohr
- Departments of Molecular Biosciences and Oncology University of Texas at Austin Austin TX, 78712
| | - Jun Yao
- Departments of Molecular Biosciences and Oncology University of Texas at Austin Austin TX, 78712
| | | | - Laura M. Markham
- Departments of Molecular Biosciences and Oncology University of Texas at Austin Austin TX, 78712
| | - Alan M. Lambowitz
- Departments of Molecular Biosciences and Oncology University of Texas at Austin Austin TX, 78712
| |
Collapse
|
4
|
Macadangdang BR, Makanani SK, Miller JF. Accelerated Evolution by Diversity-Generating Retroelements. Annu Rev Microbiol 2022; 76:389-411. [PMID: 35650669 DOI: 10.1146/annurev-micro-030322-040423] [Citation(s) in RCA: 10] [Impact Index Per Article: 5.0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/09/2022]
Abstract
Diversity-generating retroelements (DGRs) create vast amounts of targeted, functional diversity by facilitating the rapid evolution of ligand-binding protein domains. Thousands of DGRs have been identified in bacteria, archaea, and their respective viruses. They are broadly distributed throughout the microbial world, with enrichment observed in certain taxa and environments. The diversification machinery works through a novel mechanism termed mutagenic retrohoming, whereby nucleotide sequence information is copied from an invariant DNA template repeat (TR) into an RNA intermediate, selectively mutagenized at TR adenines during cDNA synthesis by a DGR-encoded reverse transcriptase, and transferred to a variable repeat (VR) region within a variable-protein gene (54). This unidirectional flow of information leaves TR-DNA sequences unmodified, allowing for repeated rounds of mutagenic retrohoming to optimize variable-protein function. DGR target genes are often modular and can encode one or more of a wide variety of discrete functional domains appended to a diversifiable ligand-binding motif. Bacterial variable proteins often localize to cell surfaces, although a subset appear to be cytoplasmic, while phage-encoded DGRs commonly diversify tail fiber-associated receptor-binding proteins. Here, we provide a comprehensive review of the mechanism and consequences of accelerated protein evolution by these unique and beneficial genetic elements. Expected final online publication date for the Annual Review of Microbiology, Volume 76 is September 2022. Please see http://www.annualreviews.org/page/journal/pubdates for revised estimates.
Collapse
Affiliation(s)
- Benjamin R Macadangdang
- Division of Neonatology and Developmental Biology, Department of Pediatrics, David Geffen School of Medicine, University of California, Los Angeles, California, USA; .,California NanoSystems Institute, University of California, Los Angeles, California, USA
| | - Sara K Makanani
- California NanoSystems Institute, University of California, Los Angeles, California, USA.,Department of Ecology and Evolutionary Biology, University of California, Los Angeles, California, USA; .,Department of Microbiology, Immunology and Molecular Genetics, David Geffen School of Medicine, University of California, Los Angeles, California, USA;
| | - Jeff F Miller
- California NanoSystems Institute, University of California, Los Angeles, California, USA.,Department of Microbiology, Immunology and Molecular Genetics, David Geffen School of Medicine, University of California, Los Angeles, California, USA; .,Molecular Biology Institute, University of California, Los Angeles, California, USA
| |
Collapse
|
5
|
Benler S, Koonin EV. Recruitment of Mobile Genetic Elements for Diverse Cellular Functions in Prokaryotes. Front Mol Biosci 2022; 9:821197. [PMID: 35402511 PMCID: PMC8987985 DOI: 10.3389/fmolb.2022.821197] [Citation(s) in RCA: 10] [Impact Index Per Article: 5.0] [Reference Citation Analysis] [Abstract] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 11/24/2021] [Accepted: 02/08/2022] [Indexed: 12/15/2022] Open
Abstract
Prokaryotic genomes are replete with mobile genetic elements (MGE) that span a continuum of replication autonomy. On numerous occasions during microbial evolution, diverse MGE lose their autonomy altogether but, rather than being quickly purged from the host genome, assume a new function that benefits the host, rendering the immobilized MGE subject to purifying selection, and resulting in its vertical inheritance. This mini-review highlights the diversity of the repurposed (exapted) MGE as well as the plethora of cellular functions that they perform. The principal contribution of the exaptation of MGE and their components is to the prokaryotic functional systems involved in biological conflicts, and in particular, defense against viruses and other MGE. This evolutionary entanglement between MGE and defense systems appears to stem both from mechanistic similarities and from similar evolutionary predicaments whereby both MGEs and defense systems tend to incur fitness costs to the hosts and thereby evolve mechanisms for survival including horizontal mobility, causing host addiction, and exaptation for functions beneficial to the host. The examples discussed demonstrate that the identity of an MGE, overall mobility and relationship with the host cell (mutualistic, symbiotic, commensal, or parasitic) are all factors that affect exaptation.
Collapse
|
6
|
Paul BG, Eren AM. Eco-evolutionary significance of domesticated retroelements in microbial genomes. Mob DNA 2022; 13:6. [PMID: 35197094 PMCID: PMC8867640 DOI: 10.1186/s13100-022-00262-6] [Citation(s) in RCA: 4] [Impact Index Per Article: 2.0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 11/30/2021] [Accepted: 01/03/2022] [Indexed: 01/03/2023] Open
Abstract
Since the first discovery of reverse transcriptase in bacteria, and later in archaea, bacterial and archaeal retroelements have been defined by their common enzyme that coordinates diverse functions. Yet, evolutionary refinement has produced distinct retroelements across the tree of microbial life that are perhaps best described in terms of their programmed RNA—a compact sequence that preserves core information for a sophisticated mechanism. From this perspective, reverse transcriptase has been selected as the modular tool for carrying out nature’s instructions in various RNA templates. Beneficial retroelements—those that can provide a fitness advantage to their host—evolved to their extant forms in a wide array of microorganisms and their viruses, spanning nearly all habitats. Within each specialized retroelement class, several universal features seem to be shared across diverse taxa, while specific functional and mechanistic insights are based on only a few model retroelement systems from clinical isolates. Currently, little is known about the diversity of cellular functions and ecological significance of retroelements across different biomes. With increasing availability of isolate, metagenome-assembled, and single-amplified genomes, the taxonomic and functional breadth of prokaryotic retroelements is coming into clearer view. This review explores the recently characterized classes of beneficial, yet accessory retroelements of bacteria and archaea. We describe how these specialized mechanisms exploit a form of fixed mobility, whereby the retroelements do not appear to proliferate selfishly throughout the genome. Moreover, we discuss computational approaches for systematic identification of retroelements from vast sequence repositories and highlight recent discoveries in terms of their apparent distribution and ecological significance in nature. Lastly, we present a new perspective on the eco-evolutionary significance of these genetic elements in marine bacteria and demonstrate approaches that enable the characterization of their environmental diversity through metagenomics.
Collapse
Affiliation(s)
- Blair G Paul
- Marine Biological Laboratory, Josephine Bay Paul Center, Woods Hole, MA, USA.
| | - A Murat Eren
- Marine Biological Laboratory, Josephine Bay Paul Center, Woods Hole, MA, USA. .,Department of Medicine, University of Chicago, Chicago, IL, USA.
| |
Collapse
|
7
|
Sharifi F, Ye Y. Identification and classification of reverse transcriptases in bacterial genomes and metagenomes. Nucleic Acids Res 2021; 50:e29. [PMID: 34904653 PMCID: PMC8934634 DOI: 10.1093/nar/gkab1207] [Citation(s) in RCA: 7] [Impact Index Per Article: 2.3] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/14/2021] [Revised: 11/17/2021] [Accepted: 11/22/2021] [Indexed: 02/07/2023] Open
Abstract
Reverse transcriptases (RTs) are found in different systems including group II introns, Diversity Generating Retroelements (DGRs), retrons, CRISPR-Cas systems, and Abortive Infection (Abi) systems in prokaryotes. Different classes of RTs can play different roles, such as template switching and mobility in group II introns, spacer acquisition in CRISPR-Cas systems, mutagenic retrohoming in DGRs, programmed cell suicide in Abi systems, and recently discovered phage defense in retrons. While some classes of RTs have been studied extensively, others remain to be characterized. There is a lack of computational tools for identifying and characterizing various classes of RTs. In this study, we built a tool (called myRT) for identification and classification of prokaryotic RTs. In addition, our tool provides information about the genomic neighborhood of each RT, providing potential functional clues. We applied our tool to predict RTs in all complete and draft bacterial genomes, and created a collection that can be used for exploration of putative RTs and their associated protein domains. Application of myRT to metagenomes showed that gut metagenomes encode proportionally more RTs related to DGRs, outnumbering retron-related RTs, as compared to the collection of reference genomes. MyRT is both available as a standalone software (https://github.com/mgtools/myRT) and also through a website (https://omics.informatics.indiana.edu/myRT/).
Collapse
Affiliation(s)
- Fatemeh Sharifi
- Luddy School of Informatics, Computing, and Engineering, Indiana University, Bloomington, IN 47408, USA
| | - Yuzhen Ye
- Luddy School of Informatics, Computing, and Engineering, Indiana University, Bloomington, IN 47408, USA
| |
Collapse
|
8
|
González-Delgado A, Mestre MR, Martínez-Abarca F, Toro N. Prokaryotic reverse transcriptases: from retroelements to specialized defense systems. FEMS Microbiol Rev 2021; 45:fuab025. [PMID: 33983378 PMCID: PMC8632793 DOI: 10.1093/femsre/fuab025] [Citation(s) in RCA: 10] [Impact Index Per Article: 3.3] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 01/15/2021] [Accepted: 05/07/2021] [Indexed: 12/30/2022] Open
Abstract
Reverse transcriptases (RTs) catalyze the polymerization of DNA from an RNA template. These enzymes were first discovered in RNA tumor viruses in 1970, but it was not until 1989 that they were found in prokaryotes as a key component of retrons. Apart from RTs encoded by the 'selfish' mobile retroelements known as group II introns, prokaryotic RTs are extraordinarily diverse, but their function has remained elusive. However, recent studies have revealed that different lineages of prokaryotic RTs, including retrons, those associated with CRISPR-Cas systems, Abi-like RTs and other yet uncharacterized RTs, are key components of different lines of defense against phages and other mobile genetic elements. Prokaryotic RTs participate in various antiviral strategies, including abortive infection (Abi), in which the infected cell is induced to commit suicide to protect the host population, adaptive immunity, in which a memory of previous infection is used to build an efficient defense, and other as yet unidentified mechanisms. These prokaryotic enzymes are attracting considerable attention, both for use in cutting-edge technologies, such as genome editing, and as an emerging research topic. In this review, we discuss what is known about prokaryotic RTs, and the exciting evidence for their domestication from retroelements to create specialized defense systems.
Collapse
Affiliation(s)
- Alejandro González-Delgado
- Department of Soil Microbiology and Symbiotic Systems, Estación Experimental del Zaidín, Consejo Superior de Investigaciones Científicas, Structure, Dynamics and Function of Rhizobacterial Genomes, Grupo de Ecología Genética de la Rizosfera, C/ Profesor Albareda 1, 18008 Granada, Spain
| | - Mario Rodríguez Mestre
- Department of Soil Microbiology and Symbiotic Systems, Estación Experimental del Zaidín, Consejo Superior de Investigaciones Científicas, Structure, Dynamics and Function of Rhizobacterial Genomes, Grupo de Ecología Genética de la Rizosfera, C/ Profesor Albareda 1, 18008 Granada, Spain
- Department of Biochemistry, Universidad Autónoma de Madrid and Instituto de Investigaciones Biomédicas “Alberto Sols”, CSIC-UAM, Madrid, Spain
| | - Francisco Martínez-Abarca
- Department of Soil Microbiology and Symbiotic Systems, Estación Experimental del Zaidín, Consejo Superior de Investigaciones Científicas, Structure, Dynamics and Function of Rhizobacterial Genomes, Grupo de Ecología Genética de la Rizosfera, C/ Profesor Albareda 1, 18008 Granada, Spain
| | - Nicolás Toro
- Department of Soil Microbiology and Symbiotic Systems, Estación Experimental del Zaidín, Consejo Superior de Investigaciones Científicas, Structure, Dynamics and Function of Rhizobacterial Genomes, Grupo de Ecología Genética de la Rizosfera, C/ Profesor Albareda 1, 18008 Granada, Spain
| |
Collapse
|
9
|
Roux S, Paul BG, Bagby SC, Nayfach S, Allen MA, Attwood G, Cavicchioli R, Chistoserdova L, Gruninger RJ, Hallam SJ, Hernandez ME, Hess M, Liu WT, McAllister TA, O'Malley MA, Peng X, Rich VI, Saleska SR, Eloe-Fadrosh EA. Ecology and molecular targets of hypermutation in the global microbiome. Nat Commun 2021; 12:3076. [PMID: 34031405 PMCID: PMC8144416 DOI: 10.1038/s41467-021-23402-7] [Citation(s) in RCA: 26] [Impact Index Per Article: 8.7] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 01/21/2021] [Accepted: 04/27/2021] [Indexed: 02/04/2023] Open
Abstract
Changes in the sequence of an organism's genome, i.e., mutations, are the raw material of evolution. The frequency and location of mutations can be constrained by specific molecular mechanisms, such as diversity-generating retroelements (DGRs). DGRs have been characterized from cultivated bacteria and bacteriophages, and perform error-prone reverse transcription leading to mutations being introduced in specific target genes. DGR loci were also identified in several metagenomes, but the ecological roles and evolutionary drivers of these DGRs remain poorly understood. Here, we analyze a dataset of >30,000 DGRs from public metagenomes, establish six major lineages of DGRs including three primarily encoded by phages and seemingly used to diversify host attachment proteins, and demonstrate that DGRs are broadly active and responsible for >10% of all amino acid changes in some organisms. Overall, these results highlight the constraints under which DGRs evolve, and elucidate several distinct roles these elements play in natural communities.
Collapse
Affiliation(s)
- Simon Roux
- DOE Joint Genome Institute, Lawrence Berkeley National Laboratory, Berkeley, CA, USA.
| | - Blair G Paul
- Marine Biological Laboratory, Woods Hole, MA, USA
| | - Sarah C Bagby
- Department of Biology, Case Western Reserve University, Cleveland, OH, USA
| | - Stephen Nayfach
- DOE Joint Genome Institute, Lawrence Berkeley National Laboratory, Berkeley, CA, USA
| | | | - Graeme Attwood
- AgResearch Limited, Grasslands Research Centre, Palmerston North, New Zealand
| | | | | | - Robert J Gruninger
- Lethbridge Research and Development Centre, Agriculture and Agri-Food Canada, Lethbridge, Alberta, Canada
| | - Steven J Hallam
- Department of Microbiology and Immunology, University of British Columbia, Vancouver, Canada
- Graduate Program in Bioinformatics, University of British Columbia, Genome Sciences Centre, Vancouver, Canada
- Genome Science and Technology Program, University of British Columbia, Vancouver, Canada
- Life Sciences Institute, University of British Columbia, Vancouver, Canada
- ECOSCOPE Training Program, University of British Columbia, Vancouver, Canada
| | - Maria E Hernandez
- Instituto de Ecología A.C. Red de Manejo Biotechnológico de Recursos. Xalapa, Veracruz, México
| | | | - Wen-Tso Liu
- University of Illinois at Urbana-Champaign, Urbana, IL, USA
| | - Tim A McAllister
- Lethbridge Research and Development Centre, Agriculture and Agri-Food Canada, Lethbridge, Alberta, Canada
| | - Michelle A O'Malley
- Department of Chemical Engineering, University of California Santa Barbara, Santa Barbara, CA, USA
| | - Xuefeng Peng
- Marine Science Institute, University of California Santa Barbara, Santa Barbara, CA, USA
| | | | | | - Emiley A Eloe-Fadrosh
- DOE Joint Genome Institute, Lawrence Berkeley National Laboratory, Berkeley, CA, USA.
| |
Collapse
|
10
|
Benler S, Yutin N, Antipov D, Rayko M, Shmakov S, Gussow AB, Pevzner P, Koonin EV. Thousands of previously unknown phages discovered in whole-community human gut metagenomes. MICROBIOME 2021; 9:78. [PMID: 33781338 PMCID: PMC8008677 DOI: 10.1186/s40168-021-01017-w] [Citation(s) in RCA: 90] [Impact Index Per Article: 30.0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Subscribe] [Scholar Register] [Received: 10/06/2020] [Accepted: 02/02/2021] [Indexed: 05/07/2023]
Abstract
BACKGROUND Double-stranded DNA bacteriophages (dsDNA phages) play pivotal roles in structuring human gut microbiomes; yet, the gut virome is far from being fully characterized, and additional groups of phages, including highly abundant ones, continue to be discovered by metagenome mining. A multilevel framework for taxonomic classification of viruses was recently adopted, facilitating the classification of phages into evolutionary informative taxonomic units based on hallmark genes. Together with advanced approaches for sequence assembly and powerful methods of sequence analysis, this revised framework offers the opportunity to discover and classify unknown phage taxa in the human gut. RESULTS A search of human gut metagenomes for circular contigs encoding phage hallmark genes resulted in the identification of 3738 apparently complete phage genomes that represent 451 putative genera. Several of these phage genera are only distantly related to previously identified phages and are likely to found new families. Two of the candidate families, "Flandersviridae" and "Quimbyviridae", include some of the most common and abundant members of the human gut virome that infect Bacteroides, Parabacteroides, and Prevotella. The third proposed family, "Gratiaviridae," consists of less abundant phages that are distantly related to the families Autographiviridae, Drexlerviridae, and Chaseviridae. Analysis of CRISPR spacers indicates that phages of all three putative families infect bacteria of the phylum Bacteroidetes. Comparative genomic analysis of the three candidate phage families revealed features without precedent in phage genomes. Some "Quimbyviridae" phages possess Diversity-Generating Retroelements (DGRs) that generate hypervariable target genes nested within defense-related genes, whereas the previously known targets of phage-encoded DGRs are structural genes. Several "Flandersviridae" phages encode enzymes of the isoprenoid pathway, a lipid biosynthesis pathway that so far has not been known to be manipulated by phages. The "Gratiaviridae" phages encode a HipA-family protein kinase and glycosyltransferase, suggesting these phages modify the host cell wall, preventing superinfection by other phages. Hundreds of phages in these three and other families are shown to encode catalases and iron-sequestering enzymes that can be predicted to enhance cellular tolerance to reactive oxygen species. CONCLUSIONS Analysis of phage genomes identified in whole-community human gut metagenomes resulted in the delineation of at least three new candidate families of Caudovirales and revealed diverse putative mechanisms underlying phage-host interactions in the human gut. Addition of these phylogenetically classified, diverse, and distinct phages to public databases will facilitate taxonomic decomposition and functional characterization of human gut viromes. Video abstract.
Collapse
Affiliation(s)
- Sean Benler
- National Center for Biotechnology Information, National Library of Medicine, Bethesda, Maryland 20894 USA
| | - Natalya Yutin
- National Center for Biotechnology Information, National Library of Medicine, Bethesda, Maryland 20894 USA
| | - Dmitry Antipov
- Center for Algorithmic Biotechnology, Institute for Translational Biomedicine, St. Petersburg State University, St. Petersburg, 199004 Russia
| | - Mikhail Rayko
- Center for Algorithmic Biotechnology, Institute for Translational Biomedicine, St. Petersburg State University, St. Petersburg, 199004 Russia
| | - Sergey Shmakov
- National Center for Biotechnology Information, National Library of Medicine, Bethesda, Maryland 20894 USA
| | - Ayal B. Gussow
- National Center for Biotechnology Information, National Library of Medicine, Bethesda, Maryland 20894 USA
| | - Pavel Pevzner
- Center for Algorithmic Biotechnology, Institute for Translational Biomedicine, St. Petersburg State University, St. Petersburg, 199004 Russia
- Department of Computer Science and Engineering, University of California, San Diego, La Jolla, CA 92093 USA
| | - Eugene V. Koonin
- National Center for Biotechnology Information, National Library of Medicine, Bethesda, Maryland 20894 USA
| |
Collapse
|
11
|
Handa S, Reyna A, Wiryaman T, Ghosh P. Determinants of adenine-mutagenesis in diversity-generating retroelements. Nucleic Acids Res 2021; 49:1033-1045. [PMID: 33367793 PMCID: PMC7826257 DOI: 10.1093/nar/gkaa1240] [Citation(s) in RCA: 2] [Impact Index Per Article: 0.7] [Reference Citation Analysis] [Abstract] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 04/21/2020] [Revised: 12/07/2020] [Accepted: 12/09/2020] [Indexed: 02/01/2023] Open
Abstract
Diversity-generating retroelements (DGRs) vary protein sequences to the greatest extent known in the natural world. These elements are encoded by constituents of the human microbiome and the microbial ‘dark matter’. Variation occurs through adenine-mutagenesis, in which genetic information in RNA is reverse transcribed faithfully to cDNA for all template bases but adenine. We investigated the determinants of adenine-mutagenesis in the prototypical Bordetella bacteriophage DGR through an in vitro system composed of the reverse transcriptase bRT, Avd protein, and a specific RNA. We found that the catalytic efficiency for correct incorporation during reverse transcription by the bRT-Avd complex was strikingly low for all template bases, with the lowest occurring for adenine. Misincorporation across a template adenine was only somewhat lower in efficiency than correct incorporation. We found that the C6, but not the N1 or C2, purine substituent was a key determinant of adenine-mutagenesis. bRT-Avd was insensitive to the C6 amine of adenine but recognized the C6 carbonyl of guanine. We also identified two bRT amino acids predicted to nonspecifically contact incoming dNTPs, R74 and I181, as promoters of adenine-mutagenesis. Our results suggest that the overall low catalytic efficiency of bRT-Avd is intimately tied to its ability to carry out adenine-mutagenesis.
Collapse
Affiliation(s)
- Sumit Handa
- Department of Chemistry & Biochemistry, 9500 Gilman Drive, La Jolla, CA, 92093-0375, USA
| | - Andres Reyna
- Department of Chemistry & Biochemistry, 9500 Gilman Drive, La Jolla, CA, 92093-0375, USA
| | - Timothy Wiryaman
- Department of Chemistry & Biochemistry, 9500 Gilman Drive, La Jolla, CA, 92093-0375, USA
| | - Partho Ghosh
- Department of Chemistry & Biochemistry, 9500 Gilman Drive, La Jolla, CA, 92093-0375, USA
| |
Collapse
|
12
|
Sozhamannan S, Hofmann ER. The State of the Art in Biodefense Related Bacterial Pathogen Detection Using Bacteriophages: How It Started and How It's Going. Viruses 2020; 12:v12121393. [PMID: 33291831 PMCID: PMC7762055 DOI: 10.3390/v12121393] [Citation(s) in RCA: 2] [Impact Index Per Article: 0.5] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/20/2020] [Revised: 11/23/2020] [Accepted: 12/01/2020] [Indexed: 02/07/2023] Open
Abstract
Accurate pathogen detection and diagnosis is paramount in clinical success of treating patients. There are two general paradigms in pathogen detection: molecular and immuno-based, and phage-based detection is a third emerging paradigm due to its sensitivity and selectivity. Molecular detection methods look for genetic material specific for a given pathogen in a sample usually by polymerase chain reaction (PCR). Immuno-methods look at the pathogen components (antigens) by antibodies raised against that pathogen specific antigens. There are different variations and products based on these two paradigms with advantages and disadvantages. The third paradigm at least for bacterial pathogen detection entails bacteriophages specific for a given bacterium. Sensitivity and specificity are the two key parameters in any pathogen detection system. By their very nature, bacteriophages afford the best sensitivity for bacterial detection. Bacteria and bacteriophages form the predator-prey pair in the evolutionary arms race and has coevolved over time to acquire the exquisite specificity of the pair, in some instances at the strain level. This specificity has been exploited for diagnostic purposes of various pathogens of concern in clinical and other settings. Many recent reviews focus on phage-based detection and sensor technologies. In this review, we focus on a very special group of pathogens that are of concern in biodefense because of their potential misuse in bioterrorism and their extremely virulent nature and as such fall under the Centers for Disease and Prevention (CDC) Category A pathogen list. We describe the currently available phage methods that are based on the usual modalities of detection from culture, to molecular and immuno- and fluorescent methods. We further highlight the gaps and the needs for more modern technologies and sensors drawing from technologies existing for detection and surveillance of other pathogens of clinical relevance.
Collapse
Affiliation(s)
- Shanmuga Sozhamannan
- National Security Science & Technology, Management Advisory Services, Logistics Management Institute, 7940 Jones Branch Drive, Tysons, VA 22102, USA;
- Defense Biological Product Assurance Office (DBPAO), Joint Program Executive Office (JPEO) for Chemical, Biological, Radiological and Nuclear Defense (CBRND) Joint Project Lead (JPL) CBRND Enabling Biotechnologies (EB), 110 Thomas Johnson Drive, Suite 250, Frederick, MD 21702, USA
| | - Edward R. Hofmann
- EXCET, Inc., 6225 Brandon Ave #360, Springfield, VA 22150, USA
- US Army Combat Capabilities Development Command, Chemical Biological Center, 8908 Guard St, E3831, Edgewood, MD 21010, USA
- Correspondence:
| |
Collapse
|
13
|
Simon AJ, Ellington AD, Finkelstein IJ. Retrons and their applications in genome engineering. Nucleic Acids Res 2020; 47:11007-11019. [PMID: 31598685 PMCID: PMC6868368 DOI: 10.1093/nar/gkz865] [Citation(s) in RCA: 45] [Impact Index Per Article: 11.3] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 04/23/2019] [Revised: 09/19/2019] [Accepted: 10/02/2019] [Indexed: 11/14/2022] Open
Abstract
Precision genome editing technologies have transformed modern biology. These technologies have arisen from the redirection of natural biological machinery, such as bacteriophage lambda proteins for recombineering and CRISPR nucleases for eliciting site-specific double-strand breaks. Less well-known is a widely distributed class of bacterial retroelements, retrons, that employ specialized reverse transcriptases to produce noncoding intracellular DNAs. Retrons' natural function and mechanism of genetic transmission have remained enigmatic. However, recent studies have harnessed their ability to produce DNA in situ for genome editing and evolution. This review describes retron biology and function in both natural and synthetic contexts. We also highlight areas that require further study to advance retron-based precision genome editing platforms.
Collapse
Affiliation(s)
- Anna J Simon
- Center for Systems and Synthetic Biology and Department of Molecular Biosciences, University of Texas at Austin, Austin, Texas 78712, USA
| | - Andrew D Ellington
- Center for Systems and Synthetic Biology and Department of Molecular Biosciences, University of Texas at Austin, Austin, Texas 78712, USA
| | - Ilya J Finkelstein
- Center for Systems and Synthetic Biology and Department of Molecular Biosciences, University of Texas at Austin, Austin, Texas 78712, USA
| |
Collapse
|
14
|
Yan F, Yu X, Duan Z, Lu J, Jia B, Qiao Y, Sun C, Wei C. Discovery and characterization of the evolution, variation and functions of diversity-generating retroelements using thousands of genomes and metagenomes. BMC Genomics 2019; 20:595. [PMID: 31324156 PMCID: PMC6642488 DOI: 10.1186/s12864-019-5951-3] [Citation(s) in RCA: 11] [Impact Index Per Article: 2.2] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/30/2019] [Accepted: 07/02/2019] [Indexed: 12/31/2022] Open
Abstract
Background Diversity-generating retroelements (DGRs) are a unique family of retroelements that generate sequence diversity of DNA to benefit their hosts by introducing variations and accelerating the evolution of target proteins. They exist widely in bacteria, archaea, phage and plasmid. However, our understanding about DGRs in natural environments was still very limited. Results We developed an efficient computational algorithm to identify DGRs, and applied it to characterize DGRs in more than 80,000 sequenced bacterial genomes as well as more than 4,000 human metagenome datasets. In total, we identified 948 non-redundant DGRs, which expanded the number of known DGRs in bacterial genomes and human microbiomes by about 55%, and provided a much more comprehensive reference for the study of DGRs. Phylogenetic analysis was done for identified DGRs. The putative target genes of DGRs were searched, and the functions of these target genes were investigated with a comprehensive alignment against the nr database. Conclusions DGR system is a powerful and universal mechanism to generate diversity. DGR evolution is closely associated with the living environment and their cassette structures. Furthermore, it may impact a wide range of functional processes in addition to receptor-binding. These results significantly improved our understanding about DGRs. Electronic supplementary material The online version of this article (10.1186/s12864-019-5951-3) contains supplementary material, which is available to authorized users.
Collapse
Affiliation(s)
- Fazhe Yan
- School of Life Sciences and Biotechnology, Shanghai Jiao Tong University, 800 Dongchuan Road, Shanghai, 200240, China
| | - Xuelin Yu
- School of Life Sciences and Biotechnology, Shanghai Jiao Tong University, 800 Dongchuan Road, Shanghai, 200240, China
| | - Zhongqu Duan
- School of Life Sciences and Biotechnology, Shanghai Jiao Tong University, 800 Dongchuan Road, Shanghai, 200240, China
| | - Jinyuan Lu
- School of Life Sciences and Biotechnology, Shanghai Jiao Tong University, 800 Dongchuan Road, Shanghai, 200240, China
| | - Ben Jia
- School of Life Sciences and Biotechnology, Shanghai Jiao Tong University, 800 Dongchuan Road, Shanghai, 200240, China.,Shanghai Center for Bioinformation Technology, 1278 Keyuan Road, Pudong District, Shanghai, 201203, China
| | - Yuyang Qiao
- School of Life Sciences and Biotechnology, Shanghai Jiao Tong University, 800 Dongchuan Road, Shanghai, 200240, China
| | - Chen Sun
- School of Life Sciences and Biotechnology, Shanghai Jiao Tong University, 800 Dongchuan Road, Shanghai, 200240, China
| | - Chaochun Wei
- School of Life Sciences and Biotechnology, Shanghai Jiao Tong University, 800 Dongchuan Road, Shanghai, 200240, China. .,Shanghai Center for Bioinformation Technology, 1278 Keyuan Road, Pudong District, Shanghai, 201203, China.
| |
Collapse
|
15
|
Sharifi F, Ye Y. MyDGR: a server for identification and characterization of diversity-generating retroelements. Nucleic Acids Res 2019; 47:W289-W294. [PMID: 31049585 PMCID: PMC6602519 DOI: 10.1093/nar/gkz329] [Citation(s) in RCA: 17] [Impact Index Per Article: 3.4] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 02/25/2019] [Revised: 04/05/2019] [Accepted: 04/23/2019] [Indexed: 01/18/2023] Open
Abstract
MyDGR is a web server providing integrated prediction and visualization of Diversity-Generating Retroelements (DGR) systems in query nucleotide sequences. It is built upon an enhanced version of DGRscan, a tool we previously developed for identification of DGR systems. DGR systems are remarkable genetic elements that use error-prone reverse transcriptases to generate vast sequence variants in specific target genes, which have been shown to benefit their hosts (bacteria, archaea or phages). As the first web server for annotation of DGR systems, myDGR is freely available on the web at http://omics.informatics.indiana.edu/myDGR with all major browsers supported. MyDGR accepts query nucleotide sequences in FASTA format, and outputs all the important features of a predicted DGR system, including a reverse transcriptase, a template repeat and one (or more) variable repeats and their alignment featuring A-to-N (N can be C, T or G) substitutions, and VR-containing target gene(s). In addition to providing the results as text files for download, myDGR generates a visual summary of the results for users to explore the predicted DGR systems. Users can also directly access pre-calculated, putative DGR systems identified in currently available reference bacterial genomes and a few other collections of sequences (including human microbiomes).
Collapse
Affiliation(s)
- Fatemeh Sharifi
- School of Informatics, Computing, and Engineering, Indiana University, Bloomington, IN 47405, USA
| | - Yuzhen Ye
- School of Informatics, Computing, and Engineering, Indiana University, Bloomington, IN 47405, USA
| |
Collapse
|
16
|
García-López R, Pérez-Brocal V, Moya A. Beyond cells - The virome in the human holobiont. MICROBIAL CELL (GRAZ, AUSTRIA) 2019; 6:373-396. [PMID: 31528630 PMCID: PMC6717880 DOI: 10.15698/mic2019.09.689] [Citation(s) in RCA: 13] [Impact Index Per Article: 2.6] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Subscribe] [Scholar Register] [Received: 01/21/2019] [Revised: 03/14/2019] [Accepted: 04/03/2019] [Indexed: 01/01/2023]
Abstract
Viromics, or viral metagenomics, is a relatively new and burgeoning field of research that studies the complete collection of viruses forming part of the microbiota in any given niche. It has strong foundations rooted in over a century of discoveries in the field of virology and recent advances in molecular biology and sequencing technologies. Historically, most studies have deconstructed the concept of viruses into a simplified perception of viral agents as mere pathogens, which demerits the scope of large-scale viromic analyses. Viruses are, in fact, much more than regular parasites. They are by far the most dynamic and abundant entity and the greatest killers on the planet, as well as the most effective geo-transforming genetic engineers and resource recyclers, acting on all life strata in any habitat. Yet, most of this uncanny viral world remains vastly unexplored to date, greatly hindered by the bewildering complexity inherent to such studies and the methodological and conceptual limitations. Viromic studies are just starting to address some of these issues but they still lag behind microbial metagenomics. In recent years, however, higher-throughput analysis and resequencing have rekindled interest in a field that is just starting to show its true potential. In this review, we take a look at the scientific and technological developments that led to the advent of viral and bacterial metagenomics with a particular, but not exclusive, focus on human viromics from an ecological perspective. We also address some of the most relevant challenges that current viral studies face and ponder on the future directions of the field.
Collapse
Affiliation(s)
- Rodrigo García-López
- Institute of Evolutionary Systems Biology (I2Sysbio), Universitat de València and CSIC, València, Spain
- CIBER in Epidemiology and Public Health (CIBEResp), Madrid, Spain
- Fundación para el Fomento de la Investigación Sanitaria y Biomédica de la Comunidad Valenciana (FISABIO), València, Spain
| | - Vicente Pérez-Brocal
- Institute of Evolutionary Systems Biology (I2Sysbio), Universitat de València and CSIC, València, Spain
- CIBER in Epidemiology and Public Health (CIBEResp), Madrid, Spain
- Fundación para el Fomento de la Investigación Sanitaria y Biomédica de la Comunidad Valenciana (FISABIO), València, Spain
| | - Andrés Moya
- Institute of Evolutionary Systems Biology (I2Sysbio), Universitat de València and CSIC, València, Spain
- CIBER in Epidemiology and Public Health (CIBEResp), Madrid, Spain
- Fundación para el Fomento de la Investigación Sanitaria y Biomédica de la Comunidad Valenciana (FISABIO), València, Spain
| |
Collapse
|
17
|
Handa S, Shaw KL, Ghosh P. Crystal structure of a Thermus aquaticus diversity-generating retroelement variable protein. PLoS One 2019; 14:e0205618. [PMID: 30629599 PMCID: PMC6328139 DOI: 10.1371/journal.pone.0205618] [Citation(s) in RCA: 5] [Impact Index Per Article: 1.0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 09/27/2018] [Accepted: 12/21/2018] [Indexed: 01/01/2023] Open
Abstract
Diversity-generating retroelements (DGRs) are widely distributed in bacteria, archaea, and microbial viruses, and bring about unparalleled levels of sequence variation in target proteins. While DGR variable proteins share low sequence identity, the structures of several such proteins have revealed the C-type lectin (CLec)-fold as a conserved scaffold for accommodating massive sequence variation. This conservation has led to the suggestion that the CLec-fold may be useful in molecular surface display applications. Thermostability is an attractive feature in such applications, and thus we studied the variable protein of a DGR encoded by a prophage of the thermophile Thermus aquaticus. We report here the 2.8 Å resolution crystal structure of the variable protein from the T. aquaticus DGR, called TaqVP, and confirm that it has a CLec-fold. Remarkably, its variable region is nearly identical in structure to those of several other CLec-fold DGR variable proteins despite low sequence identity among these. TaqVP was found to be thermostable, which appears to be a property shared by several CLec-fold DGR variable proteins. These results provide impetus for the pursuit of the DGR variable protein CLec-fold in molecular display applications.
Collapse
Affiliation(s)
- Sumit Handa
- Department of Chemistry & Biochemistry, University of California San Diego, La Jolla, CA, United States of America
| | - Kharissa L. Shaw
- Department of Chemistry & Biochemistry, University of California San Diego, La Jolla, CA, United States of America
| | - Partho Ghosh
- Department of Chemistry & Biochemistry, University of California San Diego, La Jolla, CA, United States of America
- * E-mail:
| |
Collapse
|