1
|
Oliveira LS, Reyes A, Dutilh BE, Gruber A. Rational Design of Profile HMMs for Sensitive and Specific Sequence Detection with Case Studies Applied to Viruses, Bacteriophages, and Casposons. Viruses 2023; 15:519. [PMID: 36851733 PMCID: PMC9966878 DOI: 10.3390/v15020519] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 12/29/2022] [Revised: 02/01/2023] [Accepted: 02/09/2023] [Indexed: 02/15/2023] Open
Abstract
Profile hidden Markov models (HMMs) are a powerful way of modeling biological sequence diversity and constitute a very sensitive approach to detecting divergent sequences. Here, we report the development of protocols for the rational design of profile HMMs. These methods were implemented on TABAJARA, a program that can be used to either detect all biological sequences of a group or discriminate specific groups of sequences. By calculating position-specific information scores along a multiple sequence alignment, TABAJARA automatically identifies the most informative sequence motifs and uses them to construct profile HMMs. As a proof-of-principle, we applied TABAJARA to generate profile HMMs for the detection and classification of two viral groups presenting different evolutionary rates: bacteriophages of the Microviridae family and viruses of the Flavivirus genus. We obtained conserved models for the generic detection of any Microviridae or Flavivirus sequence, and profile HMMs that can specifically discriminate Microviridae subfamilies or Flavivirus species. In another application, we constructed Cas1 endonuclease-derived profile HMMs that can discriminate CRISPRs and casposons, two evolutionarily related transposable elements. We believe that the protocols described here, and implemented on TABAJARA, constitute a generic toolbox for generating profile HMMs for the highly sensitive and specific detection of sequence classes.
Collapse
Affiliation(s)
- Liliane S. Oliveira
- Department of Parasitology, Instituto de Ciências Biomédicas, Universidade de São Paulo, São Paulo 05508-000, SP, Brazil
| | - Alejandro Reyes
- Max Planck Tandem Group in Computational Biology, Department of Biological Sciences, Universidad de los Andes, Bogotá 111711, Colombia
- The Edison Family Center for Genome Sciences and Systems Biology, Washington University School of Medicine, Saint Louis, MO 63108, USA
| | - Bas E. Dutilh
- Institute of Biodiversity, Faculty of Biological Sciences, Cluster of Excellence Balance of the Microverse, Friedrich-Schiller-University Jena, 07743 Jena, Germany
- Theoretical Biology and Bioinformatics, Science for Life, Utrecht University, Padualaan 8, 3584 CH Utrecht, The Netherlands
- European Virus Bioinformatics Center, Leutragraben 1, 07743 Jena, Germany
| | - Arthur Gruber
- Department of Parasitology, Instituto de Ciências Biomédicas, Universidade de São Paulo, São Paulo 05508-000, SP, Brazil
- European Virus Bioinformatics Center, Leutragraben 1, 07743 Jena, Germany
| |
Collapse
|
2
|
Smaruj P, Kieliszek M. Casposons - silent heroes of the CRISPR-Cas systems evolutionary history. EXCLI JOURNAL 2023; 22:70-83. [PMID: 36814855 PMCID: PMC9939771 DOI: 10.17179/excli2022-5581] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Figures] [Subscribe] [Scholar Register] [Received: 11/06/2022] [Accepted: 12/19/2022] [Indexed: 02/24/2023]
Abstract
Many archaeal and bacterial organisms possess an adaptive immunity system known as CRISPR-Cas. Its role is to recognize and degrade foreign DNA showing high similarity to repeats within the CRISPR array. In recent years computational techniques have been used to identify cas1 genes that are not associated with CRISPR systems, named cas1-solo. Often, cas1-solo genes are present in a conserved neighborhood of PolB-like polymerase genes, which is a characteristic feature of self-synthesizing, eukaryotic transposons of the Polinton class. Nearly all cas1-polB genomic islands are flanked by terminal inverted repeats and direct repeats which correspond to target site duplications. Considering the patchy taxonomic distribution of the identified islands in archaeal and bacterial genomes, they were characterized as a new superfamily of mobile genetic elements and called casposons. Here, we review recent experiments on casposons' mobility and discuss their discovery, classification, and evolutionary relationship with the CRISPR-Cas systems.
Collapse
Affiliation(s)
- Paulina Smaruj
- Department of Quantitative and Computational Biology, University of Southern California, Los Angeles, CA 90089, United States of America,College of Inter-Faculty Individual Studies in Mathematics and Natural Sciences, University of Warsaw, 02-097 Warsaw, Poland,*To whom correspondence should be addressed: Paulina Smaruj, Department of Quantitative and Computational Biology, University of Southern California, Los Angeles, CA 90089, United States of America, E-mail:
| | - Marek Kieliszek
- Department of Food Biotechnology and Microbiology, Institute of Food Sciences, Warsaw University of Life Sciences-SGGW, Nowoursynowska 159 C, 02-776 Warsaw, Poland
| |
Collapse
|
3
|
Salgado O, Guajardo-Leiva S, Moya-Beltrán A, Barbosa C, Ridley C, Tamayo-Leiva J, Quatrini R, Mojica FJM, Díez B. Global phylogenomic novelty of the Cas1 gene from hot spring microbial communities. Front Microbiol 2022; 13:1069452. [DOI: 10.3389/fmicb.2022.1069452] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/13/2022] [Accepted: 11/17/2022] [Indexed: 12/05/2022] Open
Abstract
The Cas1 protein is essential for the functioning of CRISPR-Cas adaptive systems. However, despite the high prevalence of CRISPR-Cas systems in thermophilic microorganisms, few studies have investigated the occurrence and diversity of Cas1 across hot spring microbial communities. Phylogenomic analysis of 2,150 Cas1 sequences recovered from 48 metagenomes representing hot springs (42–80°C, pH 6–9) from three continents, revealed similar ecological diversity of Cas1 and 16S rRNA associated with geographic location. Furthermore, phylogenetic analysis of the Cas1 sequences exposed a broad taxonomic distribution in thermophilic bacteria, with new clades of Cas1 homologs branching at the root of the tree or at the root of known clades harboring reference Cas1 types. Additionally, a new family of casposases was identified from hot springs, which further completes the evolutionary landscape of the Cas1 superfamily. This ecological study contributes new Cas1 sequences from known and novel locations worldwide, mainly focusing on under-sampled hot spring microbial mat taxa. Results herein show that circumneutral hot springs are environments harboring high diversity and novelty related to adaptive immunity systems.
Collapse
|
4
|
Gehlert FO, Sauerwein T, Weidenbach K, Repnik U, Hallack D, Förstner KU, Schmitz RA. Dual-RNAseq Analysis Unravels Virus-Host Interactions of MetSV and Methanosarcina mazei. Viruses 2022; 14:2585. [PMID: 36423194 PMCID: PMC9694453 DOI: 10.3390/v14112585] [Citation(s) in RCA: 3] [Impact Index Per Article: 1.5] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 09/27/2022] [Revised: 11/05/2022] [Accepted: 11/19/2022] [Indexed: 11/23/2022] Open
Abstract
Methanosarcina spherical virus (MetSV), infecting Methanosarcina species, encodes 22 genes, but their role in the infection process in combination with host genes has remained unknown. To study the infection process in detail, infected and uninfected M. mazei cultures were compared using dual-RNAseq, qRT-PCRs, and transmission electron microscopy (TEM). The transcriptome analysis strongly indicates a combined role of virus and host genes in replication, virus assembly, and lysis. Thereby, 285 host and virus genes were significantly regulated. Within these 285 regulated genes, a network of the viral polymerase, MetSVORF6, MetSVORF5, MetSVORF2, and the host genes encoding NrdD, NrdG, a CDC48 family protein, and a SSB protein with a role in viral replication was postulated. Ultrastructural analysis at 180 min p.i. revealed many infected cells with virus particles randomly scattered throughout the cytoplasm or attached at the cell surface, and membrane fragments indicating cell lysis. Dual-RNAseq and qRT-PCR analyses suggested a multifactorial lysis reaction in potential connection to the regulation of a cysteine proteinase, a pirin-like protein and a HicB-solo protein. Our study's results led to the first preliminary infection model of MetSV infecting M. mazei, summarizing the key infection steps as follows: replication, assembly, and host cell lysis.
Collapse
Affiliation(s)
- Finn O. Gehlert
- Institute for General Microbiology, Christian Albrechts University, 24118 Kiel, Germany
| | - Till Sauerwein
- ZB MED, Information Centre for Life Sciences, 50931 Cologne, Germany
| | - Katrin Weidenbach
- Institute for General Microbiology, Christian Albrechts University, 24118 Kiel, Germany
| | - Urska Repnik
- Central Microscopy, Christian Albrechts University, 24118 Kiel, Germany
| | - Daniela Hallack
- Institute for General Microbiology, Christian Albrechts University, 24118 Kiel, Germany
| | | | - Ruth A. Schmitz
- Institute for General Microbiology, Christian Albrechts University, 24118 Kiel, Germany
| |
Collapse
|
5
|
Ilyina TS. Adaptive Immunity Systems of Bacteria: Association with Self-Synthesizing Transposons, Polyfunctionality. MOLECULAR GENETICS, MICROBIOLOGY AND VIROLOGY 2022. [DOI: 10.3103/s0891416822030065] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 12/26/2022]
|
6
|
Wang X, Yuan Q, Zhang W, Ji S, Lv Y, Ren K, Lu M, Xiao Y. Sequence specific integration by the family 1 casposase from Candidatus Nitrosopumilus koreensis AR1. Nucleic Acids Res 2021; 49:9938-9952. [PMID: 34428286 PMCID: PMC8464041 DOI: 10.1093/nar/gkab725] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/10/2021] [Revised: 08/05/2021] [Accepted: 08/10/2021] [Indexed: 11/13/2022] Open
Abstract
Casposase, a homolog of Cas1 integrase, is encoded by a superfamily of mobile genetic elements known as casposons. While family 2 casposase has been well documented in both function and structure, little is known about the other three casposase families. Here, we studied the family 1 casposase lacking the helix-turn-helix (HTH) domain from Candidatus Nitrosopumilus koreensis AR1 (Ca. N. koreensis). The determinants for integration by Ca. N. koreensis casposase were extensively investigated, and it was found that a 13-bp target site duplication (TSD) sequence, a minimal 3-bp leader and three different nucleotides of the TSD sequences are indispensable for target specific integration. Significantly, the casposase can site-specifically integrate a broad range of terminal inverted repeat (TIR)-derived oligonucleotides ranging from 7-nt to ∼4000-bp, and various oligonucleotides lacking the 5′-TTCTA-3′ motif at the 3′ end of TIR sequence can be integrated efficiently. Furthermore, similar to some Cas1 homologs, the casposase utilizes a 5′-ATAA-3′ motif in the TSD as a molecular ruler to dictate nucleophilic attack at 9-bp downstream of the end of the ruler during the spacer-side integration. By characterizing the family 1 Ca. N. koreensis casposase, we have extended our understanding on mechanistic similarities and evolutionary connections between casposons and the adaptation elements of CRISPR-Cas immunity.
Collapse
Affiliation(s)
- Xiaoke Wang
- State Key Laboratory of Natural Medicines, China Pharmaceutical University, Nanjing210009, China.,Department of Pharmacology, School of Pharmacy, China Pharmaceutical University, Nanjing210009, China
| | - Qinling Yuan
- State Key Laboratory of Natural Medicines, China Pharmaceutical University, Nanjing210009, China.,Department of Pharmacology, School of Pharmacy, China Pharmaceutical University, Nanjing210009, China
| | - Wenxuan Zhang
- State Key Laboratory of Natural Medicines, China Pharmaceutical University, Nanjing210009, China.,Department of Pharmacology, School of Pharmacy, China Pharmaceutical University, Nanjing210009, China
| | - Suyu Ji
- State Key Laboratory of Natural Medicines, China Pharmaceutical University, Nanjing210009, China.,Department of Pharmacology, School of Pharmacy, China Pharmaceutical University, Nanjing210009, China
| | - Yang Lv
- State Key Laboratory of Natural Medicines, China Pharmaceutical University, Nanjing210009, China.,Department of Pharmacology, School of Pharmacy, China Pharmaceutical University, Nanjing210009, China
| | - Kejing Ren
- State Key Laboratory of Natural Medicines, China Pharmaceutical University, Nanjing210009, China.,Department of Pharmacology, School of Pharmacy, China Pharmaceutical University, Nanjing210009, China
| | - Meiling Lu
- Department of Biochemistry, School of Life Science and Technology, China Pharmaceutical University, Nanjing210009, China
| | - Yibei Xiao
- State Key Laboratory of Natural Medicines, China Pharmaceutical University, Nanjing210009, China.,Department of Pharmacology, School of Pharmacy, China Pharmaceutical University, Nanjing210009, China
| |
Collapse
|
7
|
Zink IA, Wimmer E, Schleper C. Heavily Armed Ancestors: CRISPR Immunity and Applications in Archaea with a Comparative Analysis of CRISPR Types in Sulfolobales. Biomolecules 2020; 10:E1523. [PMID: 33172134 PMCID: PMC7694759 DOI: 10.3390/biom10111523] [Citation(s) in RCA: 8] [Impact Index Per Article: 2.0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/02/2020] [Revised: 10/31/2020] [Accepted: 11/03/2020] [Indexed: 12/13/2022] Open
Abstract
Prokaryotes are constantly coping with attacks by viruses in their natural environments and therefore have evolved an impressive array of defense systems. Clustered Regularly Interspaced Short Palindromic Repeats (CRISPR) is an adaptive immune system found in the majority of archaea and about half of bacteria which stores pieces of infecting viral DNA as spacers in genomic CRISPR arrays to reuse them for specific virus destruction upon a second wave of infection. In detail, small CRISPR RNAs (crRNAs) are transcribed from CRISPR arrays and incorporated into type-specific CRISPR effector complexes which further degrade foreign nucleic acids complementary to the crRNA. This review gives an overview of CRISPR immunity to newcomers in the field and an update on CRISPR literature in archaea by comparing the functional mechanisms and abundances of the diverse CRISPR types. A bigger fraction is dedicated to the versatile and prevalent CRISPR type III systems, as tremendous progress has been made recently using archaeal models in discerning the controlled molecular mechanisms of their unique tripartite mode of action including RNA interference, DNA interference and the unique cyclic-oligoadenylate signaling that induces promiscuous RNA shredding by CARF-domain ribonucleases. The second half of the review spotlights CRISPR in archaea outlining seminal in vivo and in vitro studies in model organisms of the euryarchaeal and crenarchaeal phyla, including the application of CRISPR-Cas for genome editing and gene silencing. In the last section, a special focus is laid on members of the crenarchaeal hyperthermophilic order Sulfolobales by presenting a thorough comparative analysis about the distribution and abundance of CRISPR-Cas systems, including arrays and spacers as well as CRISPR-accessory proteins in all 53 genomes available to date. Interestingly, we find that CRISPR type III and the DNA-degrading CRISPR type I complexes co-exist in more than two thirds of these genomes. Furthermore, we identified ring nuclease candidates in all but two genomes and found that they generally co-exist with the above-mentioned CARF domain ribonucleases Csx1/Csm6. These observations, together with published literature allowed us to draft a working model of how CRISPR-Cas systems and accessory proteins cross talk to establish native CRISPR anti-virus immunity in a Sulfolobales cell.
Collapse
|
8
|
The CARF Protein MM_0565 Affects Transcription of the Casposon-Encoded cas1-solo Gene in Methanosarcina mazei Gö1. Biomolecules 2020; 10:biom10081161. [PMID: 32784796 PMCID: PMC7465815 DOI: 10.3390/biom10081161] [Citation(s) in RCA: 2] [Impact Index Per Article: 0.5] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 05/29/2020] [Revised: 08/04/2020] [Accepted: 08/04/2020] [Indexed: 12/25/2022] Open
Abstract
Clustered Regularly Interspaced Short Palindromic Repeat (CRISPR) loci are found in bacterial and archaeal genomes where they provide the molecular machinery for acquisition of immunity against foreign DNA. In addition to the cas genes fundamentally required for CRISPR activity, a second class of genes is associated with the CRISPR loci, of which many have no reported function in CRISPR-mediated immunity. Here, we characterize MM_0565 associated to the type I-B CRISPR-locus of Methanosarcina mazei Gö1. We show that purified MM_0565 composed of a CRISPR-Cas Associated Rossmann Fold (CARF) and a winged helix-turn-helix domain forms a dimer in solution; in vivo, the dimeric MM_0565 is strongly stabilized under high salt stress. While direct effects on CRISPR-Cas transcription were not detected by genetic approaches, specific binding of MM_0565 to the leader region of both CRISPR-Cas systems was observed by microscale thermophoresis and electromobility shift assays. Moreover, overexpression of MM_0565 strongly induced transcription of the cas1-solo gene located in the recently reported casposon, the gene product of which shows high similarity to classical Cas1 proteins. Based on our findings, and taking the absence of the expressed CRISPR locus-encoded Cas1 protein into account, we hypothesize that MM_0565 might modulate the activity of the CRISPR systems on different levels.
Collapse
|
9
|
Flament-Simon SC, de Toro M, Chuprikova L, Blanco M, Moreno-González J, Salas M, Blanco J, Redrejo-Rodríguez M. High diversity and variability of pipolins among a wide range of pathogenic Escherichia coli strains. Sci Rep 2020; 10:12452. [PMID: 32719405 PMCID: PMC7385651 DOI: 10.1038/s41598-020-69356-6] [Citation(s) in RCA: 5] [Impact Index Per Article: 1.3] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 04/26/2020] [Accepted: 07/01/2020] [Indexed: 12/24/2022] Open
Abstract
Self-synthesizing transposons are integrative mobile genetic elements (MGEs) that encode their own B-family DNA polymerase (PolB). Discovered a few years ago, they are proposed as key players in the evolution of several groups of DNA viruses and virus–host interaction machinery. Pipolins are the most recent addition to the group, are integrated in the genomes of bacteria from diverse phyla and also present as circular plasmids in mitochondria. Remarkably, pipolins-encoded PolBs are proficient DNA polymerases endowed with DNA priming capacity, hence the name, primer-independent PolB (piPolB). We have now surveyed the presence of pipolins in a collection of 2,238 human and animal pathogenic Escherichia coli strains and found that, although detected in only 25 positive isolates (1.1%), they are present in E. coli strains from a wide variety of pathotypes, serotypes, phylogenetic groups and sequence types. Overall, the pangenome of strains carrying pipolins is highly diverse, despite the fact that a considerable number of strains belong to only three clonal complexes (CC10, CC23 and CC32). Comparative analysis with a set of 67 additional pipolin-harboring genomes from GenBank database spanning strains from diverse origin, further confirmed these results. The genetic structure of pipolins shows great flexibility and variability, with the piPolB gene and the attachment sites being the only common features. Most pipolins contain one or more recombinases that would be involved in excision/integration of the element in the same conserved tRNA gene. This mobilization mechanism might explain the apparent incompatibility of pipolins with other integrative MGEs such as integrons. In addition, analysis of cophylogeny between pipolins and pipolin-harboring strains showed a lack of congruence between several pipolins and their host strains, in agreement with horizontal transfer between hosts. Overall, these results indicate that pipolins can serve as a vehicle for genetic transfer among circulating E. coli and possibly also among other pathogenic bacteria.
Collapse
Affiliation(s)
- Saskia-Camille Flament-Simon
- Laboratorio de Referencia de E. Coli (LREC), Departamento de Microbiología y Parasitología, Facultad de Veterinaria, Universidad de Santiago de Compostela (USC), 27002, Lugo, Spain
| | - María de Toro
- Plataforma de Genómica y Bioinformática, CIBIR (Centro de Investigación Biomédica de La Rioja), La Rioja, 26006, Logroño, Spain
| | - Liubov Chuprikova
- Departamento de Bioquímica & Instituto de Investigaciones Biomédicas "Alberto Sols" CSIC-UAM, Universidad Autónoma de Madrid (UAM), 28029, Madrid, Spain
| | - Miguel Blanco
- Laboratorio de Referencia de E. Coli (LREC), Departamento de Microbiología y Parasitología, Facultad de Veterinaria, Universidad de Santiago de Compostela (USC), 27002, Lugo, Spain
| | - Juan Moreno-González
- Centro de Biología Molecular Severo Ochoa, Consejo Superior de Investigaciones Científicas-Universidad Autónoma de Madrid, 28049, Madrid, Spain
| | - Margarita Salas
- Centro de Biología Molecular Severo Ochoa, Consejo Superior de Investigaciones Científicas-Universidad Autónoma de Madrid, 28049, Madrid, Spain
| | - Jorge Blanco
- Laboratorio de Referencia de E. Coli (LREC), Departamento de Microbiología y Parasitología, Facultad de Veterinaria, Universidad de Santiago de Compostela (USC), 27002, Lugo, Spain
| | - Modesto Redrejo-Rodríguez
- Centro de Biología Molecular Severo Ochoa, Consejo Superior de Investigaciones Científicas-Universidad Autónoma de Madrid, 28049, Madrid, Spain. .,Departamento de Bioquímica & Instituto de Investigaciones Biomédicas "Alberto Sols" CSIC-UAM, Universidad Autónoma de Madrid (UAM), 28029, Madrid, Spain.
| |
Collapse
|
10
|
Wu R, Chai B, Cole JR, Gunturu SK, Guo X, Tian R, Gu JD, Zhou J, Tiedje JM. Targeted assemblies of cas1 suggest CRISPR-Cas's response to soil warming. ISME JOURNAL 2020; 14:1651-1662. [PMID: 32221408 PMCID: PMC7305122 DOI: 10.1038/s41396-020-0635-1] [Citation(s) in RCA: 3] [Impact Index Per Article: 0.8] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Subscribe] [Scholar Register] [Received: 07/16/2019] [Revised: 03/03/2020] [Accepted: 03/16/2020] [Indexed: 02/06/2023]
Abstract
There is an increasing interest in the clustered regularly interspaced short palindromic repeats CRISPR-associated protein (CRISPR-Cas) system to reveal potential virus–host dynamics. The universal and most conserved Cas protein, cas1 is an ideal marker to elucidate CRISPR-Cas ecology. We constructed eight Hidden Markov Models (HMMs) and assembled cas1 directly from metagenomes by a targeted-gene assembler, Xander, to improve detection capacity and resolve the diverse CRISPR-Cas systems. The eight HMMs were first validated by recovering all 17 cas1 subtypes from the simulated metagenome generated from 91 prokaryotic genomes across 11 phyla. We challenged the targeted method with 48 metagenomes from a tallgrass prairie in Central Oklahoma recovering 3394 cas1. Among those, 88 were near full length, 5 times more than in de-novo assemblies from the Oklahoma metagenomes. To validate the host assignment by cas1, the targeted-assembled cas1 was mapped to the de-novo assembled contigs. All the phylum assignments of those mapped contigs were assigned independent of CRISPR-Cas genes on the same contigs and consistent with the host taxonomies predicted by the mapped cas1. We then investigated whether 8 years of soil warming altered cas1 prevalence within the communities. A shift in microbial abundances was observed during the year with the biggest temperature differential (mean 4.16 °C above ambient). cas1 prevalence increased and even in the phyla with decreased microbial abundances over the next 3 years, suggesting increasing virus–host interactions in response to soil warming. This targeted method provides an alternative means to effectively mine cas1 from metagenomes and uncover the host communities.
Collapse
Affiliation(s)
- Ruonan Wu
- Laboratory of Environmental Microbiology and Toxicology, School of Biological Sciences, Faculty of Science, The University of Hong Kong, Hong Kong SAR, China.,Center for Microbial Ecology, Michigan State University, East Lansing, MI, USA
| | - Benli Chai
- Center for Microbial Ecology, Michigan State University, East Lansing, MI, USA
| | - James R Cole
- Center for Microbial Ecology, Michigan State University, East Lansing, MI, USA.,Department of Plant, Soil and Microbial Sciences, Michigan State University, East Lansing, MI, USA
| | - Santosh K Gunturu
- Center for Microbial Ecology, Michigan State University, East Lansing, MI, USA
| | - Xue Guo
- Department of Microbiology & Plant Biology, Institute for Environmental Genomics, and School of Civil Engineering and Environmental Sciences, University of Oklahoma, Norman, OK, USA.,State Key Joint Laboratory of Environment Simulation and Pollution Control, School of Environment, Tsinghua University, Beijing, China
| | - Renmao Tian
- Department of Microbiology & Plant Biology, Institute for Environmental Genomics, and School of Civil Engineering and Environmental Sciences, University of Oklahoma, Norman, OK, USA.,Institute for Food Safety and Health, Illinois Institute of Technology, Chicago, IL, USA
| | - Ji-Dong Gu
- Laboratory of Environmental Microbiology and Toxicology, School of Biological Sciences, Faculty of Science, The University of Hong Kong, Hong Kong SAR, China
| | - Jizhong Zhou
- Department of Microbiology & Plant Biology, Institute for Environmental Genomics, and School of Civil Engineering and Environmental Sciences, University of Oklahoma, Norman, OK, USA.,State Key Joint Laboratory of Environment Simulation and Pollution Control, School of Environment, Tsinghua University, Beijing, China.,Earth and Environmental Sciences, Lawrence Berkeley National Laboratory, Berkeley, CA, USA
| | - James M Tiedje
- Center for Microbial Ecology, Michigan State University, East Lansing, MI, USA. .,Department of Plant, Soil and Microbial Sciences, Michigan State University, East Lansing, MI, USA.
| |
Collapse
|
11
|
Béguin P, Chekli Y, Sezonov G, Forterre P, Krupovic M. Sequence motifs recognized by the casposon integrase of Aciduliprofundum boonei. Nucleic Acids Res 2020; 47:6386-6395. [PMID: 31114911 PMCID: PMC6614799 DOI: 10.1093/nar/gkz447] [Citation(s) in RCA: 5] [Impact Index Per Article: 1.3] [Reference Citation Analysis] [Abstract] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 01/23/2019] [Revised: 04/17/2019] [Accepted: 05/20/2019] [Indexed: 01/01/2023] Open
Abstract
Casposons are a group of bacterial and archaeal DNA transposons encoding a specific integrase, termed casposase, which is homologous to the Cas1 enzyme responsible for the integration of new spacers into CRISPR loci. Here, we characterized the sequence motifs recognized by the casposase from a thermophilic archaeon Aciduliprofundum boonei. We identified a stretch of residues, located in the leader region upstream of the actual integration site, whose deletion or mutagenesis impaired the concerted integration reaction. However, deletions of two-thirds of the target site were fully functional. Various single-stranded 6-FAM-labelled oligonucleotides derived from casposon terminal inverted repeats were as efficiently incorporated as duplexes into the target site. This result suggests that, as in the case of spacer insertion by the CRISPR Cas1–Cas2 integrase, casposon integration involves splaying of the casposon termini, with single-stranded ends being the actual substrates. The sequence critical for incorporation was limited to the five terminal residues derived from the 3′ end of the casposon. Furthermore, we characterize the casposase from Nitrosopumilus koreensis, a marine member of the phylum Thaumarchaeota, and show that it shares similar properties with the A. boonei enzyme, despite belonging to a different family. These findings further reinforce the mechanistic similarities and evolutionary connection between the casposons and the adaptation module of the CRISPR–Cas systems.
Collapse
Affiliation(s)
- Pierre Béguin
- Unité Biologie Moléculaire du Gène chez les Extrêmophiles, Institut Pasteur, 25-28 rue du Dr. Roux 75724 Paris cedex 15, France
| | - Yankel Chekli
- Unité Biologie Moléculaire du Gène chez les Extrêmophiles, Institut Pasteur, 25-28 rue du Dr. Roux 75724 Paris cedex 15, France
| | - Guennadi Sezonov
- UMRS 1138 - Centre de Recherche des Cordeliers, Sorbonne Université, 15, rue de l'École de Médecine, 75006 Paris, France
| | - Patrick Forterre
- Unité Biologie Moléculaire du Gène chez les Extrêmophiles, Institut Pasteur, 25-28 rue du Dr. Roux 75724 Paris cedex 15, France.,Institute for Integrative Biology of the Cell (I2BC), CEA, CNRS, Univ. Paris- Sud, Université Paris-Saclay, Gif-sur-Yvette cedex, Paris, France
| | - Mart Krupovic
- Unité Biologie Moléculaire du Gène chez les Extrêmophiles, Institut Pasteur, 25-28 rue du Dr. Roux 75724 Paris cedex 15, France
| |
Collapse
|
12
|
Hickman AB, Kailasan S, Genzor P, Haase AD, Dyda F. Casposase structure and the mechanistic link between DNA transposition and spacer acquisition by CRISPR-Cas. eLife 2020; 9:50004. [PMID: 31913120 PMCID: PMC6977970 DOI: 10.7554/elife.50004] [Citation(s) in RCA: 15] [Impact Index Per Article: 3.8] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/08/2019] [Accepted: 01/08/2020] [Indexed: 12/17/2022] Open
Abstract
Key to CRISPR-Cas adaptive immunity is maintaining an ongoing record of invading nucleic acids, a process carried out by the Cas1-Cas2 complex that integrates short segments of foreign genetic material (spacers) into the CRISPR locus. It is hypothesized that Cas1 evolved from casposases, a novel class of transposases. We show here that the Methanosarcina mazei casposase can integrate varied forms of the casposon end in vitro, and recapitulates several properties of CRISPR-Cas integrases including site-specificity. The X-ray structure of the casposase bound to DNA representing the product of integration reveals a tetramer with target DNA bound snugly between two dimers in which single-stranded casposon end binding resembles that of spacer 3'-overhangs. The differences between transposase and CRISPR-Cas integrase are largely architectural, and it appears that evolutionary change involved changes in protein-protein interactions to favor Cas2 binding over tetramerization; this in turn led to preferred integration of single spacers over two transposon ends.
Collapse
Affiliation(s)
- Alison B Hickman
- Laboratory of Molecular Biology, National Institute of Diabetes and Digestive and Kidney Diseases, National Institutes of Health, Bethesda, United States
| | - Shweta Kailasan
- Laboratory of Molecular Biology, National Institute of Diabetes and Digestive and Kidney Diseases, National Institutes of Health, Bethesda, United States
| | - Pavol Genzor
- Laboratory of Cell and Molecular Biology, National Institute of Diabetes and Digestive and Kidney Diseases, National Institutes of Health, Bethesda, United States
| | - Astrid D Haase
- Laboratory of Cell and Molecular Biology, National Institute of Diabetes and Digestive and Kidney Diseases, National Institutes of Health, Bethesda, United States
| | - Fred Dyda
- Laboratory of Molecular Biology, National Institute of Diabetes and Digestive and Kidney Diseases, National Institutes of Health, Bethesda, United States
| |
Collapse
|
13
|
Koonin EV, Makarova KS. Origins and evolution of CRISPR-Cas systems. Philos Trans R Soc Lond B Biol Sci 2019; 374:20180087. [PMID: 30905284 PMCID: PMC6452270 DOI: 10.1098/rstb.2018.0087] [Citation(s) in RCA: 203] [Impact Index Per Article: 40.6] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Accepted: 10/24/2018] [Indexed: 12/11/2022] Open
Abstract
CRISPR-Cas, the bacterial and archaeal adaptive immunity systems, encompass a complex machinery that integrates fragments of foreign nucleic acids, mostly from mobile genetic elements (MGE), into CRISPR arrays embedded in microbial genomes. Transcripts of the inserted segments (spacers) are employed by CRISPR-Cas systems as guide (g)RNAs for recognition and inactivation of the cognate targets. The CRISPR-Cas systems consist of distinct adaptation and effector modules whose evolutionary trajectories appear to be at least partially independent. Comparative genome analysis reveals the origin of the adaptation module from casposons, a distinct type of transposons, which employ a homologue of Cas1 protein, the integrase responsible for the spacer incorporation into CRISPR arrays, as the transposase. The origin of the effector module(s) is far less clear. The CRISPR-Cas systems are partitioned into two classes, class 1 with multisubunit effectors, and class 2 in which the effector consists of a single, large protein. The class 2 effectors originate from nucleases encoded by different MGE, whereas the origin of the class 1 effector complexes remains murky. However, the recent discovery of a signalling pathway built into the type III systems of class 1 might offer a clue, suggesting that type III effector modules could have evolved from a signal transduction system involved in stress-induced programmed cell death. The subsequent evolution of the class 1 effector complexes through serial gene duplication and displacement, primarily of genes for proteins containing RNA recognition motif domains, can be hypothetically reconstructed. In addition to the multiple contributions of MGE to the evolution of CRISPR-Cas, the reverse flow of information is notable, namely, recruitment of minimalist variants of CRISPR-Cas systems by MGE for functions that remain to be elucidated. Here, we attempt a synthesis of the diverse threads that shed light on CRISPR-Cas origins and evolution. This article is part of a discussion meeting issue 'The ecology and evolution of prokaryotic CRISPR-Cas adaptive immune systems'.
Collapse
Affiliation(s)
- Eugene V. Koonin
- National Center for Biotechnology Information, National Library of Medicine, Bethesda, MD 20894, USA
| | | |
Collapse
|
14
|
Krupovic M, Makarova KS, Wolf YI, Medvedeva S, Prangishvili D, Forterre P, Koonin EV. Integrated mobile genetic elements in Thaumarchaeota. Environ Microbiol 2019; 21:2056-2078. [PMID: 30773816 PMCID: PMC6563490 DOI: 10.1111/1462-2920.14564] [Citation(s) in RCA: 32] [Impact Index Per Article: 6.4] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 11/30/2018] [Revised: 02/10/2019] [Accepted: 02/13/2019] [Indexed: 12/20/2022]
Abstract
To explore the diversity of mobile genetic elements (MGE) associated with archaea of the phylum Thaumarchaeota, we exploited the property of most MGE to integrate into the genomes of their hosts. Integrated MGE (iMGE) were identified in 20 thaumarchaeal genomes amounting to 2 Mbp of mobile thaumarchaeal DNA. These iMGE group into five major classes: (i) proviruses, (ii) casposons, (iii) insertion sequence-like transposons, (iv) integrative-conjugative elements and (v) cryptic integrated elements. The majority of the iMGE belong to the latter category and might represent novel families of viruses or plasmids. The identified proviruses are related to tailed viruses of the order Caudovirales and to tailless icosahedral viruses with the double jelly-roll capsid proteins. The thaumarchaeal iMGE are all connected within a gene sharing network, highlighting pervasive gene exchange between MGE occupying the same ecological niche. The thaumarchaeal mobilome carries multiple auxiliary metabolic genes, including multicopper oxidases and ammonia monooxygenase subunit C (AmoC), and stress response genes, such as those for universal stress response proteins (UspA). Thus, iMGE might make important contributions to the fitness and adaptation of their hosts. We identified several iMGE carrying type I-B CRISPR-Cas systems and spacers matching other thaumarchaeal iMGE, suggesting antagonistic interactions between coexisting MGE and symbiotic relationships with the ir archaeal hosts.
Collapse
Affiliation(s)
- Mart Krupovic
- Institut Pasteur, Unité Biologie Moléculaire du Gène chez les Extrêmophiles, 75015, Paris, France
| | - Kira S Makarova
- National Center for Biotechnology Information, National Library of Medicine, Bethesda, MD 20894, USA
| | - Yuri I Wolf
- National Center for Biotechnology Information, National Library of Medicine, Bethesda, MD 20894, USA
| | - Sofia Medvedeva
- Institut Pasteur, Unité Biologie Moléculaire du Gène chez les Extrêmophiles, 75015, Paris, France.,Center of Life Sciences, Skolkovo Institute of Science and Technology, Skolkovo, Russia.,Sorbonne Université, Collège doctoral, 75005, Paris, France
| | - David Prangishvili
- Institut Pasteur, Unité Biologie Moléculaire du Gène chez les Extrêmophiles, 75015, Paris, France
| | - Patrick Forterre
- Institut Pasteur, Unité Biologie Moléculaire du Gène chez les Extrêmophiles, 75015, Paris, France.,Institute for Integrative Biology of the Cell (I2BC), CEA, CNRS, Univ. Paris- Sud, Université Paris-Saclay, Gif-sur-Yvette cedex, Paris, France
| | - Eugene V Koonin
- National Center for Biotechnology Information, National Library of Medicine, Bethesda, MD 20894, USA
| |
Collapse
|
15
|
Petitjean C, Makarova KS, Wolf YI, Koonin EV. Extreme Deviations from Expected Evolutionary Rates in Archaeal Protein Families. Genome Biol Evol 2018; 9:2791-2811. [PMID: 28985292 PMCID: PMC5737733 DOI: 10.1093/gbe/evx189] [Citation(s) in RCA: 10] [Impact Index Per Article: 1.7] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Accepted: 09/12/2017] [Indexed: 02/07/2023] Open
Abstract
Origin of new biological functions is a complex phenomenon ranging from single-nucleotide substitutions to the gain of new genes via horizontal gene transfer or duplication. Neofunctionalization and subfunctionalization of proteins is often attributed to the emergence of paralogs that are subject to relaxed purifying selection or positive selection and thus evolve at accelerated rates. Such phenomena potentially could be detected as anomalies in the phylogenies of the respective gene families. We developed a computational pipeline to search for such anomalies in 1,834 orthologous clusters of archaeal genes, focusing on lineage-specific subfamilies that significantly deviate from the expected rate of evolution. Multiple potential cases of neofunctionalization and subfunctionalization were identified, including some ancient, house-keeping gene families, such as ribosomal protein S10, general transcription factor TFIIB and chaperone Hsp20. As expected, many cases of apparent acceleration of evolution are associated with lineage-specific gene duplication. On other occasions, long branches in phylogenetic trees correspond to horizontal gene transfer across long evolutionary distances. Significant deceleration of evolution is less common than acceleration, and the underlying causes are not well understood; functional shifts accompanied by increased constraints could be involved. Many gene families appear to be “highly evolvable,” that is, include both long and short branches. Even in the absence of precise functional predictions, this approach allows one to select targets for experimentation in search of new biology.
Collapse
Affiliation(s)
- Celine Petitjean
- National Center for Biotechnology Information, National Library of Medicine, National Institutes of Health, Bethesda, Maryland
| | - Kira S Makarova
- National Center for Biotechnology Information, National Library of Medicine, National Institutes of Health, Bethesda, Maryland
| | - Yuri I Wolf
- National Center for Biotechnology Information, National Library of Medicine, National Institutes of Health, Bethesda, Maryland
| | - Eugene V Koonin
- National Center for Biotechnology Information, National Library of Medicine, National Institutes of Health, Bethesda, Maryland
| |
Collapse
|
16
|
Yutin N, Bäckström D, Ettema TJG, Krupovic M, Koonin EV. Vast diversity of prokaryotic virus genomes encoding double jelly-roll major capsid proteins uncovered by genomic and metagenomic sequence analysis. Virol J 2018; 15:67. [PMID: 29636073 PMCID: PMC5894146 DOI: 10.1186/s12985-018-0974-y] [Citation(s) in RCA: 49] [Impact Index Per Article: 8.2] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 11/28/2017] [Accepted: 03/28/2018] [Indexed: 12/11/2022] Open
Abstract
BACKGROUND Analysis of metagenomic sequences has become the principal approach for the study of the diversity of viruses. Many recent, extensive metagenomic studies on several classes of viruses have dramatically expanded the visible part of the virosphere, showing that previously undetected viruses, or those that have been considered rare, actually are important components of the global virome. RESULTS We investigated the provenance of viruses related to tail-less bacteriophages of the family Tectiviridae by searching genomic and metagenomics sequence databases for distant homologs of the tectivirus-like Double Jelly-Roll major capsid proteins (DJR MCP). These searches resulted in the identification of numerous genomes of virus-like elements that are similar in size to tectiviruses (10-15 kilobases) and have diverse gene compositions. By comparison of the gene repertoires, the DJR MCP-encoding genomes were classified into 6 distinct groups that can be predicted to differ in reproduction strategies and host ranges. Only the DJR MCP gene that is present by design is shared by all these genomes, and most also encode a predicted DNA-packaging ATPase; the rest of the genes are present only in subgroups of this unexpectedly diverse collection of DJR MCP-encoding genomes. Only a minority encode a DNA polymerase which is a hallmark of the family Tectiviridae and the putative family "Autolykiviridae". Notably, one of the identified putative DJR MCP viruses encodes a homolog of Cas1 endonuclease, the integrase involved in CRISPR-Cas adaptation and integration of transposon-like elements called casposons. This is the first detected occurrence of Cas1 in a virus. Many of the identified elements are individual contigs flanked by inverted or direct repeats and appear to represent complete, extrachromosomal viral genomes, whereas others are flanked by bacterial genes and thus can be considered as proviruses. These contigs come from metagenomes of widely different environments, some dominated by archaea and others by bacteria, suggesting that collectively, the DJR MCP-encoding elements have a broad host range among prokaryotes. CONCLUSIONS The findings reported here greatly expand the known host range of (putative) viruses of bacteria and archaea that encode a DJR MCP. They also demonstrate the extreme diversity of genome architectures in these viruses that encode no universal proteins other than the capsid protein that was used as the marker for their identification. From a supposedly minor group of bacterial and archaeal viruses, these viruses are emerging as a substantial component of the prokaryotic virome.
Collapse
Affiliation(s)
- Natalya Yutin
- National Center for Biotechnology Information, National Library of Medicine. National Institutes of Health, Bethesda, MD, 20894, USA
| | - Disa Bäckström
- Department of Cell and Molecular Biology, Science for Life Laboratory, Uppsala University, Box 596, -75123, Uppsala, SE, Sweden
| | - Thijs J G Ettema
- Department of Cell and Molecular Biology, Science for Life Laboratory, Uppsala University, Box 596, -75123, Uppsala, SE, Sweden
| | - Mart Krupovic
- Unité Biologie Moléculaire du Gène chez les Extrêmophiles, Department of Microbiology, Institut Pasteur, Paris, France
| | - Eugene V Koonin
- National Center for Biotechnology Information, National Library of Medicine. National Institutes of Health, Bethesda, MD, 20894, USA.
| |
Collapse
|
17
|
Ishino Y, Krupovic M, Forterre P. History of CRISPR-Cas from Encounter with a Mysterious Repeated Sequence to Genome Editing Technology. J Bacteriol 2018; 200:e00580-17. [PMID: 29358495 PMCID: PMC5847661 DOI: 10.1128/jb.00580-17] [Citation(s) in RCA: 206] [Impact Index Per Article: 34.3] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 12/14/2022] Open
Abstract
Clustered regularly interspaced short palindromic repeat (CRISPR)-Cas systems are well-known acquired immunity systems that are widespread in archaea and bacteria. The RNA-guided nucleases from CRISPR-Cas systems are currently regarded as the most reliable tools for genome editing and engineering. The first hint of their existence came in 1987, when an unusual repetitive DNA sequence, which subsequently was defined as a CRISPR, was discovered in the Escherichia coli genome during an analysis of genes involved in phosphate metabolism. Similar sequence patterns were then reported in a range of other bacteria as well as in halophilic archaea, suggesting an important role for such evolutionarily conserved clusters of repeated sequences. A critical step toward functional characterization of the CRISPR-Cas systems was the recognition of a link between CRISPRs and the associated Cas proteins, which were initially hypothesized to be involved in DNA repair in hyperthermophilic archaea. Comparative genomics, structural biology, and advanced biochemistry could then work hand in hand, not only culminating in the explosion of genome editing tools based on CRISPR-Cas9 and other class II CRISPR-Cas systems but also providing insights into the origin and evolution of this system from mobile genetic elements denoted casposons. To celebrate the 30th anniversary of the discovery of CRISPR, this minireview briefly discusses the fascinating history of CRISPR-Cas systems, from the original observation of an enigmatic sequence in E. coli to genome editing in humans.
Collapse
Affiliation(s)
- Yoshizumi Ishino
- Unité de Biologie Moléculaire du Gène Chez les Extrêmophiles, Département de Microbiologie, Institut Pasteur, Paris, France
- Department of Bioscience and Biotechnology, Faculty of Agriculture, Kyushu University, Fukuoka, Japan
| | - Mart Krupovic
- Unité de Biologie Moléculaire du Gène Chez les Extrêmophiles, Département de Microbiologie, Institut Pasteur, Paris, France
| | - Patrick Forterre
- Unité de Biologie Moléculaire du Gène Chez les Extrêmophiles, Département de Microbiologie, Institut Pasteur, Paris, France
- Institute of Integrative Cellular Biology, Université Paris Sud, Orsay, France
| |
Collapse
|
18
|
Koonin EV, Makarova KS. Mobile Genetic Elements and Evolution of CRISPR-Cas Systems: All the Way There and Back. Genome Biol Evol 2017; 9:2812-2825. [PMID: 28985291 PMCID: PMC5737515 DOI: 10.1093/gbe/evx192] [Citation(s) in RCA: 88] [Impact Index Per Article: 12.6] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Accepted: 09/16/2017] [Indexed: 12/13/2022] Open
Abstract
The Clustered Regularly Interspaced Palindromic Repeats (CRISPR)-CRISPR-associated proteins (Cas) systems of bacterial and archaeal adaptive immunity show multifaceted evolutionary relationships with at least five classes of mobile genetic elements (MGE). First, the adaptation module of CRISPR-Cas that is responsible for the formation of the immune memory apparently evolved from a Casposon, a self-synthesizing transposon that employs the Cas1 protein as the integrase and might have brought additional cas genes to the emerging immunity loci. Second, a large subset of type III CRISPR-Cas systems recruited a reverse transcriptase from a Group II intron, providing for spacer acquisition from RNA. Third, effector nucleases of Class 2 CRISPR-Cas systems that are responsible for the recognition and cleavage of the target DNA were derived from transposon-encoded TnpB nucleases, most likely, on several independent occasions. Fourth, accessory nucleases in some variants of types I and III toxin and type VI effectors RNases appear to be ultimately derived from toxin nucleases of microbial toxin-antitoxin modules. Fifth, the opposite direction of evolution is manifested in the recruitment of CRISPR-Cas systems by a distinct family of Tn7-like transposons that probably exploit the capacity of CRISPR-Cas to recognize unique DNA sites to facilitate transposition as well as by bacteriophages that employ them to cope with host defense. Additionally, individual Cas proteins, such as the Cas4 nuclease, were recruited by bacteriophages and transposons. The two-sided evolutionary connection between CRISPR-Cas and MGE fits the "guns for hire" paradigm whereby homologous enzymatic machineries, in particular nucleases, are shuttled between MGE and defense systems and are used alternately as means of offense or defense.
Collapse
Affiliation(s)
- Eugene V. Koonin
- National Center for Biotechnology Information, National Library of Medicine, National Institutes of Health, Bethesda, Maryland
| | - Kira S. Makarova
- National Center for Biotechnology Information, National Library of Medicine, National Institutes of Health, Bethesda, Maryland
| |
Collapse
|
19
|
Siguier P, Gourbeyre E, Chandler M. Known knowns, known unknowns and unknown unknowns in prokaryotic transposition. Curr Opin Microbiol 2017; 38:171-180. [PMID: 28683354 DOI: 10.1016/j.mib.2017.06.005] [Citation(s) in RCA: 19] [Impact Index Per Article: 2.7] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 04/22/2017] [Revised: 06/15/2017] [Accepted: 06/19/2017] [Indexed: 02/06/2023]
Abstract
Although the phenomenon of transposition has been known for over 60 years, its overarching importance in modifying and streamlining genomes took some time to recognize. In spite of a robust understanding of transposition of some TE, there remain a number of important TE groups with potential high genome impact and unknown transposition mechanisms and yet others, only recently identified by bioinformatics, yet to be formally confirmed as mobile. Here, we point to some areas of limited understanding concerning well established important TE groups with DDE Tpases, to address central gaps in our knowledge of characterised Tn with other types of Tpases and finally, to highlight new potentially mobile DNA species. It is not exhaustive. Examples have been chosen to provide encouragement in the continued exploration of the considerable prokaryotic mobilome especially in light of the current threat to public health posed by the spread of multiple AbR.
Collapse
Affiliation(s)
- Patricia Siguier
- Centre National de la Recherche Scientifique (CNRS), Toulouse, France
| | - Edith Gourbeyre
- Centre National de la Recherche Scientifique (CNRS), Toulouse, France
| | - Michael Chandler
- Centre National de la Recherche Scientifique (CNRS), Toulouse, France; Department of Biochem., Mol. and Cell. Biol. Georgetown University Medical Center, 3900 Reservoir Rd., Washington, DC 20057-1455, USA.
| |
Collapse
|
20
|
A decade of discovery: CRISPR functions and applications. Nat Microbiol 2017; 2:17092. [PMID: 28581505 DOI: 10.1038/nmicrobiol.2017.92] [Citation(s) in RCA: 186] [Impact Index Per Article: 26.6] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 12/23/2016] [Accepted: 05/05/2017] [Indexed: 12/26/2022]
Abstract
This year marks the tenth anniversary of the identification of the biological function of CRISPR-Cas as adaptive immune systems in bacteria. In just a decade, the characterization of CRISPR-Cas systems has established a novel means of adaptive immunity in bacteria and archaea and deepened our understanding of the interplay between prokaryotes and their environment, and CRISPR-based molecular machines have been repurposed to enable a genome editing revolution. Here, we look back on the historical milestones that have paved the way for the discovery of CRISPR and its function, and discuss the related technological applications that have emerged, with a focus on microbiology. Lastly, we provide a perspective on the impacts the field has had on science and beyond.
Collapse
|
21
|
Krupovic M, Béguin P, Koonin EV. Casposons: mobile genetic elements that gave rise to the CRISPR-Cas adaptation machinery. Curr Opin Microbiol 2017; 38:36-43. [PMID: 28472712 DOI: 10.1016/j.mib.2017.04.004] [Citation(s) in RCA: 52] [Impact Index Per Article: 7.4] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 01/30/2017] [Revised: 04/04/2017] [Accepted: 04/12/2017] [Indexed: 01/26/2023]
Abstract
A casposon, a member of a distinct superfamily of archaeal and bacterial self-synthesizing transposons that employ a recombinase (casposase) homologous to the Cas1 endonuclease, appears to have given rise to the adaptation module of CRISPR-Cas systems as well as the CRISPR repeats themselves. Comparison of the mechanistic features of the reactions catalyzed by the casposase and the Cas1-Cas2 heterohexamer, the CRISPR integrase, reveals close similarity but also important differences that explain the requirement of Cas2 for integration of short DNA fragments, the CRISPR spacers.
Collapse
Affiliation(s)
- Mart Krupovic
- Unité Biologie Moléculaire du Gène chez les Extrêmophiles, Institut Pasteur, 25 rue du Docteur Roux, 75015 Paris, France.
| | - Pierre Béguin
- Unité Biologie Moléculaire du Gène chez les Extrêmophiles, Institut Pasteur, 25 rue du Docteur Roux, 75015 Paris, France
| | - Eugene V Koonin
- National Center for Biotechnology Information, National Library of Medicine, Bethesda, MD 20894, USA
| |
Collapse
|
22
|
Jackson SA, McKenzie RE, Fagerlund RD, Kieper SN, Fineran PC, Brouns SJJ. CRISPR-Cas: Adapting to change. Science 2017; 356:356/6333/eaal5056. [PMID: 28385959 DOI: 10.1126/science.aal5056] [Citation(s) in RCA: 249] [Impact Index Per Article: 35.6] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 12/14/2022]
Abstract
Bacteria and archaea are engaged in a constant arms race to defend against the ever-present threats of viruses and invasion by mobile genetic elements. The most flexible weapons in the prokaryotic defense arsenal are the CRISPR-Cas adaptive immune systems. These systems are capable of selective identification and neutralization of foreign DNA and/or RNA. CRISPR-Cas systems rely on stored genetic memories to facilitate target recognition. Thus, to keep pace with a changing pool of hostile invaders, the CRISPR memory banks must be regularly updated with new information through a process termed CRISPR adaptation. In this Review, we outline the recent advances in our understanding of the molecular mechanisms governing CRISPR adaptation. Specifically, the conserved protein machinery Cas1-Cas2 is the cornerstone of adaptive immunity in a range of diverse CRISPR-Cas systems.
Collapse
Affiliation(s)
- Simon A Jackson
- Department of Microbiology and Immunology, University of Otago, Post Office Box 56, Dunedin 9054, New Zealand
| | - Rebecca E McKenzie
- Department of Bionanoscience, Kavli Institute of Nanoscience, Delft University of Technology, Van der Maasweg 9, 2629 HZ Delft, Netherlands
| | - Robert D Fagerlund
- Department of Microbiology and Immunology, University of Otago, Post Office Box 56, Dunedin 9054, New Zealand
| | - Sebastian N Kieper
- Department of Bionanoscience, Kavli Institute of Nanoscience, Delft University of Technology, Van der Maasweg 9, 2629 HZ Delft, Netherlands
| | - Peter C Fineran
- Department of Microbiology and Immunology, University of Otago, Post Office Box 56, Dunedin 9054, New Zealand. .,Bio-Protection Research Centre, University of Otago, Post Office Box 56, Dunedin 9054, New Zealand
| | - Stan J J Brouns
- Department of Bionanoscience, Kavli Institute of Nanoscience, Delft University of Technology, Van der Maasweg 9, 2629 HZ Delft, Netherlands. .,Laboratory of Microbiology, Wageningen University, Wageningen, Netherlands
| |
Collapse
|
23
|
Arensburger P, Piégu B, Bigot Y. The future of transposable element annotation and their classification in the light of functional genomics - what we can learn from the fables of Jean de la Fontaine? Mob Genet Elements 2016; 6:e1256852. [PMID: 28090383 DOI: 10.1080/2159256x.2016.1256852] [Citation(s) in RCA: 18] [Impact Index Per Article: 2.3] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/03/2016] [Accepted: 10/31/2016] [Indexed: 12/19/2022] Open
Abstract
Transposable element (TE) science has been significantly influenced by the pioneering ideas of David Finnegan near the end of the last century, as well as by the classification systems that were subsequently developed. Today, whole genome TE annotation is mostly done using tools that were developed to aid gene annotation rather than to specifically study TEs. We argue that further progress in the TE field is impeded both by current TE classification schemes and by a failure to recognize that TE biology is fundamentally different from that of multicellular organisms. Novel genome wide TE annotation methods are helping to redefine our understanding of TE sequence origins and evolution. We briefly discuss some of these new methods as well as ideas for possible alternative classification schemes. Our hope is to encourage the formation of a society to organize a larger debate on these questions and to promote the adoption of standards for annotation and an improved TE classification.
Collapse
Affiliation(s)
- Peter Arensburger
- Biological Sciences Department, California State Polytechnic University , Pomona, CA, USA
| | - Benoît Piégu
- Physiologie de la reproduction et des Comportements, UMR INRA-CNRS 7247, PRC , Nouzilly, France
| | - Yves Bigot
- Physiologie de la reproduction et des Comportements, UMR INRA-CNRS 7247, PRC , Nouzilly, France
| |
Collapse
|
24
|
Westra ER, Dowling AJ, Broniewski JM, van Houte S. Evolution and Ecology of CRISPR. ANNUAL REVIEW OF ECOLOGY EVOLUTION AND SYSTEMATICS 2016. [DOI: 10.1146/annurev-ecolsys-121415-032428] [Citation(s) in RCA: 54] [Impact Index Per Article: 6.8] [Reference Citation Analysis] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 11/09/2022]
Affiliation(s)
- Edze R. Westra
- Environment and Sustainability Institute and Centre for Ecology and Conservation, Biosciences, University of Exeter, Tremough Campus, Penryn TR10 9FE, United Kingdom;
| | - Andrea J. Dowling
- Environment and Sustainability Institute and Centre for Ecology and Conservation, Biosciences, University of Exeter, Tremough Campus, Penryn TR10 9FE, United Kingdom;
| | - Jenny M. Broniewski
- Environment and Sustainability Institute and Centre for Ecology and Conservation, Biosciences, University of Exeter, Tremough Campus, Penryn TR10 9FE, United Kingdom;
| | - Stineke van Houte
- Environment and Sustainability Institute and Centre for Ecology and Conservation, Biosciences, University of Exeter, Tremough Campus, Penryn TR10 9FE, United Kingdom;
| |
Collapse
|
25
|
Béguin P, Charpin N, Koonin EV, Forterre P, Krupovic M. Casposon integration shows strong target site preference and recapitulates protospacer integration by CRISPR-Cas systems. Nucleic Acids Res 2016; 44:10367-10376. [PMID: 27655632 PMCID: PMC5137440 DOI: 10.1093/nar/gkw821] [Citation(s) in RCA: 26] [Impact Index Per Article: 3.3] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/20/2016] [Revised: 09/05/2016] [Accepted: 09/06/2016] [Indexed: 12/13/2022] Open
Abstract
Casposons are a recently discovered group of large DNA transposons present in diverse bacterial and archaeal genomes. For integration into the host chromosome, casposons employ an endonuclease that is homologous to the Cas1 protein involved in protospacer integration by the CRISPR-Cas adaptive immune system. Here we describe the site-preference of integration by the Cas1 integrase (casposase) encoded by the casposon of the archaeon Aciduliprofundum boonei. Oligonucleotide duplexes derived from the terminal inverted repeats (TIR) of the A. boonei casposon as well as mini-casposons flanked by the TIR inserted preferentially at a site reconstituting the original A. boonei target site. As in the A. boonei genome, the insertion was accompanied by a 15-bp direct target site duplication (TSD). The minimal functional target consisted of the 15-bp TSD segment and the adjacent 18-bp sequence which comprises the 3′ end of the tRNA-Pro gene corresponding to the TΨC loop. The functional casposase target site bears clear resemblance to the leader sequence-repeat junction which is the target for protospacer integration catalyzed by the Cas1–Cas2 adaptation module of CRISPR-Cas. These findings reinforce the mechanistic similarities and evolutionary connection between the casposons and the adaptation module of the prokaryotic adaptive immunity systems.
Collapse
Affiliation(s)
- Pierre Béguin
- Unité Biologie Moléculaire du Gène chez les Extrêmophiles, Institut Pasteur, 25 rue du Docteur Roux, 75015 Paris
| | - Nicole Charpin
- Unité Biologie Moléculaire du Gène chez les Extrêmophiles, Institut Pasteur, 25 rue du Docteur Roux, 75015 Paris
| | - Eugene V Koonin
- National Center for Biotechnology Information, National Library of Medicine Bethesda, MD 20894, USA
| | - Patrick Forterre
- Unité Biologie Moléculaire du Gène chez les Extrêmophiles, Institut Pasteur, 25 rue du Docteur Roux, 75015 Paris
| | - Mart Krupovic
- Unité Biologie Moléculaire du Gène chez les Extrêmophiles, Institut Pasteur, 25 rue du Docteur Roux, 75015 Paris
| |
Collapse
|
26
|
Mohanraju P, Makarova KS, Zetsche B, Zhang F, Koonin EV, van der Oost J. Diverse evolutionary roots and mechanistic variations of the CRISPR-Cas systems. Science 2016; 353:aad5147. [PMID: 27493190 DOI: 10.1126/science.aad5147] [Citation(s) in RCA: 407] [Impact Index Per Article: 50.9] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 12/15/2022]
Abstract
Adaptive immunity had been long thought of as an exclusive feature of animals. However, the discovery of the CRISPR-Cas defense system, present in almost half of prokaryotic genomes, proves otherwise. Because of the everlasting parasite-host arms race, CRISPR-Cas has rapidly evolved through horizontal transfer of complete loci or individual modules, resulting in extreme structural and functional diversity. CRISPR-Cas systems are divided into two distinct classes that each consist of three types and multiple subtypes. We discuss recent advances in CRISPR-Cas research that reveal elaborate molecular mechanisms and provide for a plausible scenario of CRISPR-Cas evolution. We also briefly describe the latest developments of a wide range of CRISPR-based applications.
Collapse
Affiliation(s)
- Prarthana Mohanraju
- Laboratory of Microbiology, Department of Agrotechnology and Food Sciences, Wageningen University, 6703 HB Wageningen, Netherlands
| | - Kira S Makarova
- National Center for Biotechnology Information, National Library of Medicine, Bethesda, MD 20894, USA
| | - Bernd Zetsche
- Broad Institute of MIT and Harvard, Cambridge, MA 02142, USA
| | - Feng Zhang
- Broad Institute of MIT and Harvard, Cambridge, MA 02142, USA
| | - Eugene V Koonin
- National Center for Biotechnology Information, National Library of Medicine, Bethesda, MD 20894, USA
| | - John van der Oost
- Laboratory of Microbiology, Department of Agrotechnology and Food Sciences, Wageningen University, 6703 HB Wageningen, Netherlands.
| |
Collapse
|
27
|
Krupovic M, Koonin EV. Self-synthesizing transposons: unexpected key players in the evolution of viruses and defense systems. Curr Opin Microbiol 2016; 31:25-33. [PMID: 26836982 DOI: 10.1016/j.mib.2016.01.006] [Citation(s) in RCA: 55] [Impact Index Per Article: 6.9] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 12/17/2015] [Revised: 01/13/2016] [Accepted: 01/14/2016] [Indexed: 12/20/2022]
Abstract
Self-synthesizing transposons are the largest known transposable elements that encode their own DNA polymerases (DNAP). The Polinton/Maverick family of self-synthesizing transposons is widespread in eukaryotes and abundant in the genomes of some protists. In addition to the DNAP and a retrovirus-like integrase, most of the polintons encode homologs of the major and minor jelly-roll capsid proteins, DNA-packaging ATPase and capsid maturation protease. Therefore, polintons are predicted to alternate between the transposon and viral lifestyles although virion formation remains to be demonstrated. Polintons are related to a group of eukaryotic viruses known as virophages that parasitize on giant viruses of the family Mimiviridae and another recently identified putative family of polinton-like viruses (PLV) predicted to lead a similar, dual life style. Comparative genomic analysis of polintons, virophages, PLV and the other viruses with double-stranded (ds)DNA genomes infecting eukaryotes and prokaryotes suggests that the polintons evolved from bacterial tectiviruses and could have been the ancestors of a broad range of eukaryotic viruses including adenoviruses and members of the proposed order 'Megavirales' as well as linear cytoplasmic plasmids. Recently, a group of predicted self-synthesizing transposons was discovered also in prokaryotes. These elements, denoted casposons, encode a DNAP and a homolog of the CRISPR-associated Cas1 endonuclease that has an integrase activity but no capsid proteins. Thus, unlike polintons, casposons appear to be limited to the transposon life style although they could have evolved from viruses. The casposons are thought to have played a pivotal role in the origin of the prokaryotic adaptive immunity, giving rise to the adaptation module of the CRISPR-Cas systems.
Collapse
Affiliation(s)
- Mart Krupovic
- Unité Biologie Moléculaire du Gène chez les Extrêmophiles, Department of Microbiology, Institut Pasteur, Paris, France.
| | - Eugene V Koonin
- National Center for Biotechnology Information, National Library of Medicine, National Institutes of Health, Bethesda, MD 20894, USA.
| |
Collapse
|