1
|
History, evolution and classification of CRISPR-Cas associated systems. PROGRESS IN MOLECULAR BIOLOGY AND TRANSLATIONAL SCIENCE 2021; 179:11-76. [PMID: 33785174 DOI: 10.1016/bs.pmbts.2020.12.012] [Citation(s) in RCA: 13] [Impact Index Per Article: 4.3] [Reference Citation Analysis] [Abstract] [Key Words] [Subscribe] [Scholar Register] [Indexed: 12/13/2022]
Abstract
This chapter provides a detailed description of the history of CRISPR-Cas and its evolution into one of the most efficient genome-editing strategies. The chapter begins by providing information on early findings that were critical in deciphering the role of CRISPR-Cas associated systems in prokaryotes. It then describes how CRISPR-Cas had been evolved into an efficient genome-editing strategy. In the subsequent section, latest developments in the genome-editing approaches based on CRISPR-Cas are discussed. The chapter ends with the recent classification and possible evolution of CRISPR-Cas systems.
Collapse
|
2
|
Jablonska J, Matelska D, Steczkiewicz K, Ginalski K. Systematic classification of the His-Me finger superfamily. Nucleic Acids Res 2017; 45:11479-11494. [PMID: 29040665 PMCID: PMC5714182 DOI: 10.1093/nar/gkx924] [Citation(s) in RCA: 12] [Impact Index Per Article: 1.7] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/14/2017] [Accepted: 09/29/2017] [Indexed: 02/06/2023] Open
Abstract
The His-Me finger endonucleases, also known as HNH or ββα-metal endonucleases, form a large and diverse protein superfamily. The His-Me finger domain can be found in proteins that play an essential role in cells, including genome maintenance, intron homing, host defense and target offense. Its overall structural compactness and non-specificity make it a perfectly-tailored pathogenic module that participates on both sides of inter- and intra-organismal competition. An extremely low sequence similarity across the superfamily makes it difficult to identify and classify new His-Me fingers. Using state-of-the-art distant homology detection methods, we provide an updated and systematic classification of His-Me finger proteins. In this work, we identified over 100 000 proteins and clustered them into 38 groups, of which three groups are new and cannot be found in any existing public domain database of protein families. Based on an analysis of sequences, structures, domain architectures, and genomic contexts, we provide a careful functional annotation of the poorly characterized members of this superfamily. Our results may inspire further experimental investigations that should address the predicted activity and clarify the potential substrates, to provide more detailed insights into the fundamental biological roles of these proteins.
Collapse
Affiliation(s)
- Jagoda Jablonska
- Laboratory of Bioinformatics and Systems Biology, Centre of New Technologies, University of Warsaw, Zwirki i Wigury 93, 02-089 Warsaw, Poland
| | - Dorota Matelska
- Laboratory of Bioinformatics and Systems Biology, Centre of New Technologies, University of Warsaw, Zwirki i Wigury 93, 02-089 Warsaw, Poland
| | - Kamil Steczkiewicz
- Laboratory of Bioinformatics and Systems Biology, Centre of New Technologies, University of Warsaw, Zwirki i Wigury 93, 02-089 Warsaw, Poland
| | - Krzysztof Ginalski
- Laboratory of Bioinformatics and Systems Biology, Centre of New Technologies, University of Warsaw, Zwirki i Wigury 93, 02-089 Warsaw, Poland
| |
Collapse
|
3
|
Gholizadeh P, Aghazadeh M, Asgharzadeh M, Kafil HS. Suppressing the CRISPR/Cas adaptive immune system in bacterial infections. Eur J Clin Microbiol Infect Dis 2017; 36:2043-2051. [PMID: 28601970 DOI: 10.1007/s10096-017-3036-2] [Citation(s) in RCA: 19] [Impact Index Per Article: 2.7] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 04/25/2017] [Accepted: 05/31/2017] [Indexed: 12/26/2022]
Abstract
Clustered regularly interspaced short palindromic repeats (CRISPR) coupled with CRISPR-associated (Cas) proteins (CRISPR/Cas) are the adaptive immune system of eubacteria and archaebacteria. This system provides protection of bacteria against invading foreign DNA, such as transposons, bacteriophages and plasmids. Three-stage processes in this system for immunity against foreign DNAs are defined as adaptation, expression and interference. Recent studies suggested a correlation between the interfering of the CRISPR/Cas locus, acquisition of antibiotic resistance and pathogenicity island. In this review article, we demonstrate and discuss the CRISPR/Cas system's roles in interference with acquisition of antibiotic resistance and pathogenicity island in some eubacteria. Totally, these systems function as the adaptive immune system of bacteria against invading foreign DNA, blocking the acquisition of antibiotic resistance and virulence factor, detecting serotypes, indirect effects of CRISPR self-targeting, associating with physiological functions, associating with infections in humans at the transmission stage, interfering with natural transformation, a tool for genome editing in genome engineering, monitoring foodborne pathogens etc. These results showed that the CRISPR/Cas system might prevent the emergence of virulence both in vitro and in vivo. Moreover, this system was shown to be a strong selective pressure for the acquisition of antibiotic resistance and virulence factor in bacterial pathogens.
Collapse
Affiliation(s)
- P Gholizadeh
- Hematology and Oncology Research Center, Tabriz University of Medical Sciences, Tabriz, Iran.,Student Research Committee, Tabriz University of Medical Sciences, Tabriz, Iran
| | - M Aghazadeh
- Biotechnology Research Center, Tabriz University of Medical Sciences, Tabriz, Iran
| | - M Asgharzadeh
- Infectious and Tropical Disease Research Center, Tabriz University of Medical Sciences, Tabriz, Iran
| | - H S Kafil
- Drug Applied Research Center, Tabriz University of Medical Sciences, Tabriz, Iran.
| |
Collapse
|
4
|
Elimination of inter-domain interactions increases the cleavage fidelity of the restriction endonuclease DraIII. Protein Cell 2014; 5:357-68. [PMID: 24733184 PMCID: PMC3996161 DOI: 10.1007/s13238-014-0038-z] [Citation(s) in RCA: 4] [Impact Index Per Article: 0.4] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 01/05/2014] [Accepted: 02/18/2014] [Indexed: 11/24/2022] Open
Abstract
DraIII is a type IIP restriction endonucleases (REases) that recognizes and creates a double strand break within the gapped palindromic sequence CAC↑NNN↓GTG of double-stranded DNA (↑ indicates nicking on the bottom strand; ↓ indicates nicking on the top strand). However, wild type DraIII shows significant star activity. In this study, it was found that the prominent star site is CAT↑GTT↓GTG, consisting of a star 5′ half (CAT) and a canonical 3′ half (GTG). DraIII nicks the 3′ canonical half site at a faster rate than the 5′ star half site, in contrast to the similar rate with the canonical full site. The crystal structure of the DraIII protein was solved. It indicated, as supported by mutagenesis, that DraIII possesses a ββα-metal HNH active site. The structure revealed extensive intra-molecular interactions between the N-terminal domain and the C-terminal domain containing the HNH active site. Disruptions of these interactions through site-directed mutagenesis drastically increased cleavage fidelity. The understanding of fidelity mechanisms will enable generation of high fidelity REases.
Collapse
|
5
|
Kleinstiver BP, Wolfs JM, Edgell DR. The monomeric GIY-YIG homing endonuclease I-BmoI uses a molecular anchor and a flexible tether to sequentially nick DNA. Nucleic Acids Res 2013; 41:5413-27. [PMID: 23558745 PMCID: PMC3664794 DOI: 10.1093/nar/gkt186] [Citation(s) in RCA: 11] [Impact Index Per Article: 1.0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/13/2022] Open
Abstract
The GIY-YIG nuclease domain is found within protein scaffolds that participate in diverse cellular pathways and contains a single active site that hydrolyzes DNA by a one-metal ion mechanism. GIY-YIG homing endonucleases (GIY-HEs) are two-domain proteins with N-terminal GIY-YIG nuclease domains connected to C-terminal DNA-binding and they are thought to function as monomers. Using I-BmoI as a model GIY-HE, we test mechanisms by which the single active site is used to generate a double-strand break. We show that I-BmoI is partially disordered in the absence of substrate, and that the GIY-YIG domain alone has weak affinity for DNA. Significantly, we show that I-BmoI functions as a monomer at all steps of the reaction pathway and does not transiently dimerize or use sequential transesterification reactions to cleave substrate. Our results are consistent with the I-BmoI DNA-binding domain acting as a molecular anchor to tether the GIY-YIG domain to substrate, permitting rotation of the GIY-YIG domain to sequentially nick each DNA strand. These data highlight the mechanistic differences between monomeric GIY-HEs and dimeric or tetrameric GIY-YIG restriction enzymes, and they have implications for the use of the GIY-YIG domain in genome-editing applications.
Collapse
Affiliation(s)
- Benjamin P Kleinstiver
- Department of Biochemistry, Schulich School of Medicine and Dentistry, Western University, London, Ontario N6A 5C1, Canada
| | | | | |
Collapse
|
6
|
Czene A, Németh E, Zóka IG, Jakab-Simon NI, Körtvélyesi T, Nagata K, Christensen HEM, Gyurcsik B. The role of the N-terminal loop in the function of the colicin E7 nuclease domain. J Biol Inorg Chem 2013; 18:309-21. [DOI: 10.1007/s00775-013-0975-7] [Citation(s) in RCA: 11] [Impact Index Per Article: 1.0] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 09/18/2012] [Accepted: 12/31/2012] [Indexed: 01/10/2023]
|
7
|
Towards artificial metallonucleases for gene therapy: recent advances and new perspectives. Future Med Chem 2011; 3:1935-66. [DOI: 10.4155/fmc.11.139] [Citation(s) in RCA: 18] [Impact Index Per Article: 1.4] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 12/29/2022] Open
Abstract
The process of DNA targeting or repair of mutated genes within the cell, induced by specifically positioned double-strand cleavage of DNA near the mutated sequence, can be applied for gene therapy of monogenic diseases. For this purpose, highly specific artificial metallonucleases are developed. They are expected to be important future tools of modern genetics. The present state of art and strategies of research are summarized, including protein engineering and artificial ‘chemical’ nucleases. From the results, we learn about the basic role of the metal ions and the various ligands, and about the DNA binding and cleavage mechanism. The results collected provide useful guidance for engineering highly controlled enzymes for use in gene therapy.
Collapse
|
8
|
Makarova KS, Aravind L, Wolf YI, Koonin EV. Unification of Cas protein families and a simple scenario for the origin and evolution of CRISPR-Cas systems. Biol Direct 2011; 6:38. [PMID: 21756346 PMCID: PMC3150331 DOI: 10.1186/1745-6150-6-38] [Citation(s) in RCA: 335] [Impact Index Per Article: 25.8] [Reference Citation Analysis] [Abstract] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 05/04/2011] [Accepted: 07/14/2011] [Indexed: 12/26/2022] Open
Abstract
BACKGROUND The CRISPR-Cas adaptive immunity systems that are present in most Archaea and many Bacteria function by incorporating fragments of alien genomes into specific genomic loci, transcribing the inserts and using the transcripts as guide RNAs to destroy the genome of the cognate virus or plasmid. This RNA interference-like immune response is mediated by numerous, diverse and rapidly evolving Cas (CRISPR-associated) proteins, several of which form the Cascade complex involved in the processing of CRISPR transcripts and cleavage of the target DNA. Comparative analysis of the Cas protein sequences and structures led to the classification of the CRISPR-Cas systems into three Types (I, II and III). RESULTS A detailed comparison of the available sequences and structures of Cas proteins revealed several unnoticed homologous relationships. The Repeat-Associated Mysterious Proteins (RAMPs) containing a distinct form of the RNA Recognition Motif (RRM) domain, which are major components of the CRISPR-Cas systems, were classified into three large groups, Cas5, Cas6 and Cas7. Each of these groups includes many previously uncharacterized proteins now shown to adopt the RAMP structure. Evidence is presented that large subunits contained in most of the CRISPR-Cas systems could be homologous to Cas10 proteins which contain a polymerase-like Palm domain and are predicted to be enzymatically active in Type III CRISPR-Cas systems but inactivated in Type I systems. These findings, the fact that the CRISPR polymerases, RAMPs and Cas2 all contain core RRM domains, and distinct gene arrangements in the three types of CRISPR-Cas systems together provide for a simple scenario for origin and evolution of the CRISPR-Cas machinery. Under this scenario, the CRISPR-Cas system originated in thermophilic Archaea and subsequently spread horizontally among prokaryotes. CONCLUSIONS Because of the extreme diversity of CRISPR-Cas systems, in-depth sequence and structure comparison continue to reveal unexpected homologous relationship among Cas proteins. Unification of Cas protein families previously considered unrelated provides for improvement in the classification of CRISPR-Cas systems and a reconstruction of their evolution. OPEN PEER REVIEW This article was reviewed by Malcolm White (nominated by Purficacion Lopez-Garcia), Frank Eisenhaber and Igor Zhulin. For the full reviews, see the Reviewers' Comments section.
Collapse
Affiliation(s)
- Kira S Makarova
- National Center for Biotechnology Information, National Library of Medicine, National Institutes of Health, 8600 Rockville Pike, Bethesda, MD 20894, USA
| | | | | | | |
Collapse
|
9
|
Abstract
The CRISPR-Cas (clustered regularly interspaced short palindromic repeats-CRISPR-associated proteins) modules are adaptive immunity systems that are present in many archaea and bacteria. These defence systems are encoded by operons that have an extraordinarily diverse architecture and a high rate of evolution for both the cas genes and the unique spacer content. Here, we provide an updated analysis of the evolutionary relationships between CRISPR-Cas systems and Cas proteins. Three major types of CRISPR-Cas system are delineated, with a further division into several subtypes and a few chimeric variants. Given the complexity of the genomic architectures and the extremely dynamic evolution of the CRISPR-Cas systems, a unified classification of these systems should be based on multiple criteria. Accordingly, we propose a 'polythetic' classification that integrates the phylogenies of the most common cas genes, the sequence and organization of the CRISPR repeats and the architecture of the CRISPR-cas loci.
Collapse
|
10
|
Abstract
Nucleases cleave the phosphodiester bonds of nucleic acids and may be endo or exo, DNase or RNase, topoisomerases, recombinases, ribozymes, or RNA splicing enzymes. In this review, I survey nuclease activities with known structures and catalytic machinery and classify them by reaction mechanism and metal-ion dependence and by their biological function ranging from DNA replication, recombination, repair, RNA maturation, processing, interference, to defense, nutrient regeneration or cell death. Several general principles emerge from this analysis. There is little correlation between catalytic mechanism and biological function. A single catalytic mechanism can be adapted in a variety of reactions and biological pathways. Conversely, a single biological process can often be accomplished by multiple tertiary and quaternary folds and by more than one catalytic mechanism. Two-metal-ion-dependent nucleases comprise the largest number of different tertiary folds and mediate the most diverse set of biological functions. Metal-ion-dependent cleavage is exclusively associated with exonucleases producing mononucleotides and endonucleases that cleave double- or single-stranded substrates in helical and base-stacked conformations. All metal-ion-independent RNases generate 2',3'-cyclic phosphate products, and all metal-ion-independent DNases form phospho-protein intermediates. I also find several previously unnoted relationships between different nucleases and shared catalytic configurations.
Collapse
|
11
|
Shen BW, Heiter DF, Chan SH, Wang H, Xu SY, Morgan RD, Wilson GG, Stoddard BL. Unusual target site disruption by the rare-cutting HNH restriction endonuclease PacI. Structure 2010; 18:734-43. [PMID: 20541511 DOI: 10.1016/j.str.2010.03.009] [Citation(s) in RCA: 36] [Impact Index Per Article: 2.6] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 02/21/2010] [Revised: 03/25/2010] [Accepted: 03/27/2010] [Indexed: 01/31/2023]
Abstract
The crystal structure of the rare-cutting HNH restriction endonuclease PacI in complex with its eight-base-pair target recognition sequence 5'-TTAATTAA-3' has been determined to 1.9 A resolution. The enzyme forms an extended homodimer, with each subunit containing two zinc-bound motifs surrounding a betabetaalpha-metal catalytic site. The latter is unusual in that a tyrosine residue likely initiates strand cleavage. PacI dramatically distorts its target sequence from Watson-Crick duplex DNA base pairing, with every base separated from its original partner. Two bases on each strand are unpaired, four are engaged in noncanonical A:A and T:T base pairs, and the remaining two bases are matched with new Watson-Crick partners. This represents a highly unusual DNA binding mechanism for a restriction endonuclease, and implies that initial recognition of the target site might involve significantly different contacts from those visualized in the DNA-bound cocrystal structures.
Collapse
Affiliation(s)
- Betty W Shen
- Division of Basic Sciences, Fred Hutchinson Cancer Research Center, 1100 Fairview Avenue N. A3-025, Seattle, WA 98109, USA
| | | | | | | | | | | | | | | |
Collapse
|
12
|
Vasu K, Saravanan M, Rajendra BVRN, Nagaraja V. Generation of a Manganese Specific Restriction Endonuclease with Nicking Activity. Biochemistry 2010; 49:8425-33. [DOI: 10.1021/bi101035k] [Citation(s) in RCA: 5] [Impact Index Per Article: 0.4] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 12/22/2022]
Affiliation(s)
- Kommireddy Vasu
- Department of Microbiology and Cell Biology, Indian Institute of Science, Bangalore 560012, India
| | - Matheshwaran Saravanan
- Department of Microbiology and Cell Biology, Indian Institute of Science, Bangalore 560012, India
| | | | - Valakunja Nagaraja
- Department of Microbiology and Cell Biology, Indian Institute of Science, Bangalore 560012, India
- Jawaharlal Nehru Centre for Advanced Scientific Research, Bangalore 560012, India
| |
Collapse
|
13
|
Marcaida MJ, Muñoz IG, Blanco FJ, Prieto J, Montoya G. Homing endonucleases: from basics to therapeutic applications. Cell Mol Life Sci 2010; 67:727-48. [PMID: 19915993 PMCID: PMC11115532 DOI: 10.1007/s00018-009-0188-y] [Citation(s) in RCA: 59] [Impact Index Per Article: 4.2] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/17/2009] [Revised: 10/16/2009] [Accepted: 10/19/2009] [Indexed: 10/20/2022]
Abstract
Homing endonucleases (HE) are double-stranded DNAses that target large recognition sites (12-40 bp). HE-encoding sequences are usually embedded in either introns or inteins. Their recognition sites are extremely rare, with none or only a few of these sites present in a mammalian-sized genome. However, these enzymes, unlike standard restriction endonucleases, tolerate some sequence degeneracy within their recognition sequence. Several members of this enzyme family have been used as templates to engineer tools to cleave DNA sequences that differ from their original wild-type targets. These custom HEs can be used to stimulate double-strand break homologous recombination in cells, to induce the repair of defective genes with very low toxicity levels. The use of tailored HEs opens up new possibilities for gene therapy in patients with monogenic diseases that can be treated ex vivo. This review provides an overview of recent advances in this field.
Collapse
Affiliation(s)
- Maria J. Marcaida
- Macromolecular Crystallography Group, Structural Biology and Biocomputing Programme, Spanish National Cancer Research Centre (CNIO), c/Melchor Fdez. Almagro 3, 28029 Madrid, Spain
| | - Inés G. Muñoz
- Macromolecular Crystallography Group, Structural Biology and Biocomputing Programme, Spanish National Cancer Research Centre (CNIO), c/Melchor Fdez. Almagro 3, 28029 Madrid, Spain
| | - Francisco J. Blanco
- Ikerbasque Professor Structural Biology Unit, CIC bioGUNE, Parque Tecnológico de Vizcaya, 48160 Derio, Spain
| | - Jesús Prieto
- Macromolecular Crystallography Group, Structural Biology and Biocomputing Programme, Spanish National Cancer Research Centre (CNIO), c/Melchor Fdez. Almagro 3, 28029 Madrid, Spain
| | - Guillermo Montoya
- Macromolecular Crystallography Group, Structural Biology and Biocomputing Programme, Spanish National Cancer Research Centre (CNIO), c/Melchor Fdez. Almagro 3, 28029 Madrid, Spain
| |
Collapse
|
14
|
Chan SH, Opitz L, Higgins L, O'loane D, Xu SY. Cofactor requirement of HpyAV restriction endonuclease. PLoS One 2010; 5:e9071. [PMID: 20140205 PMCID: PMC2816704 DOI: 10.1371/journal.pone.0009071] [Citation(s) in RCA: 20] [Impact Index Per Article: 1.4] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 11/19/2009] [Accepted: 01/14/2010] [Indexed: 01/28/2023] Open
Abstract
Background Helicobacter pylori is the etiologic agent of common gastritis and a risk factor for gastric cancer. It is also one of the richest sources of Type II restriction-modification (R-M) systems in microorganisms. Principal Findings We have cloned, expressed and purified a new restriction endonuclease HpyAV from H. pylori strain 26695. We determined the HpyAV DNA recognition sequence and cleavage site as CCTTC 6/5. In addition, we found that HpyAV has a unique metal ion requirement: its cleavage activity is higher with transition metal ions than in Mg++. The special metal ion requirement of HpyAV can be attributed to the presence of a HNH catalytic site similar to ColE9 nuclease instead of the canonical PD-X-D/EXK catalytic site found in many other REases. Site-directed mutagenesis was carried out to verify the catalytic residues of HpyAV. Mutation of the conserved metal-binding Asn311 and His320 to alanine eliminated cleavage activity. HpyAV variant H295A displayed approximately 1% of wt activity. Conclusions/Significance Some HNH-type endonucleases have unique metal ion cofactor requirement for optimal activities. Homology modeling and site-directed mutagenesis confirmed that HpyAV is a member of the HNH nuclease family. The identification of catalytic residues in HpyAV paved the way for further engineering of the metal binding site. A survey of sequenced microbial genomes uncovered 10 putative R-M systems that show high sequence similarity to the HpyAV system, suggesting lateral transfer of a prototypic HpyAV-like R-M system among these microorganisms.
Collapse
Affiliation(s)
- Siu-Hong Chan
- Research Department, New England Biolabs, Inc., Ipswich, Massachusetts, United States of America
| | - Lars Opitz
- Research Department, New England Biolabs, Inc., Ipswich, Massachusetts, United States of America
| | - Lauren Higgins
- Research Department, New England Biolabs, Inc., Ipswich, Massachusetts, United States of America
| | - Diana O'loane
- Research Department, New England Biolabs, Inc., Ipswich, Massachusetts, United States of America
| | - Shuang-yong Xu
- Research Department, New England Biolabs, Inc., Ipswich, Massachusetts, United States of America
- * E-mail:
| |
Collapse
|
15
|
Type II restriction endonuclease R.Hpy188I belongs to the GIY-YIG nuclease superfamily, but exhibits an unusual active site. BMC STRUCTURAL BIOLOGY 2008; 8:48. [PMID: 19014591 PMCID: PMC2630997 DOI: 10.1186/1472-6807-8-48] [Citation(s) in RCA: 14] [Impact Index Per Article: 0.9] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Subscribe] [Scholar Register] [Received: 07/06/2008] [Accepted: 11/14/2008] [Indexed: 11/10/2022]
Abstract
BACKGROUND Catalytic domains of Type II restriction endonucleases (REases) belong to a few unrelated three-dimensional folds. While the PD-(D/E)XK fold is most common among these enzymes, crystal structures have been also determined for single representatives of two other folds: PLD (R.BfiI) and half-pipe (R.PabI). Bioinformatics analyses supported by mutagenesis experiments suggested that some REases belong to the HNH fold (e.g. R.KpnI), and that a small group represented by R.Eco29kI belongs to the GIY-YIG fold. However, for a large fraction of REases with known sequences, the three-dimensional fold and the architecture of the active site remain unknown, mostly due to extreme sequence divergence that hampers detection of homology to enzymes with known folds. RESULTS R.Hpy188I is a Type II REase with unknown structure. PSI-BLAST searches of the non-redundant protein sequence database reveal only 1 homolog (R.HpyF17I, with nearly identical amino acid sequence and the same DNA sequence specificity). Standard application of state-of-the-art protein fold-recognition methods failed to predict the relationship of R.Hpy188I to proteins with known structure or to other protein families. In order to increase the amount of evolutionary information in the multiple sequence alignment, we have expanded our sequence database searches to include sequences from metagenomics projects. This search resulted in identification of 23 further members of R.Hpy188I family, both from metagenomics and the non-redundant database. Moreover, fold-recognition analysis of the extended R.Hpy188I family revealed its relationship to the GIY-YIG domain and allowed for computational modeling of the R.Hpy188I structure. Analysis of the R.Hpy188I model in the light of sequence conservation among its homologs revealed an unusual variant of the active site, in which the typical Tyr residue of the YIG half-motif had been substituted by a Lys residue. Moreover, some of its homologs have the otherwise invariant Arg residue in a non-homologous position in sequence that nonetheless allows for spatial conservation of the guanidino group potentially involved in phosphate binding. CONCLUSION The present study eliminates a significant "white spot" on the structural map of REases. It also provides important insight into sequence-structure-function relationships in the GIY-YIG nuclease superfamily. Our results reveal that in the case of proteins with no or few detectable homologs in the standard "non-redundant" database, it is useful to expand this database by adding the metagenomic sequences, which may provide evolutionary linkage to detect more remote homologs.
Collapse
|
16
|
Jakubauskas A, Sasnauskas G, Giedriene J, Janulaitis A. Domain organization and functional analysis of type IIS restriction endonuclease Eco31I. Biochemistry 2008; 47:8546-56. [PMID: 18642930 DOI: 10.1021/bi800660u] [Citation(s) in RCA: 4] [Impact Index Per Article: 0.3] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/30/2022]
Abstract
Type IIS restriction endonuclease Eco31I harbors a single HNH active site and cleaves both DNA strands close to its recognition sequence, 5'-GGTCTC(1/5). A two-domain organization of Eco31I was determined by limited proteolysis. Analysis of proteolytic fragments revealed that the N-terminal domain of Eco31I is responsible for the specific DNA binding, while the C-terminal domain contains the HNH nuclease-like active site. Gel-shift and gel-filtration experiments revealed that a monomer of the N-terminal domain of Eco31I is able to bind a single copy of cognate DNA. However, in contrast to other studied type IIS enzymes, the isolated catalytic domain of Eco31I was inactive. Steady-state and transient kinetic analysis of Eco31I reactions was inconsistent with dimerization of Eco31I on DNA. Thus, we propose that Eco31I interacts with individual copies of its recognition sequence in its monomeric form and presumably remains a monomer as it cleaves both strands of double-stranded DNA. The domain organization and reaction mechanism established for Eco31I should be common for a group of evolutionary related type IIS restriction endonucleases Alw26I, BsaI, BsmAI, BsmBI and Esp3I that recognize DNA sequences bearing the common pentanucleotide 5'-GTCTC.
Collapse
|
17
|
Orlowski J, Bujnicki JM. Structural and evolutionary classification of Type II restriction enzymes based on theoretical and experimental analyses. Nucleic Acids Res 2008; 36:3552-69. [PMID: 18456708 PMCID: PMC2441816 DOI: 10.1093/nar/gkn175] [Citation(s) in RCA: 91] [Impact Index Per Article: 5.7] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 01/05/2023] Open
Abstract
For a very long time, Type II restriction enzymes (REases) have been a paradigm of ORFans: proteins with no detectable similarity to each other and to any other protein in the database, despite common cellular and biochemical function. Crystallographic analyses published until January 2008 provided high-resolution structures for only 28 of 1637 Type II REase sequences available in the Restriction Enzyme database (REBASE). Among these structures, all but two possess catalytic domains with the common PD-(D/E)XK nuclease fold. Two structures are unrelated to the others: R.BfiI exhibits the phospholipase D (PLD) fold, while R.PabI has a new fold termed 'half-pipe'. Thus far, bioinformatic studies supported by site-directed mutagenesis have extended the number of tentatively assigned REase folds to five (now including also GIY-YIG and HNH folds identified earlier in homing endonucleases) and provided structural predictions for dozens of REase sequences without experimentally solved structures. Here, we present a comprehensive study of all Type II REase sequences available in REBASE together with their homologs detectable in the nonredundant and environmental samples databases at the NCBI. We present the summary and critical evaluation of structural assignments and predictions reported earlier, new classification of all REase sequences into families, domain architecture analysis and new predictions of three-dimensional folds. Among 289 experimentally characterized (not putative) Type II REases, whose apparently full-length sequences are available in REBASE, we assign 199 (69%) to contain the PD-(D/E)XK domain. The HNH domain is the second most common, with 24 (8%) members. When putative REases are taken into account, the fraction of PD-(D/E)XK and HNH folds changes to 48% and 30%, respectively. Fifty-six characterized (and 521 predicted) REases remain unassigned to any of the five REase folds identified so far, and may exhibit new architectures. These enzymes are proposed as the most interesting targets for structure determination by high-resolution experimental methods. Our analysis provides the first comprehensive map of sequence-structure relationships among Type II REases and will help to focus the efforts of structural and functional genomics of this large and biotechnologically important class of enzymes.
Collapse
Affiliation(s)
- Jerzy Orlowski
- Laboratory of Bioinformatics and Protein Engineering, International Institute of Molecular and Cell Biology in Warsaw, ul. Ks. Trojdena 4, PL-02-109 Warsaw, Poland
| | | |
Collapse
|
18
|
Gasiunas G, Sasnauskas G, Tamulaitis G, Urbanke C, Razaniene D, Siksnys V. Tetrameric restriction enzymes: expansion to the GIY-YIG nuclease family. Nucleic Acids Res 2007; 36:938-49. [PMID: 18086711 PMCID: PMC2241918 DOI: 10.1093/nar/gkm1090] [Citation(s) in RCA: 21] [Impact Index Per Article: 1.2] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/13/2022] Open
Abstract
The GIY-YIG nuclease domain was originally identified in homing endonucleases and enzymes involved in DNA repair and recombination. Many of the GIY-YIG family enzymes are functional as monomers. We show here that the Cfr42I restriction endonuclease which belongs to the GIY-YIG family and recognizes the symmetric sequence 5′-CCGC/GG-3′ (‘/’ indicates the cleavage site) is a tetramer in solution. Moreover, biochemical and kinetic studies provided here demonstrate that the Cfr42I tetramer is catalytically active only upon simultaneous binding of two copies of its recognition sequence. In that respect Cfr42I resembles the homotetrameric Type IIF restriction enzymes that belong to the distinct PD-(E/D)XK nuclease superfamily. Unlike the PD-(E/D)XK enzymes, the GIY-YIG nuclease Cfr42I accommodates an extremely wide selection of metal-ion cofactors, including Mg2+, Mn2+, Co2+, Zn2+, Ni2+, Cu2+ and Ca2+. To our knowledge, Cfr42I is the first tetrameric GIY-YIG family enzyme. Similar structural arrangement and phenotypes displayed by restriction enzymes of the PD-(E/D)XK and GIY-YIG nuclease families point to the functional significance of tetramerization.
Collapse
Affiliation(s)
- Giedrius Gasiunas
- Institute of Biotechnology, Graiciuno 8, LT-02241 Vilnius, Lithuania
| | | | | | | | | | | |
Collapse
|