1
|
Tamulaitiene G, Manakova E, Jovaisaite V, Tamulaitis G, Grazulis S, Bochtler M, Siksnys V. Unique mechanism of target recognition by PfoI restriction endonuclease of the CCGG-family. Nucleic Acids Res 2019; 47:997-1010. [PMID: 30445642 PMCID: PMC6344858 DOI: 10.1093/nar/gky1137] [Citation(s) in RCA: 3] [Impact Index Per Article: 0.6] [Reference Citation Analysis] [Abstract] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/14/2018] [Accepted: 10/26/2018] [Indexed: 01/23/2023] Open
Abstract
Restriction endonucleases (REs) of the CCGG-family recognize a set of 4–8 bp target sequences that share a common CCGG or CCNGG core and possess PD…D/ExK nuclease fold. REs that interact with 5 bp sequence 5′-CCNGG flip the central N nucleotides and ‘compress’ the bound DNA to stack the inner base pairs to mimic the CCGG sequence. PfoI belongs to the CCGG-family and cleaves the 7 bp sequence 5′-T|CCNGGA ("|" designates cleavage position). We present here crystal structures of PfoI in free and DNA-bound forms that show unique active site arrangement and mechanism of sequence recognition. Structures and mutagenesis indicate that PfoI features a permuted E…ExD…K active site that differs from the consensus motif characteristic to other family members. Although PfoI also flips the central N nucleotides of the target sequence it does not ‘compress’ the bound DNA. Instead, PfoI induces a drastic change in DNA backbone conformation that shortens the distance between scissile phosphates to match that in the unperturbed CCGG sequence. Our data demonstrate the diversity and versatility of structural mechanisms employed by restriction enzymes for recognition of related DNA sequences.
Collapse
Affiliation(s)
- Giedre Tamulaitiene
- Institute of Biotechnology, Vilnius University, Sauletekio al. 7, LT-10257 Vilnius, Lithuania
| | - Elena Manakova
- Institute of Biotechnology, Vilnius University, Sauletekio al. 7, LT-10257 Vilnius, Lithuania
| | - Virginija Jovaisaite
- Institute of Biotechnology, Vilnius University, Sauletekio al. 7, LT-10257 Vilnius, Lithuania
| | - Gintautas Tamulaitis
- Institute of Biotechnology, Vilnius University, Sauletekio al. 7, LT-10257 Vilnius, Lithuania
| | - Saulius Grazulis
- Institute of Biotechnology, Vilnius University, Sauletekio al. 7, LT-10257 Vilnius, Lithuania
| | - Matthias Bochtler
- Laboratory of Structural Biology, International Institute of Molecular and Cell Biology, Trojdena 4, 02-109 Warsaw, Poland.,Dept. of Bioinformatics, Institute of Biochemistry and Biophysics, Polish Academy of Sciences, Pawinskiego 5a, 02-106 Warsaw, Poland
| | - Virginijus Siksnys
- Institute of Biotechnology, Vilnius University, Sauletekio al. 7, LT-10257 Vilnius, Lithuania
| |
Collapse
|
2
|
Pingoud A, Wilson GG, Wende W. Type II restriction endonucleases--a historical perspective and more. Nucleic Acids Res 2014; 42:7489-527. [PMID: 24878924 PMCID: PMC4081073 DOI: 10.1093/nar/gku447] [Citation(s) in RCA: 169] [Impact Index Per Article: 16.9] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 01/07/2014] [Revised: 05/02/2014] [Accepted: 05/07/2014] [Indexed: 12/17/2022] Open
Abstract
This article continues the series of Surveys and Summaries on restriction endonucleases (REases) begun this year in Nucleic Acids Research. Here we discuss 'Type II' REases, the kind used for DNA analysis and cloning. We focus on their biochemistry: what they are, what they do, and how they do it. Type II REases are produced by prokaryotes to combat bacteriophages. With extreme accuracy, each recognizes a particular sequence in double-stranded DNA and cleaves at a fixed position within or nearby. The discoveries of these enzymes in the 1970s, and of the uses to which they could be put, have since impacted every corner of the life sciences. They became the enabling tools of molecular biology, genetics and biotechnology, and made analysis at the most fundamental levels routine. Hundreds of different REases have been discovered and are available commercially. Their genes have been cloned, sequenced and overexpressed. Most have been characterized to some extent, but few have been studied in depth. Here, we describe the original discoveries in this field, and the properties of the first Type II REases investigated. We discuss the mechanisms of sequence recognition and catalysis, and the varied oligomeric modes in which Type II REases act. We describe the surprising heterogeneity revealed by comparisons of their sequences and structures.
Collapse
Affiliation(s)
- Alfred Pingoud
- Institute of Biochemistry, Justus-Liebig-University Giessen, Heinrich-Buff-Ring 58, D-35392 Giessen, Germany
| | - Geoffrey G Wilson
- New England Biolabs Inc., 240 County Road, Ipswich, MA 01938-2723, USA
| | - Wolfgang Wende
- Institute of Biochemistry, Justus-Liebig-University Giessen, Heinrich-Buff-Ring 58, D-35392 Giessen, Germany
| |
Collapse
|
3
|
Skowronek K, Boniecki MJ, Kluge B, Bujnicki JM. Rational engineering of sequence specificity in R.MwoI restriction endonuclease. Nucleic Acids Res 2012; 40:8579-92. [PMID: 22735699 PMCID: PMC3458533 DOI: 10.1093/nar/gks570] [Citation(s) in RCA: 2] [Impact Index Per Article: 0.2] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 01/15/2023] Open
Abstract
R.MwoI is a Type II restriction endonucleases enzyme (REase), which specifically recognizes a palindromic interrupted DNA sequence 5′-GCNNNNNNNGC-3′ (where N indicates any nucleotide), and hydrolyzes the phosphodiester bond in the DNA between the 7th and 8th base in both strands. R.MwoI exhibits remote sequence similarity to R.BglI, a REase with known structure, which recognizes an interrupted palindromic target 5′-GCCNNNNNGGC-3′. A homology model of R.MwoI in complex with DNA was constructed and used to predict functionally important amino acid residues that were subsequently targeted by mutagenesis. The model, together with the supporting experimental data, revealed regions important for recognition of the common bases in DNA sequences recognized by R.BglI and R.MwoI. Based on the bioinformatics analysis, we designed substitutions of the S310 residue in R.MwoI to arginine or glutamic acid, which led to enzyme variants with altered sequence selectivity compared with the wild-type enzyme. The S310R variant of R.MwoI preferred the 5′-GCCNNNNNGGC-3′ sequence as a target, similarly to R.BglI, whereas the S310E variant preferentially cleaved a subset of the MwoI sites, depending on the identity of the 3rd and 9th nucleotide residues. Our results represent a case study of a REase sequence specificity alteration by a single amino acid substitution, based on a theoretical model in the absence of a crystal structure.
Collapse
Affiliation(s)
- Krzysztof Skowronek
- Laboratory of Bioinformatics and Protein Engineering, International Institute of Molecular and Cell Biology, Trojdena 4, 02-109 Warsaw, Poland.
| | | | | | | |
Collapse
|
4
|
Orlowski J, Bujnicki JM. Structural and evolutionary classification of Type II restriction enzymes based on theoretical and experimental analyses. Nucleic Acids Res 2008; 36:3552-69. [PMID: 18456708 PMCID: PMC2441816 DOI: 10.1093/nar/gkn175] [Citation(s) in RCA: 91] [Impact Index Per Article: 5.7] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 01/05/2023] Open
Abstract
For a very long time, Type II restriction enzymes (REases) have been a paradigm of ORFans: proteins with no detectable similarity to each other and to any other protein in the database, despite common cellular and biochemical function. Crystallographic analyses published until January 2008 provided high-resolution structures for only 28 of 1637 Type II REase sequences available in the Restriction Enzyme database (REBASE). Among these structures, all but two possess catalytic domains with the common PD-(D/E)XK nuclease fold. Two structures are unrelated to the others: R.BfiI exhibits the phospholipase D (PLD) fold, while R.PabI has a new fold termed 'half-pipe'. Thus far, bioinformatic studies supported by site-directed mutagenesis have extended the number of tentatively assigned REase folds to five (now including also GIY-YIG and HNH folds identified earlier in homing endonucleases) and provided structural predictions for dozens of REase sequences without experimentally solved structures. Here, we present a comprehensive study of all Type II REase sequences available in REBASE together with their homologs detectable in the nonredundant and environmental samples databases at the NCBI. We present the summary and critical evaluation of structural assignments and predictions reported earlier, new classification of all REase sequences into families, domain architecture analysis and new predictions of three-dimensional folds. Among 289 experimentally characterized (not putative) Type II REases, whose apparently full-length sequences are available in REBASE, we assign 199 (69%) to contain the PD-(D/E)XK domain. The HNH domain is the second most common, with 24 (8%) members. When putative REases are taken into account, the fraction of PD-(D/E)XK and HNH folds changes to 48% and 30%, respectively. Fifty-six characterized (and 521 predicted) REases remain unassigned to any of the five REase folds identified so far, and may exhibit new architectures. These enzymes are proposed as the most interesting targets for structure determination by high-resolution experimental methods. Our analysis provides the first comprehensive map of sequence-structure relationships among Type II REases and will help to focus the efforts of structural and functional genomics of this large and biotechnologically important class of enzymes.
Collapse
Affiliation(s)
- Jerzy Orlowski
- Laboratory of Bioinformatics and Protein Engineering, International Institute of Molecular and Cell Biology in Warsaw, ul. Ks. Trojdena 4, PL-02-109 Warsaw, Poland
| | | |
Collapse
|
5
|
Kosinski J, Kubareva E, Bujnicki JM. A model of restriction endonuclease MvaI in complex with DNA: a template for interpretation of experimental data and a guide for specificity engineering. Proteins 2007; 68:324-36. [PMID: 17407166 DOI: 10.1002/prot.21460] [Citation(s) in RCA: 12] [Impact Index Per Article: 0.7] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 01/07/2023]
Abstract
R.MvaI is a Type II restriction enzyme (REase), which specifically recognizes the pentanucleotide DNA sequence 5'-CCWGG-3' (W indicates A or T). It belongs to a family of enzymes, which recognize related sequences, including 5'-CCSGG-3' (S indicates G or C) in the case of R.BcnI, or 5'-CCNGG-3' (where N indicates any nucleoside) in the case of R.ScrFI. REases from this family hydrolyze the phosphodiester bond in the DNA between the 2nd and 3rd base in both strands, thereby generating a double strand break with 5'-protruding single nucleotides. So far, no crystal structures of REases with similar cleavage patterns have been solved. Characterization of sequence-structure-function relationships in this family would facilitate understanding of evolution of sequence specificity among REases and could aid in engineering of enzymes with new specificities. However, sequences of R.MvaI or its homologs show no significant similarity to any proteins with known structures, thus precluding straightforward comparative modeling. We used a fold recognition approach to identify a remote relationship between R.MvaI and the structure of DNA repair enzyme MutH, which belongs to the PD-(D/E)XK superfamily together with many other REases. We constructed a homology model of R.MvaI and used it to predict functionally important amino acid residues and the mode of interaction with the DNA. In particular, we predict that only one active site of R.MvaI interacts with the DNA target at a time, and the cleavage of both strands (5'-CCAGG-3' and 5'-CCTGG-3') is achieved by two independent catalytic events. The model is in good agreement with the available experimental data and will serve as a template for further analyses of R.MvaI, R.BcnI, R.ScrFI and other related enzymes.
Collapse
Affiliation(s)
- Jan Kosinski
- Laboratory of Bioinformatics and Protein Engineering, International Institute of Molecular and Cell Biology, Trojdena 4, 02-109 Warsaw, Poland.
| | | | | |
Collapse
|
6
|
Type II restriction endonuclease R.Eco29kI is a member of the GIY-YIG nuclease superfamily. BMC STRUCTURAL BIOLOGY 2007; 7:48. [PMID: 17626614 PMCID: PMC1952068 DOI: 10.1186/1472-6807-7-48] [Citation(s) in RCA: 29] [Impact Index Per Article: 1.7] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Subscribe] [Scholar Register] [Received: 02/12/2007] [Accepted: 07/12/2007] [Indexed: 01/21/2023]
Abstract
Background The majority of experimentally determined crystal structures of Type II restriction endonucleases (REases) exhibit a common PD-(D/E)XK fold. Crystal structures have been also determined for single representatives of two other folds: PLD (R.BfiI) and half-pipe (R.PabI), and bioinformatics analyses supported by mutagenesis suggested that some REases belong to the HNH fold. Our previous bioinformatic analysis suggested that REase R.Eco29kI shares sequence similarities with one more unrelated nuclease superfamily, GIY-YIG, however so far no experimental data were available to support this prediction. The determination of a crystal structure of the GIY-YIG domain of homing endonuclease I-TevI provided a template for modeling of R.Eco29kI and prompted us to validate the model experimentally. Results Using protein fold-recognition methods we generated a new alignment between R.Eco29kI and I-TevI, which suggested a reassignment of one of the putative catalytic residues. A theoretical model of R.Eco29kI was constructed to illustrate its predicted three-dimensional fold and organization of the active site, comprising amino acid residues Y49, Y76, R104, H108, E142, and N154. A series of mutants was constructed to generate amino acid substitutions of selected residues (Y49A, R104A, H108F, E142A and N154L) and the mutant proteins were examined for their ability to bind the DNA containing the Eco29kI site 5'-CCGCGG-3' and to catalyze the cleavage reaction. Experimental data reveal that residues Y49, R104, E142, H108, and N154 are important for the nuclease activity of R.Eco29kI, while H108 and N154 are also important for specific DNA binding by this enzyme. Conclusion Substitutions of residues Y49, R104, H108, E142 and N154 predicted by the model to be a part of the active site lead to mutant proteins with strong defects in the REase activity. These results are in very good agreement with the structural model presented in this work and with our prediction that R.Eco29kI belongs to the GIY-YIG superfamily of nucleases. Our study provides the first experimental evidence for a Type IIP REase that does not belong to the PD-(D/E)XK or HNH superfamilies of nucleases, and is instead a member of the unrelated GIY-YIG superfamily.
Collapse
|
7
|
Sukackaite R, Lagunavicius A, Stankevicius K, Urbanke C, Venclovas Č, Siksnys V. Restriction endonuclease BpuJI specific for the 5'-CCCGT sequence is related to the archaeal Holliday junction resolvase family. Nucleic Acids Res 2007; 35:2377-89. [PMID: 17392342 PMCID: PMC1874659 DOI: 10.1093/nar/gkm164] [Citation(s) in RCA: 8] [Impact Index Per Article: 0.5] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/25/2022] Open
Abstract
Type IIS restriction endonucleases (REases) recognize asymmetric DNA sequences and cleave both DNA strands at fixed positions downstream of the recognition site. REase BpuJI recognizes the asymmetric sequence 5′-CCCGT, however it cuts at multiple sites in the vicinity of the target sequence. We show that BpuJI is a dimer, which has two DNA binding surfaces and displays optimal catalytic activity when bound to two recognition sites. BpuJI is cleaved by chymotrypsin into an N-terminal domain (NTD), which lacks catalytic activity but binds specifically to the recognition sequence as a monomer, and a C-terminal domain (CTD), which forms a dimer with non-specific nuclease activity. Fold recognition approach reveals that the CTD of BpuJI is structurally related to archaeal Holliday junction resolvases (AHJR). We demonstrate that the isolated catalytic CTD of BpuJI possesses end-directed nuclease activity and preferentially cuts 3 nt from the 3′-terminus of blunt-ended DNA. The nuclease activity of the CTD is repressed in the apo-enzyme and becomes activated upon specific DNA binding by the NTDs. This leads to a complicated pattern of specific DNA cleavage in the vicinity of the target site. Bioinformatics analysis identifies the AHJR-like domain in the putative Type III enzymes and functionally uncharacterized proteins.
Collapse
Affiliation(s)
- Rasa Sukackaite
- Institute of Biotechnology, Graičiūno 8, LT-02241 Vilnius, Lithuania and Strukturanalyse, Medizinische Hochschule Hannover, Carl Neuberg Strasse 1, D-30632 Hannover, Germany
| | - Arunas Lagunavicius
- Institute of Biotechnology, Graičiūno 8, LT-02241 Vilnius, Lithuania and Strukturanalyse, Medizinische Hochschule Hannover, Carl Neuberg Strasse 1, D-30632 Hannover, Germany
| | - Kornelijus Stankevicius
- Institute of Biotechnology, Graičiūno 8, LT-02241 Vilnius, Lithuania and Strukturanalyse, Medizinische Hochschule Hannover, Carl Neuberg Strasse 1, D-30632 Hannover, Germany
| | - Claus Urbanke
- Institute of Biotechnology, Graičiūno 8, LT-02241 Vilnius, Lithuania and Strukturanalyse, Medizinische Hochschule Hannover, Carl Neuberg Strasse 1, D-30632 Hannover, Germany
| | - Česlovas Venclovas
- Institute of Biotechnology, Graičiūno 8, LT-02241 Vilnius, Lithuania and Strukturanalyse, Medizinische Hochschule Hannover, Carl Neuberg Strasse 1, D-30632 Hannover, Germany
| | - Virginijus Siksnys
- Institute of Biotechnology, Graičiūno 8, LT-02241 Vilnius, Lithuania and Strukturanalyse, Medizinische Hochschule Hannover, Carl Neuberg Strasse 1, D-30632 Hannover, Germany
- *To whom correspondence should be addressed.
| |
Collapse
|
8
|
Niv MY, Ripoll DR, Vila JA, Liwo A, Vanamee ES, Aggarwal AK, Weinstein H, Scheraga HA. Topology of Type II REases revisited; structural classes and the common conserved core. Nucleic Acids Res 2007; 35:2227-37. [PMID: 17369272 PMCID: PMC1874628 DOI: 10.1093/nar/gkm045] [Citation(s) in RCA: 31] [Impact Index Per Article: 1.8] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/16/2022] Open
Abstract
Type II restriction endonucleases (REases) are deoxyribonucleases that cleave DNA sequences with remarkable specificity. Type II REases are highly divergent in sequence as well as in topology, i.e. the connectivity of secondary structure elements. A widely held assumption is that a structural core of five β-strands flanked by two α-helices is common to these enzymes. We introduce a systematic procedure to enumerate secondary structure elements in an unambiguous and reproducible way, and use it to analyze the currently available X-ray structures of Type II REases. Based on this analysis, we propose an alternative definition of the core, which we term the αβα-core. The αβα-core includes the most frequently observed secondary structure elements and is not a sandwich, as it consists of a five-strand β-sheet and two α-helices on the same face of the β-sheet. We use the αβα-core connectivity as a basis for grouping the Type II REases into distinct structural classes. In these new structural classes, the connectivity correlates with the angles between the secondary structure elements and with the cleavage patterns of the REases. We show that there exists a substructure of the αβα-core, namely a common conserved core, ccc, defined here as one α-helix and four β-strands common to all Type II REase of known structure.
Collapse
Affiliation(s)
- Masha Y Niv
- Department of Physiology and Biophysics, Weill Medical College of Cornell University, New York, NY 10021, USA.
| | | | | | | | | | | | | | | |
Collapse
|
9
|
Kaus-Drobek M, Czapinska H, Sokołowska M, Tamulaitis G, Szczepanowski RH, Urbanke C, Siksnys V, Bochtler M. Restriction endonuclease MvaI is a monomer that recognizes its target sequence asymmetrically. Nucleic Acids Res 2007; 35:2035-46. [PMID: 17344322 PMCID: PMC1874612 DOI: 10.1093/nar/gkm064] [Citation(s) in RCA: 31] [Impact Index Per Article: 1.8] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/12/2022] Open
Abstract
Restriction endonuclease MvaI recognizes the sequence CC/WGG (W stands for A or T, '/' designates the cleavage site) and generates products with single nucleotide 5'-overhangs. The enzyme has been noted for its tolerance towards DNA modifications. Here, we report a biochemical characterization and crystal structures of MvaI in an apo-form and in a complex with target DNA at 1.5 A resolution. Our results show that MvaI is a monomer and recognizes its pseudosymmetric target sequence asymmetrically. The enzyme consists of two lobes. The catalytic lobe anchors the active site residues Glu36, Asp50, Glu55 and Lys57 and contacts the bases from the minor grove side. The recognition lobe mediates all major grove interactions with the bases. The enzyme in the crystal is bound to the strand with T at the center of the recognition sequence. The crystal structure with calcium ions and DNA mimics the prereactive state. MvaI shows structural similarities to BcnI, which cleaves the related sequence CC/SGG and to MutH enzyme, which is a component of the DNA repair machinery, and nicks one DNA strand instead of making a double-strand break.
Collapse
Affiliation(s)
- Magdalena Kaus-Drobek
- International Institute of Molecular and Cell Biology, ul. Trojdena 4, 02-109 Warsaw, Poland, Max-Planck-Institute for Molecular Cell Biology and Genetics, Pfotenhauerstr. 108, 01309 Dresden, Germany, Institute of Biotechnology, Graiciuno 8, LT-02241, Vilnius, Lithuania and Medizinische Hochschule, Abteilung Strukturanalyse OE 8830, Carl Neuberg Str. 1, 30625 Hannover, Germany
| | - Honorata Czapinska
- International Institute of Molecular and Cell Biology, ul. Trojdena 4, 02-109 Warsaw, Poland, Max-Planck-Institute for Molecular Cell Biology and Genetics, Pfotenhauerstr. 108, 01309 Dresden, Germany, Institute of Biotechnology, Graiciuno 8, LT-02241, Vilnius, Lithuania and Medizinische Hochschule, Abteilung Strukturanalyse OE 8830, Carl Neuberg Str. 1, 30625 Hannover, Germany
| | - Monika Sokołowska
- International Institute of Molecular and Cell Biology, ul. Trojdena 4, 02-109 Warsaw, Poland, Max-Planck-Institute for Molecular Cell Biology and Genetics, Pfotenhauerstr. 108, 01309 Dresden, Germany, Institute of Biotechnology, Graiciuno 8, LT-02241, Vilnius, Lithuania and Medizinische Hochschule, Abteilung Strukturanalyse OE 8830, Carl Neuberg Str. 1, 30625 Hannover, Germany
| | - Gintautas Tamulaitis
- International Institute of Molecular and Cell Biology, ul. Trojdena 4, 02-109 Warsaw, Poland, Max-Planck-Institute for Molecular Cell Biology and Genetics, Pfotenhauerstr. 108, 01309 Dresden, Germany, Institute of Biotechnology, Graiciuno 8, LT-02241, Vilnius, Lithuania and Medizinische Hochschule, Abteilung Strukturanalyse OE 8830, Carl Neuberg Str. 1, 30625 Hannover, Germany
| | - Roman H. Szczepanowski
- International Institute of Molecular and Cell Biology, ul. Trojdena 4, 02-109 Warsaw, Poland, Max-Planck-Institute for Molecular Cell Biology and Genetics, Pfotenhauerstr. 108, 01309 Dresden, Germany, Institute of Biotechnology, Graiciuno 8, LT-02241, Vilnius, Lithuania and Medizinische Hochschule, Abteilung Strukturanalyse OE 8830, Carl Neuberg Str. 1, 30625 Hannover, Germany
| | - Claus Urbanke
- International Institute of Molecular and Cell Biology, ul. Trojdena 4, 02-109 Warsaw, Poland, Max-Planck-Institute for Molecular Cell Biology and Genetics, Pfotenhauerstr. 108, 01309 Dresden, Germany, Institute of Biotechnology, Graiciuno 8, LT-02241, Vilnius, Lithuania and Medizinische Hochschule, Abteilung Strukturanalyse OE 8830, Carl Neuberg Str. 1, 30625 Hannover, Germany
| | - Virginijus Siksnys
- International Institute of Molecular and Cell Biology, ul. Trojdena 4, 02-109 Warsaw, Poland, Max-Planck-Institute for Molecular Cell Biology and Genetics, Pfotenhauerstr. 108, 01309 Dresden, Germany, Institute of Biotechnology, Graiciuno 8, LT-02241, Vilnius, Lithuania and Medizinische Hochschule, Abteilung Strukturanalyse OE 8830, Carl Neuberg Str. 1, 30625 Hannover, Germany
| | - Matthias Bochtler
- International Institute of Molecular and Cell Biology, ul. Trojdena 4, 02-109 Warsaw, Poland, Max-Planck-Institute for Molecular Cell Biology and Genetics, Pfotenhauerstr. 108, 01309 Dresden, Germany, Institute of Biotechnology, Graiciuno 8, LT-02241, Vilnius, Lithuania and Medizinische Hochschule, Abteilung Strukturanalyse OE 8830, Carl Neuberg Str. 1, 30625 Hannover, Germany
- *To whom correspondence should be addressed. 0048 22 59707320048 22 5970715
| |
Collapse
|
10
|
Miyazono KI, Watanabe M, Kosinski J, Ishikawa K, Kamo M, Sawasaki T, Nagata K, Bujnicki JM, Endo Y, Tanokura M, Kobayashi I. Novel protein fold discovered in the PabI family of restriction enzymes. Nucleic Acids Res 2007; 35:1908-18. [PMID: 17332011 PMCID: PMC1874622 DOI: 10.1093/nar/gkm091] [Citation(s) in RCA: 43] [Impact Index Per Article: 2.5] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 11/20/2006] [Revised: 02/01/2007] [Accepted: 02/01/2007] [Indexed: 11/14/2022] Open
Abstract
Although structures of many DNA-binding proteins have been solved, they fall into a limited number of folds. Here, we describe an approach that led to the finding of a novel DNA-binding fold. Based on the behavior of Type II restriction-modification gene complexes as mobile elements, our earlier work identified a restriction enzyme, R.PabI, and its cognate modification enzyme in Pyrococcus abyssi through comparison of closely related genomes. While the modification methyltransferase was easily recognized, R.PabI was predicted to have a novel 3D structure. We expressed cytotoxic R.PabI in a wheat-germ-based cell-free translation system and determined its crystal structure. R.PabI turned out to adopt a novel protein fold. Homodimeric R.PabI has a curved anti-parallel beta-sheet that forms a 'half pipe'. Mutational and in silico DNA-binding analyses have assigned it as the double-strand DNA-binding site. Unlike most restriction enzymes analyzed, R.PabI is able to cleave DNA in the absence of Mg(2+). These results demonstrate the value of genome comparison and the wheat-germ-based system in finding a novel DNA-binding motif in mobile DNases and, in general, a novel protein fold in horizontally transferred genes.
Collapse
Affiliation(s)
- Ken-ichi Miyazono
- Department of Applied Biological Chemistry, Graduate School of Agricultural and Life Sciences, University of Tokyo, Tokyo, 113-8657, Japan, Department of Medical Genome Sciences, Graduate School of Frontier Science, University of Tokyo, Tokyo 108-8639, Japan, Laboratory of Bioinformatics and Protein Engineering, International Institute of Molecular and Cell Biology, Trojdena 4, 02-109 Warsaw, Poland, Department of Applied Chemistry, Faculty of Engineering, Ehime University, Matsuyama 790-8577, Japan, Graduate Program in Biophysics and Biochemistry, Graduate School of Science, University of Tokyo, Tokyo 108-8639, Japan and Institute of Medical Science, University of Tokyo, Tokyo 108-8639, Japan
| | - Miki Watanabe
- Department of Applied Biological Chemistry, Graduate School of Agricultural and Life Sciences, University of Tokyo, Tokyo, 113-8657, Japan, Department of Medical Genome Sciences, Graduate School of Frontier Science, University of Tokyo, Tokyo 108-8639, Japan, Laboratory of Bioinformatics and Protein Engineering, International Institute of Molecular and Cell Biology, Trojdena 4, 02-109 Warsaw, Poland, Department of Applied Chemistry, Faculty of Engineering, Ehime University, Matsuyama 790-8577, Japan, Graduate Program in Biophysics and Biochemistry, Graduate School of Science, University of Tokyo, Tokyo 108-8639, Japan and Institute of Medical Science, University of Tokyo, Tokyo 108-8639, Japan
| | - Jan Kosinski
- Department of Applied Biological Chemistry, Graduate School of Agricultural and Life Sciences, University of Tokyo, Tokyo, 113-8657, Japan, Department of Medical Genome Sciences, Graduate School of Frontier Science, University of Tokyo, Tokyo 108-8639, Japan, Laboratory of Bioinformatics and Protein Engineering, International Institute of Molecular and Cell Biology, Trojdena 4, 02-109 Warsaw, Poland, Department of Applied Chemistry, Faculty of Engineering, Ehime University, Matsuyama 790-8577, Japan, Graduate Program in Biophysics and Biochemistry, Graduate School of Science, University of Tokyo, Tokyo 108-8639, Japan and Institute of Medical Science, University of Tokyo, Tokyo 108-8639, Japan
| | - Ken Ishikawa
- Department of Applied Biological Chemistry, Graduate School of Agricultural and Life Sciences, University of Tokyo, Tokyo, 113-8657, Japan, Department of Medical Genome Sciences, Graduate School of Frontier Science, University of Tokyo, Tokyo 108-8639, Japan, Laboratory of Bioinformatics and Protein Engineering, International Institute of Molecular and Cell Biology, Trojdena 4, 02-109 Warsaw, Poland, Department of Applied Chemistry, Faculty of Engineering, Ehime University, Matsuyama 790-8577, Japan, Graduate Program in Biophysics and Biochemistry, Graduate School of Science, University of Tokyo, Tokyo 108-8639, Japan and Institute of Medical Science, University of Tokyo, Tokyo 108-8639, Japan
| | - Masayuki Kamo
- Department of Applied Biological Chemistry, Graduate School of Agricultural and Life Sciences, University of Tokyo, Tokyo, 113-8657, Japan, Department of Medical Genome Sciences, Graduate School of Frontier Science, University of Tokyo, Tokyo 108-8639, Japan, Laboratory of Bioinformatics and Protein Engineering, International Institute of Molecular and Cell Biology, Trojdena 4, 02-109 Warsaw, Poland, Department of Applied Chemistry, Faculty of Engineering, Ehime University, Matsuyama 790-8577, Japan, Graduate Program in Biophysics and Biochemistry, Graduate School of Science, University of Tokyo, Tokyo 108-8639, Japan and Institute of Medical Science, University of Tokyo, Tokyo 108-8639, Japan
| | - Tatsuya Sawasaki
- Department of Applied Biological Chemistry, Graduate School of Agricultural and Life Sciences, University of Tokyo, Tokyo, 113-8657, Japan, Department of Medical Genome Sciences, Graduate School of Frontier Science, University of Tokyo, Tokyo 108-8639, Japan, Laboratory of Bioinformatics and Protein Engineering, International Institute of Molecular and Cell Biology, Trojdena 4, 02-109 Warsaw, Poland, Department of Applied Chemistry, Faculty of Engineering, Ehime University, Matsuyama 790-8577, Japan, Graduate Program in Biophysics and Biochemistry, Graduate School of Science, University of Tokyo, Tokyo 108-8639, Japan and Institute of Medical Science, University of Tokyo, Tokyo 108-8639, Japan
| | - Koji Nagata
- Department of Applied Biological Chemistry, Graduate School of Agricultural and Life Sciences, University of Tokyo, Tokyo, 113-8657, Japan, Department of Medical Genome Sciences, Graduate School of Frontier Science, University of Tokyo, Tokyo 108-8639, Japan, Laboratory of Bioinformatics and Protein Engineering, International Institute of Molecular and Cell Biology, Trojdena 4, 02-109 Warsaw, Poland, Department of Applied Chemistry, Faculty of Engineering, Ehime University, Matsuyama 790-8577, Japan, Graduate Program in Biophysics and Biochemistry, Graduate School of Science, University of Tokyo, Tokyo 108-8639, Japan and Institute of Medical Science, University of Tokyo, Tokyo 108-8639, Japan
| | - Janusz M. Bujnicki
- Department of Applied Biological Chemistry, Graduate School of Agricultural and Life Sciences, University of Tokyo, Tokyo, 113-8657, Japan, Department of Medical Genome Sciences, Graduate School of Frontier Science, University of Tokyo, Tokyo 108-8639, Japan, Laboratory of Bioinformatics and Protein Engineering, International Institute of Molecular and Cell Biology, Trojdena 4, 02-109 Warsaw, Poland, Department of Applied Chemistry, Faculty of Engineering, Ehime University, Matsuyama 790-8577, Japan, Graduate Program in Biophysics and Biochemistry, Graduate School of Science, University of Tokyo, Tokyo 108-8639, Japan and Institute of Medical Science, University of Tokyo, Tokyo 108-8639, Japan
| | - Yaeta Endo
- Department of Applied Biological Chemistry, Graduate School of Agricultural and Life Sciences, University of Tokyo, Tokyo, 113-8657, Japan, Department of Medical Genome Sciences, Graduate School of Frontier Science, University of Tokyo, Tokyo 108-8639, Japan, Laboratory of Bioinformatics and Protein Engineering, International Institute of Molecular and Cell Biology, Trojdena 4, 02-109 Warsaw, Poland, Department of Applied Chemistry, Faculty of Engineering, Ehime University, Matsuyama 790-8577, Japan, Graduate Program in Biophysics and Biochemistry, Graduate School of Science, University of Tokyo, Tokyo 108-8639, Japan and Institute of Medical Science, University of Tokyo, Tokyo 108-8639, Japan
| | - Masaru Tanokura
- Department of Applied Biological Chemistry, Graduate School of Agricultural and Life Sciences, University of Tokyo, Tokyo, 113-8657, Japan, Department of Medical Genome Sciences, Graduate School of Frontier Science, University of Tokyo, Tokyo 108-8639, Japan, Laboratory of Bioinformatics and Protein Engineering, International Institute of Molecular and Cell Biology, Trojdena 4, 02-109 Warsaw, Poland, Department of Applied Chemistry, Faculty of Engineering, Ehime University, Matsuyama 790-8577, Japan, Graduate Program in Biophysics and Biochemistry, Graduate School of Science, University of Tokyo, Tokyo 108-8639, Japan and Institute of Medical Science, University of Tokyo, Tokyo 108-8639, Japan
| | - Ichizo Kobayashi
- Department of Applied Biological Chemistry, Graduate School of Agricultural and Life Sciences, University of Tokyo, Tokyo, 113-8657, Japan, Department of Medical Genome Sciences, Graduate School of Frontier Science, University of Tokyo, Tokyo 108-8639, Japan, Laboratory of Bioinformatics and Protein Engineering, International Institute of Molecular and Cell Biology, Trojdena 4, 02-109 Warsaw, Poland, Department of Applied Chemistry, Faculty of Engineering, Ehime University, Matsuyama 790-8577, Japan, Graduate Program in Biophysics and Biochemistry, Graduate School of Science, University of Tokyo, Tokyo 108-8639, Japan and Institute of Medical Science, University of Tokyo, Tokyo 108-8639, Japan
| |
Collapse
|
11
|
Dunin-Horkawicz S, Feder M, Bujnicki JM. Phylogenomic analysis of the GIY-YIG nuclease superfamily. BMC Genomics 2006; 7:98. [PMID: 16646971 PMCID: PMC1564403 DOI: 10.1186/1471-2164-7-98] [Citation(s) in RCA: 95] [Impact Index Per Article: 5.3] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 01/29/2006] [Accepted: 04/28/2006] [Indexed: 11/28/2022] Open
Abstract
Background The GIY-YIG domain was initially identified in homing endonucleases and later in other selfish mobile genetic elements (including restriction enzymes and non-LTR retrotransposons) and in enzymes involved in DNA repair and recombination. However, to date no systematic search for novel members of the GIY-YIG superfamily or comparative analysis of these enzymes has been reported. Results We carried out database searches to identify all members of known GIY-YIG nuclease families. Multiple sequence alignments together with predicted secondary structures of identified families were represented as Hidden Markov Models (HMM) and compared by the HHsearch method to the uncharacterized protein families gathered in the COG, KOG, and PFAM databases. This analysis allowed for extending the GIY-YIG superfamily to include members of COG3680 and a number of proteins not classified in COGs and to predict that these proteins may function as nucleases, potentially involved in DNA recombination and/or repair. Finally, all old and new members of the GIY-YIG superfamily were compared and analyzed to infer the phylogenetic tree. Conclusion An evolutionary classification of the GIY-YIG superfamily is presented for the very first time, along with the structural annotation of all (sub)families. It provides a comprehensive picture of sequence-structure-function relationships in this superfamily of nucleases, which will help to design experiments to study the mechanism of action of known members (especially the uncharacterized ones) and will facilitate the prediction of function for the newly discovered ones.
Collapse
Affiliation(s)
- Stanislaw Dunin-Horkawicz
- Laboratory of Bioinformatics and Protein Engineering, International Institute of Molecular and Cell Biology, Trojdena 4, 02-109 Warsaw, Poland
| | - Marcin Feder
- Laboratory of Bioinformatics and Protein Engineering, International Institute of Molecular and Cell Biology, Trojdena 4, 02-109 Warsaw, Poland
| | - Janusz M Bujnicki
- Laboratory of Bioinformatics and Protein Engineering, International Institute of Molecular and Cell Biology, Trojdena 4, 02-109 Warsaw, Poland
| |
Collapse
|
12
|
Skowronek KJ, Kosinski J, Bujnicki JM. Theoretical model of restriction endonuclease HpaI in complex with DNA, predicted by fold recognition and validated by site-directed mutagenesis. Proteins 2006; 63:1059-68. [PMID: 16498623 DOI: 10.1002/prot.20920] [Citation(s) in RCA: 7] [Impact Index Per Article: 0.4] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 12/20/2022]
Abstract
Type II restriction enzymes are commercially important deoxyribonucleases and very attractive targets for protein engineering of new specificities. At the same time they are a very challenging test bed for protein structure prediction methods. Typically, enzymes that recognize different sequences show little or no amino acid sequence similarity to each other and to other proteins. Based on crystallographic analyses that revealed the same PD-(D/E)XK fold for more than a dozen case studies, they were nevertheless considered to be related until the combination of bioinformatics and mutational analyses has demonstrated that some of these proteins belong to other, unrelated folds PLD, HNH, and GIY-YIG. As a part of a large-scale project aiming at identification of a three-dimensional fold for all type II REases with known sequences (currently approximately 1000 proteins), we carried out preliminary structure prediction and selected candidates for experimental validation. Here, we present the analysis of HpaI REase, an ORFan with no detectable homologs, for which we detected a structural template by protein fold recognition, constructed a model using the FRankenstein monster approach and identified a number of residues important for the DNA binding and catalysis. These predictions were confirmed by site-directed mutagenesis and in vitro analysis of the mutant proteins. The experimentally validated model of HpaI will serve as a low-resolution structural platform for evolutionary considerations in the subgroup of blunt-cutting REases with different specificities. The research protocol developed in the course of this work represents a streamlined version of the previously used techniques and can be used in a high-throughput fashion to build and validate models for other enzymes, especially ORFans that exhibit no sequence similarity to any other protein in the database.
Collapse
|
13
|
Tamulaitis G, Mucke M, Siksnys V. Biochemical and mutational analysis ofEcoRII functional domains reveals evolutionary links between restriction enzymes. FEBS Lett 2006; 580:1665-71. [PMID: 16497303 DOI: 10.1016/j.febslet.2006.02.010] [Citation(s) in RCA: 29] [Impact Index Per Article: 1.6] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 12/12/2005] [Revised: 01/16/2006] [Accepted: 02/07/2006] [Indexed: 11/23/2022]
Abstract
The archetypal Type IIE restriction endonuclease EcoRII is a dimer that has a modular structure. DNA binding studies indicate that the isolated C-terminal domain dimer has an interface that binds a single cognate DNA molecule whereas the N-terminal domain is a monomer that also binds a single copy of cognate DNA. Hence, the full-length EcoRII contains three putative DNA binding interfaces: one at the C-terminal domain dimer and two at each of the N-terminal domains. Mutational analysis indicates that the C-terminal domain shares conserved active site architecture and DNA binding elements with the tetrameric restriction enzyme NgoMIV. Data provided here suggest possible evolutionary relationships between different subfamilies of restriction enzymes.
Collapse
|
14
|
Grazulis S, Manakova E, Roessle M, Bochtler M, Tamulaitiene G, Huber R, Siksnys V. Structure of the metal-independent restriction enzyme BfiI reveals fusion of a specific DNA-binding domain with a nonspecific nuclease. Proc Natl Acad Sci U S A 2005; 102:15797-802. [PMID: 16247004 PMCID: PMC1266039 DOI: 10.1073/pnas.0507949102] [Citation(s) in RCA: 75] [Impact Index Per Article: 3.9] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/18/2022] Open
Abstract
Among all restriction endonucleases known to date, BfiI is unique in cleaving DNA in the absence of metal ions. BfiI represents a different evolutionary lineage of restriction enzymes, as shown by its crystal structure at 1.9-A resolution. The protein consists of two structural domains. The N-terminal catalytic domain is similar to Nuc, an EDTA-resistant nuclease from the phospholipase D superfamily. The C-terminal DNA-binding domain of BfiI exhibits a beta-barrel-like structure very similar to the effector DNA-binding domain of the Mg(2+)-dependent restriction enzyme EcoRII and to the B3-like DNA-binding domain of plant transcription factors. BfiI presumably evolved through domain fusion of a DNA-recognition element to a nonspecific nuclease akin to Nuc and elaborated a mechanism to limit DNA cleavage to a single double-strand break near the specific recognition sequence. The crystal structure suggests that the interdomain linker may act as an autoinhibitor controlling BfiI catalytic activity in the absence of a specific DNA sequence. A psi-blast search identified a BfiI homologue in a Mesorhizobium sp. BNC1 bacteria strain, a plant symbiont isolated from an EDTA-rich environment.
Collapse
Affiliation(s)
- Saulius Grazulis
- Laboratory of Protein-DNA Interaction, Institute of Biotechnology, Graiciuno 8, LT-02241 Vilnius, Lithuania.
| | | | | | | | | | | | | |
Collapse
|
15
|
Sacchi CT, Alber D, Dull P, Mothershed EA, Whitney AM, Barnett GA, Popovic T, Mayer LW. High level of sequence diversity in the 16S rRNA genes of Haemophilus influenzae isolates is useful for molecular subtyping. J Clin Microbiol 2005; 43:3734-42. [PMID: 16081903 PMCID: PMC1233939 DOI: 10.1128/jcm.43.8.3734-3742.2005] [Citation(s) in RCA: 34] [Impact Index Per Article: 1.8] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/20/2022] Open
Abstract
A molecular typing method based on the 16S rRNA sequence diversity was developed for Haemophilus influenzae isolates. A total of 330 H. influenzae isolates were analyzed, representing a diverse collection of U.S. isolates. We found a high level of 16S rRNA sequence heterogeneity (up to 2.73%) and observed an exclusive correlation between 16S types and serotypes (a to f); no 16S type was found in more than one serotype. Similarly, no multilocus sequence typing (MLST) sequence type (ST) was found in more than one serotype. Our 16S typing and MLST results are in agreement with those of previous studies showing that serotypable H. influenzae isolates behave as highly clonal populations and emphasize the lack of clonality of nontypable (NT) H. influenzae isolates. There was not a 1:1 correlation between 16S types and STs, but all H. influenzae serotypable isolates clustered similarly. This correlation was not observed for NT H. influenzae; the two methods clustered NT H. influenzae isolates differently. 16S rRNA gene sequencing alone provides a level of discrimination similar to that obtained with the analysis of seven genes for MLST. We demonstrated that 16S typing is an additional and complementary approach to MLST, particularly for NT H. influenzae isolates, and is potentially useful for outbreak investigation.
Collapse
Affiliation(s)
- Claudio T Sacchi
- Meningitis and Special Pathogens Branch, Division of Bacterial and Mycotic Diseases, National Center for Infectious Diseases, CDC, MS D-11, 1600 Clifton Road, NE, Atlanta, GA 30333, USA.
| | | | | | | | | | | | | | | |
Collapse
|
16
|
Kosinski J, Feder M, Bujnicki JM. The PD-(D/E)XK superfamily revisited: identification of new members among proteins involved in DNA metabolism and functional predictions for domains of (hitherto) unknown function. BMC Bioinformatics 2005; 6:172. [PMID: 16011798 PMCID: PMC1189080 DOI: 10.1186/1471-2105-6-172] [Citation(s) in RCA: 72] [Impact Index Per Article: 3.8] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 04/13/2005] [Accepted: 07/12/2005] [Indexed: 01/02/2023] Open
Abstract
BACKGROUND The PD-(D/E)XK nuclease superfamily, initially identified in type II restriction endonucleases and later in many enzymes involved in DNA recombination and repair, is one of the most challenging targets for protein sequence analysis and structure prediction. Typically, the sequence similarity between these proteins is so low, that most of the relationships between known members of the PD-(D/E)XK superfamily were identified only after the corresponding structures were determined experimentally. Thus, it is tempting to speculate that among the uncharacterized protein families, there are potential nucleases that remain to be discovered, but their identification requires more sensitive tools than traditional PSI-BLAST searches. RESULTS The low degree of amino acid conservation hampers the possibility of identification of new members of the PD-(D/E)XK superfamily based solely on sequence comparisons to known members. Therefore, we used a recently developed method HHsearch for sensitive detection of remote similarities between protein families represented as profile Hidden Markov Models enhanced by secondary structure. We carried out a comparison of known families of PD-(D/E)XK nucleases to the database comprising the COG and PFAM profiles corresponding to both functionally characterized as well as uncharacterized protein families to detect significant similarities. The initial candidates for new nucleases were subsequently verified by sequence-structure threading, comparative modeling, and identification of potential active site residues. CONCLUSION In this article, we report identification of the PD-(D/E)XK nuclease domain in numerous proteins implicated in interactions with DNA but with unknown structure and mechanism of action (such as putative recombinase RmuC, DNA competence factor CoiA, a DNA-binding protein SfsA, a large human protein predicted to be a DNA repair enzyme, predicted archaeal transcription regulators, and the head completion protein of phage T4) and in proteins for which no function was assigned to date (such as YhcG, various phage proteins, novel candidates for restriction enzymes). Our results contributes to the reduction of "white spaces" on the sequence-structure-function map of the protein universe and will help to jump-start the experimental characterization of new nucleases, of which many may be of importance for the complete understanding of mechanisms that govern the evolution and stability of the genome.
Collapse
Affiliation(s)
- Jan Kosinski
- Laboratory of Bioinformatics and Protein Engineering, International Institute of Molecular and Cell Biology, Trojdena 4, PL-02-109 Warsaw, Poland
| | - Marcin Feder
- Laboratory of Bioinformatics and Protein Engineering, International Institute of Molecular and Cell Biology, Trojdena 4, PL-02-109 Warsaw, Poland
| | - Janusz M Bujnicki
- Laboratory of Bioinformatics and Protein Engineering, International Institute of Molecular and Cell Biology, Trojdena 4, PL-02-109 Warsaw, Poland
| |
Collapse
|
17
|
Chmiel AA, Radlinska M, Pawlak SD, Krowarsch D, Bujnicki JM, Skowronek KJ. A theoretical model of restriction endonuclease NlaIV in complex with DNA, predicted by fold recognition and validated by site-directed mutagenesis and circular dichroism spectroscopy. Protein Eng Des Sel 2005; 18:181-9. [PMID: 15849215 DOI: 10.1093/protein/gzi019] [Citation(s) in RCA: 13] [Impact Index Per Article: 0.7] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 01/10/2023] Open
Abstract
Restriction enzymes (REases) are commercial reagents commonly used in DNA manipulations and mapping. They are regarded as very attractive models for studying protein-DNA interactions and valuable targets for protein engineering. Their amino acid sequences usually show no similarities to other proteins, with rare exceptions of other REases that recognize identical or very similar sequences. Hence, they are extremely hard targets for structure prediction and modeling. NlaIV is a Type II REase, which recognizes the interrupted palindromic sequence GGNNCC (where N indicates any base) and cleaves it in the middle, leaving blunt ends. NlaIV shows no sequence similarity to other proteins and virtually nothing is known about its sequence-structure-function relationships. Using protein fold recognition, we identified a remote relationship between NlaIV and EcoRV, an extensively studied REase, which recognizes the GATATC sequence and whose crystal structure has been determined. Using the 'FRankenstein's monster' approach we constructed a comparative model of NlaIV based on the EcoRV template and used it to predict the catalytic and DNA-binding residues. The model was validated by site-directed mutagenesis and analysis of the activity of the mutants in vivo and in vitro as well as structural characterization of the wild-type enzyme and two mutants by circular dichroism spectroscopy. The structural model of the NlaIV-DNA complex suggests regions of the protein sequence that may interact with the 'non-specific' bases of the target and thus it provides insight into the evolution of sequence specificity in restriction enzymes and may help engineer REases with novel specificities. Before this analysis was carried out, neither the three-dimensional fold of NlaIV, its evolutionary relationships or its catalytic or DNA-binding residues were known. Hence our analysis may be regarded as a paradigm for studies aiming at reducing 'white spaces' on the evolutionary landscape of sequence-function relationships by combining bioinformatics with simple experimental assays.
Collapse
Affiliation(s)
- Agnieszka A Chmiel
- Laboratory of Bioinformatics and Protein Engineering, International Institute of Molecular and Cell Biology, ul. ks. Trojdena 4, 02-109 Warsaw, Poland
| | | | | | | | | | | |
Collapse
|
18
|
Feder M, Bujnicki JM. Identification of a new family of putative PD-(D/E)XK nucleases with unusual phylogenomic distribution and a new type of the active site. BMC Genomics 2005; 6:21. [PMID: 15720711 PMCID: PMC551604 DOI: 10.1186/1471-2164-6-21] [Citation(s) in RCA: 35] [Impact Index Per Article: 1.8] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 11/11/2004] [Accepted: 02/18/2005] [Indexed: 12/18/2022] Open
Abstract
Background Prediction of structure and function for uncharacterized protein families by identification of evolutionary links to characterized families and known structures is one of the cornerstones of genomics. Theoretical assignment of three-dimensional folds and prediction of protein function even at a very general level can facilitate the experimental determination of the molecular mechanism of action and the role that members of a given protein family fulfill in the cell. Here, we predict the three-dimensional fold and study the phylogenomic distribution of members of a large family of uncharacterized proteins classified in the Clusters of Orthologous Groups database as COG4636. Results Using protein fold-recognition we found that members of COG4636 are remotely related to Holliday junction resolvases and other nucleases from the PD-(D/E)XK superfamily. Structure modeling and sequence analyses suggest that most members of COG4636 exhibit a new, unusual variant of the putative active site, in which the catalytic Lys residue migrated in the sequence, but retained similar spatial position with respect to other functionally important residues. Sequence analyses revealed that members of COG4636 and their homologs are found mainly in Cyanobacteria, but also in other bacterial phyla. They undergo horizontal transfer and extensive proliferation in the colonized genomes; for instance in Gloeobacter violaceus PCC 7421 they comprise over 2% of all protein-encoding genes. Thus, members of COG4636 appear to be a new type of selfish genetic elements, which may fulfill an important role in the genome dynamics of Cyanobacteria and other species they invaded. Our analyses provide a platform for experimental determination of the molecular and cellular function of members of this large protein family. Conclusion After submission of this manuscript, a crystal structure of one of the COG4636 members was released in the Protein Data Bank (code 1wdj; Idaka, M., Wada, T., Murayama, K., Terada, T., Kuramitsu, S., Shirouzu, M., Yokoyama, S.: Crystal structure of Tt1808 from Thermus thermophilus Hb8, to be published). Our analysis of the Tt1808 structure reveals that we correctly predicted all functionally important features of the COG4636 family, including the membership in the PD-(D/E)xK superfamily of nucleases, the three-dimensional fold, the putative catalytic residues, and the unusual configuration of the active site.
Collapse
Affiliation(s)
- Marcin Feder
- Laboratory of Bioinformatics and Protein Engineering, International Institute of Molecular and Cell Biology, Trojdena 4, 02-109 Warsaw, Poland
| | - Janusz M Bujnicki
- Laboratory of Bioinformatics and Protein Engineering, International Institute of Molecular and Cell Biology, Trojdena 4, 02-109 Warsaw, Poland
| |
Collapse
|
19
|
Saravanan M, Bujnicki JM, Cymerman IA, Rao DN, Nagaraja V. Type II restriction endonuclease R.KpnI is a member of the HNH nuclease superfamily. Nucleic Acids Res 2004; 32:6129-35. [PMID: 15562004 PMCID: PMC534630 DOI: 10.1093/nar/gkh951] [Citation(s) in RCA: 73] [Impact Index Per Article: 3.7] [Reference Citation Analysis] [Abstract] [MESH Headings] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/13/2022] Open
Abstract
The restriction endonuclease (REase) R.KpnI is an orthodox Type IIP enzyme, which binds to DNA in the absence of metal ions and cleaves the DNA sequence 5'-GGTAC--C-3' in the presence of Mg2+ as shown generating 3' four base overhangs. Bioinformatics analysis reveals that R.KpnI contains a betabetaalpha-Me-finger fold, which is characteristic of many HNH-superfamily endonucleases, including homing endonuclease I-HmuI, structure-specific T4 endonuclease VII, colicin E9, sequence non-specific Serratia nuclease and sequence-specific homing endonuclease I-PpoI. According to our homology model of R.KpnI, D148, H149 and Q175 correspond to the critical D, H and N or H residues of the HNH nucleases. Substitutions of these three conserved residues lead to the loss of the DNA cleavage activity by R.KpnI, confirming their importance. The mutant Q175E fails to bind DNA at the standard conditions, although the DNA binding and cleavage can be rescued at pH 6.0, indicating a role for Q175 in DNA binding and cleavage. Our study provides the first experimental evidence for a Type IIP REase that does not belong to the PD...D/EXK superfamily of nucleases, instead is a member of the HNH superfamily.
Collapse
Affiliation(s)
- Matheshwaran Saravanan
- Microbiology and Cell Biology Department, Indian Institute of Science, Bangalore 560 012, India
| | | | | | | | | |
Collapse
|