51
|
Comparative genomic analyses of attenuated strains of Mycoplasma gallisepticum. Infect Immun 2010; 78:1760-71. [PMID: 20123709 DOI: 10.1128/iai.01172-09] [Citation(s) in RCA: 42] [Impact Index Per Article: 3.0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/20/2022] Open
Abstract
Mycoplasma gallisepticum is a significant respiratory and reproductive pathogen of domestic poultry. While the complete genomic sequence of the virulent, low-passage M. gallisepticum strain R (R(low)) has been reported, genomic determinants responsible for differences in virulence and host range remain to be completely identified. Here, we utilize genome sequencing and microarray-based comparative genomic data to identify these genomic determinants of virulence and to elucidate genomic variability among strains of M. gallisepticum. Analysis of the high-passage, attenuated derivative of R(low), R(high), indicated that relatively few total genomic changes (64 loci) occurred, yet they are potentially responsible for the observed attenuation of this strain. In addition to previously characterized mutations in cytadherence-related proteins, changes included those in coding sequences of genes involved in sugar metabolism. Analyses of the genome of the M. gallisepticum vaccine strain F revealed numerous differences relative to strain R, including a highly divergent complement of vlhA surface lipoprotein genes, and at least 16 genes absent or significantly fragmented relative to strain R. Notably, an R(low) isogenic mutant in one of these genes (MGA_1107) caused significantly fewer severe tracheal lesions in the natural host compared to virulent M. gallisepticum R(low). Comparative genomic hybridizations indicated few genetic loci commonly affected in F and vaccine strains ts-11 and 6/85, which would correlate with proteins affecting strain R virulence. Together, these data provide novel insights into inter- and intrastrain M. gallisepticum genomic variability and the genetic basis of M. gallisepticum virulence.
Collapse
|
52
|
Smith RM, Josephsen J, Szczelkun MD. An Mrr-family nuclease motif in the single polypeptide restriction-modification enzyme LlaGI. Nucleic Acids Res 2010; 37:7231-8. [PMID: 19793866 PMCID: PMC2790908 DOI: 10.1093/nar/gkp795] [Citation(s) in RCA: 11] [Impact Index Per Article: 0.8] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 01/19/2023] Open
Abstract
Bioinformatic analysis of the putative nuclease domain of the single polypeptide restriction–modification enzyme LlaGI reveals amino acid motifs characteristic of the Escherichia coli methylated DNA-specific Mrr endonuclease. Using mutagenesis, we examined the role of the conserved residues in both DNA translocation and cleavage. Mutations in those residues predicted to play a role in DNA hydrolysis produced enzymes that could translocate on DNA but were either unable to cleave the polynucleotide track or had reduced nuclease activity. Cleavage by LlaGI is not targeted to methylated DNA, suggesting that the conserved motifs in the Mrr domain are a conventional sub-family of the PD-(D/E)XK superfamily of DNA nucleases.
Collapse
Affiliation(s)
- Rachel M Smith
- DNA-Protein Interactions Unit, Department of Biochemistry, School of Medical Sciences, University of Bristol, Bristol, BS8 1TD, UK
| | | | | |
Collapse
|
53
|
Khan F, Furuta Y, Kawai M, Kaminska KH, Ishikawa K, Bujnicki JM, Kobayashi I. A putative mobile genetic element carrying a novel type IIF restriction-modification system (PluTI). Nucleic Acids Res 2010; 38:3019-30. [PMID: 20071747 PMCID: PMC2875022 DOI: 10.1093/nar/gkp1221] [Citation(s) in RCA: 12] [Impact Index Per Article: 0.9] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 01/24/2023] Open
Abstract
Genome comparison and genome context analysis were used to find a putative mobile element in the genome of Photorhabdus luminescens, an entomopathogenic bacterium. The element is composed of 16-bp direct repeats in the terminal regions, which are identical to a part of insertion sequences (ISs), a DNA methyltransferase gene homolog, two genes of unknown functions and an open reading frame (ORF) (plu0599) encoding a protein with no detectable sequence similarity to any known protein. The ORF (plu0599) product showed DNA endonuclease activity, when expressed in a cell-free expression system. Subsequently, the protein, named R.PluTI, was expressed in vivo, purified and found to be a novel type IIF restriction enzyme that recognizes 5′-GGCGC/C-3′ (/ indicates position of cleavage). R.PluTI cleaves a two-site supercoiled substrate at both the sites faster than a one-site supercoiled substrate. The modification enzyme homolog encoded by plu0600, named M.PluTI, was expressed in Escherichia coli and shown to protect DNA from R.PluTI cleavage in vitro, and to suppress the lethal effects of R.PluTI expression in vivo. These results suggested that they constitute a restriction–modification system, present on the putative mobile element. Our approach thus allowed detection of a previously uncharacterized family of DNA-interacting proteins.
Collapse
Affiliation(s)
- Feroz Khan
- Department of Medical Genome Sciences, Graduate School of Frontier Sciences, University of Tokyo, Japan
| | | | | | | | | | | | | |
Collapse
|
54
|
Bertrand L, Leiva-Torres GA, Hyjazie H, Pearson A. Conserved residues in the UL24 protein of herpes simplex virus 1 are important for dispersal of the nucleolar protein nucleolin. J Virol 2010; 84:109-18. [PMID: 19864385 PMCID: PMC2798432 DOI: 10.1128/jvi.01428-09] [Citation(s) in RCA: 31] [Impact Index Per Article: 2.2] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/10/2009] [Accepted: 10/20/2009] [Indexed: 12/13/2022] Open
Abstract
The UL24 family of proteins is widely conserved among herpesviruses. We demonstrated previously that UL24 of herpes simplex virus 1 (HSV-1) is important for the dispersal of nucleolin from nucleolar foci throughout the nuclei of infected cells. Furthermore, the N-terminal portion of UL24 localizes to nuclei and can disperse nucleolin in the absence of any other viral proteins. In this study, we tested the hypothesis that highly conserved residues in UL24 are important for the ability of the protein to modify the nuclear distribution of nucleolin. We constructed a panel of substitution mutations in UL24 and tested their effects on nucleolin staining patterns. We found that modified UL24 proteins exhibited a range of subcellular distributions. Mutations associated with a wild-type localization pattern for UL24 correlated with high levels of nucleolin dispersal. Interestingly, mutations targeting two regions, namely, within the first homology domain and overlapping or near the previously identified PD-(D/E)XK endonuclease motif, caused the most altered UL24 localization pattern and the most drastic reduction in its ability to disperse nucleolin. Viral mutants corresponding to the substitutions G121A and E99A/K101A both exhibited a syncytial plaque phenotype at 39 degrees C. vUL24-E99A/K101A replicated to lower titers than did vUL24-G121A or KOS. Furthermore, the E99A/K101A mutation caused the greatest impairment of HSV-1-induced dispersal of nucleolin. Our results identified residues in UL24 that are critical for the ability of UL24 to alter nucleoli and further support the notion that the endonuclease motif is important for the function of UL24 during infection.
Collapse
Affiliation(s)
- Luc Bertrand
- INRS-Institut Armand-Frappier, Université du Québec, Laval, Québec, Canada
| | | | - Huda Hyjazie
- INRS-Institut Armand-Frappier, Université du Québec, Laval, Québec, Canada
| | - Angela Pearson
- INRS-Institut Armand-Frappier, Université du Québec, Laval, Québec, Canada
| |
Collapse
|
55
|
Siew VK, Duh CY, Wang SK. Human cytomegalovirus UL76 induces chromosome aberrations. J Biomed Sci 2009; 16:107. [PMID: 19930723 PMCID: PMC2788540 DOI: 10.1186/1423-0127-16-107] [Citation(s) in RCA: 43] [Impact Index Per Article: 2.9] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 04/15/2009] [Accepted: 11/25/2009] [Indexed: 12/26/2022] Open
Abstract
Background Human cytomegalovirus (HCMV) is known to induce chromosome aberrations in infected cells, which can lead to congenital abnormalities in infected fetuses. HCMV UL76 belongs to a conserved protein family from herpesviruses. Some reported roles among UL76 family members include involvement in virulence determination, lytic replication, reactivation of latent virus, modulation of gene expression, induction of apoptosis, and perturbation of cell cycle progression, as well as potential nuclease activity. Previously, we have shown that stable expression of UL76 inhibits HCMV replication in glioblastoma cells. Methods To examine chromosomal integrity and the DNA damage signal γ-H2AX in cells constitutively expressing UL76, immunofluorescent cell staining and Western blotting were performed. The comet assay was employed to assess DNA breaks in cells transiently expressing UL76. Results We report that stably transfected cells expressing UL76 developed chromosome aberrations including micronuclei and misaligned chromosomes, lagging and bridging. In mitotic cells expressing UL76, aberrant spindles were increased compared to control cells. However, cells with supernumerary centrosomes were marginally increased in UL76-expressing cells relative to control cells. We further demonstrated that UL76-expressing cells activated the DNA damage signal γ-H2AX and caused foci formation in nuclei. In addition, the number of cells with DNA breaks increased in proportion to UL76 protein levels. Conclusion Our findings suggest that the virus-associated protein UL76 induces DNA damage and the accumulation of chromosome aberrations.
Collapse
Affiliation(s)
- Voon-Kwan Siew
- Department of Microbiology, Institute of Medicine, College of Medicine, Kaohsiung Medical University, 100 Shih-Chuan 1st Road, Kaohsiung 80708, Taiwan.
| | | | | |
Collapse
|
56
|
Dahlroth SL, Gurmu D, Schmitzberger F, Engman H, Haas J, Erlandsen H, Nordlund P. Crystal structure of the shutoff and exonuclease protein from the oncogenic Kaposi's sarcoma-associated herpesvirus. FEBS J 2009; 276:6636-45. [PMID: 19843164 DOI: 10.1111/j.1742-4658.2009.07374.x] [Citation(s) in RCA: 32] [Impact Index Per Article: 2.1] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/29/2022]
Abstract
The Kaposi's sarcoma-associated herpesvirus protein SOX (shut off and exonuclease) and its Epstein-Barr virus homolog, BGLF5, are active during the early lytic phase and belong to the alkaline nuclease family. Both proteins have been shown to be bifunctional, being responsible for DNA maturation as well as host shutoff at the mRNA level. We present the crystal structure of SOX determined at 1.85 A resolution. By modeling DNA binding, we have identified catalytic residues that explain the preferred 5'-exonuclease activity of the alkaline nucleases. The presence of a crevice suitable for binding duplex DNA supports a role for herpes alkaline nucleases in recombination events preceding packaging of viral DNA. Direct interaction with dsDNA is supported by oligonucleotide binding data. Mutations specifically affecting host shutoff map to a surface region of the N-terminal domain, implying an essential role in protein-protein interactions, and link the RNase activity of the enzyme to mRNA degradation pathways.
Collapse
Affiliation(s)
- Sue-Li Dahlroth
- Medical Biochemistry and Biophysics, Karolinska Institutet, Stockholm, Sweden
| | | | | | | | | | | | | |
Collapse
|
57
|
Hu P, Janga SC, Babu M, Díaz-Mejía JJ, Butland G, Yang W, Pogoutse O, Guo X, Phanse S, Wong P, Chandran S, Christopoulos C, Nazarians-Armavil A, Nasseri NK, Musso G, Ali M, Nazemof N, Eroukova V, Golshani A, Paccanaro A, Greenblatt JF, Moreno-Hagelsieb G, Emili A. Global functional atlas of Escherichia coli encompassing previously uncharacterized proteins. PLoS Biol 2009; 7:e96. [PMID: 19402753 PMCID: PMC2672614 DOI: 10.1371/journal.pbio.1000096] [Citation(s) in RCA: 268] [Impact Index Per Article: 17.9] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/21/2008] [Accepted: 03/16/2009] [Indexed: 12/28/2022] Open
Abstract
One-third of the 4,225 protein-coding genes of Escherichia coli K-12 remain functionally unannotated (orphans). Many map to distant clades such as Archaea, suggesting involvement in basic prokaryotic traits, whereas others appear restricted to E. coli, including pathogenic strains. To elucidate the orphans' biological roles, we performed an extensive proteomic survey using affinity-tagged E. coli strains and generated comprehensive genomic context inferences to derive a high-confidence compendium for virtually the entire proteome consisting of 5,993 putative physical interactions and 74,776 putative functional associations, most of which are novel. Clustering of the respective probabilistic networks revealed putative orphan membership in discrete multiprotein complexes and functional modules together with annotated gene products, whereas a machine-learning strategy based on network integration implicated the orphans in specific biological processes. We provide additional experimental evidence supporting orphan participation in protein synthesis, amino acid metabolism, biofilm formation, motility, and assembly of the bacterial cell envelope. This resource provides a "systems-wide" functional blueprint of a model microbe, with insights into the biological and evolutionary significance of previously uncharacterized proteins.
Collapse
Affiliation(s)
- Pingzhao Hu
- Banting and Best Department of Medical Research, Terrence Donnelly Center for Cellular and Biomolecular Research, University of Toronto, Toronto, Ontario, Canada
| | - Sarath Chandra Janga
- Banting and Best Department of Medical Research, Terrence Donnelly Center for Cellular and Biomolecular Research, University of Toronto, Toronto, Ontario, Canada
- Medical Research Council Laboratory of Molecular Biology, Cambridge, United Kingdom
| | - Mohan Babu
- Banting and Best Department of Medical Research, Terrence Donnelly Center for Cellular and Biomolecular Research, University of Toronto, Toronto, Ontario, Canada
| | - J. Javier Díaz-Mejía
- Banting and Best Department of Medical Research, Terrence Donnelly Center for Cellular and Biomolecular Research, University of Toronto, Toronto, Ontario, Canada
- Department of Biology, Wilfrid Laurier University, Waterloo, Ontario, Canada
| | - Gareth Butland
- Banting and Best Department of Medical Research, Terrence Donnelly Center for Cellular and Biomolecular Research, University of Toronto, Toronto, Ontario, Canada
| | - Wenhong Yang
- Banting and Best Department of Medical Research, Terrence Donnelly Center for Cellular and Biomolecular Research, University of Toronto, Toronto, Ontario, Canada
| | - Oxana Pogoutse
- Banting and Best Department of Medical Research, Terrence Donnelly Center for Cellular and Biomolecular Research, University of Toronto, Toronto, Ontario, Canada
| | - Xinghua Guo
- Banting and Best Department of Medical Research, Terrence Donnelly Center for Cellular and Biomolecular Research, University of Toronto, Toronto, Ontario, Canada
| | - Sadhna Phanse
- Banting and Best Department of Medical Research, Terrence Donnelly Center for Cellular and Biomolecular Research, University of Toronto, Toronto, Ontario, Canada
| | - Peter Wong
- Banting and Best Department of Medical Research, Terrence Donnelly Center for Cellular and Biomolecular Research, University of Toronto, Toronto, Ontario, Canada
| | - Shamanta Chandran
- Banting and Best Department of Medical Research, Terrence Donnelly Center for Cellular and Biomolecular Research, University of Toronto, Toronto, Ontario, Canada
| | - Constantine Christopoulos
- Banting and Best Department of Medical Research, Terrence Donnelly Center for Cellular and Biomolecular Research, University of Toronto, Toronto, Ontario, Canada
| | - Anaies Nazarians-Armavil
- Banting and Best Department of Medical Research, Terrence Donnelly Center for Cellular and Biomolecular Research, University of Toronto, Toronto, Ontario, Canada
| | - Negin Karimi Nasseri
- Banting and Best Department of Medical Research, Terrence Donnelly Center for Cellular and Biomolecular Research, University of Toronto, Toronto, Ontario, Canada
| | - Gabriel Musso
- Banting and Best Department of Medical Research, Terrence Donnelly Center for Cellular and Biomolecular Research, University of Toronto, Toronto, Ontario, Canada
| | - Mehrab Ali
- Banting and Best Department of Medical Research, Terrence Donnelly Center for Cellular and Biomolecular Research, University of Toronto, Toronto, Ontario, Canada
| | - Nazila Nazemof
- Department of Biology and Ottawa Institute of Systems Biology, Carleton University, Ottawa, Canada
| | - Veronika Eroukova
- Department of Biology and Ottawa Institute of Systems Biology, Carleton University, Ottawa, Canada
| | - Ashkan Golshani
- Department of Biology and Ottawa Institute of Systems Biology, Carleton University, Ottawa, Canada
| | - Alberto Paccanaro
- Department of Computer Science, Royal Holloway, University of London, Egham, United Kingdom
| | - Jack F Greenblatt
- Banting and Best Department of Medical Research, Terrence Donnelly Center for Cellular and Biomolecular Research, University of Toronto, Toronto, Ontario, Canada
| | - Gabriel Moreno-Hagelsieb
- Department of Biology, Wilfrid Laurier University, Waterloo, Ontario, Canada
- * To whom correspondence should be addressed. E-mail: (GM-H); (AE)
| | - Andrew Emili
- Banting and Best Department of Medical Research, Terrence Donnelly Center for Cellular and Biomolecular Research, University of Toronto, Toronto, Ontario, Canada
- * To whom correspondence should be addressed. E-mail: (GM-H); (AE)
| |
Collapse
|
58
|
Zylicz-Stachula A, Bujnicki JM, Skowron PM. Cloning and analysis of a bifunctional methyltransferase/restriction endonuclease TspGWI, the prototype of a Thermus sp. enzyme family. BMC Mol Biol 2009; 10:52. [PMID: 19480701 PMCID: PMC2700111 DOI: 10.1186/1471-2199-10-52] [Citation(s) in RCA: 22] [Impact Index Per Article: 1.5] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 12/31/2008] [Accepted: 05/29/2009] [Indexed: 01/09/2023] Open
Abstract
Background Restriction-modification systems are a diverse class of enzymes. They are classified into four major types: I, II, III and IV. We have previously proposed the existence of a Thermus sp. enzyme family, which belongs to type II restriction endonucleases (REases), however, it features also some characteristics of types I and III. Members include related thermophilic endonucleases: TspGWI, TaqII, TspDTI, and Tth111II. Results Here we describe cloning, mutagenesis and analysis of the prototype TspGWI enzyme that recognises the 5'-ACGGA-3' site and cleaves 11/9 nt downstream. We cloned, expressed, and mutagenised the tspgwi gene and investigated the properties of its product, the bifunctional TspGWI restriction/modification enzyme. Since TspGWI does not cleave DNA completely, a cloning method was devised, based on amino acid sequencing of internal proteolytic fragments. The deduced amino acid sequence of the enzyme shares significant sequence similarity with another representative of the Thermus sp. family – TaqII. Interestingly, these enzymes recognise similar, yet different sequences in the DNA. Both enzymes cleave DNA at the same distance, but differ in their ability to cleave single sites and in the requirement of S-adenosylmethionine as an allosteric activator for cleavage. Both the restriction endonuclease (REase) and methyltransferase (MTase) activities of wild type (wt) TspGWI (either recombinant or isolated from Thermus sp.) are dependent on the presence of divalent cations. Conclusion TspGWI is a bifunctional protein comprising a tandem arrangement of Type I-like domains; particularly noticeable is the central HsdM-like module comprising a helical domain and a highly conserved S-adenosylmethionine-binding/catalytic MTase domain, containing DPAVGTG and NPPY motifs. TspGWI also possesses an N-terminal PD-(D/E)XK nuclease domain related to the corresponding domains in HsdR subunits, but lacks the ATP-dependent translocase module of the HsdR subunit and the additional domains that are involved in subunit-subunit interactions in Type I systems. The MTase and REase activities of TspGWI are autonomous and can be uncoupled. Structurally and functionally, the TspGWI protomer appears to be a streamlined 'half' of a Type I enzyme.
Collapse
Affiliation(s)
- Agnieszka Zylicz-Stachula
- Division of Environmental Molecular Biotechnology, Department of Chemistry, University of Gdansk, Sobieskiego 18, Gdansk 80-952, Poland.
| | | | | |
Collapse
|
59
|
Skrabanek L, Niv MY. Scan2S: increasing the precision of PROSITE pattern motifs using secondary structure constraints. Proteins 2009; 72:1138-47. [PMID: 18320586 DOI: 10.1002/prot.22008] [Citation(s) in RCA: 1] [Impact Index Per Article: 0.1] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/09/2022]
Abstract
Sequence signature databases such as PROSITE, which include protein pattern motifs indicative of a protein's function, are widely used for function prediction studies, cellular localization annotation, and sequence classification. Correct annotation relies on high precision of the motifs. We present a new and general approach for increasing the precision of established protein pattern motifs by including secondary structure constraints (SSCs). We use Scan2S, the first sequence motif-scanning program to optionally include SSCs, to augment PROSITE pattern motifs. The constraints were derived from either the DSSP secondary structure assignment or the PSIPRED predictions for PROSITE-documented true positive hits. The secondary structure-augmented motifs were scanned against all SwissProt sequences, for which secondary structure predictions were precalculated. Against this dataset, motifs with PSIPRED-derived SSCs exhibited improved performance over motifs with DSSP-derived constraints. The precision of 763 of the 782 PSIPRED-augmented motifs remained unchanged or increased compared to the original motifs; 26 motifs showed an absolute precision increase of 10-30%. We provide the complete set of augmented motifs and the Scan2S program at http://physiology.med.cornell.edu/go/scan2s. Our results suggest a general protocol for increasing the precision of protein pattern detection via the inclusion of SSCs.
Collapse
Affiliation(s)
- Lucy Skrabanek
- Department of Physiology and Biophysics, Weill Medical College of Cornell University, New York, New York 10021, USA.
| | | |
Collapse
|
60
|
Nakonieczna J, Kaczorowski T, Obarska-Kosinska A, Bujnicki JM. Functional analysis of MmeI from methanol utilizer Methylophilus methylotrophus, a subtype IIC restriction-modification enzyme related to type I enzymes. Appl Environ Microbiol 2009; 75:212-23. [PMID: 18997032 PMCID: PMC2612229 DOI: 10.1128/aem.01322-08] [Citation(s) in RCA: 16] [Impact Index Per Article: 1.1] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/13/2008] [Accepted: 10/29/2008] [Indexed: 11/20/2022] Open
Abstract
MmeI from Methylophilus methylotrophus belongs to the type II restriction-modification enzymes. It recognizes an asymmetric DNA sequence, 5'-TCCRAC-3' (R indicates G or A), and cuts both strands at fixed positions downstream of the specific site. This particular feature has been exploited in transcript profiling of complex genomes (using serial analysis of gene expression technology). We have shown previously that the endonucleolytic activity of MmeI is strongly dependent on the presence of S-adenosyl-l-methionine (J. Nakonieczna, J. W. Zmijewski, B. Banecki, and A. J. Podhajska, Mol. Biotechnol. 37:127-135, 2007), which puts MmeI in subtype IIG. The same cofactor is used by MmeI as a methyl group donor for modification of an adenine in the upper strand of the recognition site to N(6)-methyladenine. Both enzymatic activities reside in a single polypeptide (919 amino acids [aa]), which puts MmeI also in subtype IIC of the restriction-modification systems. Based on a molecular model, generated with the use of bioinformatic tools and validated by site-directed mutagenesis, we were able to localize three functional domains in the structure of the MmeI enzyme: (i) the N-terminal portion containing the endonucleolytic domain with the catalytic Mg2+-binding motif D(70)-X(9)-EXK(82), characteristic for the PD-(D/E)XK superfamily of nucleases; (ii) a central portion (aa 310 to 610) containing nine sequence motifs conserved among N(6)-adenine gamma-class DNA methyltransferases; (iii) the C-terminal portion (aa 610 to 919) containing a putative target recognition domain. Interestingly, all three domains showed highest similarity to the corresponding elements of type I enzymes rather than to classical type II enzymes. We have found that MmeI variants deficient in restriction activity (D70A, E80A, and K82A) can bind and methylate specific nucleotide sequence. This suggests that domains of MmeI responsible for DNA restriction and modification can act independently. Moreover, we have shown that a single amino acid residue substitution within the putative target recognition domain (S807A) resulted in a MmeI variant with a higher endonucleolytic activity than the wild-type enzyme.
Collapse
Affiliation(s)
- Joanna Nakonieczna
- Department of Biotechnology, Intercollegiate Faculty of Biotechnology, University of Gdansk, and Medical University of Gdansk, Kladki 24, 80-822 Gdansk, Poland.
| | | | | | | |
Collapse
|
61
|
Orlowski J, Mebrhatu MT, Michiels CW, Bujnicki JM, Aertsen A. Mutational analysis and a structural model of methyl-directed restriction enzyme Mrr. Biochem Biophys Res Commun 2008; 377:862-6. [DOI: 10.1016/j.bbrc.2008.10.064] [Citation(s) in RCA: 11] [Impact Index Per Article: 0.7] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/14/2008] [Accepted: 10/15/2008] [Indexed: 11/29/2022]
|
62
|
Type II restriction endonuclease R.Hpy188I belongs to the GIY-YIG nuclease superfamily, but exhibits an unusual active site. BMC STRUCTURAL BIOLOGY 2008; 8:48. [PMID: 19014591 PMCID: PMC2630997 DOI: 10.1186/1472-6807-8-48] [Citation(s) in RCA: 14] [Impact Index Per Article: 0.9] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Subscribe] [Scholar Register] [Received: 07/06/2008] [Accepted: 11/14/2008] [Indexed: 11/10/2022]
Abstract
BACKGROUND Catalytic domains of Type II restriction endonucleases (REases) belong to a few unrelated three-dimensional folds. While the PD-(D/E)XK fold is most common among these enzymes, crystal structures have been also determined for single representatives of two other folds: PLD (R.BfiI) and half-pipe (R.PabI). Bioinformatics analyses supported by mutagenesis experiments suggested that some REases belong to the HNH fold (e.g. R.KpnI), and that a small group represented by R.Eco29kI belongs to the GIY-YIG fold. However, for a large fraction of REases with known sequences, the three-dimensional fold and the architecture of the active site remain unknown, mostly due to extreme sequence divergence that hampers detection of homology to enzymes with known folds. RESULTS R.Hpy188I is a Type II REase with unknown structure. PSI-BLAST searches of the non-redundant protein sequence database reveal only 1 homolog (R.HpyF17I, with nearly identical amino acid sequence and the same DNA sequence specificity). Standard application of state-of-the-art protein fold-recognition methods failed to predict the relationship of R.Hpy188I to proteins with known structure or to other protein families. In order to increase the amount of evolutionary information in the multiple sequence alignment, we have expanded our sequence database searches to include sequences from metagenomics projects. This search resulted in identification of 23 further members of R.Hpy188I family, both from metagenomics and the non-redundant database. Moreover, fold-recognition analysis of the extended R.Hpy188I family revealed its relationship to the GIY-YIG domain and allowed for computational modeling of the R.Hpy188I structure. Analysis of the R.Hpy188I model in the light of sequence conservation among its homologs revealed an unusual variant of the active site, in which the typical Tyr residue of the YIG half-motif had been substituted by a Lys residue. Moreover, some of its homologs have the otherwise invariant Arg residue in a non-homologous position in sequence that nonetheless allows for spatial conservation of the guanidino group potentially involved in phosphate binding. CONCLUSION The present study eliminates a significant "white spot" on the structural map of REases. It also provides important insight into sequence-structure-function relationships in the GIY-YIG nuclease superfamily. Our results reveal that in the case of proteins with no or few detectable homologs in the standard "non-redundant" database, it is useful to expand this database by adding the metagenomic sequences, which may provide evolutionary linkage to detect more remote homologs.
Collapse
|
63
|
Niv MY, Skrabanek L, Roberts RJ, Scheraga HA, Weinstein H. Identification of GATC- and CCGG-recognizing Type II REases and their putative specificity-determining positions using Scan2S--a novel motif scan algorithm with optional secondary structure constraints. Proteins 2008; 71:631-40. [PMID: 17972284 DOI: 10.1002/prot.21777] [Citation(s) in RCA: 2] [Impact Index Per Article: 0.1] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 01/30/2023]
Abstract
Restriction endonucleases (REases) are DNA-cleaving enzymes that have become indispensable tools in molecular biology. Type II REases are highly divergent in sequence despite their common structural core, function and, in some cases, common specificities towards DNA sequences. This makes it difficult to identify and classify them functionally based on sequence, and has hampered the efforts of specificity-engineering. Here, we define novel REase sequence motifs, which extend beyond the PD-(D/E)XK hallmark, and incorporate secondary structure information. The automated search using these motifs is carried out with a newly developed fast regular expression matching algorithm that accommodates long patterns with optional secondary structure constraints. Using this new tool, named Scan2S, motifs derived from REases with specificity towards GATC- and CGGG-containing DNA sequences successfully identify REases of the same specificity. Notably, some of these sequences are not identified by standard sequence detection tools. The new motifs highlight potential specificity-determining positions that do not fully overlap for the GATC- and the CCGG-recognizing REases and are candidates for specificity re-engineering.
Collapse
Affiliation(s)
- Masha Y Niv
- Department of Physiology and Biophysics, Weill Medical College of Cornell University, 1300 York Ave., New York, New York 10021, USA.
| | | | | | | | | |
Collapse
|
64
|
Orlowski J, Bujnicki JM. Structural and evolutionary classification of Type II restriction enzymes based on theoretical and experimental analyses. Nucleic Acids Res 2008; 36:3552-69. [PMID: 18456708 PMCID: PMC2441816 DOI: 10.1093/nar/gkn175] [Citation(s) in RCA: 91] [Impact Index Per Article: 5.7] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 01/05/2023] Open
Abstract
For a very long time, Type II restriction enzymes (REases) have been a paradigm of ORFans: proteins with no detectable similarity to each other and to any other protein in the database, despite common cellular and biochemical function. Crystallographic analyses published until January 2008 provided high-resolution structures for only 28 of 1637 Type II REase sequences available in the Restriction Enzyme database (REBASE). Among these structures, all but two possess catalytic domains with the common PD-(D/E)XK nuclease fold. Two structures are unrelated to the others: R.BfiI exhibits the phospholipase D (PLD) fold, while R.PabI has a new fold termed 'half-pipe'. Thus far, bioinformatic studies supported by site-directed mutagenesis have extended the number of tentatively assigned REase folds to five (now including also GIY-YIG and HNH folds identified earlier in homing endonucleases) and provided structural predictions for dozens of REase sequences without experimentally solved structures. Here, we present a comprehensive study of all Type II REase sequences available in REBASE together with their homologs detectable in the nonredundant and environmental samples databases at the NCBI. We present the summary and critical evaluation of structural assignments and predictions reported earlier, new classification of all REase sequences into families, domain architecture analysis and new predictions of three-dimensional folds. Among 289 experimentally characterized (not putative) Type II REases, whose apparently full-length sequences are available in REBASE, we assign 199 (69%) to contain the PD-(D/E)XK domain. The HNH domain is the second most common, with 24 (8%) members. When putative REases are taken into account, the fraction of PD-(D/E)XK and HNH folds changes to 48% and 30%, respectively. Fifty-six characterized (and 521 predicted) REases remain unassigned to any of the five REase folds identified so far, and may exhibit new architectures. These enzymes are proposed as the most interesting targets for structure determination by high-resolution experimental methods. Our analysis provides the first comprehensive map of sequence-structure relationships among Type II REases and will help to focus the efforts of structural and functional genomics of this large and biotechnologically important class of enzymes.
Collapse
Affiliation(s)
- Jerzy Orlowski
- Laboratory of Bioinformatics and Protein Engineering, International Institute of Molecular and Cell Biology in Warsaw, ul. Ks. Trojdena 4, PL-02-109 Warsaw, Poland
| | | |
Collapse
|
65
|
Obarska-Kosinska A, Taylor JEN, Callow P, Orlowski J, Bujnicki JM, Kneale GG. HsdR subunit of the type I restriction-modification enzyme EcoR124I: biophysical characterisation and structural modelling. J Mol Biol 2008; 376:438-452. [PMID: 18164032 PMCID: PMC2878639 DOI: 10.1016/j.jmb.2007.11.024] [Citation(s) in RCA: 20] [Impact Index Per Article: 1.3] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/10/2007] [Revised: 11/08/2007] [Accepted: 11/09/2007] [Indexed: 01/19/2023]
Abstract
Type I restriction-modification (RM) systems are large, multifunctional enzymes composed of three different subunits. HsdS and HsdM form a complex in which HsdS recognizes the target DNA sequence, and HsdM carries out methylation of adenosine residues. The HsdR subunit, when associated with the HsdS-HsdM complex, translocates DNA in an ATP-dependent process and cleaves unmethylated DNA at a distance of several thousand base-pairs from the recognition site. The molecular mechanism by which these enzymes translocate the DNA is not fully understood, in part because of the absence of crystal structures. To date, crystal structures have been determined for the individual HsdS and HsdM subunits and models have been built for the HsdM-HsdS complex with the DNA. However, no structure is available for the HsdR subunit. In this work, the gene coding for the HsdR subunit of EcoR124I was re-sequenced, which showed that there was an error in the published sequence. This changed the position of the stop codon and altered the last 17 amino acid residues of the protein sequence. An improved purification procedure was developed to enable HsdR to be purified efficiently for biophysical and structural analysis. Analytical ultracentrifugation shows that HsdR is monomeric in solution, and the frictional ratio of 1.21 indicates that the subunit is globular and fairly compact. Small angle neutron-scattering of the HsdR subunit indicates a radius of gyration of 3.4 nm and a maximum dimension of 10 nm. We constructed a model of the HsdR using protein fold-recognition and homology modelling to model individual domains, and small-angle neutron scattering data as restraints to combine them into a single molecule. The model reveals an ellipsoidal shape of the enzymatic core comprising the N-terminal and central domains, and suggests conformational heterogeneity of the C-terminal region implicated in binding of HsdR to the HsdS-HsdM complex.
Collapse
Affiliation(s)
- Agnieszka Obarska-Kosinska
- Laboratory of Bioinformatics and Protein Engineering, International Institute of Molecular and Cell Biology, Trojdena 4, 02-109 Warsaw, Poland
| | - James E N Taylor
- Biophysics Laboratories, Institute of Biomedical and Biomolecular Sciences, University of Portsmouth, PO1 2DT, UK
| | - Philip Callow
- EPSAM and ISTM Research Institutes, Keele University, Staffordshire ST5 5BG, UK; ILL-EMBL Deuteration Laboratory, Partnership for Structural Biology, Institut Laue Langevin, 38042 Grenoble Cedex 9, Grenoble, France
| | - Jerzy Orlowski
- Laboratory of Bioinformatics and Protein Engineering, International Institute of Molecular and Cell Biology, Trojdena 4, 02-109 Warsaw, Poland
| | - Janusz M Bujnicki
- Laboratory of Bioinformatics and Protein Engineering, International Institute of Molecular and Cell Biology, Trojdena 4, 02-109 Warsaw, Poland.
| | - G Geoff Kneale
- Biophysics Laboratories, Institute of Biomedical and Biomolecular Sciences, University of Portsmouth, PO1 2DT, UK.
| |
Collapse
|
66
|
Kosinski J, Kubareva E, Bujnicki JM. A model of restriction endonuclease MvaI in complex with DNA: a template for interpretation of experimental data and a guide for specificity engineering. Proteins 2007; 68:324-36. [PMID: 17407166 DOI: 10.1002/prot.21460] [Citation(s) in RCA: 12] [Impact Index Per Article: 0.7] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 01/07/2023]
Abstract
R.MvaI is a Type II restriction enzyme (REase), which specifically recognizes the pentanucleotide DNA sequence 5'-CCWGG-3' (W indicates A or T). It belongs to a family of enzymes, which recognize related sequences, including 5'-CCSGG-3' (S indicates G or C) in the case of R.BcnI, or 5'-CCNGG-3' (where N indicates any nucleoside) in the case of R.ScrFI. REases from this family hydrolyze the phosphodiester bond in the DNA between the 2nd and 3rd base in both strands, thereby generating a double strand break with 5'-protruding single nucleotides. So far, no crystal structures of REases with similar cleavage patterns have been solved. Characterization of sequence-structure-function relationships in this family would facilitate understanding of evolution of sequence specificity among REases and could aid in engineering of enzymes with new specificities. However, sequences of R.MvaI or its homologs show no significant similarity to any proteins with known structures, thus precluding straightforward comparative modeling. We used a fold recognition approach to identify a remote relationship between R.MvaI and the structure of DNA repair enzyme MutH, which belongs to the PD-(D/E)XK superfamily together with many other REases. We constructed a homology model of R.MvaI and used it to predict functionally important amino acid residues and the mode of interaction with the DNA. In particular, we predict that only one active site of R.MvaI interacts with the DNA target at a time, and the cleavage of both strands (5'-CCAGG-3' and 5'-CCTGG-3') is achieved by two independent catalytic events. The model is in good agreement with the available experimental data and will serve as a template for further analyses of R.MvaI, R.BcnI, R.ScrFI and other related enzymes.
Collapse
Affiliation(s)
- Jan Kosinski
- Laboratory of Bioinformatics and Protein Engineering, International Institute of Molecular and Cell Biology, Trojdena 4, 02-109 Warsaw, Poland.
| | | | | |
Collapse
|
67
|
Type II restriction endonuclease R.Eco29kI is a member of the GIY-YIG nuclease superfamily. BMC STRUCTURAL BIOLOGY 2007; 7:48. [PMID: 17626614 PMCID: PMC1952068 DOI: 10.1186/1472-6807-7-48] [Citation(s) in RCA: 29] [Impact Index Per Article: 1.7] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Subscribe] [Scholar Register] [Received: 02/12/2007] [Accepted: 07/12/2007] [Indexed: 01/21/2023]
Abstract
Background The majority of experimentally determined crystal structures of Type II restriction endonucleases (REases) exhibit a common PD-(D/E)XK fold. Crystal structures have been also determined for single representatives of two other folds: PLD (R.BfiI) and half-pipe (R.PabI), and bioinformatics analyses supported by mutagenesis suggested that some REases belong to the HNH fold. Our previous bioinformatic analysis suggested that REase R.Eco29kI shares sequence similarities with one more unrelated nuclease superfamily, GIY-YIG, however so far no experimental data were available to support this prediction. The determination of a crystal structure of the GIY-YIG domain of homing endonuclease I-TevI provided a template for modeling of R.Eco29kI and prompted us to validate the model experimentally. Results Using protein fold-recognition methods we generated a new alignment between R.Eco29kI and I-TevI, which suggested a reassignment of one of the putative catalytic residues. A theoretical model of R.Eco29kI was constructed to illustrate its predicted three-dimensional fold and organization of the active site, comprising amino acid residues Y49, Y76, R104, H108, E142, and N154. A series of mutants was constructed to generate amino acid substitutions of selected residues (Y49A, R104A, H108F, E142A and N154L) and the mutant proteins were examined for their ability to bind the DNA containing the Eco29kI site 5'-CCGCGG-3' and to catalyze the cleavage reaction. Experimental data reveal that residues Y49, R104, E142, H108, and N154 are important for the nuclease activity of R.Eco29kI, while H108 and N154 are also important for specific DNA binding by this enzyme. Conclusion Substitutions of residues Y49, R104, H108, E142 and N154 predicted by the model to be a part of the active site lead to mutant proteins with strong defects in the REase activity. These results are in very good agreement with the structural model presented in this work and with our prediction that R.Eco29kI belongs to the GIY-YIG superfamily of nucleases. Our study provides the first experimental evidence for a Type IIP REase that does not belong to the PD-(D/E)XK or HNH superfamilies of nucleases, and is instead a member of the unrelated GIY-YIG superfamily.
Collapse
|
68
|
Guzzo CR, Nagem RAP, Barbosa JARG, Farah CS. Structure of Xanthomonas axonopodis pv. citri YaeQ reveals a new compact protein fold built around a variation of the PD-(D/E)XK nuclease motif. Proteins 2007; 69:644-51. [PMID: 17623842 DOI: 10.1002/prot.21556] [Citation(s) in RCA: 4] [Impact Index Per Article: 0.2] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 01/29/2023]
Abstract
The YaeQ family of proteins are found in many Gram-negative and a few Gram-positive bacteria. We have determined the first structure of a member of the YaeQ family by X-ray crystallography. Comparisons with other structures indicate that YaeQ represents a new compact protein fold built around a variation of the PD-(D/E)XK nuclease motif found in type II endonucleases and enzymes involved in DNA replication, repair, and recombination. We show that catalytically important residues in the PD-(D/E)XK nuclease superfamily are spatially conserved in YaeQ and other highly conserved YaeQ residues may be poised to interact with nucleic acid structures.
Collapse
Affiliation(s)
- Cristiane R Guzzo
- Departamento de Bioquímica, Instituto de Química, Universidade de São Paulo, Avenida Prof. Lineu Prestes 748, São Paulo, SP, CEP 05508-000, Brazil
| | | | | | | |
Collapse
|
69
|
Realm of PD-(D/E)XK nuclease superfamily revisited: detection of novel families with modified transitive meta profile searches. BMC STRUCTURAL BIOLOGY 2007; 7:40. [PMID: 17584917 PMCID: PMC1913061 DOI: 10.1186/1472-6807-7-40] [Citation(s) in RCA: 42] [Impact Index Per Article: 2.5] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Subscribe] [Scholar Register] [Received: 03/13/2007] [Accepted: 06/20/2007] [Indexed: 11/30/2022]
Abstract
Background PD-(D/E)XK nucleases constitute a large and highly diverse superfamily of enzymes that display little sequence similarity despite retaining a common core fold and a few critical active site residues. This makes identification of new PD-(D/E)XK nuclease families a challenging task as they usually escape detection with standard sequence-based methods. We developed a modified transitive meta profile search approach and to consider the structural diversity of PD-(D/E)XK nuclease fold more thoroughly we analyzed also lower than threshold Meta-BASIC hits to select potentially correct predictions placed among unreliable or incorrect ones. Results Application of a modified transitive Meta-BASIC searches on updated PFAM families and PDB structures resulted in detection of five new PD-(D/E)XK nuclease families encompassing hundreds of so far uncharacterized and poorly annotated proteins. These include four families catalogued in PFAM database as domains of unknown function (DUF506, DUF524, DUF1626 and DUF1703) and YhgA-like family of putative transposases. Three of these families represent extremely distant homologs (DUF506, DUF524, and YhgA-like), while two are newly defined in updated database (DUF1626 and DUF1703). In addition, we also confidently identified an extended AAA-ATPase domain in the N-terminal region of DUF1703 family proteins. Conclusion Obtained results suggest that detailed analysis of below threshold Meta-BASIC hits may push limits further for distant homology detection in the 'midnight zone' of homology. All identified families conserve the core evolutionary fold, secondary structure and hydrophobic patterns common to existing PD-(D/E)XK nucleases and maintain critical active site motifs that contribute to nucleic acid cleavage. Further experimental investigations should address the predicted activity and clarify potential substrates providing further insight into detailed biological role of these newly detected nucleases.
Collapse
|
70
|
Eastberg JH, Eklund J, Monnat R, Stoddard BL. Mutability of an HNH Nuclease Imidazole General Base and Exchange of a Deprotonation Mechanism. Biochemistry 2007; 46:7215-25. [PMID: 17516660 DOI: 10.1021/bi700418d] [Citation(s) in RCA: 26] [Impact Index Per Article: 1.5] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/28/2022]
Abstract
Several unique protein folds that catalyze the hydrolysis of phosphodiester bonds have arisen independently in nature, including the PD(D/E)XK superfamily (typified by type II restriction endonucleases and many recombination and repair enzymes) and the HNH superfamily (found in an equally wide array of enzymes, including bacterial colicins and homing endonucleases). Whereas the identity and position of catalytic residues within the PD(D/E)XK superfamily are highly variable, the active sites of HNH nucleases are much more strongly conserved. In this study, the ability of an HNH nuclease to tolerate a mutation of its most conserved catalytic residue (its histidine general base), and the mechanism of the most active enzyme variant, were characterized. Conversion of this residue into several altered chemistries, glutamine, lysine, or glutamate, resulted in measurable activity. The histidine to glutamine mutant displays the highest residual activity and a pH profile similar to that of the wild-type enzyme. This activity is dependent on the presence of a neighboring imidazole ring, which has taken over as a less efficient general base for the reaction. This result implies that mutational pathways to alternative HNH-derived catalytic sites do exist but are not as extensively or successfully diverged or reoptimized in nature as variants of the PD(D/E)XK nuclease superfamily. This is possibly due to multiple steric constraints placed on the compact HNH motif, which is simultaneously involved in protein folding, DNA binding, and catalysis, as well as the use of a planar, aromatic imidazole group as a general base.
Collapse
Affiliation(s)
- Jennifer H Eastberg
- Fred Hutchinson Cancer Research Center and Graduate Program in Molecular and Cellular Biology, University of Washington, 1100 Fairview Avenue North, A3-025, Seattle, Washington 98109, USA
| | | | | | | |
Collapse
|
71
|
Niv MY, Ripoll DR, Vila JA, Liwo A, Vanamee ES, Aggarwal AK, Weinstein H, Scheraga HA. Topology of Type II REases revisited; structural classes and the common conserved core. Nucleic Acids Res 2007; 35:2227-37. [PMID: 17369272 PMCID: PMC1874628 DOI: 10.1093/nar/gkm045] [Citation(s) in RCA: 31] [Impact Index Per Article: 1.8] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/16/2022] Open
Abstract
Type II restriction endonucleases (REases) are deoxyribonucleases that cleave DNA sequences with remarkable specificity. Type II REases are highly divergent in sequence as well as in topology, i.e. the connectivity of secondary structure elements. A widely held assumption is that a structural core of five β-strands flanked by two α-helices is common to these enzymes. We introduce a systematic procedure to enumerate secondary structure elements in an unambiguous and reproducible way, and use it to analyze the currently available X-ray structures of Type II REases. Based on this analysis, we propose an alternative definition of the core, which we term the αβα-core. The αβα-core includes the most frequently observed secondary structure elements and is not a sandwich, as it consists of a five-strand β-sheet and two α-helices on the same face of the β-sheet. We use the αβα-core connectivity as a basis for grouping the Type II REases into distinct structural classes. In these new structural classes, the connectivity correlates with the angles between the secondary structure elements and with the cleavage patterns of the REases. We show that there exists a substructure of the αβα-core, namely a common conserved core, ccc, defined here as one α-helix and four β-strands common to all Type II REase of known structure.
Collapse
Affiliation(s)
- Masha Y Niv
- Department of Physiology and Biophysics, Weill Medical College of Cornell University, New York, NY 10021, USA.
| | | | | | | | | | | | | | | |
Collapse
|
72
|
Structural and evolutionary bioinformatics of the SPOUT superfamily of methyltransferases. BMC Bioinformatics 2007; 8:73. [PMID: 17338813 PMCID: PMC1829167 DOI: 10.1186/1471-2105-8-73] [Citation(s) in RCA: 130] [Impact Index Per Article: 7.6] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 11/20/2006] [Accepted: 03/05/2007] [Indexed: 11/29/2022] Open
Abstract
Background SPOUT methyltransferases (MTases) are a large class of S-adenosyl-L-methionine-dependent enzymes that exhibit an unusual alpha/beta fold with a very deep topological knot. In 2001, when no crystal structures were available for any of these proteins, Anantharaman, Koonin, and Aravind identified homology between SpoU and TrmD MTases and defined the SPOUT superfamily. Since then, multiple crystal structures of knotted MTases have been solved and numerous new homologous sequences appeared in the databases. However, no comprehensive comparative analysis of these proteins has been carried out to classify them based on structural and evolutionary criteria and to guide functional predictions. Results We carried out extensive searches of databases of protein structures and sequences to collect all members of previously identified SPOUT MTases, and to identify previously unknown homologs. Based on sequence clustering, characterization of domain architecture, structure predictions and sequence/structure comparisons, we re-defined families within the SPOUT superfamily and predicted putative active sites and biochemical functions for the so far uncharacterized members. We have also delineated the common core of SPOUT MTases and inferred a multiple sequence alignment for the conserved knot region, from which we calculated the phylogenetic tree of the superfamily. We have also studied phylogenetic distribution of different families, and used this information to infer the evolutionary history of the SPOUT superfamily. Conclusion We present the first phylogenetic tree of the SPOUT superfamily since it was defined, together with a new scheme for its classification, and discussion about conservation of sequence and structure in different families, and their functional implications. We identified four protein families as new members of the SPOUT superfamily. Three of these families are functionally uncharacterized (COG1772, COG1901, and COG4080), and one (COG1756 represented by Nep1p) has been already implicated in RNA metabolism, but its biochemical function has been unknown. Based on the inference of orthologous and paralogous relationships between all SPOUT families we propose that the Last Universal Common Ancestor (LUCA) of all extant organisms contained at least three SPOUT members, ancestors of contemporary RNA MTases that carry out m1G, m3U, and 2'O-ribose methylation, respectively. In this work we also speculate on the origin of the knot and propose possible 'unknotted' ancestors. The results of our analysis provide a comprehensive 'roadmap' for experimental characterization of SPOUT MTases and interpretation of functional studies in the light of sequence-structure relationships.
Collapse
|
73
|
Cymerman IA, Obarska A, Skowronek KJ, Lubys A, Bujnicki JM. Identification of a new subfamily of HNH nucleases and experimental characterization of a representative member, HphI restriction endonuclease. Proteins 2007; 65:867-76. [PMID: 17029241 DOI: 10.1002/prot.21156] [Citation(s) in RCA: 25] [Impact Index Per Article: 1.5] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 12/18/2022]
Abstract
The restriction endonuclease (REase) R. HphI is a Type IIS enzyme that recognizes the asymmetric target DNA sequence 5'-GGTGA-3' and in the presence of Mg(2+) hydrolyzes phosphodiester bonds in both strands of the DNA at a distance of 8 nucleotides towards the 3' side of the target, producing a 1 nucleotide 3'-staggered cut in an unspecified sequence at this position. REases are typically ORFans that exhibit little similarity to each other and to any proteins in the database. However, bioinformatics analyses revealed that R.HphI is a member of a relatively big sequence family with a conserved C-terminal domain and a variable N-terminal domain. We predict that the C-terminal domains of proteins from this family correspond to the nuclease domain of the HNH superfamily rather than to the most common PD-(D/E)XK superfamily of nucleases. We constructed a three-dimensional model of the R.HphI catalytic domain and validated our predictions by site-directed mutagenesis and studies of DNA-binding and catalytic activities of the mutant proteins. We also analyzed the genomic neighborhood of R.HphI homologs and found that putative nucleases accompanied by a DNA methyltransferase (i.e. predicted REases) do not form a single group on a phylogenetic tree, but are dispersed among free-standing putative nucleases. This suggests that nucleases from the HNH superfamily were independently recruited to become REases in the context of RM systems multiple times in the evolution and that members of the HNH superfamily may be much more frequent among the so far unassigned REase sequences than previously thought.
Collapse
Affiliation(s)
- Iwona A Cymerman
- Laboratory of Bioinformatics and Protein Engineering, International Institute of Molecular and Cell Biology, Trojdena 4, 02-109 Warsaw, Poland
| | | | | | | | | |
Collapse
|
74
|
Bist P, Madhusoodanan UK, Rao DN. A Mutation in the Mod Subunit of EcoP15I Restriction Enzyme Converts the DNA Methyltransferase to a Site-specific Endonuclease. J Biol Chem 2007; 282:3520-30. [PMID: 17148461 DOI: 10.1074/jbc.m603250200] [Citation(s) in RCA: 4] [Impact Index Per Article: 0.2] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/06/2022] Open
Abstract
A closer inspection of the amino acid sequence of EcoP15I DNA methyltransferase revealed a region of similarity to the PDXn(D/E)XK catalytic site of type II restriction endonucleases, except for methionine in EcoP15I DNA methyltransferase instead of proline. Substitution of methionine at position 357 by proline converts EcoP15I DNA methyltransferase to a site-specific endonuclease. EcoP15I-M357P DNA methyltransferase specifically binds to the recognition sequence 5'-CAGCAG-3' and cleaves DNA asymmetrically EcoP151-M357P.DNA methyltransferase specifically binds to the recognition sequence 5'-CAGCAG-3' and cleaves DNA asymmetrically, 5'-CAGCAG(N)(10)-3', as indicated by the arrows, in presence of magnesium ions.
Collapse
Affiliation(s)
- Pradeep Bist
- Department of Biochemistry, Indian Institute of Science, Bangalore 560 012, India
| | | | | |
Collapse
|
75
|
Orlowski J, Boniecki M, Bujnicki JM. I-Ssp6803I: the first homing endonuclease from the PD-(D/E)XK superfamily exhibits an unusual mode of DNA recognition. Bioinformatics 2007; 23:527-30. [PMID: 17242028 DOI: 10.1093/bioinformatics/btm007] [Citation(s) in RCA: 31] [Impact Index Per Article: 1.8] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/14/2022] Open
Abstract
MOTIVATION Restriction endonucleases (REases) and homing endonucleases (HEases) are biotechnologically important enzymes. Nearly all structurally characterized REases belong to the PD-(D/E)XK superfamily of nucleases, while most HEases belong to an unrelated LAGLIDADG superfamily. These two protein folds are typically associated with very different modes of protein-DNA recognition, consistent with the different mechanisms of action required to achieve high specificity. REases recognize short DNA sequences using multiple contacts per base pair, while HEases recognize very long sites using a few contacts per base pair, thereby allowing for partial degeneracy of the target sequence. Thus far, neither REases with the LAGLIDADG fold, nor HEases with the PD-(D/E)XK fold, have been found. RESULTS Using protein fold recognition, we have identified the first member of the PD-(D/E)XK superfamily among homing endonucleases, a cyanobacterial enzyme I-Ssp6803I. We present a model of the I-Ssp6803I-DNA complex based on the structure of Type II restriction endonuclease R.BglI and predict the active site and residues involved in specific DNA sequence recognition by I-Ssp6803I. Our finding reveals a new unexpected evolutionary link between HEases and REases and suggests how PD-(D/E)XK nucleases may develop a 'HEase-like' way of interacting with the extended DNA sequence. This in turn may be exploited to study the evolution of DNA sequence specificity and to engineer nucleases with new substrate specificities.
Collapse
Affiliation(s)
- Jerzy Orlowski
- Laboratory of Bioinformatics and Protein Engineering, International Institute of Molecular and Cell Biology, Trojdena 4, 02-109 Warsaw, Poland
| | | | | |
Collapse
|
76
|
van de Werken HJG, Verhees CH, Akerboom J, de Vos WM, van der Oost J. Identification of a glycolytic regulon in the archaeaPyrococcusandThermococcus. FEMS Microbiol Lett 2006; 260:69-76. [PMID: 16790020 DOI: 10.1111/j.1574-6968.2006.00292.x] [Citation(s) in RCA: 29] [Impact Index Per Article: 1.6] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/30/2022] Open
Abstract
The glycolytic pathway of the hyperthermophilic archaea that belong to the order Thermococcales (Pyrococcus, Thermococcus and Palaeococcus) differs significantly from the canonical Embden-Meyerhof pathway in bacteria and eukarya. This archaeal glycolysis variant consists of several novel enzymes, some of which catalyze unique conversions. Moreover, the enzymes appear not to be regulated allosterically, but rather at transcriptional level. To elucidate details of the gene expression control, the transcription initiation sites of the glycolytic genes in Pyrococcus furiosus have been mapped by primer extension analysis and the obtained promoter sequences have been compared with upstream regions of non-glycolytic genes. Apart from consensus sequences for the general transcription factors (TATA-box and BRE) this analysis revealed the presence of a potential transcription factor binding site (TATCAC-N(5)-GTGATA) in glycolytic and starch utilizing promoters of P. furiosus and several thermococcal species. The absence of this inverted repeat in Pyrococcus abyssi and Pyrococcus horikoshii probably reflects that their reduced catabolic capacity does not require this regulatory system. Moreover, this phyletic pattern revealed a TrmB-like regulator (PF0124 and TK1769) which may be involved in recognizing the repeat. This Thermococcales glycolytic regulon, with more than 20 genes, is the largest regulon that has yet been described for Archaea.
Collapse
|
77
|
Knizewski L, Kinch L, Grishin NV, Rychlewski L, Ginalski K. Human herpesvirus 1 UL24 gene encodes a potential PD-(D/E)XK endonuclease. J Virol 2006; 80:2575-7. [PMID: 16474163 PMCID: PMC1395385 DOI: 10.1128/jvi.80.5.2575-2577.2006] [Citation(s) in RCA: 27] [Impact Index Per Article: 1.5] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 01/09/2023] Open
Abstract
Using Meta-BASIC, a highly sensitive method for detection of distant similarity between proteins, we have identified another potential PD-(D/E)XK endonuclease in human herpesvirus 1 (HHV-1) encoded by the UL24 gene. The universal presence of UL24 in completed herpesviral genomes of three major subfamilies, Alphaherpesvirinae, Betaherpesvirinae, and Gammaherpesvirinae, suggests a fundamental role for this predicted PD-(D/E)XK endonuclease activity in the viral life cycle.
Collapse
Affiliation(s)
- Lukasz Knizewski
- Interdisciplinary Centre for Mathematical and Computational Modelling, Warsaw University, Poland
| | | | | | | | | |
Collapse
|