Reference Citation Analysis: Find an Article, Find a Category, Find a Journal, Find a Scholar

For: Feder M, Bujnicki JM. Identification of a new family of putative PD-(D/E)XK nucleases with unusual phylogenomic distribution and a new type of the active site. BMC Genomics 2005;6:21. [PMID: 15720711 PMCID: PMC551604 DOI: 10.1186/1471-2164-6-21] [Citation(s) in RCA: 35] [Impact Index Per Article: 1.8] [Reference Citation Analysis] [What about the content of this article? (0)] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 11/11/2004] [Accepted: 02/18/2005] [Indexed: 12/18/2022] Open

For:	Feder M, Bujnicki JM. Identification of a new family of putative PD-(D/E)XK nucleases with unusual phylogenomic distribution and a new type of the active site. BMC Genomics 2005;6:21. [PMID: 15720711 PMCID: PMC551604 DOI: 10.1186/1471-2164-6-21] [Citation(s) in RCA: 35] [Impact Index Per Article: 1.8] [Reference Citation Analysis] [What about the content of this article? (0)] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 11/11/2004] [Accepted: 02/18/2005] [Indexed: 12/18/2022] Open

Number

Cited by Other Article(s)

Wang S, Sun E, Liu Y, Yin B, Zhang X, Li M, Huang Q, Tan C, Qian P, Rao VB, Tao P. Landscape of New Nuclease-Containing Antiphage Systems in Escherichia coli and the Counterdefense Roles of Bacteriophage T4 Genome Modifications. J Virol 2023;97:e0059923. [PMID: 37306585 PMCID: PMC10308915 DOI: 10.1128/jvi.00599-23] [Citation(s) in RCA: 2] [Impact Index Per Article: 2.0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 04/22/2023] [Accepted: 05/19/2023] [Indexed: 06/13/2023] Open

Abstract

Many phages, such as T4, protect their genomes against the nucleases of bacterial restriction-modification (R-M) and CRISPR-Cas systems through covalent modification of their genomes. Recent studies have revealed many novel nuclease-containing antiphage systems, raising the question of the role of phage genome modifications in countering these systems. Here, by focusing on phage T4 and its host Escherichia coli, we depicted the landscape of the new nuclease-containing systems in E. coli and demonstrated the roles of T4 genome modifications in countering these systems. Our analysis identified at least 17 nuclease-containing defense systems in E. coli, with type III Druantia being the most abundant system, followed by Zorya, Septu, Gabija, AVAST type 4, and qatABCD. Of these, 8 nuclease-containing systems were found to be active against phage T4 infection. During T4 replication in E. coli, 5-hydroxymethyl dCTP is incorporated into the newly synthesized DNA instead of dCTP. The 5-hydroxymethylcytosines (hmCs) are further modified by glycosylation to form glucosyl-5-hydroxymethylcytosine (ghmC). Our data showed that the ghmC modification of the T4 genome abolished the defense activities of Gabija, Shedu, Restriction-like, type III Druantia, and qatABCD systems. The anti-phage T4 activities of the last two systems can also be counteracted by hmC modification. Interestingly, the Restriction-like system specifically restricts phage T4 containing an hmC-modified genome. The ghmC modification cannot abolish the anti-phage T4 activities of Septu, SspBCDE, and mzaABCDE, although it reduces their efficiency. Our study reveals the multidimensional defense strategies of E. coli nuclease-containing systems and the complex roles of T4 genomic modification in countering these defense systems. IMPORTANCE Cleavage of foreign DNA is a well-known mechanism used by bacteria to protect themselves from phage infections. Two well-known bacterial defense systems, R-M and CRISPR-Cas, both contain nucleases that cleave the phage genomes through specific mechanisms. However, phages have evolved different strategies to modify their genomes to prevent cleavage. Recent studies have revealed many novel nuclease-containing antiphage systems from various bacteria and archaea. However, no studies have systematically investigated the nuclease-containing antiphage systems of a specific bacterial species. In addition, the role of phage genome modifications in countering these systems remains unknown. Here, by focusing on phage T4 and its host Escherichia coli, we depicted the landscape of the new nuclease-containing systems in E. coli using all 2,289 genomes available in NCBI. Our studies reveal the multidimensional defense strategies of E. coli nuclease-containing systems and the complex roles of genomic modification of phage T4 in countering these defense systems.

Collapse

Affiliation(s)

Shuangshuang Wang State Key Laboratory of Agricultural Microbiology, Huazhong Agricultural University, Wuhan, Hubei, China Cooperative Innovation Center for Sustainable Pig Production, Huazhong Agricultural University, Wuhan, Hubei, China Hubei Hongshan Lab, Wuhan, Hubei, China
Erchao Sun State Key Laboratory of Agricultural Microbiology, Huazhong Agricultural University, Wuhan, Hubei, China Cooperative Innovation Center for Sustainable Pig Production, Huazhong Agricultural University, Wuhan, Hubei, China Hubei Hongshan Lab, Wuhan, Hubei, China
Yuepeng Liu State Key Laboratory of Agricultural Microbiology, Huazhong Agricultural University, Wuhan, Hubei, China Cooperative Innovation Center for Sustainable Pig Production, Huazhong Agricultural University, Wuhan, Hubei, China Hubei Hongshan Lab, Wuhan, Hubei, China
Baoqi Yin State Key Laboratory of Agricultural Microbiology, Huazhong Agricultural University, Wuhan, Hubei, China Cooperative Innovation Center for Sustainable Pig Production, Huazhong Agricultural University, Wuhan, Hubei, China Hubei Hongshan Lab, Wuhan, Hubei, China
Xueqi Zhang State Key Laboratory of Agricultural Microbiology, Huazhong Agricultural University, Wuhan, Hubei, China Cooperative Innovation Center for Sustainable Pig Production, Huazhong Agricultural University, Wuhan, Hubei, China Hubei Hongshan Lab, Wuhan, Hubei, China
Mengling Li State Key Laboratory of Agricultural Microbiology, Huazhong Agricultural University, Wuhan, Hubei, China Cooperative Innovation Center for Sustainable Pig Production, Huazhong Agricultural University, Wuhan, Hubei, China Hubei Hongshan Lab, Wuhan, Hubei, China
Qi Huang State Key Laboratory of Agricultural Microbiology, Huazhong Agricultural University, Wuhan, Hubei, China Cooperative Innovation Center for Sustainable Pig Production, Huazhong Agricultural University, Wuhan, Hubei, China
Chen Tan State Key Laboratory of Agricultural Microbiology, Huazhong Agricultural University, Wuhan, Hubei, China Cooperative Innovation Center for Sustainable Pig Production, Huazhong Agricultural University, Wuhan, Hubei, China Hubei Hongshan Lab, Wuhan, Hubei, China
Ping Qian State Key Laboratory of Agricultural Microbiology, Huazhong Agricultural University, Wuhan, Hubei, China Cooperative Innovation Center for Sustainable Pig Production, Huazhong Agricultural University, Wuhan, Hubei, China
Venigalla B. Rao Bacteriophage Medical Research Center, Department of Biology, The Catholic University of America, Washington, DC, USA
Pan Tao State Key Laboratory of Agricultural Microbiology, Huazhong Agricultural University, Wuhan, Hubei, China Cooperative Innovation Center for Sustainable Pig Production, Huazhong Agricultural University, Wuhan, Hubei, China Hubei Hongshan Lab, Wuhan, Hubei, China

Collapse

Chen H, Zhang M, Hochstrasser M. The Biochemistry of Cytoplasmic Incompatibility Caused by Endosymbiotic Bacteria. Genes (Basel) 2020;11:genes11080852. [PMID: 32722516 PMCID: PMC7465683 DOI: 10.3390/genes11080852] [Citation(s) in RCA: 22] [Impact Index Per Article: 5.5] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/23/2020] [Revised: 07/19/2020] [Accepted: 07/20/2020] [Indexed: 12/29/2022] Open

Abstract

Many species of arthropods carry maternally inherited bacterial endosymbionts that can influence host sexual reproduction to benefit the bacterium. The most well-known of such reproductive parasites is Wolbachia pipientis. Wolbachia are obligate intracellular α-proteobacteria found in nearly half of all arthropod species. This success has been attributed in part to their ability to manipulate host reproduction to favor infected females. Cytoplasmic incompatibility (CI), a phenomenon wherein Wolbachia infection renders males sterile when they mate with uninfected females, but not infected females (the rescue mating), appears to be the most common. CI provides a reproductive advantage to infected females in the presence of a threshold level of infected males. The molecular mechanisms of CI and other reproductive manipulations, such as male killing, parthenogenesis, and feminization, have remained mysterious for many decades. It had been proposed by Werren more than two decades ago that CI is caused by a Wolbachia-mediated sperm modification and that rescue is achieved by a Wolbachia-encoded rescue factor in the infected egg. In the past few years, new research has highlighted a set of syntenic Wolbachia gene pairs encoding CI-inducing factors (Cifs) as the key players for the induction of CI and its rescue. Within each Cif pair, the protein encoded by the upstream gene is denoted A and the downstream gene B. To date, two types of Cifs have been characterized based on the enzymatic activity identified in the B protein of each protein pair; one type encodes a deubiquitylase (thus named CI-inducing deubiquitylase or cid), and a second type encodes a nuclease (named CI-inducing nuclease or cin). The CidA and CinA proteins bind tightly and specifically to their respective CidB and CinB partners. In transgenic Drosophila melanogaster, the expression of either the Cid or Cin protein pair in the male germline induces CI and the expression of the cognate A protein in females is sufficient for rescue. With the identity of the Wolbachia CI induction and rescue factors now known, research in the field has turned to directed studies on the molecular mechanisms of CI, which we review here.

Collapse

Jana B, Fridman CM, Bosis E, Salomon D. A modular effector with a DNase domain and a marker for T6SS substrates. Nat Commun 2019;10:3595. [PMID: 31399579 PMCID: PMC6688995 DOI: 10.1038/s41467-019-11546-6] [Citation(s) in RCA: 51] [Impact Index Per Article: 10.2] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 01/15/2019] [Accepted: 07/16/2019] [Indexed: 12/30/2022] Open

Characterization of a DUF820 family protein Alr3200 of the cyanobacterium Anabaena sp. strain PCC7120. J Biosci 2016;41:589-600. [DOI: 10.1007/s12038-016-9646-5] [Citation(s) in RCA: 4] [Impact Index Per Article: 0.5] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 01/19/2023]

Afanas’ev MV, Balakhonov SV, Tokmakova EG, Polovinkina VS, Sidorova EA, Sinkov VV. Analysis of complete sequence of cryptic plasmid pTP33 from Yersinia pestis isolated in Tuva natural focus of plague. RUSS J GENET+ 2016. [DOI: 10.1134/s1022795416090027] [Citation(s) in RCA: 2] [Impact Index Per Article: 0.3] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/23/2022]

Johnson PM, Gucinski GC, Garza-Sánchez F, Wong T, Hung LW, Hayes CS, Goulding CW. Functional Diversity of Cytotoxic tRNase/Immunity Protein Complexes from Burkholderia pseudomallei. J Biol Chem 2016;291:19387-400. [PMID: 27445337 DOI: 10.1074/jbc.m116.736074] [Citation(s) in RCA: 26] [Impact Index Per Article: 3.3] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 05/03/2016] [Indexed: 12/23/2022] Open

Lopes-Kulishev CO, Alves IR, Valencia EY, Pidhirnyj MI, Fernández-Silva FS, Rodrigues TR, Guzzo CR, Galhardo RS. Functional characterization of two SOS-regulated genes involved in mitomycin C resistance in Caulobacter crescentus. DNA Repair (Amst) 2015;33:78-89. [DOI: 10.1016/j.dnarep.2015.06.009] [Citation(s) in RCA: 4] [Impact Index Per Article: 0.4] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 11/04/2014] [Revised: 06/24/2015] [Accepted: 06/26/2015] [Indexed: 10/23/2022]

Hooton SPT, Timms AR, Cummings NJ, Moreton J, Wilson R, Connerton IF. The complete plasmid sequences of Salmonella enterica serovar Typhimurium U288. Plasmid 2014;76:32-9. [PMID: 25175817 DOI: 10.1016/j.plasmid.2014.08.002] [Citation(s) in RCA: 20] [Impact Index Per Article: 2.0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/24/2013] [Revised: 08/11/2014] [Accepted: 08/21/2014] [Indexed: 12/20/2022]

Mukha DV, Pasyukova EG, Kapelinskaya TV, Kagramanova AS. Endonuclease domain of the Drosophila melanogaster R2 non-LTR retrotransposon and related retroelements: a new model for transposition. Front Genet 2013;4:63. [PMID: 23637706 PMCID: PMC3636483 DOI: 10.3389/fgene.2013.00063] [Citation(s) in RCA: 8] [Impact Index Per Article: 0.7] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 11/11/2012] [Accepted: 04/05/2013] [Indexed: 01/25/2023] Open

Steczkiewicz K, Muszewska A, Knizewski L, Rychlewski L, Ginalski K. Sequence, structure and functional diversity of PD-(D/E)XK phosphodiesterase superfamily. Nucleic Acids Res 2012;40:7016-45. [PMID: 22638584 PMCID: PMC3424549 DOI: 10.1093/nar/gks382] [Citation(s) in RCA: 109] [Impact Index Per Article: 9.1] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 01/04/2023] Open

Zylicz-Stachula A, Zolnierkiewicz O, Lubys A, Ramanauskaite D, Mitkaite G, Bujnicki JM, Skowron PM. Related bifunctional restriction endonuclease-methyltransferase triplets: TspDTI, Tth111II/TthHB27I and TsoI with distinct specificities. BMC Mol Biol 2012;13:13. [PMID: 22489904 PMCID: PMC3384240 DOI: 10.1186/1471-2199-13-13] [Citation(s) in RCA: 18] [Impact Index Per Article: 1.5] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 01/19/2012] [Accepted: 04/10/2012] [Indexed: 01/05/2023] Open

Laganeckas M, Margelevicius M, Venclovas C. Identification of new homologs of PD-(D/E)XK nucleases by support vector machines trained on data derived from profile-profile alignments. Nucleic Acids Res 2010;39:1187-96. [PMID: 20961958 PMCID: PMC3045609 DOI: 10.1093/nar/gkq958] [Citation(s) in RCA: 31] [Impact Index Per Article: 2.2] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 01/09/2023] Open

The crystal structure of D212 from sulfolobus spindle-shaped virus ragged hills reveals a new member of the PD-(D/E)XK nuclease superfamily. J Virol 2010;84:5890-7. [PMID: 20375162 PMCID: PMC2876643 DOI: 10.1128/jvi.01663-09] [Citation(s) in RCA: 20] [Impact Index Per Article: 1.4] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 01/07/2023] Open

Bertrand L, Leiva-Torres GA, Hyjazie H, Pearson A. Conserved residues in the UL24 protein of herpes simplex virus 1 are important for dispersal of the nucleolar protein nucleolin. J Virol 2010;84:109-18. [PMID: 19864385 PMCID: PMC2798432 DOI: 10.1128/jvi.01428-09] [Citation(s) in RCA: 31] [Impact Index Per Article: 2.2] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/10/2009] [Accepted: 10/20/2009] [Indexed: 12/13/2022] Open

Makarova KS, Wolf YI, Koonin EV. Comprehensive comparative-genomic analysis of type 2 toxin-antitoxin systems and related mobile stress response systems in prokaryotes. Biol Direct 2009;4:19. [PMID: 19493340 PMCID: PMC2701414 DOI: 10.1186/1745-6150-4-19] [Citation(s) in RCA: 318] [Impact Index Per Article: 21.2] [Reference Citation Analysis] [Abstract] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 05/26/2009] [Accepted: 06/03/2009] [Indexed: 11/13/2022] Open

Abstract

Background

The prokaryotic toxin-antitoxin systems (TAS, also referred to as TA loci) are widespread, mobile two-gene modules that can be viewed as selfish genetic elements because they evolved mechanisms to become addictive for replicons and cells in which they reside, but also possess "normal" cellular functions in various forms of stress response and management of prokaryotic population. Several distinct TAS of type 1, where the toxin is a protein and the antitoxin is an antisense RNA, and numerous, unrelated TAS of type 2, in which both the toxin and the antitoxin are proteins, have been experimentally characterized, and it is suspected that many more remain to be identified.

Results

We report a comprehensive comparative-genomic analysis of Type 2 toxin-antitoxin systems in prokaryotes. Using sensitive methods for distant sequence similarity search, genome context analysis and a new approach for the identification of mobile two-component systems, we identified numerous, previously unnoticed protein families that are homologous to toxins and antitoxins of known type 2 TAS. In addition, we predict 12 new families of toxins and 13 families of antitoxins, and also, predict a TAS or TAS-like activity for several gene modules that were not previously suspected to function in that capacity. In particular, we present indications that the two-gene module that encodes a minimal nucleotidyl transferase and the accompanying HEPN protein, and is extremely abundant in many archaea and bacteria, especially, thermophiles might comprise a novel TAS. We present a survey of previously known and newly predicted TAS in 750 complete genomes of archaea and bacteria, quantitatively demonstrate the exceptional mobility of the TAS, and explore the network of toxin-antitoxin pairings that combines plasticity with selectivity.

Conclusion

The defining properties of the TAS, namely, the typically small size of the toxin and antitoxin genes, fast evolution, and extensive horizontal mobility, make the task of comprehensive identification of these systems particularly challenging. However, these same properties can be exploited to develop context-based computational approaches which, combined with exhaustive analysis of subtle sequence similarities were employed in this work to substantially expand the current collection of TAS by predicting both previously unnoticed, derived versions of known toxins and antitoxins, and putative novel TAS-like systems. In a broader context, the TAS belong to the resistome domain of the prokaryotic mobilome which includes partially selfish, addictive gene cassettes involved in various aspects of stress response and organized under the same general principles as the TAS. The "selfish altruism", or "responsible selfishness", of TAS-like systems appears to be a defining feature of the resistome and an important characteristic of the entire prokaryotic pan-genome given that in the prokaryotic world the mobilome and the "stable" chromosomes form a dynamic continuum.

Reviewers

This paper was reviewed by Kenn Gerdes (nominated by Arcady Mushegian), Daniel Haft, Arcady Mushegian, and Andrei Osterman. For full reviews, go to the Reviewers' Reports section.

Collapse

Zylicz-Stachula A, Bujnicki JM, Skowron PM. Cloning and analysis of a bifunctional methyltransferase/restriction endonuclease TspGWI, the prototype of a Thermus sp. enzyme family. BMC Mol Biol 2009;10:52. [PMID: 19480701 PMCID: PMC2700111 DOI: 10.1186/1471-2199-10-52] [Citation(s) in RCA: 22] [Impact Index Per Article: 1.5] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 12/31/2008] [Accepted: 05/29/2009] [Indexed: 01/09/2023] Open

Abstract

Background

Restriction-modification systems are a diverse class of enzymes. They are classified into four major types: I, II, III and IV. We have previously proposed the existence of a Thermus sp. enzyme family, which belongs to type II restriction endonucleases (REases), however, it features also some characteristics of types I and III. Members include related thermophilic endonucleases: TspGWI, TaqII, TspDTI, and Tth111II.

Results

Here we describe cloning, mutagenesis and analysis of the prototype TspGWI enzyme that recognises the 5'-ACGGA-3' site and cleaves 11/9 nt downstream. We cloned, expressed, and mutagenised the tspgwi gene and investigated the properties of its product, the bifunctional TspGWI restriction/modification enzyme. Since TspGWI does not cleave DNA completely, a cloning method was devised, based on amino acid sequencing of internal proteolytic fragments. The deduced amino acid sequence of the enzyme shares significant sequence similarity with another representative of the Thermus sp. family – TaqII. Interestingly, these enzymes recognise similar, yet different sequences in the DNA. Both enzymes cleave DNA at the same distance, but differ in their ability to cleave single sites and in the requirement of S-adenosylmethionine as an allosteric activator for cleavage. Both the restriction endonuclease (REase) and methyltransferase (MTase) activities of wild type (wt) TspGWI (either recombinant or isolated from Thermus sp.) are dependent on the presence of divalent cations.

Conclusion

TspGWI is a bifunctional protein comprising a tandem arrangement of Type I-like domains; particularly noticeable is the central HsdM-like module comprising a helical domain and a highly conserved S-adenosylmethionine-binding/catalytic MTase domain, containing DPAVGTG and NPPY motifs. TspGWI also possesses an N-terminal PD-(D/E)XK nuclease domain related to the corresponding domains in HsdR subunits, but lacks the ATP-dependent translocase module of the HsdR subunit and the additional domains that are involved in subunit-subunit interactions in Type I systems. The MTase and REase activities of TspGWI are autonomous and can be uncoupled. Structurally and functionally, the TspGWI protomer appears to be a streamlined 'half' of a Type I enzyme.

Collapse

Type II restriction endonuclease R.Hpy188I belongs to the GIY-YIG nuclease superfamily, but exhibits an unusual active site. BMC STRUCTURAL BIOLOGY 2008;8:48. [PMID: 19014591 PMCID: PMC2630997 DOI: 10.1186/1472-6807-8-48] [Citation(s) in RCA: 14] [Impact Index Per Article: 0.9] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Subscribe] [Scholar Register] [Received: 07/06/2008] [Accepted: 11/14/2008] [Indexed: 11/10/2022]

Abstract

BACKGROUND

Catalytic domains of Type II restriction endonucleases (REases) belong to a few unrelated three-dimensional folds. While the PD-(D/E)XK fold is most common among these enzymes, crystal structures have been also determined for single representatives of two other folds: PLD (R.BfiI) and half-pipe (R.PabI). Bioinformatics analyses supported by mutagenesis experiments suggested that some REases belong to the HNH fold (e.g. R.KpnI), and that a small group represented by R.Eco29kI belongs to the GIY-YIG fold. However, for a large fraction of REases with known sequences, the three-dimensional fold and the architecture of the active site remain unknown, mostly due to extreme sequence divergence that hampers detection of homology to enzymes with known folds.

RESULTS

R.Hpy188I is a Type II REase with unknown structure. PSI-BLAST searches of the non-redundant protein sequence database reveal only 1 homolog (R.HpyF17I, with nearly identical amino acid sequence and the same DNA sequence specificity). Standard application of state-of-the-art protein fold-recognition methods failed to predict the relationship of R.Hpy188I to proteins with known structure or to other protein families. In order to increase the amount of evolutionary information in the multiple sequence alignment, we have expanded our sequence database searches to include sequences from metagenomics projects. This search resulted in identification of 23 further members of R.Hpy188I family, both from metagenomics and the non-redundant database. Moreover, fold-recognition analysis of the extended R.Hpy188I family revealed its relationship to the GIY-YIG domain and allowed for computational modeling of the R.Hpy188I structure. Analysis of the R.Hpy188I model in the light of sequence conservation among its homologs revealed an unusual variant of the active site, in which the typical Tyr residue of the YIG half-motif had been substituted by a Lys residue. Moreover, some of its homologs have the otherwise invariant Arg residue in a non-homologous position in sequence that nonetheless allows for spatial conservation of the guanidino group potentially involved in phosphate binding.

CONCLUSION

The present study eliminates a significant "white spot" on the structural map of REases. It also provides important insight into sequence-structure-function relationships in the GIY-YIG nuclease superfamily. Our results reveal that in the case of proteins with no or few detectable homologs in the standard "non-redundant" database, it is useful to expand this database by adding the metagenomic sequences, which may provide evolutionary linkage to detect more remote homologs.

Collapse

Orlowski J, Bujnicki JM. Structural and evolutionary classification of Type II restriction enzymes based on theoretical and experimental analyses. Nucleic Acids Res 2008;36:3552-69. [PMID: 18456708 PMCID: PMC2441816 DOI: 10.1093/nar/gkn175] [Citation(s) in RCA: 91] [Impact Index Per Article: 5.7] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 01/05/2023] Open

Abstract

For a very long time, Type II restriction enzymes (REases) have been a paradigm of ORFans: proteins with no detectable similarity to each other and to any other protein in the database, despite common cellular and biochemical function. Crystallographic analyses published until January 2008 provided high-resolution structures for only 28 of 1637 Type II REase sequences available in the Restriction Enzyme database (REBASE). Among these structures, all but two possess catalytic domains with the common PD-(D/E)XK nuclease fold. Two structures are unrelated to the others: R.BfiI exhibits the phospholipase D (PLD) fold, while R.PabI has a new fold termed 'half-pipe'. Thus far, bioinformatic studies supported by site-directed mutagenesis have extended the number of tentatively assigned REase folds to five (now including also GIY-YIG and HNH folds identified earlier in homing endonucleases) and provided structural predictions for dozens of REase sequences without experimentally solved structures. Here, we present a comprehensive study of all Type II REase sequences available in REBASE together with their homologs detectable in the nonredundant and environmental samples databases at the NCBI. We present the summary and critical evaluation of structural assignments and predictions reported earlier, new classification of all REase sequences into families, domain architecture analysis and new predictions of three-dimensional folds. Among 289 experimentally characterized (not putative) Type II REases, whose apparently full-length sequences are available in REBASE, we assign 199 (69%) to contain the PD-(D/E)XK domain. The HNH domain is the second most common, with 24 (8%) members. When putative REases are taken into account, the fraction of PD-(D/E)XK and HNH folds changes to 48% and 30%, respectively. Fifty-six characterized (and 521 predicted) REases remain unassigned to any of the five REase folds identified so far, and may exhibit new architectures. These enzymes are proposed as the most interesting targets for structure determination by high-resolution experimental methods. Our analysis provides the first comprehensive map of sequence-structure relationships among Type II REases and will help to focus the efforts of structural and functional genomics of this large and biotechnologically important class of enzymes.

Collapse

Obarska-Kosinska A, Taylor JEN, Callow P, Orlowski J, Bujnicki JM, Kneale GG. HsdR subunit of the type I restriction-modification enzyme EcoR124I: biophysical characterisation and structural modelling. J Mol Biol 2008;376:438-452. [PMID: 18164032 PMCID: PMC2878639 DOI: 10.1016/j.jmb.2007.11.024] [Citation(s) in RCA: 20] [Impact Index Per Article: 1.3] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/10/2007] [Revised: 11/08/2007] [Accepted: 11/09/2007] [Indexed: 01/19/2023]

Abstract

Type I restriction-modification (RM) systems are large, multifunctional enzymes composed of three different subunits. HsdS and HsdM form a complex in which HsdS recognizes the target DNA sequence, and HsdM carries out methylation of adenosine residues. The HsdR subunit, when associated with the HsdS-HsdM complex, translocates DNA in an ATP-dependent process and cleaves unmethylated DNA at a distance of several thousand base-pairs from the recognition site. The molecular mechanism by which these enzymes translocate the DNA is not fully understood, in part because of the absence of crystal structures. To date, crystal structures have been determined for the individual HsdS and HsdM subunits and models have been built for the HsdM-HsdS complex with the DNA. However, no structure is available for the HsdR subunit. In this work, the gene coding for the HsdR subunit of EcoR124I was re-sequenced, which showed that there was an error in the published sequence. This changed the position of the stop codon and altered the last 17 amino acid residues of the protein sequence. An improved purification procedure was developed to enable HsdR to be purified efficiently for biophysical and structural analysis. Analytical ultracentrifugation shows that HsdR is monomeric in solution, and the frictional ratio of 1.21 indicates that the subunit is globular and fairly compact. Small angle neutron-scattering of the HsdR subunit indicates a radius of gyration of 3.4 nm and a maximum dimension of 10 nm. We constructed a model of the HsdR using protein fold-recognition and homology modelling to model individual domains, and small-angle neutron scattering data as restraints to combine them into a single molecule. The model reveals an ellipsoidal shape of the enzymatic core comprising the N-terminal and central domains, and suggests conformational heterogeneity of the C-terminal region implicated in binding of HsdR to the HsdS-HsdM complex.

Collapse

Functional differentiation of proteins: implications for structural genomics. Structure 2007;15:405-15. [PMID: 17437713 DOI: 10.1016/j.str.2007.02.005] [Citation(s) in RCA: 7] [Impact Index Per Article: 0.4] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 12/27/2006] [Revised: 02/15/2007] [Accepted: 02/16/2007] [Indexed: 01/06/2023]

Guzzo CR, Nagem RAP, Barbosa JARG, Farah CS. Structure of Xanthomonas axonopodis pv. citri YaeQ reveals a new compact protein fold built around a variation of the PD-(D/E)XK nuclease motif. Proteins 2007;69:644-51. [PMID: 17623842 DOI: 10.1002/prot.21556] [Citation(s) in RCA: 4] [Impact Index Per Article: 0.2] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 01/29/2023]

Cymerman IA, Obarska A, Skowronek KJ, Lubys A, Bujnicki JM. Identification of a new subfamily of HNH nucleases and experimental characterization of a representative member, HphI restriction endonuclease. Proteins 2007;65:867-76. [PMID: 17029241 DOI: 10.1002/prot.21156] [Citation(s) in RCA: 25] [Impact Index Per Article: 1.5] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 12/18/2022]

Chovancová E, Kosinski J, Bujnicki JM, Damborský J. Phylogenetic analysis of haloalkane dehalogenases. Proteins 2007;67:305-16. [PMID: 17295320 DOI: 10.1002/prot.21313] [Citation(s) in RCA: 73] [Impact Index Per Article: 4.3] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 01/30/2023]

Abstract

Haloalkane dehalogenases (HLDs) are enzymes that catalyze the cleavage of carbon-halogen bonds by a hydrolytic mechanism. Although comparative biochemical analyses have been published, no classification system has been proposed for HLDs, to date, that reconciles their phylogenetic and functional relationships. In the study presented here, we have analyzed all sequences and structures of genuine HLDs and their homologs detectable by database searches. Phylogenetic analyses revealed that the HLD family can be divided into three subfamilies denoted HLD-I, HLD-II, and HLD-III, of which HLD-I and HLD-III are predicted to be sister-groups. A mismatch between the HLD protein tree and the tree of species, as well as the presence of more than one HLD gene in a few genomes, suggest that horizontal gene transfers, and perhaps also multiple gene duplications and losses have been involved in the evolution of this family. Most of the biochemically characterized HLDs are found in the HLD-II subfamily. The dehalogenating activity of two members of the newly identified HLD-III subfamily has only recently been confirmed, in a study motivated by this phylogenetic analysis. A novel type of the catalytic pentad (Asp-His-Asp+Asn-Trp) was predicted for members of the HLD-III subfamily. Calculation of the evolutionary rates and lineage-specific innovations revealed a common conserved core as well as a set of residues that characterizes each HLD subfamily. The N-terminal part of the cap domain is one of the most variable regions within the whole family as well as within individual subfamilies, and serves as a preferential site for the location of relatively long insertions. The highest variability of discrete sites was observed among residues that are structural components of the access channels. Mutations at these sites modify the anatomy of the channels, which are important for the exchange of ligands between the buried active site and the bulk solvent, thus creating a structural basis for the molecular evolution of new substrate specificities. Our analysis sheds light on the evolutionary history of HLDs and provides a structural framework for designing enzymes with new specificities.

Collapse

Tamulaitiene G, Jakubauskas A, Urbanke C, Huber R, Grazulis S, Siksnys V. The crystal structure of the rare-cutting restriction enzyme SdaI reveals unexpected domain architecture. Structure 2006;14:1389-400. [PMID: 16962970 DOI: 10.1016/j.str.2006.07.002] [Citation(s) in RCA: 22] [Impact Index Per Article: 1.2] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 04/24/2006] [Revised: 07/04/2006] [Accepted: 07/05/2006] [Indexed: 01/31/2023]

Koliński A, Bujnicki JM. Generalized protein structure prediction based on combination of fold-recognition with de novo folding and evaluation of models. Proteins 2006;61 Suppl 7:84-90. [PMID: 16187348 DOI: 10.1002/prot.20723] [Citation(s) in RCA: 85] [Impact Index Per Article: 4.7] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 12/23/2022]

Abstract

To predict the tertiary structure of full-length sequences of all targets in CASP6, regardless of their potential category (from easy comparative modeling to fold recognition to apparent new folds) we used a novel combination of two very different approaches developed independently in our laboratories, which ranked quite well in different categories in CASP5. First, the GeneSilico metaserver was used to identify domains, predict secondary structure, and generate fold recognition (FR) alignments, which were converted to full-atom models using the "FRankenstein's Monster" approach for comparative modeling (CM) by recombination of protein fragments. Additional models generated "de novo" by fully automated servers were obtained from the CASP website. All these models were evaluated by VERIFY3D, and residues with scores better than 0.2 were used as a source of spatial restraints. Second, a new implementation of the lattice-based protein modeling tool CABS was used to carry out folding guided by the above-mentioned restraints with the Replica Exchange Monte Carlo sampling technique. Decoys generated in the course of simulation were subject to the average linkage hierarchical clustering. For a representative decoy from each cluster, a full-atom model was rebuilt. Finally, five models were selected for submission based on combination of various criteria, including the size, density, and average energy of the corresponding cluster, and the visual evaluation of the full-atom structures and their relationship to the original templates. The combination of FRankenstein and CABS was one of the best-performing algorithms over all categories in CASP6 (it is important to note that our human intervention was very limited, and all steps in our method can be easily automated). We were able to generate a number of very good models, especially in the Comparative Modeling and New Folds categories. Frequently, the best models were closer to the native structure than any of the templates used. The main problem we encountered was in the ranking of the final models (the only step of significant human intervention), due to the insufficient computational power, which precluded the possibility of full-atom refinement and energy-based evaluation.

Collapse

Dunin-Horkawicz S, Feder M, Bujnicki JM. Phylogenomic analysis of the GIY-YIG nuclease superfamily. BMC Genomics 2006;7:98. [PMID: 16646971 PMCID: PMC1564403 DOI: 10.1186/1471-2164-7-98] [Citation(s) in RCA: 95] [Impact Index Per Article: 5.3] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 01/29/2006] [Accepted: 04/28/2006] [Indexed: 11/28/2022] Open

Tress ML, Cozzetto D, Tramontano A, Valencia A. An analysis of the Sargasso Sea resource and the consequences for database composition. BMC Bioinformatics 2006;7:213. [PMID: 16623953 PMCID: PMC1513258 DOI: 10.1186/1471-2105-7-213] [Citation(s) in RCA: 16] [Impact Index Per Article: 0.9] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 12/22/2005] [Accepted: 04/19/2006] [Indexed: 01/20/2023] Open

Skowronek KJ, Kosinski J, Bujnicki JM. Theoretical model of restriction endonuclease HpaI in complex with DNA, predicted by fold recognition and validated by site-directed mutagenesis. Proteins 2006;63:1059-68. [PMID: 16498623 DOI: 10.1002/prot.20920] [Citation(s) in RCA: 7] [Impact Index Per Article: 0.4] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 12/20/2022]

Abstract

Type II restriction enzymes are commercially important deoxyribonucleases and very attractive targets for protein engineering of new specificities. At the same time they are a very challenging test bed for protein structure prediction methods. Typically, enzymes that recognize different sequences show little or no amino acid sequence similarity to each other and to other proteins. Based on crystallographic analyses that revealed the same PD-(D/E)XK fold for more than a dozen case studies, they were nevertheless considered to be related until the combination of bioinformatics and mutational analyses has demonstrated that some of these proteins belong to other, unrelated folds PLD, HNH, and GIY-YIG. As a part of a large-scale project aiming at identification of a three-dimensional fold for all type II REases with known sequences (currently approximately 1000 proteins), we carried out preliminary structure prediction and selected candidates for experimental validation. Here, we present the analysis of HpaI REase, an ORFan with no detectable homologs, for which we detected a structural template by protein fold recognition, constructed a model using the FRankenstein monster approach and identified a number of residues important for the DNA binding and catalysis. These predictions were confirmed by site-directed mutagenesis and in vitro analysis of the mutant proteins. The experimentally validated model of HpaI will serve as a low-resolution structural platform for evolutionary considerations in the subgroup of blunt-cutting REases with different specificities. The research protocol developed in the course of this work represents a streamlined version of the previously used techniques and can be used in a high-throughput fashion to build and validate models for other enzymes, especially ORFans that exhibit no sequence similarity to any other protein in the database.

Collapse

Zhao F, Zhang X, Liang C, Wu J, Bao Q, Qin S. Genome-wide analysis of restriction-modification system in unicellular and filamentous cyanobacteria. Physiol Genomics 2005;24:181-90. [PMID: 16368872 DOI: 10.1152/physiolgenomics.00255.2005] [Citation(s) in RCA: 34] [Impact Index Per Article: 1.8] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 01/02/2023] Open

Armalyte E, Bujnicki JM, Giedriene J, Gasiunas G, Kosiński J, Lubys A. Mva1269I: a monomeric type IIS restriction endonuclease from Micrococcus varians with two EcoRI- and FokI-like catalytic domains. J Biol Chem 2005;280:41584-94. [PMID: 16223716 DOI: 10.1074/jbc.m506775200] [Citation(s) in RCA: 24] [Impact Index Per Article: 1.3] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 01/04/2023] Open

Kosinski J, Feder M, Bujnicki JM. The PD-(D/E)XK superfamily revisited: identification of new members among proteins involved in DNA metabolism and functional predictions for domains of (hitherto) unknown function. BMC Bioinformatics 2005;6:172. [PMID: 16011798 PMCID: PMC1189080 DOI: 10.1186/1471-2105-6-172] [Citation(s) in RCA: 72] [Impact Index Per Article: 3.8] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 04/13/2005] [Accepted: 07/12/2005] [Indexed: 01/02/2023] Open

Abstract

BACKGROUND

The PD-(D/E)XK nuclease superfamily, initially identified in type II restriction endonucleases and later in many enzymes involved in DNA recombination and repair, is one of the most challenging targets for protein sequence analysis and structure prediction. Typically, the sequence similarity between these proteins is so low, that most of the relationships between known members of the PD-(D/E)XK superfamily were identified only after the corresponding structures were determined experimentally. Thus, it is tempting to speculate that among the uncharacterized protein families, there are potential nucleases that remain to be discovered, but their identification requires more sensitive tools than traditional PSI-BLAST searches.

RESULTS

The low degree of amino acid conservation hampers the possibility of identification of new members of the PD-(D/E)XK superfamily based solely on sequence comparisons to known members. Therefore, we used a recently developed method HHsearch for sensitive detection of remote similarities between protein families represented as profile Hidden Markov Models enhanced by secondary structure. We carried out a comparison of known families of PD-(D/E)XK nucleases to the database comprising the COG and PFAM profiles corresponding to both functionally characterized as well as uncharacterized protein families to detect significant similarities. The initial candidates for new nucleases were subsequently verified by sequence-structure threading, comparative modeling, and identification of potential active site residues.

CONCLUSION

In this article, we report identification of the PD-(D/E)XK nuclease domain in numerous proteins implicated in interactions with DNA but with unknown structure and mechanism of action (such as putative recombinase RmuC, DNA competence factor CoiA, a DNA-binding protein SfsA, a large human protein predicted to be a DNA repair enzyme, predicted archaeal transcription regulators, and the head completion protein of phage T4) and in proteins for which no function was assigned to date (such as YhcG, various phage proteins, novel candidates for restriction enzymes). Our results contributes to the reduction of "white spaces" on the sequence-structure-function map of the protein universe and will help to jump-start the experimental characterization of new nucleases, of which many may be of importance for the complete understanding of mechanisms that govern the evolution and stability of the genome.

Collapse

Rigden DJ. An inactivated nuclease-like domain in RecC with novel function: implications for evolution. BMC STRUCTURAL BIOLOGY 2005;5:9. [PMID: 15985153 PMCID: PMC1185551 DOI: 10.1186/1472-6807-5-9] [Citation(s) in RCA: 22] [Impact Index Per Article: 1.2] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Download PDF] [Figures] [Subscribe] [Scholar Register] [Received: 04/15/2005] [Accepted: 06/28/2005] [Indexed: 02/03/2023]

Chmiel AA, Radlinska M, Pawlak SD, Krowarsch D, Bujnicki JM, Skowronek KJ. A theoretical model of restriction endonuclease NlaIV in complex with DNA, predicted by fold recognition and validated by site-directed mutagenesis and circular dichroism spectroscopy. Protein Eng Des Sel 2005;18:181-9. [PMID: 15849215 DOI: 10.1093/protein/gzi019] [Citation(s) in RCA: 13] [Impact Index Per Article: 0.7] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 01/10/2023] Open

Abstract

Restriction enzymes (REases) are commercial reagents commonly used in DNA manipulations and mapping. They are regarded as very attractive models for studying protein-DNA interactions and valuable targets for protein engineering. Their amino acid sequences usually show no similarities to other proteins, with rare exceptions of other REases that recognize identical or very similar sequences. Hence, they are extremely hard targets for structure prediction and modeling. NlaIV is a Type II REase, which recognizes the interrupted palindromic sequence GGNNCC (where N indicates any base) and cleaves it in the middle, leaving blunt ends. NlaIV shows no sequence similarity to other proteins and virtually nothing is known about its sequence-structure-function relationships. Using protein fold recognition, we identified a remote relationship between NlaIV and EcoRV, an extensively studied REase, which recognizes the GATATC sequence and whose crystal structure has been determined. Using the 'FRankenstein's monster' approach we constructed a comparative model of NlaIV based on the EcoRV template and used it to predict the catalytic and DNA-binding residues. The model was validated by site-directed mutagenesis and analysis of the activity of the mutants in vivo and in vitro as well as structural characterization of the wild-type enzyme and two mutants by circular dichroism spectroscopy. The structural model of the NlaIV-DNA complex suggests regions of the protein sequence that may interact with the 'non-specific' bases of the target and thus it provides insight into the evolution of sequence specificity in restriction enzymes and may help engineer REases with novel specificities. Before this analysis was carried out, neither the three-dimensional fold of NlaIV, its evolutionary relationships or its catalytic or DNA-binding residues were known. Hence our analysis may be regarded as a paradigm for studies aiming at reducing 'white spaces' on the evolutionary landscape of sequence-function relationships by combining bioinformatics with simple experimental assays.

Collapse