1
|
Ding F, Liu F, Shao W, Chu J, Wu B, He B. Efficient Synthesis of Crocins from Crocetin by a Microbial Glycosyltransferase from Bacillus subtilis 168. JOURNAL OF AGRICULTURAL AND FOOD CHEMISTRY 2018; 66:11701-11708. [PMID: 30350978 DOI: 10.1021/acs.jafc.8b04274] [Citation(s) in RCA: 16] [Impact Index Per Article: 2.7] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 06/08/2023]
Abstract
Crocins are the most important active ingredient found in Crocus sativus, a well-known "plant gold". The glycosyltransferase-catalyzed glycosylation of crocetin is the last step of biosynthesizing crocins and contributes to their structural diversity. Crocin biosynthesis is now hampered by the lack of efficient glycosyltransferases with activity toward crocetin. In this study, two microbial glycosyltransferases (Bs-GT and Bc-GTA) were successfully mined based on the comprehensive analysis of the PSPG motif and the N-terminal motif of the target plant-derived UGT75L6 and Cs-GT2. Bs-GT from Bacillus subtilis 168, an enzyme with a higher activity of glycosylation toward crocetin than that of Bc-GTA, was characterized. The efficient synthesis of crocins from crocetin catalyzed by microbial GT (Bs-GT) was first reported with a high molecular conversion rate of 81.9%, resulting in the production of 476.8 mg/L of crocins. The glycosylation of crocetin on its carboxyl groups by Bs-GT specifically produced crocin-5 and crocin-3, the important rare crocins.
Collapse
Affiliation(s)
- Fangyu Ding
- College of Biotechnology and Pharmaceutical Engineering , Nanjing Tech University , No. 30 Puzhu South Road , Nanjing 211816 , China
| | - Feng Liu
- College of Biotechnology and Pharmaceutical Engineering , Nanjing Tech University , No. 30 Puzhu South Road , Nanjing 211816 , China
| | - Wenming Shao
- College of Biotechnology and Pharmaceutical Engineering , Nanjing Tech University , No. 30 Puzhu South Road , Nanjing 211816 , China
| | - Jianlin Chu
- School of Pharmaceutical Sciences , Nanjing Tech University , No. 30 Puzhu South Road , Nanjing 211816 , China
- Jiangsu National Synergetic Innovation Center for Advanced Materials , 30 Puzhunan Road , Nanjing 211816 , China
| | - Bin Wu
- College of Biotechnology and Pharmaceutical Engineering , Nanjing Tech University , No. 30 Puzhu South Road , Nanjing 211816 , China
| | - Bingfang He
- College of Biotechnology and Pharmaceutical Engineering , Nanjing Tech University , No. 30 Puzhu South Road , Nanjing 211816 , China
- School of Pharmaceutical Sciences , Nanjing Tech University , No. 30 Puzhu South Road , Nanjing 211816 , China
| |
Collapse
|
2
|
Tomar JS, Peddinti RK. Optimized method for TAG protein homology modeling: In silico and experimental structural characterization. Int J Biol Macromol 2016; 88:102-12. [DOI: 10.1016/j.ijbiomac.2016.03.047] [Citation(s) in RCA: 3] [Impact Index Per Article: 0.4] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/28/2015] [Revised: 03/21/2016] [Accepted: 03/22/2016] [Indexed: 01/03/2023]
|
3
|
Liang J, Blumenthal RM. Naturally-occurring, dually-functional fusions between restriction endonucleases and regulatory proteins. BMC Evol Biol 2013; 13:218. [PMID: 24083337 PMCID: PMC3850674 DOI: 10.1186/1471-2148-13-218] [Citation(s) in RCA: 4] [Impact Index Per Article: 0.4] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/19/2013] [Accepted: 10/01/2013] [Indexed: 01/03/2023] Open
Abstract
Background Restriction-modification (RM) systems appear to play key roles in modulating gene flow among bacteria and archaea. Because the restriction endonuclease (REase) is potentially lethal to unmethylated new host cells, regulation to ensure pre-expression of the protective DNA methyltransferase (MTase) is essential to the spread of RM genes. This is particularly true for Type IIP RM systems, in which the REase and MTase are separate, independently-active proteins. A substantial subset of Type IIP RM systems are controlled by an activator-repressor called C protein. In these systems, C controls the promoter for its own gene, and for the downstream REase gene that lacks its own promoter. Thus MTase is expressed immediately after the RM genes enter a new cell, while expression of REase is delayed until sufficient C protein accumulates. To study the variation in and evolution of this regulatory mechanism, we searched for RM systems closely related to the well-studied C protein-dependent PvuII RM system. Unexpectedly, among those found were several in which the C protein and REase genes were fused. Results The gene for CR.NsoJS138I fusion protein (nsoJS138ICR, from the bacterium Niabella soli) was cloned, and the fusion protein produced and partially purified. Western blots provided no evidence that, under the conditions tested, anything other than full-length fusion protein is produced. This protein had REase activity in vitro and, as expected from the sequence similarity, its specificity was indistinguishable from that for PvuII REase, though the optimal reaction conditions were different. Furthermore, the fusion was active as a C protein, as revealed by in vivo activation of a lacZ reporter fusion to the promoter region for the nsoJS138ICR gene. Conclusions Fusions between C proteins and REases have not previously been characterized, though other fusions have (such as between REases and MTases). These results reinforce the evidence for impressive modularity among RM system proteins, and raise important questions about the implications of the C-REase fusions on expression kinetics of these RM systems.
Collapse
Affiliation(s)
- Jixiao Liang
- Department of Medical Microbiology & Immunology, College of Medicine and Life Sciences, University of Toledo, 3100 Transverse Drive, Toledo, OH 43614, USA.
| | | |
Collapse
|
4
|
Steczkiewicz K, Muszewska A, Knizewski L, Rychlewski L, Ginalski K. Sequence, structure and functional diversity of PD-(D/E)XK phosphodiesterase superfamily. Nucleic Acids Res 2012; 40:7016-45. [PMID: 22638584 PMCID: PMC3424549 DOI: 10.1093/nar/gks382] [Citation(s) in RCA: 109] [Impact Index Per Article: 9.1] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 01/04/2023] Open
Abstract
Proteins belonging to PD-(D/E)XK phosphodiesterases constitute a functionally diverse superfamily with representatives involved in replication, restriction, DNA repair and tRNA-intron splicing. Their malfunction in humans triggers severe diseases, such as Fanconi anemia and Xeroderma pigmentosum. To date there have been several attempts to identify and classify new PD-(D/E)KK phosphodiesterases using remote homology detection methods. Such efforts are complicated, because the superfamily exhibits extreme sequence and structural divergence. Using advanced homology detection methods supported with superfamily-wide domain architecture and horizontal gene transfer analyses, we provide a comprehensive reclassification of proteins containing a PD-(D/E)XK domain. The PD-(D/E)XK phosphodiesterases span over 21,900 proteins, which can be classified into 121 groups of various families. Eleven of them, including DUF4420, DUF3883, DUF4263, COG5482, COG1395, Tsp45I, HaeII, Eco47II, ScaI, HpaII and Replic_Relax, are newly assigned to the PD-(D/E)XK superfamily. Some groups of PD-(D/E)XK proteins are present in all domains of life, whereas others occur within small numbers of organisms. We observed multiple horizontal gene transfers even between human pathogenic bacteria or from Prokaryota to Eukaryota. Uncommon domain arrangements greatly elaborate the PD-(D/E)XK world. These include domain architectures suggesting regulatory roles in Eukaryotes, like stress sensing and cell-cycle regulation. Our results may inspire further experimental studies aimed at identification of exact biological functions, specific substrates and molecular mechanisms of reactions performed by these highly diverse proteins.
Collapse
Affiliation(s)
- Kamil Steczkiewicz
- Laboratory of Bioinformatics and Systems Biology, CENT, University of Warsaw, Zwirki i Wigury 93, 02-089 Warsaw, Poland
| | | | | | | | | |
Collapse
|
5
|
Lama D, Sankararamakrishnan R. Identification of Core Structural Residues in the Sequentially Diverse and Structurally Homologous Bcl-2 Family of Proteins. Biochemistry 2010; 49:2574-84. [DOI: 10.1021/bi100029k] [Citation(s) in RCA: 19] [Impact Index Per Article: 1.4] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 01/12/2023]
Affiliation(s)
- Dilraj Lama
- Department of Biological Sciences and Bioengineering, Indian Institute of Technology-Kanpur, Kanpur 208016, India
| | | |
Collapse
|
6
|
Smith RM, Josephsen J, Szczelkun MD. An Mrr-family nuclease motif in the single polypeptide restriction-modification enzyme LlaGI. Nucleic Acids Res 2010; 37:7231-8. [PMID: 19793866 PMCID: PMC2790908 DOI: 10.1093/nar/gkp795] [Citation(s) in RCA: 11] [Impact Index Per Article: 0.8] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 01/19/2023] Open
Abstract
Bioinformatic analysis of the putative nuclease domain of the single polypeptide restriction–modification enzyme LlaGI reveals amino acid motifs characteristic of the Escherichia coli methylated DNA-specific Mrr endonuclease. Using mutagenesis, we examined the role of the conserved residues in both DNA translocation and cleavage. Mutations in those residues predicted to play a role in DNA hydrolysis produced enzymes that could translocate on DNA but were either unable to cleave the polynucleotide track or had reduced nuclease activity. Cleavage by LlaGI is not targeted to methylated DNA, suggesting that the conserved motifs in the Mrr domain are a conventional sub-family of the PD-(D/E)XK superfamily of DNA nucleases.
Collapse
Affiliation(s)
- Rachel M Smith
- DNA-Protein Interactions Unit, Department of Biochemistry, School of Medical Sciences, University of Bristol, Bristol, BS8 1TD, UK
| | | | | |
Collapse
|
7
|
Morgan RD, Dwinell EA, Bhatia TK, Lang EM, Luyten YA. The MmeI family: type II restriction-modification enzymes that employ single-strand modification for host protection. Nucleic Acids Res 2009; 37:5208-21. [PMID: 19578066 PMCID: PMC2731913 DOI: 10.1093/nar/gkp534] [Citation(s) in RCA: 51] [Impact Index Per Article: 3.4] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 12/16/2022] Open
Abstract
The type II restriction endonucleases form one of the largest families of biochemically-characterized proteins. These endonucleases typically share little sequence similarity, except among isoschizomers that recognize the same sequence. MmeI is an unusual type II restriction endonuclease that combines endonuclease and methyltransferase activities in a single polypeptide. MmeI cuts DNA 20 bases from its recognition sequence and modifies just one DNA strand for host protection. Using MmeI as query we have identified numerous putative genes highly similar to MmeI in database sequences. We have cloned and characterized 20 of these MmeI homologs. Each cuts DNA at the same distance as MmeI and each modifies a conserved adenine on only one DNA strand for host protection. However each enzyme recognizes a unique DNA sequence, suggesting these enzymes are undergoing rapid evolution of DNA specificity. The MmeI family thus provides a rich source of novel endonucleases while affording an opportunity to observe the evolution of DNA specificity. Because the MmeI family enzymes employ modification of only one DNA strand for host protection, unlike previously described type II systems, we propose that such single-strand modification systems be classified as a new subgroup, the type IIL enzymes, for Lone strand DNA modification.
Collapse
|
8
|
Orlowski J, Bujnicki JM. Structural and evolutionary classification of Type II restriction enzymes based on theoretical and experimental analyses. Nucleic Acids Res 2008; 36:3552-69. [PMID: 18456708 PMCID: PMC2441816 DOI: 10.1093/nar/gkn175] [Citation(s) in RCA: 91] [Impact Index Per Article: 5.7] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 01/05/2023] Open
Abstract
For a very long time, Type II restriction enzymes (REases) have been a paradigm of ORFans: proteins with no detectable similarity to each other and to any other protein in the database, despite common cellular and biochemical function. Crystallographic analyses published until January 2008 provided high-resolution structures for only 28 of 1637 Type II REase sequences available in the Restriction Enzyme database (REBASE). Among these structures, all but two possess catalytic domains with the common PD-(D/E)XK nuclease fold. Two structures are unrelated to the others: R.BfiI exhibits the phospholipase D (PLD) fold, while R.PabI has a new fold termed 'half-pipe'. Thus far, bioinformatic studies supported by site-directed mutagenesis have extended the number of tentatively assigned REase folds to five (now including also GIY-YIG and HNH folds identified earlier in homing endonucleases) and provided structural predictions for dozens of REase sequences without experimentally solved structures. Here, we present a comprehensive study of all Type II REase sequences available in REBASE together with their homologs detectable in the nonredundant and environmental samples databases at the NCBI. We present the summary and critical evaluation of structural assignments and predictions reported earlier, new classification of all REase sequences into families, domain architecture analysis and new predictions of three-dimensional folds. Among 289 experimentally characterized (not putative) Type II REases, whose apparently full-length sequences are available in REBASE, we assign 199 (69%) to contain the PD-(D/E)XK domain. The HNH domain is the second most common, with 24 (8%) members. When putative REases are taken into account, the fraction of PD-(D/E)XK and HNH folds changes to 48% and 30%, respectively. Fifty-six characterized (and 521 predicted) REases remain unassigned to any of the five REase folds identified so far, and may exhibit new architectures. These enzymes are proposed as the most interesting targets for structure determination by high-resolution experimental methods. Our analysis provides the first comprehensive map of sequence-structure relationships among Type II REases and will help to focus the efforts of structural and functional genomics of this large and biotechnologically important class of enzymes.
Collapse
Affiliation(s)
- Jerzy Orlowski
- Laboratory of Bioinformatics and Protein Engineering, International Institute of Molecular and Cell Biology in Warsaw, ul. Ks. Trojdena 4, PL-02-109 Warsaw, Poland
| | | |
Collapse
|
9
|
Kosinski J, Kubareva E, Bujnicki JM. A model of restriction endonuclease MvaI in complex with DNA: a template for interpretation of experimental data and a guide for specificity engineering. Proteins 2007; 68:324-36. [PMID: 17407166 DOI: 10.1002/prot.21460] [Citation(s) in RCA: 12] [Impact Index Per Article: 0.7] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 01/07/2023]
Abstract
R.MvaI is a Type II restriction enzyme (REase), which specifically recognizes the pentanucleotide DNA sequence 5'-CCWGG-3' (W indicates A or T). It belongs to a family of enzymes, which recognize related sequences, including 5'-CCSGG-3' (S indicates G or C) in the case of R.BcnI, or 5'-CCNGG-3' (where N indicates any nucleoside) in the case of R.ScrFI. REases from this family hydrolyze the phosphodiester bond in the DNA between the 2nd and 3rd base in both strands, thereby generating a double strand break with 5'-protruding single nucleotides. So far, no crystal structures of REases with similar cleavage patterns have been solved. Characterization of sequence-structure-function relationships in this family would facilitate understanding of evolution of sequence specificity among REases and could aid in engineering of enzymes with new specificities. However, sequences of R.MvaI or its homologs show no significant similarity to any proteins with known structures, thus precluding straightforward comparative modeling. We used a fold recognition approach to identify a remote relationship between R.MvaI and the structure of DNA repair enzyme MutH, which belongs to the PD-(D/E)XK superfamily together with many other REases. We constructed a homology model of R.MvaI and used it to predict functionally important amino acid residues and the mode of interaction with the DNA. In particular, we predict that only one active site of R.MvaI interacts with the DNA target at a time, and the cleavage of both strands (5'-CCAGG-3' and 5'-CCTGG-3') is achieved by two independent catalytic events. The model is in good agreement with the available experimental data and will serve as a template for further analyses of R.MvaI, R.BcnI, R.ScrFI and other related enzymes.
Collapse
Affiliation(s)
- Jan Kosinski
- Laboratory of Bioinformatics and Protein Engineering, International Institute of Molecular and Cell Biology, Trojdena 4, 02-109 Warsaw, Poland.
| | | | | |
Collapse
|
10
|
Abstract
Incorporation of the epsilon subunit into the GABAA receptor has been suggested to confer unusual, but variable, biophysical and pharmacological characteristics to both recombinant and native receptors. Due to their structural similarity with the gamma subunits, epsilon subunits have been assumed to substitute at the single position of the gamma subunit in assembled receptors. However, prior work suggests that functional variability in epsilon-containing receptors may reflect alternative sites of incorporation and of not just one, but possibly multiple epsilon subunits in the pentameric receptor complex. Here we present data indicating that increased expression of epsilon, in conjunction with alpha2 and beta3 subunits, results in expression of GABAA receptors with correspondingly altered rectification, deactivation and levels of spontaneous openings, but not increased total current density. We also provide data that the epsilon subunit, like the beta3 subunit, can self-export and data from chimeric receptors suggesting that similarities between the assembly domains of the beta3 and the epsilon subunits may allow the epsilon subunit to replace the beta, as well as the gamma, subunit. The substitution of an epsilon for a beta, as well as the gamma subunit and formation of receptors with alternative patterns of assembly with respect to epsilon incorporation may underlie the observed variability in both biophysical and pharmacological properties noted not only in recombinant, but also in native receptors.
Collapse
Affiliation(s)
- Brian L Jones
- Department of Physiology, Dartmouth Medical School, Hanover, New Hampshire 03755, USA
| | | |
Collapse
|
11
|
Niv MY, Ripoll DR, Vila JA, Liwo A, Vanamee ES, Aggarwal AK, Weinstein H, Scheraga HA. Topology of Type II REases revisited; structural classes and the common conserved core. Nucleic Acids Res 2007; 35:2227-37. [PMID: 17369272 PMCID: PMC1874628 DOI: 10.1093/nar/gkm045] [Citation(s) in RCA: 31] [Impact Index Per Article: 1.8] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/16/2022] Open
Abstract
Type II restriction endonucleases (REases) are deoxyribonucleases that cleave DNA sequences with remarkable specificity. Type II REases are highly divergent in sequence as well as in topology, i.e. the connectivity of secondary structure elements. A widely held assumption is that a structural core of five β-strands flanked by two α-helices is common to these enzymes. We introduce a systematic procedure to enumerate secondary structure elements in an unambiguous and reproducible way, and use it to analyze the currently available X-ray structures of Type II REases. Based on this analysis, we propose an alternative definition of the core, which we term the αβα-core. The αβα-core includes the most frequently observed secondary structure elements and is not a sandwich, as it consists of a five-strand β-sheet and two α-helices on the same face of the β-sheet. We use the αβα-core connectivity as a basis for grouping the Type II REases into distinct structural classes. In these new structural classes, the connectivity correlates with the angles between the secondary structure elements and with the cleavage patterns of the REases. We show that there exists a substructure of the αβα-core, namely a common conserved core, ccc, defined here as one α-helix and four β-strands common to all Type II REase of known structure.
Collapse
Affiliation(s)
- Masha Y Niv
- Department of Physiology and Biophysics, Weill Medical College of Cornell University, New York, NY 10021, USA.
| | | | | | | | | | | | | | | |
Collapse
|
12
|
Cymerman IA, Obarska A, Skowronek KJ, Lubys A, Bujnicki JM. Identification of a new subfamily of HNH nucleases and experimental characterization of a representative member, HphI restriction endonuclease. Proteins 2007; 65:867-76. [PMID: 17029241 DOI: 10.1002/prot.21156] [Citation(s) in RCA: 25] [Impact Index Per Article: 1.5] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 12/18/2022]
Abstract
The restriction endonuclease (REase) R. HphI is a Type IIS enzyme that recognizes the asymmetric target DNA sequence 5'-GGTGA-3' and in the presence of Mg(2+) hydrolyzes phosphodiester bonds in both strands of the DNA at a distance of 8 nucleotides towards the 3' side of the target, producing a 1 nucleotide 3'-staggered cut in an unspecified sequence at this position. REases are typically ORFans that exhibit little similarity to each other and to any proteins in the database. However, bioinformatics analyses revealed that R.HphI is a member of a relatively big sequence family with a conserved C-terminal domain and a variable N-terminal domain. We predict that the C-terminal domains of proteins from this family correspond to the nuclease domain of the HNH superfamily rather than to the most common PD-(D/E)XK superfamily of nucleases. We constructed a three-dimensional model of the R.HphI catalytic domain and validated our predictions by site-directed mutagenesis and studies of DNA-binding and catalytic activities of the mutant proteins. We also analyzed the genomic neighborhood of R.HphI homologs and found that putative nucleases accompanied by a DNA methyltransferase (i.e. predicted REases) do not form a single group on a phylogenetic tree, but are dispersed among free-standing putative nucleases. This suggests that nucleases from the HNH superfamily were independently recruited to become REases in the context of RM systems multiple times in the evolution and that members of the HNH superfamily may be much more frequent among the so far unassigned REase sequences than previously thought.
Collapse
Affiliation(s)
- Iwona A Cymerman
- Laboratory of Bioinformatics and Protein Engineering, International Institute of Molecular and Cell Biology, Trojdena 4, 02-109 Warsaw, Poland
| | | | | | | | | |
Collapse
|
13
|
Skowronek KJ, Kosinski J, Bujnicki JM. Theoretical model of restriction endonuclease HpaI in complex with DNA, predicted by fold recognition and validated by site-directed mutagenesis. Proteins 2006; 63:1059-68. [PMID: 16498623 DOI: 10.1002/prot.20920] [Citation(s) in RCA: 7] [Impact Index Per Article: 0.4] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 12/20/2022]
Abstract
Type II restriction enzymes are commercially important deoxyribonucleases and very attractive targets for protein engineering of new specificities. At the same time they are a very challenging test bed for protein structure prediction methods. Typically, enzymes that recognize different sequences show little or no amino acid sequence similarity to each other and to other proteins. Based on crystallographic analyses that revealed the same PD-(D/E)XK fold for more than a dozen case studies, they were nevertheless considered to be related until the combination of bioinformatics and mutational analyses has demonstrated that some of these proteins belong to other, unrelated folds PLD, HNH, and GIY-YIG. As a part of a large-scale project aiming at identification of a three-dimensional fold for all type II REases with known sequences (currently approximately 1000 proteins), we carried out preliminary structure prediction and selected candidates for experimental validation. Here, we present the analysis of HpaI REase, an ORFan with no detectable homologs, for which we detected a structural template by protein fold recognition, constructed a model using the FRankenstein monster approach and identified a number of residues important for the DNA binding and catalysis. These predictions were confirmed by site-directed mutagenesis and in vitro analysis of the mutant proteins. The experimentally validated model of HpaI will serve as a low-resolution structural platform for evolutionary considerations in the subgroup of blunt-cutting REases with different specificities. The research protocol developed in the course of this work represents a streamlined version of the previously used techniques and can be used in a high-throughput fashion to build and validate models for other enzymes, especially ORFans that exhibit no sequence similarity to any other protein in the database.
Collapse
|
14
|
Armalyte E, Bujnicki JM, Giedriene J, Gasiunas G, Kosiński J, Lubys A. Mva1269I: a monomeric type IIS restriction endonuclease from Micrococcus varians with two EcoRI- and FokI-like catalytic domains. J Biol Chem 2005; 280:41584-94. [PMID: 16223716 DOI: 10.1074/jbc.m506775200] [Citation(s) in RCA: 24] [Impact Index Per Article: 1.3] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 01/04/2023] Open
Abstract
Type II restriction endonuclease Mva1269I recognizes an asymmetric DNA sequence 5'-GAATGCN / -3'/5'-NG / CATTC-3' and cuts top and bottom DNA strands at positions, indicated by the "/" symbol. Most restriction endonucleases require dimerization to cleave both strands of DNA. We found that Mva1269I is a monomer both in solution and upon binding of cognate DNA. Protein fold-recognition analysis revealed that Mva1269I comprises two "PD-(D/E)XK" domains. The N-terminal domain is related to the 5'-GAATTC-3'-specific restriction endonuclease EcoRI, whereas the C-terminal one resembles the nonspecific nuclease domain of restriction endonuclease FokI. Inactivation of the C-terminal catalytic site transformed Mva1269I into a very active bottom strand-nicking enzyme, whereas mutants in the N-terminal domain nicked the top strand, but only at elevated enzyme concentrations. We found that the cleavage of the bottom strand is a prerequisite for the cleavage of the top strand. We suggest that Mva1269I evolved the ability to recognize and to cleave its asymmetrical target by a fusion of an EcoRI-like domain, which incises the bottom strand within the target, and a FokI-like domain that completes the cleavage within the nonspecific region outside the target sequence. Our results have implications for the molecular evolution of restriction endonucleases, as well as for perspectives of engineering new restriction and nicking enzymes with asymmetric target sites.
Collapse
Affiliation(s)
- Elena Armalyte
- Institute of Biotechnology, Graiciuno 8, Vilnius LT-02241, Lithuania
| | | | | | | | | | | |
Collapse
|
15
|
Söding J, Biegert A, Lupas AN. The HHpred interactive server for protein homology detection and structure prediction. Nucleic Acids Res 2005; 33:W244-8. [PMID: 15980461 PMCID: PMC1160169 DOI: 10.1093/nar/gki408] [Citation(s) in RCA: 2797] [Impact Index Per Article: 147.2] [Reference Citation Analysis] [Abstract] [MESH Headings] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/13/2022] Open
Abstract
HHpred is a fast server for remote protein homology detection and structure prediction and is the first to implement pairwise comparison of profile hidden Markov models (HMMs). It allows to search a wide choice of databases, such as the PDB, SCOP, Pfam, SMART, COGs and CDD. It accepts a single query sequence or a multiple alignment as input. Within only a few minutes it returns the search results in a user-friendly format similar to that of PSI-BLAST. Search options include local or global alignment and scoring secondary structure similarity. HHpred can produce pairwise query-template alignments, multiple alignments of the query with a set of templates selected from the search results, as well as 3D structural models that are calculated by the MODELLER software from these alignments. A detailed help facility is available. As a demonstration, we analyze the sequence of SpoVT, a transcriptional regulator from Bacillus subtilis. HHpred can be accessed at http://protevo.eb.tuebingen.mpg.de/hhpred.
Collapse
Affiliation(s)
- Johannes Söding
- Department of Protein Evolution, Max-Planck-Institute for Developmental Biology Spemannstrasse 35, 72076 Tübingen, Germany.
| | | | | |
Collapse
|
16
|
Kosinski J, Feder M, Bujnicki JM. The PD-(D/E)XK superfamily revisited: identification of new members among proteins involved in DNA metabolism and functional predictions for domains of (hitherto) unknown function. BMC Bioinformatics 2005; 6:172. [PMID: 16011798 PMCID: PMC1189080 DOI: 10.1186/1471-2105-6-172] [Citation(s) in RCA: 72] [Impact Index Per Article: 3.8] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 04/13/2005] [Accepted: 07/12/2005] [Indexed: 01/02/2023] Open
Abstract
BACKGROUND The PD-(D/E)XK nuclease superfamily, initially identified in type II restriction endonucleases and later in many enzymes involved in DNA recombination and repair, is one of the most challenging targets for protein sequence analysis and structure prediction. Typically, the sequence similarity between these proteins is so low, that most of the relationships between known members of the PD-(D/E)XK superfamily were identified only after the corresponding structures were determined experimentally. Thus, it is tempting to speculate that among the uncharacterized protein families, there are potential nucleases that remain to be discovered, but their identification requires more sensitive tools than traditional PSI-BLAST searches. RESULTS The low degree of amino acid conservation hampers the possibility of identification of new members of the PD-(D/E)XK superfamily based solely on sequence comparisons to known members. Therefore, we used a recently developed method HHsearch for sensitive detection of remote similarities between protein families represented as profile Hidden Markov Models enhanced by secondary structure. We carried out a comparison of known families of PD-(D/E)XK nucleases to the database comprising the COG and PFAM profiles corresponding to both functionally characterized as well as uncharacterized protein families to detect significant similarities. The initial candidates for new nucleases were subsequently verified by sequence-structure threading, comparative modeling, and identification of potential active site residues. CONCLUSION In this article, we report identification of the PD-(D/E)XK nuclease domain in numerous proteins implicated in interactions with DNA but with unknown structure and mechanism of action (such as putative recombinase RmuC, DNA competence factor CoiA, a DNA-binding protein SfsA, a large human protein predicted to be a DNA repair enzyme, predicted archaeal transcription regulators, and the head completion protein of phage T4) and in proteins for which no function was assigned to date (such as YhcG, various phage proteins, novel candidates for restriction enzymes). Our results contributes to the reduction of "white spaces" on the sequence-structure-function map of the protein universe and will help to jump-start the experimental characterization of new nucleases, of which many may be of importance for the complete understanding of mechanisms that govern the evolution and stability of the genome.
Collapse
Affiliation(s)
- Jan Kosinski
- Laboratory of Bioinformatics and Protein Engineering, International Institute of Molecular and Cell Biology, Trojdena 4, PL-02-109 Warsaw, Poland
| | - Marcin Feder
- Laboratory of Bioinformatics and Protein Engineering, International Institute of Molecular and Cell Biology, Trojdena 4, PL-02-109 Warsaw, Poland
| | - Janusz M Bujnicki
- Laboratory of Bioinformatics and Protein Engineering, International Institute of Molecular and Cell Biology, Trojdena 4, PL-02-109 Warsaw, Poland
| |
Collapse
|
17
|
Chmiel AA, Radlinska M, Pawlak SD, Krowarsch D, Bujnicki JM, Skowronek KJ. A theoretical model of restriction endonuclease NlaIV in complex with DNA, predicted by fold recognition and validated by site-directed mutagenesis and circular dichroism spectroscopy. Protein Eng Des Sel 2005; 18:181-9. [PMID: 15849215 DOI: 10.1093/protein/gzi019] [Citation(s) in RCA: 13] [Impact Index Per Article: 0.7] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 01/10/2023] Open
Abstract
Restriction enzymes (REases) are commercial reagents commonly used in DNA manipulations and mapping. They are regarded as very attractive models for studying protein-DNA interactions and valuable targets for protein engineering. Their amino acid sequences usually show no similarities to other proteins, with rare exceptions of other REases that recognize identical or very similar sequences. Hence, they are extremely hard targets for structure prediction and modeling. NlaIV is a Type II REase, which recognizes the interrupted palindromic sequence GGNNCC (where N indicates any base) and cleaves it in the middle, leaving blunt ends. NlaIV shows no sequence similarity to other proteins and virtually nothing is known about its sequence-structure-function relationships. Using protein fold recognition, we identified a remote relationship between NlaIV and EcoRV, an extensively studied REase, which recognizes the GATATC sequence and whose crystal structure has been determined. Using the 'FRankenstein's monster' approach we constructed a comparative model of NlaIV based on the EcoRV template and used it to predict the catalytic and DNA-binding residues. The model was validated by site-directed mutagenesis and analysis of the activity of the mutants in vivo and in vitro as well as structural characterization of the wild-type enzyme and two mutants by circular dichroism spectroscopy. The structural model of the NlaIV-DNA complex suggests regions of the protein sequence that may interact with the 'non-specific' bases of the target and thus it provides insight into the evolution of sequence specificity in restriction enzymes and may help engineer REases with novel specificities. Before this analysis was carried out, neither the three-dimensional fold of NlaIV, its evolutionary relationships or its catalytic or DNA-binding residues were known. Hence our analysis may be regarded as a paradigm for studies aiming at reducing 'white spaces' on the evolutionary landscape of sequence-function relationships by combining bioinformatics with simple experimental assays.
Collapse
Affiliation(s)
- Agnieszka A Chmiel
- Laboratory of Bioinformatics and Protein Engineering, International Institute of Molecular and Cell Biology, ul. ks. Trojdena 4, 02-109 Warsaw, Poland
| | | | | | | | | | | |
Collapse
|