1
|
Inteins in Science: Evolution to Application. Microorganisms 2020; 8:microorganisms8122004. [PMID: 33339089 PMCID: PMC7765530 DOI: 10.3390/microorganisms8122004] [Citation(s) in RCA: 16] [Impact Index Per Article: 4.0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 11/12/2020] [Revised: 12/09/2020] [Accepted: 12/09/2020] [Indexed: 12/20/2022] Open
Abstract
Inteins are mobile genetic elements that apply standard enzymatic strategies to excise themselves post-translationally from the precursor protein via protein splicing. Since their discovery in the 1990s, recent advances in intein technology allow for them to be implemented as a modern biotechnological contrivance. Radical improvement in the structure and catalytic framework of cis- and trans-splicing inteins devised the development of engineered inteins that contribute to various efficient downstream techniques. Previous literature indicates that implementation of intein-mediated splicing has been extended to in vivo systems. Besides, the homing endonuclease domain also acts as a versatile biotechnological tool involving genetic manipulation and control of monogenic diseases. This review orients the understanding of inteins by sequentially studying the distribution and evolution pattern of intein, thereby highlighting a role in genetic mobility. Further, we include an in-depth summary of specific applications branching from protein purification using self-cleaving tags to protein modification, post-translational processing and labelling, followed by the development of intein-based biosensors. These engineered inteins offer a disruptive approach towards research avenues like biomaterial construction, metabolic engineering and synthetic biology. Therefore, this linear perspective allows for a more comprehensive understanding of intein function and its diverse applications.
Collapse
|
2
|
Hoffmann S, Terhorst TME, Singh RK, Kümmel D, Pietrokovski S, Mootz HD. Biochemical and Structural Characterization of an Unusual and Naturally Split Class 3 Intein. Chembiochem 2020; 22:364-373. [PMID: 32813312 PMCID: PMC7891396 DOI: 10.1002/cbic.202000509] [Citation(s) in RCA: 2] [Impact Index Per Article: 0.5] [Reference Citation Analysis] [Abstract] [Key Words] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/24/2020] [Revised: 08/19/2020] [Indexed: 12/31/2022]
Abstract
Split inteins are indispensable tools for protein engineering because their ligation and cleavage reactions enable unique modifications of the polypeptide backbone. Three different classes of inteins have been identified according to the nature of the covalent intermediates resulting from the acyl rearrangements in the multistep protein‐splicing pathway. Class 3 inteins employ a characteristic internal cysteine for a branched thioester intermediate. A bioinformatic database search of non‐redundant protein sequences revealed the absence of split variants in 1701 class 3 inteins. We have discovered the first reported split class 3 intein in a metagenomics data set and report its biochemical, mechanistic and structural analysis. The AceL NrdHF intein exhibits low sequence conservation with other inteins and marked deviations in residues at conserved key positions, including a variation of the typical class‐3 WCT triplet motif. Nevertheless, functional analysis confirmed the class 3 mechanism of the intein and revealed excellent splicing yields within a few minutes over a wide range of conditions and with barely detectable cleavage side reactions. A high‐resolution crystal structure of the AceL NrdHF precursor and a mutagenesis study explained the importance and roles of several residues at the key positions. Tolerated substitutions in the flanking extein residues and a high affinity between the split intein fragments further underline the intein's future potential as a ligation tool.
Collapse
Affiliation(s)
- Simon Hoffmann
- Institute of Biochemistry, University of Muenster, Corrensstraße 36, 48149, Münster, Germany
| | - Tobias M E Terhorst
- Institute of Biochemistry, University of Muenster, Corrensstraße 36, 48149, Münster, Germany
| | - Rohit K Singh
- Institute of Biochemistry, University of Muenster, Corrensstraße 36, 48149, Münster, Germany
| | - Daniel Kümmel
- Institute of Biochemistry, University of Muenster, Corrensstraße 36, 48149, Münster, Germany
| | - Shmuel Pietrokovski
- Department of Molecular Genetics, Weizmann Institute of Science, Rehovot, 76100, Israel
| | - Henning D Mootz
- Institute of Biochemistry, University of Muenster, Corrensstraße 36, 48149, Münster, Germany
| |
Collapse
|
3
|
A mesophilic cysteine-less split intein for protein trans-splicing applications under oxidizing conditions. Proc Natl Acad Sci U S A 2019; 116:22164-22172. [PMID: 31611397 DOI: 10.1073/pnas.1909825116] [Citation(s) in RCA: 24] [Impact Index Per Article: 4.8] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 02/06/2023] Open
Abstract
Split intein-mediated protein trans-splicing has found extensive applications in chemical biology, protein chemistry, and biotechnology. However, an enduring limitation of all well-established split inteins has been the requirement to carry out the reaction in a reducing environment due to the presence of 1 or 2 catalytic cysteines that need to be in a reduced state for splicing to occur. The concomitant exposure of the fused proteins to reducing agents severely limits the scope of protein trans-splicing by excluding proteins sensitive to reducing conditions, such as those containing critical disulfide bonds. Here we report the discovery, characterization, and engineering of a completely cysteine-less split intein (CL intein) that is capable of efficient trans-splicing at ambient temperatures, without a denaturation step, and in the absence of reducing agents. We demonstrate its utility for the site-specific chemical modification of nanobodies and an antibody Fc fragment by N- and C-terminal trans-splicing with short peptide tags (CysTag) that consist of only a few amino acids and have been prelabeled on a single cysteine using classical cysteine bioconjugation. We also synthesized the short N-terminal fragment of the atypically split CL intein by solid-phase peptide synthesis. Furthermore, using the CL intein in combination with a nanobody-epitope pair as a high-affinity mediator, we showed chemical labeling of the extracellular domain of a cell surface receptor on living mammalian cells with a short CysTag containing a synthetic fluorophore. The CL intein thus greatly expands the scope of applications for protein trans-splicing.
Collapse
|
4
|
Sarmiento C, Camarero JA. Biotechnological Applications of Protein Splicing. Curr Protein Pept Sci 2019; 20:408-424. [PMID: 30734675 PMCID: PMC7135711 DOI: 10.2174/1389203720666190208110416] [Citation(s) in RCA: 33] [Impact Index Per Article: 6.6] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 08/01/2018] [Revised: 12/22/2018] [Accepted: 12/25/2018] [Indexed: 12/12/2022]
Abstract
Protein splicing domains, also called inteins, have become a powerful biotechnological tool for applications involving molecular biology and protein engineering. Early applications of inteins focused on self-cleaving affinity tags, generation of recombinant polypeptide α-thioesters for the production of semisynthetic proteins and backbone cyclized polypeptides. The discovery of naturallyoccurring split-inteins has allowed the development of novel approaches for the selective modification of proteins both in vitro and in vivo. This review gives a general introduction to protein splicing with a focus on their role in expanding the applications of intein-based technologies in protein engineering and chemical biology.
Collapse
Affiliation(s)
- Corina Sarmiento
- Department of Pharmacology and Pharmaceutical Sciences, University of Southern California, Los Angeles, CA9033 USA
| | - Julio A. Camarero
- Department of Pharmacology and Pharmaceutical Sciences, University of Southern California, Los Angeles, CA9033 USA
- Norris Comprehensive Cancer Center, University of Southern California, Los Angeles, CA9033 USA
- Department of Chemistry, University of Southern California, Los Angeles, California 90089-9121, USA
| |
Collapse
|
5
|
Tori K, Perler F. Sequential formation of two branched intermediates during protein splicing of class three inteins. Extremophiles 2016; 21:41-49. [PMID: 27704298 PMCID: PMC5222942 DOI: 10.1007/s00792-016-0876-0] [Citation(s) in RCA: 6] [Impact Index Per Article: 0.8] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/15/2016] [Accepted: 09/24/2016] [Indexed: 11/25/2022]
Abstract
Inteins are the protein equivalent of introns. They are seamlessly removed during post-translational maturation of their host protein (extein). Inteins from extremophiles played a key role in understanding intein-mediated protein splicing. There are currently three classes of inteins defined by catalytic mechanism and sequence signatures. This study demonstrates splicing of three class 3 mini-inteins: Burkholderia vietnamiensis G4 Bvi IcmO intein, Mycobacterium smegmatis MC2 155 Msm DnaB-1 intein and Mycobacterium leprae strain TN Mle DnaB intein. B. vietnamiensis has a broad ecological range and remediates trichloroethene. M. smegmatis is a biofilm forming soil bacteria. Although other intein classes have only a single branched intermediate at the C-terminal splice junction, the class 3 intein reaction pathway includes two branched intermediates. The class 3 specific branched intermediate is formed by an internal cysteine, while the C-terminal branch intermediate is at a serine or threonine in all class 3 inteins except the Bvi IcmO intein, where it is a cysteine. This latter cysteine was unable to compensate for mutation of the class 3-specific internal catalytic cysteine despite the Bvi IcmO intein having an N-terminal splice junction naturally tuned for a cysteine nucleophile, demonstrating the mandatory order of branch intermediates in class 3 inteins.
Collapse
Affiliation(s)
- Kazuo Tori
- New England Biolabs, Inc., Ipswich, MA 01938 USA
- Takara Bio USA, Inc., 1290 Terra Bella Ave., Mountain View, CA 94043 USA
| | - Francine Perler
- New England Biolabs, Inc., Ipswich, MA 01938 USA
- Perls of Wisdom Biotech Consulting, Brookline, MA 02446 USA
- Department of Molecular Biophysics and Biochemistry, Yale University, New Haven, CT 06520 USA
| |
Collapse
|
6
|
Miraula M, Enculescu C, Schenk G, Mitić N. Inteins—A Focus on the Biotechnological Applications of Splicing-Promoting Proteins. ACTA ACUST UNITED AC 2015. [DOI: 10.4236/ajmb.2015.52005] [Citation(s) in RCA: 9] [Impact Index Per Article: 1.0] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/20/2022]
|
7
|
Abstract
Inteins are nature's escape artists; they facilitate their excision from flanking polypeptides (exteins) concomitant with extein ligation to produce a mature host protein. Splicing requires sequential nucleophilic displacement reactions catalyzed by strategies similar to proteases and asparagine lyases. Inteins require precise reaction coordination rather than rapid turnover or tight substrate binding because they are single turnover enzymes with covalently linked substrates. This has allowed inteins to explore alternative mechanisms with different steps or to use different methods for activation and coordination of the steps. Pressing issues include understanding the underlying details of catalysis and how the splicing steps are controlled.
Collapse
Affiliation(s)
- Kenneth V Mills
- From the Department of Chemistry, College of the Holy Cross, Worcester, Massachusetts 01610
| | - Margaret A Johnson
- the Department of Chemistry, University of Alabama at Birmingham, Birmingham, Alabama 35294, and
| | | |
Collapse
|
8
|
Abstract
Inteins are auto-processing domains found in organisms from all domains of life. These proteins carry out a process known as protein splicing, which is a multi-step biochemical reaction comprised of both the cleavage and formation of peptide bonds. While the endogenous substrates of protein splicing are specific essential proteins found in intein-containing host organisms, inteins are also functional in exogenous contexts and can be used to chemically manipulate virtually any polypeptide backbone. Given this, protein chemists have exploited various facets of intein reactivity to modify proteins in myriad ways for both basic biological research as well as potential therapeutic applications. Here, we review the intein field, first focusing on the biological context and phylogenetic diversity of inteins, followed by a description of intein structure and biochemical function. Finally, we discuss prevalent inteinbased technologies, focusing on their applications in chemical biology, followed by persistent caveats of intein chemistry and approaches to alleviate these shortcomings. The findings summarized herein describe two and a half decades of research, leading from a biochemical curiosity to the development of powerful protein engineering tools.
Collapse
Affiliation(s)
- Neel H Shah
- Department of Chemistry, Princeton University, Frick Laboratory, Princeton, NJ 08544, United States
| | - Tom W Muir
- Department of Chemistry, Princeton University, Frick Laboratory, Princeton, NJ 08544, United States
| |
Collapse
|
9
|
Lin Y, Li M, Song H, Xu L, Meng Q, Liu XQ. Protein trans-splicing of multiple atypical split inteins engineered from natural inteins. PLoS One 2013; 8:e59516. [PMID: 23593141 PMCID: PMC3620165 DOI: 10.1371/journal.pone.0059516] [Citation(s) in RCA: 18] [Impact Index Per Article: 1.6] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 09/26/2012] [Accepted: 02/15/2013] [Indexed: 11/30/2022] Open
Abstract
Protein trans-splicing by split inteins has many uses in protein production and research. Splicing proteins with synthetic peptides, which employs atypical split inteins, is particularly useful for site-specific protein modifications and labeling, because the synthetic peptide can be made to contain a variety of unnatural amino acids and chemical modifications. For this purpose, atypical split inteins need to be engineered to have a small N-intein or C-intein fragment that can be more easily included in a synthetic peptide that also contains a small extein to be trans-spliced onto target proteins. Here we have successfully engineered multiple atypical split inteins capable of protein trans-splicing, by modifying and testing more than a dozen natural inteins. These included both S1 split inteins having a very small (11–12 aa) N-intein fragment and S11 split inteins having a very small (6 aa) C-intein fragment. Four of the new S1 and S11 split inteins showed high efficiencies (85–100%) of protein trans-splicing both in E. coli cells and in vitro. Under in vitro conditions, they exhibited reaction rate constants ranging from ∼1.7×10−4 s−1 to ∼3.8×10−4 s−1, which are comparable to or higher than those of previously reported atypical split inteins. These findings should facilitate a more general use of trans-splicing between proteins and synthetic peptides, by expanding the availability of different atypical split inteins. They also have implications on understanding the structure-function relationship of atypical split inteins, particularly in terms of intein fragment complementation.
Collapse
Affiliation(s)
- Ying Lin
- Institute of Biological Sciences and Biotechnology, Donghua University, Shanghai, P.R. China
| | - Mengmeng Li
- Institute of Biological Sciences and Biotechnology, Donghua University, Shanghai, P.R. China
| | - Huiling Song
- Institute of Biological Sciences and Biotechnology, Donghua University, Shanghai, P.R. China
- Department of Biochemistry and Molecular Biology, Dalhousie University, Halifax, Nova Scotia, Canada
| | - Lingling Xu
- Institute of Biological Sciences and Biotechnology, Donghua University, Shanghai, P.R. China
- Department of Biochemistry and Molecular Biology, Dalhousie University, Halifax, Nova Scotia, Canada
| | - Qing Meng
- Institute of Biological Sciences and Biotechnology, Donghua University, Shanghai, P.R. China
- * E-mail: (QM); (XQL)
| | - Xiang-Qin Liu
- Department of Biochemistry and Molecular Biology, Dalhousie University, Halifax, Nova Scotia, Canada
- * E-mail: (QM); (XQL)
| |
Collapse
|
10
|
Cheriyan M, Pedamallu CS, Tori K, Perler F. Faster protein splicing with the Nostoc punctiforme DnaE intein using non-native extein residues. J Biol Chem 2013; 288:6202-11. [PMID: 23306197 PMCID: PMC3585056 DOI: 10.1074/jbc.m112.433094] [Citation(s) in RCA: 76] [Impact Index Per Article: 6.9] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/06/2022] Open
Abstract
Inteins are naturally occurring intervening sequences that catalyze a protein splicing reaction resulting in intein excision and concatenation of the flanking polypeptides (exteins) with a native peptide bond. Inteins display a diversity of catalytic mechanisms within a highly conserved fold that is shared with hedgehog autoprocessing proteins. The unusual chemistry of inteins has afforded powerful biotechnology tools for controlling enzyme function upon splicing and allowing peptides of different origins to be coupled in a specific, time-defined manner. The extein sequences immediately flanking the intein affect splicing and can be defined as the intein substrate. Because of the enormous potential complexity of all possible flanking sequences, studying intein substrate specificity has been difficult. Therefore, we developed a genetic selection for splicing-dependent kanamycin resistance with no significant bias when six amino acids that immediately flanked the intein insertion site were randomized. We applied this selection to examine the sequence space of residues flanking the Nostoc punctiforme Npu DnaE intein and found that this intein efficiently splices a much wider range of sequences than previously thought, with little N-extein specificity and only two important C-extein positions. The novel selected extein sequences were sufficient to promote splicing in three unrelated proteins, confirming the generalizable nature of the specificity data and defining new potential insertion sites for any target. Kinetic analysis showed splicing rates with the selected exteins that were as fast or faster than the native extein, refuting past assumptions that the naturally selected flanking extein sequences are optimal for splicing.
Collapse
Affiliation(s)
- Manoj Cheriyan
- New England Biolabs, Inc, Ipswich, Massachusetts 01938, USA
| | | | | | | |
Collapse
|
11
|
Tori K, Perler FB. The Arthrobacter species FB24 Arth_1007 (DnaB) intein is a pseudogene. PLoS One 2011; 6:e26361. [PMID: 22028863 PMCID: PMC3196547 DOI: 10.1371/journal.pone.0026361] [Citation(s) in RCA: 6] [Impact Index Per Article: 0.5] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 08/17/2011] [Accepted: 09/25/2011] [Indexed: 02/05/2023] Open
Abstract
An Arthrobacter species FB24 gene (locus tag Arth_1007) was previously annotated as a putative intein-containing DnaB helicase of phage origin (Arsp-FB24 DnaB intein). However, it is not a helicase gene because the sequence similarity is limited to inteins. In fact, the flanking exteins total only 66 amino acids. Therefore, the intein should be referred to as the Arsp-FB24 Arth_1007 intein. The Arsp-FB24 Arth_1007 intein failed to splice in its native precursor and in a model precursor. We previously noted that the Arsp-FB24 Arth_1007 intein is the only putative Class 3 intein that is missing the catalytically essential Cys at position 4 of intein Motif F, which is one of the three defining signature residues of this class. Additionally, a catalytically essential His in position 10 of intein Motif B is also absent; this His is the most conserved residue amongst all inteins. Splicing activity was not rescued when these two catalytically important positions were 'reverted' back to their consensus residues. This study restores the unity of the Class 3 intein signature sequence in active inteins by demonstrating that the Arsp-FB24 Arth_1007 intein is an inactive pseudogene.
Collapse
Affiliation(s)
- Kazuo Tori
- New England Biolabs, Inc., Ipswich, Massachusetts, United States of America
| | - Francine B. Perler
- New England Biolabs, Inc., Ipswich, Massachusetts, United States of America
- * E-mail:
| |
Collapse
|
12
|
|
13
|
Appleby JH, Zhou K, Volkmann G, Liu XQ. Novel Split Intein for trans-Splicing Synthetic Peptide onto C Terminus of Protein. J Biol Chem 2009; 284:6194-9. [DOI: 10.1074/jbc.m805474200] [Citation(s) in RCA: 48] [Impact Index Per Article: 3.2] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/06/2022] Open
|
14
|
Bürglin TR. Evolution of hedgehog and hedgehog-related genes, their origin from Hog proteins in ancestral eukaryotes and discovery of a novel Hint motif. BMC Genomics 2008; 9:127. [PMID: 18334026 PMCID: PMC2362128 DOI: 10.1186/1471-2164-9-127] [Citation(s) in RCA: 36] [Impact Index Per Article: 2.3] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/16/2007] [Accepted: 03/11/2008] [Indexed: 11/18/2022] Open
Abstract
Background The Hedgehog (Hh) signaling pathway plays important roles in human and animal development as well as in carcinogenesis. Hh molecules have been found in both protostomes and deuterostomes, but curiously the nematode Caenorhabditis elegans lacks a bona-fide Hh. Instead a series of Hh-related proteins are found, which share the Hint/Hog domain with Hh, but have distinct N-termini. Results We performed extensive genome searches such as the cnidarian Nematostella vectensis and several nematodes to gain further insights into Hh evolution. We found six genes in N. vectensis with a relationship to Hh: two Hh genes, one gene with a Hh N-terminal domain fused to a Willebrand factor type A domain (VWA), and three genes containing Hint/Hog domains with distinct novel N-termini. In the nematode Brugia malayi we find the same types of hh-related genes as in C. elegans. In the more distantly related Enoplea nematodes Xiphinema and Trichinella spiralis we find a bona-fide Hh. In addition, T. spiralis also has a quahog gene like C. elegans, and there are several additional hh-related genes, some of which have secreted N-terminal domains of only 15 to 25 residues. Examination of other Hh pathway components revealed that T. spiralis - like C. elegans - lacks some of these components. Extending our search to all eukaryotes, we recovered genes containing a Hog domain similar to Hh from many different groups of protists. In addition, we identified a novel Hint gene family present in many eukaryote groups that encodes a VWA domain fused to a distinct Hint domain we call Vint. Further members of a poorly characterized Hint family were also retrieved from bacteria. Conclusion In Cnidaria and nematodes the evolution of hh genes occurred in parallel to the evolution of other genes that contain a Hog domain but have different N-termini. The fact that Hog genes comprising a secreted N-terminus and a Hog domain are found in many protists indicates that this gene family must have arisen in very early eukaryotic evolution, and gave rise eventually to hh and hh-related genes in animals. The results indicate a hitherto unsuspected ability of Hog domain encoding genes to evolve new N-termini. In one instance in Cnidaria, the Hh N-terminal signaling domain is associated with a VWA domain and lacks a Hog domain, suggesting a modular mode of evolution also for the N-terminal domain. The Hog domain proteins, the inteins and VWA-Vint proteins are three families of Hint domain proteins that evolved in parallel in eukaryotes.
Collapse
Affiliation(s)
- Thomas R Bürglin
- Dept. of Biosciences and Nutrition, Karolinska Institutet & School of Life Sciences, Södertörns Högskola, Alfred Nobels Allé 7, SE-141 89 Huddinge, Sweden.
| |
Collapse
|
15
|
Mian IS, Worthey EA, Salavati R. Taking U out, with two nucleases? BMC Bioinformatics 2006; 7:305. [PMID: 16780580 PMCID: PMC1525001 DOI: 10.1186/1471-2105-7-305] [Citation(s) in RCA: 6] [Impact Index Per Article: 0.3] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 04/04/2006] [Accepted: 06/16/2006] [Indexed: 11/10/2022] Open
Abstract
BACKGROUND REX1 and REX2 are protein components of the RNA editing complex (the editosome) and function as exouridylylases. The exact roles of REX1 and REX2 in the editosome are unclear and the consequences of the presence of two related proteins are not fully understood. Here, a variety of computational studies were performed to enhance understanding of the structure and function of REX proteins in Trypanosoma and Leishmania species. RESULTS Sequence analysis and homology modeling of the Endonuclease/Exonuclease/Phosphatase (EEP) domain at the C-terminus of REX1 and REX2 highlights a common active site shared by all EEP domains. Phylogenetic analysis indicates that REX proteins contain a distinct subfamily of EEP domains. Inspection of three-dimensional models of the EEP domain in Trypanosoma brucei REX1 and REX2, and Leishmania major REX1 suggests variations of previously characterized key residues likely to be important in catalysis and determining substrate specificity. CONCLUSION We have identified features of the REX EEP domain that distinguish it from other family members and hence subfamily specific determinants of catalysis and substrate binding. The results provide specific guidance for experimental investigations about the role(s) of REX proteins in RNA editing.
Collapse
Affiliation(s)
- I Saira Mian
- Life Sciences Division, Lawrence Berkeley National Laboratory, Berkeley, California 94720-8265, USA
| | | | - Reza Salavati
- Seattle Biomedical Research Institute, Seattle, Washington, 98109, USA
- McGill University, Institute of Parasitology, Ste.-Anne-De-Bellevue, Quebec, H9X 3V9, Canada
| |
Collapse
|
16
|
Hiraga K, Derbyshire V, Dansereau JT, Van Roey P, Belfort M. Minimization and stabilization of the Mycobacterium tuberculosis recA intein. J Mol Biol 2005; 354:916-26. [PMID: 16288917 DOI: 10.1016/j.jmb.2005.09.088] [Citation(s) in RCA: 41] [Impact Index Per Article: 2.2] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/29/2005] [Revised: 09/01/2005] [Accepted: 09/09/2005] [Indexed: 10/25/2022]
Abstract
Many naturally occurring inteins consist of two functionally independent domains, a protein-splicing domain and an endonuclease domain. In a previous study, a 168 amino acid residue mini-intein was generated by removal of the central endonuclease domain of the 440 residue Mycobacterium tuberculosis (Mtu) recA intein. In addition, directed evolution experiments identified a mutation, V67L, that improved the activity of the mini-intein significantly. A recent crystal structure shows that the loop connecting two beta-strands from the N-terminal and C-terminal intein subdomains of the mini-intein is disordered. The goals of the present study were to generate smaller mini-intein derivatives and to understand the basis for reversal of the splicing defect by the V67L mutation. Guided by the structural information, we generated a number of derivatives 135 to 152 residues in length, with V67 or L67. All of the new minimal inteins are functional in splicing. In vivo selection experiments for function showed that by removal of the loop region, 137 residues may be the lower limit for full protein-splicing activity. In addition, the activation effect of the V67L mutation was observed to be universal for mini-inteins longer than 137 residues. Structural and functional analyses indicate that the role of the mutation is in stabilization of the mini-intein core.
Collapse
Affiliation(s)
- Kaori Hiraga
- Wadsworth Center, New York State Department of Health, Center for Medical Science, 150 New Scotland Avenue, Albany, NY 12208, USA
| | | | | | | | | |
Collapse
|
17
|
Edgell DR, Stanger MJ, Belfort M. Coincidence of cleavage sites of intron endonuclease I-TevI and critical sequences of the host thymidylate synthase gene. J Mol Biol 2004; 343:1231-41. [PMID: 15491609 DOI: 10.1016/j.jmb.2004.09.005] [Citation(s) in RCA: 33] [Impact Index Per Article: 1.7] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 04/13/2004] [Revised: 08/25/2004] [Accepted: 09/02/2004] [Indexed: 12/01/2022]
Abstract
To maximize spread of their host intron or intein, many homing endonucleases recognize nucleotides that code for important and conserved amino acid residues of the target gene. Here, we examine the cleavage requirements for I-TevI, which binds a stretch of thymidylate synthase (TS) DNA that codes for functionally critical residues in the TS active site. Using an in vitro selection scheme, we identified two base-pairs in the I-TevI cleavage site region as important for cleavage efficiency. These were confirmed by comparison of I-TevI cleavage efficiencies on mutant and on wild-type substrates. We also showed that nicking of the bottom strand by I-TevI is not affected by mutation of residues surrounding the bottom-strand cleavage site, unlike other homing endonucleases. One of these two base-pairs is universally conserved in all TS sequences, and is identical with a previously identified cleavage determinant of I-BmoI, a related GIY-YIG endonuclease that binds a homologous stretch of TS-encoding DNA. The other base-pair is conserved only in a subset of TS genes that includes the I-TevI, but not the I-BmoI, target sequence. Both the I-TevI and I-BmoI cleavage site requirements correspond to functionally critical residues involved in an extensive hydrogen bond network within the TS active site. Remarkably, these cleavage requirements correlate with TS phylogeny in bacteria, suggesting that each endonuclease has individually adapted to efficiently cleave distinct TS substrates.
Collapse
Affiliation(s)
- David R Edgell
- Molecular Genetics Program, Wadsworth Center, New York State Department of Health, PO Box 22002, Albany, NY 12201-2002, USA.
| | | | | |
Collapse
|
18
|
Abstract
Secreted signaling proteins function in a diverse array of essential patterning events during metazoan development, ranging from embryonic segmentation in insects to neural tube differentiation in vertebrates. These proteins generally are expressed in a localized manner, and they may elicit distinct concentration-dependent responses in the cells of surrounding tissues and structures, thus functioning as morphogens that specify the pattern of cellular responses by their tissue distribution. Given the importance of signal distribution, it is notable that the Hedgehog (Hh) and Wnt proteins, two of the most important families of such signals, are known to be covalently modified by lipid moieties, the membrane-anchoring properties of which are not consistent with passive models of protein mobilization within tissues. This review focuses on the mechanisms underlying biogenesis of the mature Hh proteins, which are dually modified by cholesteryl and palmitoyl adducts, as well as on the relationship between Hh proteins and the self-splicing proteins (i.e., proteins containing inteins) and the Hh-like proteins of nematodes. We further discuss the cellular mechanisms that have evolved to handle lipidated Hh proteins in the spatial deployment of the signal in developing tissues and the more recent findings that implicate palmitate modification as an important feature of Wnt signaling proteins.
Collapse
Affiliation(s)
- Randall K Mann
- Department of Molecular Biology and Genetics and Howard Hughes Medical Institute, Johns Hopkins University School of Medicine, Baltimore, Maryland 21205, USA.
| | | |
Collapse
|
19
|
Posey KL, Koufopanou V, Burt A, Gimble FS. Evolution of divergent DNA recognition specificities in VDE homing endonucleases from two yeast species. Nucleic Acids Res 2004; 32:3947-56. [PMID: 15280510 PMCID: PMC506816 DOI: 10.1093/nar/gkh734] [Citation(s) in RCA: 34] [Impact Index Per Article: 1.7] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/12/2022] Open
Abstract
Homing endonuclease genes (HEGs) are mobile DNA elements that are thought to confer no benefit to their host. They encode site-specific DNA endonucleases that perpetuate the element within a species population by homing and disseminate it between species by horizontal transfer. Several yeast species contain the VMA1 HEG that encodes the intein-associated VMA1-derived endonuclease (VDE). The evolutionary state of VDEs from 12 species was assessed by assaying their endonuclease activities. Only two enzymes are active, PI-ZbaI from Zygosaccharomyces bailii and PI-ScaI from Saccharomyces cariocanus. PI-ZbaI cleaves the Z.bailii recognition sequence significantly faster than the Saccharomyces cerevisiae site, which differs at six nucleotide positions. A mutational analysis indicates that PI-ZbaI cleaves the S.cerevisiae substrate poorly due to the absence of a contact that is analogous to one made in PI-SceI between Gln-55 and nucleotides +9/+10. PI-ZbaI cleaves the Z.bailii substrate primarily due to a single base-pair substitution (A/T+5 --> T/A+5). Structural modeling of the PI-ZbaI/DNA complex suggests that Arg-331, which is absent in PI-SceI, contacts T/A+5, and the reduced activity observed in a PI-ZbaI R331A mutant provides evidence for this interaction. These data illustrate that homing endonucleases evolve altered specificity as they adapt to recognize alternative target sites.
Collapse
Affiliation(s)
- Karen L Posey
- Center for Genome Research, Institute of Biosciences and Technology, Texas A&M University System Health Science Center, 2121 W. Holcombe Blvd, Houston, TX 77030, USA
| | | | | | | |
Collapse
|
20
|
Dassa B, Haviv H, Amitai G, Pietrokovski S. Protein splicing and auto-cleavage of bacterial intein-like domains lacking a C'-flanking nucleophilic residue. J Biol Chem 2004; 279:32001-7. [PMID: 15150275 DOI: 10.1074/jbc.m404562200] [Citation(s) in RCA: 22] [Impact Index Per Article: 1.1] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/06/2022] Open
Abstract
Bacterial intein-like (BIL) domains are newly identified homologs of intein protein-splicing domains. The two known types of BIL domains together with inteins and hedgehog (Hog) auto-processing domains form the Hog/intein (HINT) superfamily. BIL domains are distinct from inteins and Hogs in sequence, phylogenetic distribution, and host protein type, but little is known about their biochemical activity. Here we experimentally study the auto-processing activity of four BIL domains. An A-type BIL domain from Clostridium thermocellum showed both protein-splicing and auto-cleavage activities. The splicing is notable, because this domain has a native Ala C'-flanking residue rather than a nucleophilic residue, which is absolutely necessary for intein protein splicing. B-type BIL domains from Rhodobacter sphaeroides and Rhodobacter capsulatus cleaved their N' or C' ends. We propose an alternative protein-splicing mechanism for the A-type BIL domains. After an initial N-S acyl shift, creating a thioester bond at the N' end of the domain, the C' end of the domain is cleaved by Asn cyclization. The resulting amino end of the C'-flank attacks the thioester bond next at the N' end of the domain. This aminolysis step splices the two flanks of the domain. The B-type BIL domain cleavage activity is explained in the context of the canonical intein protein-splicing mechanism. Our results suggest that the different HINT domains have related biochemical activities of proteolytic cleavages, ligation and splicing. Yet the predominant reactions diverged in each HINT type according to their specific biological roles. We suggest that the BIL domain cleavage and splicing reactions are mechanisms for post-translationally generating protein variability, particularly in extracellular bacterial proteins.
Collapse
Affiliation(s)
- Bareket Dassa
- Department of Molecular Genetics, Weizmann Institute of Science, Rehovot, Israel 76100
| | | | | | | |
Collapse
|
21
|
Gogarten JP, Senejani AG, Zhaxybayeva O, Olendzenski L, Hilario E. Inteins: structure, function, and evolution. Annu Rev Microbiol 2003; 56:263-87. [PMID: 12142479 DOI: 10.1146/annurev.micro.56.012302.160741] [Citation(s) in RCA: 156] [Impact Index Per Article: 7.4] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/09/2022]
Abstract
Inteins are genetic elements that disrupt the coding sequence of genes. However, in contrast to introns, inteins are transcribed and translated together with their host protein. Inteins appear most frequently in Archaea, but they are found in organisms belonging to all three domains of life and in viral and phage proteins. Most inteins consist of two domains: One is involved in autocatalytic splicing, and the other is an endonuclease that is important in the spread of inteins. This review focuses on the evolution and technical application of inteins and only briefly summarizes recent advances in the study of the catalytic activities and structures of inteins. In particular, this review considers inteins as selfish or parasitic genetic elements, a point of view that explains many otherwise puzzling aspects of inteins.
Collapse
Affiliation(s)
- J Peter Gogarten
- Department of Molecular and Cell Biology, University of Connecticut, 75 North Eagleville Road, Storrs 06269-3044, USA.
| | | | | | | | | |
Collapse
|
22
|
Amitai G, Belenkiy O, Dassa B, Shainskaya A, Pietrokovski S. Distribution and function of new bacterial intein-like protein domains. Mol Microbiol 2003; 47:61-73. [PMID: 12492854 DOI: 10.1046/j.1365-2958.2003.03283.x] [Citation(s) in RCA: 55] [Impact Index Per Article: 2.6] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 12/18/2022]
Abstract
Hint protein domains appear in inteins and in the C-terminal region of Hedgehog and Hedgehog-like animal developmental proteins. Intein Hint domains are responsible and sufficient for protein-splicing of their host-protein flanks. In Hedgehog proteins the Hint domain autocatalyses its cleavage from the N-terminal domain of the Hedgehog protein by attaching a cholesterol molecule to it. We identified two new types of Hint domains. Both types have active site sequence features of Hint domains but also possess distinguishing sequence features. The new domains appear in more than 50 different proteins from diverse bacteria, including pathogenic species of humans and plants, such as Neisseria meningitidis and Pseudomonas syringae. These new domains are termed bacterial intein-like (BIL) domains. Bacterial intein-like domains are present in variable protein regions and are typically flanked by domains that also appear in secreted proteins such as filamentous haemagglutinin and calcium binding RTX repeats. Phylogenetic and genomic analysis of BIL sequences suggests that they were positively selected for in different lineages. We cloned two BIL domains of different types and showed them to be active. One of the domains efficiently cleaved itself from its C-terminal flank and could also protein-splice its two flanks, in E. coli and in a cell free system. We discuss several possible biological roles for BIL domains including microevolution and post translational modification for generating protein variability.
Collapse
Affiliation(s)
- Gil Amitai
- Molecular Genetics Department and Mass Spectrometry Unit, The Weizmann Institute of Science, Rehovot 76100, Israel
| | | | | | | | | |
Collapse
|
23
|
Fitzsimons Hall M, Noren CJ, Perler FB, Schildkraut I. Creation of an artificial bifunctional intein by grafting a homing endonuclease into a mini-intein. J Mol Biol 2002; 323:173-9. [PMID: 12381313 DOI: 10.1016/s0022-2836(02)00912-9] [Citation(s) in RCA: 12] [Impact Index Per Article: 0.5] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/21/2022]
Abstract
The majority of inteins are comprised of a protein splicing domain and a homing endonuclease domain. Experimental evidence has demonstrated that the splicing domain and the endonuclease domain in a bifunctional intein are largely independent of each other with respect to both structure and activity. Here, an artificial bifunctional intein has been created through the insertion of an existing homing endonuclease into a mini-intein that is naturally lacking this functionality. The gene for I-CreI, an intron-encoded homing endonuclease, was grafted into the monofunctional Mycobacterium xenopi GyrA intein at the putative site of the missing endonuclease. The resulting fusion protein was found to be capable of protein splicing similar to that of the parent intein. In addition, the protein demonstrated site-specific endonuclease activity that is characteristic of the I-CreI homing endonuclease. The function of each domain therefore remained unaffected by the presence of the other domain. This artificial fusion of the two domains is a potential novel mobile genetic element.
Collapse
|
24
|
Landthaler M, Begley U, Lau NC, Shub DA. Two self-splicing group I introns in the ribonucleotide reductase large subunit gene of Staphylococcus aureus phage Twort. Nucleic Acids Res 2002; 30:1935-43. [PMID: 11972330 PMCID: PMC113830 DOI: 10.1093/nar/30.9.1935] [Citation(s) in RCA: 46] [Impact Index Per Article: 2.1] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/13/2022] Open
Abstract
We have recently described three group I introns inserted into a single gene, orf142, of the staphylococcal bacteriophage Twort and suggested the presence of at least two additional self-splicing introns in this phage genome. Here we report that two previously uncharacterized introns, 429 and 1087 nt in length, interrupt the Twort gene coding for the large subunit of ribonucleotide reductase (nrdE). Reverse transcription-polymerase chain reaction (RT-PCR) of RNA isolated from Staphylococcus aureus after phage infection indicates that the introns are removed from the primary transcript in vivo. Both nrdE introns show sequence similarity to the Twort orf142 introns I2 and I3, suggesting either a common origin of these introns or shuffling of intron structural elements. Intron 2 encodes a DNA endonuclease, I-TwoI, with similarity to homing endonucleases of the HNH family. Like I-HmuI and I-HmuII, intron-encoded HNH endonucleases in Bacillus subtilis phages SPO1 and SP82, I-TwoI nicks only one strand of its DNA recognition sequence. However, whereas I-HmuI and I-HmuII cleave the template strand in exon 2, I-TwoI cleaves the coding strand in exon 1. In each case, the 3' OH created on the cut strand is positioned to prime DNA synthesis towards the intron, suggesting that this reaction contributes to the mechanism of intron homing. Both nrdE introns are inserted in highly conserved regions of the ribonucleotide reductase gene, next to codons for functionally important residues.
Collapse
Affiliation(s)
- Markus Landthaler
- Department of Biological Sciences and Center for Molecular Genetics, University at Albany, State University of New York, 1400 Washington Avenue, Albany, NY 12222, USA
| | | | | | | |
Collapse
|
25
|
Abstract
Selfish genes of no function other than self-propagation are susceptible to degeneration if they become fixed in a population, and regular transfer to new species may be the only means for their long-term persistence. To test this idea we surveyed 24 species of yeast for VDE, a nuclear, intein-associated homing endonuclease gene (HEG) originally discovered in Saccharomyces cerevisiae. Phylogenetic analyses show that horizontal transmission has been a regular occurrence in its evolutionary history. Moreover, VDE appears to be specifically adapted for horizontal transmission. Its 31-bp recognition sequence is an unusually well-conserved region in an unusually well-conserved gene. In addition, the nine nucleotide sites most critical for homing are also unusually well conserved. Such adaptation for horizontal transmission presumably arose as a consequence of selection, both among HEGs at different locations in the genome and among variants at the same location. The frequency of horizontal transmission must therefore be a key feature constraining the distribution and abundance of these genes.
Collapse
|
26
|
Affiliation(s)
- I Giriat
- Rockefeller University, 1230 York Avenue, New York, NY 10021, USA
| | | | | |
Collapse
|
27
|
Belle A, Landthaler M, Shub DA. Intronless homing: site-specific endonuclease SegF of bacteriophage T4 mediates localized marker exclusion analogous to homing endonucleases of group I introns. Genes Dev 2002; 16:351-62. [PMID: 11825876 PMCID: PMC155333 DOI: 10.1101/gad.960302] [Citation(s) in RCA: 55] [Impact Index Per Article: 2.5] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/24/2022]
Abstract
All genetic markers from phage T2 are partially excluded from the progeny of mixed infections with the related phage T4 (general, or phage exclusion). Several loci, including gene 56 of T2, are more dramatically excluded, being present in only approximately 1% of the progeny. This phenomenon is referred to as localized marker exclusion. Gene 69 is adjacent to gene 56 of T4 but is absent in T2, being replaced by completely nonhomologous DNA. We describe SegF, a novel site-specific DNA endonuclease encoded by gene 69, which is similar to GIY-YIG homing endonucleases of group I introns. Interestingly, SegF preferentially cleaves gene 56 of T2, both in vitro and in vivo, compared with that of phage T4. Repair of the double-strand break (DSB) results in the predominance of T4 genes 56 and segF in the progeny, with exclusion of the corresponding T2 sequences. Localized exclusion of T2 gene 56 is dependent on full-length SegF and is likely analogous to group I intron homing, in which repair of a DSB results in coconversion of markers in the flanking DNA. Phage T4 has many optional homing endonuclease genes similar to segF, whereas similar endonuclease genes are relatively rare in other members of the T-even family of bacteriophages. We propose that the general advantage enjoyed by T4 phage, over almost all of its relatives, is a cumulative effect of many of these localized events.
Collapse
Affiliation(s)
- Archana Belle
- Department of Biological Sciences and Center for Molecular Genetics, University at Albany, State University of New York, Albany, New York 12222, USA
| | | | | |
Collapse
|
28
|
Lew BM, Paulus H. An in vivo screening system against protein splicing useful for the isolation of non-splicing mutants or inhibitors of the RecA intein of Mycobacterium tuberculosis. Gene 2002; 282:169-77. [PMID: 11814689 DOI: 10.1016/s0378-1119(01)00836-8] [Citation(s) in RCA: 29] [Impact Index Per Article: 1.3] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 10/27/2022]
Abstract
Protein splicing involves the self-catalyzed excision of an intervening sequence, the intein, from a precursor protein, with the concomitant ligation of the flanking extein sequences to yield a new polypeptide. The ability of inteins to promote protein splicing even when inserted into a foreign context has facilitated the study of the modulation of protein splicing. In this paper, we describe an in vivo screening system for the isolation of mutations or inhibitors that interfere with protein splicing mediated by the RecA intein of Mycobacterium tuberculosis. It involves the activation of the cytotoxic CcdB protein by protein splicing, such that host cells survive in the presence of inducer only when protein splicing is blocked. The coding sequence for the RecA intein was inserted in-frame into the polylinker region of an inducible lacZ alpha-ccdB fusion vector, leading to inactivation of the CcdB toxin unless the intein is excised by protein splicing. Depending on the objective of the screening procedure, its stringency can be modified by altering the level of expression of the intein-CcdB fusion protein. To induce large amounts of CcdB fusion proteins, the fusion protein is expressed from a high-copy-number plasmid. Such a screening system detects even low levels of protein splicing and we have used it to show that protein splicing of the RecA intein is compatible with any amino acid in the extein position adjacent to the N-terminal splice junction. In order to search for protein splicing inhibitors, which may attenuate protein splicing by less than an order of magnitude, we have also constructed a low-copy-number intein-CcdB plasmid so that the host cells can survive when splicing of the expressed CcdB fusion protein is only moderately suppressed. We anticipate that the CcdB-based in vivo screening system will find uses in the analysis of structural and mechanistic aspects of protein splicing.
Collapse
Affiliation(s)
- Belinda M Lew
- Boston Biomedical Research Institute, 64 Grove Street, Watertown, MA 02472, USA
| | | |
Collapse
|
29
|
Senejani AG, Hilario E, Gogarten JP. The intein of the Thermoplasma A-ATPase A subunit: structure, evolution and expression in E. coli. BMC BIOCHEMISTRY 2001; 2:13. [PMID: 11722801 PMCID: PMC60005 DOI: 10.1186/1471-2091-2-13] [Citation(s) in RCA: 5] [Impact Index Per Article: 0.2] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Download PDF] [Figures] [Subscribe] [Scholar Register] [Received: 10/17/2001] [Accepted: 11/14/2001] [Indexed: 11/28/2022]
Abstract
BACKGROUND Inteins are selfish genetic elements that excise themselves from the host protein during post translational processing, and religate the host protein with a peptide bond. In addition to this splicing activity, most reported inteins also contain an endonuclease domain that is important in intein propagation. RESULTS The gene encoding the Thermoplasma acidophilum A-ATPase catalytic subunit A is the only one in the entire T. acidophilum genome that has been identified to contain an intein. This intein is inserted in the same position as the inteins found in the ATPase A-subunits encoding gene in Pyrococcus abyssi, P. furiosus and P. horikoshii and is found 20 amino acids upstream of the intein in the homologous vma-1 gene in Saccharomyces cerevisiae. In contrast to the other inteins in catalytic ATPase subunits, the T. acidophilum intein does not contain an endonuclease domain.T. acidophilum has different codon usage frequencies as compared to Escherichia coli. Initially, the low abundance of rare tRNAs prevented expression of the T. acidophilum A-ATPase A subunit in E. coli. Using a strain of E. coli that expresses additional tRNAs for rare codons, the T. acidophilum A-ATPase A subunit was successfully expressed in E. coli. CONCLUSIONS Despite differences in pH and temperature between the E. coli and the T. acidophilum cytoplasms, the T. acidophilum intein retains efficient self-splicing activity when expressed in E. coli. The small intein in the Thermoplasma A-ATPase is closely related to the endonuclease containing intein in the Pyrococcus A-ATPase. Phylogenetic analyses suggest that this intein was horizontally transferred between Pyrococcus and Thermoplasma, and that the small intein has persisted in Thermoplasma apparently without homing.
Collapse
Affiliation(s)
- Alireza G Senejani
- Department of Molecular and Cell Biology, University of Connecticut, Storrs, CT 06269-3044, USA
| | - Elena Hilario
- Current address: HortResearch, 120 Mt Albert Road, Private Bag 92, 169 Mt Albert, Auckland, New Zealand
| | - J Peter Gogarten
- Department of Molecular and Cell Biology, University of Connecticut, 75 North Eagleville Rd. Storrs, CT 06269-3044, USA
| |
Collapse
|
30
|
Mian IS, Dubchak I. Representing and reasoning about protein families using generative and discriminative methods. J Comput Biol 2001; 7:849-62. [PMID: 11382366 DOI: 10.1089/10665270050514972] [Citation(s) in RCA: 5] [Impact Index Per Article: 0.2] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/12/2022] Open
Abstract
This work addresses the issues of data representation and incorporation of domain knowledge into the design of learning systems for reasoning about protein families. Given the limited expressive capacity of a particular method, a mixture of protein annotation and fold recognition experts, each implementing a different underlying representation, should provide a robust method for assigning sequences to families. These ideas are illustrated using two data-driven learning methods that make use of different prior information and employ independent, yet complementary, projections of a family: hidden Markov models (HMMs) based on a multiple sequence alignment and neural networks (NNs) based on global sequence descriptors of proteins. Examination of seven protein families indicates that combining a generative (HMM) and a discriminative (NN) method is better than either method on its own. Biologically, human 4-hydroxyphenylpyruvic acid dioxygenase, involved in tyrosinemia type 3, is predicted to be structurally and functionally related to the glyoxalase I family.
Collapse
Affiliation(s)
- I S Mian
- Department of Molecular and Cell Biology (MS 74-197), Radiation Biology and Environmental Toxicology Group, Life Sciences Division, Lawrence Berkeley National Laboratory, Cyclotron Road, Berkeley, CA 94720, USA
| | | |
Collapse
|
31
|
Perler FB. Hyperthermophilic inteins. Methods Enzymol 2001; 334:270-80. [PMID: 11398469 DOI: 10.1016/s0076-6879(01)34475-0] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 10/26/2022]
Affiliation(s)
- F B Perler
- New England BioLabs, Inc., Beverly, Massachusetts 01915, USA
| |
Collapse
|
32
|
Abstract
Intein is the protein equivalent of intron and has been discovered in increasing numbers of organisms and host proteins. A self-splicing intein catalyzes its own removal from the host protein through a posttranslational process of protein splicing. A mobile intein displays a site-specific endonuclease activity that confers genetic mobility to the intein through intein homing. Recent findings of intein structure and the mechanism of protein splicing illuminated how inteins work and yielded clues regarding intein's origin, spread, and evolution. Inteins can evolve into new structures and new functions, such as split inteins that do trans-splicing. The structural basis of intein function needs to be identified for a full understanding of the origin and evolution of this marvelous genetic element.
Collapse
Affiliation(s)
- X Q Liu
- Department of Biochemistry and Molecular Biology, Dalhousie University, Halifax, Nova Scotia B3H 4H7, Canada.
| |
Collapse
|
33
|
Bonocora RP, Shub DA. A novel group I intron-encoded endonuclease specific for the anticodon region of tRNA(fMet) genes. Mol Microbiol 2001; 39:1299-306. [PMID: 11251845 DOI: 10.1111/j.1365-2958.2001.02318.x] [Citation(s) in RCA: 32] [Impact Index Per Article: 1.4] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/30/2022]
Abstract
Open reading frames (ORFs) are frequently inserted into group I self-splicing introns. These ORFs encode either maturases that are required for splicing of the intron or DNA endonucleases that promote intron mobility. A self-splicing intron in the tRNA(fMet) gene of Synechocystis PCC 6803, which has been proposed to have moved laterally within the cyanobacteria, contains an ORF that is unrelated to known intron-encoded endonucleases or maturases. Here, using an in vitro transcription-translation system, we show that this intronic ORF encodes a double-strand DNA endonuclease, I-Ssp6803I. I-Ssp6803I cleaves each strand of the intronless tRNA(fMet) gene adjacent to the anticodon triplet leaving 3 bp 3' extensions and has no activity at intron-exon boundaries. Using an in vitro cleavage assay and scanning deletion mutants of the intronless target site, the minimal recognition site was determined to be a partially palindromic 20 bp region encompassing the entire anticodon stem and loop of the tRNA(fMet) gene. I-Ssp6803I represents a novel intron-encoded DNA endonuclease and is the first example of a chromosomally encoded group I intron endonuclease in bacteria.
Collapse
Affiliation(s)
- R P Bonocora
- Department of Biological Sciences and Center for Molecular Genetics, University at Albany, State University of New York, 12222, USA
| | | |
Collapse
|
34
|
Abstract
Protein splicing is a form of posttranslational processing that consists of the excision of an intervening polypeptide sequence, the intein, from a protein, accompanied by the concomitant joining of the flanking polypeptide sequences, the exteins, by a peptide bond. It requires neither cofactors nor auxiliary enzymes and involves a series of four intramolecular reactions, the first three of which occur at a single catalytic center of the intein. Protein splicing can be modulated by mutation and converted to highly specific self-cleavage and protein ligation reactions that are useful protein engineering tools. Some of the reactions characteristic of protein splicing also occur in other forms of protein autoprocessing, ranging from peptide bond cleavage to conjugation with nonprotein moieties. These mechanistic similarities may be the result of convergent evolution, but in at least one case-hedgehog protein autoprocessing-there is definitely a close evolutionary relationship to protein splicing.
Collapse
Affiliation(s)
- H Paulus
- Boston Biomedical Research Institute, 64 Grove Street, Watertown, Massachusetts 02472, USA.
| |
Collapse
|
35
|
Abstract
The demonstration over 30 years ago that inhibitors of cholesterol biosynthesis disrupt animal development suggested an intriguing connection between fundamental cellular metabolic processes and the more global processes of embryonic tissue patterning. Adding a new dimension to this relationship is the more recent finding that the Hedgehog family of tissue patterning factors are covalently modified by cholesterol. Here we review the mechanism of the Hedgehog autoprocessing reaction that results in this modification, and compare this reaction to that undergone by other autoprocessing proteins. We also discuss the biological consequences of cholesterol modification, in particular the use of cholesterol as a molecular handle in the spatial deployment of the protein signal in developing tissues. Finally, the developmental consequences of chemical and genetic disruption of cholesterol homeostasis are summarized, along with the potential importance of cholesterol-rich lipid rafts in production of and response to the Hh signal.
Collapse
Affiliation(s)
- R K Mann
- Department of Molecular Biology and Genetics and Howard Hughes Medical Institute, The Johns Hopkins University, School of Medicine, Baltimore, MD 21205, USA
| | | |
Collapse
|
36
|
Southworth MW, Benner J, Perler FB. An alternative protein splicing mechanism for inteins lacking an N-terminal nucleophile. EMBO J 2000; 19:5019-26. [PMID: 10990465 PMCID: PMC314217 DOI: 10.1093/emboj/19.18.5019] [Citation(s) in RCA: 83] [Impact Index Per Article: 3.5] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 01/27/2023] Open
Abstract
Variations in the intein-mediated protein splicing mechanism are becoming more apparent as polymorphisms in conserved catalytic residues are identified. The conserved Ser or Cys at the intein N-terminus and the conserved intein penultimate His are absent in the KlbA family of inteins. These inteins were predicted to be inactive, since an N-terminal Ala cannot perform the initial reaction of the standard protein splicing pathway to yield the requisite N-terminal splice junction (thio)ester. Despite the presence of an N-terminal Ala and a penultimate Ser, the KlbA inteins splice efficiently using an alternative protein splicing mechanism. In this non-canonical pathway, the C-extein nucleophile attacks a peptide bond at the N-terminal splice junction rather than a (thio)ester bond, alleviating the need to form the initial (thio)ester at the N-terminal splice junction. The remainder of the two pathways is the same: branch resolution by Asn cyclization is followed by an acyl rearrangement to form a native peptide bond between the ligated exteins.
Collapse
Affiliation(s)
- M W Southworth
- New England BioLabs, 32 Tozer Road, Beverly, MA 01915, USA
| | | | | |
Collapse
|
37
|
Chen L, Benner J, Perler FB. Protein splicing in the absence of an intein penultimate histidine. J Biol Chem 2000; 275:20431-5. [PMID: 10770923 DOI: 10.1074/jbc.m000178200] [Citation(s) in RCA: 38] [Impact Index Per Article: 1.6] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/06/2022] Open
Abstract
Protein splicing is a self-catalytic process in which an intervening sequence, termed an intein, is excised from a protein precursor, and the flanking polypeptides are religated. The conserved intein penultimate His facilitates this reaction by assisting in Asn cyclization, which results in C-terminal splice junction cleavage. However, many inteins do not have a penultimate His. Previous splicing studies with 2 such inteins yielded contradictory results. To resolve this issue, the splicing capacity of 2 more inteins without penultimate His residues was examined. Both the Methanococcus jannaschii phosphoenolpyruvate synthase and RNA polymerase subunit A' inteins spliced. Splicing of the phosphoenolpyruvate synthase intein improved when its penultimate Phe was changed to His, but splicing of the RNA polymerase subunit A' intein was inhibited when its penultimate Gly was changed to His. We propose that inteins lacking a penultimate His (i) arose by mutation from ancestors in which a penultimate His facilitated splicing, (ii) that loss of this His inhibited, but may not have blocked, splicing, and (iii) that selective pressure for efficient expression of the RNA polymerase yielded an intein that utilizes another residue to assist Asn cyclization, changing the intein active site so that a penultimate His now inhibits splicing.
Collapse
Affiliation(s)
- L Chen
- New England BioLabs Inc., Beverly, Massachusetts 01915, USA
| | | | | |
Collapse
|
38
|
Abstract
Persistence of a mobile DNA element in a population reflects a balance between the ability of the host to eliminate the element and the ability of the element to survive and to disseminate to other individuals. In each of the three biological kingdoms, several families of a mobile DNA element have been identified which encode a single protein that acts on nucleic acids. Collectively termed homing endonuclease genes (HEGs), these elements employ varied strategies to ensure their survival. Some members of the HEG families have a minimal impact on host fitness because they associate with genes having self-splicing introns or inteins that remove the HEGs at the RNA or protein level. The HEG and the intron/intein gene spread throughout the population by a gene conversion process initiated by the HEG-encoded endonuclease called 'homing' in which the HEG and intron/intein genes are copied to cognate alleles that lack them. The endonuclease activity also contributes to a high frequency of lateral transmission of HEGs between species as has been documented in plants and other systems. Other HEGs have positive selection value because the proteins have evolved activities that benefit their host organisms. The success of HEGs in colonizing diverse genetic niches results from the flexibility of the encoded endonucleases in adopting new specificities.
Collapse
Affiliation(s)
- F S Gimble
- Center for Genome Research, Institute of Biosciences and Technology, The Texas A and M University System Health Science Center, 2121 W. Holcombe Blvd., Texas A and M University, Houston, TX, USA.
| |
Collapse
|
39
|
|
40
|
Hu D, Crist M, Duan X, Quiocho FA, Gimble FS. Probing the structure of the PI-SceI-DNA complex by affinity cleavage and affinity photocross-linking. J Biol Chem 2000; 275:2705-12. [PMID: 10644733 DOI: 10.1074/jbc.275.4.2705] [Citation(s) in RCA: 30] [Impact Index Per Article: 1.3] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/06/2022] Open
Abstract
The PI-SceI protein is an intein-encoded homing endonuclease that initiates the mobility of its gene by making a double strand break at a single site in the yeast genome. The PI-SceI protein splicing and endonucleolytic active sites are separately located in each of two domains in the PI-SceI structure. To determine the spatial relationship between bases in the PI-SceI recognition sequence and selected PI-SceI amino acids, the PI-SceI-DNA complex was probed by photocross-linking and affinity cleavage methods. Unique solvent-accessible cysteine residues were introduced into the two PI-SceI domains at positions 91, 97, 170, 230, 376, and 378, and the mutant proteins were modified with either 4-azidophenacyl bromide or iron (S)-1-(p-bromoacetamidobenzyl)-ethylenediaminetetraacetate (FeBABE). The phenyl azide-coupled proteins cross-linked to the PI-SceI target sequence, and the FeBABE-modified proteins cleaved the DNA proximal to the derivatized amino acid. The results suggest that an extended beta-hairpin loop in the endonuclease domain that contains residues 376 and 378 contacts the major groove near the PI-SceI cleavage site. Conversely, residues 91, 97, and 170 in the protein splicing domain are in close proximity to a distant region of the substrate. To interpret our results, we used a new PI-SceI structure that is ordered in regions of the protein that bind DNA. The data strongly support a model of the PI-SceI-DNA complex derived from this structure.
Collapse
Affiliation(s)
- D Hu
- Center for Genome Research, Institute of Biosciences and Technology, Department of Medical Biochemistry, The Texas A & M University System Health Science Center, Houston, Texas 77030, USA
| | | | | | | | | |
Collapse
|
41
|
Wood DW, Wu W, Belfort G, Derbyshire V, Belfort M. A genetic system yields self-cleaving inteins for bioseparations. Nat Biotechnol 1999; 17:889-92. [PMID: 10471931 DOI: 10.1038/12879] [Citation(s) in RCA: 205] [Impact Index Per Article: 8.2] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/08/2022]
Abstract
A self-cleaving element for use in bioseparations has been derived from a naturally occurring, 43 kDa protein splicing element (intein) through a combination of protein engineering and random mutagenesis. A mini-intein (18 kDa) previously engineered for reduced size had compromised activity and was therefore subjected to random mutagenesis and genetic selection. In one selection a mini-intein was isolated with restored splicing activity, while in another, a mutant was isolated with enhanced, pH-sensitive C-terminal cleavage activity. The enhanced-cleavage mutant has utility in affinity fusion-based protein purification. These mutants also provide new insights into the structural and functional roles of some conserved residues in protein splicing.
Collapse
Affiliation(s)
- D W Wood
- Wadsworth Center, New York State Department of Health, and School of Public Health, State University of New York at Albany, Albany, NY 12201-2002, USA
| | | | | | | | | |
Collapse
|
42
|
Affiliation(s)
- F B Perler
- New England Biolabs Inc., 32 Tozer Rd, Beverly, MA 01915, USA.
| |
Collapse
|
43
|
Silva GH, Dalgaard JZ, Belfort M, Van Roey P. Crystal structure of the thermostable archaeal intron-encoded endonuclease I-DmoI. J Mol Biol 1999; 286:1123-36. [PMID: 10047486 DOI: 10.1006/jmbi.1998.2519] [Citation(s) in RCA: 76] [Impact Index Per Article: 3.0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/22/2022]
Abstract
I-DmoI is a 22 kDa endonuclease encoded by an intron in the 23 S rRNA gene of the hyperthermophilic archaeon Desulfurococcus mobilis. The structure of I-DmoI has been determined to 2.2 A resolution using multi-wavelength anomalous diffraction techniques. I-DmoI, a protein of the LAGLIDADG motif family, represents the first structure of a freestanding endonuclease with two LAGLIDADG motifs, and the first of a thermostable homing endonuclease. I-DmoI consists of two similar alpha/beta domains (alphabetabetaalphabetabetaalpha) related by pseudo 2-fold symmetry. The LAGLIDADG motifs are located at the carboxy-terminal end of the first alpha-helix of each domain. These helices form a two-helix bundle at the interface between the domains and are perpendicular to a saddle-shaped DNA binding surface, formed by two four-stranded antiparallel beta-sheets. Despite substantially different sequences, the overall fold of I-DmoI is similar to that of two other LAGLIDADG proteins for which the structures are known, I-CreI and the endonuclease domain of PI-SceI. The three structures differ most in the loops connecting the beta-strands, relating to the respective DNA target site sizes and geometries. In addition, the absence of conserved residues surrounding the active site, other than those within the LAGLIDADG motif, is of mechanistic importance. Finally, the carboxy-terminal domain of I-DmoI is smaller and has a more irregular fold than the amino-terminal domain, which is more similar to I-CreI, a symmetric homodimeric endonuclease. This is reversed compared to PI-SceI, where the amino-terminal domain is more similar to carboxy-terminal domain of I-DmoI and to I-CreI, with interesting evolutionary implications.
Collapse
Affiliation(s)
- G H Silva
- Wadsworth Center, New York State Department of Health, Albany, NY, 12201-0509, USA
| | | | | | | |
Collapse
|
44
|
Wu H, Xu MQ, Liu XQ. Protein trans-splicing and functional mini-inteins of a cyanobacterial dnaB intein. BIOCHIMICA ET BIOPHYSICA ACTA 1998; 1387:422-32. [PMID: 9748659 DOI: 10.1016/s0167-4838(98)00157-5] [Citation(s) in RCA: 135] [Impact Index Per Article: 5.2] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 11/21/2022]
Abstract
A 429 aa theoretical intein is encoded in the dnaB gene (DNA helicase) of the cyanobacterium Synechocystis sp. strain PCC6803. This intein is shown to be capable of protein splicing with or without its native exteins when tested in E. coli cells. A centrally located 275 amino acid sequence (residues 107-381) of this intein can be deleted without loss of the protein splicing activity, resulting in a functional mini-intein of 154 aa in size. Efficient in vivo protein trans-splicing was observed when this mini-intein was split into a 106 aa N-terminal fragment containing intein motifs A and B, and a 48 aa C-terminal fragment containing intein motifs F and G. These results indicate that the N- and C-terminal regions of the Ssp DnaB intein, whether covalently linked with each other or not, can come together through non-covalent interaction to form a protein splicing domain that is functionally sufficient and structurally independent from the centrally located endonuclease domain of the intein.
Collapse
Affiliation(s)
- H Wu
- Biochemistry Department, Dalhousie University, Halifax, Nova Scotia B3H 4H7, Canada
| | | | | |
Collapse
|
45
|
Wu H, Hu Z, Liu XQ. Protein trans-splicing by a split intein encoded in a split DnaE gene of Synechocystis sp. PCC6803. Proc Natl Acad Sci U S A 1998; 95:9226-31. [PMID: 9689062 PMCID: PMC21320 DOI: 10.1073/pnas.95.16.9226] [Citation(s) in RCA: 289] [Impact Index Per Article: 11.1] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 04/06/1998] [Indexed: 02/08/2023] Open
Abstract
A split intein capable of protein trans-splicing is identified in a DnaE protein of the cyanobacterium Synechocystis sp. strain PCC6803. The N- and C-terminal halves of DnaE (catalytic subunit alpha of DNA polymerase III) are encoded by two separate genes, dnaE-n and dnaE-c, respectively. These two genes are located 745,226 bp apart in the genome and on opposite DNA strands. The dnaE-n product consists of a N-extein sequence followed by a 123-aa intein sequence, whereas the dnaE-c product consists of a 36-aa intein sequence followed by a C-extein sequence. The N- and C-extein sequences together reconstitute a complete DnaE sequence that is interrupted by the intein sequences inside the beta- and tau-binding domains. The two intein sequences together reconstitute a split mini-intein that not only has intein-like sequence features but also exhibited protein trans-splicing activity when tested in Escherichia coli cells.
Collapse
Affiliation(s)
- H Wu
- Biochemistry Department, Dalhousie University, Halifax, Nova Scotia, B3H 4H7, Canada
| | | | | |
Collapse
|
46
|
Mian IS, Moser MJ, Holley WR, Chatterjee A. Statistical modelling and phylogenetic analysis of a deaminase domain. J Comput Biol 1998; 5:57-72. [PMID: 9541871 DOI: 10.1089/cmb.1998.5.57] [Citation(s) in RCA: 23] [Impact Index Per Article: 0.9] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 02/07/2023] Open
Abstract
Deamination reactions are catalyzed by a variety of enzymes including those involved in nucleoside/nucleotide metabolism and cytosine to uracil (C-->U) and adenosine to inosine (A-->I) mRNA editing. The active site of the deaminase (DM) domain in these enzymes contains a conserved histidine (or rarely cysteine), two cysteines and a glutamate proposed to act as a proton shuttle during deamination. Here, a statistical model, a hidden Markov model (HMM), of the DM domain has been created which identifies currently known DM domains and suggests new DM domains in viral, bacterial and eucaryotic proteins. However, no DM domains were identified in the currently predicted proteins from the archaeon Methanococcus jannaschii and possible causes for, and a potential means to ameliorate this situation are discussed. In some of the newly identified DM domains, the glutamate is changed to a residue that could not function as a proton shuttle and in one instance (Mus musculus spermatid protein TENR) the cysteines are also changed to lysine and serine. These may be non-competent DM domains able to bind but not act upon their substrate. Phylogenetic analysis using an HMM-generated alignment of DM domains reveals three branches with clear substructure in each branch. The results suggest DM domains that are candidates for yeast, platyhelminth, plant and mammalian C-->U and A-->I mRNA editing enzymes. Some bacterial and eucaryotic DM domains form distinct branches in the phylogenetic tree suggesting the existence of common, novel substrates.
Collapse
Affiliation(s)
- I S Mian
- Life Sciences Division, Lawrence Berkeley National Laboratory, Berkeley, California, USA
| | | | | | | |
Collapse
|
47
|
Grindl W, Wende W, Pingoud V, Pingoud A. The protein splicing domain of the homing endonuclease PI-sceI is responsible for specific DNA binding. Nucleic Acids Res 1998; 26:1857-62. [PMID: 9518476 PMCID: PMC147489 DOI: 10.1093/nar/26.8.1857] [Citation(s) in RCA: 29] [Impact Index Per Article: 1.1] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 02/06/2023] Open
Abstract
The homing endonuclease PI- Sce I consists of a protein splicing domain (I) and an endonucleolytic domain (II). To characterize the two domains with respect to their contribution to DNA recognition we cloned, purified and characterized the isolated domains. Both domains have no detectable endonucleolytic activity. Domain I binds specifically to the PI- Sce I recognition sequence, whereas domain II displays only weak non-specific DNA binding. In the specific complex with domain I the DNA is bent to a similar extent as observed with the initial complex formed between PI- Sce I and DNA. Our results indicate that protein splicing domain I is also involved in recognition of the DNA substrate.
Collapse
Affiliation(s)
- W Grindl
- Institut für Biochemie, Justus-Liebig-Universität, Heinrich-Buff-Ring 58, D-35392 Giessen, Germany
| | | | | | | |
Collapse
|
48
|
Abstract
Previous analyses have shown that inteins (protein splicing elements) employ two structural organizations: the 'canonical' Nintein-Dod-inteinC found in dozens of inteins and a 'non-canonical' Nintein-inteinC described in two inteins, where Nintein at the N-terminus and inteinC at the C-terminus are conserved domains involved in self-splicing and Dod is the Dod DNA endonuclease (DNase). In this study, four non-canonical inteins, each with unique structural features, have been identified using alignment-based Hidden Markov Models. A Nintein-inteinC intein, carrying an unprecedented replacement of the N-terminal catalytic Cys(Ser) by Ala, is described in a putative ATPase encoded by Methanococcus jannaschii . Three replicative proteins of Synechocystis spp. contain inteins with the organizations: (i) Nintein minus X minus inteinC over Dod, where X is an uncharacterized domain and Dod DNase is located in an alternative open reading frame (ORF) being embedded between two novel CG and YK domains; (ii) Nintein-HN-inteinC, where HN stands for phage-like DNase from the EX1H-HX3H family; (iii) Nintein>|<inteinC, where >|< indicates that the intein domains are associated with a disrupted host protein encoded by two spatially separated ORFs. The expression of some of these newly identified inteins may affect the intein hosts. The variety of structural forms of inteins could have evolved through invasion of self-splicing proteases by different mobile DNases or the departure of mobile DNases from canonical inteins.
Collapse
Affiliation(s)
- A E Gorbalenya
- M. P. Chumakov Institute of Poliomyelitis and Viral Encephalitides, Russian Academy of Medical Sciences, 142782 Moscow Region, Russia.
| |
Collapse
|
49
|
Chute IC, Hu Z, Liu XQ. A topA intein in Pyrococcus furiosus and its relatedness to the r-gyr intein of Methanococcus jannaschii. Gene 1998; 210:85-92. [PMID: 9524230 DOI: 10.1016/s0378-1119(98)00044-4] [Citation(s) in RCA: 5] [Impact Index Per Article: 0.2] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 02/06/2023]
Abstract
A new intein coding sequence was found in a topA (DNA topoisomerase I) gene by cloning and sequencing this gene from the hyperthermophilic Archaeon Pyrococcus furiosus. The predicted Pfu topA intein sequence is 373 amino acids long and located two residues away from the catalytic tyrosine of the topoisomerase. It contains putative intein sequence blocks (C, E, and H) associated with intein endonuclease activity, in addition to intein sequence blocks (A, B, F, and G) that are necessary for protein splicing. This DNA topoisomerase I intein is most related to a reverse gyrase intein from the methanogenic Archaeon Methanococcus jannaschii. These two inteins share 31% amino acid sequence identity and, more importantly, have the same insertion sites in their respective host proteins. It is suggested that these two inteins are homologous inteins present in structurally related, but functionally distinct, proteins, with implications on intein evolution and intein homing.
Collapse
Affiliation(s)
- I C Chute
- Biochemistry Department, Dalhousie University, Halifax, Nova Scotia, B3H 4H7, Canada
| | | | | |
Collapse
|
50
|
Abstract
Computational analysis of the Fanconi anemia (FA) complementation group A protein suggests that it contains a peroxidase domain. FA proteins may be part of a general mechanism that protects cells from oxidative damage.
Collapse
Affiliation(s)
- I S Mian
- Life Sciences Division, Lawrence Berkeley National Laboratory, Berkeley, California 94720, USA
| | | |
Collapse
|