101
|
Koike R, Kinoshita K, Kidera A. Probabilistic description of protein alignments for sequences and structures. Proteins 2004; 56:157-66. [PMID: 15162495 DOI: 10.1002/prot.20067] [Citation(s) in RCA: 5] [Impact Index Per Article: 0.3] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/05/2022]
Abstract
A number of equally optimal alignments inherently exist in the sequence and structure comparisons among proteins. To represent the sub-optimal alignments systematically, we have developed a method of generating probabilistic alignments for sequences and structures, by which the correspondence between pairs of residues is evaluated in a probabilistic manner. Our method uses the periodic boundary condition to avoid the entropy artifact favoring full-length matches. In the structure comparison, the environmental effects are incorporated by the mean-field approximation. We applied this method in comparisons of two pairs of proteins with internal symmetry; the first set were proteins of TIM-barrel fold and the second were beta-trefoil fold. These pairs are expected to have distinct sub-optimal alignments suitable for probabilistic description with the periodic boundary. It was shown that the sequence and structure alignments are consistent with each other and that the alignments with the highest probability represent circular permutation.
Collapse
Affiliation(s)
- Ryotaro Koike
- Department of Chemistry, Graduate School of Science, Kyoto University, Kitashirakawa-Oiwake-cho, Sakyo-ku, Kyoto 606-8502, Japan
| | | | | |
Collapse
|
102
|
Leopoldseder S, Claren J, Jürgens C, Sterner R. Interconverting the Catalytic Activities of (βα)8-barrel Enzymes from Different Metabolic Pathways: Sequence Requirements and Molecular Analysis. J Mol Biol 2004; 337:871-9. [PMID: 15033357 DOI: 10.1016/j.jmb.2004.01.062] [Citation(s) in RCA: 40] [Impact Index Per Article: 2.0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 11/24/2003] [Revised: 01/26/2004] [Accepted: 01/26/2004] [Indexed: 11/30/2022]
Abstract
The (betaalpha)(8)-barrel enzymes N'-[(5'-phosphoribosyl)formimino]-5-aminoimidazole-4-carboxamide ribonucleotide isomerase (tHisA) and imidazole glycerol phosphate synthase (tHisF) from Thermotoga maritima catalyze two successive reactions in the biosynthesis of histidine. In both enzymes, aspartate residues at the C-terminal end of beta-strand 1 (Asp8 in tHisA and Asp11 in tHisF) and beta-strand 5 (Asp127 in tHisA and Asp130 in tHisF) are essential for catalytic activity. It was demonstrated earlier that in tHisA the substitution of Asp127 by valine (tHisA-D127V) generates phosphoribosylanthranilate isomerase (TrpF) activity, a related (betaalpha)(8)-barrel enzyme participating in tryptophan biosynthesis. It is shown here that in tHisF the corresponding substitution of Asp130 by valine (tHisF-D130V) also generates TrpF activity. To determine the effectiveness of individual amino acid exchanges in these conversions, each of the 20 standard amino acid residues was introduced at position 127 of tHisA and 130 of tHisF by saturation random mutagenesis. The tHisA-D127X and tHisF-D130X variants with TrpF activity were identified by selection in vivo, and the proteins purified and characterized. The results obtained show that removal of the negatively charged carboxylate side-chain at the C-terminal end of beta-strand 5 is sufficient to establish TrpF activity in tHisA and tHisF, presumably because it allows the binding of the negatively charged TrpF substrate, phosphoribosylanthranilate. In contrast, the double mutants tHisA-D8N+D127V and tHisF-D11N+D130V did not show detectable activity, demonstrating that the aspartate residues at the C-terminal end of beta-strand 1 are essential for catalysis of the TrpF reaction. The ease with which TrpF activity can be established on both the tHisA and tHisF scaffolds supports the evolutionary relationship of these three enzymes and highlights the functional plasticity of the (betaalpha)(8)-barrel enzyme fold.
Collapse
Affiliation(s)
- Sonja Leopoldseder
- Universität zu Köln, Institut für Biochemie, Otto-Fischer-Strasse 12-14, D-50674 Köln, Germany
| | | | | | | |
Collapse
|
103
|
Andi B, West AH, Cook PF. Stabilization and characterization of histidine-tagged homocitrate synthase from Saccharomyces cerevisiae. Arch Biochem Biophys 2004; 421:243-54. [PMID: 14984204 DOI: 10.1016/j.abb.2003.11.005] [Citation(s) in RCA: 20] [Impact Index Per Article: 1.0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/21/2022]
Abstract
Histidine-tagged homocitrate synthase from Saccharomyces cerevisiae was purified to about 98% using a Ni-NTA resin and stabilized using a combination of 100 mM guanidine hydrochloride, 100 mM alpha-cyclodextrin, and 600 mM ammonium sulfate. The enzyme was assayed using dichlorophenol indophenol (DCPIP) as an oxidant to oxidize the CoASH produced in the reaction. A stoichiometry of 1:1 was obtained between DCPIP and CoASH. Kinetic parameters for the stable enzyme at pH 7.5 are: Km (AcCoA), 24 microM: Km (alpha-kg), 1.3 mM; and kcat, 37 min(-1). The enzyme, in the absence of reactants, self-associates, as suggested by size exclusion chromatography. Fluorescence and circular dichroic spectra suggested a partially exposed tryptophan residue and a mixed (alpha/beta) secondary structure for the enzyme. Fluorescence quenching studies with KI, CsCl, and acrylamide suggest that the microenvironment around the single tryptophan residue of the enzyme has some positive charge.
Collapse
Affiliation(s)
- Babak Andi
- Department of Chemistry and Biochemistry, University of Oklahoma, 620 Parrington Oval, Norman, Oklahoma 73019, USA
| | | | | |
Collapse
|
104
|
Light S, Kraulis P. Network analysis of metabolic enzyme evolution in Escherichia coli. BMC Bioinformatics 2004; 5:15. [PMID: 15113413 PMCID: PMC394313 DOI: 10.1186/1471-2105-5-15] [Citation(s) in RCA: 36] [Impact Index Per Article: 1.8] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/18/2003] [Accepted: 02/18/2004] [Indexed: 11/10/2022] Open
Abstract
BACKGROUND The two most common models for the evolution of metabolism are the patchwork evolution model, where enzymes are thought to diverge from broad to narrow substrate specificity, and the retrograde evolution model, according to which enzymes evolve in response to substrate depletion. Analysis of the distribution of homologous enzyme pairs in the metabolic network can shed light on the respective importance of the two models. We here investigate the evolution of the metabolism in E. coli viewed as a single network using EcoCyc. RESULTS Sequence comparison between all enzyme pairs was performed and the minimal path length (MPL) between all enzyme pairs was determined. We find a strong over-representation of homologous enzymes at MPL 1. We show that the functionally similar and functionally undetermined enzyme pairs are responsible for most of the over-representation of homologous enzyme pairs at MPL 1. CONCLUSIONS The retrograde evolution model predicts that homologous enzymes pairs are at short metabolic distances from each other. In general agreement with previous studies we find that homologous enzymes occur close to each other in the network more often than expected by chance, which lends some support to the retrograde evolution model. However, we show that the homologous enzyme pairs which may have evolved through retrograde evolution, namely the pairs that are functionally dissimilar, show a weaker over-representation at MPL 1 than the functionally similar enzyme pairs. Our study indicates that, while the retrograde evolution model may have played a small part, the patchwork evolution model is the predominant process of metabolic enzyme evolution.
Collapse
Affiliation(s)
- Sara Light
- Stockholm Bioinformatics Center, Department of Biochemistry and Biophysics, Stockholm Center for Physics, Astronomy and Biotechnology, Stockholm University, Stockholm SE-10691, Sweden
| | - Per Kraulis
- Stockholm Bioinformatics Center, Department of Biochemistry and Biophysics, Stockholm Center for Physics, Astronomy and Biotechnology, Stockholm University, Stockholm SE-10691, Sweden
| |
Collapse
|
105
|
Soberón X, Fuentes-Gallego P, Saab-Rincón G. In vivo fragment complementation of a (β/α)8barrel protein: generation of variability by recombination. FEBS Lett 2004; 560:167-72. [PMID: 14988017 DOI: 10.1016/s0014-5793(04)00098-5] [Citation(s) in RCA: 17] [Impact Index Per Article: 0.9] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 12/09/2003] [Revised: 01/08/2004] [Accepted: 01/15/2004] [Indexed: 11/25/2022]
Abstract
The high representation of the TIM barrel as a scaffold for enzymatic proteins makes it an interesting model for protein engineering. Based on previous reports of folding mechanisms of TIM barrels that suggest an independent folding unit formed by six (beta/alpha) subunits, we interrupted the gene of phosphoribosylanthranilate isomerase (PRAI) from Escherichia coli at three different positions to yield fragments with different combinations of (beta/alpha) subunits. When these constructions were expressed as polycistrons in a TrpF-E. coli strain, complementation of the function only occurred with fragments beta1-alpha4 and beta5-alpha8, demonstrating that (beta/alpha)(4) subunits are stable enough to survive in vivo conditions and to assemble to yield a functional enzyme. The expression of these fragments in a separated plasmid/phagemid system to complement the function gave a slower complementation in the TrpF-E. coli strain; this was overcome by introducing extra secondary elements to the structure that reinforce their interaction.
Collapse
Affiliation(s)
- Xavier Soberón
- Instituto de Biotecnología, UNAM, Apartado Postal 510-3, Cuernavaca, Morelos 62271, Mexico
| | | | | |
Collapse
|
106
|
Lorentzen E, Pohl E, Zwart P, Stark A, Russell RB, Knura T, Hensel R, Siebers B. Crystal structure of an archaeal class I aldolase and the evolution of (betaalpha)8 barrel proteins. J Biol Chem 2003; 278:47253-60. [PMID: 12941964 DOI: 10.1074/jbc.m305922200] [Citation(s) in RCA: 40] [Impact Index Per Article: 1.9] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/06/2022] Open
Abstract
Fructose-1,6-bisphosphate aldolase (FBPA) catalyzes the reversible cleavage of fructose 1,6-bisphosphate to glyceraldehyde 3-phosphate and dihydroxyacetone phosphate in the glycolytic pathway. FBPAs from archaeal organisms have recently been identified and characterized as a divergent family of proteins. Here, we report the first crystal structure of an archaeal FBPA at 1.9-A resolution. The structure of this 280-kDa protein complex was determined using single wavelength anomalous dispersion followed by 10-fold non-crystallographic symmetry averaging and refined to an R-factor of 14.9% (Rfree 17.9%). The protein forms a dimer of pentamers, consisting of subunits adopting the ubiquitous (betaalpha)8 barrel fold. Additionally, a crystal structure of the archaeal FBPA covalently bound to dihydroxyacetone phosphate was solved at 2.1-A resolution. Comparison of the active site residues with those of classical FBPAs, which share no significant sequence identity but display the same overall fold, reveals a common ancestry between these two families of FBPAs. Structural comparisons, furthermore, establish an evolutionary link to the triosephosphate isomerases, a superfamily hitherto considered independent from the superfamily of aldolases.
Collapse
Affiliation(s)
- Esben Lorentzen
- European Molecular Biology Laboratory, Hamburg Outstation, Notkestrasse 85, D-22603 Hamburg, Germany
| | | | | | | | | | | | | | | |
Collapse
|
107
|
Wise EL, Yew WS, Gerlt JA, Rayment I. Structural Evidence for a 1,2-Enediolate Intermediate in the Reaction Catalyzed by 3-Keto-l-Gulonate 6-Phosphate Decarboxylase, a Member of the Orotidine 5‘-Monophosphate Decarboxylase Suprafamily†,‡. Biochemistry 2003; 42:12133-42. [PMID: 14567674 DOI: 10.1021/bi0348819] [Citation(s) in RCA: 8] [Impact Index Per Article: 0.4] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/29/2022]
Abstract
3-Keto-L-gulonate 6-phosphate decarboxylase (KGPDC) and orotidine 5'-phosphate decarboxylase (OMPDC) are members of an enzyme suprafamily, the OMPDC suprafamily, because they are homologous enzymes that catalyze mechanistically distinct reactions using different substrates. KGPDC catalyzes the Mg(2+) ion-dependent decarboxylation of 3-keto-L-gulonate 6-phosphate to yield L-xylulose 5-phosphate and CO(2); OMPDC catalyzes the metal ion-independent decarboxylation of OMP to UMP and CO(2). Structural studies have shown that KGPDC and OMPDC share several strictly conserved active site residues that are used differently by each enzyme to catalyze their mechanistically distinct reactions. Although the mechanism of the KGPDC-catalyzed reaction has yet to be elucidated, it is thought to proceed via a Mg(2+) ion-stabilized 1,2-enediolate intermediate. Here we report the crystal structures of KGPDC complexed with L-gulonate 6-phosphate, L-threonohydroxamate 4-phosphate, and L-xylitol 5-phosphate, analogues of the substrate, enediolate intermediate, and product, as well as with the product, L-xylulose 5-phosphate, at 1.2, 1.8, 1.7, and 1.8 A resolution, respectively. These structures support a mechanism that involves the formation of a cis-1,2-enediolate intermediate. Contrary to expectations, the geometry of the intermediate does not involve bidentate coordination of both enediolate oxygen atoms to the Mg(2+) ion but rather involves only the coordination of the oxygen on C2 to the Mg(2+) ion. The oxygen atom on C1 instead forms hydrogen bonds to both Lys64 and Asp67, two strictly conserved active site residues. Lys64 also interacts with the oxygen on C2 and may serve to stabilize a cis conformation of the 1,2-enediolate. These structures also implicate His136 to be the general acid that protonates the 1,2-enediolate intermediate. This study further demonstrates that multiple unrelated enzyme functions can evolve from a single active site architecture without regard for substrate binding affinity or mechanism.
Collapse
Affiliation(s)
- Eric L Wise
- Department of Biochemistry, University of Wisconsin, Madison, Wisconsin 53706, USA
| | | | | | | |
Collapse
|
108
|
Xie G, Keyhani NO, Bonner CA, Jensen RA. Ancient origin of the tryptophan operon and the dynamics of evolutionary change. Microbiol Mol Biol Rev 2003; 67:303-42, table of contents. [PMID: 12966138 PMCID: PMC193870 DOI: 10.1128/mmbr.67.3.303-342.2003] [Citation(s) in RCA: 93] [Impact Index Per Article: 4.4] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/20/2022] Open
Abstract
The seven conserved enzymatic domains required for tryptophan (Trp) biosynthesis are encoded in seven genetic regions that are organized differently (whole-pathway operons, multiple partial-pathway operons, and dispersed genes) in prokaryotes. A comparative bioinformatics evaluation of the conservation and organization of the genes of Trp biosynthesis in prokaryotic operons should serve as an excellent model for assessing the feasibility of predicting the evolutionary histories of genes and operons associated with other biochemical pathways. These comparisons should provide a better understanding of possible explanations for differences in operon organization in different organisms at a genomics level. These analyses may also permit identification of some of the prevailing forces that dictated specific gene rearrangements during the course of evolution. Operons concerned with Trp biosynthesis in prokaryotes have been in a dynamic state of flux. Analysis of closely related organisms among the Bacteria at various phylogenetic nodes reveals many examples of operon scission, gene dispersal, gene fusion, gene scrambling, and gene loss from which the direction of evolutionary events can be deduced. Two milestone evolutionary events have been mapped to the 16S rRNA tree of Bacteria, one splitting the operon in two, and the other rejoining it by gene fusion. The Archaea, though less resolved due to a lesser genome representation, appear to exhibit more gene scrambling than the Bacteria. The trp operon appears to have been an ancient innovation; it was already present in the common ancestor of Bacteria and Archaea. Although the operon has been subjected, even in recent times, to dynamic changes in gene rearrangement, the ancestral gene order can be deduced with confidence. The evolutionary history of the genes of the pathway is discernible in rough outline as a vertical line of descent, with events of lateral gene transfer or paralogy enriching the analysis as interesting features that can be distinguished. As additional genomes are thoroughly analyzed, an increasingly refined resolution of the sequential evolutionary steps is clearly possible. These comparisons suggest that present-day trp operons that possess finely tuned regulatory features are under strong positive selection and are able to resist the disruptive evolutionary events that may be experienced by simpler, poorly regulated operons.
Collapse
Affiliation(s)
- Gary Xie
- Department of Microbiology and Cell Science, University of Florida, Gainesville, Florida 32611, USA
| | | | | | | |
Collapse
|
109
|
Sandhya S, Kishore S, Sowdhamini R, Srinivasan N. Effective detection of remote homologues by searching in sequence dataset of a protein domain fold. FEBS Lett 2003; 552:225-30. [PMID: 14527691 DOI: 10.1016/s0014-5793(03)00929-3] [Citation(s) in RCA: 7] [Impact Index Per Article: 0.3] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 10/27/2022]
Abstract
Profile matching methods are commonly used in searches in protein sequence databases to detect evolutionary relationships. We describe here a sensitive protocol, which detects remote similarities by searching in a specialized database of sequences belonging to a fold. We have assessed this protocol by exploring the relationships we detect among sequences known to belong to specific folds. We find that searches within sequences adopting a fold are more effective in detecting remote similarities and evolutionary connections than searches in a database of all sequences. We also discuss the implications of using this strategy to link sequence and structure space.
Collapse
Affiliation(s)
- S Sandhya
- Molecular Biophysics Unit, Indian Institute of Science, 560 012 Bangalore, India
| | | | | | | |
Collapse
|
110
|
Affiliation(s)
- David B Searls
- Bioinformatics Division, Genetics Research, GlaxoSmithKline Pharmaceuticals, 709 Swedeland Road, P.O. Box 1539, King of Prussia, Pennsylvania 19406, USA.
| |
Collapse
|
111
|
Casals N, Gómez-Puertas P, Pié J, Mir C, Roca R, Puisac B, Aledo R, Clotet J, Menao S, Serra D, Asins G, Till J, Elias-Jones AC, Cresto JC, Chamoles NA, Abdenur JE, Mayatepek E, Besley G, Valencia A, Hegardt FG. Structural (betaalpha)8 TIM barrel model of 3-hydroxy-3-methylglutaryl-coenzyme A lyase. J Biol Chem 2003; 278:29016-23. [PMID: 12746442 DOI: 10.1074/jbc.m304276200] [Citation(s) in RCA: 14] [Impact Index Per Article: 0.7] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/06/2022] Open
Abstract
This study describes three novel homozygous missense mutations (S75R, S201Y, and D204N) in the 3-hydroxy-3-methylglutaryl-CoA (HMG-CoA) lyase gene, which caused 3-hydroxy-3-methylglutaric aciduria in patients from Germany, England, and Argentina. Expression studies in Escherichia coli show that S75R and S201Y substitutions completely abolished the HMG-CoA lyase activity, whereas D204N reduced catalytic efficiency to 6.6% of the wild type. We also propose a three-dimensional model for human HMG-CoA lyase containing a (betaalpha)8 (TIM) barrel structure. The model is supported by the similarity with analogous TIM barrel structures of functionally related proteins, by the localization of catalytic amino acids at the active site, and by the coincidence between the shape of the substrate (HMG-CoA) and the predicted inner cavity. The three novel mutations explain the lack of HMG-CoA lyase activity on the basis of the proposed structure: in S75R and S201Y because the new amino acid residues occlude the substrate cavity, and in D204N because the mutation alters the electrochemical environment of the active site. We also report the localization of all missense mutations reported to date and show that these mutations are located in the beta-sheets around the substrate cavity.
Collapse
Affiliation(s)
- Núria Casals
- Unit of Biochemistry and Molecular Biology, International University of Catalonia, Spain
| | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | |
Collapse
|
112
|
Liu J, Mushegian A. Three monophyletic superfamilies account for the majority of the known glycosyltransferases. Protein Sci 2003; 12:1418-31. [PMID: 12824488 PMCID: PMC2323934 DOI: 10.1110/ps.0302103] [Citation(s) in RCA: 171] [Impact Index Per Article: 8.1] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 10/27/2022]
Abstract
Sixty-five families of glycosyltransferases (EC 2.4.x.y) have been recognized on the basis of high-sequence similarity to a founding member with experimentally demonstrated enzymatic activity. Although distant sequence relationships between some of these families have been reported, the natural history of glycosyltransferases is poorly understood. We used iterative searches of sequence databases, motif extraction, structural comparison, and analysis of completely sequenced genomes to track the origins of modern-type glycosyltransferases. We show that >75% of recognized glycosyltransferase families belong to one of only three monophyletic superfamilies of proteins, namely, (1) a recently described GPGTF/GT-B superfamily; (2) a nucleoside-diphosphosugar transferase (GT-A) superfamily, which is characterized by a DxD sequence signature and also includes nucleotidyltransferases; and (3) a GT-C superfamily of integral membrane glycosyltransferases with a modified DxD signature in the first extracellular loop. Several developmental regulators in Metazoans, including Fringe and Egghead homologs, belong to the second superfamily. Interestingly, Tout-velu/Exostosin family of developmental proteins found in all multicellular eukaryotes, contains separate domains belonging to the first and the second superfamilies, explaining multiple glycosyltransferase activities in one protein.
Collapse
Affiliation(s)
- Jing Liu
- Stowers Institute for Medical Research, Kansas City, Missouri 64110, USADepartment of Microbiology, Molecular Genetics, and Immunology, University of Kansas Medical Center, Kansas City, Kansas 66160, USA
| | - Arcady Mushegian
- Stowers Institute for Medical Research, Kansas City, Missouri 64110, USADepartment of Microbiology, Molecular Genetics, and Immunology, University of Kansas Medical Center, Kansas City, Kansas 66160, USA
| |
Collapse
|
113
|
Abstract
The evolution of enzymes and pathways is under debate. Recent studies show that recruitment of single enzymes from different pathways could be the driving force for pathway evolution. Other mechanisms of evolution, such as pathway duplication, enzyme specialization, de novo invention of pathways or retro-evolution of pathways, appear to be less abundant. Twenty percent of enzyme superfamilies are quite variable, not only in changing reaction chemistry or metabolite type but in changing both at the same time. These variable superfamilies account for nearly half of all known reactions. The most frequently occurring metabolites provide a helping hand for such changes because they can be accommodated by many enzyme superfamilies. Thus, a picture is emerging in which new pathways are evolving from central metabolites by preference, thereby keeping the overall topology of the metabolic network.
Collapse
Affiliation(s)
- Steffen Schmidt
- European Molecular Biology Laboratory Heidelberg, Postfach 102209, Germany
| | | | | | | |
Collapse
|
114
|
Joerger AC, Mayer S, Fersht AR. Mimicking natural evolution in vitro: an N-acetylneuraminate lyase mutant with an increased dihydrodipicolinate synthase activity. Proc Natl Acad Sci U S A 2003; 100:5694-9. [PMID: 12711733 PMCID: PMC156263 DOI: 10.1073/pnas.0531477100] [Citation(s) in RCA: 61] [Impact Index Per Article: 2.9] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Accepted: 03/13/2003] [Indexed: 11/18/2022] Open
Abstract
N-acetylneuraminate lyase (NAL) and dihydrodipicolinate synthase (DHDPS) belong to the NAL subfamily of (betaalpha)(8)-barrels. They share a common catalytic step but catalyze reactions in different biological pathways. By rational design, we have introduced various mutations into the NAL scaffold from Escherichia coli to switch the activity toward DHDPS. These mutants were tested with respect to their catalytic properties in vivo and in vitro as well as their stability. One point mutation (L142R) was sufficient to create an enzyme that could complement a bacterial auxotroph lacking the gene for DHDPS as efficiently as DHDPS itself. In vitro, this mutant had an increased DHDPS activity of up to 19-fold as defined by the specificity constant k(cat)K(M) for the new substrate l-aspartate-beta-semialdehyde when compared with the residual activity of NAL wild-type, mainly because of an increased turnover rate. At the same time, mutant L142R maintained much of its original NAL activity. We have solved the crystal structure of mutant L142R at 1.8 A resolution in complex with the inhibitor beta-hydroxypyruvate. This structure reveals that the conformations of neighboring active site residues are left virtually unchanged by the mutation. The high flexibility of R142 may favor its role in assisting in catalysis. Perhaps, nature has exploited the catalytic promiscuity of many enzymes to evolve novel enzymes or biological pathways during the course of evolution.
Collapse
Affiliation(s)
- Andreas C Joerger
- Cambridge University Chemical Laboratory and Cambridge Centre for Protein Engineering, Medical Research Council Centre, Hills Road, Cambridge CB2 2QH, United Kingdom
| | | | | |
Collapse
|
115
|
Copley RR, Ponting CP, Schultz J, Bork P. Sequence analysis of multidomain proteins: past perspectives and future directions. ADVANCES IN PROTEIN CHEMISTRY 2003; 61:75-98. [PMID: 12461821 DOI: 10.1016/s0065-3233(02)61002-2] [Citation(s) in RCA: 13] [Impact Index Per Article: 0.6] [Reference Citation Analysis] [MESH Headings] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 01/25/2023]
|
116
|
Nájera H, Costas M, Fernández-Velasco DA. Thermodynamic characterization of yeast triosephosphate isomerase refolding: insights into the interplay between function and stability as reasons for the oligomeric nature of the enzyme. Biochem J 2003; 370:785-92. [PMID: 12472469 PMCID: PMC1223230 DOI: 10.1042/bj20021439] [Citation(s) in RCA: 41] [Impact Index Per Article: 2.0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 09/13/2002] [Revised: 12/02/2002] [Accepted: 12/10/2002] [Indexed: 11/17/2022]
Abstract
The reasons underlying the oligomeric nature of some proteins such as triosephosphate isomerase (TIM) are unclear. It has been proposed that this enzyme is an oligomer, mainly because of its stability rather than for functional reasons. To address this issue, the reversible denaturation and renaturation of the homodimeric TIM from baker's yeast ( Saccharomyces cerevisiae ) induced by guanidinium chloride and urea have been characterized by spectroscopic, functional and hydrodynamic techniques. The unfolding and refolding of this enzyme are not coincident after 'conventional' equilibrium times. Unfolding experiments did not reach equilibrium, owing to a very slow dissociation and/or unfolding process. By contrast, equilibrium was reached in the refolding direction. The simplest equilibrium pathway compatible with the obtained data was found to be a three-state process involving an inactive and expanded monomer. The Gibbs energy changes for monomer folding (delta G (0)(fold) = -16.6+/-0.7 kJ x mol(-1)) and monomer association (delta G (0)(assoc) = -70.3+/-1.1 kJ x mol(-1)) were calculated from data obtained in the two denaturants. From an analysis of the present data and data from the literature on the stability of TIM from different species and for other beta/alpha barrels, and model simulations on the effect of stability in the catalytic activity of the enzyme, it is concluded that the low stability of the monomers is neither the only, nor the main, cause for the dimeric nature of TIM. There is interplay between function and stability.
Collapse
Affiliation(s)
- Hugo Nájera
- Laboratorio de Fisicoquímica e Ingeniería de Proteínas, Departamento de Bioquímica, Facultad de Medicina, Universidad Nacional Autónoma de México, Ciudad Universitaria. México, D.F. 04510, México
| | | | | |
Collapse
|
117
|
Barona-Gómez F, Hodgson DA. Occurrence of a putative ancient-like isomerase involved in histidine and tryptophan biosynthesis. EMBO Rep 2003; 4:296-300. [PMID: 12634849 PMCID: PMC1315899 DOI: 10.1038/sj.embor.embor771] [Citation(s) in RCA: 69] [Impact Index Per Article: 3.3] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/23/2002] [Revised: 01/13/2003] [Accepted: 01/15/2003] [Indexed: 11/09/2022] Open
Abstract
We report the occurrence of an isomerase with a putative (betaalpha)(8)-barrel structure involved in both histidine and trypto-phan biosynthesis in Streptomyces coelicolor A3(2) and Mycobacterium tuberculosis HR37Rv. Deletion of a hisA homologue (SCO2050) putatively encoding N'-[(5'-phosphoribosyl)-formimino]-5 amino-imidazole-4-carboxamide ribonucleotide isomerase from the chromosome of S. coelicolor A3(2) generated a double auxotrophic mutant for histidine and tryptophan. The bifunctional gene SCO2050 and its orthologue Rv1603 from M. tuberculosis complemented both hisA and trpF mutants of Escherichia coli. Expression of the E. coli trpF gene in the S. coelicolor mutant only complemented the tryptophan auxo-trophy, and the hisA gene only complemented the histidine auxotrophy. The discovery of this enzyme, which has a broad-substrate specificity, has implications for the evolution of metabolic pathways and may prove to be important for understanding the evolution of the (betaalpha)(8)-barrels.
Collapse
Affiliation(s)
- Francisco Barona-Gómez
- Department of Biological Sciences, University of Warwick, Gibbet Hill Road, Coventry CV4 7AL, UK.
| | | |
Collapse
|
118
|
Anantharaman V, Aravind L, Koonin EV. Emergence of diverse biochemical activities in evolutionarily conserved structural scaffolds of proteins. Curr Opin Chem Biol 2003; 7:12-20. [PMID: 12547421 DOI: 10.1016/s1367-5931(02)00018-2] [Citation(s) in RCA: 119] [Impact Index Per Article: 5.7] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/28/2022]
Abstract
Comparative analysis of numerous protein structures that have become available in the past few years, combined with genome comparison, has yielded new insights into the evolution of enzymes and their functions. In addition to the well-known diversification of substrate specificities, enzymes with several widespread catalytic folds, particularly the TIM barrel, the RRM-like domain and the double-stranded beta-helix (cupin) domain, have been extensively explored in 'reaction space', resulting in the evolution of numerous, diverse catalytic activities supported by the same structural scaffold. Common protein folds differ widely in the diversity of catalyzed reactions. The biochemical plasticity of a fold seems to hinge on the presence of a generic, symmetrical substrate-binding pocket as opposed to highly specialized binding sites.
Collapse
Affiliation(s)
- Vivek Anantharaman
- National Center for Biotechnology Information, National Library of Medicine, National Institutes of Health, Bethesda, MD 20894, USA
| | | | | |
Collapse
|
119
|
Smith CV, Huang CC, Miczak A, Russell DG, Sacchettini JC, Höner zu Bentrup K. Biochemical and structural studies of malate synthase from Mycobacterium tuberculosis. J Biol Chem 2003; 278:1735-43. [PMID: 12393860 DOI: 10.1074/jbc.m209248200] [Citation(s) in RCA: 108] [Impact Index Per Article: 5.1] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/06/2022] Open
Abstract
Establishment or maintenance of a persistent infection by Mycobacterium tuberculosis requires the glyoxylate pathway. This is a bypass of the tricarboxylic acid cycle in which isocitrate lyase and malate synthase (GlcB) catalyze the net incorporation of carbon during growth of microorganisms on acetate or fatty acids as the primary carbon source. The glcB gene from M. tuberculosis, which encodes malate synthase, was cloned, and GlcB was expressed in Escherichia coli. The influence of media conditions on expression in M. tuberculosis indicated that this enzyme is regulated differentially to isocitrate lyase. Purified GlcB had K(m) values of 57 and 30 microm for its substrates glyoxylate and acetyl coenzyme A, respectively, and was inhibited by bromopyruvate, oxalate, and phosphoenolpyruvate. The GlcB structure was solved to 2.1-A resolution in the presence of glyoxylate and magnesium. We also report the structure of GlcB in complex with the products of the reaction, coenzyme A and malate, solved to 2.7-A resolution. Coenzyme A binds in a bent conformation, and the details of its interactions are described, together with implications on the enzyme mechanism.
Collapse
Affiliation(s)
- Clare V Smith
- Department of Biochemistry and Biophysics, Texas A & M University, College Station, Texas 77843-2128, USA
| | | | | | | | | | | |
Collapse
|
120
|
Blouin C, Boucher Y, Roger AJ. Inferring functional constraints and divergence in protein families using 3D mapping of phylogenetic information. Nucleic Acids Res 2003; 31:790-7. [PMID: 12527789 PMCID: PMC140515 DOI: 10.1093/nar/gkg151] [Citation(s) in RCA: 30] [Impact Index Per Article: 1.4] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/13/2022] Open
Abstract
Comparative sequence analysis has been used to study specific questions about the structure and function of proteins for many years. Here we propose a knowledge-based framework in which the maximum likelihood rate of evolution is used to quantify the level of constraint on the identity of a site. We demonstrate that site-rate mapping on 3D structures using datasets of rhodopsin-like G-protein receptors and alpha- and beta-tubulins provides an excellent tool for pinpointing the functional features shared between orthologous and paralogous proteins. In addition, functional divergence within protein families can be inferred by examining the differences in the site rates, the differences in the chemical properties of the side chains or amino acid usage between aligned sites. Two novel analytical methods are introduced to characterize rate- independent functional divergence. These are tested using a dataset of two classes of HMG-CoA reductases for which only one class can perform both the forward and reverse reaction. We show that functionally divergent sites occur in a cluster of sites interacting with the catalytic residues and that this information should facilitate the design of experimental strategies to directly test functional properties of residues.
Collapse
Affiliation(s)
- Christian Blouin
- Canadian Institute for Advanced Research, Program in Evolutionary Biology, Genome Atlantic, Department of Biochemistry and Molecular Biology, Dalhousie University, Halifax, NS B3H 4H7, Canada.
| | | | | |
Collapse
|
121
|
Rodríguez-Romero A, Hernández-Santoyo A, del Pozo Yauner L, Kornhauser A, Fernández-Velasco DA. Structure and inactivation of triosephosphate isomerase from Entamoeba histolytica. J Mol Biol 2002; 322:669-75. [PMID: 12270704 DOI: 10.1016/s0022-2836(02)00809-4] [Citation(s) in RCA: 46] [Impact Index Per Article: 2.1] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/23/2022]
Abstract
Triosephosphate isomerase (TIM) has been proposed as a target for drug design. TIMs from several parasites have a cysteine residue at the dimer interface, whose derivatization with thiol-specific reagents induces enzyme inactivation and aggregation. TIMs lacking this residue, such as human TIM, are less affected. TIM from Entamoeba histolytica (EhTIM) has the interface cysteine residue and presents more than ten insertions when compared with the enzyme from other pathogens. To gain further insight into the role that interface residues play in the stability and reactivity of these enzymes, we determined the high-resolution structure and characterized the effect of methylmethane thiosulfonate (MMTS) on the activity and conformational properties of EhTIM. The structure of this enzyme was determined at 1.5A resolution using molecular replacement, observing that the dimer is not symmetric. EhTIM is completely inactivated by MMTS, and dissociated into stable monomers that possess considerable secondary structure. Structural and spectroscopic analysis of EhTIM and comparison with TIMs from other pathogens reveal that conformational rearrangements of the interface after dissociation, as well as intramonomeric contacts formed by the inserted residues, may contribute to the unusual stability of the derivatized EhTIM monomer.
Collapse
Affiliation(s)
- Adela Rodríguez-Romero
- Laboratorio Universitario de Estructura de Proteínas and Departamento de Bioquímica, Instituto de Química, Universidad Nacional Autónoma de México, DF, Mexico.
| | | | | | | | | |
Collapse
|
122
|
Dean AM, Neuhauser C, Grenier E, Golding GB. The pattern of amino acid replacements in alpha/beta-barrels. Mol Biol Evol 2002; 19:1846-64. [PMID: 12411594 DOI: 10.1093/oxfordjournals.molbev.a004009] [Citation(s) in RCA: 58] [Impact Index Per Article: 2.6] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/14/2022] Open
Abstract
The determinants of site-to-site variability in the rate of amino acid replacement in alpha/beta-barrel enzyme structures are investigated. Of 125 available alpha/beta-barrel structures, only 25 meet a variety of phylogenetic and statistical criteria necessary to ensure sufficient data for reliable analysis. These 25 enzyme structures (from a wide variety of taxa with diverse lifestyles in diverse habitats) differ greatly in size, number, and topology of domains in addition to the alpha/beta-barrel, quaternary structure, metabolic role, reaction catalyzed, presence of prosthetic groups, regulatory mechanisms, use of cofactors, and catalytic mechanisms. Yet, with the exception of ribulose-1,5-bisphosphate carboxylase, all structures have similar frequency distributions of amino acid replacement rates. Hence, site-specific variability in rates of evolution is largely independent of differences in biology, biochemistry, and molecular structure. A correlation between site-specific rate variation and (1) distance from the active site, (2) solvent accessibility, and (3) treating glycines in unusual main-chain conformations as a separate class, explains approximately half the causal variation. Secondary structure exerts little influence on the pattern and distribution of replacements. Additional domains and subunits, side-chain hydrogen bonds, unusual side-chain rotamers, nonplanar peptide bonds, strained main-chain conformations, and buried hydrophilic-charged residues contribute little to variability among sites because they are rare. Nonlinear models do not improve the fits. In several enzymes, deviations from the typical pattern of replacements suggest the possible action of natural selection. A statistical analysis shows that, in all cases, much of the remaining unexplained variation is not attributable to chance and that other, as yet unidentified, causal relations must exist.
Collapse
Affiliation(s)
- Antony M Dean
- The Biological Process Technology Institute, University of Minnesota, St. Paul, 55108, USA.
| | | | | | | |
Collapse
|
123
|
Henn-Sax M, Thoma R, Schmidt S, Hennig M, Kirschner K, Sterner R. Two (betaalpha)(8)-barrel enzymes of histidine and tryptophan biosynthesis have similar reaction mechanisms and common strategies for protecting their labile substrates. Biochemistry 2002; 41:12032-42. [PMID: 12356303 DOI: 10.1021/bi026092h] [Citation(s) in RCA: 57] [Impact Index Per Article: 2.6] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/29/2022]
Abstract
The enzymes N'-[(5'-phosphoribosyl)formimino]-5-aminoimidazole-4-carboxamide ribonucleotide isomerase (HisA) and phosphoribosylanthranilate isomerase (TrpF) are sugar isomerases that are involved in histidine and tryptophan biosynthesis, respectively. Both enzymes have the (betaalpha)(8)-barrel fold and catalyze Amadori rearrangements of a thermolabile aminoaldose into the corresponding aminoketose. To identify those amino acids that are essential for catalysis, conserved residues at the active sites of both HisA and TrpF from the hyperthermophile Thermotoga maritima were replaced by site-directed mutagenesis, and the purified variants were investigated by steady-state enzyme kinetics. Aspartate 8, aspartate 127, and threonine 164 appeared to be important for the HisA reaction, whereas cysteine 7 and aspartate 126 appeared to be important for the TrpF reaction. On the basis of these results and the X-ray structure of a complex between TrpF and a bound product analogue, a reaction mechanism involving general acid-base catalysis and a Schiff base intermediate is proposed for both enzymes. A comparison of the HisA and TrpF enzymes from T. maritima and Escherichia coli showed that, at the physiological temperatures of 80 and 37 degrees C, respectively, the enzymes from the hyperthermophile have significantly higher catalytic efficiencies than the corresponding enzymes from mesophiles. These results suggest that HisA and TrpF have similar chemical reaction mechanisms and use the same strategy to prevent the loss of their thermolabile substrates.
Collapse
Affiliation(s)
- Martina Henn-Sax
- Institut für Biochemie, Universität zu Köln, Otto-Fischer-Strasse 12-14, D-50674 Köln, Germany
| | | | | | | | | | | |
Collapse
|
124
|
Williams A, Westhead DR. Sequence relationships in the legume lectin fold and other jelly rolls. Protein Eng Des Sel 2002; 15:771-4. [PMID: 12468709 DOI: 10.1093/protein/15.10.771] [Citation(s) in RCA: 11] [Impact Index Per Article: 0.5] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/15/2022] Open
Abstract
Distant sequence relationships in proteins containing the beta jelly-roll fold were investigated using sensitive sequence comparison methods, including PSI-BLAST and Hidden Markov Models. A relationship was identified between the rmlC-like and phosphomannose isomerase SCOP (version 1.53) superfamilies, which were merged in the most recent SCOP release. No other distant sequence relationships linking jelly roll superfamilies were found.
Collapse
Affiliation(s)
- A Williams
- School of Biochemistry and Molecular Biology, University of Leeds, Leeds LS2 9JT, UK
| | | |
Collapse
|
125
|
Nagano N, Orengo CA, Thornton JM. One fold with many functions: the evolutionary relationships between TIM barrel families based on their sequences, structures and functions. J Mol Biol 2002; 321:741-65. [PMID: 12206759 DOI: 10.1016/s0022-2836(02)00649-6] [Citation(s) in RCA: 466] [Impact Index Per Article: 21.2] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/30/2022]
Abstract
The eightfold (betaalpha) barrel structure, first observed in triose-phosphate isomerase, occurs ubiquitously in nature. It is nearly always an enzyme and most often involved in molecular or energy metabolism within the cell. In this review we bring together data on the sequence, structure and function of the proteins known to adopt this fold. We highlight the sequence and functional diversity in the 21 homologous superfamilies, which include 76 different sequence families. In many structures, the barrels are "mixed and matched" with other domains generating additional variety. Global and local structure-based alignments are used to explore the distribution of the associated functional residues on this common structural scaffold. Many of the substrates/co-factors include a phosphate moiety, which is usually but not always bound towards the C-terminal end of the sequence. Some, but not all, of these structures, exhibit a structurally conserved "phosphate binding motif". In contrast metal-ligating residues and catalytic residues are distributed along the sequence. However, we also found striking structural superposition of some of these residues. Lastly we consider the possible evolutionary relationships between these proteins, whose sequences are so diverse that even the most powerful approaches find few relationships, yet whose active sites all cluster at one end of the barrel. This extreme example of the "one fold-many functions" paradigm illustrates the difficulty of assigning function through a structural genomics approach for some folds.
Collapse
Affiliation(s)
- Nozomi Nagano
- Biomolecular Structure and Modelling Group, Department of Biochemistry and Molecular Biology, University College London, Gower Street, UK.
| | | | | |
Collapse
|
126
|
Ponting CP, Russell RR. The natural history of protein domains. ANNUAL REVIEW OF BIOPHYSICS AND BIOMOLECULAR STRUCTURE 2002; 31:45-71. [PMID: 11988462 DOI: 10.1146/annurev.biophys.31.082901.134314] [Citation(s) in RCA: 193] [Impact Index Per Article: 8.8] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 11/09/2022]
Abstract
Genome sequencing and structural genomics projects are providing new insights into the evolutionary history ofprote in domains. As methods for sequence and structure comparison improve, more distantly related domains are shown to be homologous. Thus there is a need for domain families to be classified within a hierarchy similar to Linnaeus' Systema Naturae, the classification of species. With such a hierarchy in mind, we discuss the evolution of domains, their combination into proteins, and evidence as to the likely origin of protein domains. We also discuss when and how analysis of domains can be used to understand details of protein function. Unconventional features of domain evolution such as intragenomic competition, domain insertion, horizontal gene transfer, and convergent evolution are seen as analogs of organismal evolutionary events. These parallels illustrate how the concept of domains can be applied to provide insights into evolutionary biology.
Collapse
Affiliation(s)
- Chris P Ponting
- Department of Human Anatomy and Genetics, University of Oxford, MRC Functional Genetics Unit, South Parks Road, Oxford OX1 3QX, UK.
| | | |
Collapse
|
127
|
Forsyth WR, Matthews CR. Folding mechanism of indole-3-glycerol phosphate synthase from Sulfolobus solfataricus: a test of the conservation of folding mechanisms hypothesis in (beta(alpha))(8) barrels. J Mol Biol 2002; 320:1119-33. [PMID: 12126630 DOI: 10.1016/s0022-2836(02)00557-0] [Citation(s) in RCA: 42] [Impact Index Per Article: 1.9] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 10/27/2022]
Abstract
As a test of the hypothesis that folding mechanisms are better conserved than sequences in TIM barrels, the equilibrium and kinetic folding mechanisms of indole-3-glycerol phosphate synthase (sIGPS) from the thermoacidophilic archaebacterium Sulfolobus solfataricus were compared to the well-characterized models of the alpha subunit of tryptophan synthase (alphaTS) from Escherichia coli. A multifaceted approach combining urea denaturation and far-UV circular dichroism, tyrosine fluorescence total intensity, and tyrosine fluorescence anisotropy was employed. Despite a sequence identity of only 13%, a stable intermediate (I) in sIGPS was found to be similar to a stable intermediate in alphaTS in terms of its thermodynamic properties and secondary structure. Kinetic experiments revealed that the fastest detectable folding event for sIGPS involves a burst-phase (<5ms) reaction that leads directly to the stable intermediate. The slower of two subsequent phases reflects the formation/disruption of an off-pathway dimeric form of I. The faster phase reflects the conversion of I to the native state and is limited by folding under marginally stable conditions and by isomerization or rearrangement under strongly folding conditions. By contrast, alphaTS is thought to fold via an off-pathway burst-phase intermediate whose unfolding controls access to a set of four on-pathway intermediates that comprise the stable equilibrium intermediate. At least three proline isomerization reactions are known to limit their interconversions and lead to a parallel channel mechanism. The simple sequential mechanism deduced for sIGPS reflects the dominance of the on-pathway burst-phase intermediate and the absence of prolyl residues that partition the stable intermediate into kinetically distinguishable species. Comparison of the results for sIGPS and alphaTS demonstrates that the thermodynamic properties and the final steps of the folding reaction are better conserved than the early events. The initial events in folding appear to be more sensitive to the sequence differences between the two TIM barrel proteins.
Collapse
Affiliation(s)
- William R Forsyth
- Department of Biochemistry and Molecular Pharmacology, University of Massachusetts Medical School, Worcester, MA 01605, USA
| | | |
Collapse
|
128
|
Abstract
Several models have been proposed to explain the origin and evolution of enzymes in metabolic pathways. Initially, the retro-evolution model proposed that, as enzymes at the end of pathways depleted their substrates in the primordial soup, there was a pressure for earlier enzymes in pathways to be created, using the later ones as initial template, in order to replenish the pools of depleted metabolites. Later, the recruitment model proposed that initial templates from other pathways could be used as long as those enzymes were similar in chemistry or substrate specificity. These two models have dominated recent studies of enzyme evolution. These studies are constrained by either the small scale of the study or the artificial restrictions imposed by pathway definitions. Here, a network approach is used to study enzyme evolution in fully sequenced genomes, thus removing both constraints. We find that homologous pairs of enzymes are roughly twice as likely to have evolved from enzymes that are less than three steps away from each other in the reaction network than pairs of non-homologous enzymes. These results, together with the conservation of the type of chemical reaction catalyzed by evolutionarily related enzymes, suggest that functional blocks of similar chemistry have evolved within metabolic networks. One possible explanation for these observations is that this local evolution phenomenon is likely to cause less global physiological disruptions in metabolism than evolution of enzymes from other enzymes that are distant from them in the metabolic network.
Collapse
Affiliation(s)
- Rui Alves
- Department of Biological Sciences, Structural Bioinformatics Group, Biochemistry Building, Imperial College of Science, Technology and Medicine, London SW7 2AZ, UK
| | | | | |
Collapse
|
129
|
Kinch LN, Grishin NV. Expanding the nitrogen regulatory protein superfamily: Homology detection at below random sequence identity. Proteins 2002; 48:75-84. [PMID: 12012339 DOI: 10.1002/prot.10110] [Citation(s) in RCA: 16] [Impact Index Per Article: 0.7] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/11/2022]
Abstract
Nitrogen regulatory (PII) proteins are signal transduction molecules involved in controlling nitrogen metabolism in prokaryots. PII proteins integrate the signals of intracellular nitrogen and carbon status into the control of enzymes involved in nitrogen assimilation. Using elaborate sequence similarity detection schemes, we show that five clusters of orthologs (COGs) and several small divergent protein groups belong to the PII superfamily and predict their structure to be a (betaalphabeta)(2) ferredoxin-like fold. Proteins from the newly emerged PII superfamily are present in all major phylogenetic lineages. The PII homologs are quite diverse, with below random (as low as 1%) pairwise sequence identities between some members of distant groups. Despite this sequence diversity, evidence suggests that the different subfamilies retain the PII trimeric structure important for ligand-binding site formation and maintain a conservation of conservations at residue positions important for PII function. Because most of the orthologous groups within the PII superfamily are composed entirely of hypothetical proteins, our remote homology-based structure prediction provides the only information about them. Analogous to structural genomics efforts, such prediction gives clues to the biological roles of these proteins and allows us to hypothesize about locations of functional sites on model structures or rationalize about available experimental information. For instance, conserved residues in one of the families map in close proximity to each other on PII structure, allowing for a possible metal-binding site in the proteins coded by the locus known to affect sensitivity to divalent metal ions. Presented analysis pushes the limits of sequence similarity searches and exemplifies one of the extreme cases of reliable sequence-based structure prediction. In conjunction with structural genomics efforts to shed light on protein function, our strategies make it possible to detect homology between highly diverse sequences and are aimed at understanding the most remote evolutionary connections in the protein world.
Collapse
Affiliation(s)
- Lisa N Kinch
- Howard Hughes Medical Institute, and Department of Biochemistry, University of Texas Southwestern Medical Center, Dallas, Texas, USA
| | | |
Collapse
|
130
|
Abstract
Small-molecule metabolism forms the core of the metabolic processes of all living organisms. As early as 1945, possible mechanisms for the evolution of such a complex metabolic system were considered. The problem is to explain the appearance and development of a highly regulated complex network of interacting proteins and substrates from a limited structural and functional repertoire. By permitting the co-analysis of phylogeny and metabolism, the combined exploitation of pathway and structural databases, as well as the use of multiple-sequence alignment search algorithms, sheds light on this problem. Much of the current research suggests a chemistry-driven 'patchwork' model of pathway evolution, but other mechanisms may play a role. In the future, as metabolic structure and sequence space are further explored, it should become easier to trace the finer details of pathway development and understand how complexity has evolved.
Collapse
Affiliation(s)
- Stuart C G Rison
- Department of Biochemistry and Molecular Biology, University College London, Darwin Building, Gower Street, London WC1E 6BT, UK
| | | |
Collapse
|
131
|
Rison SCG, Teichmann SA, Thornton JM. Homology, pathway distance and chromosomal localization of the small molecule metabolism enzymes in Escherichia coli. J Mol Biol 2002; 318:911-32. [PMID: 12054833 DOI: 10.1016/s0022-2836(02)00140-7] [Citation(s) in RCA: 43] [Impact Index Per Article: 2.0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/22/2022]
Abstract
Here, we analyse Escherichia coli enzymes involved in small molecule metabolism (SMM). We introduce the concept of pathway distance as a measure of the number of distinct metabolic steps separating two SMM enzymes, and we consider protein homology (as determined by assigning enzymes to structural and sequence families) and gene interval (the number of genes separating two genes on the E. coli chromosome). The relationships between these three contexts (pathway distance, homology and chromosomal localisation) is investigated extensively. We make use of these relationships to suggest possible SMM evolution mechanisms. Homology between enzyme pairs close in the SMM was higher than expected by chance but was still rare. When observed, homologues usually conserved their reaction mechanism and/or co-factor binding rather than shared substrate binding. The correlation between pathway distance and gene intervals was clear. Enzymes catalysing nearby SMM reactions were usually encoded by genes close by on the E. coli chromosome. We found many co-regulated blocks of three to four genes (usually non-homologous) encoding enzymes occurring within four metabolic steps of one another; nearly all of these blocks formed part of known or predicted operons. The "inline reuse" of enzymes (i.e. the use of the same enzyme to catalyse two or more different steps of a metabolic pathway) is also discussed: of these enzymes, four were multifunctional (i.e. catalysed a different reaction in each instance), nine had multiple substrate specificity (i.e. catalysed the same reaction on different substrates in each instance) and one catalysed the same reaction on the same substrate but as part of two different complexes. We also identified 59 sets of isozymic proteins most commonly duplicated to function under different conditions, or with a different preferred substrate or minor substrate. In addition to transcriptional units, isozymes and inline reuse of enzymes provide mechanisms for controlling the SMM network. Our data suggest that several pathway evolution mechanisms may occur in concert, although chemistry-driven duplication/recruitment is favoured. SMM exploits regulatory strategies involving chromosomal location, isozymes and the reuse of enzymes.
Collapse
Affiliation(s)
- Stuart C G Rison
- Department of Biochemistry and Molecular Biology, University College London, Darwin Building, Gower Street, London WC1E 6BT, UK
| | | | | |
Collapse
|
132
|
Németh A, Kamondi S, Szilágyi A, Magyar C, Kovári Z, Závodszky P. Increasing the thermal stability of cellulase C using rules learned from thermophilic proteins: a pilot study. Biophys Chem 2002; 96:229-41. [PMID: 12034443 DOI: 10.1016/s0301-4622(02)00027-3] [Citation(s) in RCA: 20] [Impact Index Per Article: 0.9] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/16/2022]
Abstract
Some structural features underlying the increased thermostability of enzymes from thermophilic organisms relative to their homologues from mesophiles are known from earlier studies. We used cellulase C from Clostridium thermocellum to test whether thermostability can be increased by mutations designed using rules learned from thermophilic proteins. Cellulase C has a TIM barrel fold with an additional helical subdomain. We designed and produced a number of mutants with the aim to increase its thermostability. Five mutants were designed to create new electrostatic interactions. They all retained catalytic activity but exhibited decreased thermostability relative to the wild-type enzyme. Here, the stabilizing contributions are obviously smaller than the destabilization caused by the introduction of the new side chains. In another mutant, the small helical subdomain was deleted. This mutant lost activity but its melting point was only 3 degrees C lower than that of the wild-type enzyme, which suggests that the subdomain is an independent folding unit and is important for catalytic function. A double mutant was designed to introduce a new disulfide bridge into the enzyme. This mutant is active and has an increased stability (deltaT(m)=3 degrees C, delta(deltaG(u))=1.73 kcal/mol) relative to the wild-type enzyme. Reduction of the disulfide bridge results in destabilization and an altered thermal denaturation behavior. We conclude that rules learned from thermophilic proteins cannot be used in a straightforward way to increase the thermostability of a protein. Creating a crosslink such as a disulfide bond is a relatively sure-fire method but the stabilization may be smaller than calculated due to coupled destabilizing effects.
Collapse
Affiliation(s)
- Attila Németh
- Institute of Enzymology, Hungarian Academy of Sciences, Budapest, Hungary
| | | | | | | | | | | |
Collapse
|
133
|
Aloy P, Oliva B, Querol E, Aviles FX, Russell RB. Structural similarity to link sequence space: new potential superfamilies and implications for structural genomics. Protein Sci 2002; 11:1101-16. [PMID: 11967367 PMCID: PMC2373547 DOI: 10.1110/ps.3950102] [Citation(s) in RCA: 14] [Impact Index Per Article: 0.6] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 12/28/2022]
Abstract
The current pace of structural biology now means that protein three-dimensional structure can be known before protein function, making methods for assigning homology via structure comparison of growing importance. Previous research has suggested that sequence similarity after structure-based alignment is one of the best discriminators of homology and often functional similarity. Here, we exploit this observation, together with a merger of protein structure and sequence databases, to predict distant homologous relationships. We use the Structural Classification of Proteins (SCOP) database to link sequence alignments from the SMART and Pfam databases. We thus provide new alignments that could not be constructed easily in the absence of known three-dimensional structures. We then extend the method of Murzin (1993b) to assign statistical significance to sequence identities found after structural alignment and thus suggest the best link between diverse sequence families. We find that several distantly related protein sequence families can be linked with confidence, showing the approach to be a means for inferring homologous relationships and thus possible functions when proteins are of known structure but of unknown function. The analysis also finds several new potential superfamilies, where inspection of the associated alignments and superimpositions reveals conservation of unusual structural features or co-location of conserved amino acids and bound substrates. We discuss implications for Structural Genomics initiatives and for improvements to sequence comparison methods.
Collapse
Affiliation(s)
- Patrick Aloy
- EMBL, Biocomputing, Meyerhofstrasse 1, D-69117 Heidelberg, Germany
| | | | | | | | | |
Collapse
|
134
|
Anantharaman V, Koonin EV, Aravind L. Comparative genomics and evolution of proteins involved in RNA metabolism. Nucleic Acids Res 2002; 30:1427-64. [PMID: 11917006 PMCID: PMC101826 DOI: 10.1093/nar/30.7.1427] [Citation(s) in RCA: 381] [Impact Index Per Article: 17.3] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/14/2022] Open
Abstract
RNA metabolism, broadly defined as the compendium of all processes that involve RNA, including transcription, processing and modification of transcripts, translation, RNA degradation and its regulation, is the central and most evolutionarily conserved part of cell physiology. A comprehensive, genome-wide census of all enzymatic and non-enzymatic protein domains involved in RNA metabolism was conducted by using sequence profile analysis and structural comparisons. Proteins related to RNA metabolism comprise from 3 to 11% of the complete protein repertoire in bacteria, archaea and eukaryotes, with the greatest fraction seen in parasitic bacteria with small genomes. Approximately one-half of protein domains involved in RNA metabolism are present in most, if not all, species from all three primary kingdoms and are traceable to the last universal common ancestor (LUCA). The principal features of LUCA's RNA metabolism system were reconstructed by parsimony-based evolutionary analysis of all relevant groups of orthologous proteins. This reconstruction shows that LUCA possessed not only the basal translation system, but also the principal forms of RNA modification, such as methylation, pseudouridylation and thiouridylation, as well as simple mechanisms for polyadenylation and RNA degradation. Some of these ancient domains form paralogous groups whose evolution can be traced back in time beyond LUCA, towards low-specificity proteins, which probably functioned as cofactors for ribozymes within the RNA world framework. The main lineage-specific innovations of RNA metabolism systems were identified. The most notable phase of innovation in RNA metabolism coincides with the advent of eukaryotes and was brought about by the merge of the archaeal and bacterial systems via mitochondrial endosymbiosis, but also involved emergence of several new, eukaryote-specific RNA-binding domains. Subsequent, vast expansions of these domains mark the origin of alternative splicing in animals and probably in plants. In addition to the reconstruction of the evolutionary history of RNA metabolism, this analysis produced numerous functional predictions, e.g. of previously undetected enzymes of RNA modification.
Collapse
Affiliation(s)
- Vivek Anantharaman
- National Center for Biotechnology Information, National Library of Medicine, 8600 Rockville Pike, Building 389, National Institutes of Health, Bethesda, MD 20894, USA
| | | | | |
Collapse
|
135
|
Wise E, Yew WS, Babbitt PC, Gerlt JA, Rayment I. Homologous (beta/alpha)8-barrel enzymes that catalyze unrelated reactions: orotidine 5'-monophosphate decarboxylase and 3-keto-L-gulonate 6-phosphate decarboxylase. Biochemistry 2002; 41:3861-9. [PMID: 11900527 DOI: 10.1021/bi012174e] [Citation(s) in RCA: 45] [Impact Index Per Article: 2.0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/28/2022]
Abstract
The 3-keto-L-gulonate 6-phosphate decarboxylase (KGPDC) encoded by the ulaD gene in the Escherichia coli genome [Yew, W. S., and Gerlt, J. A. (2002) J. Bacteriol. 184, 302-306] and orotidine 5'-monophosphate decarboxylase (OMPDC) are homologous (derived from a common ancestor) but catalyze different reactions. The metal-independent decarboxylation reaction catalyzed by OMPDC avoids the formation of a vinyl anion intermediate; the Mg2+-dependent decarboxylation reaction catalyzed by KGPDC involves the formation of an enediolate anion intermediate. Based on the available structures of OMPDC, a sequence alignment allows the predictions that (1) KGPDC is a dimer of (beta/alpha)8-barrels, with the active sites located at the dimer interface; (2) KGPDC and OMPDC share an aspartate residue at the end of the first beta-strand and an Asp-x-Lys-x-x-Asp motif at the end of the third beta-strand with OMPDC; but (3) KGPDC has a Glu instead of a Lys at the end of the second beta-strand. The structure of KGPDC has been determined in the presence of Mg2+ and the substrate analogue L-gulonate 6-phosphate and confirms these predictions. The carboxylate functional groups at the ends of the first, second, and third beta-strands in KGPDC are ligands of the Mg2+; in OMPDC, the homologues of these residues participate in a hydrogen-bonded network that facilitates the decarboxylation reaction. The 3-OH group of the substrate analogue is coordinated to the Mg2+, supporting the hypothesis that the mechanism of the decarboxylation catalyzed by KGPDC involves stabilization of an enediolate anion intermediate. These structural studies establish the existence of the OMPDC "suprafamily," in which members catalyze reactions that occur in different metabolic pathways and share no mechanistic relationship. The existence of this suprafamily demonstrates that divergent evolution can be opportunistic, conscripting active site features of a progenitor to catalyze unrelated functions. Accordingly, sequence or structure homology alone cannot be used to infer the functions of new proteins discovered in genome projects.
Collapse
Affiliation(s)
- Eric Wise
- Department of Biochemistry, University of Wisconsin, Madison, Wisconsin 53706, USA
| | | | | | | | | |
Collapse
|
136
|
Knöchel T, Pappenberger A, Jansonius JN, Kirschner K. The crystal structure of indoleglycerol-phosphate synthase from Thermotoga maritima. Kinetic stabilization by salt bridges. J Biol Chem 2002; 277:8626-34. [PMID: 11741953 DOI: 10.1074/jbc.m109517200] [Citation(s) in RCA: 20] [Impact Index Per Article: 0.9] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/06/2022] Open
Abstract
The crystal structure of the thermostable indoleglycerol-phosphate synthase from Thermotoga maritima (tIGPS) was determined at 2.5 A resolution. It was compared with the structures of the thermostable sIGPS from Sulfolobus solfataricus and of the thermolabile eIGPS from Escherichia coli. The main chains of the three (beta alpha)(8)-barrel proteins superimpose closely, and the packing of side chains in the beta-barrel cores, as well as the architecture of surface loops, is very similar. Both thermostable proteins have, however, 17 strong salt bridges, compared with only 10 in eIGPS. The number of additional salt bridges in tIGPS and sIGPS correlates well with their reduced rate of irreversible thermal inactivation at 90 degrees C. Only 3 of 17 salt bridges in tIGPS and sIGPS are topologically conserved. The major difference between the two proteins is the preference for interhelical salt bridges in sIGPS and intrahelical ones in tIGPS. The different implementation of salt bridges in the closely related proteins suggests that the stabilizing effect of salt bridges depends rather on the sum of their individual contributions than on their location. This observation is consistent with a protein unfolding mechanism where the simultaneous breakdown of all salt bridges is the rate-determining step.
Collapse
Affiliation(s)
- Thorsten Knöchel
- Division of Structural Biology, Biozentrum, University Basel, Klingelbergstrasse 70, CH-4056 Basel, Switzerland.
| | | | | | | |
Collapse
|
137
|
Douangamath A, Walker M, Beismann-Driemeyer S, Vega-Fernandez MC, Sterner R, Wilmanns M. Structural evidence for ammonia tunneling across the (beta alpha)(8) barrel of the imidazole glycerol phosphate synthase bienzyme complex. Structure 2002; 10:185-93. [PMID: 11839304 DOI: 10.1016/s0969-2126(02)00702-5] [Citation(s) in RCA: 92] [Impact Index Per Article: 4.2] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 10/17/2022]
Abstract
Since reactive ammonia is not available under physiological conditions, glutamine is used as a source for the incorporation of nitrogen in a number of metabolic pathway intermediates. The heterodimeric ImGP synthase that links histidine and purine biosynthesis belongs to the family of glutamine amidotransferases in which the glutaminase activity is coupled with a subsequent synthase activity specific for each member of the enzyme family. Its X-ray structure from the hyperthermophile Thermotoga maritima shows that the glutaminase subunit is associated with the N-terminal face of the (beta alpha)(8) barrel cyclase subunit. The complex reveals a putative tunnel for the transfer of ammonia over a distance of 25 A. Although ammonia tunneling has been reported for glutamine amidotransferases, the ImGP synthase has evolved a novel mechanism, which extends the known functional properties of the versatile (beta alpha)(8) barrel fold.
Collapse
Affiliation(s)
- Alice Douangamath
- EMBL Hamburg Outstation c/o DESY, Notkestrasse 85, D-22603 Hamburg, Germany
| | | | | | | | | | | |
Collapse
|
138
|
Pearl FMG, Lee D, Bray JE, Buchan DWA, Shepherd AJ, Orengo CA. The CATH extended protein-family database: providing structural annotations for genome sequences. Protein Sci 2002; 11:233-44. [PMID: 11790833 PMCID: PMC2373435 DOI: 10.1110/ps.16802] [Citation(s) in RCA: 36] [Impact Index Per Article: 1.6] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 10/17/2022]
Abstract
An automatic sequence search and analysis protocol (DomainFinder) based on PSI-BLAST and IMPALA, and using conservative thresholds, has been developed for reliably integrating gene sequences from GenBank into their respective structural families within the CATH domain database (http://www.biochem.ucl.ac.uk/bsm/cath_new). DomainFinder assigns a new gene sequence to a CATH homologous superfamily provided that PSI-BLAST identifies a clear relationship to at least one other Protein Data Bank sequence within that superfamily. This has resulted in an expansion of the CATH protein family database (CATH-PFDB v1.6) from 19,563 domain structures to 176,597 domain sequences. A further 50,000 putative homologous relationships can be identified using less stringent cut-offs and these relationships are maintained within neighbour tables in the CATH Oracle database, pending further evidence of their suggested evolutionary relationship. Analysis of the CATH-PFDB has shown that only 15% of the sequence families are close enough to a known structure for reliable homology modeling. IMPALA/PSI-BLAST profiles have been generated for each of the sequence families in the expanded CATH-PFDB and a web server has been provided so that new sequences may be scanned against the profile library and be assigned to a structure and homologous superfamily.
Collapse
Affiliation(s)
- Frances M G Pearl
- Department of Biochemistry and Molecular Biology, University College London, University of London, London WC1E 6BT, UK.
| | | | | | | | | | | |
Collapse
|
139
|
Abstract
The (beta alpha)(8)-barrel is the most frequent and most versatile fold among enzymes [Höcker et al., Curr. Opin. Biotechnol. 12 (2001) 376-381; Wierenga, FEBS Lett. 492 (2001) 193-198]. Structural and functional evidence suggests that (beta alpha)(8)-barrels evolved from an ancestral half-barrel, which consisted of four (beta alpha) units stabilized by dimerization [Lang et al., Science 289 (2000) 1546-550; Höcker et al., Nat. Struct. Biol. 8 (2001) 32-36; Gerlt and Babbitt, Nat. Struct. Biol. 8 (2001) 5-7]. Here, by performing a comprehensive database search, we detect a striking and unexpected structural and amino acid sequence similarity between (beta alpha)(4) half-barrels and members of the (beta alpha)(5) flavodoxin-like fold. These findings provoke the hypothesis that a large fraction of the modern-day enzymes evolved from a basic structural building block, which can be identified by a combination of sequence and structural analyses.
Collapse
Affiliation(s)
- Birte Höcker
- Universität zu Köln, Institut für Biochimie, Cologne, Germany
| | | | | |
Collapse
|
140
|
Wrabl JO, Grishin NV. Homology between O-linked GlcNAc transferases and proteins of the glycogen phosphorylase superfamily. J Mol Biol 2001; 314:365-74. [PMID: 11846551 DOI: 10.1006/jmbi.2001.5151] [Citation(s) in RCA: 99] [Impact Index Per Article: 4.3] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/22/2022]
Abstract
The O-linked GlcNAc transferases (OGTs) are a recently characterized group of largely eukaryotic enzymes that add a single beta-N-acetylglucosamine moiety to specific serine or threonine hydroxyls. In humans, this process may be part of a sugar regulation mechanism or cellular signaling pathway that is involved in many important diseases, such as diabetes, cancer, and neurodegeneration. However, no structural information about the human OGT exists, except for the identification of tetratricopeptide repeats (TPR) at the N terminus. The locations of substrate binding sites are unknown and the structural basis for this enzyme's function is not clear. Here, remote homology is reported between the OGTs and a large group of diverse sugar processing enzymes, including proteins with known structure such as glycogen phosphorylase, UDP-GlcNAc 2-epimerase, and the glycosyl transferase MurG. This relationship, in conjunction with amino acid similarity spanning the entire length of the sequence, implies that the fold of the human OGT consists of two Rossmann-like domains C-terminal to the TPR region. A conserved motif in the second Rossmann domain points to the UDP-GlcNAc donor binding site. This conclusion is supported by a combination of statistically significant PSI-BLAST hits, consensus secondary structure predictions, and a fold recognition hit to MurG. Additionally, iterative PSI-BLAST database searches reveal that proteins homologous to the OGTs form a large and diverse superfamily that is termed GPGTF (glycogen phosphorylase/glycosyl transferase). Up to one-third of the 51 functional families in the CAZY database, a glycosyl transferase classification scheme based on catalytic residue and sequence homology considerations, can be unified through this common predicted fold. GPGTF homologs constitute a substantial fraction of known proteins: 0.4% of all non-redundant sequences and about 1% of proteins in the Escherichia coli genome are found to belong to the GPGTF superfamily.
Collapse
Affiliation(s)
- J O Wrabl
- Howard Hughes Medical Institute, University of Texas Southwestern Medical Center, Dallas TX 75390-9050, USA
| | | |
Collapse
|
141
|
Schmidt-Dannert C. Directed evolution of single proteins, metabolic pathways, and viruses. Biochemistry 2001; 40:13125-36. [PMID: 11683620 DOI: 10.1021/bi011310c] [Citation(s) in RCA: 62] [Impact Index Per Article: 2.7] [Reference Citation Analysis] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 02/08/2023]
Affiliation(s)
- C Schmidt-Dannert
- Department of Biochemistry, Molecular Biology and Biophysics, University of Minnesota, 1479 Gortner Avenue, St. Paul, Minnesota 55126, USA.
| |
Collapse
|
142
|
Gough J, Karplus K, Hughey R, Chothia C. Assignment of homology to genome sequences using a library of hidden Markov models that represent all proteins of known structure. J Mol Biol 2001; 313:903-19. [PMID: 11697912 DOI: 10.1006/jmbi.2001.5080] [Citation(s) in RCA: 870] [Impact Index Per Article: 37.8] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/22/2022]
Abstract
Of the sequence comparison methods, profile-based methods perform with greater selectively than those that use pairwise comparisons. Of the profile methods, hidden Markov models (HMMs) are apparently the best. The first part of this paper describes calculations that (i) improve the performance of HMMs and (ii) determine a good procedure for creating HMMs for sequences of proteins of known structure. For a family of related proteins, more homologues are detected using multiple models built from diverse single seed sequences than from one model built from a good alignment of those sequences. A new procedure is described for detecting and correcting those errors that arise at the model-building stage of the procedure. These two improvements greatly increase selectivity and coverage. The second part of the paper describes the construction of a library of HMMs, called SUPERFAMILY, that represent essentially all proteins of known structure. The sequences of the domains in proteins of known structure, that have identities less than 95 %, are used as seeds to build the models. Using the current data, this gives a library with 4894 models. The third part of the paper describes the use of the SUPERFAMILY model library to annotate the sequences of over 50 genomes. The models match twice as many target sequences as are matched by pairwise sequence comparison methods. For each genome, close to half of the sequences are matched in all or in part and, overall, the matches cover 35 % of eukaryotic genomes and 45 % of bacterial genomes. On average roughly 15% of genome sequences are labelled as being hypothetical yet homologous to proteins of known structure. The annotations derived from these matches are available from a public web server at: http://stash.mrc-lmb.cam.ac.uk/SUPERFAMILY. This server also enables users to match their own sequences against the SUPERFAMILY model library.
Collapse
Affiliation(s)
- J Gough
- Laboratory of Molecular Biology, MRC, Hills Road, Cambridge, CB2 2QH, UK.
| | | | | | | |
Collapse
|
143
|
Jain D, Nair DT, Swaminathan GJ, Abraham EG, Nagaraju J, Salunke DM. Structure of the induced antibacterial protein from tasar silkworm, Antheraea mylitta. Implications to molecular evolution. J Biol Chem 2001; 276:41377-82. [PMID: 11522783 DOI: 10.1074/jbc.m104674200] [Citation(s) in RCA: 23] [Impact Index Per Article: 1.0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/06/2022] Open
Abstract
The crystal structure of an antibacterial protein of immune origin (TSWAB), purified from tasar silkworm (Antheraea mylitta) larvae after induction by Escherichia coli infection, has been determined. This is the first insect lysozyme structure and represents induced lysozymes of innate immunity. The core structure of TSWAB is similar to c-type lysozymes and alpha-lactalbumins. However, TSWAB shows significant differences with respect to the other two proteins in the exposed loop regions. The catalytic residues in TSWAB are conserved with respect to the chicken lysozyme, indicating a common mechanism of action. However, differences in the noncatalytic residues in the substrate binding groove imply subtle differences in the specificity and the level of activity. Thus, conformational differences between TSWAB and chicken lysozyme exist, whereas functional mechanisms appear to be similar. On the other hand, alpha-lactalbumins and c-type lysozymes exhibit drastically different functions with conserved molecular conformation. It is evident that a common molecular scaffold is exploited in the three enzymes for apparently different physiological roles. It can be inferred on the basis of the structure-function comparison of these three proteins having common phylogenetic origin that the conformational changes in a protein are minimal during rapid evolution as compared with those in the normal course of evolution.
Collapse
Affiliation(s)
- D Jain
- National Institute of Immunology, New Delhi 110 067, India
| | | | | | | | | | | |
Collapse
|
144
|
Henn-Sax M, Höcker B, Wilmanns M, Sterner R. Divergent evolution of (betaalpha)8-barrel enzymes. Biol Chem 2001; 382:1315-20. [PMID: 11688714 DOI: 10.1515/bc.2001.163] [Citation(s) in RCA: 37] [Impact Index Per Article: 1.6] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/15/2022]
Abstract
The (betaalpha)8-barrel is the most versatile and most frequently encountered fold among enzymes. It is an interesting question how the contemporary (betaalpha)8-barrels are evolutionarily related and by which mechanisms they evolved from more simple precursors. Comprehensive comparisons of amino acid sequences and three-dimensional structures suggest that a large fraction of the known (betaalpha)8-barrels have divergently evolved from a common ancestor. The mutational interconversion of enzymatic activities of several (betaalpha)8-barrels further supports their common evolutionary origin. Moreover, the high structural similarity between the N- and C-terminal (betaalpha)4 units of two (betaalpha)8-barrel enzymes from histidine biosynthesis indicates that the contemporary proteins evolved by tandem duplication and fusion of the gene of an ancestral 'half-barrel' precursor. In support of this hypothesis, recombinantly produced 'half-barrels' were shown to be folded, dimeric proteins.
Collapse
Affiliation(s)
- M Henn-Sax
- Institut für Biochemie, Universität zu Köln, Germany
| | | | | | | |
Collapse
|
145
|
Tsoka S, Ouzounis CA. Functional versatility and molecular diversity of the metabolic map of Escherichia coli. Genome Res 2001; 11:1503-10. [PMID: 11544193 PMCID: PMC311099 DOI: 10.1101/gr.187501] [Citation(s) in RCA: 26] [Impact Index Per Article: 1.1] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/24/2022]
Abstract
We have analyzed the known metabolic enzymes of Escherichia coli in relation to their biochemical reaction properties and their involvement in biochemical pathways. All enzymes involved in small-molecule metabolism and their corresponding protein sequences have been extracted from the EcoCyc database. These 548 metabolic enzymes are clustered into 405 protein families according to sequence similarity. In this study, we examine the functional versatility within enzyme families in terms of their reaction capabilities and pathway participation. In addition, we examine the molecular diversity of reactions and pathways according to their presence across enzyme families. These complex, many-to-many relationships between protein sequence and biochemical function reveal a significant degree of correlation between enzyme families and reactions. Pathways, however, appear to require more than one enzyme type to perform their complex biochemical transformations. Finally, the distribution of enzyme family members across different pathways provides support for the "recruitment" hypothesis of biochemical pathway evolution.
Collapse
Affiliation(s)
- S Tsoka
- Computational Genomics Group, Research Programme, The European Bioinformatics Institute, European Molecular Biology Laboratory, Cambridge Outstation, Cambridge CB10 1SD, UK
| | | |
Collapse
|
146
|
Teichmann SA, Rison SC, Thornton JM, Riley M, Gough J, Chothia C. The evolution and structural anatomy of the small molecule metabolic pathways in Escherichia coli. J Mol Biol 2001; 311:693-708. [PMID: 11518524 DOI: 10.1006/jmbi.2001.4912] [Citation(s) in RCA: 74] [Impact Index Per Article: 3.2] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/22/2022]
Abstract
The 106 small molecule metabolic (SMM) pathways in Escherichia coli are formed by the protein products of 581 genes. We can define 722 domains, nearly all of which are homologous to proteins of known structure, that form all or part of 510 of these proteins. This information allows us to answer general questions on the structural anatomy of the SMM pathway proteins and to trace family relationships and recruitment events within and across pathways. Half the gene products contain a single domain and half are formed by combinations of between two and six domains. The 722 domains belong to one of 213 families that have between one and 51 members. Family members usually conserve their catalytic or cofactor binding properties; substrate recognition is rarely conserved. Of the 213 families, members of only a quarter occur in isolation, i.e. they form single-domain proteins. Most members of the other families combine with domains from just one or two other families and a few more versatile families can combine with several different partners. Excluding isoenzymes, more than twice as many homologues are distributed across pathways as within pathways. However, serial recruitment, with two consecutive enzymes both being recruited to another pathway, is rare and recruitment of three consecutive enzymes is not observed. Only eight of the 106 pathways have a high number of homologues. Homology between consecutive pairs of enzymes with conservation of the main substrate-binding site but change in catalytic mechanism (which would support a simple model of retrograde pathway evolution) occurs only six times in the whole set of enzymes. Most of the domains that form SMM pathways have homologues in non-SMM pathways. Taken together, these results imply a pervasive "mosaic" model for the formation of protein repertoires and pathways.
Collapse
Affiliation(s)
- S A Teichmann
- Department of Biochemistry and Molecular Biology, University College London, Darwin Building, Gower Street, London, WC1E 6BT, UK.
| | | | | | | | | | | |
Collapse
|
147
|
Siebers B, Brinkmann H, Dörr C, Tjaden B, Lilie H, van der Oost J, Verhees CH. Archaeal fructose-1,6-bisphosphate aldolases constitute a new family of archaeal type class I aldolase. J Biol Chem 2001; 276:28710-8. [PMID: 11387336 DOI: 10.1074/jbc.m103447200] [Citation(s) in RCA: 84] [Impact Index Per Article: 3.7] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/06/2022] Open
Abstract
Fructose-1,6-bisphosphate (FBP) aldolase activity has been detected previously in several Archaea. However, no obvious orthologs of the bacterial and eucaryal Class I and II FBP aldolases have yet been identified in sequenced archaeal genomes. Based on a recently described novel type of bacterial aldolase, we report on the identification and molecular characterization of the first archaeal FBP aldolases. We have analyzed the FBP aldolases of two hyperthermophilic Archaea, the facultatively heterotrophic Crenarchaeon Thermoproteus tenax and the obligately heterotrophic Euryarchaeon Pyrococcus furiosus. For enzymatic studies the fba genes of T. tenax and P. furiosus were expressed in Escherichia coli. The recombinant FBP aldolases show preferred substrate specificity for FBP in the catabolic direction and exhibit metal-independent Class I FBP aldolase activity via a Schiff-base mechanism. Transcript analyses reveal that the expression of both archaeal genes is induced during sugar fermentation. Remarkably, the fbp gene of T. tenax is co-transcribed with the pfp gene that codes for the reversible PP(i)-dependent phosphofructokinase. As revealed by phylogenetic analyses, orthologs of the T. tenax and P. furiosus enzyme appear to be present in almost all sequenced archaeal genomes, as well as in some bacterial genomes, strongly suggesting that this new enzyme family represents the typical archaeal FBP aldolase. Because this new family shows no significant sequence similarity to classical Class I and II enzymes, a new name is proposed, archaeal type Class I FBP aldolases (FBP aldolase Class IA).
Collapse
Affiliation(s)
- B Siebers
- Department of Microbiology, Universität Essen, 45117 Essen, Germany.
| | | | | | | | | | | | | |
Collapse
|
148
|
Höcker B, Jürgens C, Wilmanns M, Sterner R. Stability, catalytic versatility and evolution of the (beta alpha)(8)-barrel fold. Curr Opin Biotechnol 2001; 12:376-81. [PMID: 11551466 DOI: 10.1016/s0958-1669(00)00230-5] [Citation(s) in RCA: 76] [Impact Index Per Article: 3.3] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 10/18/2022]
Abstract
The (beta alpha)(8)-barrel is a versatile single-domain protein fold that is adopted by a large number of enzymes. The (beta alpha)(8)-barrel fold has been used as a model to elucidate the structural basis of protein thermostability and in studies to interconvert catalytic activities or substrate specificities by rational design or directed evolution. Recently, the (beta alpha)(4)-half-barrel was identified as a possible structural subdomain.
Collapse
Affiliation(s)
- B Höcker
- Universität zu Köln, Institut für Biochemie, Otto-Fischer-Strasse 12-14, D-50674 Köln, Germany
| | | | | | | |
Collapse
|
149
|
|
150
|
|