151
|
Marques HM, Brown KL. Molecular mechanics and molecular dynamics simulations of porphyrins, metalloporphyrins, heme proteins and cobalt corrinoids. Coord Chem Rev 2002. [DOI: 10.1016/s0010-8545(01)00411-8] [Citation(s) in RCA: 64] [Impact Index Per Article: 2.9] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 10/27/2022]
|
152
|
Swain MT, Kemp GJ. Modelling protein side-chain conformations using constraint logic programming. COMPUTERS & CHEMISTRY 2001; 26:85-95. [PMID: 11765856 DOI: 10.1016/s0097-8485(01)00103-6] [Citation(s) in RCA: 4] [Impact Index Per Article: 0.2] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 10/17/2022]
Abstract
Side-chain placement is an important sub-task in protein modelling. Selecting conformations for side-chains is a difficult problem because of the large search space to be explored. This problem can be addressed using constraint logic programming (CLP), which is an artificial intelligence technique developed to solve large combinatorial search problems. The side-chain placement problem can be expressed as a CLP program in which rotamer conformations are used as values for finite domain variables, and bad steric contacts involving rotamers are represented as constraints. This paper introduces the concept of null rotamers, and shows how these can be used in implementing a novel iterative approach. We present results that compare the accuracy of models constructed using different rotamer libraries and different domain variable enumeration heuristics. The results obtained using this CLP-based approach compare favourably with those obtained by other methods.
Collapse
Affiliation(s)
- M T Swain
- Department of Computing Science, King's College, University of Aberdeen, Aberdeen, Scotland AB24 3UE, UK.
| | | |
Collapse
|
153
|
Bill RM, Hedfalk K, Karlgren S, Mullins JG, Rydström J, Hohmann S. Analysis of the pore of the unusual major intrinsic protein channel, yeast Fps1p. J Biol Chem 2001; 276:36543-9. [PMID: 11445582 DOI: 10.1074/jbc.m105045200] [Citation(s) in RCA: 25] [Impact Index Per Article: 1.1] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/06/2022] Open
Abstract
Fps1p is a glycerol efflux channel from Saccharomyces cerevisiae. In this atypical major intrinsic protein neither of the signature NPA motifs of the family, which are part of the pore, is preserved. To understand the functional consequences of this feature, we analyzed the pseudo-NPA motifs of Fps1p by site-directed mutagenesis and assayed the resultant mutant proteins in vivo. In addition, we took advantage of the fact that the closest bacterial homolog of Fps1p, Escherichia coli GlpF, can be functionally expressed in yeast, thus enabling the analysis in yeast cells of mutations that make this typical major intrinsic protein more similar to Fps1p. We observed that mutations made in Fps1p to "restore" the signature NPA motifs did not substantially affect channel function. In contrast, when GlpF was mutated to resemble Fps1p, all mutants had reduced activity compared with wild type. We rationalized these data by constructing models of one GlpF mutant and of the transmembrane core of Fps1p. Our model predicts that the pore of Fps1p is more flexible than that of GlpF. We discuss the fact that this may accommodate the divergent NPA motifs of Fps1p and that the different pore structures of Fps1p and GlpF may reflect the physiological roles of the two glycerol facilitators.
Collapse
Affiliation(s)
- R M Bill
- Department of Cell and Molecular Biology/Microbiology, Göteborg University, S-40530 Göteborg, Sweden
| | | | | | | | | | | |
Collapse
|
154
|
Pang YP, Zheng XE, Weinshilboum RM. Theoretical 3D model of histamine N-methyltransferase: insights into the effects of a genetic polymorphism on enzymatic activity and thermal stability. Biochem Biophys Res Commun 2001; 287:204-8. [PMID: 11549275 DOI: 10.1006/bbrc.2001.5570] [Citation(s) in RCA: 17] [Impact Index Per Article: 0.7] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/22/2022]
Abstract
Histamine N-methyltransferase (HNMT) catalyzes the N-methylation of histamine in mammals. The experimentally determined HNMT three-dimensional (3D) structure is not available. However, there is a common genetic polymorphism for human HNMT (Thr105Ile) that reduces enzymatic activity and is a risk factor for asthma. To obtain insights into mechanisms responsible for the effects of that polymorphism on enzymatic activity and thermal stability, we predicted the 3D structure of HNMT using the threading method and molecular dynamics simulations in water. Herein, we report a theoretical 3D model of human HNMT which reveals that polymorphic residue Thr105Ile is located in the turn between a beta strand and an alpha helix on the protein surface away from the active site of HNMT. Ile105 energetically destabilizes folded HNMT because of its low Chou-Fasman score for forming a turn conformation and the exposure of its hydrophobic side chain to aqueous solution. It thus promotes the formation of misfolded proteins that are prone to the clearance by proteasomes. This information explains, for the first time, how genetic polymorphisms can cause enhanced protein degradation and why the thermal stability of allozyme Ile105 is lower than that of Thr105. It also supports the hypothesis that the experimental observation of a significantly lower level of HNMT enzymatic activity for allozyme Ile105 than that with Thr105 is due to a decreased concentration of allozyme Ile105, but not an alternation of the active-site topology of HNMT caused by the difference at residue 105.
Collapse
Affiliation(s)
- Y P Pang
- Department of Molecular Pharmacology and Experimental Therapeutics, Mayo Foundation for Medical Education and Research, 200 First Street SW, Rochester, MN 55905, USA.
| | | | | |
Collapse
|
155
|
Abstract
Current techniques for the prediction of side-chain conformations on a fixed backbone have an accuracy limit of about 1.0-1.5 A rmsd for core residues. We have carried out a detailed and systematic analysis of the factors that influence the prediction of side-chain conformation and, on this basis, have succeeded in extending the limits of side-chain prediction for core residues to about 0.7 A rmsd from native, and 94 % and 89 % of chi(1) and chi(1+2 ) dihedral angles correctly predicted to within 20 degrees of native, respectively. These results are obtained using a force-field that accounts for only van der Waals interactions and torsional potentials. Prediction accuracy is strongly dependent on the rotamer library used. That is, a complete and detailed rotamer library is essential. The greatest accuracy was obtained with an extensive rotamer library, containing over 7560 members, in which bond lengths and bond angles were taken from the database rather than simply assuming idealized values. Perhaps the most surprising finding is that the combinatorial problem normally associated with the prediction of the side-chain conformation does not appear to be important. This conclusion is based on the fact that the prediction of the conformation of a single side-chain with all others fixed in their native conformations is only slightly more accurate than the simultaneous prediction of all side-chain dihedral angles.
Collapse
Affiliation(s)
- Z Xiang
- Department of Biochemistry and Molecular Biophysics BB221, Columbia University, New York, NY 10032, USA
| | | |
Collapse
|
156
|
Jyothi S, Joshi RR. Protein structure determination by non-parametric regression and knowledge-based constraints. COMPUTERS & CHEMISTRY 2001; 25:283-99. [PMID: 11339411 DOI: 10.1016/s0097-8485(00)00104-2] [Citation(s) in RCA: 3] [Impact Index Per Article: 0.1] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 10/18/2022]
Abstract
We have devised a non-parametric regression-based approach for the estimation of small- and medium-range inter-residual three-dimensional (3d) distances in a protein using only the primary sequence as input. A multivariate analysis of variance technique is used to identify the attributes of the primary sequence that is most effective in determining the tertiary structure. Certain compactness and hydrophobic core building heuristics are used along with the estimated distances in a distance geometry program to predict the 3d-structure (tertiary fold). Our method is found to predict correctly the native topologies of small proteins having up to 150 residues. The sensitivity of the structures to long-range distance constraints is studied by incorporating a small number of NMR distance restraints. In terms of modularity, precision, accuracy and computational efficiency our method is found to be better in comparison with current computational methods like X-PLOR and DRAGON on the sample that was reported in the literature for the comparison of these two methods.
Collapse
Affiliation(s)
- S Jyothi
- Department of Mathematics, Indian Institute of Technology Bombay, Powai, Mumbai, India
| | | |
Collapse
|
157
|
Martí-Renom MA, Stuart AC, Fiser A, Sánchez R, Melo F, Sali A. Comparative protein structure modeling of genes and genomes. ANNUAL REVIEW OF BIOPHYSICS AND BIOMOLECULAR STRUCTURE 2001; 29:291-325. [PMID: 10940251 DOI: 10.1146/annurev.biophys.29.1.291] [Citation(s) in RCA: 2352] [Impact Index Per Article: 102.3] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 11/09/2022]
Abstract
Comparative modeling predicts the three-dimensional structure of a given protein sequence (target) based primarily on its alignment to one or more proteins of known structure (templates). The prediction process consists of fold assignment, target-template alignment, model building, and model evaluation. The number of protein sequences that can be modeled and the accuracy of the predictions are increasing steadily because of the growth in the number of known protein structures and because of the improvements in the modeling software. Further advances are necessary in recognizing weak sequence-structure similarities, aligning sequences with structures, modeling of rigid body shifts, distortions, loops and side chains, as well as detecting errors in a model. Despite these problems, it is currently possible to model with useful accuracy significant parts of approximately one third of all known protein sequences. The use of individual comparative models in biology is already rewarding and increasingly widespread. A major new challenge for comparative modeling is the integration of it with the torrents of data from genome sequencing projects as well as from functional and structural genomics. In particular, there is a need to develop an automated, rapid, robust, sensitive, and accurate comparative modeling pipeline applicable to whole genomes. Such large-scale modeling is likely to encourage new kinds of applications for the many resulting models, based on their large number and completeness at the level of the family, organism, or functional network.
Collapse
Affiliation(s)
- M A Martí-Renom
- Laboratories of Molecular Biophysics, Pels Family Center for Biochemistry and Structural Biology, Rockefeller University, New York, NY 10021, USA
| | | | | | | | | | | |
Collapse
|
158
|
Ebright RH. RNA polymerase: structural similarities between bacterial RNA polymerase and eukaryotic RNA polymerase II. J Mol Biol 2000; 304:687-98. [PMID: 11124018 DOI: 10.1006/jmbi.2000.4309] [Citation(s) in RCA: 203] [Impact Index Per Article: 8.5] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/22/2022]
Abstract
Bacterial RNA polymerase and eukaryotic RNA polymerase II exhibit striking structural similarities, including similarities in overall structure, relative positions of subunits, relative positions of functional determinants, and structures and folding topologies of subunits. These structural similarities are paralleled by similarities in mechanisms of interaction with DNA.
Collapse
Affiliation(s)
- R H Ebright
- Howard Hughes Medical Institute, Waksman Institute and Department of Chemistry Rutgers University, Piscataway, NJ 08854, USA.
| |
Collapse
|
159
|
van Hooft PA, Höltje HD. Construction of a full three-dimensional model of the transpeptidase domain of Streptococcus pneumoniae PBP2x starting from its Calpha-atom coordinates. J Comput Aided Mol Des 2000; 14:719-30. [PMID: 11131966 DOI: 10.1023/a:1008164914993] [Citation(s) in RCA: 3] [Impact Index Per Article: 0.1] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/12/2022]
Abstract
A new method is described for generating all-atom protein structures from Calpha-atom information. The method, which combines both local structural trace alignments and comparative side chain modeling with ab initio side chain modeling, makes use of both the virtual-bond and the dipole-path methods. Provided that 3D structures of structurally and functionally related proteins exist, the method presented here is highly suitable for generating all-atom coordinates of partly solved, low-resolution crystal structures. Particularly the active site region can be modeled accurately with this procedure, which enables investigation of the binding modes of different classes of ligands with molecular dynamics simulations. The method is applied to the trace of Streptococcus pneumoniae, in order to construct an all-atom structure of the transpeptidase domain. Since after generation of full coordinates of the transpeptidase domain the structure had been solved to 2.4 A resolution, new X-ray coordinates for the worst modeled loop (residues T370 to M386; 17 out of a total number of 351 residues constituting the transpeptidase domain) were incorporated, as kindly provided by Dr. Dideberg. The structure was relaxed with molecular dynamics simulations and simulated annealing methods. The RMS deviation between the 144 aligned Calpha-atoms and the corresponding ones in the originally solved 3.5 A resolution crystal structure was 0.98. The 351 Calpha-atoms of the whole transpeptidase domain of the final model showed an RMS deviation of 1.58. The Ramachandran plot showed that 79.3% of the residues are in the most favored regions, with only 1.0% occurring in disallowed regions. The model presented here can be used to investigate the three-dimensional influences of mutations around the active site of PBP2x.
Collapse
Affiliation(s)
- P A van Hooft
- Institut für Pharmazeutische Chemie, Heinrich-Heine Universität Düsseldorf, Germany
| | | |
Collapse
|
160
|
Fong S, Machajewski TD, Mak CC, Wong C. Directed evolution of D-2-keto-3-deoxy-6-phosphogluconate aldolase to new variants for the efficient synthesis of D- and L-sugars. CHEMISTRY & BIOLOGY 2000; 7:873-83. [PMID: 11094340 DOI: 10.1016/s1074-5521(00)00035-1] [Citation(s) in RCA: 122] [Impact Index Per Article: 5.1] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 11/23/2022]
Abstract
BACKGROUND Exploitation and improvement of enzymes as catalysts for organic synthesis is of current interest in biocatalysis. A representative enzyme for investigation is the Escherichia coli D-2-keto-3-deoxy-6-phosphogluconate (KDPG) aldolase, which catalyzes the highly specific reversible aldol reaction using the D-configurated KDPG as substrate. RESULTS Using in vitro evolution, the aldolase has been converted into aldolases with improved catalytic efficiency, altered substrate specificity and stereoselectivity. In particular, some evolved aldolases capable of accepting both D- and L- glyceraldehyde in the non-phosphorylated form as substrates for reversible aldol reaction have been obtained, providing a new direction to the enzymatic synthesis of both D- and L-sugars. CONCLUSIONS This research has demonstrated the effectiveness of using in vitro evolution to rapidly alter the properties of an aldolase to improve its utility in asymmetric synthesis. The evolved aldolases, differing from the native enzyme which is highly phosphate- and D-sugar-dependent, catalyze the efficient synthesis of both D- and L-sugars from non-phosphorylated aldehydes and pyruvate. The principles and strategies described in this study should be applicable to other aldolases to further expand the scope of their synthetic utility.
Collapse
Affiliation(s)
- S Fong
- Department of Chemistry and the Skaggs Institute for Chemical Biology, The Scripps Research Institute, 10550 North Torrey Pines Road, La Jolla, CA 92037, USA
| | | | | | | |
Collapse
|
161
|
Feig M, Rotkiewicz P, Kolinski A, Skolnick J, Brooks CL. Accurate reconstruction of all-atom protein representations from side-chain-based low-resolution models. Proteins 2000; 41:86-97. [PMID: 10944396 DOI: 10.1002/1097-0134(20001001)41:1<86::aid-prot110>3.0.co;2-y] [Citation(s) in RCA: 80] [Impact Index Per Article: 3.3] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/06/2022]
Abstract
A procedure for the reconstruction of all-atom protein structures from side-chain center-based low-resolution models is introduced and applied to a set of test proteins with high-resolution X-ray structures. The accuracy of the rebuilt all-atom models is measured by root mean square deviations to the corresponding X-ray structures and percentages of correct chi(1) and chi(2) side-chain dihedrals. The benefit of including C(alpha) positions in the low-resolution model is examined, and the effect of lattice-based models on the reconstruction accuracy is discussed. Programs and scripts implementing the reconstruction procedure are made available through the NIH research resource for Multiscale Modeling Tools in Structural Biology (http://mmtsb.scripps.edu).
Collapse
Affiliation(s)
- M Feig
- Department of Molecular Biology, The Scripps Research Institute, La Jolla, California 92037, USA
| | | | | | | | | |
Collapse
|
162
|
|
163
|
Marrec-Fairley M, Piette A, Gallet X, Brasseur R, Hara H, Fraipont C, Ghuysen JM, Nguyen-Distèche M. Differential functionalities of amphiphilic peptide segments of the cell-septation penicillin-binding protein 3 of Escherichia coli. Mol Microbiol 2000; 37:1019-31. [PMID: 10972821 DOI: 10.1046/j.1365-2958.2000.02054.x] [Citation(s) in RCA: 46] [Impact Index Per Article: 1.9] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/20/2022]
Abstract
The class B M1-V577 penicillin-binding protein (PBP) 3 of Escherichia coli consists of a M1-L39 membrane anchor (bearing a cytosolic tail) that is linked via a G40-S70 intervening peptide to an R71-I236 non-catalytic module (containing the conserved motifs 1-3) itself linked via motif 4 to a D237-V577 catalytic module (containing the conserved motifs 5-7 of the penicilloyl serine transferases superfamily). It has been proposed that during cell septation the peptidoglycan crosslinking activity of the acyl transferase module of PBP3 is regulated by the associated M1-I236 polypeptide itself in interaction with other components of the divisome. The fold adopted by the R71-V577 polypeptide of PBP3 has been modelled by reference to the corresponding R76-S634 polypeptide of the class B Streptococcus pneumoniae PBP2x. Based on these data and the results of site-directed mutagenesis of motifs 1-3 and of peptide segments of high amphiphilicity (identified from hydrophobic moment plots), the M1-I236 polypeptide of PBP3 appears to be precisely designed to work in the way proposed. The membrane anchor and the G40-S70 sequence (containing the G57-Q66 peptide segment) upstream from the non-catalytic module have the information ensuring that PBP3 undergoes proper insertion within the divisome at the cell septation site. Motif 1 and the I74-L82 overlapping peptide segment, motif 2 and the H160-G172 overlapping peptide segment, and the G188-D197 motif 3 are located at or close to the intermodule junction. They contain the information ensuring that PBP3 folds correctly and the acyl transferase catalytic centre adopts the active configuration. The E206-V217 peptide segment is exposed at the surface of the non-catalytic module. It has the information ensuring that PBP3 fulfils its cell septation activity within the fully complemented divisome.
Collapse
Affiliation(s)
- M Marrec-Fairley
- Centre d'Ingénierie des Protéines, Université de Liège, Institut de Chimie, B6, Sart Tilman, B-4000 Liège, Belgium.
| | | | | | | | | | | | | | | |
Collapse
|
164
|
Ogata K, Umeyama H. An automatic homology modeling method consisting of database searches and simulated annealing. J Mol Graph Model 2000; 18:258-72, 305-6. [PMID: 11021542 DOI: 10.1016/s1093-3263(00)00037-1] [Citation(s) in RCA: 73] [Impact Index Per Article: 3.0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/24/2022]
Abstract
We introduce a method of homology modeling consisting of database searches and simulated annealing. All processes involving searches for homologous proteins, alignment, the construction of Calpha atoms, construction of main-chain atoms, and the construction of side-chain atoms are performed automatically. In this method, main-chain conformations are generated from the weighted average of mainchain coordinates in reference proteins. The weight is defined by the local space homology representing the similarity of environmental residues at topologically equivalent positions in reference proteins. Side-chain conformations are generated for constructed main-chain atoms by database searches, and main-chain atoms are optimized for the fixed side-chain conformations. These two processes, i.e., the side-chain generation and main-chain optimization, are repeated several times. This type of construction provides a structure similar to the x-ray structure, in particular, for main-chain and side-chain atoms in the residues belonging to structurally conserved regions (SCRs). The accuracy of our method was evaluated for 14 proteins whose structures are known. The average root mean square deviation between models and x-ray structures was 2.29 A for all atoms, and the percentage of chi1 angles within 30 degrees was 72.6% for SCRs residues. Some models were in good agreement with their respective x-ray structures. Our method, which has the advantage of being automated, gives results similar to, or better than, published results for three widely used test proteins. Our software, FAMS, is available on the World Wide Web.
Collapse
Affiliation(s)
- K Ogata
- School of Pharmaceutical Sciences, Kitasato University, Tokyo, Japan
| | | |
Collapse
|
165
|
|
166
|
Chang YT, Loew G. Homology modeling, molecular dynamics simulations, and analysis of CYP119, a P450 enzyme from extreme acidothermophilic archaeon Sulfolobus solfataricus. Biochemistry 2000; 39:2484-98. [PMID: 10704198 DOI: 10.1021/bi991966u] [Citation(s) in RCA: 28] [Impact Index Per Article: 1.2] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/30/2022]
Abstract
The recent characterization of a thermophilic and barophilic CYP119 from Sulfolobus solfataricus offers a new opportunity to identify the origin of its stability by comparing it with mesophilic P450s with known structures. Since the three-dimensional structure of CYP119 is not yet available, homology modeling techniques were used to build model structures for this enzyme. The overall quality and stability of the models were assessed using three protein analysis programs and by monitoring structural stability during 1 ns of molecular dynamics simulations at 300 and 390 K. The results show the CYP119 models to be of good quality. Possible origins of the thermo- and barostability of CYP119 were then investigated by examining the amino acid compositions and the three-dimensional structure of CYP119 compared with the five mesophilic templates. Three possible factors were identified that could contribute to the enhanced stability of CYP119. The first was the higher relative population of salt bridges and the presence of a few unique salt bridges found in CYP119 that were absent in all five template CYP450s. The second factor was a decreased population of Ala and an increased population of Ile found in the interior of CYP119, which are likely to improve packing in CYP119. The third factor was a more extensive aromatic cluster seen in CYP119 which was not found in all five template P450s. In addition, the model CYP119 three-dimensional structures were also used to determine key properties related to its function. Specifically, binding site residues and surface residues for redox partner interactions were identified. These residues identified together with those residues found that might contribute to the increased stability are suggested for future mutagenesis studies. The results obtained from these experimental studies shall then provide further validation of the suggested origins of stability and the structure-function relationships derived from the model structures of this enzyme.
Collapse
Affiliation(s)
- Y T Chang
- Molecular Research Institute, 2495 Old Middlefield Way, Mountain View, California 94043-2316, USA.
| | | |
Collapse
|
167
|
Jiang X, Farid H, Pistor E, Farid RS. A new approach to the design of uniquely folded thermally stable proteins. Protein Sci 2000; 9:403-16. [PMID: 10716193 PMCID: PMC2144549 DOI: 10.1110/ps.9.2.403] [Citation(s) in RCA: 41] [Impact Index Per Article: 1.7] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 10/19/2022]
Abstract
A new computer program (CORE) is described that predicts core hydrophobic sequences of predetermined target protein structures. A novel scoring function is employed, which for the first time incorporates parameters directly correlated to free energies of unfolding (deltaGu), melting temperatures (Tm), and cooperativity. Metropolis-driven simulated annealing and low-temperature Monte Carlo sampling are used to optimize this score, generating sequences predicted to yield uniquely folded, stable proteins with cooperative unfolding transitions. The hydrophobic core residues of four natural proteins were predicted using CORE with the backbone structure and solvent exposed residues as input. In the two smaller proteins tested (Gbeta1, 11 core amino acids; 434 cro, 10 core amino acids), the native sequence was regenerated as well as the sequence of known thermally stable variants that exhibit cooperative denaturation transitions. Previously designed sequences of variants with lower thermal stability and weaker cooperativity were not predicted. In the two larger proteins tested (myoglobin, 32 core amino acids; methionine aminopeptidase, 63 core amino acids), sequences with corresponding side-chain conformations remarkably similar to that of native were predicted.
Collapse
Affiliation(s)
- X Jiang
- Department of Chemistry, Rutgers, The State University of New Jersey, Newark 07102-1811, USA
| | | | | | | |
Collapse
|
168
|
Abstract
The CYP2C subfamily of human liver P450 isozymes is of major importance in drug metabolism. The most abundant 2C isozyme, CYP2C9, regioselectively hydroxylates a wide variety of substrates. A major obstacle to understanding this specificity in human CYP2C9 is the absence of a 3D structure. A 3D model of CYP2C9 was built, assessed, and used to characterize explicit enzyme-substrate complexes using methods previously developed in our laboratory. The 3D model was assessed by determining its stability to unconstrained molecular dynamics and by comparison of specific properties with those of known protein structures. The CYP2C9 model was then used to characterize explicit enzyme complexes with three structurally and chemically diverse substrates: (S)-naproxen, phenytoin, and progesterone. Each substrate was found to bind to the enzyme with a favorable interaction energy and to remain in the binding site during unconstrained molecular dynamics. Moreover, the mode of binding of each substrate led to calculated preferred hydroxylation sites consistent with experiment. Binding-site residues identified for the models included Arg 105 and Arg97 as key cationic residues, as well as Asn 202, Asp 293, Pro 101, Leu 102, Gly 296, and Phe 476. Site-specific mutations are proposed for further integrated computational and experimental study.
Collapse
Affiliation(s)
- V A Payne
- Molecular Research Institute, Mountain View, California 94043-2316, USA.
| | | | | |
Collapse
|
169
|
|
170
|
Abstract
The precision and accuracy of protein structures determined by nuclear magnetic resonance (NMR) spectroscopy depend on the completeness of input experimental data set. Typically, rather than a single structure, an ensemble of up to 20 equally representative conformers is generated and routinely deposited in the Protein Database. There are substantially more experimentally derived restraints available to define the main-chain coordinates than those of the side chains. Consequently, the side-chain conformations among the conformers are more variable and less well defined than those of the backbone. Even when a side chain is determined with high precision and is found to adopt very similar orientations among all the conformers in the ensemble, it is possible that its orientation might still be incorrect. Thus, it would be helpful if there were a method to assess independently the side-chain orientations determined by NMR. Recently, homology modeling by side-chain packing algorithms has been shown to be successful in predicting the side-chain conformations of the buried residues for a protein when the main-chain coordinates and sequence information are given. Since the main-chain coordinates determined by NMR are consistently more reliable than those of the side-chains, we have applied the side-chain packing algorithms to predict side-chain conformations that are compatible with the NMR-derived backbone. Using four test cases where the NMR solution structures and the X-ray crystal structure of the same protein are available, we demonstrate that the side-chain packing method can provide independent validation for the side-chain conformations of NMR structures. Comparison of the side-chain conformations derived by side-chain packing prediction and by NMR spectroscopy demonstrates that when there is agreement between the NMR model and the predicted model, on average 78% of the time the X-ray structure also concurs. While the side-chain packing method can confirm the reliable residue conformations in NMR models, more importantly, it can also identify the questionable residue conformations with an accuracy of 60%. This validation method can serve to increase the confidence level for potential users of structural models determined by NMR.
Collapse
Affiliation(s)
- S Y Chung
- Department of Biochemistry, Uniformed Services University of the Health Sciences, Bethesda, Maryland 20814, USA.
| | | |
Collapse
|
171
|
|
172
|
Abstract
BACKGROUND Atomic level rotamer libraries for sidechains in proteins have been proposed by several groups. Conformations of side groups in coarse-grained models, on the other hand, have not yet been analyzed, although low resolution approaches are the only efficient way to explore global structural features. RESULTS A residue-specific backbone-dependent library for sidechain isomers, compatible with a coarse-grained model, is proposed. The isomeric states are utilized in packing sidechains of known backbone structures. Sidechain positions are predicted with a root-mean-square deviation (r.m.s.d.) of 2.40 A with respect to crystal structure for 50 test proteins. The rmsd for core residues is 1.60 A and decreases to 1.35 A when conformational correlations and directional effects in inter-residue couplings are considered. CONCLUSIONS An automated method for assigning sidechain positions in coarse-grained model proteins is proposed and made available on the internet; the method accounts satisfactorily for sidechain packing, particularly in the core.
Collapse
Affiliation(s)
- O Keskin
- Chemical Engineering Department and Polymer Research Center, Bogazici University, Istanbul, Turkey
| | | |
Collapse
|
173
|
|
174
|
Simons KT, Ruczinski I, Kooperberg C, Fox BA, Bystroff C, Baker D. Improved recognition of native-like protein structures using a combination of sequence-dependent and sequence-independent features of proteins. Proteins 1999; 34:82-95. [PMID: 10336385 DOI: 10.1002/(sici)1097-0134(19990101)34:1<82::aid-prot7>3.0.co;2-a] [Citation(s) in RCA: 350] [Impact Index Per Article: 14.0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/08/2022]
Abstract
We describe the development of a scoring function based on the decomposition P(structure/sequence) proportional to P(sequence/structure) *P(structure), which outperforms previous scoring functions in correctly identifying native-like protein structures in large ensembles of compact decoys. The first term captures sequence-dependent features of protein structures, such as the burial of hydrophobic residues in the core, the second term, universal sequence-independent features, such as the assembly of beta-strands into beta-sheets. The efficacies of a wide variety of sequence-dependent and sequence-independent features of protein structures for recognizing native-like structures were systematically evaluated using ensembles of approximately 30,000 compact conformations with fixed secondary structure for each of 17 small protein domains. The best results were obtained using a core scoring function with P(sequence/structure) parameterized similarly to our previous work (Simons et al., J Mol Biol 1997;268:209-225] and P(structure) focused on secondary structure packing preferences; while several additional features had some discriminatory power on their own, they did not provide any additional discriminatory power when combined with the core scoring function. Our results, on both the training set and the independent decoy set of Park and Levitt (J Mol Biol 1996;258:367-392), suggest that this scoring function should contribute to the prediction of tertiary structure from knowledge of sequence and secondary structure.
Collapse
Affiliation(s)
- K T Simons
- Department of Biochemistry, University of Washington, Seattle 98195, USA
| | | | | | | | | | | |
Collapse
|
175
|
Ogata K, Umeyama H. The role played by environmental residues on sidechain torsional angles within homologous families of proteins: A new method of sidechain modeling. Proteins 1998. [DOI: 10.1002/(sici)1097-0134(19980601)31:4<355::aid-prot3>3.0.co;2-h] [Citation(s) in RCA: 14] [Impact Index Per Article: 0.5] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/11/2022]
|
176
|
Wang Y, Huq HI, de la Cruz XF, Lee B. A new procedure for constructing peptides into a given Calpha chain. FOLDING & DESIGN 1998; 3:1-10. [PMID: 9502315 DOI: 10.1016/s1359-0278(98)00003-0] [Citation(s) in RCA: 11] [Impact Index Per Article: 0.4] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 02/06/2023]
Abstract
BACKGROUND In ab initio protein folding studies, it is often advantageous to build the Calpha chain first and then to construct the full structure by filling in the peptide groups and the sidechains. Many algorithms have been reported for constructing peptide groups on the Calpha chain, but most are unsuitable for use in such studies; some are too slow for screening a large number of trial Calpha chains and others use only the local geometry and ignore the effects of specific non-neighbor interactions, which can be crucial for proper folding. We needed a fast procedure for constructing the peptide groups that does not ignore the effects of long-range, specific interactions. RESULTS We first found rich correlations between the peptide orientation angle and both the local Calpha-chain geometry and the type of the flanking amino acid residues. These correlations can be used to greatly limit the range of possible peptide orientation angles. We devised a simple peptide construction procedure in which all orientations within this reduced range are systematically examined and the orientation is selected that minimizes a suitable energy function that includes long-range, specific interactions. When tested on known structures, the method is found to be among the fastest of known methods and attains an accuracy comparable with or better than most methods. CONCLUSIONS The new method is fast and takes into account both the local and non-local specific interactions. It therefore appears to be suitable for use in ab initio protein folding studies, wherein a large number of Calpha chains are screened.
Collapse
Affiliation(s)
- Y Wang
- Laboratory of Molecular Biology, Division of Basic Sciences, National Cancer Institute, National Institutes of Health, Bethesda, MD 20892-4255, USA
| | | | | | | |
Collapse
|
177
|
Karpusas M, Nolte M, Benton CB, Meier W, Lipscomb WN, Goelz S. The crystal structure of human interferon beta at 2.2-A resolution. Proc Natl Acad Sci U S A 1997; 94:11813-8. [PMID: 9342320 PMCID: PMC23607 DOI: 10.1073/pnas.94.22.11813] [Citation(s) in RCA: 161] [Impact Index Per Article: 6.0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 02/05/2023] Open
Abstract
Type I interferons (IFNs) are helical cytokines that have diverse biological activities despite the fact that they appear to interact with the same receptor system. To achieve a better understanding of the structural basis for the different activities of alpha and beta IFNs, we have determined the crystal structure of glycosylated human IFN-beta at 2.2-A resolution by molecular replacement. The molecule adopts a fold similar to that of the previously determined structures of murine IFN-beta and human IFN-alpha2b but displays several distinct structural features. Like human IFN-alpha2b, human IFN-beta contains a zinc-binding site at the interface of the two molecules in the asymmetric unit, raising the question of functional relevance for IFN-beta dimers. However, unlike the human IFN-alpha2b dimer, in which homologous surfaces form the interface, human IFN-beta dimerizes with contact surfaces from opposite sides of the molecule. The relevance of the structure to the effects of point mutations in IFN-beta at specific exposed residues is discussed. A potential role of ligand-ligand interactions in the conformational assembly of IFN receptor components is discussed.
Collapse
Affiliation(s)
- M Karpusas
- Biogen, Inc., 12 Cambridge Center, Cambridge, MA 02142, USA.
| | | | | | | | | | | |
Collapse
|
178
|
Abstract
We have previously reported the development and evaluation of a computational program to assist in the design of hydrophobic cores of proteins. In an effort to investigate the role of core packing in protein structure, we have used this program, referred to as Repacking of Cores (ROC), to design several variants of the protein ubiquitin. Nine ubiquitin variants containing from three to eight hydrophobic core mutations were constructed, purified, and characterized in terms of their stability and their ability to adopt a uniquely folded native-like conformation. In general, designed ubiquitin variants are more stable than control variants in which the hydrophobic core was chosen randomly. However, in contrast to previous results with 434 cro, all designs are destabilized relative to the wild-type (WT) protein. This raises the possibility that beta-sheet structures have more stringent packing requirements than alpha-helical proteins. A more striking observation is that all variants, including random controls, adopt fairly well-defined conformations, regardless of their stability. This result supports conclusions from the cro studies that non-core residues contribute significantly to the conformational uniqueness of these proteins while core packing largely affects protein stability and has less impact on the nature or uniqueness of the fold. Concurrent with the above work, we used stability data on the nine ubiquitin variants to evaluate and improve the predictive ability of our core packing algorithm. Additional versions of the program were generated that differ in potential function parameters and sampling of side chain conformers. Reasonable correlations between experimental and predicted stabilities suggest the program will be useful in future studies to design variants with stabilities closer to that of the native protein. Taken together, the present study provides further clarification of the role of specific packing interactions in protein structure and stability, and demonstrates the benefit of using systematic computational methods to predict core packing arrangements for the design of proteins.
Collapse
Affiliation(s)
- G A Lazar
- Department of Molecular and Cell Biology, University of California at Berkeley 94720, USA
| | | | | |
Collapse
|
179
|
Plückthun A, Pack P. New protein engineering approaches to multivalent and bispecific antibody fragments. IMMUNOTECHNOLOGY : AN INTERNATIONAL JOURNAL OF IMMUNOLOGICAL ENGINEERING 1997; 3:83-105. [PMID: 9237094 DOI: 10.1016/s1380-2933(97)00067-5] [Citation(s) in RCA: 235] [Impact Index Per Article: 8.7] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 02/04/2023]
Abstract
Multivalency is one of the hallmarks of antibodies, by which enormous gains in functional affinity, and thereby improved performance in vivo and in a variety of in vitro assays are achieved. Improved in vivo targeting and more selective localization are another consequence of multivalency. We summarize recent progress in engineering multivalency from recombinant antibody fragments by using miniantibodies (scFv fragments linked with hinges and oligomerization domains), spontaneous scFv dimers with short linkers (diabodies), or chemically crosslinked antibody fragments. Directly related to this are efforts of bringing different binding sites together to create bispecific antibodies. For this purpose, chemically linked fragments, diabodies, scFv-scFv tandems and bispecific miniantibodies have been investigated. Progress in E. coli expression technology makes the amounts necessary for clinical studies now available for suitably engineered fragments. We foresee therapeutic advances from a modular, systematic approach to optimizing pharmacokinetics, stability and functional affinity, which should prove possible with the new recombinant molecular designs.
Collapse
Affiliation(s)
- A Plückthun
- Biochemisches Institut der Universität Zürich, Switzerland
| | | |
Collapse
|
180
|
Gan K, Alexander P, Coxon JM, McKinnon AJ, Worth GH. The reconstruction of a protein backbone from Cα coordinates. Biopolymers 1997. [DOI: 10.1002/(sici)1097-0282(19970405)41:4<381::aid-bip3>3.0.co;2-i] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/08/2022]
|
181
|
Gan K, Alexander P, Coxon JM, McKinnon AJ, Worth GH. Assignment of secondary structure from Cα coordinates. Biopolymers 1997. [DOI: 10.1002/(sici)1097-0282(19970405)41:4<367::aid-bip2>3.0.co;2-k] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/12/2022]
|
182
|
|
183
|
Abstract
We present a new approach at the atomic level for the development of knowledge-based mean force potentials (MFPs) that can be used in fold recognition, ab initio structure prediction, comparative modelling and molecular recognition. Our method is based on atom-type definitions, raising the total frequency of the pairwise distributions and leading to very accurate and specific distance-dependent energy functions. Forty different heavy atom types were defined depending on their bond connectivity, chemical nature and location level (side-chain or backbone). Using this approach it has been possible to obtain average frequencies of pairwise contacts about 15 times higher than the ones obtained using the classic way of one heavy atom definition for each amino acid (i.e. alpha-carbon, beta-carbon, virtual centroid or virtual beta-carbon co-ordinates). In this paper we use this approach to develop a MFP that can be used in fold recognition and we compare it with a classic MFP at the amino acid level compiled from the alpha-carbon distances between the different amino acid pairs. Both potentials involve all the pairwise contacts extracted from a non-redundant folds database of 180 protein chains with a sequence identity threshold of 25%. The pairwise energy functions of the MFP at the atomic level have a deep and very well defined minimum for each pairwise interaction, in contrast to the same curves obtained from the MFP developed at the amino acid level, which generally have multiple minima with similar depth. Our results also show that this MFP is able to produce very similar energy profiles for couples of proteins that share a very low sequence identity but are closely related at the structural level. When these profiles are plotted considering the structure-structure alignment, they are mostly superimposed, showing a correlation with the structure-structure similarity. In the same test, the MFP at the amino acid level fails to produce similar profiles. We suggest that using this MFP at the atomic level in the last stages of fold recognition or threading, when some candidates are available, can improve the sequence-structure alignments and, therefore, the final models. We also discuss the possibility of using this approach in the development of new MFPs to be used in ab initio structure prediction, comparative modelling and molecular recognition procedures.
Collapse
Affiliation(s)
- F Melo
- Facultés Universitaires Notre-Dame de la Paix, Structural Molecular Biology, Namur, Belgium
| | | |
Collapse
|
184
|
|
185
|
De Maeyer M, Desmet J, Lasters I. All in one: a highly detailed rotamer library improves both accuracy and speed in the modelling of sidechains by dead-end elimination. FOLDING & DESIGN 1997; 2:53-66. [PMID: 9080199 DOI: 10.1016/s1359-0278(97)00006-0] [Citation(s) in RCA: 97] [Impact Index Per Article: 3.6] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 02/04/2023]
Abstract
BACKGROUND About a decade ago, the concept of rotamer libraries was introduced to model sidechains given known mainchain coordinates. Since then, several groups have developed methods to handle the challenging combinatorial problem that is faced when searching rotamer libraries. To avoid a combinatorial explosion, the dead-end elimination method detects and eliminates rotamers that cannot be members of the global minimum energy conformation (GMEC). Several groups have applied and further developed this method in the fields of homology modelling and protein design. RESULTS This work addresses at the same time increased prediction accuracy and calculation speed improvements. The proposed enhancements allow the elimination of more than one-third of the possible rotameric states before applying the dead-end elimination method. This is achieved by using a highly detailed rotamer library allowing the safe application of an energy-based rejection criterion without risking the elimination of a GMEC rotamer. As a result, we gain both in modelling accuracy and in computational speed. Being completely automated, the current implementation of the dead-end elimination prediction of protein sidechains can be applied to the modelling of sidechains of proteins of any size on the high-end computer systems currently used in molecular modelling. The improved accuracy is highlighted in a comparative study on a collection of proteins of varying size for which score results have previously been published by multiple groups. Furthermore, we propose a new validation method for the scoring of the modelled structure versus the experimental data based upon the volume overlap of the predicted and observed sidechains. This overlap criterion is discussed in relation to the classic RMSD and the frequently used +/- 40 degrees window in comparing chi 1 and chi 2 angles. CONCLUSIONS We have shown that a very detailed library allows the introduction of a safe energy threshold rejection criterion, thereby increasing both the execution speed and the accuracy of the modelling program. We speculate that the current method will allow the sidechain prediction of medium-sized proteins and complex protein interfaces involving up to 150 residues on low-end desktop computers.
Collapse
Affiliation(s)
- M De Maeyer
- Center for Transgene Technology and Gene Therapy, Flanders Interuniversity Institute for Biotechnology, KU Leuven, Belgium
| | | | | |
Collapse
|
186
|
Abstract
The last stage of protein folding, the "endgame," involves the ordering of amino acid side-chains into a well defined and closely packed configuration. We review a number of topics related to this process. We first describe how the observed packing in protein crystal structures is measured. Such measurements show that the protein interior is packed exceptionally tightly, more so than the protein surface or surrounding solvent and even more efficiently than crystals of simple organic molecules. In vitro protein folding experiments also show that the protein is close-packed in solution and that the tight packing and intercalation of side-chains is a final and essential step in the folding pathway. These experimental observations, in turn, suggest that a folded protein structure can be described as a kind of three-dimensional jigsaw puzzle and that predicting side-chain packing is possible in the sense of solving this puzzle. The major difficulty that must be overcome in predicting side-chain packing is a combinatorial "explosion" in the number of possible configurations. There has been much recent progress towards overcoming this problem, and we survey a variety of the approaches. These approaches differ principally in whether they use ab initio (physical) or more knowledge-based methods, how they divide up and search conformational space, and how they evaluate candidate configurations (using scoring functions). The accuracy of side-chain prediction depends crucially on the (assumed) positioning of the main-chain. Methods for predicting main-chain conformation are, in a sense, not as developed as that for side-chains. We conclude by surveying these methods. As with side-chain prediction, there are a great variety of approaches, which differ in how they divide up and search space and in how they score candidate conformations.
Collapse
Affiliation(s)
- M Levitt
- Department of Structural Biology, Stanford University School of Medicine, California 94305, USA
| | | | | | | | | |
Collapse
|
187
|
Finzel BC. [12] LORE: Exploiting database of known structures. Methods Enzymol 1997; 277:230-42. [DOI: 10.1016/s0076-6879(97)77014-9] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 03/07/2023]
|
188
|
Schneider R, de Daruvar A, Sander C. The HSSP database of protein structure-sequence alignments. Nucleic Acids Res 1997; 25:226-30. [PMID: 9016541 PMCID: PMC146419 DOI: 10.1093/nar/25.1.226] [Citation(s) in RCA: 65] [Impact Index Per Article: 2.4] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 02/03/2023] Open
Abstract
HSSP is a derived database merging structural (3-D) and sequence (1-D) information. For each protein of known 3-D structure from the Protein Data Bank (PDB), the database has a multiple sequence alignment of all available homologues and a sequence profile characteristic of the family. The list of homologues is the result of a database search in SwissProt using a position-weighted dynamic programming method for sequence profile alignment (MaxHom). The database is updated frequently. The listed homologues are very likely to have the same 3-D structure as the PDB protein to which they have been aligned. As a result, the database is not only a database of aligned sequence families, but also a database of implied secondary and tertiary structures covering 29% of all SwissProt-stored sequences.
Collapse
Affiliation(s)
- R Schneider
- Protein Design Group, European Molecular Biology Labratory, D-69012 Heidelberg, Germany.
| | | | | |
Collapse
|
189
|
Abstract
A common approach to protein modeling is to propose a backbone structure based on homology or threading and then to attempt to build side chains onto this backbone. A fast algorithm using the simple criteria of atomic overlap and overall rotamer probability is proposed for this purpose. The method was first tested in the context of exhaustive searches of side chain configuration space in protein cores and was then applied to all side chains in 49 proteins of known structure, using simulated annealing to sample space. The latter procedure obtains the correct rotamer for 57% and the correct chi 1 value for 74% of the 6751 residues in the sample. When low-temperature Monte-Carlo simulations are initiated from the results of the simulated-annealing processes, consensus configurations are obtained which exhibit slightly more accurate predictions. The Monte-Carlo procedure also allows converged side chain entropies to be calculated for all residues. These prove to be accurate indicators of prediction reliability. For example, the correct rotamer is obtained for 79% and the correct chi 1 value is obtained for 84% of the half of the sample residues exhibiting the lowest entropies. Side chain entropy and predictability are nearly completely uncorrelated with solvent-accessible area. Some precedents for and implications of this observation are discussed.
Collapse
Affiliation(s)
- P S Shenkin
- Department of Chemistry, Columbia University, New York 10027, USA
| | | | | |
Collapse
|
190
|
Abstract
Homology modeling of protein structures as a function of sequence breaks down at the twilight zone limit of sequence identity between the template and target proteins. Our results suggest that protein sequences that have diverged from a common ancestor beyond the twilight zone may adopt side-chain interactions that are very different from those endowed by the ancestral sequence.
Collapse
Affiliation(s)
- S Y Chung
- Department of Biochemistry, Uniformed Services University of the Health Sciences, Bethesda, MD 20814, USA.
| | | |
Collapse
|
191
|
Zhao D, Gilfoyle DJ, Smith AT, Loew GH. Refinement of 3D models of horseradish peroxidase isoenzyme C: predictions of 2D NMR assignments and substrate binding sites. Proteins 1996; 26:204-16. [PMID: 8916228 DOI: 10.1002/(sici)1097-0134(199610)26:2<204::aid-prot10>3.0.co;2-t] [Citation(s) in RCA: 17] [Impact Index Per Article: 0.6] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 02/03/2023]
Abstract
In this study, two alternative three-dimensional (3D) models of horseradish peroxidase (HRP-C)-differing mainly in the structure of a long untemplated insertion-were refined, systematically assessed, and used to make predictions that can both guide and be tested by future experimental studies. A key first step in the model-building process was a procedure for multiple sequence alignment based on structurally conserved regions and key conserved residues, including those side chains providing ligands to the two Ca2+ binding sites. The model refinements reported here include (1) optimization of side-chain conformations; (3) addition of structural waters using a template-independent procedure; (2) structural refinement of the untemplated 34 amino acid insertion located between the F and G helices, using both energy criteria and NMR data; (4) unconstrained energy optimization of the refined models. Using these procedures, two refined structures of HRP-C were obtained, differing mainly in the conformation of this long insertion. The presence of residues in this insertion that could potentially interact with bound substrates suggests a functional role that may be related to the general ability of class III peroxidases to form stable 1:1 complexes with a variety of substrates. The structural validity of the models was systematically assessed by a variety of criteria. Most notably, the ProsaII z scores and Profiles 3D scores of the two HRP-C models indicated that they are significantly better than would be obtained by simple amino acid replacement, using any of the known structures as a template. These two 3D HRP-C models, were then used to predict candidate residues for the assignment of NOESY cross-peaks previously noted in 2D-NMR studies. Specifically, the residues known as Ile X, Phe A, Phe B, aliphatic residue Q, and Ile T. Candidate substrate binding sites were also identified and compared with experimentally based predictions. This work is timely because new X-ray structures are anticipated that will facilitate the validation of these procedures.
Collapse
Affiliation(s)
- D Zhao
- Molecular Research Institute, Palo Alto, California 94304, USA
| | | | | | | |
Collapse
|
192
|
Hooft RW, Vriend G. Improved coordinate reconstruction from stereo diagrams. JOURNAL OF MOLECULAR GRAPHICS 1996; 14:168-72. [PMID: 8901644 DOI: 10.1016/s0263-7855(96)00055-0] [Citation(s) in RCA: 4] [Impact Index Per Article: 0.1] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 02/02/2023]
Abstract
A program has been written that reconstructs three-dimensional coordinates for a protein structure given a stereo C alpha diagram. Initial three-dimensional coordinates are determined using an algorithm similar to the one used by Rossmann in the program STEREO. Thereafter the coordinates are refined such that the stereo image based on the reconstructed three-dimensional coordinates optimally fits the scanned stereo image while normal C alpha stereochemistry is enforced.
Collapse
|
193
|
Comparing theoretical and experimental backbone-dependent sidechain conformational preferences for linear, branched, aromatic and polar residues. Chem Phys 1996. [DOI: 10.1016/0301-0104(95)00361-4] [Citation(s) in RCA: 4] [Impact Index Per Article: 0.1] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/19/2022]
|
194
|
Abstract
Over the past few years, a number of methods for the calculation of side-chain conformations in proteins have been described. More recent studies have considered the effect of combinatorial packing, derivations from idealized rotameric structures and, to a limited extent, backbone flexibility on the quality and efficiency of calculations of protein side-chain conformation. Although further work is needed to address the issue of backbone displacements, the recent progress solves the packing problem to a significant degree. This opens the way for fruitful incorporation of these methods into general procedures for homology modeling and studies of ligand-protein interactions.
Collapse
Affiliation(s)
- M Vásquez
- Protein Design Labs Inc, Mountain View, CA 94043, USA.
| |
Collapse
|
195
|
|
196
|
Abstract
In the past years, much effort has been put on the development of new methodologies and algorithms for the prediction of protein secondary and tertiary structures from (sequence) data; this is reviewed in detail. New approaches for these predictions such as neural network methods, genetic algorithms, machine learning, and graph theoretical methods are discussed. Secondary structure prediction algorithms were improved mostly by considering families of related proteins; however, for the reliable tertiary structure modeling of proteins, knowledge-based techniques are still preferred. Methods and examples with more or less successful results are described. Also, programs and parameterizations for energy minimisations, molecular dynamics, and electrostatic interactions have been improved, especially with respect to their former limits of applicability. Other topics discussed in this review include the use of traditional and on-line databases, the docking problem and surface properties of biomolecules, packing of protein cores, de novo design and protein engineering, prediction of membrane protein structures, the verification and reliability of model structures, and progress made with currently available software and computer hardware. In summary, the prediction of the structure, function, and other properties of a protein is still possible only within limits, but these limits continue to be moved.
Collapse
Affiliation(s)
- G Böhm
- Institut für Biotechnologie, Martin-Luther-Universität Halle-Wittenberg, Germany
| |
Collapse
|
197
|
Abstract
More than 60 years after the analyses by Linderstrom-Lang and Kirkwood of their hypothetical 'protein' structures, we have now a plethora of experimental evidence and computational estimates of the electrostatic forces in proteins, with very many protein 3D structures at atomic resolution. In the mean time, there were in the beginning, many arguments and suggestions about the roles of electrostatics, mainly from empirical findings and tendencies. A few experimental results indicated that the electrostatic contribution is of the order of several kcal/mol, which was theoretically difficult to reproduce correctly, because a large opposing reaction field should be subtracted from a large, direct Coulombic field. Although the importance of the reaction field was recognized even 70 years ago, appropriate applications to protein molecules were made only in this decade, with the development of numerical computation. Now, an electrostatic molecular surface is one of the most popular pictures in journals of structural biology, indicating that the electrostatic force is one of the important components contributing to molecular recognition, which is a major focus of current biology and biochemistry. The development of NMR techniques has made it possible to observe the individual ionizations of ionizable groups in a protein, in addition to the determination of the 3D structure. Since it does not require any additional probe, each charge state can report the very local and heterogeneous electrostatic potentials working in the protein, without disturbing the original field. From the pKa values, the contributions of electrostatic interactions, ion pairs, charge-dipole interactions, and hydrogen bonds to protein stability have been correctly evaluated. Protein engineering also provides much more information than that obtainable from the native proteins, as the residues concerned can now be easily substituted with other amino acid residues having electrostatically different characteristics. Those experimental results have revealed smaller contributions than previously expected, probably because we underestimated the reaction field effects. Especially, a single ion pair stabilizes a protein only slightly, although a cooperative salt-bridge network can contribute significantly to protein stability. Marginal stabilities of proteins arise from small difference between many factors with driving and opposing forces. In spite of the small contribution of each single electrostatic interaction to the protein stability, the sum of their actions works to maintain the specific 3D structure of the protein. The 'negative' roles of electrostatics, which might destabilize protein conformation, should be pointed out. Unpaired buried charges are energetically too expensive to exit in the hydrophobic core. Isolated hydrogen bond donors and acceptors also exert negative effects, but they are not as expensive as the unpaired buried charges, with costs of a few kcal/mol. Therefore, statistical analyses of protein 3D structures reveal only rare instances of isolated hydrogen bond donors and acceptors. This must be the main reason why alpha-helices and beta-sheets are only observed in protein cores as the backbone structures. Such secondary structures do not leave any backbone hydrogen donors or acceptors unpaired, because of their intrinsically regular packing. Otherwise, it might be very difficult to construct a backbone structure, in which all the backbone amide and carbonyl groups had their own hydrogen bond partners in the protein core. There are two theoretical approaches to protein electrostatics, the macroscopic or continuum model, and the microscopic or molecular model. As described in this article, the macroscopic model has inherent problems because the protein-solvent system is very hetergeneous from the physical point of view...
Collapse
Affiliation(s)
- H Nakamura
- Protein Engineering Research Institute, Osaka, Japan
| |
Collapse
|
198
|
Lee C. Testing homology modeling on mutant proteins: predicting structural and thermodynamic effects in the Ala98-->Val mutants of T4 lysozyme. FOLDING & DESIGN 1996; 1:1-12. [PMID: 9079358 DOI: 10.1016/s1359-0278(96)00006-5] [Citation(s) in RCA: 24] [Impact Index Per Article: 0.9] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 02/04/2023]
Abstract
BACKGROUND Current approaches to homology modeling predict how amino acid substitutions will alter a protein's structure, primarily by modeling sidechain conformations upon essentially immobile backbone frameworks. However, recent crystal structures of T4 lysozyme mutants reveal significant shifts of the mainchain and other potentially serious problems for sidechain rotamer-based modeling. This paper evaluates the accuracy of structural and thermodynamic predictions from two common sidechain modeling approaches to measure errors caused by the fixed-backbone approximation. RESULTS Tested on a series of T4 lysozyme mutants, this sidechain rotamer library approach did not handle mainchain shifts well, correctly predicting the sidechain conformations of only two of six mutants. By contrast, allowing sidechains to move more flexibly appeared to compensate for the rigidity of the mainchain and gave reasonably accurate coordinate predictions (rms errors of 0.5-1.0 A for each mutated sidechain), better on average than 90% of possible conformations. The calculated packing energies correlated well with experimental stabilities (r2 = 0.81) and correctly captured the cooperative interactions of several neighboring mutations. CONCLUSIONS Mutant modeling can be relatively accurate despite the fixed-backbone approximation. Mainchain shifts (0.2-0.5 A) cause increased sidechain coordinate errors of 0.1-0.8 A, torsional errors of 10-30 degrees, and exaggerated strain energy for overpacked mutants, compared with the same calculations performed with the correct mutant backbones.
Collapse
Affiliation(s)
- C Lee
- Department of Chemistry, University Stanford, California, USA.
| |
Collapse
|
199
|
Abstract
The understanding of structure-function relationship of enzymes requires detailed information of their three-dimensional structure. Protein structure determination by X-ray and NMR methods, the two most frequently used experimental procedures, are often difficult and time-consuming. Thus computer modeling of protein structures has become an increasingly active and attractive option for obtaining predictive models of three-dimensional protein structures. Specifically, for the ubiquitous metabolizing heme proteins, the cytochrome P450s, the X-ray structures of four isozymes of bacterial origin, P450cam, P450terp, P450BM-3 and P450eryF have now been determined. However, attempts to obtain the structure of mammalian forms by experimental means have thus far not been successful. Thus, there have been numerous attempts to construct models of mammalian P450s using homology modeling methods in which the known structures have been used to various extents and in various strategies to build models of P450 isozymes. In this paper, we review these efforts and then describe a strategy for structure building and assessment of 3D models of P450s recently developed in our laboratory that corrects many of the weaknesses in the previous procedures. The results are 3D models that for the first time are stable to unconstrained molecular dynamics simulations. The use of this method is demonstrated by the construction and validation of a 3D model for rabbit liver microsomal P450 isozyme 2B4, responsible for the oxidative metabolism of diverse xenobiotics including widely used inhalation anesthetics. Using this 2B4 model, the substrate access channel, substrate binding site and plausible surface regions for binding with P450 redox partners were identified.
Collapse
Affiliation(s)
- Y T Chang
- Molecular Research Institute, Palo Alto, CA 94304, USA
| | | | | |
Collapse
|
200
|
Abstract
HSSP is a derived database merging structural three dimensional (3-D) and sequence one dimensional(1-D) information. For each protein of known 3-D structure from the Protein Data Bank (PDB), the database has a multiple sequence alignment of all available homologues and a sequence profile characteristic of the family. The list of homologues is the result of a database search in Swissprot using a position-weighted dynamic programming method for sequence profile alignment (MaxHom). The database is updated frequently. The listed homologues are very likely to have the same 3-D structure as the PDB protein to which they have been aligned. As a result, the database is not only a database of aligned sequence families, but also a database of implied secondary and tertiary structures covering 27% of all Swissprot-stored sequences.
Collapse
Affiliation(s)
- R Schneider
- Protein Design Group, European Molecular Biology Laboratory, Heidelberg, Germany
| | | |
Collapse
|