1
|
Lu WW, Huang RB, Wei YT, Meng JZ, Du LQ, Du QS. Statistical energy potential: reduced representation of Dehouck–Gilis–Rooman function by selecting against decoy datasets. Amino Acids 2012; 42:2353-61. [DOI: 10.1007/s00726-011-0977-0] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 05/29/2010] [Accepted: 07/06/2011] [Indexed: 11/24/2022]
|
2
|
Goldstein M, Fredj E, Gerber RB. A new hybrid algorithm for finding the lowest minima of potential surfaces: Approach and application to peptides. J Comput Chem 2011; 32:1785-800. [DOI: 10.1002/jcc.21755] [Citation(s) in RCA: 18] [Impact Index Per Article: 1.4] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/15/2010] [Revised: 11/13/2010] [Accepted: 12/18/2010] [Indexed: 11/11/2022]
|
3
|
Tramontano A. Comparative modelling techniques: where are we? Comp Funct Genomics 2010; 4:402-5. [PMID: 18629085 PMCID: PMC2447371 DOI: 10.1002/cfg.306] [Citation(s) in RCA: 4] [Impact Index Per Article: 0.3] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/02/2003] [Revised: 06/02/2003] [Accepted: 06/03/2003] [Indexed: 11/21/2022] Open
Abstract
The enormous increase in data availability brought about by genomic projects is paralleled by an equally unprecedented increase in the expectations for new medical,
pharmacological, environmental and biotechnological discoveries. Whether or not we
will be able to meet (at least partially) these expectations will depend on how well
we will be able to interpret the data and translate the mono-dimensional information
encrypted in genomes into a detailed understanding of its biological meaning at the
phenotypic level. The process is far from being trivial, and the obstacles along the
road are formidable: even the problem of identifying coding regions in eukaryotic
genomes is not completely solved. Far more complex is identification of the function of
the encoded proteins, and this will probably represent the most challenging problem
for the next generations of scientists.
Collapse
Affiliation(s)
- Anna Tramontano
- Department of Biochemical Sciences A. Rossi Fanelli, University of Rome La Sapienza, Rome 00185, Italy.
| |
Collapse
|
4
|
Lobanov MY, Finkel’shtein AV. Analogy-based protein structure prediction: III. Optimizing the combination of the substitution matrix and pseudopotentials used to align protein sequences with spatial structures. Mol Biol 2010. [DOI: 10.1134/s0026893310010140] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/23/2022]
|
5
|
Pal A, Bahadur RP, Ray PS, Chakrabarti P. Accessibility and partner number of protein residues, their relationship and a webserver, ContPlot for their display. BMC Bioinformatics 2009; 10:103. [PMID: 19356223 PMCID: PMC2680847 DOI: 10.1186/1471-2105-10-103] [Citation(s) in RCA: 4] [Impact Index Per Article: 0.3] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 12/15/2008] [Accepted: 04/08/2009] [Indexed: 11/29/2022] Open
Abstract
Background Depending on chemical features residues have preferred locations – interior or exterior – in protein structures, which also determine how many other residues are found around them. The close packing of residues is the hallmark of protein interior and protein-protein interaction sites. Results The average values of accessible surface area (ASA) and partner number (PN, the number of other residues within a distance of 4.5 Å from any atom of a given residue) of different residues have been determined and a webserver, ContPlot has been designed to display these values (relative to the average values) along the protein sequence. This would be useful to visually identify residues that are densely packed, or those involved in protein-protein interactions. The skewness observed in the distribution of PNs is indicative of the hydrophobic or hydrophilic nature of the residue. The variation of ASA with PN can be analytically expressed in terms of a cubic equation. These equations (one for each residue) can be used to estimate the ASA of a polypeptide chain using the PNs of the individual residues in the structure. Conclusion The atom-based PNs (obtained by counting surrounding atoms) are highly correlated to the residue-based PN, indicating that the latter can adequately capture the atomic details of packing. The average values of ASA and PN associated with each residue should be useful in protein structure prediction or fold-recognition algorithm. ContPlot would provide a handy tool to assess the importance of a residue in the protein structure or interaction site.
Collapse
Affiliation(s)
- Arumay Pal
- Department of Biochemistry, Bose Institute, P-1/12 CIT Scheme VIIM, Kolkata, India.
| | | | | | | |
Collapse
|
6
|
Vassura M, Margara L, Fariselli P, Casadio R. A graph theoretic approach to protein structure selection. Artif Intell Med 2009; 45:229-37. [DOI: 10.1016/j.artmed.2008.07.016] [Citation(s) in RCA: 5] [Impact Index Per Article: 0.3] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/19/2007] [Revised: 07/25/2008] [Accepted: 07/26/2008] [Indexed: 11/28/2022]
|
7
|
Hoppe C, Schomburg D. Prediction of protein thermostability with a direction- and distance-dependent knowledge-based potential. Protein Sci 2005; 14:2682-92. [PMID: 16155198 PMCID: PMC2253293 DOI: 10.1110/ps.04940705] [Citation(s) in RCA: 33] [Impact Index Per Article: 1.7] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 10/25/2022]
Abstract
The increasing use of enzymes in industrial processes and the importance of understanding protein folding and stability have led to several attempts to predict and quantify the effect of every possible amino acid exchange (mutation) on the thermostability of proteins. In this article we describe a knowledge-based discrimination function that acts as a fast and reliable guide in protein engineering and optimization. The function used consists of two parts, a pairwise energy function based on a distance- and direction-dependent atomic description of the amino acid environment, and a torsion angle energy function. In a first step a training set of 11 proteins including 646 mutant proteins with experimentally determined thermostability was used to optimize the knowledge-based energy functions. The resulting potential function was then tested using a test mutant database consisting of 918 various point mutations introduced in 27 proteins. The best correlation coefficient obtained for the experimental data and the predicted thermostability for the training set is r = 0.81 (561 data points). A total of 76% of the mutations could be predicted correctly as being either stabilizing or destabilizing. The results for the test set are r = 0.74 (747 data points) and 72%, respectively. The global correlation over the combined data (1308 mutants) obtained is 0.78.
Collapse
Affiliation(s)
- Christian Hoppe
- Institut für Biochemie, Zülpicher Strasse 47, 50674 Köln, Germany
| | | |
Collapse
|
8
|
Tramontano A, Morea V. Exploiting evolutionary relationships for predicting protein structures. Biotechnol Bioeng 2004; 84:756-62. [PMID: 14708116 DOI: 10.1002/bit.10850] [Citation(s) in RCA: 9] [Impact Index Per Article: 0.5] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/10/2022]
Abstract
In the last few years there have been many developments in computational biology, particularly with regard to novel, imaginative exploitation of genomic data. Disappointingly, there has been a lack of progress in the methodology for prediction of protein structures. In the last several years, however, promising new methods have finally begun to emerge. These methods are increasing the power and scope of the methodology, but, most importantly, they are generating new areas of investigation that we believe will accelerate progress in the field. In this review we describe recent developments and highlight the implications of their success as well as areas where efforts should be focused.
Collapse
Affiliation(s)
- Anna Tramontano
- Department of Biochemical Sciences A. Rossi Fanelli, University La Sapienza, P. le Aldo Moro 5, 00185 Rome, Italy.
| | | |
Collapse
|
9
|
|
10
|
Abstract
Here we present a simplified form of threading that uses only a 20 x 20 two-body residue-based potential and restricted number of gaps. Despite its simplicity and transparency the Monte Carlo-based threading algorithm performs very well in a rigorous test of fold recognition. The results suggest that by simplifying and constraining the decoy space, one can achieve better fold recognition. Fold recognition results are compared with and supplemented by a PSI-BLAST search. The statistical significance of threading results is rigorously evaluated from statistics of extremes by comparison with optimal alignments of a large set of randomly shuffled sequences. The statistical theory, based on the Random Energy Model, yields a cumulative statistical parameter, epsilon, that attests to the likelihood of correct fold recognition. A large epsilon indicates a significant energy gap between the optimal alignment and decoy alignments and, consequently, a high probability that the fold is correctly recognized. For a particular number of gaps, the epsilon parameter reaches its maximal value, and the fold is recognized. As the number of gaps further increases, the likelihood of correct fold recognition drops off. This is because the decoy space is small when gaps are restricted to a small number, but the native alignment is still well approximated, whereas unrestricted increase of the number of gaps leads to rapid growth of the number of decoys and their statistical dominance over the correct alignment. It is shown that best results are obtained when a combination of one-, two-, and three-gap threading is used. To this end, use of the epsilon parameter is crucial for rigorous comparison of results across the different decoy spaces belonging to a different number of gaps.
Collapse
Affiliation(s)
- William Chen
- Department of Biophysics, Harvard University, Boston, Massachusetts, USA
| | | | | |
Collapse
|
11
|
Angelov B, Sadoc JF, Jullien R, Soyer A, Mornon JP, Chomilier J. Nonatomic solvent-driven Voronoi tessellation of proteins: an open tool to analyze protein folds. Proteins 2002; 49:446-56. [PMID: 12402355 DOI: 10.1002/prot.10220] [Citation(s) in RCA: 38] [Impact Index Per Article: 1.7] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/11/2022]
Abstract
A three-dimensional Voronoi tessellation of folded proteins is used to analyze geometrical and topological properties of a set of proteins. To each amino acid is associated a central point surrounded by a Voronoi cell. Voronoi cells describe the packing of the amino acids. Special attention is given to reproduction of the protein surface. Once the Voronoi cells are built, a lot of tools from geometrical analysis can be applied to investigate the protein structure; volume of cells, number of faces per cell, and number of sides per face are the usual signatures of the protein structure. A distinct difference between faces related to primary, secondary, and tertiary structures has been observed. Faces threaded by the main-chain have on average more than six edges, whereas those related to helical packing of the amino acid chain have less than five edges. The faces on the protein surface have on average five edges within 1% error. The average number of faces on the protein surface for a given type of amino acid brings a new point of view in the characterization of the exposition to the solvent and the classification of amino acid as hydrophilic or hydrophobic. It may be a convenient tool for model validation.
Collapse
Affiliation(s)
- Borislav Angelov
- Laboratoire de Physique des Solides, Université Paris 11, Orsay, France
| | | | | | | | | | | |
Collapse
|
12
|
Abstract
The level of sequence similarity that implies similarity in protein structure is well established. Recently, many groups proposed thresholds for similarity in sequence implying similarity in enzymatic function. All previous results suggest the strong conservation of enzymatic function above levels of 50% pairwise sequence identity. Here, I argue that all groups substantially overestimated the conservation of enzyme function because their data sets were either too biased, or too small. An unbiased analysis suggested that less than 30% of the pair fragments above 50% sequence identity have entirely identical EC numbers. Another surprising finding was that even BLAST E-values below 10(-50) did not suffice to automatically transfer enzyme function without errors. As expected, most misclassifications originated from similarities in relatively short regions and/or from transferring annotations for different domains. Both problems cannot be corrected easily by adjusting the thresholds for automatic transfer of genome annotations. A score relating sequence identity to alignment length (distance from HSSP-threshold) outperformed statistical BLAST scores for high sequence similarity. In particular, the distance score allowed error-free transfer of enzyme function for the 10% most similar enzyme pairs. The results illustrated how difficult it is to assess the conservation of protein function and to guarantee error-free genome annotations, in general: sets with millions of pair comparisons might not suffice to arrive at statistically significant conclusions. In practice, the revised detailed estimates for the sequence conservation of enzyme function may provide important benchmarks for everyday sequence analysis and for more cautious automatic genome annotations.
Collapse
Affiliation(s)
- Burkhard Rost
- CUBIC, Department of Biochemistry and Molecular Biophysics, Columbia University, 650 West 168th Street BB217, New York, NY 10032, USA.
| |
Collapse
|
13
|
Getz G, Vendruscolo M, Sachs D, Domany E. Automated assignment of SCOP and CATH protein structure classifications from FSSP scores. Proteins 2002; 46:405-15. [PMID: 11835515 DOI: 10.1002/prot.1176] [Citation(s) in RCA: 34] [Impact Index Per Article: 1.5] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/07/2022]
Abstract
We present an automated procedure to assign CATH and SCOP classifications to proteins whose FSSP score is available. CATH classification is assigned down to the topology level, and SCOP classification is assigned to the fold level. Because the FSSP database is updated weekly, this method makes it possible to update also CATH and SCOP with the same frequency. Our predictions have a nearly perfect success rate when ambiguous cases are discarded. These ambiguous cases are intrinsic in any protein structure classification that relies on structural information alone. Hence, we introduce the "twilight zone for structure classification." We further suggest that to resolve these ambiguous cases, other criteria of classification, based also on information about sequence and function, must be used.
Collapse
Affiliation(s)
- Gad Getz
- Department of Physics of Complex Systems, Weizmann Institute of Science, Rehovot, Israel
| | | | | | | |
Collapse
|
14
|
de la Cruz X, Sillitoe I, Orengo C. Use of structure comparison methods for the refinement of protein structure predictions. I. Identifying the structural family of a protein from low-resolution models. Proteins 2002; 46:72-84. [PMID: 11746704 DOI: 10.1002/prot.10002] [Citation(s) in RCA: 3] [Impact Index Per Article: 0.1] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/08/2022]
Abstract
Predicting the three-dimensional structure of proteins is still one of the most challenging problems in molecular biology. Despite its difficulty, several investigators have started to produce consistently low-resolution predictions for small proteins. However, in most of these cases, the prediction accuracy is still too low to make them useful. In the present article, we address the problem of obtaining better-quality predictions, starting from low-resolution models. To this end, we have devised a new procedure that uses these models, together with structure comparison methods, to identify the structural family of the target protein. This would allow, in a second step not described in the present work, to refine the predictions using conserved features of the identified family. In our approach, the structure database is investigated using predictions, at different accuracy levels, for a given protein. As query structures, we used both low-resolution versions of the native structures, as well as different sets of low accuracy predictions. In general, we found that for predictions with a resolution of > or =5-7 A, structure comparison methods were able to identify the fold of a protein in the top positions.
Collapse
Affiliation(s)
- Xavier de la Cruz
- Departmento de Bioquímica y Biología Molecular Facultad de Químicas; Universidad de Barcelona, Barcelona, Spain.
| | | | | |
Collapse
|
15
|
Li QZ, Lu ZQ. The prediction of the structural class of protein: application of the measure of diversity. J Theor Biol 2001; 213:493-502. [PMID: 11735294 DOI: 10.1006/jtbi.2001.2441] [Citation(s) in RCA: 37] [Impact Index Per Article: 1.6] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/22/2022]
Abstract
Based on the concept that the structural class of a protein is mainly determined by its secondary structure sequence, a new algorithm for prediction of the structural class of a protein is proposed. By use of the number of alpha -helices, beta -strands, and betaalphabeta fragments, the structural class of a protein can be predicted by an algorithm based on the increment of diversity (ID), in which the sole prediction parameter-the increment of diversity is used as the index of prediction of structural class of a protein. The results indicate that the high rates of correct prediction are obtained for complete set (standard set) from Brookhaven Protein Data Bank-CD ROM (PDB) published in October 1995 and the test set newly released from Brookhaven Protein Data Bank-CD ROM (PDB) before July 1998, respectively.
Collapse
Affiliation(s)
- Q Z Li
- Laboratory of Theoretical Biophysics, Inner Mongolia University, Hohhot, 010021, China.
| | | |
Collapse
|
16
|
Pasquier C, Promponas VJ, Hamodrakas SJ. PRED-CLASS: cascading neural networks for generalized protein classification and genome-wide applications. Proteins 2001; 44:361-9. [PMID: 11455609 DOI: 10.1002/prot.1101] [Citation(s) in RCA: 30] [Impact Index Per Article: 1.3] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/07/2022]
Abstract
A cascading system of hierarchical, artificial neural networks (named PRED-CLASS) is presented for the generalized classification of proteins into four distinct classes-transmembrane, fibrous, globular, and mixed-from information solely encoded in their amino acid sequences. The architecture of the individual component networks is kept very simple, reducing the number of free parameters (network synaptic weights) for faster training, improved generalization, and the avoidance of data overfitting. Capturing information from as few as 50 protein sequences spread among the four target classes (6 transmembrane, 10 fibrous, 13 globular, and 17 mixed), PRED-CLASS was able to obtain 371 correct predictions out of a set of 387 proteins (success rate approximately 96%) unambiguously assigned into one of the target classes. The application of PRED-CLASS to several test sets and complete proteomes of several organisms demonstrates that such a method could serve as a valuable tool in the annotation of genomic open reading frames with no functional assignment or as a preliminary step in fold recognition and ab initio structure prediction methods. Detailed results obtained for various data sets and completed genomes, along with a web sever running the PRED-CLASS algorithm, can be accessed over the World Wide Web at http://o2.biol.uoa.gr/PRED-CLASS.
Collapse
Affiliation(s)
- C Pasquier
- Department of Cell Biology and Biophysics, Faculty of Biology, University of Athens, Athens, Greece
| | | | | |
Collapse
|
17
|
|
18
|
Abstract
A homology-based structure prediction method ideally gives both a correct fold assignment and an accurate query-template alignment. In this article we show that the combination of two existing methods, PSI-BLAST and threading, leads to significant enhancement in the success rate of fold recognition. The combined approach, termed COBLATH, also yields much higher alignment accuracy than found in previous studies. It consists of two-way searches both by PSI-BLAST and by threading. In the PSI-BLAST portion, a query is used to search for hits in a library of potential templates and, conversely, each potential template is used to search for hits in a library of queries. In the threading portion, the scoring function is the sum of a sequence profile and a 6x6 substitution matrix between predicted query and known template secondary structure and solvent exposure. "Two-way" in threading means that the query's sequence profile is used to match the sequences of all potential templates and the sequence profiles of all potential templates are used to match the query's sequence. When tested on a set of 533 nonhomologous proteins, COBLATH was able to assign folds for 390 (73%). Among these 390 queries, 265 (68%) had root-mean-square deviations (RMSDs) of less than 8 A between predicted and actual structures. Such high success rate and accuracy make COBLATH an ideal tool for structural genomics.
Collapse
Affiliation(s)
- Y Shan
- Department of Physics, Drexel University, Philadelphia, Pennsylvania 19104, USA
| | | | | |
Collapse
|
19
|
Rykunov D, Lobanov M, Finkelstein A. Search for the most stable folds of protein chains: III. Improvement in fold recognition by averaging over homologous sequences and 3D structures. Proteins 2000. [DOI: 10.1002/1097-0134(20000815)40:3<494::aid-prot160>3.0.co;2-#] [Citation(s) in RCA: 4] [Impact Index Per Article: 0.2] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/05/2022]
Affiliation(s)
- D.S. Rykunov
- Institute of Theoretical & Experimental Biophysics, Russian Academy of Sciences, Pushchino, Moscow Region, Russia
| | - M.Yu. Lobanov
- Institute of Protein Research, Russian Academy of Sciences, Pushchino, Moscow Region, Russia
| | - A.V. Finkelstein
- Institute of Protein Research, Russian Academy of Sciences, Pushchino, Moscow Region, Russia
| |
Collapse
|
20
|
|
21
|
|
22
|
Abstract
The study of the plant oncogene rolA has been hampered by a lack of structural information. Here we show that, despite a lack of significant sequence similarity to proteins of known structure, the rolA sequence adopts a known fold; that of the papillomavirus E2 DNA-binding domain. This fold is reliably identified by modern threading programs, which consider predicted secondary structure, but not by others. Although the rolA sequence is only around 16% identical to those of the available template structures, a structural model could be built that performed well against protein structure verification programs. The adopted strategy involved alignment corrections, justified by multiple model building and evaluation, with particular attention paid to the hydrophobic core residues. We find that rolA protein is predicted to resemble the template proteins in two key aspects; existence as a dimer and ability to bind DNA. rolA protein has recently been shown experimentally to possess DNA binding ability. This model predicts Lys 24 and Arg 27 to be involved in sequence-specific interactions and eight other residues to hydrogen-bond phosphate groups of the DNA.
Collapse
Affiliation(s)
- D J Rigden
- National Centre of Genetic Resources and Biotechnology, Cenargen/Embrapa, Brasilia, Brazil.
| | | |
Collapse
|
23
|
Abstract
We studied the possibility to approximate a Lennard-Jones interaction by a pairwise contact potential. First we used a Lennard-Jones potential to design off-lattice, protein-like heteropolymer sequences, whose lowest energy (native) conformations were then identified by molecular dynamics. Then we turned to investigate whether one can find a pairwise contact potential, whose ground states are the contact maps associated with these native conformations. We show that such a requirement cannot be satisfied exactly, i.e., no such contact parameters exist. Nevertheless, we found that one can find contact energy parameters for which an energy minimization procedure, acting in the space of contact maps, yields maps whose corresponding structures are close to the native ones. Finally, we show that when these structures are used as the initial point of a molecular dynamics energy minimization process, the correct native folds are recovered with high probability.
Collapse
Affiliation(s)
- C Clementi
- International School for Advanced Studies (SISSA) and Istituto Nazionale di Fiscia della Materia, Trieste, Italy.
| | | | | | | |
Collapse
|
24
|
Ali SA, Joao HC, Hammerschmid F, Eder J, Steinkasserer A. An antigenic HIV-1 peptide sequence engineered into the surface structure of transferrin does not elicit an antibody response. FEBS Lett 1999; 459:230-2. [PMID: 10518025 DOI: 10.1016/s0014-5793(99)01242-9] [Citation(s) in RCA: 3] [Impact Index Per Article: 0.1] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 10/18/2022]
Abstract
One novel approach for the biological delivery of peptide drugs is to incorporate the sequence of the peptide into the structure of a natural transport protein such as human serum transferrin (HST). However, a potential drawback is that the HST may increase the immunoreactivity of the peptide, in the same way that carrier proteins can be used to generate highly immunogenic peptide hapten conjugates. In this study we have generated a recombinant HST carrier protein that contains a peptide substrate of HIV-1 protease (VSQNYPIVL). The protein retained native HST function, and the peptide was surface exposed since it was immunoreactive in native dot blots, and was cleaved by HIV-1 protease. Immunisation of rabbits with the recombinant protein elicited only a very poor anti-peptide immune response. In contrast, strong anti-peptide immune responses were raised against both the peptide alone, and a chemical conjugate of the peptide with HST. These data demonstrate that it is possible to attenuate the immune response normally directed against an immunogenic peptide sequence by engineering into a surface exposed loop of HST. These findings may have an important impact on the future design of peptide delivery systems.
Collapse
Affiliation(s)
- S A Ali
- Novartis Research Institute, Brunnerstrasse 59, A-1230, Vienna, Austria
| | | | | | | | | |
Collapse
|
25
|
Boland MV, Murphy RF. After sequencing: quantitative analysis of protein localization. IEEE ENGINEERING IN MEDICINE AND BIOLOGY MAGAZINE : THE QUARTERLY MAGAZINE OF THE ENGINEERING IN MEDICINE & BIOLOGY SOCIETY 1999; 18:115-9. [PMID: 10497746 DOI: 10.1109/51.790995] [Citation(s) in RCA: 12] [Impact Index Per Article: 0.5] [Reference Citation Analysis] [MESH Headings] [Grants] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 11/06/2022]
|
26
|
Ali SA, Joao HC, Hammerschmid F, Eder J, Steinkasserer A. Transferrin trojan horses as a rational approach for the biological delivery of therapeutic peptide domains. J Biol Chem 1999; 274:24066-73. [PMID: 10446177 DOI: 10.1074/jbc.274.34.24066] [Citation(s) in RCA: 17] [Impact Index Per Article: 0.7] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/06/2022] Open
Abstract
One novel approach for the biological delivery of peptide drugs is to incorporate the sequence of the peptide into the structure of a natural transport protein, such as human serum transferrin. To examine whether this is feasible, a peptide sequence cleavable by the human immunodeficiency virus type 1 protease (VSQNYPIVL) was inserted into various regions of human serum transferrin, and the resultant proteins were tested for function. Experimentally, molecular modeling was used to identify five candidate insertion sites in surface exposed loops of human serum transferrin that were distant from biologically active domains. These insertions were cloned using polymerase chain reaction mutagenesis, and the proteins were expressed using a baculovirus expression vector system. Analysis of the mutant proteins provided a number of important findings: (a) they retained native human serum transferrin function, (b) the inserted peptide sequence was surface exposed, and most importantly, (c) two of these mutants could be cleaved by human immunodeficiency virus-1 protease. In conclusion, this investigation has validated the use of human serum transferrin as a carrier protein for functional peptide domains introduced into its structure using protein engineering. These findings will be useful for developing a novel class of therapeutic agents for a broad spectrum of diseases.
Collapse
Affiliation(s)
- S A Ali
- Novartis Research Institute, Brunnerstrasse 59, A-1230 Vienna, Austria
| | | | | | | | | |
Collapse
|
27
|
|
28
|
Lazaridis T, Karplus M. Discrimination of the native from misfolded protein models with an energy function including implicit solvation. J Mol Biol 1999; 288:477-87. [PMID: 10329155 DOI: 10.1006/jmbi.1999.2685] [Citation(s) in RCA: 226] [Impact Index Per Article: 9.0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/22/2022]
Abstract
An essential requirement for theoretical protein structure prediction is an energy function that can discriminate the native from non-native protein conformations. To date most of the energy functions used for this purpose have been extracted from a statistical analysis of the protein structure database, without explicit reference to the physical interactions responsible for protein stability. The use of the statistical functions has been supported by the widespread belief that they are superior for such discrimination to physics-based energy functions. An effective energy function which combined the CHARMM vacuum potential with a Gaussian model for the solvation free energy is tested for its ability to discriminate the native structure of a protein from misfolded conformations; the results are compared with those obtained with the vacuum CHARMM potential. The test is performed on several sets of misfolded structures prepared by others, including sets of about 650 good decoys for six proteins, as well as on misfolded structures of chymotrypsin inhibitor 2. The vacuum CHARMM potential is successful in most cases when energy minimized conformations are considered, but fails when applied to structures relaxed by molecular dynamics. With the effective energy function the native state is always more stable than grossly misfolded conformations both in energy minimized and molecular dynamics-relaxed structures. The present results suggest that molecular mechanics (physics-based) energy functions, complemented by a simple model for the solvation free energy, should be tested for use in the inverse folding problem, and supports their use in studies of the effective energy surface of proteins in solution. Moreover, the study suggests that the belief in the superiority of statistical functions for these purposes may be ill founded.
Collapse
Affiliation(s)
- T Lazaridis
- Department of Chemistry and Chemical Biology, Harvard University, 12 Oxford St, Cambridge, MA, 02138, USA
| | | |
Collapse
|
29
|
Pereira de Araujo AF, Pochapsky TC, Joughin B. Thermodynamics of interactions between amino acid side chains: experimental differentiation of aromatic-aromatic, aromatic-aliphatic, and aliphatic-aliphatic side-chain interactions in water. Biophys J 1999; 76:2319-28. [PMID: 10233051 PMCID: PMC1300206 DOI: 10.1016/s0006-3495(99)77389-3] [Citation(s) in RCA: 19] [Impact Index Per Article: 0.8] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/17/2022] Open
Abstract
A stationary phase for high-pressure liquid chromatography has been prepared by derivatizing microparticulate silica gel with functionality mimicking the side chain of isoleucine. The chromatographic retentions of a series of hydrophobic and amphiphilic amino acid analytes on this stationary phase (Ile MSP) using an aqueous mobile phase were measured as a function of temperature from 273 K to 323 K. Observed temperature dependencies are consistent with a constant change in heat capacity, DeltaC degrees P, upon binding of the analyte to the stationary phase. The curvatures of plots of retention data versus temperature (related to the magnitude of DeltaC degrees P) are distinctly different for retention of aromatic and aliphatic analytes, with retention of aliphatic analytes Val, Ile, and Leu exhibiting the characteristic signature of the hydrophobic effect, i.e., a large negative DeltaC degrees P upon desolvation from water and a maximum of retention around room temperature. Retention of aromatic analytes (Trp, Phe, and Tyr) involves smaller heat capacity changes and pronounced negative enthalpies of interaction with the stationary phase. Estimates of DeltaC degrees P for the interactions of analyte side chains with the Ile side chain were obtained by fitting the temperature dependence of retention to an expression derived from thermodynamic considerations and chromatographic theory. Similar estimates were made for interactions with the Phe side chain, using previously published data for a phenylalanine mimic stationary phase (Phe MSP) (. Protein Sci. 1:786-795). As with the Ile MSP, the retentions of aliphatic analytes show temperature dependencies markedly different from those of aromatic analytes. Data from both phases indicate that a realistic differentiation can be made between the interactions of various types of amino acid side chains tested (i.e., aliphatic/aliphatic, aliphatic/aromatic, and aromatic/aromatic) by comparison of the corresponding thermodynamic functions for pairwise interactions. The retention of leucine on the Phe MSP and that of phenylalanine on the Ile MSP showed similar DeltaC degrees P values, suggesting that the aromatic-aliphatic interaction is reasonably independent of the residue attached to the stationary phase. This result is consistent with a one-to-one interaction and suggests a simple way to estimate the column-dependent phase factor, making it possible to compare entropies and free energies of interaction obtained using different MSPs. The possibilities for using MSP-derived interaction potentials in folding simulations are discussed.
Collapse
|
30
|
Jermutus L, Guez V, Bedouelle H. Disordered C-terminal domain of tyrosyl-tRNA synthetase: secondary structure prediction. Biochimie 1999; 81:235-44. [PMID: 10385005 DOI: 10.1016/s0300-9084(99)80057-1] [Citation(s) in RCA: 1] [Impact Index Per Article: 0.0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/29/2022]
Abstract
The C-terminal domain (residues 320-419) of tyrosyl-tRNA synthetase (TyrRS) from Bacillus stearothermophilus is disordered in the crystal structure and involved in the binding of the anticodon arm of tRNA(Tyr). The sequences of 11 TyrRSs of prokaryotic or mitochondrial origins were aligned and the alignment showed the existence of conserved residues in the sequences of the C-terminal domains. A consensus could be deduced from the application of five programs of secondary structure prediction to the 11 sequences of the query set. These results suggested that the sequences of the C-terminal domains determined a precise and conserved secondary structure. They predicted that the C-terminal domain would have a mixed fold (alpha/beta or alpha+beta), with the alpha-helices in the first half of the sequence and the beta-strands mainly in its second half. Several programs of fold recognition from sequence alone, by threading onto known structures, were applied but none of them identified a type of fold that would be common to the different sequences of the query set. Therefore, the fold of the C-terminal, anticodon binding domain might be novel.
Collapse
Affiliation(s)
- L Jermutus
- Groupe d'Ingénierie des Protéines (CNRS URA 1129), Unité de Biochimie Cellulaire, Institut Pasteur, Paris, France
| | | | | |
Collapse
|
31
|
Morea V, Leplae R, Tramontano A. Protein structure prediction and design. BIOTECHNOLOGY ANNUAL REVIEW 1999; 4:177-214. [PMID: 9890141 DOI: 10.1016/s1387-2656(08)70070-x] [Citation(s) in RCA: 4] [Impact Index Per Article: 0.2] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 01/26/2023]
Abstract
Proteins have a unique native conformation, which can be proven in many instances to be determined by the amino acid sequence alone. The folding problem, that is the understanding of how the amino acid sequence directs folding, is still unsolved, despite more than 30 years of effort. However, many new methods have appeared in the past few years. This chapter describes the different principles underlying them and tries to give an overview of their successes and pitfalls.
Collapse
Affiliation(s)
- V Morea
- IRBM P. Angeletti, Pomezia, Rome, Italy
| | | | | |
Collapse
|
32
|
Vendruscolo M, Domany E. Pairwise contact potentials are unsuitable for protein folding. J Chem Phys 1998. [DOI: 10.1063/1.477748] [Citation(s) in RCA: 115] [Impact Index Per Article: 4.4] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/14/2022] Open
|
33
|
Mirny LA, Shakhnovich EI. Protein structure prediction by threading. Why it works and why it does not. J Mol Biol 1998; 283:507-26. [PMID: 9769221 DOI: 10.1006/jmbi.1998.2092] [Citation(s) in RCA: 39] [Impact Index Per Article: 1.5] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/22/2022]
Abstract
We developed a novel Monte Carlo threading algorithm which allows gaps and insertions both in the template structure and threaded sequence. The algorithm is able to find the optimal sequence-structure alignment and sample suboptimal alignments. Using our algorithm we performed sequence-structure alignments for a number of examples for three protein folds (ubiquitin, immunoglobulin and globin) using both "ideal" set of potentials (optimized to provide the best Z-score for a given protein) and more realistic knowledge-based potentials. Two physically different scenarios emerged. If a template structure is similar to the native one (within 2 A RMS), then (i) the optimal threading alignment is correct and robust with respect to deviations of the potential from the "ideal" one; (ii) suboptimal alignments are very similar to the optimal one; (iii) as Monte Carlo temperature decreases a sharp cooperative transition to the optimal alignment is observed. In contrast, if the template structure is only moderately close to the native structure (RMS greater than 3.5 A), then (i) the optimal alignment changes dramatically when an "ideal" potential is substituted by the real one; (ii) the structures of suboptimal alignments are very different from the optimal one, reducing the reliability of the alignment; (iii) the transition to the apparently optimal alignment is non-cooperative. In the intermediate cases when the RMS between the template and the native conformations is in the range between 2 A and 3.5 A, the success of threading alignment may depend on the quality of potentials used. These results are rationalized in terms of a threading free energy landscape. Possible ways to overcome the fundamental limitations of threading are discussed briefly.
Collapse
Affiliation(s)
- L A Mirny
- Department of Chemistry and Chemical Biology, Harvard University, 12 Oxford Street, Cambridge, MA, 02138, USA
| | | |
Collapse
|
34
|
Finkelstein AV. How homologs can help to predict protein folds even though they cannot be predicted for individual sequences. J Comput Biol 1998; 5:369-76. [PMID: 9773338 DOI: 10.1089/cmb.1998.5.369] [Citation(s) in RCA: 4] [Impact Index Per Article: 0.2] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/12/2022] Open
Abstract
At present, one cannot predict the 3D structure of a protein directly from its sequence alone mainly because of errors in the energy estimates. However, a recently developed simple analytical theory (Finkelstein, 1998) shows that using a set of homologs (i.e., chains with numerous amino acid mutations but with equal 3D folds) one can average the interaction energies over the homologs and predict their common 3D fold even when predictions for individual sequences are wrong because the energy parameters are known only approximately. In this work we verify this theoretical conclusion by computer simulations performed with simplified models of protein chains.
Collapse
|
35
|
|
36
|
Sunyaev SR, Eisenhaber F, Argos P, Kuznetsov EN, Tumanyan VG. Are knowledge-based potentials derived from protein structure sets discriminative with respect to amino acid types? Proteins 1998. [DOI: 10.1002/(sici)1097-0134(19980515)31:3<225::aid-prot1>3.0.co;2-i] [Citation(s) in RCA: 13] [Impact Index Per Article: 0.5] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/11/2022]
|
37
|
Abstract
The possible existence in eukaryotic cells of an internal, non-chromatin nuclear structural framework that facilitates gene readout as a set of spatially concerted reactions has become a popular but controversial theater of investigation. This article endeavors to present a circumspect review of the nuclear matrix concept as we presently know it, framed around two contrasting hypotheses: (1) that an internal nuclear framework actively enhances gene expression (in much the same way the cytoskeleton mediates cell locomotion, mitosis and intracellular vesicular traffic) versus (2) that the interphase chromosomes have fixed, inherited positions and that the DNA replication, transcripton and RNA processing machinery diffusionally arrives at sites of gene readout, with some aspects of nuclear structure thus being more a result than a cause of gene expression. On balance, the available information suggests that interactions among various gene expression machines may contribute to isolated nuclear matrix preparations. Some components of isolated nuclear matrix preparations may also reflect induced or reconfigured protein-protein associations. The protein characterization and ultrastructural analysis of the isolated nuclear matrix has advanced significantly in recent years, although controversies remain. Important new clues are now coming in from promising contemporary lines of research that report on nuclear structure in living cells.
Collapse
Affiliation(s)
- T Pederson
- Worcester Foundation for Biomedical Research, Shrewsbury, MA 01545, USA.
| |
Collapse
|
38
|
Chung MS, Neuwald AF, Wilbur WJ. A free energy analysis by unfolding applied to 125-mers on a cubic lattice. FOLDING & DESIGN 1998; 3:51-65. [PMID: 9502320 DOI: 10.1016/s1359-0278(98)00008-x] [Citation(s) in RCA: 5] [Impact Index Per Article: 0.2] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 02/06/2023]
Abstract
BACKGROUND A common approach to the protein folding problem involves computer simulation of folding using lattice models of amino acid sequences. Key factors for good performance in such models are the correct choice of the temperature and the average interaction energy between residues. In order to push the lattice approach to its limit it is important to have a method to adjust these parameters for optimal folding that is not limited by our ability to successfully simulate folding in a reasonable time. RESULTS In this study, we adopt a simple cubic-lattice model and present a method for calculating the free energy of a chain as a function of the number of native contacts. This does not require that we are able to fold the sequence by simulation and it provides a method of estimating the folding transition temperature. For a given set of parameters, the free energy analysis also allows an estimate of foldability. By applying the method to sequences with 27 and 125 residues, we show that optimal folding occurs near the folding transition temperature and at either zero or small negative average interaction energy. We find ourselves able to fold only 125-mers that have significant short-range native contacts. CONCLUSIONS A free energy analysis during unfolding is a useful tool for the study of foldability and should be applicable to a variety of folding models. In this way we are able to fold some 125-mer designed sequences and our results confirm the finding that short-range contacts contribute to foldability.
Collapse
Affiliation(s)
- M S Chung
- National Center for Biotechnology Information, National Library of Medicine, National Institutes of Health, Bethesda, MD 20894, USA
| | | | | |
Collapse
|
39
|
Li A, Fenselau C, Kaltashov IA. Stability of secondary structural elements in a solvent-free environment. II: The β-pleated sheets. Proteins 1998. [DOI: 10.1002/(sici)1097-0134(1998)33:2+<22::aid-prot4>3.0.co;2-6] [Citation(s) in RCA: 18] [Impact Index Per Article: 0.7] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/09/2022]
|
40
|
Abstract
Prediction of protein structure by fold recognition, or threading, was recently put to the test in a 'blind' structure prediction experiment, CASP2. Thirty-two teams from around the world participated, preparing predictions for 22 different 'target' proteins whose structures were soon to be determined. As experimental structures became available, we, as organizers of the threading competition, computed objective measures of fold-recognition specificity and model accuracy, to identify and characterize successful predictions. Here, we present a brief summary of these prediction evaluations, a tally of 'correct' predictions and a discussion of factors associated with correct predictions. We find that threading produced specific recognition and accurate models whenever the structural database contained a template spanning a large fraction of target sequence. Presence of conserved sequence motifs was helpful, but not required, and it would appear that threading can succeed whenever similarity to a known structure is sufficiently extensive.
Collapse
Affiliation(s)
- A Marchler-Bauer
- Computational Biology Branch, National Center for Biotechnology Information, National Institutes of Health, Bethesda, MD 20894, USA
| | | |
Collapse
|
41
|
Jones DT. Successful ab initio prediction of the tertiary structure of NK-lysin using multiple sequences and recognized supersecondary structural motifs. Proteins 1997. [DOI: 10.1002/(sici)1097-0134(1997)1+<185::aid-prot24>3.0.co;2-j] [Citation(s) in RCA: 66] [Impact Index Per Article: 2.4] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/06/2022]
|
42
|
|