1
|
Mani H, Chang CC, Hsu HJ, Yang CH, Yen JH, Liou JW. Comparison, Analysis, and Molecular Dynamics Simulations of Structures of a Viral Protein Modeled Using Various Computational Tools. Bioengineering (Basel) 2023; 10:1004. [PMID: 37760106 PMCID: PMC10525864 DOI: 10.3390/bioengineering10091004] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/05/2023] [Revised: 08/16/2023] [Accepted: 08/22/2023] [Indexed: 09/29/2023] Open
Abstract
The structural analysis of proteins is a major domain of biomedical research. Such analysis requires resolved three-dimensional structures of proteins. Advancements in computer technology have led to progress in biomedical research. In silico prediction and modeling approaches have facilitated the construction of protein structures, with or without structural templates. In this study, we used three neural network-based de novo modeling approaches-AlphaFold2 (AF2), Robetta-RoseTTAFold (Robetta), and transform-restrained Rosetta (trRosetta)-and two template-based tools-the Molecular Operating Environment (MOE) and iterative threading assembly refinement (I-TASSER)-to construct the structure of a viral capsid protein, hepatitis C virus core protein (HCVcp), whose structure have not been fully resolved by laboratory techniques. Templates with sufficient sequence identity for the homology modeling of complete HCVcp are currently unavailable. Therefore, we performed domain-based homology modeling for MOE simulations. The templates for each domain were obtained through sequence-based searches on NCBI and the Protein Data Bank. Then, the modeled domains were assembled to construct the complete structure of HCVcp. The full-length structure and two truncated forms modeled using various computational tools were compared. Molecular dynamics (MD) simulations were performed to refine the structures. The root mean square deviation of backbone atoms, root mean square fluctuation of Cα atoms, and radius of gyration were calculated to monitor structural changes and convergence in the simulations. The model quality was evaluated through ERRAT and phi-psi plot analysis. In terms of the initial prediction for protein modeling, Robetta and trRosetta outperformed AF2. Regarding template-based tools, MOE outperformed I-TASSER. MD simulations resulted in compactly folded protein structures, which were of good quality and theoretically accurate. Thus, the predicted structures of certain proteins must be refined to obtain reliable structural models. MD simulation is a promising tool for this purpose.
Collapse
Affiliation(s)
- Hemalatha Mani
- Institute of Medical Sciences, Tzu Chi University, Hualien 97004, Taiwan
| | - Chun-Chun Chang
- Department of Laboratory Medicine, Hualien Tzu Chi Hospital, Buddhist Tzu Chi Medical Foundation, Hualien 97004, Taiwan
- Department of Laboratory Medicine and Biotechnology, Tzu Chi University, Hualien 97004, Taiwan
| | - Hao-Jen Hsu
- Department of Biomedical Sciences and Engineering, Tzu Chi University, Hualien 97004, Taiwan
| | - Chin-Hao Yang
- Department of Biochemistry, School of Medicine, Tzu Chi University, Hualien 97004, Taiwan
| | - Jui-Hung Yen
- Department of Molecular Biology and Human Genetics, Tzu Chi University, Hualien 97004, Taiwan
| | - Je-Wen Liou
- Institute of Medical Sciences, Tzu Chi University, Hualien 97004, Taiwan
- Department of Laboratory Medicine and Biotechnology, Tzu Chi University, Hualien 97004, Taiwan
- Department of Biochemistry, School of Medicine, Tzu Chi University, Hualien 97004, Taiwan
| |
Collapse
|
2
|
Gogoi CR, Rahman A, Saikia B, Baruah A. Protein Dihedral Angle Prediction: The State of the Art. ChemistrySelect 2023. [DOI: 10.1002/slct.202203427] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 02/05/2023]
Affiliation(s)
| | - Aziza Rahman
- Department of Chemistry Dibrugarh University Dibrugarh Assam India
| | - Bondeepa Saikia
- Department of Chemistry Dibrugarh University Dibrugarh Assam India
| | - Anupaul Baruah
- Department of Chemistry Dibrugarh University Dibrugarh Assam India
| |
Collapse
|
3
|
Gao Y, Wang S, Deng M, Xu J. RaptorX-Angle: real-value prediction of protein backbone dihedral angles through a hybrid method of clustering and deep learning. BMC Bioinformatics 2018; 19:100. [PMID: 29745828 PMCID: PMC5998898 DOI: 10.1186/s12859-018-2065-x] [Citation(s) in RCA: 36] [Impact Index Per Article: 6.0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 12/16/2022] Open
Abstract
Background Protein dihedral angles provide a detailed description of protein local conformation. Predicted dihedral angles can be used to narrow down the conformational space of the whole polypeptide chain significantly, thus aiding protein tertiary structure prediction. However, direct angle prediction from sequence alone is challenging. Results In this article, we present a novel method (named RaptorX-Angle) to predict real-valued angles by combining clustering and deep learning. Tested on a subset of PDB25 and the targets in the latest two Critical Assessment of protein Structure Prediction (CASP), our method outperforms the existing state-of-art method SPIDER2 in terms of Pearson Correlation Coefficient (PCC) and Mean Absolute Error (MAE). Our result also shows approximately linear relationship between the real prediction errors and our estimated bounds. That is, the real prediction error can be well approximated by our estimated bounds. Conclusions Our study provides an alternative and more accurate prediction of dihedral angles, which may facilitate protein structure prediction and functional study. Electronic supplementary material The online version of this article (10.1186/s12859-018-2065-x) contains supplementary material, which is available to authorized users.
Collapse
Affiliation(s)
- Yujuan Gao
- Center for Quantitative Biology, Peking University, Beijing, China.,Toyota Technological Institute at Chicago, 6045 S Kenwood Ave., Chicago, USA
| | - Sheng Wang
- Toyota Technological Institute at Chicago, 6045 S Kenwood Ave., Chicago, USA
| | - Minghua Deng
- Center for Quantitative Biology, Peking University, Beijing, China. .,School of Mathematical Sciences, Beijing, China. .,Center for Statistical Sciences, Beijing, China.
| | - Jinbo Xu
- Toyota Technological Institute at Chicago, 6045 S Kenwood Ave., Chicago, USA.
| |
Collapse
|
4
|
Li J, Cai J, Su H, Du H, Zhang J, Ding S, Liu G, Tang Y, Li W. Effects of protein flexibility and active site water molecules on the prediction of sites of metabolism for cytochrome P450 2C19 substrates. MOLECULAR BIOSYSTEMS 2016; 12:868-78. [DOI: 10.1039/c5mb00784d] [Citation(s) in RCA: 11] [Impact Index Per Article: 1.4] [Reference Citation Analysis] [Abstract] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 01/14/2023]
Abstract
Structure-based prediction of sites of metabolism (SOMs) mediated by cytochrome P450s (CYPs) is of great interest in drug discovery and development.
Collapse
Affiliation(s)
- Junhao Li
- Shanghai Key Laboratory of New Drug Design
- School of Pharmacy
- East China University of Science and Technology
- Shanghai 200237
- China
| | - Jinya Cai
- Shanghai Key Laboratory of New Drug Design
- School of Pharmacy
- East China University of Science and Technology
- Shanghai 200237
- China
| | - Haixia Su
- Shanghai Key Laboratory of New Drug Design
- School of Pharmacy
- East China University of Science and Technology
- Shanghai 200237
- China
| | - Hanwen Du
- Shanghai Key Laboratory of New Drug Design
- School of Pharmacy
- East China University of Science and Technology
- Shanghai 200237
- China
| | - Juan Zhang
- Shanghai Key Laboratory of New Drug Design
- School of Pharmacy
- East China University of Science and Technology
- Shanghai 200237
- China
| | - Shihui Ding
- Shanghai Key Laboratory of New Drug Design
- School of Pharmacy
- East China University of Science and Technology
- Shanghai 200237
- China
| | - Guixia Liu
- Shanghai Key Laboratory of New Drug Design
- School of Pharmacy
- East China University of Science and Technology
- Shanghai 200237
- China
| | - Yun Tang
- Shanghai Key Laboratory of New Drug Design
- School of Pharmacy
- East China University of Science and Technology
- Shanghai 200237
- China
| | - Weihua Li
- Shanghai Key Laboratory of New Drug Design
- School of Pharmacy
- East China University of Science and Technology
- Shanghai 200237
- China
| |
Collapse
|
5
|
Therrien E, Weill N, Tomberg A, Corbeil CR, Lee D, Moitessier N. Docking Ligands into Flexible and Solvated Macromolecules. 7. Impact of Protein Flexibility and Water Molecules on Docking-Based Virtual Screening Accuracy. J Chem Inf Model 2014; 54:3198-210. [DOI: 10.1021/ci500299h] [Citation(s) in RCA: 24] [Impact Index Per Article: 2.4] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 12/18/2022]
Affiliation(s)
- Eric Therrien
- Department of Chemistry, McGill University, 801 Sherbrooke Street W., Montréal, Québec, Canada H3A 0B8
| | - Nathanael Weill
- Department of Chemistry, McGill University, 801 Sherbrooke Street W., Montréal, Québec, Canada H3A 0B8
| | - Anna Tomberg
- Department of Chemistry, McGill University, 801 Sherbrooke Street W., Montréal, Québec, Canada H3A 0B8
| | - Christopher R. Corbeil
- Department of Chemistry, McGill University, 801 Sherbrooke Street W., Montréal, Québec, Canada H3A 0B8
| | - Devin Lee
- Department of Chemistry, McGill University, 801 Sherbrooke Street W., Montréal, Québec, Canada H3A 0B8
| | - Nicolas Moitessier
- Department of Chemistry, McGill University, 801 Sherbrooke Street W., Montréal, Québec, Canada H3A 0B8
| |
Collapse
|
6
|
Cao R, Wang Z, Wang Y, Cheng J. SMOQ: a tool for predicting the absolute residue-specific quality of a single protein model with support vector machines. BMC Bioinformatics 2014; 15:120. [PMID: 24776231 PMCID: PMC4013430 DOI: 10.1186/1471-2105-15-120] [Citation(s) in RCA: 87] [Impact Index Per Article: 8.7] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 12/17/2013] [Accepted: 04/15/2014] [Indexed: 01/19/2023] Open
Abstract
Background It is important to predict the quality of a protein structural model before its native structure is known. The method that can predict the absolute local quality of individual residues in a single protein model is rare, yet particularly needed for using, ranking and refining protein models. Results We developed a machine learning tool (SMOQ) that can predict the distance deviation of each residue in a single protein model. SMOQ uses support vector machines (SVM) with protein sequence and structural features (i.e. basic feature set), including amino acid sequence, secondary structures, solvent accessibilities, and residue-residue contacts to make predictions. We also trained a SVM model with two new additional features (profiles and SOV scores) on 20 CASP8 targets and found that including them can only improve the performance when real deviations between native and model are higher than 5Å. The SMOQ tool finally released uses the basic feature set trained on 85 CASP8 targets. Moreover, SMOQ implemented a way to convert predicted local quality scores into a global quality score. SMOQ was tested on the 84 CASP9 single-domain targets. The average difference between the residue-specific distance deviation predicted by our method and the actual distance deviation on the test data is 2.637Å. The global quality prediction accuracy of the tool is comparable to other good tools on the same benchmark. Conclusion SMOQ is a useful tool for protein single model quality assessment. Its source code and executable are available at: http://sysbio.rnet.missouri.edu/multicom_toolbox/.
Collapse
Affiliation(s)
| | | | | | - Jianlin Cheng
- Department of Computer Science, Informatics Institute, Christopher S, Bond Life Science Center, University of Missouri, Columbia, MO 65211, USA.
| |
Collapse
|
7
|
Carugo O, Djinović-Carugo K. Half a century of Ramachandran plots. ACTA CRYSTALLOGRAPHICA SECTION D: BIOLOGICAL CRYSTALLOGRAPHY 2013; 69:1333-41. [DOI: 10.1107/s090744491301158x] [Citation(s) in RCA: 60] [Impact Index Per Article: 5.5] [Reference Citation Analysis] [Track Full Text] [Subscribe] [Scholar Register] [Received: 02/28/2013] [Accepted: 04/27/2013] [Indexed: 11/11/2022]
|
8
|
Haddadian EJ, Gong H, Jha AK, Yang X, Debartolo J, Hinshaw JR, Rice PA, Sosnick TR, Freed KF. Automated real-space refinement of protein structures using a realistic backbone move set. Biophys J 2011; 101:899-909. [PMID: 21843481 DOI: 10.1016/j.bpj.2011.06.063] [Citation(s) in RCA: 25] [Impact Index Per Article: 1.9] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 05/26/2011] [Revised: 06/23/2011] [Accepted: 06/28/2011] [Indexed: 11/26/2022] Open
Abstract
Crystals of many important biological macromolecules diffract to limited resolution, rendering accurate model building and refinement difficult and time-consuming. We present a torsional optimization protocol that is applicable to many such situations and combines Protein Data Bank-based torsional optimization with real-space refinement against the electron density derived from crystallography or cryo-electron microscopy. Our method converts moderate- to low-resolution structures at initial (e.g., backbone trace only) or late stages of refinement to structures with increased numbers of hydrogen bonds, improved crystallographic R-factors, and superior backbone geometry. This automated method is applicable to DNA-binding and membrane proteins of any size and will aid studies of structural biology by improving model quality and saving considerable effort. The method can be extended to improve NMR and other structures. Our backbone score and its sequence profile provide an additional standard tool for evaluating structural quality.
Collapse
Affiliation(s)
- Esmael J Haddadian
- Department of Biochemistry and Molecular Biology, University of Chicago, Chicago, Illinois, USA
| | | | | | | | | | | | | | | | | |
Collapse
|
9
|
Abstract
Around half of all protein structures solved nowadays using solution-state nuclear magnetic resonance (NMR) spectroscopy have been because of automated data analysis. The pervasiveness of computational approaches in general hides, however, a more nuanced view in which the full variety and richness of the field appears. This review is structured around a comparison of methods associated with three NMR observables: classical nuclear Overhauser effect (NOE) constraint gathering in contrast with more recent chemical shift and residual dipole coupling (RDC) based protocols. In each case, the emphasis is placed on the latest research, covering mainly the past 5 years. By describing both general concepts and representative programs, the objective is to map out a field in which--through the very profusion of approaches--it is all too easy to lose one's bearings.
Collapse
|
10
|
Investigation of the Bcl-2 multimerisation process: structural and functional implications. BIOCHIMICA ET BIOPHYSICA ACTA-MOLECULAR CELL RESEARCH 2011; 1813:850-7. [PMID: 21320534 DOI: 10.1016/j.bbamcr.2011.02.006] [Citation(s) in RCA: 14] [Impact Index Per Article: 1.1] [Reference Citation Analysis] [Abstract] [Track Full Text] [Subscribe] [Scholar Register] [Received: 10/18/2010] [Revised: 01/15/2011] [Accepted: 02/07/2011] [Indexed: 11/22/2022]
Abstract
Bcl-2 plays a prominent role in regulating the function of mitochondria during respiration and in determining the threshold of apoptotic sensitivity. Despite its relevance, the mechanism through which these processes are achieved is still unknown. Using surface plasmon resonance technology to monitor Bcl-2 multimerisation we discovered that a simple dimeric model does not fit with experimental data. A molecular model of the experimentally observed Bcl-2 homomeric complex has been developed. Accordingly, using a panel of mutants we identified in the loop a critical region for the process of Bcl-2 multimerisation. Our results indicate that the Bcl-2 loop posttranscriptional changes can modulate its ability to make homo and hetero-complexes, ultimately leading to functional modulation, suggesting an intriguing relationship between the ability of Bcl-2 to form multimeric complexes and its multi-functional role as a membrane channel. This article is part of a Special Issue entitled: 11th European Symposium on Calcium.
Collapse
|
11
|
Zhang J, Wang Q, Barz B, He Z, Kosztin I, Shang Y, Xu D. MUFOLD: A new solution for protein 3D structure prediction. Proteins 2010; 78:1137-52. [PMID: 19927325 DOI: 10.1002/prot.22634] [Citation(s) in RCA: 61] [Impact Index Per Article: 4.4] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/10/2022]
Abstract
There have been steady improvements in protein structure prediction during the past 2 decades. However, current methods are still far from consistently predicting structural models accurately with computing power accessible to common users. Toward achieving more accurate and efficient structure prediction, we developed a number of novel methods and integrated them into a software package, MUFOLD. First, a systematic protocol was developed to identify useful templates and fragments from Protein Data Bank for a given target protein. Then, an efficient process was applied for iterative coarse-grain model generation and evaluation at the Calpha or backbone level. In this process, we construct models using interresidue spatial restraints derived from alignments by multidimensional scaling, evaluate and select models through clustering and static scoring functions, and iteratively improve the selected models by integrating spatial restraints and previous models. Finally, the full-atom models were evaluated using molecular dynamics simulations based on structural changes under simulated heating. We have continuously improved the performance of MUFOLD by using a benchmark of 200 proteins from the Astral database, where no template with >25% sequence identity to any target protein is included. The average root-mean-square deviation of the best models from the native structures is 4.28 A, which shows significant and systematic improvement over our previous methods. The computing time of MUFOLD is much shorter than many other tools, such as Rosetta. MUFOLD demonstrated some success in the 2008 community-wide experiment for protein structure prediction CASP8.
Collapse
Affiliation(s)
- Jingfen Zhang
- Department of Computer Science, University of Missouri, Columbia, Missouri 65211, USA
| | | | | | | | | | | | | |
Collapse
|
12
|
Maupetit J, Tuffery P, Derreumaux P. A coarse-grained protein force field for folding and structure prediction. Proteins 2009; 69:394-408. [PMID: 17600832 DOI: 10.1002/prot.21505] [Citation(s) in RCA: 164] [Impact Index Per Article: 10.9] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 12/22/2022]
Abstract
We have revisited the protein coarse-grained optimized potential for efficient structure prediction (OPEP). The training and validation sets consist of 13 and 16 protein targets. Because optimization depends on details of how the ensemble of decoys is sampled, trial conformations are generated by molecular dynamics, threading, greedy, and Monte Carlo simulations, or taken from publicly available databases. The OPEP parameters are varied by a genetic algorithm using a scoring function which requires that the native structure has the lowest energy, and the native-like structures have energy higher than the native structure but lower than the remote conformations. Overall, we find that OPEP correctly identifies 24 native or native-like states for 29 targets and has very similar capability to the all-atom discrete optimized protein energy model (DOPE), found recently to outperform five currently used energy models.
Collapse
Affiliation(s)
- Julien Maupetit
- Equipe de Bioinformatique Génomique et Moléculaire, INSERM E0346, Université Paris 7, Tour 53-54, 2 place Jussieu, 75251 Paris, Cedex 05, France
| | | | | |
Collapse
|
13
|
Frickenhaus S, Kannan S, Zacharias M. Efficient evaluation of sampling quality of molecular dynamics simulations by clustering of dihedral torsion angles and Sammon mapping. J Comput Chem 2009; 30:479-92. [DOI: 10.1002/jcc.21076] [Citation(s) in RCA: 15] [Impact Index Per Article: 1.0] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/11/2022]
|
14
|
Li SC, Bu D, Xu J, Li M. Fragment-HMM: a new approach to protein structure prediction. Protein Sci 2008; 17:1925-34. [PMID: 18723665 DOI: 10.1110/ps.036442.108] [Citation(s) in RCA: 38] [Impact Index Per Article: 2.4] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 10/21/2022]
Abstract
We designed a simple position-specific hidden Markov model to predict protein structure. Our new framework naturally repeats itself to converge to a final target, conglomerating fragment assembly, clustering, target selection, refinement, and consensus, all in one process. Our initial implementation of this theory converges to within 6 A of the native structures for 100% of decoys on all six standard benchmark proteins used in ROSETTA (discussed by Simons and colleagues in a recent paper), which achieved only 14%-94% for the same data. The qualities of the best decoys and the final decoys our theory converges to are also notably better.
Collapse
Affiliation(s)
- Shuai Cheng Li
- David R. Cheriton School of Computer Science, University of Waterloo, Waterloo, Ontario N2L3G1, Canada
| | | | | | | |
Collapse
|
15
|
Amir EAD, Kalisman N, Keasar C. Differentiable, multi-dimensional, knowledge-based energy terms for torsion angle probabilities and propensities. Proteins 2008; 72:62-73. [PMID: 18186478 DOI: 10.1002/prot.21896] [Citation(s) in RCA: 31] [Impact Index Per Article: 1.9] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/06/2022]
Abstract
Rotatable torsion angles are the major degrees of freedom in proteins. Adjacent angles are highly correlated and energy terms that rely on these correlations are intensively used in molecular modeling. However, the utility of torsion based terms is not yet fully exploited. Many of these terms do not capture the full scale of the correlations. Other terms, which rely on lookup tables, cannot be used in the context of force-driven algorithms because they are not fully differentiable. This study aims to extend the usability of torsion terms by presenting a set of high-dimensional and fully-differentiable energy terms that are derived from high-resolution structures. The set includes terms that describe backbone conformational probabilities and propensities, side-chain rotamer probabilities, and an elaborate term that couples all the torsion angles within the same residue. The terms are constructed by cubic spline interpolation with periodic boundary conditions that enable full differentiability and high computational efficiency. We show that the spline implementation does not compromise the accuracy of the original database statistics. We further show that the side-chain relevant terms are compatible with established rotamer probabilities. Despite their very local characteristics, the new terms are often able to identify native and native-like structures within decoy sets. Finally, force-based minimization of NMR structures with the new terms improves their torsion angle statistics with minor structural distortion (0.5 A RMSD on average). The new terms are freely available in the MESHI molecular modeling package. The spline coefficients are also available as a documented MATLAB file.
Collapse
Affiliation(s)
- El-Ad David Amir
- Department of Computer Science, Ben-Gurion University of the Negev, Israel
| | | | | |
Collapse
|
16
|
Abstract
MOTIVATION The 3D structure of a protein sequence can be assembled from the substructures corresponding to small segments of this sequence. For each small sequence segment, there are only a few more likely substructures. We call them the 'structural alphabet' for this segment. Classical approaches such as ROSETTA used sequence profile and secondary structure information, to predict structural fragments. In contrast, we utilize more structural information, such as solvent accessibility and contact capacity, for finding structural fragments. RESULTS Integer linear programming technique is applied to derive the best combination of these sequence and structural information items. This approach generates significantly more accurate and succinct structural alphabets with more than 50% improvement over the previous accuracies. With these novel structural alphabets, we are able to construct more accurate protein structures than the state-of-art ab initio protein structure prediction programs such as ROSETTA. We are also able to reduce the Kolodny's library size by a factor of 8, at the same accuracy. AVAILABILITY The online FRazor server is under construction.
Collapse
Affiliation(s)
- Shuai Cheng Li
- David R. Cheriton School of Computer Science, University of Waterloo, Waterloo, Ont. N2L 3G1, Canada.
| | | | | | | | | |
Collapse
|
17
|
Saccenti E, Rosato A. The war of tools: how can NMR spectroscopists detect errors in their structures? JOURNAL OF BIOMOLECULAR NMR 2008; 40:251-261. [PMID: 18320330 DOI: 10.1007/s10858-008-9228-4] [Citation(s) in RCA: 14] [Impact Index Per Article: 0.9] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Subscribe] [Scholar Register] [Received: 10/01/2007] [Revised: 02/08/2008] [Accepted: 02/13/2008] [Indexed: 05/26/2023]
Abstract
Protein structure determination by NMR methods has started in the mid-eighties and has been growing steadily since then. Ca. 14% of the protein structures deposited in the PDB have been solved by NMR. The evaluation of the quality of NMR structures however is still lacking a well-established practice. In this work, we examined various tools for the assessment of structural quality to ascertain the extent to which these tools could be applied to detect flaws in NMR structures. In particular, we investigated the variation in the scores assigned by these programs as a function of the deviation of the structures induced by errors in assignments or in the upper distance limits used. These perturbations did not distort radically the protein fold, but resulted in backbone RMS deviations up to 3 A, which is in line with errors highlighted in the available literature. We found that it is quite difficult to discriminate the structures perturbed because of misassignments from the original ones, also because the spread in score over the conformers of the original bundle is relatively large. varphi-psi distributions and normality scores related to the backbone conformation and to the distribution of side-chain dihedral angles are the most sensitive indicators of flaws.
Collapse
Affiliation(s)
- Edoardo Saccenti
- Magnetic Resonance Center, University of Florence, Via L. Sacconi 6, 50019, Sesto Fiorentino, Italy
| | | |
Collapse
|
18
|
Tosatto SCE, Battistutta R. TAP score: torsion angle propensity normalization applied to local protein structure evaluation. BMC Bioinformatics 2007; 8:155. [PMID: 17504537 PMCID: PMC1878508 DOI: 10.1186/1471-2105-8-155] [Citation(s) in RCA: 32] [Impact Index Per Article: 1.9] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/14/2007] [Accepted: 05/15/2007] [Indexed: 11/10/2022] Open
Abstract
BACKGROUND Experimentally determined protein structures may contain errors and require validation. Conformational criteria based on the Ramachandran plot are mainly used to distinguish between distorted and adequately refined models. While the readily available criteria are sufficient to detect totally wrong structures, establishing the more subtle differences between plausible structures remains more challenging. RESULTS A new criterion, called TAP score, measuring local sequence to structure fitness based on torsion angle propensities normalized against the global minimum and maximum is introduced. It is shown to be more accurate than previous methods at estimating the validity of a protein model in terms of commonly used experimental quality parameters on two test sets representing the full PDB database and a subset of obsolete PDB structures. Highly selective TAP thresholds are derived to recognize over 90% of the top experimental structures in the absence of experimental information. Both a web server and an executable version of the TAP score are available at http://protein.cribi.unipd.it/tap/. CONCLUSION A novel procedure for energy normalization (TAP) has significantly improved the possibility to recognize the best experimental structures. It will allow the user to more reliably isolate problematic structures in the context of automated experimental structure determination.
Collapse
Affiliation(s)
- Silvio CE Tosatto
- Dept. of Biology and CRIBI Biotechnology Centre, University of Padova, Italy
| | - Roberto Battistutta
- Dept. of Chemical Sciences and Venetian Institute of Molecular Medicine (VIMM), University of Padova, Italy
| |
Collapse
|
19
|
Evaluation of the structural quality of modeled proteins by using globularity criteria. BMC STRUCTURAL BIOLOGY 2007; 7:9. [PMID: 17346357 PMCID: PMC1828058 DOI: 10.1186/1472-6807-7-9] [Citation(s) in RCA: 6] [Impact Index Per Article: 0.4] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Subscribe] [Scholar Register] [Received: 12/06/2006] [Accepted: 03/09/2007] [Indexed: 11/10/2022]
Abstract
Background The knowledge of the three-dimensional structure of globular proteins is fundamental for a detailed investigation of their functional properties. Experimental methods are too slow for structure investigation on a large scale, while computational prediction methods offer alternatives that are continuously being improved. The international Comparative Assessment of Structure Prediction (CASP), an "a posteriori" evaluation of the quality of theoretical models when the experimental structure becomes available, demonstrates that predictions can be successful as well as unsuccessful, and this suggests the necessity for evaluations able to discard "a priori" the wrong models. Results We analyzed different structural properties of globular proteins for experimentally solved proteins belonging to the four different structural classes: "mainly alpha", "mainly beta", "alpha/beta" and "alpha+beta". The properties were found to be linearly correlated to protein molecular weight, but with some differences among the four classes. These results were applied to develop an evaluation test of theoretical models based on the expected globular properties of proteins. To verify the success of our test, we applied it to several protein models submitted to the sixth edition of CASP. The best theoretical models, as judged by CASP assessors, were in agreement with the expected properties, while most of the low-quality models had not passed our evaluations. Conclusion This study supports the need for careful checks to avoid the diffusion of incorrect structural models. Our test allows the evaluation of models in the absence of experimental reference structures, thereby preventing the diffusion of incorrect structural models and the formulation of incorrect functional hypotheses. It can be used to check the globularity of predicted models, and to supplement other methods already used to evaluate their quality.
Collapse
|