1
|
da Cruz EC, Silva MJA, Gama GCB, Pinheiro AHG, Gonçalves EC, Siqueira AS. Virtual screening and repurposing of approved drugs targeting homoserine dehydrogenase from Paracoccidioides brasiliensis. J Mol Model 2022; 28:374. [DOI: 10.1007/s00894-022-05335-0] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 02/22/2022] [Accepted: 09/28/2022] [Indexed: 11/05/2022]
|
2
|
Monroe L, Kihara D. Using steered molecular dynamic tension for assessing quality of computational protein structure models. J Comput Chem 2022; 43:1140-1150. [PMID: 35475517 PMCID: PMC9133218 DOI: 10.1002/jcc.26876] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/21/2021] [Revised: 02/16/2022] [Accepted: 04/15/2022] [Indexed: 11/12/2022]
Abstract
The native structures of proteins, except for notable exceptions of intrinsically disordered proteins, in general take their most stable conformation in the physiological condition to maintain their structural framework so that their biological function can be properly carried out. Experimentally, the stability of a protein can be measured by several means, among which the pulling experiment using the atomic force microscope (AFM) stands as a unique method. AFM directly measures the resistance from unfolding, which can be quantified from the observed force-extension profile. It has been shown that key features observed in an AFM pulling experiment can be well reproduced by computational molecular dynamics simulations. Here, we applied computational pulling for estimating the accuracy of computational protein structure models under the hypothesis that the structural stability would positively correlated with the accuracy, i.e. the closeness to the native, of a model. We used in total 4929 structure models for 24 target proteins from the Critical Assessment of Techniques of Structure Prediction (CASP) and investigated if the magnitude of the break force, that is, the force required to rearrange the model's structure, from the force profile was sufficient information for selecting near-native models. We found that near-native models can be successfully selected by examining their break forces suggesting that high break force indeed indicates high stability of models. On the other hand, there were also near-native models that had relatively low peak forces. The mechanisms of the stability exhibited by the break forces were explored and discussed.
Collapse
Affiliation(s)
- Lyman Monroe
- Department of Biological Sciences, Purdue University, West Lafayette, IN 47907, USA
| | - Daisuke Kihara
- Department of Biological Sciences, Purdue University, West Lafayette, IN 47907, USA
- Department of Computer Science, Purdue University, West Lafayette, IN 47907, USA
- Purdue University Center for Cancer Research, West Lafayette, IN, 47907, USA
| |
Collapse
|
3
|
Köhler L, Patzschke M, Schmidt M, Stumpf T, März J. How 5 f Electron Polarisability Drives Covalency and Selectivity in Actinide N-Donor Complexes. Chemistry 2021; 27:18058-18065. [PMID: 34747538 PMCID: PMC9299701 DOI: 10.1002/chem.202102849] [Citation(s) in RCA: 1] [Impact Index Per Article: 0.3] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 08/04/2021] [Indexed: 01/12/2023]
Abstract
We report a series of isostructural tetravalent actinide (Th, U−Pu) complexes with the N‐donor ligand N,N’‐ethylene‐bis((pyrrole‐2‐yl)methanimine) (H2L, H2pyren). Structural data from SC‐XRD analysis reveal [An(pyren)2] complexes with different An−Nimine versus An−Npyrrolide bond lengths. Quantum chemical calculations elucidated the bonding situation, including differences in the covalent character of the coordinative bonds. A comparison to the intensely studied analogous N,N′‐ethylene‐bis(salicylideneimine) (H2salen)‐based complexes [An(salen)2] displays, on average, almost equal electron sharing of pyren or salen with the AnIV, pointing to a potential ligand‐cage‐driven complex stabilisation. This is shown in the fixed ligand arrangement of pyren and salen in the respective AnIV complexes. The overall bond strength of the pure N‐donor ligand pyren to AnIV (An=Th, U, Np, Pu) is slightly weaker than to salen, with the exception of the PaIV complex, which exhibits extraordinarily high electron sharing of pyren with PaIV. Such an altered ligand preference within the early AnIV series points to a specificity of the 5f1 configuration, which can be explained by polarisation effects of the 5 f electrons, allowing the strongest f electron backbonding from PaIV (5f1) to the N donors of pyren.
Collapse
Affiliation(s)
- Luisa Köhler
- Helmholtz-Zentrum Dresden-Rossendorf (HZDR), Institute of Resource Ecology, Bautzner Landstraße 400, 01328, Dresden, Germany
| | - Michael Patzschke
- Helmholtz-Zentrum Dresden-Rossendorf (HZDR), Institute of Resource Ecology, Bautzner Landstraße 400, 01328, Dresden, Germany
| | - Moritz Schmidt
- Helmholtz-Zentrum Dresden-Rossendorf (HZDR), Institute of Resource Ecology, Bautzner Landstraße 400, 01328, Dresden, Germany
| | - Thorsten Stumpf
- Helmholtz-Zentrum Dresden-Rossendorf (HZDR), Institute of Resource Ecology, Bautzner Landstraße 400, 01328, Dresden, Germany
| | - Juliane März
- Helmholtz-Zentrum Dresden-Rossendorf (HZDR), Institute of Resource Ecology, Bautzner Landstraße 400, 01328, Dresden, Germany
| |
Collapse
|
4
|
Yusof NA, Charles J, Wan Mahadi WNS, Abdul Murad AM, Mahadi NM. Characterization of Inducible HSP70 Genes in an Antarctic Yeast, Glaciozyma antarctica PI12, in Response to Thermal Stress. Microorganisms 2021; 9:microorganisms9102069. [PMID: 34683390 PMCID: PMC8540855 DOI: 10.3390/microorganisms9102069] [Citation(s) in RCA: 2] [Impact Index Per Article: 0.7] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/21/2021] [Revised: 09/01/2021] [Accepted: 09/06/2021] [Indexed: 11/17/2022] Open
Abstract
The induction of highly conserved heat shock protein 70 (HSP70) is often related to a cellular response due to harmful stress or adverse life conditions. In this study, we determined the expression of Hsp70 genes in the Antarctic yeast, Glaciozyma antarctica, under different several thermal treatments for several exposure periods. The main aims of the present study were (1) to determine if stress-induced Hsp70 could be used to monitor the exposure of the yeast species G. antarctica to various types of thermal stress; (2) to analyze the structures of the G. antarctica HSP70 proteins using comparative modeling; and (3) to evaluate the relationship between the function and structure of HSP70 in G. antarctica. In this study, we managed to amplify and clone 2 Hsp70 genes from G. antarctica named GaHsp70-1 and GaHsp70-2. The cells of G. antarctica expressed significantly inducible Hsp70 genes after the heat and cold shock treatments. Interestingly, GaHsp70-1 showed 2–6-fold higher expression than GaHsp70-2 after the heat and cold exposure. ATP hydrolysis analysis on both G. antarctica HSP70s proved that these psychrophilic chaperones can perform activities in a wide range of temperatures, such as at 37, 25, 15, and 4 °C. The 3D structures of both HSP70s revealed several interesting findings, such as the substitution of a β-sheet to loop in the N-terminal ATPase binding domain and some modest residue substitutions, which gave the proteins the flexibility to function at low temperatures and retain their functional activity at ambient temperatures. In conclusion, both analyzed HSP70s played important roles in the physiological adaptation of G. antarctica.
Collapse
Affiliation(s)
- Nur Athirah Yusof
- Biotechnology Research Institute, Universiti Malaysia Sabah, Kota Kinabalu 88400, Sabah, Malaysia; (J.C.); (W.N.S.W.M.)
- Correspondence: ; Tel.: +60-19-605-1219
| | - Jennifer Charles
- Biotechnology Research Institute, Universiti Malaysia Sabah, Kota Kinabalu 88400, Sabah, Malaysia; (J.C.); (W.N.S.W.M.)
| | - Wan Nur Shuhaida Wan Mahadi
- Biotechnology Research Institute, Universiti Malaysia Sabah, Kota Kinabalu 88400, Sabah, Malaysia; (J.C.); (W.N.S.W.M.)
| | - Abdul Munir Abdul Murad
- Faculty of Science and Technology, School of Biosciences and Biotechnology, Universiti Kebangsaan Malaysia, Bangi 43600, Selangor, Malaysia;
| | | |
Collapse
|
5
|
Prediction of Local Quality of Protein Structure Models Considering Spatial Neighbors in Graphical Models. Sci Rep 2017; 7:40629. [PMID: 28074879 PMCID: PMC5225430 DOI: 10.1038/srep40629] [Citation(s) in RCA: 7] [Impact Index Per Article: 1.0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 09/07/2016] [Accepted: 12/08/2016] [Indexed: 12/31/2022] Open
Abstract
Protein tertiary structure prediction methods have matured in recent years. However, some proteins defy accurate prediction due to factors such as inadequate template structures. While existing model quality assessment methods predict global model quality relatively well, there is substantial room for improvement in local quality assessment, i.e. assessment of the error at each residue position in a model. Local quality is a very important information for practical applications of structure models such as interpreting/designing site-directed mutagenesis of proteins. We have developed a novel local quality assessment method for protein tertiary structure models. The method, named Graph-based Model Quality assessment method (GMQ), explicitly considers the predicted quality of spatially neighboring residues using a graph representation of a query protein structure model. GMQ uses conditional random field as its core of the algorithm, and performs a binary prediction of the quality of each residue in a model, indicating if a residue position is likely to be within an error cutoff or not. The accuracy of GMQ was improved by considering larger graphs to include quality information of more surrounding residues. Moreover, we found that using different edge weights in graphs reflecting different secondary structures further improves the accuracy. GMQ showed competitive performance on a benchmark for quality assessment of structure models from the Critical Assessment of Techniques for Protein Structure Prediction (CASP).
Collapse
|
6
|
Esquivel-Rodríguez J, Kihara D. Computational methods for constructing protein structure models from 3D electron microscopy maps. J Struct Biol 2013; 184:93-102. [PMID: 23796504 DOI: 10.1016/j.jsb.2013.06.008] [Citation(s) in RCA: 36] [Impact Index Per Article: 3.3] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 12/18/2012] [Revised: 06/11/2013] [Accepted: 06/13/2013] [Indexed: 12/31/2022]
Abstract
Protein structure determination by cryo-electron microscopy (EM) has made significant progress in the past decades. Resolutions of EM maps have been improving as evidenced by recently reported structures that are solved at high resolutions close to 3Å. Computational methods play a key role in interpreting EM data. Among many computational procedures applied to an EM map to obtain protein structure information, in this article we focus on reviewing computational methods that model protein three-dimensional (3D) structures from a 3D EM density map that is constructed from two-dimensional (2D) maps. The computational methods we discuss range from de novo methods, which identify structural elements in an EM map, to structure fitting methods, where known high resolution structures are fit into a low-resolution EM map. A list of available computational tools is also provided.
Collapse
Affiliation(s)
- Juan Esquivel-Rodríguez
- Department of Computer Science, College of Science, Purdue University, West Lafayette, IN 47907, USA
| | | |
Collapse
|
7
|
Three-Dimensional Molecular Modeling of a Diverse Range of SC Clan Serine Proteases. Mol Biol Int 2012; 2012:580965. [PMID: 23213528 PMCID: PMC3507156 DOI: 10.1155/2012/580965] [Citation(s) in RCA: 5] [Impact Index Per Article: 0.4] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/25/2012] [Revised: 10/17/2012] [Accepted: 10/17/2012] [Indexed: 01/09/2023] Open
Abstract
Serine proteases are involved in a variety of biological processes and are classified into clans sharing structural homology. Although various three-dimensional structures of SC clan proteases have been experimentally determined, they are mostly bacterial and animal proteases, with some from archaea, plants, and fungi, and as yet no structures have been determined for protozoa. To bridge this gap, we have used molecular modeling techniques to investigate the structural properties of different SC clan serine proteases from a diverse range of taxa. Either SWISS-MODEL was used for homology-based structure prediction or the LOOPP server was used for threading-based structure prediction. The predicted models were refined using Insight II and SCRWL and validated against experimental structures. Investigation of secondary structures and electrostatic surface potential was performed using MOLMOL. The structural geometry of the catalytic core shows clear deviations between taxa, but the relative positions of the catalytic triad residues were conserved. Evolutionary divergence was also exhibited by large variation in secondary structure features outside the core, differences in overall amino acid distribution, and unique surface electrostatic potential patterns between species. Encompassing a wide range of taxa, our structural analysis provides an evolutionary perspective on SC clan serine proteases.
Collapse
|
8
|
Laskar A, Rodger EJ, Chatterjee A, Mandal C. Modeling and structural analysis of PA clan serine proteases. BMC Res Notes 2012; 5:256. [PMID: 22624962 PMCID: PMC3434108 DOI: 10.1186/1756-0500-5-256] [Citation(s) in RCA: 17] [Impact Index Per Article: 1.4] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 12/14/2011] [Accepted: 05/11/2012] [Indexed: 01/02/2023] Open
Abstract
BACKGROUND Serine proteases account for over a third of all known proteolytic enzymes; they are involved in a variety of physiological processes and are classified into clans sharing structural homology. The PA clan of endopeptidases is the most abundant and over two thirds of this clan is comprised of the S1 family of serine proteases, which bear the archetypal trypsin fold and have a catalytic triad in the order Histidine, Aspartate, Serine. These proteases have been studied in depth and many three dimensional structures have been experimentally determined. However, these structures mostly consist of bacterial and animal proteases, with a small number of plant and fungal proteases and as yet no structures have been determined for protozoa or archaea. The core structure and active site geometry of these proteases is of interest for many applications. This study investigated the structural properties of different S1 family serine proteases from a diverse range of taxa using molecular modeling techniques. RESULTS Our predicted models from protozoa, archaea, fungi and plants were combined with the experimentally determined structures of 16 S1 family members and used for analysis of the catalytic core. Amino acid sequences were submitted to SWISS-MODEL for homology-based structure prediction or the LOOPP server for threading-based structure prediction. Predicted models were refined using INSIGHT II and SCRWL and validated against experimental structures. Investigation of secondary structures and electrostatic surface potential was performed using MOLMOL. The structural geometry of the catalytic core shows clear deviations between taxa, but the relative positions of the catalytic triad residues were conserved. Some highly conserved residues potentially contributing to the stability of the structural core were identified. Evolutionary divergence was also exhibited by large variation in secondary structure features outside the core, differences in overall amino acid distribution, and unique surface electrostatic potential patterns between species. CONCLUSIONS Encompassing a wide range of taxa, our structural analysis provides an evolutionary perspective on S1 family serine proteases. Focusing on the common core containing the catalytic site of the enzyme, this analysis is beneficial for future molecular modeling strategies and structural analysis of serine protease models.
Collapse
Affiliation(s)
- Aparna Laskar
- Indian Institute of Chemical Biology (CSIR Unit, Government of India), Kolkata, West Bengal, India.
| | | | | | | |
Collapse
|
9
|
Bagaria A, Jaravine V, Huang YJ, Montelione GT, Güntert P. Protein structure validation by generalized linear model root-mean-square deviation prediction. Protein Sci 2012; 21:229-38. [PMID: 22113924 DOI: 10.1002/pro.2007] [Citation(s) in RCA: 39] [Impact Index Per Article: 3.3] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/19/2011] [Revised: 11/17/2011] [Accepted: 11/19/2011] [Indexed: 01/03/2023]
Abstract
Large-scale initiatives for obtaining spatial protein structures by experimental or computational means have accentuated the need for the critical assessment of protein structure determination and prediction methods. These include blind test projects such as the critical assessment of protein structure prediction (CASP) and the critical assessment of protein structure determination by nuclear magnetic resonance (CASD-NMR). An important aim is to establish structure validation criteria that can reliably assess the accuracy of a new protein structure. Various quality measures derived from the coordinates have been proposed. A universal structural quality assessment method should combine multiple individual scores in a meaningful way, which is challenging because of their different measurement units. Here, we present a method based on a generalized linear model (GLM) that combines diverse protein structure quality scores into a single quantity with intuitive meaning, namely the predicted coordinate root-mean-square deviation (RMSD) value between the present structure and the (unavailable) "true" structure (GLM-RMSD). For two sets of structural models from the CASD-NMR and CASP projects, this GLM-RMSD value was compared with the actual accuracy given by the RMSD value to the corresponding, experimentally determined reference structure from the Protein Data Bank (PDB). The correlation coefficients between actual (model vs. reference from PDB) and predicted (model vs. "true") heavy-atom RMSDs were 0.69 and 0.76, for the two datasets from CASD-NMR and CASP, respectively, which is considerably higher than those for the individual scores (-0.24 to 0.68). The GLM-RMSD can thus predict the accuracy of protein structures more reliably than individual coordinate-based quality scores.
Collapse
Affiliation(s)
- Anurag Bagaria
- Institute of Biophysical Chemistry, Center for Biomolecular Magnetic Resonance, and Frankfurt Institute of Advanced Studies, Goethe University Frankfurt, Frankfurt am Main, Germany
| | | | | | | | | |
Collapse
|
10
|
Kryshtafovych A, Fidelis K, Tramontano A. Evaluation of model quality predictions in CASP9. Proteins 2011; 79 Suppl 10:91-106. [PMID: 21997462 DOI: 10.1002/prot.23180] [Citation(s) in RCA: 69] [Impact Index Per Article: 5.3] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 04/15/2011] [Revised: 08/22/2011] [Accepted: 08/24/2011] [Indexed: 12/14/2022]
Abstract
CASP has been assessing the state of the art in the a priori estimation of accuracy of protein structure prediction since 2006. The inclusion of model quality assessment category in CASP contributed to a rapid development of methods in this area. In the last experiment, 46 quality assessment groups tested their approaches to estimate the accuracy of protein models as a whole and/or on a per-residue basis. We assessed the performance of these methods predominantly on the basis of the correlation between the predicted and observed quality of the models on both global and local scales. The ability of the methods to identify the models closest to the best one, to differentiate between good and bad models, and to identify well modeled regions was also analyzed. Our evaluations demonstrate that even though global quality assessment methods seem to approach perfection point (weighted average per-target Pearson's correlation coefficients are as high as 0.97 for the best groups), there is still room for improvement. First, all top-performing methods use consensus approaches to generate quality estimates, and this strategy has its own limitations. Second, the methods that are based on the analysis of individual models lag far behind clustering techniques and need a boost in performance. The methods for estimating per-residue accuracy of models are less accurate than global quality assessment methods, with an average weighted per-model correlation coefficient in the range of 0.63-0.72 for the best 10 groups.
Collapse
Affiliation(s)
- Andriy Kryshtafovych
- Genome Center, University of California-Davis, 451 Health Sciences Drive, Davis, CA 95616, USA.
| | | | | |
Collapse
|
11
|
Saraiva LA, Veloso MP, Camps I, da Silveira NJF. Structural Bioinformatics Approach of Cyclin-Dependent Kinases 1 and 3 Complexed with Inhibitors. Mol Inform 2011; 30:219-31. [PMID: 27466775 DOI: 10.1002/minf.201000143] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/30/2010] [Accepted: 01/17/2011] [Indexed: 11/06/2022]
Abstract
The cyclin-dependent kinases or CDKs participate in the regulation of both the cell progression cycle and the RNA polymerase-II transcription cycle. In several human tumours deregulation of CDK-related mechanisms have been detected, e.g., overexpression of cyclins or deletion of genes encoding for CKIs. Regarding these observations, CDKs came up to be interesting targets for elaboration of novel antitumour drugs. Based on the importance of the CDKs, this research aimed to describe, to characterize and to compare the molecular models of CDK1 and CDK3. Since the structures of human CDK1 and CDK3 are unavailable in the Protein Data Bank -PDB, homology models were created based on the CDK2 as the template, once they share a substantial identity. The structural studies of the CDK1 and CDK3 biding sites were conducted by molecular docking with 15 different CDK inhibitors previously identified to CDK2. This study allowed the understanding of the structure of the complexes between CDK1/ CDK3 with inhibitors. The knowledge of their structural features mainly the biding sites might be useful to discovery and rationalization of drug design process.
Collapse
Affiliation(s)
- Lucas A Saraiva
- Faculty of Pharmaceutical Science, Rua Gabriel Monteiro da Silva, 700 Centro - Alfenas/MG Postal Code: 37130000, Brazil. tel: +553133322556.
| | - Marcia P Veloso
- Faculty of Pharmaceutical Science, Rua Gabriel Monteiro da Silva, 700 Centro - Alfenas/MG Postal Code: 37130000, Brazil. tel: +553133322556
| | - I Camps
- Institute of Exacts Science, Rua Gabriel Monteiro da Silva, 700 Centro - Alfenas/MG Postal Code: 37130000, Brazil
| | - Nelson J F da Silveira
- Institute of Exacts Science, Rua Gabriel Monteiro da Silva, 700 Centro - Alfenas/MG Postal Code: 37130000, Brazil
| |
Collapse
|
12
|
Liu PF, Kihara D, Park C. Energetics-based discovery of protein-ligand interactions on a proteomic scale. J Mol Biol 2011; 408:147-62. [PMID: 21338610 DOI: 10.1016/j.jmb.2011.02.026] [Citation(s) in RCA: 49] [Impact Index Per Article: 3.8] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/29/2010] [Revised: 01/28/2011] [Accepted: 02/04/2011] [Indexed: 01/09/2023]
Abstract
Biochemical functions of proteins in cells frequently involve interactions with various ligands. Proteomic methods for the identification of proteins that interact with specific ligands such as metabolites, signaling molecules, and drugs are valuable in investigating the regulatory mechanisms of cellular metabolism, annotating proteins with unknown functions, and elucidating pharmacological mechanisms. Here we report an energetics-based target identification method in which target proteins in a cell lysate are identified by exploiting the effect of ligand binding on their stabilities. Urea-induced unfolding of proteins in cell lysates is probed by a short pulse of proteolysis, and the effect of a ligand on the amount of folded protein remaining is monitored on a proteomic scale. As proof of principle, we identified proteins that interact with ATP in the Escherichia coli proteome. Literature and database mining confirmed that a majority of the identified proteins are indeed ATP-binding proteins. Four identified proteins that were previously not known to interact with ATP were cloned and expressed to validate the result. Except for one protein, the effects of ATP on urea-induced unfolding were confirmed. Analyses of the protein sequences and structure models were also employed to predict potential ATP binding sites in the identified proteins. Our results demonstrate that this energetics-based target identification approach is a facile method to identify proteins that interact with specific ligands on a proteomic scale.
Collapse
Affiliation(s)
- Pei-Fen Liu
- Department of Medicinal Chemistry and Molecular Pharmacology, Purdue University, West Lafayette, IN 47907, USA
| | | | | |
Collapse
|
13
|
Chen H, Kihara D. Effect of using suboptimal alignments in template-based protein structure prediction. Proteins 2011; 79:315-34. [PMID: 21058297 PMCID: PMC3058269 DOI: 10.1002/prot.22885] [Citation(s) in RCA: 20] [Impact Index Per Article: 1.5] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 12/28/2022]
Abstract
Computational protein structure prediction remains a challenging task in protein bioinformatics. In the recent years, the importance of template-based structure prediction is increasing because of the growing number of protein structures solved by the structural genomics projects. To capitalize the significant efforts and investments paid on the structural genomics projects, it is urgent to establish effective ways to use the solved structures as templates by developing methods for exploiting remotely related proteins that cannot be simply identified by homology. In this work, we examine the effect of using suboptimal alignments in template-based protein structure prediction. We showed that suboptimal alignments are often more accurate than the optimal one, and such accurate suboptimal alignments can occur even at a very low rank of the alignment score. Suboptimal alignments contain a significant number of correct amino acid residue contacts. Moreover, suboptimal alignments can improve template-based models when used as input to Modeller. Finally, we use suboptimal alignments for handling a contact potential in a probabilistic way in a threading program, SUPRB. The probabilistic contacts strategy outperforms the partly thawed approach, which only uses the optimal alignment in defining residue contacts, and also the re-ranking strategy, which uses the contact potential in re-ranking alignments. The comparison with existing methods in the template-recognition test shows that SUPRB is very competitive and outperforms existing methods.
Collapse
Affiliation(s)
- Hao Chen
- Department of Biological Sciences College of Science, Purdue University, West Lafayette, IN, 47907, USA
| | - Daisuke Kihara
- Department of Biological Sciences College of Science, Purdue University, West Lafayette, IN, 47907, USA
- Department of Computer Science College of Science, Purdue University, West Lafayette, IN, 47907, USA
- Markey Center for Structural Biology College of Science, Purdue University, West Lafayette, IN, 47907, USA
| |
Collapse
|