Reference Citation Analysis: Find an Article, Find a Category, Find a Journal, Find a Scholar

For: Basu S, Bhattacharyya D, Banerjee R. Mapping the distribution of packing topologies within protein interiors shows predominant preference for specific packing motifs. BMC Bioinformatics 2011;12:195. [PMID: 21605466 DOI: 10.1186/1471-2105-12-195] [Citation(s) in RCA: 22] [Impact Index Per Article: 1.7] [Reference Citation Analysis] [What about the content of this article? (0)] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/20/2010] [Accepted: 05/24/2011] [Indexed: 11/24/2022] Open

For:	Basu S, Bhattacharyya D, Banerjee R. Mapping the distribution of packing topologies within protein interiors shows predominant preference for specific packing motifs. BMC Bioinformatics 2011;12:195. [PMID: 21605466 DOI: 10.1186/1471-2105-12-195] [Citation(s) in RCA: 22] [Impact Index Per Article: 1.7] [Reference Citation Analysis] [What about the content of this article? (0)] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/20/2010] [Accepted: 05/24/2011] [Indexed: 11/24/2022] Open

Number

Cited by Other Article(s)

Biswas G, Mukherjee D, Dutta N, Ghosh P, Basu S. EnCPdock: a web-interface for direct conjoint comparative analyses of complementarity and binding energetics in inter-protein associations. J Mol Model 2023;29:239. [PMID: 37423912 DOI: 10.1007/s00894-023-05626-0] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/23/2023] [Accepted: 06/20/2023] [Indexed: 07/11/2023]

Abstract

CONTEXT

Protein-protein interaction (PPI) is a key component linked to virtually all cellular processes. Be it an enzyme catalysis ('classic type functions' of proteins) or a signal transduction ('non-classic'), proteins generally function involving stable or quasi-stable multi-protein associations. The physical basis for such associations is inherent in the combined effect of shape and electrostatic complementarities (Sc, EC) of the interacting protein partners at their interface, which provides indirect probabilistic estimates of the stability and affinity of the interaction. While Sc is a necessary criterion for inter-protein associations, EC can be favorable as well as disfavored (e.g., in transient interactions). Estimating equilibrium thermodynamic parameters (∆G_binding, K_d) by experimental means is costly and time consuming, thereby opening windows for computational structural interventions. Attempts to empirically probe ∆G_binding from coarse-grain structural descriptors (primarily, surface area based terms) have lately been overtaken by physics-based, knowledge-based and their hybrid approaches (MM/PBSA, FoldX, etc.) that directly compute ∆G_binding without involving intermediate structural descriptors.

METHODS

Here, we present EnCPdock ( https://www.scinetmol.in/EnCPdock/ ), a user-friendly web-interface for the direct conjoint comparative analyses of complementarity and binding energetics in proteins. EnCPdock returns an AI-predicted ∆G_binding computed by combining complementarity (Sc, EC) and other high-level structural descriptors (input feature vectors), and renders a prediction accuracy comparable to the state-of-the-art. EnCPdock further locates a PPI complex in terms of its {Sc, EC} values (taken as an ordered pair) in the two-dimensional complementarity plot (CP). In addition, it also generates mobile molecular graphics of the interfacial atomic contact network for further analyses. EnCPdock also furnishes individual feature trends along with the relative probability estimates (Pr_fmax) of the obtained feature-scores with respect to the events of their highest observed frequencies. Together, these functionalities are of real practical use for structural tinkering and intervention as might be relevant in the design of targeted protein-interfaces. Combining all its features and applications, EnCPdock presents a unique online tool that should be beneficial to structural biologists and researchers across related fraternities.

Collapse

Basu S, Chakravarty D, Hou Q, Uversky VN. Editorial: From the hydrophobic core to the globular-disorder interface: New challenges and insights into protein design. Front Mol Biosci 2023;10:1151676. [PMID: 36814642 PMCID: PMC9939879 DOI: 10.3389/fmolb.2023.1151676] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Key Words] [Track Full Text] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 01/26/2023] [Accepted: 01/30/2023] [Indexed: 02/09/2023] Open

Jung Y, Geng C, Bonvin AMJJ, Xue LC, Honavar VG. MetaScore: A Novel Machine-Learning-Based Approach to Improve Traditional Scoring Functions for Scoring Protein-Protein Docking Conformations. Biomolecules 2023;13:121. [PMID: 36671507 PMCID: PMC9855734 DOI: 10.3390/biom13010121] [Citation(s) in RCA: 3] [Impact Index Per Article: 3.0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 12/01/2022] [Revised: 12/22/2022] [Accepted: 12/26/2022] [Indexed: 01/11/2023] Open

Abstract

Protein-protein interactions play a ubiquitous role in biological function. Knowledge of the three-dimensional (3D) structures of the complexes they form is essential for understanding the structural basis of those interactions and how they orchestrate key cellular processes. Computational docking has become an indispensable alternative to the expensive and time-consuming experimental approaches for determining the 3D structures of protein complexes. Despite recent progress, identifying near-native models from a large set of conformations sampled by docking-the so-called scoring problem-still has considerable room for improvement. We present MetaScore, a new machine-learning-based approach to improve the scoring of docked conformations. MetaScore utilizes a random forest (RF) classifier trained to distinguish near-native from non-native conformations using their protein-protein interfacial features. The features include physicochemical properties, energy terms, interaction-propensity-based features, geometric properties, interface topology features, evolutionary conservation, and also scores produced by traditional scoring functions (SFs). MetaScore scores docked conformations by simply averaging the score produced by the RF classifier with that produced by any traditional SF. We demonstrate that (i) MetaScore consistently outperforms each of the nine traditional SFs included in this work in terms of success rate and hit rate evaluated over conformations ranked among the top 10; (ii) an ensemble method, MetaScore-Ensemble, that combines 10 variants of MetaScore obtained by combining the RF score with each of the traditional SFs outperforms each of the MetaScore variants. We conclude that the performance of traditional SFs can be improved upon by using machine learning to judiciously leverage protein-protein interfacial features and by using ensemble methods to combine multiple scoring functions.

Collapse

Biswas G, Ghosh S, Basu S, Bhattacharyya D, Datta AK, Banerjee R. Can the jigsaw puzzle model of protein folding re‐assemble a hydrophobic core? Proteins 2022;90:1390-1412. [DOI: 10.1002/prot.26321] [Citation(s) in RCA: 1] [Impact Index Per Article: 0.5] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 08/30/2021] [Revised: 01/11/2022] [Accepted: 01/28/2022] [Indexed: 12/30/2022]

Capturing a Crucial ‘Disorder-to-Order Transition’ at the Heart of the Coronavirus Molecular Pathology—Triggered by Highly Persistent, Interchangeable Salt-Bridges. Vaccines (Basel) 2022;10:vaccines10020301. [PMID: 35214759 PMCID: PMC8875383 DOI: 10.3390/vaccines10020301] [Citation(s) in RCA: 3] [Impact Index Per Article: 1.5] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 12/29/2021] [Revised: 01/27/2022] [Accepted: 02/05/2022] [Indexed: 02/05/2023] Open

Abstract The COVID-19 origin debate has greatly been influenced by genome comparison studies of late, revealing the emergence of the Furin-like cleavage site at the S1/S2 junction of the SARS-CoV-2 Spike (FLCSSpike) containing its 681PRRAR685 motif, absent in other related respiratory viruses. Being the rate-limiting (i.e., the slowest) step, the host Furin cleavage is instrumental in the abrupt increase in transmissibility in COVID-19, compared to earlier onsets of respiratory viral diseases. In such a context, the current paper entraps a ‘disorder-to-order transition’ of the FLCSSpike (concomitant to an entropy arrest) upon binding to Furin. The interaction clearly seems to be optimized for a more efficient proteolytic cleavage in SARS-CoV-2. The study further shows the formation of dynamically interchangeable and persistent networks of salt-bridges at the Spike–Furin interface in SARS-CoV-2 involving the three arginines (R682, R683, R685) of the FLCSSpike with several anionic residues (E230, E236, D259, D264, D306) coming from Furin, strategically distributed around its catalytic triad. Multiplicity and structural degeneracy of plausible salt-bridge network archetypes seem to be the other key characteristic features of the Spike–Furin binding in SARS-CoV-2, allowing the system to breathe—a trademark of protein disorder transitions. Interestingly, with respect to the homologous interaction in SARS-CoV (2002/2003) taken as a baseline, the Spike–Furin binding events, generally, in the coronavirus lineage, seems to have preference for ionic bond formation, even with a lesser number of cationic residues at their potentially polybasic FLCSSpike patches. The interaction energies are suggestive of characteristic metastabilities attributed to Spike–Furin interactions, generally to the coronavirus lineage, which appears to be favorable for proteolytic cleavages targeted at flexible protein loops. The current findings not only offer novel mechanistic insights into the coronavirus molecular pathology and evolution, but also add substantially to the existing theories of proteolytic cleavages. Collapse

Liu M, Liu Y, Qian W, Wang Y. DeepSeed Local Graph Matching for Densely Packed Cells Tracking. IEEE/ACM TRANSACTIONS ON COMPUTATIONAL BIOLOGY AND BIOINFORMATICS 2021;18:1060-1069. [PMID: 31443049 DOI: 10.1109/tcbb.2019.2936851] [Citation(s) in RCA: 2] [Impact Index Per Article: 0.7] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 06/10/2023]

Solihah B, Azhari A, Musdholifah A. Enhancement of conformational B-cell epitope prediction using CluSMOTE. PeerJ Comput Sci 2020;6:e275. [PMID: 33816926 PMCID: PMC7924438 DOI: 10.7717/peerj-cs.275] [Citation(s) in RCA: 6] [Impact Index Per Article: 1.5] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 05/12/2019] [Accepted: 04/15/2020] [Indexed: 06/12/2023]

Abstract

BACKGROUND

A conformational B-cell epitope is one of the main components of vaccine design. It contains separate segments in its sequence, which are spatially close in the antigen chain. The availability of Ag-Ab complex data on the Protein Data Bank allows for the development predictive methods. Several epitope prediction models also have been developed, including learning-based methods. However, the performance of the model is still not optimum. The main problem in learning-based prediction models is class imbalance.

METHODS

This study proposes CluSMOTE, which is a combination of a cluster-based undersampling method and Synthetic Minority Oversampling Technique. The approach is used to generate other sample data to ensure that the dataset of the conformational epitope is balanced. The Hierarchical DBSCAN algorithm is performed to identify the cluster in the majority class. Some of the randomly selected data is taken from each cluster, considering the oversampling degree, and combined with the minority class data. The balance data is utilized as the training dataset to develop a conformational epitope prediction. Furthermore, two binary classification methods, Support Vector Machine and Decision Tree, are separately used to develop model prediction and to evaluate the performance of CluSMOTE in predicting conformational B-cell epitope. The experiment is focused on determining the best parameter for optimal CluSMOTE. Two independent datasets are used to compare the proposed prediction model with state of the art methods. The first and the second datasets represent the general protein and the glycoprotein antigens respectively.

RESULT

The experimental result shows that CluSMOTE Decision Tree outperformed the Support Vector Machine in terms of AUC and Gmean as performance measurements. The mean AUC of CluSMOTE Decision Tree in the Kringelum and the SEPPA 3 test sets are 0.83 and 0.766, respectively. This shows that CluSMOTE Decision Tree is better than other methods in the general protein antigen, though comparable with SEPPA 3 in the glycoprotein antigen.

Collapse

Bandyopadhyay A, Dhar AK, Basu S. Graph coloring: a novel heuristic based on trailing path—properties, perspective and applications in structured networks. Soft comput 2020. [DOI: 10.1007/s00500-019-04278-8] [Citation(s) in RCA: 3] [Impact Index Per Article: 0.8] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/30/2022]

Peng Y, Alexov E, Basu S. Structural Perspective on Revealing and Altering Molecular Functions of Genetic Variants Linked with Diseases. Int J Mol Sci 2019;20:ijms20030548. [PMID: 30696058 PMCID: PMC6386852 DOI: 10.3390/ijms20030548] [Citation(s) in RCA: 14] [Impact Index Per Article: 2.8] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 12/22/2018] [Revised: 01/25/2019] [Accepted: 01/26/2019] [Indexed: 12/25/2022] Open

Inner-View of Nanomaterial Incited Protein Conformational Changes: Insights into Designable Interaction. RESEARCH 2018;2018:9712832. [PMID: 31549040 PMCID: PMC6750102 DOI: 10.1155/2018/9712832] [Citation(s) in RCA: 29] [Impact Index Per Article: 4.8] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Subscribe] [Scholar Register] [Received: 06/16/2018] [Accepted: 08/16/2018] [Indexed: 12/19/2022]

Basu S, Biswas P. Salt-bridge dynamics in intrinsically disordered proteins: A trade-off between electrostatic interactions and structural flexibility. BIOCHIMICA ET BIOPHYSICA ACTA-PROTEINS AND PROTEOMICS 2018;1866:624-641. [DOI: 10.1016/j.bbapap.2018.03.002] [Citation(s) in RCA: 13] [Impact Index Per Article: 2.2] [Reference Citation Analysis] [Track Full Text] [Subscribe] [Scholar Register] [Received: 12/19/2017] [Revised: 02/13/2018] [Accepted: 03/07/2018] [Indexed: 12/29/2022]

Borges NM, Sartori GR, Ribeiro JFR, Rocha JR, Martins JBL, Montanari CA, Gargano R. Similarity search combined with docking and molecular dynamics for novel hAChE inhibitor scaffolds. J Mol Model 2018;24:41. [PMID: 29332299 DOI: 10.1007/s00894-017-3548-9] [Citation(s) in RCA: 13] [Impact Index Per Article: 2.2] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/21/2017] [Accepted: 11/27/2017] [Indexed: 12/31/2022]

Basu S. CP_dock: the complementarity plot for docking of proteins: implementing multi-dielectric continuum electrostatics. J Mol Model 2017;24:8. [PMID: 29218430 DOI: 10.1007/s00894-017-3546-y] [Citation(s) in RCA: 8] [Impact Index Per Article: 1.1] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 08/19/2017] [Accepted: 11/24/2017] [Indexed: 01/18/2023]

Abstract

The complementarity plot (CP) is an established validation tool for protein structures, applicable to both globular proteins (folding) as well as protein-protein complexes (binding). It computes the shape and electrostatic complementarities (S_m, E_m) for amino acid side-chains buried within the protein interior or interface and plots them in a two-dimensional plot having knowledge-based probabilistic quality estimates for the residues as well as for the whole structure. The current report essentially presents an upgraded version of the plot with the implementation of the advanced multi-dielectric functionality (as in Delphi version 6.2 or higher) in the computation of electrostatic complementarity to make the validation tool physico-chemically more realistic. The two methods (single- and multi-dielectric) agree decently in their resultant E_m values, and hence, provisions for both methods have been kept in the software suite. So to speak, the global electrostatic balance within a well-folded protein and/or a well-packed interface seems only marginally perturbed by the choice of different internal dielectric values. However, both from theoretical as well as practical grounds, the more advanced multi-dielectric version of the plot is certainly recommended for potentially producing more reliable results. The report also presents a new methodology and a variant plot, namely CP_dock, based on the same principles of complementarity specifically designed to be used in the docking of proteins. The efficacy of the method to discriminate between good and bad docked protein complexes has been tested on a recent state-of-the-art docking benchmark. The results unambiguously indicate that CP_dock can indeed be effective in the initial screening phase of a docking scoring pipeline before going into more sophisticated and computationally expensive scoring functions. CP_dock has been made available at https://github.com/nemo8130/CPdock . Graphical Abstract An example showing the efficacy of CP_dock to be used in the initial screening phase of a protein-protein docking scoring pipeline.

Collapse

Basu S, Wallner B. Finding correct protein-protein docking models using ProQDock. Bioinformatics 2017;32:i262-i270. [PMID: 27307625 PMCID: PMC4908341 DOI: 10.1093/bioinformatics/btw257] [Citation(s) in RCA: 36] [Impact Index Per Article: 5.1] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 01/20/2023] Open

Abstract

Motivation: Protein–protein interactions are a key in virtually all biological processes. For a detailed understanding of the biological processes, the structure of the protein complex is essential. Given the current experimental techniques for structure determination, the vast majority of all protein complexes will never be solved by experimental techniques. In lack of experimental data, computational docking methods can be used to predict the structure of the protein complex. A common strategy is to generate many alternative docking solutions (atomic models) and then use a scoring function to select the best. The success of the computational docking technique is, to a large degree, dependent on the ability of the scoring function to accurately rank and score the many alternative docking models.

Results: Here, we present ProQDock, a scoring function that predicts the absolute quality of docking model measured by a novel protein docking quality score (DockQ). ProQDock uses support vector machines trained to predict the quality of protein docking models using features that can be calculated from the docking model itself. By combining different types of features describing both the protein–protein interface and the overall physical chemistry, it was possible to improve the correlation with DockQ from 0.25 for the best individual feature (electrostatic complementarity) to 0.49 for the final version of ProQDock. ProQDock performed better than the state-of-the-art methods ZRANK and ZRANK2 in terms of correlations, ranking and finding correct models on an independent test set. Finally, we also demonstrate that it is possible to combine ProQDock with ZRANK and ZRANK2 to improve performance even further.

Availability and implementation:http://bioinfo.ifm.liu.se/ProQDock

Contact:bjornw@ifm.liu.se

Supplementary information:Supplementary data are available at Bioinformatics online.

Collapse

Salt-bridge networks within globular and disordered proteins: characterizing trends for designable interactions. J Mol Model 2017. [PMID: 28626846 DOI: 10.1007/s00894-017-3376-y] [Citation(s) in RCA: 25] [Impact Index Per Article: 3.6] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 10/19/2022]

Abstract

There has been considerable debate about the contribution of salt bridges to the stabilization of protein folds, in spite of their participation in crucial protein functions. Salt bridges appear to contribute to the activity-stability trade-off within proteins by bringing high-entropy charged amino acids into close contacts during the course of their functions. The current study analyzes the modes of association of salt bridges (in terms of networks) within globular proteins and at protein-protein interfaces. While the most common and trivial type of salt bridge is the isolated salt bridge, bifurcated salt bridge appears to be a distinct salt-bridge motif having a special topology and geometry. Bifurcated salt bridges are found ubiquitously in proteins and interprotein complexes. Interesting and attractive examples presenting different modes of interaction are highlighted. Bifurcated salt bridges appear to function as molecular clips that are used to stitch together large surface contours at interacting protein interfaces. The present work also emphasizes the key role of salt-bridge-mediated interactions in the partial folding of proteins containing long stretches of disordered regions. Salt-bridge-mediated interactions seem to be pivotal to the promotion of "disorder-to-order" transitions in small disordered protein fragments and their stabilization upon binding. The results obtained in this work should help to guide efforts to elucidate the modus operandi of these partially disordered proteins, and to conceptualize how these proteins manage to maintain the required amount of disorder even in their bound forms. This work could also potentially facilitate explorations of geometrically specific designable salt bridges through the characterization of composite salt-bridge networks. Graphical abstract ᅟ.

Collapse

O'Rourke KF, Gorman SD, Boehr DD. Biophysical and computational methods to analyze amino acid interaction networks in proteins. Comput Struct Biotechnol J 2016;14:245-51. [PMID: 27441044 PMCID: PMC4939391 DOI: 10.1016/j.csbj.2016.06.002] [Citation(s) in RCA: 51] [Impact Index Per Article: 6.4] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/30/2016] [Revised: 06/04/2016] [Accepted: 06/13/2016] [Indexed: 12/20/2022] Open

Roy S, Basu S, Dasgupta D, Bhattacharyya D, Banerjee R. The Unfolding MD Simulations of Cyclophilin: Analyzed by Surface Contact Networks and Their Associated Metrics. PLoS One 2015;10:e0142173. [PMID: 26545107 PMCID: PMC4636149 DOI: 10.1371/journal.pone.0142173] [Citation(s) in RCA: 5] [Impact Index Per Article: 0.6] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 12/10/2014] [Accepted: 10/18/2015] [Indexed: 11/19/2022] Open

Joo H, Chavan AG, Fraga KJ, Tsai J. An amino acid code for irregular and mixed protein packing. Proteins 2015;83:2147-61. [PMID: 26370334 DOI: 10.1002/prot.24929] [Citation(s) in RCA: 6] [Impact Index Per Article: 0.7] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 05/04/2015] [Revised: 09/01/2015] [Accepted: 09/02/2015] [Indexed: 11/10/2022]

Guilloux A, Caudron B, Jestin JL. A method to predict edge strands in beta-sheets from protein sequences. Comput Struct Biotechnol J 2013;7:e201305001. [PMID: 24688737 PMCID: PMC3962219 DOI: 10.5936/csbj.201305001] [Citation(s) in RCA: 6] [Impact Index Per Article: 0.5] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 02/22/2013] [Revised: 05/27/2013] [Accepted: 05/30/2013] [Indexed: 12/15/2022] Open

Basu S, Bhattacharyya D, Banerjee R. Self-complementarity within proteins: bridging the gap between binding and folding. Biophys J 2012;102:2605-14. [PMID: 22713576 DOI: 10.1016/j.bpj.2012.04.029] [Citation(s) in RCA: 23] [Impact Index Per Article: 1.9] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 12/08/2011] [Revised: 03/30/2012] [Accepted: 04/17/2012] [Indexed: 01/09/2023] Open

Abstract

Complementarity, in terms of both shape and electrostatic potential, has been quantitatively estimated at protein-protein interfaces and used extensively to predict the specific geometry of association between interacting proteins. In this work, we attempted to place both binding and folding on a common conceptual platform based on complementarity. To that end, we estimated (for the first time to our knowledge) electrostatic complementarity (Em) for residues buried within proteins. Em measures the correlation of surface electrostatic potential at protein interiors. The results show fairly uniform and significant values for all amino acids. Interestingly, hydrophobic side chains also attain appreciable complementarity primarily due to the trajectory of the main chain. Previous work from our laboratory characterized the surface (or shape) complementarity (Sm) of interior residues, and both of these measures have now been combined to derive two scoring functions to identify the native fold amid a set of decoys. These scoring functions are somewhat similar to functions that discriminate among multiple solutions in a protein-protein docking exercise. The performances of both of these functions on state-of-the-art databases were comparable if not better than most currently available scoring functions. Thus, analogously to interfacial residues of protein chains associated (docked) with specific geometry, amino acids found in the native interior have to satisfy fairly stringent constraints in terms of both Sm and Em. The functions were also found to be useful for correctly identifying the same fold for two sequences with low sequence identity. Finally, inspired by the Ramachandran plot, we developed a plot of Sm versus Em (referred to as the complementarity plot) that identifies residues with suboptimal packing and electrostatics which appear to be correlated to coordinate errors.

Collapse