1
|
Harihar B, Saravanan KM, Gromiha MM, Selvaraj S. Importance of Inter-residue Contacts for Understanding Protein Folding and Unfolding Rates, Remote Homology, and Drug Design. Mol Biotechnol 2024:10.1007/s12033-024-01119-4. [PMID: 38498284 DOI: 10.1007/s12033-024-01119-4] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 12/16/2023] [Accepted: 02/10/2024] [Indexed: 03/20/2024]
Abstract
Inter-residue interactions in protein structures provide valuable insights into protein folding and stability. Understanding these interactions can be helpful in many crucial applications, including rational design of therapeutic small molecules and biologics, locating functional protein sites, and predicting protein-protein and protein-ligand interactions. The process of developing machine learning models incorporating inter-residue interactions has been improved recently. This review highlights the theoretical models incorporating inter-residue interactions in predicting folding and unfolding rates of proteins. Utilizing contact maps to depict inter-residue interactions aids researchers in developing computer models for detecting remote homologs and interface residues within protein-protein complexes which, in turn, enhances our knowledge of the relationship between sequence and structure of proteins. Further, the application of contact maps derived from inter-residue interactions is highlighted in the field of drug discovery. Overall, this review presents an extensive assessment of the significant models that use inter-residue interactions to investigate folding rates, unfolding rates, remote homology, and drug development, providing potential future advancements in constructing efficient computational models in structural biology.
Collapse
Affiliation(s)
- Balasubramanian Harihar
- Department of Bioinformatics, School of Life Sciences, Bharathidasan University, Tiruchirappalli, Tamil Nadu, 620024, India
- Department of Biotechnology, Bhupat and Jyoti Mehta School of Biosciences, Indian Institute of Technology Madras, Chennai, Tamil Nadu, 600036, India
| | - Konda Mani Saravanan
- Department of Bioinformatics, School of Life Sciences, Bharathidasan University, Tiruchirappalli, Tamil Nadu, 620024, India
- Department of Biotechnology, Bharath Institute of Higher Education and Research, Chennai, Tamil Nadu, 600073, India
| | - Michael M Gromiha
- Department of Biotechnology, Bhupat and Jyoti Mehta School of Biosciences, Indian Institute of Technology Madras, Chennai, Tamil Nadu, 600036, India
| | - Samuel Selvaraj
- Department of Bioinformatics, School of Life Sciences, Bharathidasan University, Tiruchirappalli, Tamil Nadu, 620024, India.
| |
Collapse
|
2
|
PFDB: A standardized protein folding database with temperature correction. Sci Rep 2019; 9:1588. [PMID: 30733462 PMCID: PMC6367381 DOI: 10.1038/s41598-018-36992-y] [Citation(s) in RCA: 18] [Impact Index Per Article: 3.0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 02/01/2018] [Accepted: 11/22/2018] [Indexed: 11/23/2022] Open
Abstract
We constructed a standardized protein folding kinetics database (PFDB) in which the logarithmic rate constants of all listed proteins are calculated at the standard temperature (25 °C). A temperature correction based on the Eyring–Kramers equation was introduced for proteins whose folding kinetics were originally measured at temperatures other than 25 °C. We verified the temperature correction by comparing the logarithmic rate constants predicted and experimentally observed at 25 °C for 14 different proteins, and the results demonstrated improvement of the quality of the database. PFDB consists of 141 (89 two-state and 52 non-two-state) single-domain globular proteins, which has the largest number among the currently available databases of protein folding kinetics. PFDB is thus intended to be used as a standard for developing and testing future predictive and theoretical studies of protein folding. PFDB can be accessed from the following link: http://lee.kias.re.kr/~bala/PFDB.
Collapse
|
3
|
Prediction of change in protein unfolding rates upon point mutations in two state proteins. BIOCHIMICA ET BIOPHYSICA ACTA-PROTEINS AND PROTEOMICS 2016; 1864:1104-1109. [DOI: 10.1016/j.bbapap.2016.06.001] [Citation(s) in RCA: 1] [Impact Index Per Article: 0.1] [Reference Citation Analysis] [Track Full Text] [Subscribe] [Scholar Register] [Received: 03/26/2016] [Revised: 05/05/2016] [Accepted: 06/01/2016] [Indexed: 11/23/2022]
|
4
|
Roy S, Basu S, Dasgupta D, Bhattacharyya D, Banerjee R. The Unfolding MD Simulations of Cyclophilin: Analyzed by Surface Contact Networks and Their Associated Metrics. PLoS One 2015; 10:e0142173. [PMID: 26545107 PMCID: PMC4636149 DOI: 10.1371/journal.pone.0142173] [Citation(s) in RCA: 5] [Impact Index Per Article: 0.5] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 12/10/2014] [Accepted: 10/18/2015] [Indexed: 11/19/2022] Open
Abstract
Currently, considerable interest exists with regard to the dissociation of close packed aminoacids within proteins, in the course of unfolding, which could result in either wet or dry moltenglobules. The progressive disjuncture of residues constituting the hydrophobic core ofcyclophilin from L. donovani (LdCyp) has been studied during the thermal unfolding of the molecule, by molecular dynamics simulations. LdCyp has been represented as a surface contactnetwork (SCN) based on the surface complementarity (Sm) of interacting residues within themolecular interior. The application of Sm to side chain packing within proteins make it a very sensitive indicator of subtle perturbations in packing, in the thermal unfolding of the protein. Network based metrics have been defined to track the sequential changes in the disintegration ofthe SCN spanning the hydrophobic core of LdCyp and these metrics prove to be highly sensitive compared to traditional metrics in indicating the increased conformational (and dynamical) flexibility in the network. These metrics have been applied to suggest criteria distinguishing DMG, WMG and transition state ensembles and to identify key residues involved in crucial conformational/topological events during the unfolding process.
Collapse
Affiliation(s)
- Sourav Roy
- Saha Institute of Nuclear Physics, Sector 1, Block AF, Bidhannagar, Kolkata, 700064 India
| | - Sankar Basu
- Saha Institute of Nuclear Physics, Sector 1, Block AF, Bidhannagar, Kolkata, 700064 India
| | - Dipak Dasgupta
- Saha Institute of Nuclear Physics, Sector 1, Block AF, Bidhannagar, Kolkata, 700064 India
| | - Dhananjay Bhattacharyya
- Saha Institute of Nuclear Physics, Sector 1, Block AF, Bidhannagar, Kolkata, 700064 India
- * E-mail: (DB); (RB)
| | - Rahul Banerjee
- Saha Institute of Nuclear Physics, Sector 1, Block AF, Bidhannagar, Kolkata, 700064 India
- * E-mail: (DB); (RB)
| |
Collapse
|
5
|
Broom A, Gosavi S, Meiering EM. Protein unfolding rates correlate as strongly as folding rates with native structure. Protein Sci 2014; 24:580-7. [PMID: 25422093 DOI: 10.1002/pro.2606] [Citation(s) in RCA: 18] [Impact Index Per Article: 1.6] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 09/10/2014] [Revised: 11/03/2014] [Accepted: 11/04/2014] [Indexed: 01/19/2023]
Abstract
Although the folding rates of proteins have been studied extensively, both experimentally and theoretically, and many native state topological parameters have been proposed to correlate with or predict these rates, unfolding rates have received much less attention. Moreover, unfolding rates have generally been thought either to not relate to native topology in the same manner as folding rates, perhaps depending on different topological parameters, or to be more difficult to predict. Using a dataset of 108 proteins including two-state and multistate folders, we find that both unfolding and folding rates correlate strongly, and comparably well, with well-established measures of native topology, the absolute contact order and the long range order, with correlation coefficient values of 0.75 or higher. In addition, compared to folding rates, the absolute values of unfolding rates vary more strongly with native topology, have a larger range of values, and correlate better with thermodynamic stability. Similar trends are observed for subsets of different protein structural classes. Taken together, these results suggest that choosing a scaffold for protein engineering may require a compromise between a simple topology that will fold sufficiently quickly but also unfold quickly, and a complex topology that will unfold slowly and hence have kinetic stability, but fold slowly. These observations, together with the established role of kinetic stability in determining resistance to thermal and chemical denaturation as well as proteases, have important implications for understanding fundamental aspects of protein unfolding and folding and for protein engineering and design.
Collapse
Affiliation(s)
- Aron Broom
- Department of Chemistry, Guelph-Waterloo Centre for Graduate Studies in Chemistry and Biochemistry, University of Waterloo, Waterloo, Ontario, Canada, N2L 1W2
| | | | | |
Collapse
|
6
|
Wagaman AS, Coburn A, Brand-Thomas I, Dash B, Jaswal SS. A comprehensive database of verified experimental data on protein folding kinetics. Protein Sci 2014; 23:1808-12. [PMID: 25229122 DOI: 10.1002/pro.2551] [Citation(s) in RCA: 11] [Impact Index Per Article: 1.0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/24/2014] [Accepted: 09/15/2014] [Indexed: 11/11/2022]
Abstract
Insights into protein folding rely increasingly on the synergy between experimental and theoretical approaches. Developing successful computational models requires access to experimental data of sufficient quantity and high quality. We compiled folding rate constants for what initially appeared to be 184 proteins from 15 published collections/web databases. To generate the highest confidence in the dataset, we verified the reported lnkf value and exact experimental construct and conditions from the original experimental report(s). The resulting comprehensive database of 126 verified entries, ACPro, will serve as a freely accessible resource (https://www.ats.amherst.edu/protein/) for the protein folding community to enable confident testing of predictive models. In addition, we provide a streamlined submission form for researchers to add new folding kinetics results, requiring specification of all the relevant experimental information according to the standards proposed in 2005 by the protein folding consortium organized by Plaxco. As the number and diversity of proteins whose folding kinetics are studied expands, our curated database will enable efficient and confident incorporation of new experimental results into a standardized collection. This database will support a more robust symbiosis between experiment and theory, leading ultimately to more rapid and accurate insights into protein folding, stability, and dynamics.
Collapse
Affiliation(s)
- Amy S Wagaman
- Department of Mathematics and Statistics, Amherst College, Amherst, Massachusetts
| | | | | | | | | |
Collapse
|
7
|
Yan W, Zhou J, Sun M, Chen J, Hu G, Shen B. The construction of an amino acid network for understanding protein structure and function. Amino Acids 2014; 46:1419-39. [PMID: 24623120 DOI: 10.1007/s00726-014-1710-6] [Citation(s) in RCA: 70] [Impact Index Per Article: 6.4] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 04/05/2013] [Accepted: 02/21/2014] [Indexed: 01/08/2023]
Abstract
Amino acid networks (AANs) are undirected networks consisting of amino acid residues and their interactions in three-dimensional protein structures. The analysis of AANs provides novel insight into protein science, and several common amino acid network properties have revealed diverse classes of proteins. In this review, we first summarize methods for the construction and characterization of AANs. We then compare software tools for the construction and analysis of AANs. Finally, we review the application of AANs for understanding protein structure and function, including the identification of functional residues, the prediction of protein folding, analyzing protein stability and protein-protein interactions, and for understanding communication within and between proteins.
Collapse
Affiliation(s)
- Wenying Yan
- Center for Systems Biology, Soochow University, Suzhou, 215006, Jiangsu, China
| | | | | | | | | | | |
Collapse
|
8
|
Wagaman A. Graph construction and random graph generation for modeling protein structures. Stat Anal Data Min 2013. [DOI: 10.1002/sam.11203] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/05/2022]
|
9
|
Buck PM, Kumar S, Wang X, Agrawal NJ, Trout BL, Singh SK. Computational methods to predict therapeutic protein aggregation. Methods Mol Biol 2012; 899:425-451. [PMID: 22735968 DOI: 10.1007/978-1-61779-921-1_26] [Citation(s) in RCA: 48] [Impact Index Per Article: 3.7] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 06/01/2023]
Abstract
Protein based biotherapeutics have emerged as a successful class of pharmaceuticals. However, these macromolecules endure a variety of physicochemical degradations during manufacturing, shipping, and storage, which may adversely impact the drug product quality. Of these degradations, the irreversible self-association of therapeutic proteins to form aggregates is a major challenge in the formulation of these molecules. Tools to predict and mitigate protein aggregation are, therefore, of great interest to biopharmaceutical research and development. In this chapter, a number of such computational tools developed to understand and predict the various steps involved in protein aggregation are described. These tools can be grouped into three general classes: unfolding kinetics and native state thermal stability, colloidal stability, and sequence/structure based aggregation liabilities. Chapter sections introduce each class by discussing how these predictive tools provide insight into the molecular events leading to protein aggregation. The computational methods are then explained in detail along with their advantages and limitations.
Collapse
Affiliation(s)
- Patrick M Buck
- Biotherapeutics Pharmaceutical Research and Development, Pfizer, Inc, St. Louis, MO, USA
| | | | | | | | | | | |
Collapse
|
10
|
Harihar B, Selvaraj S. Application of long-range order to predict unfolding rates of two-state proteins. Proteins 2010; 79:880-7. [DOI: 10.1002/prot.22925] [Citation(s) in RCA: 5] [Impact Index Per Article: 0.3] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/23/2010] [Revised: 10/07/2010] [Accepted: 10/24/2010] [Indexed: 01/09/2023]
|
11
|
Topological Quantities Determining the Folding/Unfolding Rate of Two-state Folding Proteins. J SOLUTION CHEM 2010. [DOI: 10.1007/s10953-010-9556-3] [Citation(s) in RCA: 7] [Impact Index Per Article: 0.5] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/26/2022]
|
12
|
Li J, Wang J, Wang W. Identifying folding nucleus based on residue contact networks of proteins. Proteins 2008; 71:1899-907. [DOI: 10.1002/prot.21891] [Citation(s) in RCA: 21] [Impact Index Per Article: 1.2] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/08/2022]
|
13
|
Bagler G, Sinha S. Assortative mixing in Protein Contact Networks and protein folding kinetics. Bioinformatics 2007; 23:1760-7. [PMID: 17519248 DOI: 10.1093/bioinformatics/btm257] [Citation(s) in RCA: 72] [Impact Index Per Article: 4.0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/14/2022] Open
Abstract
MOTIVATION Starting from linear chains of amino acids, the spontaneous folding of proteins into their elaborate 3D structures is one of the remarkable examples of biological self-organization. We investigated native state structures of 30 single-domain, two-state proteins, from complex networks perspective, to understand the role of topological parameters in proteins' folding kinetics, at two length scales--as 'Protein Contact Networks (PCNs)' and their corresponding 'Long-range Interaction Networks (LINs)' constructed by ignoring the short-range interactions. RESULTS Our results show that, both PCNs and LINs exhibit the exceptional topological property of 'assortative mixing' that is absent in all other biological and technological networks studied so far. We show that the degree distribution of these contact networks is partly responsible for the observed assortativity. The coefficient of assortativity also shows a positive correlation with the rate of protein folding at both short- and long-contact scale, whereas, the clustering coefficients of only the LINs exhibit a negative correlation. The results indicate that the general topological parameters of these naturally evolved protein networks can effectively represent the structural and functional properties required for fast information transfer among the residues facilitating biochemical/kinetic functions, such as, allostery, stability and the rate of folding. SUPPLEMENTARY INFORMATION Supplementary data are available at Bioinformatics online.
Collapse
Affiliation(s)
- Ganesh Bagler
- Centre for Cellular and Molecular Biology, Uppal Road, Hyderabad 500007, India
| | | |
Collapse
|
14
|
Xu G, Narayan M, Kurinov I, Ripoll DR, Welker E, Khalili M, Ealick SE, Scheraga HA. A localized specific interaction alters the unfolding pathways of structural homologues. J Am Chem Soc 2006; 128:1204-13. [PMID: 16433537 PMCID: PMC2529162 DOI: 10.1021/ja055313e] [Citation(s) in RCA: 19] [Impact Index Per Article: 1.0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/29/2022]
Abstract
Reductive unfolding studies of proteins are designed to provide information about intramolecular interactions that govern the formation (and stabilization) of the native state and about folding/unfolding pathways. By mutating Tyr92 to G, A, or L in the model protein, bovine pancreatic ribonuclease A, and through analysis of temperature factors and molecular dynamics simulations of the crystal structures of these mutants, it is demonstrated that the markedly different reductive unfolding rates and pathways of ribonuclease A and its structural homologue onconase can be attributed to a single, localized, ring-stacking interaction between Tyr92 and Pro93 in the bovine variant. The fortuitous location of this specific stabilizing interaction in a disulfide-bond-containing loop region of ribonuclease A results in the localized modulation of protein dynamics that, in turn, enhances the susceptibility of the disulfide bond to reduction leading to an alteration in the reductive unfolding behavior of the homologues. These results have important implications for folding studies involving topological determinants to obtain folding/unfolding rates and pathways, for protein structure-function prediction through fold recognition, and for predicting proteolytic cleavage sites.
Collapse
Affiliation(s)
| | | | | | | | | | | | | | - Harold A. Scheraga
- *To whom correspondence should be addressed: Tel: 607 255-4034; Fax: 607 254-4700; E-mail:
| |
Collapse
|