1
|
Fu M. Evolutionary analysis of major histocompatibility complex variants in chytrid-resistant and susceptible amphibians. INFECTION, GENETICS AND EVOLUTION : JOURNAL OF MOLECULAR EPIDEMIOLOGY AND EVOLUTIONARY GENETICS IN INFECTIOUS DISEASES 2024; 118:105544. [PMID: 38216106 DOI: 10.1016/j.meegid.2023.105544] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Subscribe] [Scholar Register] [Received: 05/04/2023] [Revised: 10/09/2023] [Accepted: 12/17/2023] [Indexed: 01/14/2024]
Abstract
An amphibian emerging infectious disease (EID), chytridiomycosis, caused by Batrachochytrium dendrobatidis (Bd), originated in Asia but primarily led to declines and extinctions in amphibian populations outside of Asia. Host major histocompatibility complex (MHC) molecules exhibit high polymorphism, and the evolution of MHC can be influenced by recombination and pathogens. Previous studies have indicated that host MHC class II is associated with Bd resistance. In this study, I conducted recombination and selection tests on functional MHC IIß1 alleles from an Asian Bd-resistant anuran species (Bufo gargarizans) and an Australasian Bd-susceptible species (Litoria caerulea). Recombination at the same site was identified in both species, supporting the hypothesis that recombination contributes to MHC IIß1 diversity in amphibians. Positive selection was observed in MHC IIß1 alleles in both species. In L. caerulea, at least four amino acid sites were identified under significant positive selection in the MHC IIß1, whereas these sites were either negatively selected or conserved in B. gargarizans. This suggests these sites might be selected for Bd resistance. Hydrophobicity was detected in certain amino acid sites relating to Bd resistance, suggesting this physicochemical property may be a factor selected to counteract Bd infection. These findings of this study provide an evolutionary basis for understanding how amphibian MHC IIß1 may undergo selection in response to chytrid infection.
Collapse
Affiliation(s)
- Minjie Fu
- School of Biological Sciences, College of Natural Sciences, Seoul National University, Seoul 08826, Republic of Korea; Research Institute of Basic Sciences, Seoul National University, Seoul 08826, Republic of Korea.
| |
Collapse
|
2
|
Yan Z, Wang J. Evolution shapes interaction patterns for epistasis and specific protein binding in a two-component signaling system. Commun Chem 2024; 7:13. [PMID: 38233668 PMCID: PMC10794238 DOI: 10.1038/s42004-024-01098-2] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/27/2023] [Accepted: 01/05/2024] [Indexed: 01/19/2024] Open
Abstract
The elegant design of protein sequence/structure/function relationships arises from the interaction patterns between amino acid positions. A central question is how evolutionary forces shape the interaction patterns that encode long-range epistasis and binding specificity. Here, we combined family-wide evolutionary analysis of natural homologous sequences and structure-oriented evolution simulation for two-component signaling (TCS) system. The magnitude-frequency relationship of coupling conservation between positions manifests a power-law-like distribution and the positions with highly coupling conservation are sparse but distributed intensely on the binding surfaces and hydrophobic core. The structure-specific interaction pattern involves further optimization of local frustrations at or near the binding surface to adapt the binding partner. The construction of family-wide conserved interaction patterns and structure-specific ones demonstrates that binding specificity is modulated by both direct intermolecular interactions and long-range epistasis across the binding complex. Evolution sculpts the interaction patterns via sequence variations at both family-wide and structure-specific levels for TCS system.
Collapse
Affiliation(s)
- Zhiqiang Yan
- Center for Theoretical Interdisciplinary Sciences, Wenzhou Institute, University of Chinese Academy of Sciences, Wenzhou, Zhejiang, 325001, PR China
| | - Jin Wang
- Department of Chemistry and Physics, State University of New York at Stony Brook, Stony Brook, NY, 11790, USA.
| |
Collapse
|
3
|
Yang YX, Huang JY, Wang P, Zhu BT. AREA-AFFINITY: A Web Server for Machine Learning-Based Prediction of Protein-Protein and Antibody-Protein Antigen Binding Affinities. J Chem Inf Model 2023. [PMID: 37235532 DOI: 10.1021/acs.jcim.2c01499] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 05/28/2023]
Abstract
Protein-Protein binding affinity reflects the binding strength between the binding partners. The prediction of protein-protein binding affinity is important for elucidating protein functions and also for designing protein-based therapeutics. The geometric characteristics such as area (both interface and surface areas) in the structure of a protein-protein complex play an important role in determining protein-protein interactions and their binding affinity. Here, we present a free web server for academic use, AREA-AFFINITY, for prediction of protein-protein or antibody-protein antigen binding affinity based on interface and surface areas in the structure of a protein-protein complex. AREA-AFFINITY implements 60 effective area-based protein-protein affinity predictive models and 37 effective area-based models specific for antibody-protein antigen binding affinity prediction developed in our recent studies. These models take into consideration the roles of interface and surface areas in binding affinity by using areas classified according to different amino acid types with different biophysical nature. The models with the best performances integrate machine learning methods such as neural network or random forest. These newly developed models have superior or comparable performance compared to the commonly used existing methods. AREA-AFFINITY is available for free at: https://affinity.cuhk.edu.cn/.
Collapse
Affiliation(s)
- Yong Xiao Yang
- Shenzhen Key Laboratory of Steroid Drug Discovery and Development, School of Medicine, The Chinese University of Hong Kong, Shenzhen, Guangdong 518172, China
| | - Jin Yan Huang
- Shenzhen Key Laboratory of Steroid Drug Discovery and Development, School of Medicine, The Chinese University of Hong Kong, Shenzhen, Guangdong 518172, China
| | - Pan Wang
- Shenzhen Key Laboratory of Steroid Drug Discovery and Development, School of Medicine, The Chinese University of Hong Kong, Shenzhen, Guangdong 518172, China
| | - Bao Ting Zhu
- Shenzhen Key Laboratory of Steroid Drug Discovery and Development, School of Medicine, The Chinese University of Hong Kong, Shenzhen, Guangdong 518172, China
- Shenzhen Bay Laboratory, Shenzhen, 518055, China
| |
Collapse
|
4
|
Perspectives on the landscape and flux theory for describing emergent behaviors of the biological systems. J Biol Phys 2022; 48:1-36. [PMID: 34822073 PMCID: PMC8866630 DOI: 10.1007/s10867-021-09586-5] [Citation(s) in RCA: 1] [Impact Index Per Article: 0.5] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 05/20/2021] [Accepted: 09/07/2021] [Indexed: 10/19/2022] Open
Abstract
We give a review on the landscape theory of the equilibrium biological systems and landscape-flux theory of the nonequilibrium biological systems as the global driving force. The emergences of the behaviors, the associated thermodynamics in terms of the entropy and free energy and dynamics in terms of the rate and paths have been quantitatively demonstrated. The hierarchical organization structures have been discussed. The biological applications ranging from protein folding, biomolecular recognition, specificity, biomolecular evolution and design for equilibrium systems as well as cell cycle, differentiation and development, cancer, neural networks and brain function, and evolution for nonequilibrium systems, cross-scale studies of genome structural dynamics and experimental quantifications/verifications of the landscape and flux are illustrated. Together, this gives an overall global physical and quantitative picture in terms of the landscape and flux for the behaviors, dynamics and functions of biological systems.
Collapse
|
5
|
Yang YX, Wang P, Zhu BT. Relative importance of interface and surface areas in protein-protein binding affinity prediction: A machine learning analysis based on linear regression and artificial neural network. Biophys Chem 2022; 283:106762. [DOI: 10.1016/j.bpc.2022.106762] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 11/17/2021] [Revised: 01/11/2022] [Accepted: 01/14/2022] [Indexed: 11/02/2022]
|
6
|
Zhang J, Liu Z, Zhao W, Yin X, Zheng X, Liu C, Wang J, Wang E. Discovery of Small Molecule NSC290956 as a Therapeutic Agent for KRas Mutant Non-Small-Cell Lung Cancer. Front Pharmacol 2022; 12:797821. [PMID: 35069209 PMCID: PMC8766838 DOI: 10.3389/fphar.2021.797821] [Citation(s) in RCA: 1] [Impact Index Per Article: 0.5] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/19/2021] [Accepted: 11/22/2021] [Indexed: 11/13/2022] Open
Abstract
HRas-GTP has a transient intermediate state with a “non-signaling open conformation” in GTP hydrolysis and nucleotide exchange. Due to the same hydrolysis process and the structural homology, it can be speculated that the active KRas adopts the same characteristics with the “open conformation.” This implies that agents locking this “open conformation” may theoretically block KRas-dependent signaling. Applying our specificity-affinity drug screening approach, NSC290956 was chosen by high affinity and specificity interaction with the “open conformation” structure HRasG60A-GppNp. In mutant KRas-driven non-small-cell lung cancer (NSCLC) model system, NSC290956 effectively suppresses the KRas-GTP state and gives pharmacological KRas inhibition with concomitant blockages of both the MAPK-ERK and AKT-mTOR pathways. The dual inhibitory effects lead to the metabolic phenotype switching from glycolysis to mitochondrial metabolism, which promotes the cancer cell death. In the xenograft model, NSC290956 significantly reduces H358 tumor growth in nude mice by mechanisms similar to those observed in the cells. Our work indicates that NSC290956 can be a promising agent for the mutant KRas-driven NSCLC therapy.
Collapse
Affiliation(s)
- Jiaxin Zhang
- State Key Laboratory of Electroanalytical Chemistry, Changchun Institute of Applied Chemistry, Chinese Academy of Sciences, Changchun, China.,Department of Chemistry, University of Science and Technology of China, Hefei, China
| | - Zuojia Liu
- State Key Laboratory of Electroanalytical Chemistry, Changchun Institute of Applied Chemistry, Chinese Academy of Sciences, Changchun, China
| | - Wenjing Zhao
- State Key Laboratory of Electroanalytical Chemistry, Changchun Institute of Applied Chemistry, Chinese Academy of Sciences, Changchun, China
| | - Xunzhe Yin
- State Key Laboratory of Electroanalytical Chemistry, Changchun Institute of Applied Chemistry, Chinese Academy of Sciences, Changchun, China
| | - Xiliang Zheng
- State Key Laboratory of Electroanalytical Chemistry, Changchun Institute of Applied Chemistry, Chinese Academy of Sciences, Changchun, China
| | - Chuanbo Liu
- State Key Laboratory of Electroanalytical Chemistry, Changchun Institute of Applied Chemistry, Chinese Academy of Sciences, Changchun, China
| | - Jin Wang
- Department of Chemistry and Physics, State University of New York, Stony Brook, NY, United States
| | - Erkang Wang
- State Key Laboratory of Electroanalytical Chemistry, Changchun Institute of Applied Chemistry, Chinese Academy of Sciences, Changchun, China.,Department of Chemistry, University of Science and Technology of China, Hefei, China
| |
Collapse
|
7
|
Chu WT, Yan Z, Chu X, Zheng X, Liu Z, Xu L, Zhang K, Wang J. Physics of biomolecular recognition and conformational dynamics. REPORTS ON PROGRESS IN PHYSICS. PHYSICAL SOCIETY (GREAT BRITAIN) 2021; 84:126601. [PMID: 34753115 DOI: 10.1088/1361-6633/ac3800] [Citation(s) in RCA: 1] [Impact Index Per Article: 0.3] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Subscribe] [Scholar Register] [Received: 05/26/2021] [Accepted: 11/09/2021] [Indexed: 06/13/2023]
Abstract
Biomolecular recognition usually leads to the formation of binding complexes, often accompanied by large-scale conformational changes. This process is fundamental to biological functions at the molecular and cellular levels. Uncovering the physical mechanisms of biomolecular recognition and quantifying the key biomolecular interactions are vital to understand these functions. The recently developed energy landscape theory has been successful in quantifying recognition processes and revealing the underlying mechanisms. Recent studies have shown that in addition to affinity, specificity is also crucial for biomolecular recognition. The proposed physical concept of intrinsic specificity based on the underlying energy landscape theory provides a practical way to quantify the specificity. Optimization of affinity and specificity can be adopted as a principle to guide the evolution and design of molecular recognition. This approach can also be used in practice for drug discovery using multidimensional screening to identify lead compounds. The energy landscape topography of molecular recognition is important for revealing the underlying flexible binding or binding-folding mechanisms. In this review, we first introduce the energy landscape theory for molecular recognition and then address four critical issues related to biomolecular recognition and conformational dynamics: (1) specificity quantification of molecular recognition; (2) evolution and design in molecular recognition; (3) flexible molecular recognition; (4) chromosome structural dynamics. The results described here and the discussions of the insights gained from the energy landscape topography can provide valuable guidance for further computational and experimental investigations of biomolecular recognition and conformational dynamics.
Collapse
Affiliation(s)
- Wen-Ting Chu
- State Key Laboratory of Electroanalytical Chemistry, Changchun Institute of Applied Chemistry, Chinese Academy of Sciences, Changchun 130022, People's Republic of China
| | - Zhiqiang Yan
- State Key Laboratory of Electroanalytical Chemistry, Changchun Institute of Applied Chemistry, Chinese Academy of Sciences, Changchun 130022, People's Republic of China
| | - Xiakun Chu
- Department of Chemistry & Physics, State University of New York at Stony Brook, Stony Brook, NY 11794, United States of America
| | - Xiliang Zheng
- State Key Laboratory of Electroanalytical Chemistry, Changchun Institute of Applied Chemistry, Chinese Academy of Sciences, Changchun 130022, People's Republic of China
| | - Zuojia Liu
- State Key Laboratory of Electroanalytical Chemistry, Changchun Institute of Applied Chemistry, Chinese Academy of Sciences, Changchun 130022, People's Republic of China
| | - Li Xu
- State Key Laboratory of Electroanalytical Chemistry, Changchun Institute of Applied Chemistry, Chinese Academy of Sciences, Changchun 130022, People's Republic of China
| | - Kun Zhang
- State Key Laboratory of Electroanalytical Chemistry, Changchun Institute of Applied Chemistry, Chinese Academy of Sciences, Changchun 130022, People's Republic of China
| | - Jin Wang
- Department of Chemistry & Physics, State University of New York at Stony Brook, Stony Brook, NY 11794, United States of America
| |
Collapse
|
8
|
Frutiger A, Tanno A, Hwu S, Tiefenauer RF, Vörös J, Nakatsuka N. Nonspecific Binding-Fundamental Concepts and Consequences for Biosensing Applications. Chem Rev 2021; 121:8095-8160. [PMID: 34105942 DOI: 10.1021/acs.chemrev.1c00044] [Citation(s) in RCA: 89] [Impact Index Per Article: 29.7] [Reference Citation Analysis] [Abstract] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 12/15/2022]
Abstract
Nature achieves differentiation of specific and nonspecific binding in molecular interactions through precise control of biomolecules in space and time. Artificial systems such as biosensors that rely on distinguishing specific molecular binding events in a sea of nonspecific interactions have struggled to overcome this issue. Despite the numerous technological advancements in biosensor technologies, nonspecific binding has remained a critical bottleneck due to the lack of a fundamental understanding of the phenomenon. To date, the identity, cause, and influence of nonspecific binding remain topics of debate within the scientific community. In this review, we discuss the evolution of the concept of nonspecific binding over the past five decades based upon the thermodynamic, intermolecular, and structural perspectives to provide classification frameworks for biomolecular interactions. Further, we introduce various theoretical models that predict the expected behavior of biosensors in physiologically relevant environments to calculate the theoretical detection limit and to optimize sensor performance. We conclude by discussing existing practical approaches to tackle the nonspecific binding challenge in vitro for biosensing platforms and how we can both address and harness nonspecific interactions for in vivo systems.
Collapse
Affiliation(s)
- Andreas Frutiger
- Laboratory of Biosensors and Bioelectronics, Institute for Biomedical Engineering, ETH Zürich, Zürich CH-8092, Switzerland
| | - Alexander Tanno
- Laboratory of Biosensors and Bioelectronics, Institute for Biomedical Engineering, ETH Zürich, Zürich CH-8092, Switzerland
| | - Stephanie Hwu
- Laboratory of Biosensors and Bioelectronics, Institute for Biomedical Engineering, ETH Zürich, Zürich CH-8092, Switzerland
| | - Raphael F Tiefenauer
- Laboratory of Biosensors and Bioelectronics, Institute for Biomedical Engineering, ETH Zürich, Zürich CH-8092, Switzerland
| | - János Vörös
- Laboratory of Biosensors and Bioelectronics, Institute for Biomedical Engineering, ETH Zürich, Zürich CH-8092, Switzerland
| | - Nako Nakatsuka
- Laboratory of Biosensors and Bioelectronics, Institute for Biomedical Engineering, ETH Zürich, Zürich CH-8092, Switzerland
| |
Collapse
|
9
|
Chen F, Liu H, Sun H, Pan P, Li Y, Li D, Hou T. Assessing the performance of the MM/PBSA and MM/GBSA methods. 6. Capability to predict protein-protein binding free energies and re-rank binding poses generated by protein-protein docking. Phys Chem Chem Phys 2018; 18:22129-39. [PMID: 27444142 DOI: 10.1039/c6cp03670h] [Citation(s) in RCA: 309] [Impact Index Per Article: 51.5] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 01/05/2023]
Abstract
Understanding protein-protein interactions (PPIs) is quite important to elucidate crucial biological processes and even design compounds that interfere with PPIs with pharmaceutical significance. Protein-protein docking can afford the atomic structural details of protein-protein complexes, but the accurate prediction of the three-dimensional structures for protein-protein systems is still notoriously difficult due in part to the lack of an ideal scoring function for protein-protein docking. Compared with most scoring functions used in protein-protein docking, the Molecular Mechanics/Generalized Born Surface Area (MM/GBSA) and Molecular Mechanics/Poisson Boltzmann Surface Area (MM/PBSA) methodologies are more theoretically rigorous, but their overall performance for the predictions of binding affinities and binding poses for protein-protein systems has not been systematically evaluated. In this study, we first evaluated the performance of MM/PBSA and MM/GBSA to predict the binding affinities for 46 protein-protein complexes. On the whole, different force fields, solvation models, and interior dielectric constants have obvious impacts on the prediction accuracy of MM/GBSA and MM/PBSA. The MM/GBSA calculations based on the ff02 force field, the GB model developed by Onufriev et al. and a low interior dielectric constant (εin = 1) yield the best correlation between the predicted binding affinities and the experimental data (rp = -0.647), which is better than MM/PBSA (rp = -0.523) and a number of empirical scoring functions used in protein-protein docking (rp = -0.141 to -0.529). Then, we examined the capability of MM/GBSA to identify the possible near-native binding structures from the decoys generated by ZDOCK for 43 protein-protein systems. The results illustrate that the MM/GBSA rescoring has better capability to distinguish the correct binding structures from the decoys than the ZDOCK scoring. Besides, the optimal interior dielectric constant of MM/GBSA for re-ranking docking poses may be determined by analyzing the characteristics of protein-protein binding interfaces. Considering the relatively high prediction accuracy and low computational cost, MM/GBSA may be a good choice for predicting the binding affinities and identifying correct binding structures for protein-protein systems.
Collapse
Affiliation(s)
- Fu Chen
- College of Pharmaceutical Sciences, Zhejiang University, Hangzhou, Zhejiang 310058, China.
| | - Hui Liu
- College of Pharmaceutical Sciences, Zhejiang University, Hangzhou, Zhejiang 310058, China.
| | - Huiyong Sun
- College of Pharmaceutical Sciences, Zhejiang University, Hangzhou, Zhejiang 310058, China.
| | - Peichen Pan
- College of Pharmaceutical Sciences, Zhejiang University, Hangzhou, Zhejiang 310058, China.
| | - Youyong Li
- Institute of Functional Nano & Soft Materials (FUNSOM), Soochow University, Suzhou, Jiangsu 215123, China
| | - Dan Li
- College of Pharmaceutical Sciences, Zhejiang University, Hangzhou, Zhejiang 310058, China.
| | - Tingjun Hou
- College of Pharmaceutical Sciences, Zhejiang University, Hangzhou, Zhejiang 310058, China. and State Key Lab of CAD&CG, Zhejiang University, Hangzhou, Zhejiang 310058, P. R. China
| |
Collapse
|
10
|
Yan Z, Wang J. SPA-LN: a scoring function of ligand-nucleic acid interactions via optimizing both specificity and affinity. Nucleic Acids Res 2017; 45:e110. [PMID: 28431169 PMCID: PMC5499587 DOI: 10.1093/nar/gkx255] [Citation(s) in RCA: 29] [Impact Index Per Article: 4.1] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 11/07/2016] [Accepted: 04/05/2017] [Indexed: 01/10/2023] Open
Abstract
Nucleic acids have been widely recognized as potential targets in drug discovery and aptamer selection. Quantifying the interactions between small molecules and nucleic acids is critical to discover lead compounds and design novel aptamers. Scoring function is normally employed to quantify the interactions in structure-based virtual screening. However, the predictive power of nucleic acid–ligand scoring functions is still a challenge compared to other types of biomolecular recognition. With the rapid growth of experimentally determined nucleic acid–ligand complex structures, in this work, we develop a knowledge-based scoring function of nucleic acid–ligand interactions, namely SPA-LN. SPA-LN is optimized by maximizing both the affinity and specificity of native complex structures. The development strategy is different from those of previous nucleic acid–ligand scoring functions which focus on the affinity only in the optimization. The native conformation is stabilized while non-native conformations are destabilized by our optimization, making the funnel-like binding energy landscape more biased toward the native state. The performance of SPA-LN validates the development strategy and provides a relatively more accurate way to score the nucleic acid–ligand interactions.
Collapse
Affiliation(s)
- Zhiqiang Yan
- State Key Laboratory of Electroanalytical Chemistry, Changchun Institute of Applied Chemistry, Chinese Academy of Sciences, Changchun, Jilin 130022, China
| | - Jin Wang
- State Key Laboratory of Electroanalytical Chemistry, Changchun Institute of Applied Chemistry, Chinese Academy of Sciences, Changchun, Jilin 130022, China.,Department of Chemistry & Physics, State University of New York at Stony Brook, Stony Brook, NY 11794-3400, USA
| |
Collapse
|
11
|
Improving the accuracy of high-throughput protein-protein affinity prediction may require better training data. BMC Bioinformatics 2017; 18:102. [PMID: 28361672 PMCID: PMC5374557 DOI: 10.1186/s12859-017-1533-z] [Citation(s) in RCA: 10] [Impact Index Per Article: 1.4] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 02/06/2023] Open
Abstract
Background One goal of structural biology is to understand how a protein’s 3-dimensional conformation determines its capacity to interact with potential ligands. In the case of small chemical ligands, deconstructing a static protein-ligand complex into its constituent atom-atom interactions is typically sufficient to rapidly predict ligand affinity with high accuracy (>70% correlation between predicted and experimentally-determined affinity), a fact that is exploited to support structure-based drug design. We recently found that protein-DNA/RNA affinity can also be predicted with high accuracy using extensions of existing techniques, but protein-protein affinity could not be predicted with >60% correlation, even when the protein-protein complex was available. Methods X-ray and NMR structures of protein-protein complexes, their associated binding affinities and experimental conditions were obtained from different binding affinity and structural databases. Statistical models were implemented using a generalized linear model framework, including the experimental conditions as new model features. We evaluated the potential for new features to improve affinity prediction models by calculating the Pearson correlation between predicted and experimental binding affinities on the training and test data after model fitting and after cross-validation. Differences in accuracy were assessed using two-sample t test and nonparametric Mann–Whitney U test. Results Here we evaluate a range of potential factors that may interfere with accurate protein-protein affinity prediction. We find that X-ray crystal resolution has the strongest single effect on protein-protein affinity prediction. Limiting our analyses to only high-resolution complexes (≤2.5 Å) increased the correlation between predicted and experimental affinity from 54 to 68% (p = 4.32x10−3). In addition, incorporating information on the experimental conditions under which affinities were measured (pH, temperature and binding assay) had significant effects on prediction accuracy. We also highlight a number of potential errors in large structure-affinity databases, which could affect both model training and accuracy assessment. Conclusions The results suggest that the accuracy of statistical models for protein-protein affinity prediction may be limited by the information present in databases used to train new models. Improving our capacity to integrate large-scale structural and functional information may be required to substantively advance our understanding of the general principles by which a protein’s structure determines its function. Electronic supplementary material The online version of this article (doi:10.1186/s12859-017-1533-z) contains supplementary material, which is available to authorized users.
Collapse
|
12
|
Yan Z, Wang J. Incorporating specificity into optimization: evaluation of SPA using CSAR 2014 and CASF 2013 benchmarks. J Comput Aided Mol Des 2016; 30:219-27. [DOI: 10.1007/s10822-016-9897-0] [Citation(s) in RCA: 6] [Impact Index Per Article: 0.8] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/28/2015] [Accepted: 01/28/2016] [Indexed: 01/04/2023]
|
13
|
Marillet S, Boudinot P, Cazals F. High-resolution crystal structures leverage protein binding affinity predictions. Proteins 2015; 84:9-20. [DOI: 10.1002/prot.24946] [Citation(s) in RCA: 21] [Impact Index Per Article: 2.3] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/16/2015] [Revised: 09/25/2015] [Accepted: 10/12/2015] [Indexed: 12/14/2022]
Affiliation(s)
- Simon Marillet
- Virologie Et Immunologie Moléculaires; INRA; Jouy-en-Josas France
- INRIA Sophia-Antipolis-Méditerraée, Algorithms-Biology-Structure; Sophia-Antipolis France
| | - Pierre Boudinot
- Virologie Et Immunologie Moléculaires; INRA; Jouy-en-Josas France
| | - Frédéric Cazals
- INRIA Sophia-Antipolis-Méditerraée, Algorithms-Biology-Structure; Sophia-Antipolis France
| |
Collapse
|
14
|
Choi JM, Serohijos AWR, Murphy S, Lucarelli D, Lofranco LL, Feldman A, Shakhnovich EI. Minimalistic predictor of protein binding energy: contribution of solvation factor to protein binding. Biophys J 2015; 108:795-798. [PMID: 25692584 DOI: 10.1016/j.bpj.2015.01.001] [Citation(s) in RCA: 12] [Impact Index Per Article: 1.3] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/28/2014] [Revised: 12/28/2014] [Accepted: 01/05/2015] [Indexed: 01/20/2023] Open
Abstract
It has long been known that solvation plays an important role in protein-protein interactions. Here, we use a minimalistic solvation-based model for predicting protein binding energy to estimate quantitatively the contribution of the solvation factor in protein binding. The factor is described by a simple linear combination of buried surface areas according to amino-acid types. Even without structural optimization, our minimalistic model demonstrates a predictive power comparable to more complex methods, making the proposed approach the basis for high throughput applications. Application of the model to a proteomic database shows that receptor-substrate complexes involved in signaling have lower affinities than enzyme-inhibitor and antibody-antigen complexes, and they differ by chemical compositions on interfaces. Also, we found that protein complexes with components that come from the same genes generally have lower affinities than complexes formed by proteins from different genes, but in this case the difference originates from different interface areas. The model was implemented in the software PYTHON, and the source code can be found on the Shakhnovich group webpage: http://faculty.chemistry.harvard.edu/shakhnovich/software.
Collapse
Affiliation(s)
- Jeong-Mo Choi
- Department of Chemistry and Chemical Biology, Harvard University, Cambridge, Massachusetts
| | - Adrian W R Serohijos
- Department of Chemistry and Chemical Biology, Harvard University, Cambridge, Massachusetts
| | - Sean Murphy
- Johns Hopkins University Applied Physics Laboratory, Laurel, Maryland
| | - Dennis Lucarelli
- Johns Hopkins University Applied Physics Laboratory, Laurel, Maryland
| | - Leo L Lofranco
- Department of Chemistry and Chemical Biology, Harvard University, Cambridge, Massachusetts
| | - Andrew Feldman
- Johns Hopkins University Applied Physics Laboratory, Laurel, Maryland
| | - Eugene I Shakhnovich
- Department of Chemistry and Chemical Biology, Harvard University, Cambridge, Massachusetts.
| |
Collapse
|
15
|
Yan Z, Wang J. Optimizing the affinity and specificity of ligand binding with the inclusion of solvation effect. Proteins 2015; 83:1632-42. [PMID: 26111900 DOI: 10.1002/prot.24848] [Citation(s) in RCA: 11] [Impact Index Per Article: 1.2] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/16/2015] [Revised: 06/03/2015] [Accepted: 06/21/2015] [Indexed: 01/08/2023]
Abstract
Solvation effect is an important factor for protein-ligand binding in aqueous water. Previous scoring function of protein-ligand interactions rarely incorporates the solvation model into the quantification of protein-ligand interactions, mainly due to the immense computational cost, especially in the structure-based virtual screening, and nontransferable application of independently optimized atomic solvation parameters. In order to overcome these barriers, we effectively combine knowledge-based atom-pair potentials and the atomic solvation energy of charge-independent implicit solvent model in the optimization of binding affinity and specificity. The resulting scoring functions with optimized atomic solvation parameters is named as specificity and affinity with solvation effect (SPA-SE). The performance of SPA-SE is evaluated and compared to 20 other scoring functions, as well as SPA. The comparative results show that SPA-SE outperforms all other scoring functions in binding affinity prediction and "native" pose identification. Our optimization validates that solvation effect is an important regulator to the stability and specificity of protein-ligand binding. The development strategy of SPA-SE sets an example for other scoring function to account for the solvation effect in biomolecular recognitions.
Collapse
Affiliation(s)
- Zhiqiang Yan
- State Key Laboratory of Electroanalytical Chemistry, Changchun Institute of Applied Chemistry, Chinese Academy of Sciences Changchun, Jilin, 130022, China
| | - Jin Wang
- State Key Laboratory of Electroanalytical Chemistry, Changchun Institute of Applied Chemistry, Chinese Academy of Sciences Changchun, Jilin, 130022, China.,Department of Chemistry & Physics, State University of New York at Stony Brook, Stony Brook, New York, 11794-3400, USA
| |
Collapse
|
16
|
Surfing the Protein-Protein Interaction Surface Using Docking Methods: Application to the Design of PPI Inhibitors. Molecules 2015; 20:11569-603. [PMID: 26111183 PMCID: PMC6272567 DOI: 10.3390/molecules200611569] [Citation(s) in RCA: 44] [Impact Index Per Article: 4.9] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/27/2015] [Revised: 06/02/2015] [Accepted: 06/15/2015] [Indexed: 02/06/2023] Open
Abstract
Blocking protein-protein interactions (PPI) using small molecules or peptides modulates biochemical pathways and has therapeutic significance. PPI inhibition for designing drug-like molecules is a new area that has been explored extensively during the last decade. Considering the number of available PPI inhibitor databases and the limited number of 3D structures available for proteins, docking and scoring methods play a major role in designing PPI inhibitors as well as stabilizers. Docking methods are used in the design of PPI inhibitors at several stages of finding a lead compound, including modeling the protein complex, screening for hot spots on the protein-protein interaction interface and screening small molecules or peptides that bind to the PPI interface. There are three major challenges to the use of docking on the relatively flat surfaces of PPI. In this review we will provide some examples of the use of docking in PPI inhibitor design as well as its limitations. The combination of experimental and docking methods with improved scoring function has thus far resulted in few success stories of PPI inhibitors for therapeutic purposes. Docking algorithms used for PPI are in the early stages, however, and as more data are available docking will become a highly promising area in the design of PPI inhibitors or stabilizers.
Collapse
|
17
|
The universal statistical distributions of the affinity, equilibrium constants, kinetics and specificity in biomolecular recognition. PLoS Comput Biol 2015; 11:e1004212. [PMID: 25885453 PMCID: PMC4401658 DOI: 10.1371/journal.pcbi.1004212] [Citation(s) in RCA: 21] [Impact Index Per Article: 2.3] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 05/29/2014] [Accepted: 02/24/2015] [Indexed: 01/01/2023] Open
Abstract
We uncovered the universal statistical laws for the biomolecular recognition/binding process. We quantified the statistical energy landscapes for binding, from which we can characterize the distributions of the binding free energy (affinity), the equilibrium constants, the kinetics and the specificity by exploring the different ligands binding with a particular receptor. The results of the analytical studies are confirmed by the microscopic flexible docking simulations. The distribution of binding affinity is Gaussian around the mean and becomes exponential near the tail. The equilibrium constants of the binding follow a log-normal distribution around the mean and a power law distribution in the tail. The intrinsic specificity for biomolecular recognition measures the degree of discrimination of native versus non-native binding and the optimization of which becomes the maximization of the ratio of the free energy gap between the native state and the average of non-native states versus the roughness measured by the variance of the free energy landscape around its mean. The intrinsic specificity obeys a Gaussian distribution near the mean and an exponential distribution near the tail. Furthermore, the kinetics of binding follows a log-normal distribution near the mean and a power law distribution at the tail. Our study provides new insights into the statistical nature of thermodynamics, kinetics and function from different ligands binding with a specific receptor or equivalently specific ligand binding with different receptors. The elucidation of distributions of the kinetics and free energy has guiding roles in studying biomolecular recognition and function through small-molecule evolution and chemical genetics. Uncovering the principles and underlying mechanisms of biomolecular recognition and molecular binding process is crucial for understanding the function and evolution, yet challenging. We meet the challenge by quantifying the statistical natures of the relevant physical variables of biomolecular recognition using the analytical model combined with microscopic flexible docking simulation methods. We uncovered the universal statistical laws obeyed by the affinity, equilibrium constant, intrinsic specificity and kinetics for biomolecular recognition. The general statistical laws based on energy landscape theory can serve as a conceptual framework for molecular recognition in biological repertoires. They can be applied to molecular selection, in vitro evolution process, high throughput screening and virtual screening for drug discovery. The statistical laws in combinations with experiments provide quantitative signatures of a specific ligand binding to a specific receptor, these resultant laws as a guideline will contribute to drug design against a specific target. Our developed statistical methodology is general and applicable for all other biomolecular recognitions.
Collapse
|
18
|
Janin J. A minimal model of protein-protein binding affinities. Protein Sci 2014; 23:1813-7. [PMID: 25270898 DOI: 10.1002/pro.2560] [Citation(s) in RCA: 23] [Impact Index Per Article: 2.3] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 09/02/2014] [Accepted: 09/25/2014] [Indexed: 11/10/2022]
Abstract
A minimal model of protein-protein binding affinity that takes into account only two structural features of the complex, the size of its interface, and the amplitude of the conformation change between the free and bound subunits, is tested on the 144 complexes of a structure-affinity benchmark. It yields Kd values that are within two orders of magnitude of the experiment for 67% of the complexes, within three orders for 88%, and fails on 12%, which display either large conformation changes, or a very high or a low affinity. The minimal model lacks the specificity and accuracy needed to make useful affinity predictions, but it should help in assessing the added value of parameters used by more elaborate models, and set a baseline for evaluating their performances.
Collapse
Affiliation(s)
- Joël Janin
- IBBMC, CNRS UMR 8619, Université Paris-Sud 11, Orsay, France
| |
Collapse
|
19
|
Kepsutlu B, Kizilel R, Kizilel S. Quantification of interactions among circadian clock proteins via surface plasmon resonance. J Mol Recognit 2014; 27:458-69. [PMID: 24895278 DOI: 10.1002/jmr.2367] [Citation(s) in RCA: 4] [Impact Index Per Article: 0.4] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/09/2013] [Revised: 12/01/2013] [Accepted: 01/29/2014] [Indexed: 11/11/2022]
Abstract
Circadian clock is an internal time keeping system recurring 24 h daily rhythm in physiology and behavior of organisms. Circadian clock contains transcription and translation feedback loop involving CLOCK/NPAS2, BMAL1, Cry1/2, and Per1/2. In common, heterodimer of CLOCK/NPAS2 and BMAL1 binds to EBOX element in the promoter of Per and Cry genes in order to activate their transcription. CRY and PER making heterodimeric complexes enter the nucleus in order to inhibit their own BMAL1-CLOCK-activated transcription. The aim of this study was to investigate and quantify real-time binding affinities of clock proteins among each other on and off DNA modes using surface plasmon resonance. The pairwise interaction coefficients among clock proteins, as well as interaction of PER2, CRY2, and PER2 : CRY2 proteins with BMAL1 : CLOCK complex in the presence and absence of EBOX motif have been investigated via analysis of surface plasmon resonance data with pseudo first-order reaction kinetics approximation and via nonlinear regression curve fitting. The results indicated that CRY2 and PER2, BMAL1, and CLOCK proteins form complexes in vitro and that PER2, CRY2 and PER2 : CRY2 complex have similar affinities toward BMAL1 : CLOCK complex. CRY2 protein had the highest affinity toward EBOX complex, whereas PER2 and CRY2 : PER2 complexes displayed low affinity toward EBOX complex. The quantification of the interaction between clock proteins is critical to understand the operation mechanism of the biological clock and to address the behavioral and physiological disorders, and it will be useful for the design of new drugs toward clock-related diseases.
Collapse
Affiliation(s)
- Burcu Kepsutlu
- Chemical and Biological Engineering, Koc University, Sariyer, Istanbul, 34450, Turkey
| | | | | |
Collapse
|
20
|
Pires DEV, Ascher DB, Blundell TL. mCSM: predicting the effects of mutations in proteins using graph-based signatures. ACTA ACUST UNITED AC 2013; 30:335-42. [PMID: 24281696 PMCID: PMC3904523 DOI: 10.1093/bioinformatics/btt691] [Citation(s) in RCA: 641] [Impact Index Per Article: 58.3] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/15/2022]
Abstract
Motivation: Mutations play fundamental roles in evolution by introducing diversity into genomes. Missense mutations in structural genes may become either selectively advantageous or disadvantageous to the organism by affecting protein stability and/or interfering with interactions between partners. Thus, the ability to predict the impact of mutations on protein stability and interactions is of significant value, particularly in understanding the effects of Mendelian and somatic mutations on the progression of disease. Here, we propose a novel approach to the study of missense mutations, called mCSM, which relies on graph-based signatures. These encode distance patterns between atoms and are used to represent the protein residue environment and to train predictive models. To understand the roles of mutations in disease, we have evaluated their impacts not only on protein stability but also on protein–protein and protein–nucleic acid interactions. Results: We show that mCSM performs as well as or better than other methods that are used widely. The mCSM signatures were successfully used in different tasks demonstrating that the impact of a mutation can be correlated with the atomic-distance patterns surrounding an amino acid residue. We showed that mCSM can predict stability changes of a wide range of mutations occurring in the tumour suppressor protein p53, demonstrating the applicability of the proposed method in a challenging disease scenario. Availability and implementation: A web server is available at http://structure.bioc.cam.ac.uk/mcsm. Contact:dpires@dcc.ufmg.br; tom@cryst.bioc.cam.ac.uk Supplementary information:Supplementary data are available at Bioinformatics online.
Collapse
Affiliation(s)
- Douglas E V Pires
- Department of Biochemistry, University of Cambridge, Cambridge CB2 1GA, UK and ACRF Rational Drug Discovery Centre and Biota Structural Biology Laboratory, St Vincents Institute of Medical Research, Fitzroy, VIC, 3065, Australia
| | | | | |
Collapse
|
21
|
Yan Z, Wang J. Optimizing scoring function of protein-nucleic acid interactions with both affinity and specificity. PLoS One 2013; 8:e74443. [PMID: 24098651 PMCID: PMC3787031 DOI: 10.1371/journal.pone.0074443] [Citation(s) in RCA: 16] [Impact Index Per Article: 1.5] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/01/2013] [Accepted: 08/02/2013] [Indexed: 12/14/2022] Open
Abstract
Protein-nucleic acid (protein-DNA and protein-RNA) recognition is fundamental to the regulation of gene expression. Determination of the structures of the protein-nucleic acid recognition and insight into their interactions at molecular level are vital to understanding the regulation function. Recently, quantitative computational approach has been becoming an alternative of experimental technique for predicting the structures and interactions of biomolecular recognition. However, the progress of protein-nucleic acid structure prediction, especially protein-RNA, is far behind that of the protein-ligand and protein-protein structure predictions due to the lack of reliable and accurate scoring function for quantifying the protein-nucleic acid interactions. In this work, we developed an accurate scoring function (named as SPA-PN, SPecificity and Affinity of the Protein-Nucleic acid interactions) for protein-nucleic acid interactions by incorporating both the specificity and affinity into the optimization strategy. Specificity and affinity are two requirements of highly efficient and specific biomolecular recognition. Previous quantitative descriptions of the biomolecular interactions considered the affinity, but often ignored the specificity owing to the challenge of specificity quantification. We applied our concept of intrinsic specificity to connect the conventional specificity, which circumvents the challenge of specificity quantification. In addition to the affinity optimization, we incorporated the quantified intrinsic specificity into the optimization strategy of SPA-PN. The testing results and comparisons with other scoring functions validated that SPA-PN performs well on both the prediction of binding affinity and identification of native conformation. In terms of its performance, SPA-PN can be widely used to predict the protein-nucleic acid structures and quantify their interactions.
Collapse
Affiliation(s)
- Zhiqiang Yan
- Department of Chemistry & Physics, State University of New York at Stony Brook, Stony Brook, New York, United States of America
- State Key Laboratory of Electroanalytical Chemistry, Changchun Institute of Applied Chemistry, Chinese Academy of Sciences, Changchun, Jilin, China
| | - Jin Wang
- Department of Chemistry & Physics, State University of New York at Stony Brook, Stony Brook, New York, United States of America
- State Key Laboratory of Electroanalytical Chemistry, Changchun Institute of Applied Chemistry, Chinese Academy of Sciences, Changchun, Jilin, China
| |
Collapse
|