1
|
Zheng F, Jiang X, Wen Y, Yang Y, Li M. Systematic investigation of machine learning on limited data: A study on predicting protein-protein binding strength. Comput Struct Biotechnol J 2024; 23:460-472. [PMID: 38235359 PMCID: PMC10792694 DOI: 10.1016/j.csbj.2023.12.018] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/03/2023] [Revised: 12/14/2023] [Accepted: 12/16/2023] [Indexed: 01/19/2024] Open
Abstract
The application of machine learning techniques in biological research, especially when dealing with limited data availability, poses significant challenges. In this study, we leveraged advancements in method development for predicting protein-protein binding strength to conduct a systematic investigation into the application of machine learning on limited data. The binding strength, quantitatively measured as binding affinity, is vital for understanding the processes of recognition, association, and dysfunction that occur within protein complexes. By incorporating transfer learning, integrating domain knowledge, and employing both deep learning and traditional machine learning algorithms, we mitigated the impact of data limitations and made significant advancements in predicting protein-protein binding affinity. In particular, we developed over 20 models, ultimately selecting three representative best-performing ones that belong to distinct categories. The first model is structure-based, consisting of a random forest regression and thirteen handcrafted features. The second model is sequence-based, employing an architecture that combines transferred embedding features with a multilayer perceptron. Finally, we created an ensemble model by averaging the predictions of the two aforementioned models. The comparison with other predictors on three independent datasets confirms the significant improvements achieved by our models in predicting protein-protein binding affinity. The programs for running these three models are available at https://github.com/minghuilab/BindPPI.
Collapse
Affiliation(s)
- Feifan Zheng
- MOE Key Laboratory of Geriatric Diseases and Immunology, School of Biology and Basic Medical Sciences, Suzhou Medical College of Soochow University, Suzhou, Jiangsu Province 215123, China
| | - Xin Jiang
- MOE Key Laboratory of Geriatric Diseases and Immunology, School of Biology and Basic Medical Sciences, Suzhou Medical College of Soochow University, Suzhou, Jiangsu Province 215123, China
| | - Yuhao Wen
- MOE Key Laboratory of Geriatric Diseases and Immunology, School of Biology and Basic Medical Sciences, Suzhou Medical College of Soochow University, Suzhou, Jiangsu Province 215123, China
| | - Yan Yang
- MOE Key Laboratory of Geriatric Diseases and Immunology, School of Biology and Basic Medical Sciences, Suzhou Medical College of Soochow University, Suzhou, Jiangsu Province 215123, China
| | - Minghui Li
- MOE Key Laboratory of Geriatric Diseases and Immunology, School of Biology and Basic Medical Sciences, Suzhou Medical College of Soochow University, Suzhou, Jiangsu Province 215123, China
| |
Collapse
|
2
|
Li D, Pucci F, Rooman M. Prediction of Paratope-Epitope Pairs Using Convolutional Neural Networks. Int J Mol Sci 2024; 25:5434. [PMID: 38791470 PMCID: PMC11121317 DOI: 10.3390/ijms25105434] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 04/02/2024] [Revised: 05/06/2024] [Accepted: 05/13/2024] [Indexed: 05/26/2024] Open
Abstract
Antibodies play a central role in the adaptive immune response of vertebrates through the specific recognition of exogenous or endogenous antigens. The rational design of antibodies has a wide range of biotechnological and medical applications, such as in disease diagnosis and treatment. However, there are currently no reliable methods for predicting the antibodies that recognize a specific antigen region (or epitope) and, conversely, epitopes that recognize the binding region of a given antibody (or paratope). To fill this gap, we developed ImaPEp, a machine learning-based tool for predicting the binding probability of paratope-epitope pairs, where the epitope and paratope patches were simplified into interacting two-dimensional patches, which were colored according to the values of selected features, and pixelated. The specific recognition of an epitope image by a paratope image was achieved by using a convolutional neural network-based model, which was trained on a set of two-dimensional paratope-epitope images derived from experimental structures of antibody-antigen complexes. Our method achieves good performances in terms of cross-validation with a balanced accuracy of 0.8. Finally, we showcase examples of application of ImaPep, including extensive screening of large libraries to identify paratope candidates that bind to a selected epitope, and rescoring and refining antibody-antigen docking poses.
Collapse
Affiliation(s)
- Dong Li
- Computational Biology and Bioinformatics, Université Libre de Bruxelles, 1050 Brussels, Belgium; (D.L.); (F.P.)
- Interuniversity Institute of Bioinformatics in Brussels, 1050 Brussels, Belgium
| | - Fabrizio Pucci
- Computational Biology and Bioinformatics, Université Libre de Bruxelles, 1050 Brussels, Belgium; (D.L.); (F.P.)
- Interuniversity Institute of Bioinformatics in Brussels, 1050 Brussels, Belgium
| | - Marianne Rooman
- Computational Biology and Bioinformatics, Université Libre de Bruxelles, 1050 Brussels, Belgium; (D.L.); (F.P.)
- Interuniversity Institute of Bioinformatics in Brussels, 1050 Brussels, Belgium
| |
Collapse
|
3
|
Yuan Y, Chen Q, Mao J, Li G, Pan X. DG-Affinity: predicting antigen-antibody affinity with language models from sequences. BMC Bioinformatics 2023; 24:430. [PMID: 37957563 PMCID: PMC10644518 DOI: 10.1186/s12859-023-05562-z] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 09/21/2023] [Accepted: 11/06/2023] [Indexed: 11/15/2023] Open
Abstract
BACKGROUND Antibody-mediated immune responses play a crucial role in the immune defense of human body. The evolution of bioengineering has led the progress of antibody-derived drugs, showing promising efficacy in cancer and autoimmune disease therapy. A critical step of this development process is obtaining the affinity between antibodies and their binding antigens. RESULTS In this study, we introduce a novel sequence-based antigen-antibody affinity prediction method, named DG-Affinity. DG-Affinity uses deep neural networks to efficiently and accurately predict the affinity between antibodies and antigens from sequences, without the need for structural information. The sequences of both the antigen and the antibody are first transformed into embedding vectors by two pre-trained language models, then these embeddings are concatenated into an ConvNeXt framework with a regression task. The results demonstrate the superiority of DG-Affinity over the existing structure-based prediction methods and the sequence-based tools, achieving a Pearson's correlation of over 0.65 on an independent test dataset. CONCLUSIONS Compared to the baseline methods, DG-Affinity achieves the best performance and can advance the development of antibody design. It is freely available as an easy-to-use web server at https://www.digitalgeneai.tech/solution/affinity .
Collapse
Affiliation(s)
- Ye Yuan
- Institute of Image Processing and Pattern Recognition, Shanghai Jiao Tong University, and Key Laboratory of System Control and Information Processing, Ministry of Education of China, Shanghai, 200240, China.
| | | | - Jun Mao
- DigitalGene, Ltd, Shanghai, 200240, China
| | - Guipeng Li
- DigitalGene, Ltd, Shanghai, 200240, China
| | - Xiaoyong Pan
- Institute of Image Processing and Pattern Recognition, Shanghai Jiao Tong University, and Key Laboratory of System Control and Information Processing, Ministry of Education of China, Shanghai, 200240, China.
| |
Collapse
|
4
|
Rozano L, Hane JK, Mancera RL. The Molecular Docking of MAX Fungal Effectors with Plant HMA Domain-Binding Proteins. Int J Mol Sci 2023; 24:15239. [PMID: 37894919 PMCID: PMC10607590 DOI: 10.3390/ijms242015239] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/03/2023] [Revised: 10/11/2023] [Accepted: 10/13/2023] [Indexed: 10/29/2023] Open
Abstract
Fungal effector proteins are important in mediating disease infections in agriculturally important crops. These secreted small proteins are known to interact with their respective host receptor binding partners in the host, either inside the cells or in the apoplastic space, depending on the localisation of the effector proteins. Consequently, it is important to understand the interactions between fungal effector proteins and their target host receptor binding partners, particularly since this can be used for the selection of potential plant resistance or susceptibility-related proteins that can be applied to the breeding of new cultivars with disease resistance. In this study, molecular docking simulations were used to characterise protein-protein interactions between effector and plant receptors. Benchmarking was undertaken using available experimental structures of effector-host receptor complexes to optimise simulation parameters, which were then used to predict the structures and mediating interactions of effector proteins with host receptor binding partners that have not yet been characterised experimentally. Rigid docking was applied for both the so-called bound and unbound docking of MAX effectors with plant HMA domain protein partners. All bound complexes used for benchmarking were correctly predicted, with 84% being ranked as the top docking pose using the ZDOCK scoring function. In the case of unbound complexes, a minimum of 95% of known residues were predicted to be part of the interacting interface on the host receptor binding partner, and at least 87% of known residues were predicted to be part of the interacting interface on the effector protein. Hydrophobic interactions were found to dominate the formation of effector-plant protein complexes. An optimised set of docking parameters based on the use of ZDOCK and ZRANK scoring functions were established to enable the prediction of near-native docking poses involving different binding interfaces on plant HMA domain proteins. Whilst this study was limited by the availability of the experimentally determined complexed structures of effectors and host receptor binding partners, we demonstrated the potential of molecular docking simulations to predict the likely interactions between effectors and their respective host receptor binding partners. This computational approach may accelerate the process of the discovery of putative interacting plant partners of effector proteins and contribute to effector-assisted marker discovery, thereby supporting the breeding of disease-resistant crops.
Collapse
Affiliation(s)
- Lina Rozano
- Curtin Medical School, Curtin Health Innovation Research Institute, GPO Box U1987, Perth, WA 6845, Australia
- Curtin Institute for Data Science, Curtin University, GPO Box U1987, Perth, WA 6845, Australia
| | - James K. Hane
- Curtin Institute for Data Science, Curtin University, GPO Box U1987, Perth, WA 6845, Australia
- Centre for Crop and Disease Management, School of Molecular and Life Sciences, Curtin University, GPO Box U1987, Perth, WA 6845, Australia
| | - Ricardo L. Mancera
- Curtin Medical School, Curtin Health Innovation Research Institute, GPO Box U1987, Perth, WA 6845, Australia
- Curtin Institute for Data Science, Curtin University, GPO Box U1987, Perth, WA 6845, Australia
| |
Collapse
|
5
|
Schweke H, Xu Q, Tauriello G, Pantolini L, Schwede T, Cazals F, Lhéritier A, Fernandez-Recio J, Rodríguez-Lumbreras LÁ, Schueler-Furman O, Varga JK, Jiménez-García B, Réau MF, Bonvin A, Savojardo C, Martelli PL, Casadio R, Tubiana J, Wolfson H, Oliva R, Barradas-Bautista D, Ricciardelli T, Cavallo L, Venclovas Č, Olechnovič K, Guerois R, Andreani J, Martin J, Wang X, Kihara D, Marchand A, Correia B, Zou X, Dey S, Dunbrack R, Levy E, Wodak S. Discriminating physiological from non-physiological interfaces in structures of protein complexes: A community-wide study. Proteomics 2023; 23:e2200323. [PMID: 37365936 PMCID: PMC10937251 DOI: 10.1002/pmic.202200323] [Citation(s) in RCA: 4] [Impact Index Per Article: 4.0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 02/04/2023] [Revised: 05/11/2023] [Accepted: 05/11/2023] [Indexed: 06/28/2023]
Abstract
Reliably scoring and ranking candidate models of protein complexes and assigning their oligomeric state from the structure of the crystal lattice represent outstanding challenges. A community-wide effort was launched to tackle these challenges. The latest resources on protein complexes and interfaces were exploited to derive a benchmark dataset consisting of 1677 homodimer protein crystal structures, including a balanced mix of physiological and non-physiological complexes. The non-physiological complexes in the benchmark were selected to bury a similar or larger interface area than their physiological counterparts, making it more difficult for scoring functions to differentiate between them. Next, 252 functions for scoring protein-protein interfaces previously developed by 13 groups were collected and evaluated for their ability to discriminate between physiological and non-physiological complexes. A simple consensus score generated using the best performing score of each of the 13 groups, and a cross-validated Random Forest (RF) classifier were created. Both approaches showed excellent performance, with an area under the Receiver Operating Characteristic (ROC) curve of 0.93 and 0.94, respectively, outperforming individual scores developed by different groups. Additionally, AlphaFold2 engines recalled the physiological dimers with significantly higher accuracy than the non-physiological set, lending support to the reliability of our benchmark dataset annotations. Optimizing the combined power of interface scoring functions and evaluating it on challenging benchmark datasets appears to be a promising strategy.
Collapse
Affiliation(s)
| | | | | | | | | | | | | | | | | | | | - Julia K. Varga
- Hebrew University of Jerusalem Institute for Medical Research Israel-Canada
| | | | | | | | | | | | | | - Jérôme Tubiana
- Tel Aviv University Blavatnik School of Computer Science
| | - Haim Wolfson
- Tel Aviv University Blavatnik School of Computer Science
| | | | | | | | | | | | | | | | | | | | | | | | | | | | - Xiaoqin Zou
- Dalton Cardiovascular Research Center, Institute for Data Science and Informatics, University of Missouri
| | | | | | | | | |
Collapse
|
6
|
de Oliveira Matos A, Vilela Rodrigues TC, Tiwari S, Dos Santos Dantas PH, Sartori GR, de Carvalho Azevedo VA, Martins Da Silva JH, de Castro Soares S, Silva-Sales M, Sales-Campos H. Immunoinformatics-guided design of a multi-valent vaccine against Rotavirus and Norovirus (ChRNV22). Comput Biol Med 2023; 159:106941. [PMID: 37105111 DOI: 10.1016/j.compbiomed.2023.106941] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 12/02/2022] [Revised: 03/17/2023] [Accepted: 04/14/2023] [Indexed: 04/29/2023]
Abstract
Rotavirus (RV) and Norovirus (NV) are the main viral etiologic agents of acute gastroenteritis (AG), a serious pediatric condition associated with significant death rates and long-term complications. Anti-RV vaccination has been proved efficient in the reduction of severe AG worldwide, however, the available vaccines are all attenuated and have suboptimal efficiencies in developing countries, where AG leads to substantial disease burden. On the other hand, no NV vaccine has been licensed so far. Therefore, we used immunoinformatics tools to develop a multi-epitope vaccine (ChRNV22) to prevent severe AG by RV and NV. Epitopes were predicted against 17 prevalent genotypes of four structural proteins (NV's VP1, RV's VP4, VP6 and VP7), and then assembled in a chimeric protein, with two small adjuvant sequences (tetanus toxin P2 epitope and a conserved sequence of RV's enterotoxin, NSP4). Simulations of the immune response and interactions with immune receptors indicated the immunogenic properties of ChRNV22, including a Th1-biased response. In silico search for putative host-homologous, allergenic and toxic regions also indicated the vaccine safety. In summary, we developed a multi-epitope vaccine against different NV and RV genotypes that seems promising for the prevention of severe AG, which will be further assessed by in vivo tests.
Collapse
Affiliation(s)
- Amanda de Oliveira Matos
- Laboratory of Mucosal Immunology and Immunoinformatics (LIM), Institute of Tropical Pathology and Public Health, Federal University of Goiás (UFG), Goiânia, 746050-050, Brazil
| | - Thaís Cristina Vilela Rodrigues
- Laboratory of Cellular and Molecular Genetics (LGCM), Institute of Biological Sciences, Federal University of Minas Gerais (UFMG), Belo Horizonte, 31270-901, Brazil
| | - Sandeep Tiwari
- Institute of Biology, Federal University of Bahia (UFBA), Salvador, 40170-115, Brazil; Institute of Health Sciences, Federal University of Bahia (UFBA), Salvador, 40231-300, Brazil
| | - Pedro Henrique Dos Santos Dantas
- Laboratory of Mucosal Immunology and Immunoinformatics (LIM), Institute of Tropical Pathology and Public Health, Federal University of Goiás (UFG), Goiânia, 746050-050, Brazil
| | | | - Vasco Ariston de Carvalho Azevedo
- Laboratory of Cellular and Molecular Genetics (LGCM), Institute of Biological Sciences, Federal University of Minas Gerais (UFMG), Belo Horizonte, 31270-901, Brazil
| | | | - Siomar de Castro Soares
- Department of Immunology, Microbiology, Immunology and Parasitology, Institute of Biological and Natural Sciences, Federal University of Triângulo Mineiro (UFTM), Uberaba, 38025-180, Brazil
| | - Marcelle Silva-Sales
- Laboratory of Virology and Cellular Culture (LABVICC), Institute of Tropical Pathology and Public Health, Federal University of Goiás (UFG), Goiânia, 746050-050, Brazil
| | - Helioswilton Sales-Campos
- Laboratory of Mucosal Immunology and Immunoinformatics (LIM), Institute of Tropical Pathology and Public Health, Federal University of Goiás (UFG), Goiânia, 746050-050, Brazil.
| |
Collapse
|
7
|
Barradas-Bautista D, Almajed A, Oliva R, Kalnis P, Cavallo L. Improving classification of correct and incorrect protein-protein docking models by augmenting the training set. BIOINFORMATICS ADVANCES 2023; 3:vbad012. [PMID: 36789292 PMCID: PMC9923443 DOI: 10.1093/bioadv/vbad012] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Figures] [Subscribe] [Scholar Register] [Received: 11/27/2022] [Revised: 01/20/2023] [Accepted: 02/01/2023] [Indexed: 02/04/2023]
Abstract
Motivation Protein-protein interactions drive many relevant biological events, such as infection, replication and recognition. To control or engineer such events, we need to access the molecular details of the interaction provided by experimental 3D structures. However, such experiments take time and are expensive; moreover, the current technology cannot keep up with the high discovery rate of new interactions. Computational modeling, like protein-protein docking, can help to fill this gap by generating docking poses. Protein-protein docking generally consists of two parts, sampling and scoring. The sampling is an exhaustive search of the tridimensional space. The caveat of the sampling is that it generates a large number of incorrect poses, producing a highly unbalanced dataset. This limits the utility of the data to train machine learning classifiers. Results Using weak supervision, we developed a data augmentation method that we named hAIkal. Using hAIkal, we increased the labeled training data to train several algorithms. We trained and obtained different classifiers; the best classifier has 81% accuracy and 0.51 Matthews' correlation coefficient on the test set, surpassing the state-of-the-art scoring functions. Availability and implementation Docking models from Benchmark 5 are available at https://doi.org/10.5281/zenodo.4012018. Processed tabular data are available at https://repository.kaust.edu.sa/handle/10754/666961. Google colab is available at https://colab.research.google.com/drive/1vbVrJcQSf6\_C3jOAmZzgQbTpuJ5zC1RP?usp=sharing. Supplementary information Supplementary data are available at Bioinformatics Advances online.
Collapse
Affiliation(s)
| | - Ali Almajed
- Computer, Electrical and Mathematical Science and Engineering Division, Kaust Extreme Computing Center, King Abdullah University of Science and Technology (KAUST), Thuwal 23955-6900, Saudi Arabia
| | - Romina Oliva
- Department of Sciences and Technologies, University of Naples “Parthenope”, I-80143 Naples, Italy
| | - Panos Kalnis
- Computer, Electrical and Mathematical Science and Engineering Division, Kaust Extreme Computing Center, King Abdullah University of Science and Technology (KAUST), Thuwal 23955-6900, Saudi Arabia
| | - Luigi Cavallo
- Physical Sciences and Engineering Division, Kaust Catalysis Center, King Abdullah University of Science and Technology (KAUST), Thuwal 23955-6900, Saudi Arabia
| |
Collapse
|
8
|
Wang E. Prediction of antibody binding to SARS-CoV-2 RBDs. BIOINFORMATICS ADVANCES 2023; 3:vbac103. [PMID: 36698760 PMCID: PMC9868522 DOI: 10.1093/bioadv/vbac103] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Figures] [Subscribe] [Scholar Register] [Received: 10/13/2022] [Revised: 12/18/2022] [Accepted: 12/31/2022] [Indexed: 01/03/2023]
Abstract
Summary The ability to predict antibody-antigen binding is essential for computational models of antibody affinity maturation and protein design. While most models aim to predict binding for arbitrary antigens and antibodies, the global impact of SARS-CoV-2 on public health and the availability of associated data suggest that a SARS-CoV-2-specific model would be highly beneficial. In this work, we present a neural network model, trained on ∼315 000 datapoints from deep mutational scanning experiments, that predicts escape fractions of SARS-CoV-2 RBDs binding to arbitrary antibodies. The antibody embeddings within the model constitute an effective sequence space, which correlates with the Hamming distance, suggesting that these embeddings may be useful for downstream tasks such as binding prediction. Indeed, the model achieves Spearman correlation coefficients of 0.46 and 0.52 on two held-out test sets. By comparison, correlation coefficients calculated using existing structure and sequence-based models do not exceed 0.28. The correlation coefficient against dissociation constants of antibodies binding to SARS-CoV-2 RBD variants is 0.46. Additionally, the residue-level escapes are highest in the antibody epitope, correlating well with experimentally measured escapes. We further study the effect of antibody chain use, embedding dimension size and feed-forward and convolutional architectures on the model results. Lastly, we find that the inference time of our model is significantly faster than previous models, suggesting that it could be a useful tool for the accurate and rapid prediction of antibodies binding to SARS-CoV-2 RBDs. Availability and implementation The model and associated code are available for download at https://github.com/ericzwang/RBD_AB. Supplementary information Supplementary data are available at Bioinformatics Advances online.
Collapse
Affiliation(s)
- Eric Wang
- To whom correspondence should be addressed.
| |
Collapse
|
9
|
de Oliveira Matos A, dos Santos Dantas PH, Colmenares MTC, Sartori GR, Silva-Sales M, Da Silva JHM, Neves BJ, Andrade CH, Sales-Campos H. The CDR3 region as the major driver of TREM-1 interaction with its ligands, an in silico characterization. Comput Struct Biotechnol J 2023; 21:2579-2590. [PMID: 37122631 PMCID: PMC10130352 DOI: 10.1016/j.csbj.2023.04.008] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 08/19/2022] [Revised: 04/07/2023] [Accepted: 04/12/2023] [Indexed: 05/02/2023] Open
Abstract
The triggering receptor expressed on myeloid cells-1 (TREM-1) is a pattern recognition receptor heavily investigated in infectious and non-infectious diseases. Because of its role in amplifying inflammation, TREM-1 has been explored as a diagnostic/prognostic biomarker. Further, as the receptor has been implicated in the pathophysiology of several diseases, therapies aiming at modulating its activity represent a promising strategy to constrain uncontrolled inflammatory or infectious diseases. Despite this, several aspects concerning its interaction with ligands and activation process, remain unclear. Although many molecules have been suggested as TREM-1 ligands, only five have been confirmed to interact with the receptor: actin, eCIRP, HMGB1, Hsp70 and PGLYRP1. However, the domains involved in the interaction between the receptor and these proteins are not clarified yet. Therefore, here we used in silico approaches to investigate the putative binding domains in the receptor, using hot spots analysis, molecular docking and molecular dynamics simulations between TREM-1 and its five known ligands. Our results indicated the complementarity-determining regions (CDRs) of the receptor as the main mediators of antigen recognition, especially the CDR3 loop. We believe that our study could be used as structural basis for the elucidation of TREM-1's recognition process, and may be useful for prospective in silico and biological investigations exploring the receptor in different contexts.
Collapse
Affiliation(s)
| | | | | | | | - Marcelle Silva-Sales
- Instituto de Patologia Tropical e Saúde Pública, Universidade Federal de Goiás, Goiânia, Brazil
| | | | - Bruno Junior Neves
- LabMol – Laboratory for Molecular Modeling and Drug Design, Faculty of Pharmacy, Universidade Federal de Goiás, Goiânia, Brazil
| | - Carolina Horta Andrade
- LabMol – Laboratory for Molecular Modeling and Drug Design, Faculty of Pharmacy, Universidade Federal de Goiás, Goiânia, Brazil
| | - Helioswilton Sales-Campos
- Instituto de Patologia Tropical e Saúde Pública, Universidade Federal de Goiás, Goiânia, Brazil
- Correspondence to: Universidade Federal de Goiás – UFG, Instituto de Patologia Tropical e Saúde Pública – IPTSP, Rua 235, S/N, sala 332, Setor Leste Universitário, Goiânia, Goiás 746050-05, Brazil.
| |
Collapse
|
10
|
Panday S, Alexov E. Protein-Protein Binding Free Energy Predictions with the MM/PBSA Approach Complemented with the Gaussian-Based Method for Entropy Estimation. ACS OMEGA 2022; 7:11057-11067. [PMID: 35415339 PMCID: PMC8991903 DOI: 10.1021/acsomega.1c07037] [Citation(s) in RCA: 6] [Impact Index Per Article: 3.0] [Reference Citation Analysis] [Abstract] [Grants] [Track Full Text] [Download PDF] [Figures] [Subscribe] [Scholar Register] [Received: 12/13/2021] [Accepted: 03/10/2022] [Indexed: 06/14/2023]
Abstract
Here, we present a Gaussian-based method for estimation of protein-protein binding entropy to augment the molecular mechanics Poisson-Boltzmann surface area (MM/PBSA) method for computational prediction of binding free energy (ΔG). The method is termed f5-MM/PBSA/E, where "E" stands for entropy and f5 for five adjustable parameters. The enthalpy components of ΔG (molecular mechanics, polar and non-polar solvation energies) are computed from a single implicit solvent generalized Born (GB) energy minimized structure of a protein-protein complex, while the binding entropy is computed using independently GB energy minimized unbound and bound structures. It should be emphasized that the f5-MM/PBSA/E method does not use snapshots, just energy minimized structures, and is thus very fast and computationally efficient. The method is trained and benchmarked in 5-fold validation test over a data set consisting of 46 protein-protein binding cases with experimentally determined dissociation constant K d values. This data set has been used for benchmarking in recently published protein-protein binding studies that apply conventional MM/PBSA and MM/PBSA with an enhanced sampling method. The f5-MM/PBSA/E tested on the same data set achieves similar or better performance than these computationally demanding approaches, making it an excellent choice for high throughput protein-protein binding affinity prediction studies.
Collapse
|
11
|
Barradas-Bautista D, Cao Z, Vangone A, Oliva R, Cavallo L. A random forest classifier for protein-protein docking models. BIOINFORMATICS ADVANCES 2021; 2:vbab042. [PMID: 36699405 PMCID: PMC9710594 DOI: 10.1093/bioadv/vbab042] [Citation(s) in RCA: 1] [Impact Index Per Article: 0.3] [Reference Citation Analysis] [Abstract] [Track Full Text] [Figures] [Subscribe] [Scholar Register] [Received: 11/05/2021] [Revised: 11/11/2021] [Accepted: 12/06/2021] [Indexed: 01/28/2023]
Abstract
Herein, we present the results of a machine learning approach we developed to single out correct 3D docking models of protein-protein complexes obtained by popular docking software. To this aim, we generated 3 × 10 4 docking models for each of the 230 complexes in the protein-protein benchmark, version 5, using three different docking programs (HADDOCK, FTDock and ZDOCK), for a cumulative set of ≈ 7 × 10 6 docking models. Three different machine learning approaches (Random Forest, Supported Vector Machine and Perceptron) were used to train classifiers with 158 different scoring functions (features). The Random Forest algorithm outperformed the other two algorithms and was selected for further optimization. Using a features selection algorithm, and optimizing the random forest hyperparameters, allowed us to train and validate a random forest classifier, named COnservation Driven Expert System (CoDES). Testing of CoDES on independent datasets, as well as results of its comparative performance with machine learning methods recently developed in the field for the scoring of docking decoys, confirm its state-of-the-art ability to discriminate correct from incorrect decoys both in terms of global parameters and in terms of decoys ranked at the top positions. Supplementary information Supplementary data are available at Bioinformatics Advances online. Software and data availability statement The docking models are available at https://doi.org/10.5281/zenodo.4012018. The programs underlying this article will be shared on request to the corresponding authors.
Collapse
Affiliation(s)
- Didier Barradas-Bautista
- Kaust Catalysis Center, Physical Sciences and Engineering Division, King Abdullah University of Science and Technology (KAUST), 23955-6900 Thuwal, Saudi Arabia,To whom correspondence should be addressed. or or
| | - Zhen Cao
- Kaust Catalysis Center, Physical Sciences and Engineering Division, King Abdullah University of Science and Technology (KAUST), 23955-6900 Thuwal, Saudi Arabia
| | - Anna Vangone
- Pharma Research and Early Development, Therapeutic Modalities, Roche Innovation Center Munich Large Molecule Research, 82377 Penzberg, Germany
| | - Romina Oliva
- Department of Sciences and Technologies, University Parthenope of Naples, Centro Direzionale Isola C4, I-80143 Naples, Italy,To whom correspondence should be addressed. or or
| | - Luigi Cavallo
- Kaust Catalysis Center, Physical Sciences and Engineering Division, King Abdullah University of Science and Technology (KAUST), 23955-6900 Thuwal, Saudi Arabia,To whom correspondence should be addressed. or or
| |
Collapse
|
12
|
Vračko M, Basak SC, Sen D, Nandy A. Clustering of Zika Viruses Originating from Different Geographical Regions using Computational Sequence Descriptors. Curr Comput Aided Drug Des 2021; 17:314-322. [PMID: 31878862 DOI: 10.2174/1573409916666191226110936] [Citation(s) in RCA: 2] [Impact Index Per Article: 0.7] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/02/2019] [Revised: 11/18/2019] [Accepted: 12/09/2019] [Indexed: 11/22/2022]
Abstract
BACKGROUND In this report, we consider a data set, which consists of 310 Zika virus genome sequences taken from different continents, Africa, Asia and South America. The sequences, which were compiled from GenBank, were derived from the host cells of different mammalian species (Simiiformes, Aedes opok, Aedes africanus, Aedes luteocephalus, Aedes dalzieli, Aedes aegypti, and Homo sapiens). METHODS For chemometrical treatment, the sequences have been represented by sequence descriptors derived from their graphs or neighborhood matrices. The set was analyzed with three chemometrical methods: Mahalanobis distances, principal component analysis (PCA) and self organizing maps (SOM). A good separation of samples with respect to the region of origin was observed using these three methods. RESULTS Study of 310 Zika virus genome sequences from different continents. To characterize and compare Zika virus sequences from around the world using alignment-free sequence comparison and chemometrical methods. CONCLUSION Mahalanobis distance analysis, self organizing maps, principal components were used to carry out the chemometrical analyses of the Zika sequence data. Genome sequences are clustered with respect to the region of origin (continent, country). Africa samples are well separated from Asian and South American ones.
Collapse
Affiliation(s)
- Marjan Vračko
- National Institute of Chemistry, Hajdrihova 19, 1000 Ljubljana, Slovenia
| | - Subhash C Basak
- Department of Chemistry and Biochemistry, University of Minnesota, Duluth, United States
| | - Dwaipayan Sen
- Centre for Interdisciplinary Research and Education, Kolkata, India
| | - Ashesh Nandy
- Centre for Interdisciplinary Research and Education, Kolkata, India
| |
Collapse
|
13
|
Alizadeh AA, Dastmalchi S. Designing Novel Teduglutide Analogues with Improved Binding Affinity: An In Silico Peptide Engineering Approach. Curr Comput Aided Drug Des 2021; 17:225-234. [PMID: 32065094 DOI: 10.2174/1573409916666200217091456] [Citation(s) in RCA: 2] [Impact Index Per Article: 0.7] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/05/2019] [Revised: 12/05/2019] [Accepted: 01/17/2020] [Indexed: 11/22/2022]
Abstract
INTRODUCTION Short bowel syndrome (SBS) is a disabling condition that occurs following the loss of substantial portions of the intestine, leading to inadequate absorption of nutrients and fluids. Teduglutide is the only drug that has been FDA-approved for long-term treatment of SBS. This medicine exerts its biological effects through binding to the GLP-2 receptor. METHODS The current study aimed to use computational mutagenesis approaches to design novel potent analogues of teduglutide. To this end, the constructed teduglutide-GLP2R 3D model was subjected to the alanine scanning mutagenesis where ARG20, PHE22, ILE23, LEU26, ILE27 and LYS30 were identified as the key amino acids involved in ligand-receptor interaction. In order to design potent teduglutide analogues, using MAESTROweb machine learning method, the residues of teduglutide were virtually mutated into all naturally occurring amino acids and the affinity improving mutations were selected for further analysis using PDBePISA methodology which interactively investigates the interactions established at the interfaces of macromolecules. RESULTS The calculations resulted in D15I, D15L, D15M and N24M mutations, which can improve the binding ability of the ligand to the receptor. The final evaluation of identified mutations was performed by molecular dynamics simulations, indicating that D15I and D15M are the most reliable mutations to increase teduglutide affinity towards its receptor. CONCLUSION The findings in the current study may facilitate designing more potent teduglutide analogues leading to the development of novel treatments in short bowel syndrome.
Collapse
Affiliation(s)
- Ali A Alizadeh
- Biotechnology Research Center, Tabriz University of Medical Sciences, Tabriz, Iran
| | - Siavoush Dastmalchi
- Biotechnology Research Center, Tabriz University of Medical Sciences, Tabriz, Iran
| |
Collapse
|
14
|
Guest JD, Vreven T, Zhou J, Moal I, Jeliazkov JR, Gray JJ, Weng Z, Pierce BG. An expanded benchmark for antibody-antigen docking and affinity prediction reveals insights into antibody recognition determinants. Structure 2021; 29:606-621.e5. [PMID: 33539768 DOI: 10.1016/j.str.2021.01.005] [Citation(s) in RCA: 47] [Impact Index Per Article: 15.7] [Reference Citation Analysis] [Abstract] [Key Words] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/08/2020] [Revised: 11/15/2020] [Accepted: 01/11/2021] [Indexed: 01/04/2023]
Abstract
Accurate predictive modeling of antibody-antigen complex structures and structure-based antibody design remain major challenges in computational biology, with implications for biotherapeutics, immunity, and vaccines. Through a systematic search for high-resolution structures of antibody-antigen complexes and unbound antibody and antigen structures, in conjunction with identification of experimentally determined binding affinities, we have assembled a non-redundant set of test cases for antibody-antigen docking and affinity prediction. This benchmark more than doubles the number of antibody-antigen complexes and corresponding affinities available in our previous benchmarks, providing an unprecedented view of the determinants of antibody recognition and insights into molecular flexibility. Initial assessments of docking and affinity prediction tools highlight the challenges posed by this diverse set of cases, which includes camelid nanobodies, therapeutic monoclonal antibodies, and broadly neutralizing antibodies targeting viral glycoproteins. This dataset will enable development of advanced predictive modeling and design methods for this therapeutically relevant class of protein-protein interactions.
Collapse
Affiliation(s)
- Johnathan D Guest
- University of Maryland Institute for Bioscience and Biotechnology Research, Rockville, MD 20850, USA; Department of Cell Biology and Molecular Genetics, University of Maryland, College Park, MD 20742, USA
| | - Thom Vreven
- Program in Bioinformatics and Integrative Biology, University of Massachusetts Medical School, Worcester, MA 01605, USA
| | - Jing Zhou
- Department of Chemical and Biomolecular Engineering, Johns Hopkins University, Baltimore, MD 21218, USA
| | - Iain Moal
- Computational Sciences, GlaxoSmithKline Research and Development, Stevenage SG1 2NY, UK
| | - Jeliazko R Jeliazkov
- Program in Molecular Biophysics, Johns Hopkins University, Baltimore, MD 21218, USA
| | - Jeffrey J Gray
- Department of Chemical and Biomolecular Engineering, Johns Hopkins University, Baltimore, MD 21218, USA; Program in Molecular Biophysics, Johns Hopkins University, Baltimore, MD 21218, USA
| | - Zhiping Weng
- Program in Bioinformatics and Integrative Biology, University of Massachusetts Medical School, Worcester, MA 01605, USA.
| | - Brian G Pierce
- University of Maryland Institute for Bioscience and Biotechnology Research, Rockville, MD 20850, USA; Department of Cell Biology and Molecular Genetics, University of Maryland, College Park, MD 20742, USA.
| |
Collapse
|
15
|
Das S, Chakrabarti S. Classification and prediction of protein-protein interaction interface using machine learning algorithm. Sci Rep 2021; 11:1761. [PMID: 33469042 PMCID: PMC7815773 DOI: 10.1038/s41598-020-80900-2] [Citation(s) in RCA: 33] [Impact Index Per Article: 11.0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 08/04/2020] [Accepted: 12/15/2020] [Indexed: 01/29/2023] Open
Abstract
Structural insight of the protein-protein interaction (PPI) interface can provide knowledge about the kinetics, thermodynamics and molecular functions of the complex while elucidating its role in diseases and further enabling it as a potential therapeutic target. However, owing to experimental lag in solving protein-protein complex structures, three-dimensional (3D) knowledge of the PPI interfaces can be gained via computational approaches like molecular docking and post-docking analyses. Despite development of numerous docking tools and techniques, success in identification of native like interfaces based on docking score functions is limited. Hence, we employed an in-depth investigation of the structural features of the interface that might successfully delineate native complexes from non-native ones. We identify interface properties, which show statistically significant difference between native and non-native interfaces belonging to homo and hetero, protein-protein complexes. Utilizing these properties, a support vector machine (SVM) based classification scheme has been implemented to differentiate native and non-native like complexes generated using docking decoys. Benchmarking and comparative analyses suggest very good performance of our SVM classifiers. Further, protein interactions, which are proven via experimental findings but not resolved structurally, were subjected to this approach where 3D-models of the complexes were generated and most likely interfaces were predicted. A web server called Protein Complex Prediction by Interface Properties (PCPIP) is developed to predict whether interface of a given protein-protein dimer complex resembles known protein interfaces. The server is freely available at http://www.hpppi.iicb.res.in/pcpip/ .
Collapse
Affiliation(s)
- Subhrangshu Das
- grid.417635.20000 0001 2216 5074Structural Biology and Bioinformatics Division, CSIR-Indian Institute of Chemical Biology, Kolkata, WB India
| | - Saikat Chakrabarti
- grid.417635.20000 0001 2216 5074Structural Biology and Bioinformatics Division, CSIR-Indian Institute of Chemical Biology, Kolkata, WB India
| |
Collapse
|
16
|
Agostino M. Comprehensive analysis of carbohydrate-protein recognition in the Protein Data Bank. Carbohydr Res 2020; 498:108180. [PMID: 33096507 DOI: 10.1016/j.carres.2020.108180] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 09/01/2020] [Revised: 10/09/2020] [Accepted: 10/14/2020] [Indexed: 10/23/2022]
Abstract
Carbohydrate-protein interactions underpin wide-ranging aspects of biology. However, such interactions remain relatively unexplored in pharmaceutical and biotechnological applications, in part due to the challenges associated with their structural characterisation, both experimentally and computationally. Knowledge-based approaches have shown great success in the prediction of drug-protein and protein-protein interactions, although have not been comprehensively investigated for carbohydrate-protein interactions. In this work, carbohydrate-protein complexes from the Protein Data Bank were comprehensively obtained and analysed to identify patterns in how carbohydrate-protein interactions are mediated.
Collapse
Affiliation(s)
- Mark Agostino
- School of Pharmacy and Biomedical Sciences, Curtin Health Innovation Research Institute and Curtin Institute for Computation, Bentley, Australia.
| |
Collapse
|
17
|
Meseguer A, Dominguez L, Bota PM, Aguirre‐Plans J, Bonet J, Fernandez‐Fuentes N, Oliva B. Using collections of structural models to predict changes of binding affinity caused by mutations in protein-protein interactions. Protein Sci 2020; 29:2112-2130. [PMID: 32797645 PMCID: PMC7513729 DOI: 10.1002/pro.3930] [Citation(s) in RCA: 6] [Impact Index Per Article: 1.5] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 04/27/2020] [Revised: 08/04/2020] [Accepted: 08/05/2020] [Indexed: 12/24/2022]
Abstract
Protein-protein interactions (PPIs) in all the molecular aspects that take place both inside and outside cells. However, determining experimentally the structure and affinity of PPIs is expensive and time consuming. Therefore, the development of computational tools, as a complement to experimental methods, is fundamental. Here, we present a computational suite: MODPIN, to model and predict the changes of binding affinity of PPIs. In this approach we use homology modeling to derive the structures of PPIs and score them using state-of-the-art scoring functions. We explore the conformational space of PPIs by generating not a single structural model but a collection of structural models with different conformations based on several templates. We apply the approach to predict the changes in free energy upon mutations and splicing variants of large datasets of PPIs to statistically quantify the quality and accuracy of the predictions. As an example, we use MODPIN to study the effect of mutations in the interaction between colicin endonuclease 9 and colicin endonuclease 2 immune protein from Escherichia coli. Finally, we have compared our results with other state-of-art methods.
Collapse
Affiliation(s)
- Alberto Meseguer
- Structural Bioinformatics Group, Research Programme on Biomedical Informatics, Department of Experimental and Health ScienceUniversitat Pompeu FabraBarcelonaCataloniaSpain
| | - Lluis Dominguez
- Integrative Biomedical Informatics Group (GRIB‐IMIM). Department of Experimental and Life SciencesUniversitat Pompeu FabraBarcelonaCataloniaSpain
| | - Patricia M. Bota
- Structural Bioinformatics Group, Research Programme on Biomedical Informatics, Department of Experimental and Health ScienceUniversitat Pompeu FabraBarcelonaCataloniaSpain
- Department of BiosciencesUniversitat de Vic‐Universitat Central de CatalunyaVicCataloniaSpain
| | - Joaquim Aguirre‐Plans
- Structural Bioinformatics Group, Research Programme on Biomedical Informatics, Department of Experimental and Health ScienceUniversitat Pompeu FabraBarcelonaCataloniaSpain
| | - Jaume Bonet
- Structural Bioinformatics Group, Research Programme on Biomedical Informatics, Department of Experimental and Health ScienceUniversitat Pompeu FabraBarcelonaCataloniaSpain
| | - Narcis Fernandez‐Fuentes
- Department of BiosciencesUniversitat de Vic‐Universitat Central de CatalunyaVicCataloniaSpain
- Institute of Biological, Environmental and Rural SciencesAberystwyth UniversityAberystwythUK
| | - Baldo Oliva
- Structural Bioinformatics Group, Research Programme on Biomedical Informatics, Department of Experimental and Health ScienceUniversitat Pompeu FabraBarcelonaCataloniaSpain
| |
Collapse
|
18
|
Silva ON, Franco OL, Neves BJ, Morais ÁCB, De Oliveira Neto JR, da Cunha LC, Naves LM, Pedrino GR, Costa EA, Fajemiroye JO. Involvement of the gabaergic, serotonergic and glucocorticoid mechanism in the anxiolytic-like effect of mastoparan-L. Neuropeptides 2020; 81:102027. [PMID: 32059939 DOI: 10.1016/j.npep.2020.102027] [Citation(s) in RCA: 1] [Impact Index Per Article: 0.3] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Submit a Manuscript] [Subscribe] [Scholar Register] [Received: 09/11/2019] [Revised: 02/03/2020] [Accepted: 02/04/2020] [Indexed: 10/25/2022]
Abstract
Mastoparan-L (mast-L) is a cell-penetrating tetradecapeptide and stimulator of monoamine exocytosis. In the present study, we evaluated the anxiolytic-like effect of mast-L. Preliminary pharmacological tests were conducted to determine the most appropriate route of administration, extrapolate dose and detect potential toxic effects of this peptide. Oral and intracerebroventricular administration of mast-L (0.1, 0.3 or 0.9 mg.kg-1), diazepam (1 or 5 mg.kg-1), buspirone (10 mg.kg-1) or vehicle 10 mL.kg-1 was carried out prior to the exposure of mice to the anxiety models: open field, light-dark box and elevated plus-maze. To characterize the mechanism underlying the antianxiety-like effect of mast-L, pharmacological antagonism, blood plasma analysis, molecular docking, and receptor binding assays were performed. The absence of a neurotoxic sign, animal's death as well as lack of significant changes in the relative organ weight, hematological and biochemical parameters suggest that mast-L is relatively safe. The anxiolytic-like effect of mast-L was attenuated by flumazenil (antagonist of benzodiazepine binding site) and WAY100635 (selective antagonist of 5-HT1A receptors) pretreatments. Mast-L reduced plasma corticosterone and lowered the scoring function at GABAA -18.48 kcal/mol (Ki = 155 nM), 5-HT1A -22.39 kcal/mol (Ki = 130 nM), corticotropin-releasing factor receptor subtype 1 (CRF1) -11.95 kcal/mol (Ki = 299 nM) and glucocorticoid receptors (GR) -14.69 kcal/mol (Ki = 3552 nM). These data fit the binding affinity (Ki) and demonstrate the involvement of gabaergic, serotonergic and glucocorticoid mechanisms in the anxiolytic-like property of mast-L.
Collapse
Affiliation(s)
- Osmar N Silva
- Centro de Análises Proteômicas e Bioquímicas, Universidade Católica de Brasília, Brasília, DF, Brazil
| | - Octavio L Franco
- Centro de Análises Proteômicas e Bioquímicas, Universidade Católica de Brasília, Brasília, DF, Brazil
| | - Bruno J Neves
- Centro Universitário de Anápolis, UniEvangélica, Av. Universitária Km 3,5 Cidade Universitária Anápolis/GO 75083-515, Brazil
| | - Álice Cristina B Morais
- Centro Universitário de Anápolis, UniEvangélica, Av. Universitária Km 3,5 Cidade Universitária Anápolis/GO 75083-515, Brazil
| | - Jeronimo R De Oliveira Neto
- Núcleo de Estudos e Pesquisas Tóxico-Farmacológicas, Faculdade de Farmácia, Universidade Federal de Goiás, PMB 131, CEP 74001-970, Goiânia, Brazil
| | - Luiz Carlos da Cunha
- Núcleo de Estudos e Pesquisas Tóxico-Farmacológicas, Faculdade de Farmácia, Universidade Federal de Goiás, PMB 131, CEP 74001-970, Goiânia, Brazil
| | - Lara M Naves
- Instituto de Ciências Biológicas, Universidade Federal de Goiás, 74001-970, Goiânia, GO, Brazil
| | - Gustavo R Pedrino
- Instituto de Ciências Biológicas, Universidade Federal de Goiás, 74001-970, Goiânia, GO, Brazil
| | - Elson A Costa
- Instituto de Ciências Biológicas, Universidade Federal de Goiás, 74001-970, Goiânia, GO, Brazil
| | - James O Fajemiroye
- Centro Universitário de Anápolis, UniEvangélica, Av. Universitária Km 3,5 Cidade Universitária Anápolis/GO 75083-515, Brazil; Instituto de Ciências Biológicas, Universidade Federal de Goiás, 74001-970, Goiânia, GO, Brazil.
| |
Collapse
|
19
|
da Silva FCV, Pessoa Costa E, Moreira Gomes V, de Oliveira Carvalho A. Inhibition mechanism of human salivary α-amylase by lipid transfer protein from Vigna unguiculata. Comput Biol Chem 2020; 85:107193. [DOI: 10.1016/j.compbiolchem.2019.107193] [Citation(s) in RCA: 4] [Impact Index Per Article: 1.0] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 08/14/2019] [Revised: 12/06/2019] [Accepted: 12/11/2019] [Indexed: 01/09/2023]
|
20
|
Abstract
Many of the biological functions of the cell are driven by protein-protein interactions. However, determining which proteins interact and exactly how they do so to enable their functions, remain major research questions. Functional interactions are dependent on a number of complicated factors; therefore, modeling the three-dimensional structure of protein-protein complexes is still considered a complex endeavor. Nevertheless, the rewards for modeling protein interactions to atomic level detail are substantial, and there are numerous examples of how models can provide useful information for drug design, protein engineering, systems biology, and understanding of the immune system. Here, we provide practical guidelines for docking proteins using the web-server, SwarmDock, a flexible protein-protein docking method. Moreover, we provide an overview of the factors that need to be considered when deciding whether docking is likely to be successful.
Collapse
Affiliation(s)
- Iain H Moal
- European Bioinformatics Institute, Hinxton, UK
| | | | | | - Paul A Bates
- Biomolecular Modelling Laboratory, The Francis Crick Institute, London, UK.
| |
Collapse
|
21
|
Hadi-Alijanvand H. Complex Stability is Encoded in Binding Patch Softness: a Monomer-Based Approach to Predict Inter-Subunit Affinity of Protein Dimers. J Proteome Res 2019; 19:409-423. [PMID: 31795635 DOI: 10.1021/acs.jproteome.9b00594] [Citation(s) in RCA: 4] [Impact Index Per Article: 0.8] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/28/2022]
Abstract
Knowledge about the structure and stability of protein-protein interactions is vital to decipher the behavior of protein systems. Prediction of protein complexes' stability is an interesting topic in the field of structural biology. There are some promising published computational approaches that predict the affinity between subunits of protein dimers using 3D structures of both subunits. In the current study, we classify protein complexes with experimentally measured affinities into distinct classes with different mean affinities. By predicting the mechanical stiffness of the protein binding patch (PBP) region on a single subunit, we successfully predict the assigned affinity class of the PBP in the classification step. Now to predict the experimentally measured affinity between protein monomers in solution, we just need the 3D structure of the suggested PBP on one subunit of the proposed dimer. We designed the SEPAS software and have made the software freely available for academic non-commercial research purposes at " http://biophysics.ir/affinity ". SEPAS predicts the stability of the intended dimer in a classwise manner by utilizing the computed mechanical stiffness of the introduced binding site on one subunit with the minimum accuracy of 0.72.
Collapse
Affiliation(s)
- Hamid Hadi-Alijanvand
- Department of Biological Sciences , Institute for Advanced Studies in Basic Sciences (IASBS) , Zanjan 45137-66731 , Iran
| |
Collapse
|
22
|
Jankauskaite J, Jiménez-García B, Dapkunas J, Fernández-Recio J, Moal IH. SKEMPI 2.0: an updated benchmark of changes in protein-protein binding energy, kinetics and thermodynamics upon mutation. Bioinformatics 2019; 35:462-469. [PMID: 30020414 PMCID: PMC6361233 DOI: 10.1093/bioinformatics/bty635] [Citation(s) in RCA: 141] [Impact Index Per Article: 28.2] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 05/16/2018] [Accepted: 07/17/2018] [Indexed: 11/18/2022] Open
Abstract
Motivation Understanding the relationship between the sequence, structure, binding energy, binding kinetics and binding thermodynamics of protein–protein interactions is crucial to understanding cellular signaling, the assembly and regulation of molecular complexes, the mechanisms through which mutations lead to disease, and protein engineering. Results We present SKEMPI 2.0, a major update to our database of binding free energy changes upon mutation for structurally resolved protein–protein interactions. This version now contains manually curated binding data for 7085 mutations, an increase of 133%, including changes in kinetics for 1844 mutations, enthalpy and entropy changes for 443 mutations, and 440 mutations, which abolish detectable binding. Availability and implementation The database is available as supplementary data and at https://life.bsc.es/pid/skempi2/. Supplementary information Supplementary data are available at Bioinformatics online.
Collapse
Affiliation(s)
- Justina Jankauskaite
- Institute of Biotechnology, Life Sciences Center, Vilnius University, Vilnius, Lithuania
| | - Brian Jiménez-García
- Barcelona Supercomputing Center (BSC), Barcelona, Spain.,Bijvoet Center for Biomolecular Research, Faculty of Science, Utrecht University, Utrecht, the Netherlands
| | - Justas Dapkunas
- Institute of Biotechnology, Life Sciences Center, Vilnius University, Vilnius, Lithuania
| | - Juan Fernández-Recio
- Barcelona Supercomputing Center (BSC), Barcelona, Spain.,Institut de Biologia Molecular de Barcelona (IBMB), CSIC, Barcelona, Spain
| | - Iain H Moal
- European Molecular Biology Laboratory, European Bioinformatics Institute (EMBL-EBI), Hinxton, Cambridge, UK
| |
Collapse
|
23
|
Tobias-Santos V, Guerra-Almeida D, Mury F, Ribeiro L, Berni M, Araujo H, Logullo C, Feitosa NM, de Souza-Menezes J, Pessoa Costa E, Nunes-da-Fonseca R. Multiple Roles of the Polycistronic Gene Tarsal-less/Mille-Pattes/Polished-Rice During Embryogenesis of the Kissing Bug Rhodnius prolixus. Front Ecol Evol 2019. [DOI: 10.3389/fevo.2019.00379] [Citation(s) in RCA: 3] [Impact Index Per Article: 0.6] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/13/2022] Open
|
24
|
Wnt Binding Affinity Prediction for Putative Frizzled-Type Cysteine-Rich Domains. Int J Mol Sci 2019; 20:ijms20174168. [PMID: 31454915 PMCID: PMC6747125 DOI: 10.3390/ijms20174168] [Citation(s) in RCA: 13] [Impact Index Per Article: 2.6] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/20/2019] [Revised: 08/22/2019] [Accepted: 08/22/2019] [Indexed: 12/25/2022] Open
Abstract
Several proteins other than the frizzled receptors (Fzd) and the secreted Frizzled-related proteins (sFRP) contain Fzd-type cysteine-rich domains (CRD). We have termed these domains “putative Fzd-type CRDs”, as the relevance of Wnt signalling in the majority of these is unknown; the RORs, an exception to this, are well known for mediating non-canonical Wnt signalling. In this study, we have predicted the likely binding affinity of all Wnts for all putative Fzd-type CRDs. We applied both our previously determined Wnt‒Fzd CRD binding affinity prediction model, as well as a newly devised model wherein the lipid term was forced to contribute favourably to the predicted binding energy. The results obtained from our new model indicate that certain putative Fzd CRDs are much more likely to bind Wnts, in some cases exhibiting selectivity for specific Wnts. The results of this study inform the investigation of Wnt signalling modulation beyond Fzds and sFRPs.
Collapse
|
25
|
Porter KA, Desta I, Kozakov D, Vajda S. What method to use for protein-protein docking? Curr Opin Struct Biol 2019; 55:1-7. [PMID: 30711743 PMCID: PMC6669123 DOI: 10.1016/j.sbi.2018.12.010] [Citation(s) in RCA: 68] [Impact Index Per Article: 13.6] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/29/2018] [Accepted: 12/22/2018] [Indexed: 10/27/2022]
Abstract
A number of well-established servers perform 'free' docking of proteins of known structures. In contrast, template-based docking can start from sequences if structures are available for complexes that are homologous to the target. On the basis of the results of the CAPRI-CASP structure prediction experiments, template-based methods yield more accurate predictions if good templates can be found, but generally fail without such templates. However, free global docking, or focused docking around even poor quality template-based models, can still generate acceptable docked structures in these cases. In accordance with the analysis of a benchmark set, free docking of heterodimers yields acceptable or better predictions in the top 10 models for around 40% of structures. However, it is likely that a combination of template-based and free docking methods can perform better for targets that have template structures available. Another way of improving the reliability of predictions is adding experimental information as restraints, an option built into several docking servers.
Collapse
Affiliation(s)
- Kathryn A Porter
- Department of Biomedical Engineering, Boston University, Boston, MA 02215, USA
| | - Israel Desta
- Department of Biomedical Engineering, Boston University, Boston, MA 02215, USA
| | - Dima Kozakov
- Department of Applied Mathematics and Statistics, Stony Brook University, NY, USA; Laufer Center for Physical and Quantitative Biology, Stony Brook University, NY, USA.
| | - Sandor Vajda
- Department of Biomedical Engineering, Boston University, Boston, MA 02215, USA; Department of Chemistry, Boston University, Boston, MA 02215, USA.
| |
Collapse
|
26
|
Soong JX, Chan SK, Lim TS, Choong YS. Optimisation of human V H domain antibodies specific to Mycobacterium tuberculosis heat shock protein (HSP16.3). J Comput Aided Mol Des 2019; 33:375-385. [PMID: 30689080 DOI: 10.1007/s10822-019-00186-z] [Citation(s) in RCA: 2] [Impact Index Per Article: 0.4] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 11/19/2018] [Accepted: 01/22/2019] [Indexed: 11/29/2022]
Abstract
Mycobacterium tuberculosis (Mtb) 16.3 kDa heat shock protein 16.3 (HSP16.3) is a latency-associated antigen that can be targeted for latent tuberculosis (TB) diagnostic and therapeutic development. We have previously developed human VH domain antibodies (dAbs; clone E3 and F1) specific against HSP16.3. In this work, we applied computational methods to optimise and design the antibodies in order to improve the binding affinity with HSP16.3. The VH domain antibodies were first docked to the dimer form of HSP16.3 and further sampled using molecular dynamics simulation. The calculated binding free energy of the HSP16.3-dAb complexes showed non-polar interactions were responsible for the antigen-antibody association. Per-residue free energy decomposition and computational alanine scanning have identified one hotspot residue for E3 (Y391) and 4 hotspot residues for F1 (M394, Y396, R397 and M398). These hotspot residues were then mutated and evaluated by binding free energy calculations. Phage ELISA assay was carried out on the potential mutants (E3Y391W, F1M394E, F1R397N and F1M398Y). The experimental assay showed improved binding affinities of E3Y391W and F1M394E against HSP16.3 compared with the wild type E3 and F1. This case study has thus showed in silico methods are able to assist in optimisation or improvement of antibody-antigen binding.
Collapse
Affiliation(s)
- Jia Xin Soong
- Institute for Research in Molecular Medicine (INFORMM), Universiti Sains Malaysia, Minden, 11800, Penang, Malaysia
| | - Soo Khim Chan
- Institute for Research in Molecular Medicine (INFORMM), Universiti Sains Malaysia, Minden, 11800, Penang, Malaysia
| | - Theam Soon Lim
- Institute for Research in Molecular Medicine (INFORMM), Universiti Sains Malaysia, Minden, 11800, Penang, Malaysia.,Analytical Biochemistry Research Centre, Universiti Sains Malaysia, Minden, 11800, Penang, Malaysia
| | - Yee Siew Choong
- Institute for Research in Molecular Medicine (INFORMM), Universiti Sains Malaysia, Minden, 11800, Penang, Malaysia.
| |
Collapse
|
27
|
Marín-López MA, Planas-Iglesias J, Aguirre-Plans J, Bonet J, Garcia-Garcia J, Fernandez-Fuentes N, Oliva B. On the mechanisms of protein interactions: predicting their affinity from unbound tertiary structures. Bioinformatics 2018; 34:592-598. [PMID: 29028891 PMCID: PMC5860604 DOI: 10.1093/bioinformatics/btx616] [Citation(s) in RCA: 11] [Impact Index Per Article: 1.8] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 11/22/2016] [Accepted: 09/26/2017] [Indexed: 12/12/2022] Open
Abstract
Motivation The characterization of the protein–protein association mechanisms is crucial to understanding how biological processes occur. It has been previously shown that the early formation of non-specific encounters enhances the realization of the stereospecific (i.e. native) complex by reducing the dimensionality of the search process. The association rate for the formation of such complex plays a crucial role in the cell biology and depends on how the partners diffuse to be close to each other. Predicting the binding free energy of proteins provides new opportunities to modulate and control protein–protein interactions. However, existing methods require the 3D structure of the complex to predict its affinity, severely limiting their application to interactions with known structures. Results We present a new approach that relies on the unbound protein structures and protein docking to predict protein–protein binding affinities. Through the study of the docking space (i.e. decoys), the method predicts the binding affinity of the query proteins when the actual structure of the complex itself is unknown. We tested our approach on a set of globular and soluble proteins of the newest affinity benchmark, obtaining accuracy values comparable to other state-of-art methods: a 0.4 correlation coefficient between the experimental and predicted values of ΔG and an error < 3 Kcal/mol. Availability and implementation The binding affinity predictor is implemented and available at http://sbi.upf.edu/BADock and https://github.com/badocksbi/BADock. Supplementary information Supplementary data are available at Bioinformatics online.
Collapse
Affiliation(s)
- Manuel Alejandro Marín-López
- Structural Bioinformatics Lab, Department of Experimental and Health Science, Universitat Pompeu Fabra, Barcelona 08003, Spain
| | - Joan Planas-Iglesias
- Division of Metabolic and Vascular Health, University of Warwick, Coventry CV4?7AL, UK
| | - Joaquim Aguirre-Plans
- Structural Bioinformatics Lab, Department of Experimental and Health Science, Universitat Pompeu Fabra, Barcelona 08003, Spain
| | - Jaume Bonet
- Laboratory of Protein Design and Immunoenginneering, School of Engineering, Ecole Polytechnique Federale de Lausanne, Lausanne 1015, Switzerland
| | - Javier Garcia-Garcia
- Structural Bioinformatics Lab, Department of Experimental and Health Science, Universitat Pompeu Fabra, Barcelona 08003, Spain
| | - Narcis Fernandez-Fuentes
- Institute of Biological, Environmental and Rural Sciences, Aberystwyth University, Aberystwyth SY23?3DA, UK
| | - Baldo Oliva
- Structural Bioinformatics Lab, Department of Experimental and Health Science, Universitat Pompeu Fabra, Barcelona 08003, Spain
| |
Collapse
|
28
|
Pfeiffenberger E, Bates PA. Predicting improved protein conformations with a temporal deep recurrent neural network. PLoS One 2018; 13:e0202652. [PMID: 30180164 PMCID: PMC6122789 DOI: 10.1371/journal.pone.0202652] [Citation(s) in RCA: 13] [Impact Index Per Article: 2.2] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 04/10/2018] [Accepted: 08/07/2018] [Indexed: 02/03/2023] Open
Abstract
Accurate protein structure prediction from amino acid sequence is still an unsolved problem. The most reliable methods centre on template based modelling. However, the accuracy of these models entirely depends on the availability of experimentally resolved homologous template structures. In order to generate more accurate models, extensive physics based molecular dynamics (MD) refinement simulations are performed to sample many different conformations to find improved conformational states. In this study, we propose a deep recurrent network model, called DeepTrajectory, that is able to identify these improved conformational states, with high precision, from a variety of different MD based sampling protocols. The proposed model learns the temporal patterns of features computed from MD trajectory data in order to classify whether each recorded simulation snapshot is an improved quality conformational state, decreased quality conformational state or whether there is no perceivable change in state with respect to the starting conformation. The model was trained and tested on 904 trajectories from 42 different protein systems with a cumulative number of more than 1.7 million snapshots. We show that our model outperforms other state of the art machine-learning algorithms that do not consider temporal dependencies. To our knowledge, DeepTrajectory is the first implementation of a time-dependent deep-learning protocol that is re-trainable and able to adapt to any new MD based sampling procedure, thereby demonstrating how a neural network can be used to learn the latter part of the protein folding funnel.
Collapse
Affiliation(s)
- Erik Pfeiffenberger
- Biomolecular Modelling Laboratory, The Francis Crick Institute, 1 Midland Road, London NW1 1AT, United Kingdom
| | - Paul A. Bates
- Biomolecular Modelling Laboratory, The Francis Crick Institute, 1 Midland Road, London NW1 1AT, United Kingdom
| |
Collapse
|
29
|
PPInS: a repository of protein-protein interaction sitesbase. Sci Rep 2018; 8:12453. [PMID: 30127348 PMCID: PMC6102274 DOI: 10.1038/s41598-018-30999-1] [Citation(s) in RCA: 5] [Impact Index Per Article: 0.8] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/25/2018] [Accepted: 08/03/2018] [Indexed: 01/14/2023] Open
Abstract
Protein-Protein Interaction Sitesbase (PPInS), a high-performance database of protein-protein interacting interfaces, is presented. The atomic level information of the molecular interaction happening amongst various protein chains in protein-protein complexes (as reported in the Protein Data Bank [PDB]) together with their evolutionary information in Structural Classification of Proteins (SCOPe release 2.06), is made available in PPInS. Total 32468 PDB files representing X-ray crystallized multimeric protein-protein complexes with structural resolution better than 2.5 Å had been shortlisted to demarcate the protein-protein interaction interfaces (PPIIs). A total of 111857 PPIIs with ~32.24 million atomic contact pairs (ACPs) were generated and made available on a web server for on-site analysis and downloading purpose. All these PPIIs and protein-protein interacting patches (PPIPs) involved in them, were also analyzed in terms of a number of residues contributing in patch formation, their hydrophobic nature, amount of surface area they contributed in binding, and their homo and heterodimeric nature, to describe the diversity of information covered in PPInS. It was observed that 42.37% of total PPIPs were made up of 6–20 interacting residues, 53.08% PPIPs had interface area ≤1000 Å2 in PPII formation, 82.64% PPIPs were reported with hydrophobicity score of ≤10, and 73.26% PPIPs were homologous to each other with the sequence similarity score ranging from 75–100%. A subset “Non-Redundant Database (NRDB)” of the PPInS containing 2265 PPIIs, with over 1.8 million ACPs corresponding to the 1931 protein-protein complexes (PDBs), was also designed by removing structural redundancies at the level of SCOP superfamily (SCOP release 1.75). The web interface of the PPInS (http://www.cup.edu.in:99/ppins/home.php) offers an easy-to-navigate, intuitive and user-friendly environment, and can be accessed by providing PDB ID, SCOP superfamily ID, and protein sequence.
Collapse
|
30
|
Raucci R, Laine E, Carbone A. Local Interaction Signal Analysis Predicts Protein-Protein Binding Affinity. Structure 2018; 26:905-915.e4. [PMID: 29779789 DOI: 10.1016/j.str.2018.04.006] [Citation(s) in RCA: 19] [Impact Index Per Article: 3.2] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/26/2017] [Revised: 02/06/2018] [Accepted: 04/10/2018] [Indexed: 12/27/2022]
Abstract
Several models estimating the strength of the interaction between proteins in a complex have been proposed. By exploring the geometry of contact distribution at protein-protein interfaces, we provide an improved model of binding energy. Local interaction signal analysis (LISA) is a radial function based on terms describing favorable and non-favorable contacts obtained by density functional theory, the support-core-rim interface residue distribution, non-interacting charged residues and secondary structures contribution. The three-dimensional organization of the contacts and their contribution on localized hot-sites over the entire interaction surface were numerically evaluated. LISA achieves a correlation of 0.81 (and a root-mean-square error of 2.35 ± 0.38 kcal/mol) when tested on 125 complexes for which experimental measurements were realized. LISA's performance is stable for subsets defined by functional composition and extent of conformational changes upon complex formation. A large-scale comparison with 17 other functions demonstrated the power of the geometrical model in the understanding of complex binding.
Collapse
Affiliation(s)
- Raffaele Raucci
- Sorbonne Université, CNRS, IBPS, Laboratoire de Biologie Computationnelle et Quantitative (LCQB), 4 Place Jussieu, 75005 Paris, France; Sorbonne Université, Institut des Sciences du Calcul et des Données (ISCD), 75005 Paris, France
| | - Elodie Laine
- Sorbonne Université, CNRS, IBPS, Laboratoire de Biologie Computationnelle et Quantitative (LCQB), 4 Place Jussieu, 75005 Paris, France
| | - Alessandra Carbone
- Sorbonne Université, CNRS, IBPS, Laboratoire de Biologie Computationnelle et Quantitative (LCQB), 4 Place Jussieu, 75005 Paris, France; Institut Universitaire de France, 75005 Paris, France.
| |
Collapse
|
31
|
Rosell M, Fernández-Recio J. Hot-spot analysis for drug discovery targeting protein-protein interactions. Expert Opin Drug Discov 2018; 13:327-338. [PMID: 29376444 DOI: 10.1080/17460441.2018.1430763] [Citation(s) in RCA: 54] [Impact Index Per Article: 9.0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 12/18/2022]
Abstract
INTRODUCTION Protein-protein interactions are important for biological processes and pathological situations, and are attractive targets for drug discovery. However, rational drug design targeting protein-protein interactions is still highly challenging. Hot-spot residues are seen as the best option to target such interactions, but their identification requires detailed structural and energetic characterization, which is only available for a tiny fraction of protein interactions. Areas covered: In this review, the authors cover a variety of computational methods that have been reported for the energetic analysis of protein-protein interfaces in search of hot-spots, and the structural modeling of protein-protein complexes by docking. This can help to rationalize the discovery of small-molecule inhibitors of protein-protein interfaces of therapeutic interest. Computational analysis and docking can help to locate the interface, molecular dynamics can be used to find suitable cavities, and hot-spot predictions can focus the search for inhibitors of protein-protein interactions. Expert opinion: A major difficulty for applying rational drug design methods to protein-protein interactions is that in the majority of cases the complex structure is not available. Fortunately, computational docking can complement experimental data. An interesting aspect to explore in the future is the integration of these strategies for targeting PPIs with large-scale mutational analysis.
Collapse
Affiliation(s)
- Mireia Rosell
- a Department of Life Sciences , Barcelona Supercomputing Center (BSC) , Barcelona , Spain
| | - Juan Fernández-Recio
- a Department of Life Sciences , Barcelona Supercomputing Center (BSC) , Barcelona , Spain.,b Structural Biology Unit , Institut de Biologia Molecular de Barcelona (IBMB), CSIC , Barcelona , Spain
| |
Collapse
|
32
|
Abstract
The atomic structures of protein complexes can provide useful information for drug design, protein engineering, systems biology, and understanding pathology. Obtaining this information experimentally can be challenging. However, if the structures of the subunits are known, then it is often possible to model the complex computationally. This chapter provide practical guidelines for docking proteins using the SwarmDock flexible protein-protein docking method, providing an overview of the factors that need to be considered when deciding whether docking is likely to be successful, the preparation of structural input, generation of docked poses, analysis and ranking of docked poses, and the validation of models using external data.
Collapse
Affiliation(s)
- Iain H Moal
- European Molecular Biology Laboratory, European Bioinformatics Institute (EMBL-EBI), Cambridge, UK.
| | | | - Paul A Bates
- Biomolecular Modelling Laboratory, The Francis Crick Institute, London, UK
| |
Collapse
|
33
|
Understanding Insulin Endocrinology in Decapod Crustacea: Molecular Modelling Characterization of an Insulin-Binding Protein and Insulin-Like Peptides in the Eastern Spiny Lobster, Sagmariasus verreauxi. Int J Mol Sci 2017; 18:ijms18091832. [PMID: 28832524 PMCID: PMC5618481 DOI: 10.3390/ijms18091832] [Citation(s) in RCA: 22] [Impact Index Per Article: 3.1] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/24/2017] [Revised: 08/18/2017] [Accepted: 08/19/2017] [Indexed: 12/13/2022] Open
Abstract
The insulin signalling system is one of the most conserved endocrine systems of Animalia from mollusc to man. In decapod Crustacea, such as the Eastern spiny lobster, Sagmariasus verreauxi (Sv) and the red-claw crayfish, Cherax quadricarinatus (Cq), insulin endocrinology governs male sexual differentiation through the action of a male-specific, insulin-like androgenic gland peptide (IAG). To understand the bioactivity of IAG it is necessary to consider its bio-regulators such as the insulin-like growth factor binding protein (IGFBP). This work has employed various molecular modelling approaches to represent S. verreauxi IGFBP and IAG, along with additional Sv-ILP ligands, in order to characterise their binding interactions. Firstly, we present Sv- and Cq-ILP2: neuroendocrine factors that share closest homology with Drosophila ILP8 (Dilp8). We then describe the binding interaction of the N-terminal domain of Sv-IGFBP and each ILP through a synergy of computational analyses. In-depth interaction mapping and computational alanine scanning of IGFBP_N' highlight the conserved involvement of the hotspot residues Q67, G70, D71, S72, G91, G92, T93 and D94. The significance of the negatively charged residues D71 and D94 was then further exemplified by structural electrostatics. The functional importance of the negative surface charge of IGFBP is exemplified in the complementary electropositive charge on the reciprocal binding interface of all three ILP ligands. When examined, this electrostatic complementarity is the inverse of vertebrate homologues; such physicochemical divergences elucidate towards ligand-binding specificity between Phyla.
Collapse
|
34
|
Structure-based cross-docking analysis of antibody-antigen interactions. Sci Rep 2017; 7:8145. [PMID: 28811664 PMCID: PMC5557897 DOI: 10.1038/s41598-017-08414-y] [Citation(s) in RCA: 24] [Impact Index Per Article: 3.4] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 02/27/2017] [Accepted: 07/10/2017] [Indexed: 12/02/2022] Open
Abstract
Antibody–antigen interactions are critical to our immune response, and understanding the structure-based biophysical determinants for their binding specificity and affinity is of fundamental importance. We present a computational structure-based cross-docking study to test the identification of native antibody–antigen interaction pairs among cognate and non-cognate complexes. We picked a dataset of 17 antibody–antigen complexes of which 11 have both bound and unbound structures available, and we generated a representative ensemble of cognate and non-cognate complexes. Using the Rosetta interface score as a classifier, the cognate pair was the top-ranked model in 80% (14/17) of the antigen targets using bound monomer structures in docking, 35% (6/17) when using unbound, and 12% (2/17) when using the homology-modeled backbones to generate the complexes. Increasing rigid-body diversity of the models using RosettaDock’s local dock routine lowers the discrimination accuracy with the cognate antibody–antigen pair ranking in bound and unbound models but recovers additional top-ranked cognate complexes when using homology models. The study is the first structure-based cross-docking attempt aimed at distinguishing antibody–antigen binders from non-binders and demonstrates the challenges to address for the methods to be widely applicable to supplement high-throughput experimental antibody sequencing workflows.
Collapse
|
35
|
Moal IH, Barradas-Bautista D, Jiménez-García B, Torchala M, van der Velde A, Vreven T, Weng Z, Bates PA, Fernández-Recio J. IRaPPA: information retrieval based integration of biophysical models for protein assembly selection. Bioinformatics 2017; 33:1806-1813. [PMID: 28200016 PMCID: PMC5783285 DOI: 10.1093/bioinformatics/btx068] [Citation(s) in RCA: 32] [Impact Index Per Article: 4.6] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 09/20/2016] [Revised: 01/26/2017] [Accepted: 02/12/2017] [Indexed: 01/23/2023] Open
Abstract
MOTIVATION In order to function, proteins frequently bind to one another and form 3D assemblies. Knowledge of the atomic details of these structures helps our understanding of how proteins work together, how mutations can lead to disease, and facilitates the designing of drugs which prevent or mimic the interaction. RESULTS Atomic modeling of protein-protein interactions requires the selection of near-native structures from a set of docked poses based on their calculable properties. By considering this as an information retrieval problem, we have adapted methods developed for Internet search ranking and electoral voting into IRaPPA, a pipeline integrating biophysical properties. The approach enhances the identification of near-native structures when applied to four docking methods, resulting in a near-native appearing in the top 10 solutions for up to 50% of complexes benchmarked, and up to 70% in the top 100. AVAILABILITY AND IMPLEMENTATION IRaPPA has been implemented in the SwarmDock server ( http://bmm.crick.ac.uk/∼SwarmDock/ ), pyDock server ( http://life.bsc.es/pid/pydockrescoring/ ) and ZDOCK server ( http://zdock.umassmed.edu/ ), with code available on request. CONTACT moal@ebi.ac.uk. SUPPLEMENTARY INFORMATION Supplementary data are available at Bioinformatics online.
Collapse
Affiliation(s)
- Iain H Moal
- European Molecular Biology Laboratory, European Bioinformatics Institute, Wellcome Trust Genome Campus, Hinxton, Cambridge, UK
- Life Science Department, Joint BSC-IRB Research Program in Computational Biology, Barcelona Supercomputing Center, Barcelona, Spain
| | - Didier Barradas-Bautista
- Life Science Department, Joint BSC-IRB Research Program in Computational Biology, Barcelona Supercomputing Center, Barcelona, Spain
| | - Brian Jiménez-García
- Life Science Department, Joint BSC-IRB Research Program in Computational Biology, Barcelona Supercomputing Center, Barcelona, Spain
| | | | - Arjan van der Velde
- Program in Bioinformatics and Integrative Biology, University of Massachusetts Medical School, Worcester, MA, USA
- Bioinformatics Program, Boston University, Boston, MA, USA
| | - Thom Vreven
- Program in Bioinformatics and Integrative Biology, University of Massachusetts Medical School, Worcester, MA, USA
| | - Zhiping Weng
- Program in Bioinformatics and Integrative Biology, University of Massachusetts Medical School, Worcester, MA, USA
| | - Paul A Bates
- Biomolecular Modelling Laboratory, The Francis Crick Institute, London, UK
| | - Juan Fernández-Recio
- Life Science Department, Joint BSC-IRB Research Program in Computational Biology, Barcelona Supercomputing Center, Barcelona, Spain
| |
Collapse
|
36
|
Agostino M, Pohl SÖG, Dharmarajan A. Structure-based prediction of Wnt binding affinities for Frizzled-type cysteine-rich domains. J Biol Chem 2017; 292:11218-11229. [PMID: 28533339 DOI: 10.1074/jbc.m117.786269] [Citation(s) in RCA: 21] [Impact Index Per Article: 3.0] [Reference Citation Analysis] [Abstract] [Key Words] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/14/2017] [Revised: 05/09/2017] [Indexed: 11/06/2022] Open
Abstract
Wnt signaling pathways are of significant interest in development and oncogenesis. The first step in these pathways typically involves the binding of a Wnt protein to the cysteine-rich domain (CRD) of a Frizzled receptor. Wnt-Frizzled interactions can be antagonized by secreted Frizzled-related proteins (SFRPs), which also contain a Frizzled-like CRD. The large number of Wnts, Frizzleds, and SFRPs, as well as the hydrophobic nature of Wnt, poses challenges to laboratory-based investigations of interactions involving Wnt. Here, utilizing structural knowledge of a representative Wnt-Frizzled CRD interaction, as well as experimentally determined binding affinities for a selection of Wnt-Frizzled CRD interactions, we generated homology models of Wnt-Frizzled CRD interactions and developed a quantitative structure-activity relationship for predicting their binding affinities. The derived model incorporates a small selection of terms derived from scoring functions used in protein-protein docking, as well as an energetic term considering the contribution made by the lipid of Wnt to the Wnt-Frizzled binding affinity. Validation with an external test set suggests that the model can accurately predict binding affinity for 75% of cases and that the error associated with the predictions is comparable with the experimental error. The model was applied to predict the binding affinities of the full range of mouse and human Wnt-Frizzled and Wnt-SFRP interactions, indicating trends in Wnt binding affinity for Frizzled and SFRP CRDs. The comprehensive predictions made in this study provide the basis for laboratory-based studies of previously unexplored Wnt-Frizzled and Wnt-SFRP interactions, which, in turn, may reveal further Wnt signaling pathways.
Collapse
Affiliation(s)
- Mark Agostino
- From the Stem Cell and Cancer Biology Laboratory, School of Biomedical Sciences and Curtin Health Innovation Research Institute and .,Curtin Institute of Computation, Curtin University, Kent Street, Bentley, Western Australia 6102, Australia
| | - Sebastian Öther-Gee Pohl
- From the Stem Cell and Cancer Biology Laboratory, School of Biomedical Sciences and Curtin Health Innovation Research Institute and
| | - Arun Dharmarajan
- From the Stem Cell and Cancer Biology Laboratory, School of Biomedical Sciences and Curtin Health Innovation Research Institute and
| |
Collapse
|
37
|
Zarei O, Hamzeh-Mivehroud M, Benvenuti S, Ustun-Alkan F, Dastmalchi S. Characterizing the Hot Spots Involved in RON-MSPβ Complex Formation Using In Silico Alanine Scanning Mutagenesis and Molecular Dynamics Simulation. Adv Pharm Bull 2017; 7:141-150. [PMID: 28507948 PMCID: PMC5426727 DOI: 10.15171/apb.2017.018] [Citation(s) in RCA: 6] [Impact Index Per Article: 0.9] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 02/27/2017] [Revised: 03/18/2017] [Accepted: 03/20/2017] [Indexed: 12/30/2022] Open
Abstract
Purpose: Implication of protein-protein interactions (PPIs) in development of many diseases such as cancer makes them attractive for therapeutic intervention and rational drug design. RON (Recepteur d'Origine Nantais) tyrosine kinase receptor has gained considerable attention as promising target in cancer therapy. The activation of RON via its ligand, macrophage stimulation protein (MSP) is the most common mechanism of activation for this receptor. The aim of the current study was to perform in silico alanine scanning mutagenesis and to calculate binding energy for prediction of hot spots in protein-protein interface between RON and MSPβ chain (MSPβ). Methods: In this work the residues at the interface of RON-MSPβ complex were mutated to alanine and then molecular dynamics simulation was used to calculate binding free energy. Results: The results revealed that Gln193, Arg220, Glu287, Pro288, Glu289, and His424 residues from RON and Arg521, His528, Ser565, Glu658, and Arg683 from MSPβ may play important roles in protein-protein interaction between RON and MSP. Conclusion: Identification of these RON hot spots is important in designing anti-RON drugs when the aim is to disrupt RON-MSP interaction. In the same way, the acquired information regarding the critical amino acids of MSPβ can be used in the process of rational drug design for developing MSP antagonizing agents, the development of novel MSP mimicking peptides where inhibition of RON activation is required, and the design of experimental site directed mutagenesis studies.
Collapse
Affiliation(s)
- Omid Zarei
- Department of Pharmaceutical Biotechnology, Faculty of Pharmacy, Tabriz University of Medical Sciences, Tabriz, Iran.,Biotechnology Research Center, Tabriz University of Medical Sciences, Tabriz, Iran.,Students Research Committee, Tabriz University of Medical Sciences, Tabriz, Iran
| | - Maryam Hamzeh-Mivehroud
- Biotechnology Research Center, Tabriz University of Medical Sciences, Tabriz, Iran.,Department of Medicinal Chemistry, Faculty of Pharmacy, Tabriz University of Medical Sciences, Tabriz, Iran
| | - Silvia Benvenuti
- Molecular Therapeutics and Exploratory Research Laboratory, Candiolo Cancer Institute-FPO-IRCCS, Candiolo, Turin, Italy
| | - Fulya Ustun-Alkan
- Department of Pharmacology and Toxicology, Faculty of Veterinary Medicine, Istanbul University, Istanbul, Turkey
| | - Siavoush Dastmalchi
- Biotechnology Research Center, Tabriz University of Medical Sciences, Tabriz, Iran.,Department of Medicinal Chemistry, Faculty of Pharmacy, Tabriz University of Medical Sciences, Tabriz, Iran
| |
Collapse
|
38
|
Barradas-Bautista D, Moal IH, Fernández-Recio J. A systematic analysis of scoring functions in rigid-body protein docking: The delicate balance between the predictive rate improvement and the risk of overtraining. Proteins 2017; 85:1287-1297. [DOI: 10.1002/prot.25289] [Citation(s) in RCA: 10] [Impact Index Per Article: 1.4] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 12/05/2016] [Revised: 03/08/2017] [Accepted: 03/20/2017] [Indexed: 12/24/2022]
Affiliation(s)
- Didier Barradas-Bautista
- Life Sciences Department, Barcelona Supercomputing Center (BSC), Joint BSC-CRG-IRB Research Program in Computational Biology; Barcelona 08034 Spain
| | - Iain H. Moal
- Life Sciences Department, Barcelona Supercomputing Center (BSC), Joint BSC-CRG-IRB Research Program in Computational Biology; Barcelona 08034 Spain
- European Molecular Biology Laboratory; European Bioinformatics Institute, Wellcome Trust Genome Campus; Hinxton Cambridge CB10 1SD United Kingdom
| | - Juan Fernández-Recio
- Life Sciences Department, Barcelona Supercomputing Center (BSC), Joint BSC-CRG-IRB Research Program in Computational Biology; Barcelona 08034 Spain
| |
Collapse
|
39
|
Computationally Design of Inhibitory Peptides Against Wnt Signaling Pathway: In Silico Insight on Complex of DKK1 and LRP6. Int J Pept Res Ther 2017. [DOI: 10.1007/s10989-017-9589-1] [Citation(s) in RCA: 7] [Impact Index Per Article: 1.0] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 12/20/2022]
|
40
|
Pfeiffenberger E, Chaleil RA, Moal IH, Bates PA. A machine learning approach for ranking clusters of docked protein-protein complexes by pairwise cluster comparison. Proteins 2017; 85:528-543. [PMID: 27935158 PMCID: PMC5396268 DOI: 10.1002/prot.25218] [Citation(s) in RCA: 18] [Impact Index Per Article: 2.6] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/25/2016] [Revised: 11/14/2016] [Accepted: 11/21/2016] [Indexed: 01/28/2023]
Abstract
Reliable identification of near-native poses of docked protein-protein complexes is still an unsolved problem. The intrinsic heterogeneity of protein-protein interactions is challenging for traditional biophysical or knowledge based potentials and the identification of many false positive binding sites is not unusual. Often, ranking protocols are based on initial clustering of docked poses followed by the application of an energy function to rank each cluster according to its lowest energy member. Here, we present an approach of cluster ranking based not only on one molecular descriptor (e.g., an energy function) but also employing a large number of descriptors that are integrated in a machine learning model, whereby, an extremely randomized tree classifier based on 109 molecular descriptors is trained. The protocol is based on first locally enriching clusters with additional poses, the clusters are then characterized using features describing the distribution of molecular descriptors within the cluster, which are combined into a pairwise cluster comparison model to discriminate near-native from incorrect clusters. The results show that our approach is able to identify clusters containing near-native protein-protein complexes. In addition, we present an analysis of the descriptors with respect to their power to discriminate near native from incorrect clusters and how data transformations and recursive feature elimination can improve the ranking performance. Proteins 2017; 85:528-543. © 2016 Wiley Periodicals, Inc.
Collapse
Affiliation(s)
| | | | - Iain H. Moal
- European Molecular Biology LaboratoryEuropean Bioinformatics Institute, Wellcome Trust Genome Campus, HinxtonCambridgeCB10 1SDUK
| | - Paul A. Bates
- Biomolecular Modelling LaboratoryThe Francis Crick InstituteLondonNW1 1ATUK
| |
Collapse
|
41
|
Vishwanath S, Sukhwal A, Sowdhamini R, Srinivasan N. Specificity and stability of transient protein-protein interactions. Curr Opin Struct Biol 2017; 44:77-86. [PMID: 28088083 DOI: 10.1016/j.sbi.2016.12.010] [Citation(s) in RCA: 18] [Impact Index Per Article: 2.6] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 09/17/2016] [Revised: 11/03/2016] [Accepted: 12/19/2016] [Indexed: 11/18/2022]
Abstract
Remarkable features that are achieved in a protein-protein complex to precise levels are stability and specificity. Deviation from the normal levels of specificity and stability, which is often caused by mutations, could result in disease conditions. Chemical nature, 3-D arrangement and dynamics of interface residues code for both specificity and stability. This article reviews roles of interfacial residues in transient protein-protein complexes. It is proposed that aside from hotspot residues conferring stability to the complex, a small set of 'rigid' residues at the interface that maintain conformation between complexed and uncomplexed forms, play a major role in conferring specificity. Exceptionally, 'super hotspot' residues, which confer both stability and specificity, are attractive sites for interaction with small molecule inhibitors.
Collapse
Affiliation(s)
- Sneha Vishwanath
- Molecular Biophysics Unit, Indian Institute of Science, Bangalore 560012, India
| | - Anshul Sukhwal
- National Centre for Biological Sciences, TIFR, UAS-GKVK Campus, Bellary road, Bangalore 560065, India; SASTRA Deemed University, Tirumalai Samudram, Thanjavur 613402, India
| | - Ramanathan Sowdhamini
- National Centre for Biological Sciences, TIFR, UAS-GKVK Campus, Bellary road, Bangalore 560065, India
| | | |
Collapse
|
42
|
Gromiha MM, Yugandhar K, Jemimah S. Protein-protein interactions: scoring schemes and binding affinity. Curr Opin Struct Biol 2016; 44:31-38. [PMID: 27866112 DOI: 10.1016/j.sbi.2016.10.016] [Citation(s) in RCA: 80] [Impact Index Per Article: 10.0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/29/2016] [Revised: 09/30/2016] [Accepted: 10/25/2016] [Indexed: 01/16/2023]
Abstract
Protein-protein interactions mediate several cellular functions, which can be understood from the information obtained using the three-dimensional structures of protein-protein complexes and binding affinity data. This review focuses on computational aspects of predicting the best native-like complex structure and binding affinities. The first part covers the prediction of protein-protein complex structures and the advantages of conformational searching and scoring functions in protein-protein docking. The second part is devoted to various aspects of protein-protein interaction thermodynamics, such as databases for binding affinities and other thermodynamic parameters, computational methods to predict the binding affinity using either the three-dimensional structures of complexes or amino acid sequences, and change in binding affinities of the complexes upon mutations. We provide the latest developments on protein-protein docking and binding affinity studies along with a list of available computational resources for understanding protein-protein interactions.
Collapse
Affiliation(s)
- M Michael Gromiha
- Department of Biotechnology, Bhupat and Jyoti Mehta School of Biosciences, Indian Institute of Technology Madras, Chennai 600036, Tamil Nadu, India.
| | - K Yugandhar
- Department of Biotechnology, Bhupat and Jyoti Mehta School of Biosciences, Indian Institute of Technology Madras, Chennai 600036, Tamil Nadu, India
| | - Sherlyn Jemimah
- Department of Biotechnology, Bhupat and Jyoti Mehta School of Biosciences, Indian Institute of Technology Madras, Chennai 600036, Tamil Nadu, India
| |
Collapse
|
43
|
Xue LC, Rodrigues JP, Kastritis PL, Bonvin AM, Vangone A. PRODIGY: a web server for predicting the binding affinity of protein-protein complexes. Bioinformatics 2016; 32:3676-3678. [PMID: 27503228 DOI: 10.1093/bioinformatics/btw514] [Citation(s) in RCA: 455] [Impact Index Per Article: 56.9] [Reference Citation Analysis] [Abstract] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/29/2016] [Revised: 07/17/2016] [Accepted: 07/30/2016] [Indexed: 11/13/2022] Open
Abstract
Gaining insights into the structural determinants of protein-protein interactions holds the key for a deeper understanding of biological functions, diseases and development of therapeutics. An important aspect of this is the ability to accurately predict the binding strength for a given protein-protein complex. Here we present PROtein binDIng enerGY prediction (PRODIGY), a web server to predict the binding affinity of protein-protein complexes from their 3D structure. The PRODIGY server implements our simple but highly effective predictive model based on intermolecular contacts and properties derived from non-interface surface. AVAILABILITY AND IMPLEMENTATION PRODIGY is freely available at: http://milou.science.uu.nl/services/PRODIGY CONTACT: a.m.j.j.bonvin@uu.nl, a.vangone@uu.nl.
Collapse
Affiliation(s)
- Li C Xue
- Computational Structural Biology Group, Bijvoet Center for Biomolecular Research, Faculty of Science - Department of Chemistry, Utrecht University, 3584CH Utrecht, The Netherlands
| | - João Pglm Rodrigues
- Computational Structural Biology Group, Bijvoet Center for Biomolecular Research, Faculty of Science - Department of Chemistry, Utrecht University, 3584CH Utrecht, The Netherlands
| | - Panagiotis L Kastritis
- Computational Structural Biology Group, Bijvoet Center for Biomolecular Research, Faculty of Science - Department of Chemistry, Utrecht University, 3584CH Utrecht, The Netherlands
| | - Alexandre Mjj Bonvin
- Computational Structural Biology Group, Bijvoet Center for Biomolecular Research, Faculty of Science - Department of Chemistry, Utrecht University, 3584CH Utrecht, The Netherlands
| | - Anna Vangone
- Computational Structural Biology Group, Bijvoet Center for Biomolecular Research, Faculty of Science - Department of Chemistry, Utrecht University, 3584CH Utrecht, The Netherlands
| |
Collapse
|
44
|
Agostino M, Mancera RL, Ramsland PA, Fernández-Recio J. Optimization of protein-protein docking for predicting Fc-protein interactions. J Mol Recognit 2016; 29:555-568. [PMID: 27445195 DOI: 10.1002/jmr.2555] [Citation(s) in RCA: 10] [Impact Index Per Article: 1.3] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 02/15/2016] [Revised: 06/12/2016] [Accepted: 06/14/2016] [Indexed: 01/08/2023]
Abstract
The antibody crystallizable fragment (Fc) is recognized by effector proteins as part of the immune system. Pathogens produce proteins that bind Fc in order to subvert or evade the immune response. The structural characterization of the determinants of Fc-protein association is essential to improve our understanding of the immune system at the molecular level and to develop new therapeutic agents. Furthermore, Fc-binding peptides and proteins are frequently used to purify therapeutic antibodies. Although several structures of Fc-protein complexes are available, numerous others have not yet been determined. Protein-protein docking could be used to investigate Fc-protein complexes; however, improved approaches are necessary to efficiently model such cases. In this study, a docking-based structural bioinformatics approach is developed for predicting the structures of Fc-protein complexes. Based on the available set of X-ray structures of Fc-protein complexes, three regions of the Fc, loosely corresponding to three turns within the structure, were defined as containing the essential features for protein recognition and used as restraints to filter the initial docking search. Rescoring the filtered poses with an optimal scoring strategy provided a success rate of approximately 80% of the test cases examined within the top ranked 20 poses, compared to approximately 20% by the initial unrestrained docking. The developed docking protocol provides a significant improvement over the initial unrestrained docking and will be valuable for predicting the structures of currently undetermined Fc-protein complexes, as well as in the design of peptides and proteins that target Fc.
Collapse
Affiliation(s)
- Mark Agostino
- School of Biomedical Sciences, Curtin Health Innovation Research Institute and Curtin Institute for Computation, Curtin University, Perth, Australia.,Joint BSC-CRG-IRB Research Program in Computational Biology, Life Sciences Department, Barcelona Supercomputing Center, Barcelona, Spain.,Centre for Biomedical Research, Burnet Institute, Melbourne, Australia
| | - Ricardo L Mancera
- School of Biomedical Sciences, Curtin Health Innovation Research Institute and Curtin Institute for Computation, Curtin University, Perth, Australia
| | - Paul A Ramsland
- Centre for Biomedical Research, Burnet Institute, Melbourne, Australia. .,School of Science, RMIT University, Bundoora, Australia. .,Department of Surgery Austin Health, University of Melbourne, Heidelberg, Australia. .,Department of Immunology, Monash University, Alfred Medical Research and Education Precinct, Melbourne, Australia.
| | - Juan Fernández-Recio
- Joint BSC-CRG-IRB Research Program in Computational Biology, Life Sciences Department, Barcelona Supercomputing Center, Barcelona, Spain.
| |
Collapse
|
45
|
Rigid-Docking Approaches to Explore Protein-Protein Interaction Space. ADVANCES IN BIOCHEMICAL ENGINEERING/BIOTECHNOLOGY 2016; 160:33-55. [PMID: 27830312 DOI: 10.1007/10_2016_41] [Citation(s) in RCA: 2] [Impact Index Per Article: 0.3] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 01/15/2023]
Abstract
Protein-protein interactions play core roles in living cells, especially in the regulatory systems. As information on proteins has rapidly accumulated on publicly available databases, much effort has been made to obtain a better picture of protein-protein interaction networks using protein tertiary structure data. Predicting relevant interacting partners from their tertiary structure is a challenging task and computer science methods have the potential to assist with this. Protein-protein rigid docking has been utilized by several projects, docking-based approaches having the advantages that they can suggest binding poses of predicted binding partners which would help in understanding the interaction mechanisms and that comparing docking results of both non-binders and binders can lead to understanding the specificity of protein-protein interactions from structural viewpoints. In this review we focus on explaining current computational prediction methods to predict pairwise direct protein-protein interactions that form protein complexes.
Collapse
|
46
|
Vreven T, Moal IH, Vangone A, Pierce BG, Kastritis PL, Torchala M, Chaleil R, Jiménez-García B, Bates PA, Fernandez-Recio J, Bonvin AMJJ, Weng Z. Updates to the Integrated Protein-Protein Interaction Benchmarks: Docking Benchmark Version 5 and Affinity Benchmark Version 2. J Mol Biol 2015; 427:3031-41. [PMID: 26231283 PMCID: PMC4677049 DOI: 10.1016/j.jmb.2015.07.016] [Citation(s) in RCA: 248] [Impact Index Per Article: 27.6] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 05/11/2015] [Revised: 07/17/2015] [Accepted: 07/17/2015] [Indexed: 01/31/2023]
Abstract
We present an updated and integrated version of our widely used protein-protein docking and binding affinity benchmarks. The benchmarks consist of non-redundant, high-quality structures of protein-protein complexes along with the unbound structures of their components. Fifty-five new complexes were added to the docking benchmark, 35 of which have experimentally measured binding affinities. These updated docking and affinity benchmarks now contain 230 and 179 entries, respectively. In particular, the number of antibody-antigen complexes has increased significantly, by 67% and 74% in the docking and affinity benchmarks, respectively. We tested previously developed docking and affinity prediction algorithms on the new cases. Considering only the top 10 docking predictions per benchmark case, a prediction accuracy of 38% is achieved on all 55 cases and up to 50% for the 32 rigid-body cases only. Predicted affinity scores are found to correlate with experimental binding energies up to r=0.52 overall and r=0.72 for the rigid complexes.
Collapse
Affiliation(s)
- Thom Vreven
- Program in Bioinformatics and Integrative Biology, University of Massachusetts Medical School, Worcester, MA 01605, USA
| | - Iain H Moal
- Joint BSC-CRG-IRB Research Program in Computational Biology, Life Sciences Department, Barcelona Supercomputing Center, C/Jordi Girona 29, 08034 Barcelona, Spain
| | - Anna Vangone
- Bijvoet Center for Biomolecular Research, Faculty of Science, Utrecht University, 3584CH Utrecht, The Netherlands
| | - Brian G Pierce
- Program in Bioinformatics and Integrative Biology, University of Massachusetts Medical School, Worcester, MA 01605, USA
| | - Panagiotis L Kastritis
- Bijvoet Center for Biomolecular Research, Faculty of Science, Utrecht University, 3584CH Utrecht, The Netherlands
| | - Mieczyslaw Torchala
- Biomolecular Modelling Laboratory, The Francis Crick Institute, Lincoln's Inn Fields Laboratory, London WC2A 3LY, United Kingdom
| | - Raphael Chaleil
- Biomolecular Modelling Laboratory, The Francis Crick Institute, Lincoln's Inn Fields Laboratory, London WC2A 3LY, United Kingdom
| | - Brian Jiménez-García
- Joint BSC-CRG-IRB Research Program in Computational Biology, Life Sciences Department, Barcelona Supercomputing Center, C/Jordi Girona 29, 08034 Barcelona, Spain
| | - Paul A Bates
- Biomolecular Modelling Laboratory, The Francis Crick Institute, Lincoln's Inn Fields Laboratory, London WC2A 3LY, United Kingdom.
| | - Juan Fernandez-Recio
- Joint BSC-CRG-IRB Research Program in Computational Biology, Life Sciences Department, Barcelona Supercomputing Center, C/Jordi Girona 29, 08034 Barcelona, Spain.
| | - Alexandre M J J Bonvin
- Bijvoet Center for Biomolecular Research, Faculty of Science, Utrecht University, 3584CH Utrecht, The Netherlands.
| | - Zhiping Weng
- Program in Bioinformatics and Integrative Biology, University of Massachusetts Medical School, Worcester, MA 01605, USA.
| |
Collapse
|
47
|
Vangone A, Bonvin AM. Contacts-based prediction of binding affinity in protein-protein complexes. eLife 2015. [PMID: 26193119 DOI: 10.7554/elife07454] [Citation(s) in RCA: 1] [Impact Index Per Article: 0.1] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 05/12/2023] Open
Abstract
Almost all critical functions in cells rely on specific protein-protein interactions. Understanding these is therefore crucial in the investigation of biological systems. Despite all past efforts, we still lack a thorough understanding of the energetics of association of proteins. Here, we introduce a new and simple approach to predict binding affinity based on functional and structural features of the biological system, namely the network of interfacial contacts. We assess its performance against a protein-protein binding affinity benchmark and show that both experimental methods used for affinity measurements and conformational changes have a strong impact on prediction accuracy. Using a subset of complexes with reliable experimental binding affinities and combining our contacts and contact-types-based model with recent observations on the role of the non-interacting surface in protein-protein interactions, we reach a high prediction accuracy for such a diverse dataset outperforming all other tested methods.
Collapse
Affiliation(s)
- Anna Vangone
- Computational Structural Biology Group, Bijvoet Center for Biomolecular Research, Faculty of Science-Chemistry, Utrecht University, Utrecht, Netherlands
| | - Alexandre Mjj Bonvin
- Computational Structural Biology Group, Bijvoet Center for Biomolecular Research, Faculty of Science-Chemistry, Utrecht University, Utrecht, Netherlands
| |
Collapse
|
48
|
Vangone A, Bonvin AMJJ. Contacts-based prediction of binding affinity in protein-protein complexes. eLife 2015; 4:e07454. [PMID: 26193119 PMCID: PMC4523921 DOI: 10.7554/elife.07454] [Citation(s) in RCA: 302] [Impact Index Per Article: 33.6] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/12/2015] [Accepted: 07/08/2015] [Indexed: 12/13/2022] Open
Abstract
Almost all critical functions in cells rely on specific protein-protein interactions. Understanding these is therefore crucial in the investigation of biological systems. Despite all past efforts, we still lack a thorough understanding of the energetics of association of proteins. Here, we introduce a new and simple approach to predict binding affinity based on functional and structural features of the biological system, namely the network of interfacial contacts. We assess its performance against a protein-protein binding affinity benchmark and show that both experimental methods used for affinity measurements and conformational changes have a strong impact on prediction accuracy. Using a subset of complexes with reliable experimental binding affinities and combining our contacts and contact-types-based model with recent observations on the role of the non-interacting surface in protein-protein interactions, we reach a high prediction accuracy for such a diverse dataset outperforming all other tested methods.
Collapse
Affiliation(s)
- Anna Vangone
- Computational Structural Biology Group, Bijvoet Center for Biomolecular Research, Faculty of Science—Chemistry, Utrecht University, Utrecht, Netherlands
| | - Alexandre MJJ Bonvin
- Computational Structural Biology Group, Bijvoet Center for Biomolecular Research, Faculty of Science—Chemistry, Utrecht University, Utrecht, Netherlands
| |
Collapse
|
49
|
Moal IH, Dapkūnas J, Fernández-Recio J. Inferring the microscopic surface energy of protein-protein interfaces from mutation data. Proteins 2015; 83:640-50. [PMID: 25586563 DOI: 10.1002/prot.24761] [Citation(s) in RCA: 12] [Impact Index Per Article: 1.3] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 09/24/2014] [Revised: 12/04/2014] [Accepted: 12/21/2014] [Indexed: 11/11/2022]
Abstract
Mutations at protein-protein recognition sites alter binding strength by altering the chemical nature of the interacting surfaces. We present a simple surface energy model, parameterized with empirical ΔΔG values, yielding mean energies of -48 cal mol(-1) Å(-2) for interactions between hydrophobic surfaces, -51 to -80 cal mol(-1) Å(-2) for surfaces of complementary charge, and 66-83 cal mol(-1) Å(-2) for electrostatically repelling surfaces, relative to the aqueous phase. This places the mean energy of hydrophobic surface burial at -24 cal mol(-1) Å(-2) . Despite neglecting configurational entropy and intramolecular changes, the model correlates with empirical binding free energies of a functionally diverse set of rigid-body interactions (r = 0.66). When used to rerank docking poses, it can place near-native solutions in the top 10 for 37% of the complexes evaluated, and 82% in the top 100. The method shows that hydrophobic burial is the driving force for protein association, accounting for 50-95% of the cohesive energy. The model is available open-source from http://life.bsc.es/pid/web/surface_energy/ and via the CCharpPPI web server http://life.bsc.es/pid/ccharppi/.
Collapse
Affiliation(s)
- Iain H Moal
- Joint BSC-IRB Research Program in Computational Biology, Life Science Department, Barcelona Supercomputing Center, Barcelona, 08034, Spain
| | | | | |
Collapse
|