1
|
Nikam R, Yugandhar K, Gromiha MM. DeepBSRPred: deep learning-based binding site residue prediction for proteins. Amino Acids 2023; 55:1305-1316. [PMID: 36574037 DOI: 10.1007/s00726-022-03228-3] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/09/2022] [Accepted: 12/15/2022] [Indexed: 12/28/2022]
Abstract
MOTIVATION Proteins-protein interactions (PPIs) are important to govern several cellular activities. Amino acid residues, which are located at the interface are known as the binding sites and the information about binding sites helps to understand the binding affinities and functions of protein-protein complexes. RESULTS We have developed a deep neural network-based method, DeepBSRPred, for predicting the binding sites using protein sequence information and predicted structures from AlphaFold2. Specific sequence and structure-based features include position-specific scoring matrix (PSSM), solvent accessible surface area, conservation score and amino acid properties, and residue depth, respectively. Our method predicted the binding sites with an average F1 score of 0.73 in a dataset of 1236 proteins. Further, we compared the performance with other existing methods in the literature using four benchmark datasets and our method outperformed those methods. AVAILABILITY AND IMPLEMENTATION The DeepBSRPred web server can be found at https://web.iitm.ac.in/bioinfo2/deepbsrpred/index.html , along with all datasets used in this study. The trained models, the DeepBSRPred standalone source code, and the feature computation pipeline are freely available at https://web.iitm.ac.in/bioinfo2/deepbsrpred/download.html .
Collapse
Affiliation(s)
- Rahul Nikam
- Department of Biotechnology, Bhupat and Jyoti Mehta School of Biosciences, Indian Institute of Technology Madras, Chennai, Tamil Nadu, 600036, India
| | - Kumar Yugandhar
- Department of Biotechnology, Bhupat and Jyoti Mehta School of Biosciences, Indian Institute of Technology Madras, Chennai, Tamil Nadu, 600036, India
- Department of Computational Biology, Cornell University, New York, NY, USA
| | - M Michael Gromiha
- Department of Biotechnology, Bhupat and Jyoti Mehta School of Biosciences, Indian Institute of Technology Madras, Chennai, Tamil Nadu, 600036, India.
- Department of Computer Science, Tokyo Institute of Technology, Yokohama, Japan.
| |
Collapse
|
2
|
Avery C, Patterson J, Grear T, Frater T, Jacobs DJ. Protein Function Analysis through Machine Learning. Biomolecules 2022; 12:1246. [PMID: 36139085 PMCID: PMC9496392 DOI: 10.3390/biom12091246] [Citation(s) in RCA: 9] [Impact Index Per Article: 4.5] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/16/2022] [Revised: 08/22/2022] [Accepted: 08/31/2022] [Indexed: 11/16/2022] Open
Abstract
Machine learning (ML) has been an important arsenal in computational biology used to elucidate protein function for decades. With the recent burgeoning of novel ML methods and applications, new ML approaches have been incorporated into many areas of computational biology dealing with protein function. We examine how ML has been integrated into a wide range of computational models to improve prediction accuracy and gain a better understanding of protein function. The applications discussed are protein structure prediction, protein engineering using sequence modifications to achieve stability and druggability characteristics, molecular docking in terms of protein-ligand binding, including allosteric effects, protein-protein interactions and protein-centric drug discovery. To quantify the mechanisms underlying protein function, a holistic approach that takes structure, flexibility, stability, and dynamics into account is required, as these aspects become inseparable through their interdependence. Another key component of protein function is conformational dynamics, which often manifest as protein kinetics. Computational methods that use ML to generate representative conformational ensembles and quantify differences in conformational ensembles important for function are included in this review. Future opportunities are highlighted for each of these topics.
Collapse
Affiliation(s)
- Chris Avery
- Department of Bioinformatics and Genomics, University of North Carolina at Charlotte, Charlotte, NC 28223, USA
| | - John Patterson
- Department of Bioinformatics and Genomics, University of North Carolina at Charlotte, Charlotte, NC 28223, USA
| | - Tyler Grear
- Department of Bioinformatics and Genomics, University of North Carolina at Charlotte, Charlotte, NC 28223, USA
- Department of Physics and Optical Science, University of North Carolina at Charlotte, Charlotte, NC 28223, USA
| | - Theodore Frater
- Department of Bioinformatics and Genomics, University of North Carolina at Charlotte, Charlotte, NC 28223, USA
| | - Donald J. Jacobs
- Department of Physics and Optical Science, University of North Carolina at Charlotte, Charlotte, NC 28223, USA
| |
Collapse
|
3
|
Molecular determinants of inhibition of the human proton channel hHv1 by the designer peptide C6 and a bivalent derivative. Proc Natl Acad Sci U S A 2022; 119:e2120750119. [PMID: 35648818 PMCID: PMC9191634 DOI: 10.1073/pnas.2120750119] [Citation(s) in RCA: 6] [Impact Index Per Article: 3.0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/18/2022] Open
Abstract
We designed C6 peptide to address the absence of specific inhibitors of human voltage-gated proton channels (hHv1). Two C6 bind to the two hHv1 voltage sensors at the resting state, inhibiting activation on depolarization. Here, we identify the C6–hHv1 binding interface using tethered-toxin variants and channel mutants, unveil an important role for negatively charged lipids, and present a model of the C6–hHv1 complex. Inspired by nature, we create a peptide with two C6 epitopes (C62) that binds to both channel subunits simultaneously, yielding picomolar affinity and significantly improved inhibition at high potentials. C6 and C62 are peptides designed to regulate hHv1, a channel involved in innate immune-system inflammatory pathophysiology, sperm capacitation, cancer-cell proliferation, and tissue damage in ischemic stroke. The human voltage-gated proton channel (hHv1) is important for control of intracellular pH. We designed C6, a specific peptide inhibitor of hHv1, to evaluate the roles of the channel in sperm capacitation and in the inflammatory immune response of neutrophils [R. Zhao et al., Proc. Natl. Acad. Sci. U.S.A. 115, E11847–E11856 (2018)]. One C6 binds with nanomolar affinity to each of the two S3–S4 voltage-sensor loops in hHv1 in cooperative fashion so that C6-bound channels require greater depolarization to open and do so more slowly. As depolarization drives hHv1 sensors outwardly, C6 affinity decreases, and inhibition is partial. Here, we identified residues essential to C6–hHv1 binding by scanning mutagenesis, five in the hHv1 S3–S4 loops and seven on C6. A structural model of the C6–hHv1 complex was then generated by molecular dynamics simulations and validated by mutant-cycle analysis. Guided by this model, we created a bivalent C6 peptide (C62) that binds simultaneously to both hHv1 subunits and fully inhibits current with picomolar affinity. The results help delineate the structural basis for C6 state-dependent inhibition, support an anionic lipid-mediated binding mechanism, and offer molecular insight into the effectiveness of engineered C6 as a therapeutic agent or lead.
Collapse
|
4
|
Lee YCJ, Shirkey JD, Park J, Bisht K, Cowan AJ. An Overview of Antiviral Peptides and Rational Biodesign Considerations. BIODESIGN RESEARCH 2022; 2022:9898241. [PMID: 37850133 PMCID: PMC10521750 DOI: 10.34133/2022/9898241] [Citation(s) in RCA: 11] [Impact Index Per Article: 5.5] [Reference Citation Analysis] [Abstract] [Grants] [Track Full Text] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 01/17/2022] [Accepted: 04/04/2022] [Indexed: 10/19/2023] Open
Abstract
Viral diseases have contributed significantly to worldwide morbidity and mortality throughout history. Despite the existence of therapeutic treatments for many viral infections, antiviral resistance and the threat posed by novel viruses highlight the need for an increased number of effective therapeutics. In addition to small molecule drugs and biologics, antimicrobial peptides (AMPs) represent an emerging class of potential antiviral therapeutics. While AMPs have traditionally been regarded in the context of their antibacterial activities, many AMPs are now known to be antiviral. These antiviral peptides (AVPs) have been shown to target and perturb viral membrane envelopes and inhibit various stages of the viral life cycle, from preattachment inhibition through viral release from infected host cells. Rational design of AMPs has also proven effective in identifying highly active and specific peptides and can aid in the discovery of lead peptides with high therapeutic selectivity. In this review, we highlight AVPs with strong antiviral activity largely curated from a publicly available AMP database. We then compile the sequences present in our AVP database to generate structural predictions of generic AVP motifs. Finally, we cover the rational design approaches available for AVPs taking into account approaches currently used for the rational design of AMPs.
Collapse
Affiliation(s)
- Ying-Chiang J. Lee
- Department of Molecular Biology, Princeton University, Princeton, New Jersey 08544, USA
| | - Jaden D. Shirkey
- Department of Molecular Biology, Princeton University, Princeton, New Jersey 08544, USA
| | - Jongbeom Park
- Department of Molecular Biology, Princeton University, Princeton, New Jersey 08544, USA
| | - Karishma Bisht
- Department of Molecular Biology, Princeton University, Princeton, New Jersey 08544, USA
| | - Alexis J. Cowan
- Department of Molecular Biology, Princeton University, Princeton, New Jersey 08544, USA
| |
Collapse
|
5
|
Singh A, Kumar P, Sarvagalla S, Bharadwaj T, Nayak N, Coumar MS, Giri R, Garg N. Functional inhibition of c-Myc using novel inhibitors identified through “hot spot” targeting. J Biol Chem 2022; 298:101898. [PMID: 35378126 PMCID: PMC9065629 DOI: 10.1016/j.jbc.2022.101898] [Citation(s) in RCA: 1] [Impact Index Per Article: 0.5] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 12/15/2021] [Revised: 03/18/2022] [Accepted: 03/28/2022] [Indexed: 12/14/2022] Open
Abstract
Protein–protein interactions drive various biological processes in healthy as well as disease states. The transcription factor c-Myc plays a crucial role in maintaining cellular homeostasis, and its deregulated expression is linked to various human cancers; therefore, it can be considered a viable target for cancer therapeutics. However, the structural heterogeneity of c-Myc due to its disordered nature poses a major challenge to drug discovery. In the present study, we used an in silico alanine scanning mutagenesis approach to identify “hot spot” residues within the c-Myc/Myc-associated factor X interface, which is highly disordered and has not yet been systematically analyzed for potential small molecule binding sites. We then used the information gained from this analysis to screen potential inhibitors using a conformation ensemble approach. The fluorescence-based biophysical experiments showed that the identified hit molecules displayed noncovalent interactions with these hot spot residues, and further cell-based experiments showed substantial in vitro potency against diverse c-Myc-expressing cancer/stem cells by deregulating c-Myc activity. These biophysical and computational studies demonstrated stable binding of the hit compounds with the disordered c-Myc protein. Collectively, our data indicated effective drug targeting of the disordered c-Myc protein via the determination of hot spot residues in the c-Myc/Myc-associated factor X heterodimer.
Collapse
|
6
|
Engineering of Cytolethal Distending Toxin B by Its Reducing Immunogenicity and Maintaining Stability as a New Drug Candidate for Tumor Therapy; an In Silico Study. Toxins (Basel) 2021; 13:toxins13110785. [PMID: 34822569 PMCID: PMC8624547 DOI: 10.3390/toxins13110785] [Citation(s) in RCA: 3] [Impact Index Per Article: 1.0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/03/2021] [Revised: 10/30/2021] [Accepted: 11/01/2021] [Indexed: 12/25/2022] Open
Abstract
The cytolethal distending toxin (CDT), Haemophilus ducreyi, is one of the bacterial toxins that have recently been considered for targeted therapies, especially in cancer therapies. CDT is an A-B2 exotoxin. Its catalytic subunit (CdtB) is capable of inducing DNA double strand breaks, cell cycle arrest and apoptosis in host eukaryotic cells. The sequence alignment indicates that the CdtB is structurally homologyr to phosphatases and deoxyribonucleases I (DNase I). Recently, it has been found that CdtB toxicity is mainly related to its nuclease activity. The immunogenicity of CDT can reduce its effectiveness in targeted therapies. However, the toxin can be very useful if its immunogenicity is significantly reduced. Detecting hotspot ectopic residues by computational servers and then mutating them to eliminate B-cell epitopes is a promising approach to reduce the immunogenicity of foreign protein-based therapeutics. By the mentioned method, in this study, we try to reduce the immunogenicity of the CdtB- protein sequence. This study initially screened residue of the CdtB is B-cell epitopes both linearly and conformationally. By overlapping the B-cell epitopes with the excluded conserve residues, and active and enzymatic sites, four residues were allowed to be mutated. There were two mutein options that show reduced antigenicity probability. Option one was N19F, G74I, and S161F with a VaxiJen score of 0.45 and the immune epitope database (IEDB) score of 1.80, and option two was N19F, G74I, and S161W with a VaxiJen score of 0.45 and IEDB score of 1.88. The 3D structure of the proposed sequences was evaluated and refined. The structural stability of native and mutant proteins was accessed through molecular dynamic simulation. The results showed that the mutations in the mutants caused no considerable changes in their structural stability. However, mutant 1 reveals more thermodynamic stability during the simulation. The applied approaches in this study can be used as rough guidelines for finding hot spot immunogen regions in the therapeutic proteins. Our results provide a new version of CdtB that, due to reduced immunogenicity and increased stability, can be used in toxin-based drugs such as immunotoxins.
Collapse
|
7
|
Parate S, Rampogu S, Lee G, Hong JC, Lee KW. Exploring the Binding Interaction of Raf Kinase Inhibitory Protein With the N-Terminal of C-Raf Through Molecular Docking and Molecular Dynamics Simulation. Front Mol Biosci 2021; 8:655035. [PMID: 34124147 PMCID: PMC8194344 DOI: 10.3389/fmolb.2021.655035] [Citation(s) in RCA: 9] [Impact Index Per Article: 3.0] [Reference Citation Analysis] [Abstract] [Key Words] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 01/18/2021] [Accepted: 05/14/2021] [Indexed: 12/12/2022] Open
Abstract
Protein-protein interactions are indispensable physiological processes regulating several biological functions. Despite the availability of structural information on protein-protein complexes, deciphering their complex topology remains an outstanding challenge. Raf kinase inhibitory protein (RKIP) has gained substantial attention as a favorable molecular target for numerous pathologies including cancer and Alzheimer’s disease. RKIP interferes with the RAF/MEK/ERK signaling cascade by endogenously binding with C-Raf (Raf-1 kinase) and preventing its activation. In the current investigation, the binding of RKIP with C-Raf was explored by knowledge-based protein-protein docking web-servers including HADDOCK and ZDOCK and a consensus binding mode of C-Raf/RKIP structural complex was obtained. Molecular dynamics (MD) simulations were further performed in an explicit solvent to sample the conformations for when RKIP binds to C-Raf. Some of the conserved interface residues were mutated to alanine, phenylalanine and leucine and the impact of mutations was estimated by additional MD simulations and MM/PBSA analysis for the wild-type (WT) and constructed mutant complexes. Substantial decrease in binding free energy was observed for the mutant complexes as compared to the binding free energy of WT C-Raf/RKIP structural complex. Furthermore, a considerable increase in average backbone root mean square deviation and fluctuation was perceived for the mutant complexes. Moreover, per-residue energy contribution analysis of the equilibrated simulation trajectory by HawkDock and ANCHOR web-servers was conducted to characterize the key residues for the complex formation. One residue each from C-Raf (Arg398) and RKIP (Lys80) were identified as the druggable “hot spots” constituting the core of the binding interface and corroborated by additional long-time scale (300 ns) MD simulation of Arg398Ala mutant complex. A notable conformational change in Arg398Ala mutant occurred near the mutation site as compared to the equilibrated C-Raf/RKIP native state conformation and an essential hydrogen bonding interaction was lost. The thirteen binding sites assimilated from the overall analysis were mapped onto the complex as surface and divided into active and allosteric binding sites, depending on their location at the interface. The acquired information on the predicted 3D structural complex and the detected sites aid as promising targets in designing novel inhibitors to block the C-Raf/RKIP interaction.
Collapse
Affiliation(s)
- Shraddha Parate
- Division of Life Sciences, Division of Applied Life Science (BK21 Plus), Plant Molecular Biology and Biotechnology Research Center (PMBBRC), Research Institute of Natural Science (RINS), Gyeongsang National University (GNU), Jinju, Korea
| | - Shailima Rampogu
- Division of Life Sciences, Plant Molecular Biology and Biotechnology Research Center (PMBBRC), Department of Bio and Medical Big Data (BK21 Four Program), Research Institute of Natural Science (RINS), Gyeongsang National University (GNU), Jinju, Korea
| | - Gihwan Lee
- Division of Life Sciences, Division of Applied Life Science (BK21 Plus), Plant Molecular Biology and Biotechnology Research Center (PMBBRC), Research Institute of Natural Science (RINS), Gyeongsang National University (GNU), Jinju, Korea
| | - Jong Chan Hong
- Division of Life Sciences, Division of Applied Life Science (BK21 Plus), Plant Molecular Biology and Biotechnology Research Center (PMBBRC), Research Institute of Natural Science (RINS), Gyeongsang National University (GNU), Jinju, Korea
| | - Keun Woo Lee
- Division of Life Sciences, Plant Molecular Biology and Biotechnology Research Center (PMBBRC), Department of Bio and Medical Big Data (BK21 Four Program), Research Institute of Natural Science (RINS), Gyeongsang National University (GNU), Jinju, Korea
| |
Collapse
|
8
|
Hashemi ZS, Zarei M, Fath MK, Ganji M, Farahani MS, Afsharnouri F, Pourzardosht N, Khalesi B, Jahangiri A, Rahbar MR, Khalili S. In silico Approaches for the Design and Optimization of Interfering Peptides Against Protein-Protein Interactions. Front Mol Biosci 2021; 8:669431. [PMID: 33996914 PMCID: PMC8113820 DOI: 10.3389/fmolb.2021.669431] [Citation(s) in RCA: 31] [Impact Index Per Article: 10.3] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 02/18/2021] [Accepted: 04/06/2021] [Indexed: 01/01/2023] Open
Abstract
Large contact surfaces of protein-protein interactions (PPIs) remain to be an ongoing issue in the discovery and design of small molecule modulators. Peptides are intrinsically capable of exploring larger surfaces, stable, and bioavailable, and therefore bear a high therapeutic value in the treatment of various diseases, including cancer, infectious diseases, and neurodegenerative diseases. Given these promising properties, a long way has been covered in the field of targeting PPIs via peptide design strategies. In silico tools have recently become an inevitable approach for the design and optimization of these interfering peptides. Various algorithms have been developed to scrutinize the PPI interfaces. Moreover, different databases and software tools have been created to predict the peptide structures and their interactions with target protein complexes. High-throughput screening of large peptide libraries against PPIs; "hotspot" identification; structure-based and off-structure approaches of peptide design; 3D peptide modeling; peptide optimization strategies like cyclization; and peptide binding energy evaluation are among the capabilities of in silico tools. In the present study, the most recent advances in the field of in silico approaches for the design of interfering peptides against PPIs will be reviewed. The future perspective of the field and its advantages and limitations will also be pinpointed.
Collapse
Affiliation(s)
- Zahra Sadat Hashemi
- ATMP Department, Breast Cancer Research Center, Motamed Cancer Institute, Academic Center for Education, Culture and Research, Tehran, Iran
| | - Mahboubeh Zarei
- Pharmaceutical Sciences Research Center, Shiraz University of Medical Sciences, Shiraz, Iran
| | - Mohsen Karami Fath
- Department of Cellular and Molecular Biology, Faculty of Biological Sciences, Kharazmi University, Tehran, Iran
| | - Mahmoud Ganji
- Department of Medical Biotechnology, Faculty of Medical Sciences, Tarbiat Modares University, Tehran, Iran
| | - Mahboube Shahrabi Farahani
- Department of Medical Biotechnology, Faculty of Medical Sciences, Tarbiat Modares University, Tehran, Iran
| | - Fatemeh Afsharnouri
- Department of Medical Biotechnology, Faculty of Medical Sciences, Tarbiat Modares University, Tehran, Iran
| | - Navid Pourzardosht
- Cellular and Molecular Research Center, Faculty of Medicine, Guilan University of Medical Sciences, Rasht, Iran
- Department of Biochemistry, Guilan University of Medical Sciences, Rasht, Iran
| | - Bahman Khalesi
- Department of Research and Production of Poultry Viral Vaccine, Razi Vaccine and Serum Research Institute, Agricultural Research Education and Extension Organization, Karaj, Iran
| | - Abolfazl Jahangiri
- Applied Microbiology Research Center, Systems Biology and Poisonings Institute, Baqiyatallah University of Medical Sciences, Tehran, Iran
| | - Mohammad Reza Rahbar
- Pharmaceutical Sciences Research Center, Shiraz University of Medical Sciences, Shiraz, Iran
| | - Saeed Khalili
- Department of Biology Sciences, Shahid Rajaee Teacher Training University, Tehran, Iran
| |
Collapse
|
9
|
Meng F, Liang Z, Zhao K, Luo C. Drug design targeting active posttranslational modification protein isoforms. Med Res Rev 2020; 41:1701-1750. [PMID: 33355944 DOI: 10.1002/med.21774] [Citation(s) in RCA: 32] [Impact Index Per Article: 8.0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 09/24/2020] [Revised: 11/29/2020] [Accepted: 12/03/2020] [Indexed: 12/11/2022]
Abstract
Modern drug design aims to discover novel lead compounds with attractable chemical profiles to enable further exploration of the intersection of chemical space and biological space. Identification of small molecules with good ligand efficiency, high activity, and selectivity is crucial toward developing effective and safe drugs. However, the intersection is one of the most challenging tasks in the pharmaceutical industry, as chemical space is almost infinity and continuous, whereas the biological space is very limited and discrete. This bottleneck potentially limits the discovery of molecules with desirable properties for lead optimization. Herein, we present a new direction leveraging posttranslational modification (PTM) protein isoforms target space to inspire drug design termed as "Post-translational Modification Inspired Drug Design (PTMI-DD)." PTMI-DD aims to extend the intersections of chemical space and biological space. We further rationalized and highlighted the importance of PTM protein isoforms and their roles in various diseases and biological functions. We then laid out a few directions to elaborate the PTMI-DD in drug design including discovering covalent binding inhibitors mimicking PTMs, targeting PTM protein isoforms with distinctive binding sites from that of wild-type counterpart, targeting protein-protein interactions involving PTMs, and hijacking protein degeneration by ubiquitination for PTM protein isoforms. These directions will lead to a significant expansion of the biological space and/or increase the tractability of compounds, primarily due to precisely targeting PTM protein isoforms or complexes which are highly relevant to biological functions. Importantly, this new avenue will further enrich the personalized treatment opportunity through precision medicine targeting PTM isoforms.
Collapse
Affiliation(s)
- Fanwang Meng
- Drug Discovery and Design Center, the Center for Chemical Biology, State Key Laboratory of Drug Research, Shanghai Institute of Materia Medica, Chinese Academy of Sciences, Shanghai, China.,Department of Chemistry and Chemical Biology, McMaster University, Hamilton, Ontario, Canada
| | - Zhongjie Liang
- Center for Systems Biology, Department of Bioinformatics, School of Biology and Basic Medical Sciences, Soochow University, Suzhou, China
| | - Kehao Zhao
- School of Pharmacy, Key Laboratory of Molecular Pharmacology and Drug Evaluation (Yantai University), Ministry of Education, Collaborative Innovation Center of Advanced Drug Delivery System and Biotech Drugs in Universities of Shandong, Yantai University, Yantai, China
| | - Cheng Luo
- Drug Discovery and Design Center, the Center for Chemical Biology, State Key Laboratory of Drug Research, Shanghai Institute of Materia Medica, Chinese Academy of Sciences, Shanghai, China
| |
Collapse
|
10
|
Perišić O. Recognition of Potential COVID-19 Drug Treatments through the Study of Existing Protein-Drug and Protein-Protein Structures: An Analysis of Kinetically Active Residues. Biomolecules 2020; 10:biom10091346. [PMID: 32967116 PMCID: PMC7565175 DOI: 10.3390/biom10091346] [Citation(s) in RCA: 7] [Impact Index Per Article: 1.8] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 08/09/2020] [Revised: 09/09/2020] [Accepted: 09/15/2020] [Indexed: 01/18/2023] Open
Abstract
We report the results of our in silico study of approved drugs as potential treatments for COVID-19. The study is based on the analysis of normal modes of proteins. The drugs studied include chloroquine, ivermectin, remdesivir, sofosbuvir, boceprevir, and α-difluoromethylornithine (DMFO). We applied the tools we developed and standard tools used in the structural biology community. Our results indicate that small molecules selectively bind to stable, kinetically active residues and residues adjoining them on the surface of proteins and inside protein pockets, and that some prefer hydrophobic sites over other active sites. Our approach is not restricted to viruses and can facilitate rational drug design, as well as improve our understanding of molecular interactions, in general.
Collapse
Affiliation(s)
- Ognjen Perišić
- Big Blue Genomics, Vojvode Brane 32, 11000 Belgrade, Serbia
| |
Collapse
|
11
|
Tököli A, Mag B, Bartus É, Wéber E, Szakonyi G, Simon MA, Czibula Á, Monostori É, Nyitray L, Martinek TA. Proteomimetic surface fragments distinguish targets by function. Chem Sci 2020; 11:10390-10398. [PMID: 34094300 PMCID: PMC8162404 DOI: 10.1039/d0sc03525d] [Citation(s) in RCA: 2] [Impact Index Per Article: 0.5] [Reference Citation Analysis] [Abstract] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/25/2020] [Accepted: 09/09/2020] [Indexed: 11/21/2022] Open
Abstract
The fragment-centric design promises a means to develop complex xenobiotic protein surface mimetics, but it is challenging to find locally biomimetic structures. To address this issue, foldameric local surface mimetic (LSM) libraries were constructed. Protein affinity patterns, ligand promiscuity and protein druggability were evaluated using pull-down data for targets with various interaction tendencies and levels of homology. LSM probes based on H14 helices exhibited sufficient binding affinities for the detection of both orthosteric and non-orthosteric spots, and overall binding tendencies correlated with the magnitude of the target interactome. Binding was driven by two proteinogenic side chains and LSM probes could distinguish structurally similar proteins with different functions, indicating limited promiscuity. Binding patterns displayed similar side chain enrichment values to those for native protein-protein interfaces implying locally biomimetic behavior. These analyses suggest that in a fragment-centric approach foldameric LSMs can serve as useful probes and building blocks for undruggable protein interfaces.
Collapse
Affiliation(s)
- Attila Tököli
- Department of Medical Chemistry, University of Szeged Dóm tér 8 H6720 Szeged Hungary
| | - Beáta Mag
- Department of Medical Chemistry, University of Szeged Dóm tér 8 H6720 Szeged Hungary
| | - Éva Bartus
- Department of Medical Chemistry, University of Szeged Dóm tér 8 H6720 Szeged Hungary
- MTA-SZTE Biomimetic Systems Research Group, University of Szeged Dóm tér 8 H6720 Szeged Hungary
| | - Edit Wéber
- Department of Medical Chemistry, University of Szeged Dóm tér 8 H6720 Szeged Hungary
| | - Gerda Szakonyi
- Institute of Pharmaceutical Analysis, University of Szeged Somogyi u. 4. H6720 Szeged Hungary
| | - Márton A Simon
- Department of Biochemistry, Eötvös Loránd University Pázmány Péter sétány 1/C H1077 Budapest Hungary
| | - Ágnes Czibula
- Lymphocyte Signal Transduction Laboratory, Institute of Genetics, Biological Research Centre Temesvári krt. 62 H6726 Szeged Hungary
| | - Éva Monostori
- Lymphocyte Signal Transduction Laboratory, Institute of Genetics, Biological Research Centre Temesvári krt. 62 H6726 Szeged Hungary
| | - László Nyitray
- Department of Biochemistry, Eötvös Loránd University Pázmány Péter sétány 1/C H1077 Budapest Hungary
| | - Tamás A Martinek
- Department of Medical Chemistry, University of Szeged Dóm tér 8 H6720 Szeged Hungary
- MTA-SZTE Biomimetic Systems Research Group, University of Szeged Dóm tér 8 H6720 Szeged Hungary
| |
Collapse
|
12
|
Oña Chuquimarca S, Ayala-Ruano S, Goossens J, Pauwels L, Goossens A, Leon-Reyes A, Ángel Méndez M. The Molecular Basis of JAZ-MYC Coupling, a Protein-Protein Interface Essential for Plant Response to Stressors. FRONTIERS IN PLANT SCIENCE 2020; 11:1139. [PMID: 32973821 PMCID: PMC7468482 DOI: 10.3389/fpls.2020.01139] [Citation(s) in RCA: 4] [Impact Index Per Article: 1.0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Subscribe] [Scholar Register] [Received: 03/12/2020] [Accepted: 07/14/2020] [Indexed: 05/29/2023]
Abstract
The jasmonic acid (JA) signaling pathway is one of the primary mechanisms that allow plants to respond to a variety of biotic and abiotic stressors. Within this pathway, the JAZ repressor proteins and the basic helix-loop-helix (bHLH) transcription factor MYC3 play a critical role. JA is a volatile organic compound with an essential role in plant immunity. The increase in the concentration of JA leads to the decoupling of the JAZ repressor proteins and the bHLH transcription factor MYC3 causing the induction of genes of interest. The primary goal of this study was to identify the molecular basis of JAZ-MYC coupling. For this purpose, we modeled and validated 12 JAZ-MYC3 3D in silico structures and developed a molecular dynamics/machine learning pipeline to obtain two outcomes. First, we calculated the average free binding energy of JAZ-MYC3 complexes, which was predicted to be -10.94 +/-2.67 kJ/mol. Second, we predicted which ones should be the interface residues that make the predominant contribution to the free energy of binding (molecular hotspots). The predicted protein hotspots matched a conserved linear motif SL••FL•••R, which may have a crucial role during MYC3 recognition of JAZ proteins. As a proof of concept, we tested, both in silico and in vitro, the importance of this motif on PEAPOD (PPD) proteins, which also belong to the TIFY protein family, like the JAZ proteins, but cannot bind to MYC3. By mutating these proteins to match the SL••FL•••R motif, we could force PPDs to bind the MYC3 transcription factor. Taken together, modeling protein-protein interactions and using machine learning will help to find essential motifs and molecular mechanisms in the JA pathway.
Collapse
Affiliation(s)
- Samara Oña Chuquimarca
- Grupo de Química Computacional y Teórica, Departamento de Ingeniería Química, Universidad San Francisco de Quito USFQ, Campus Cumbayá, Quito, Ecuador
- Instituto de Simulación Computacional (ISC-USFQ), Universidad San Francisco de Quito USFQ, Quito, Ecuador
| | - Sebastián Ayala-Ruano
- Grupo de Química Computacional y Teórica, Departamento de Ingeniería Química, Universidad San Francisco de Quito USFQ, Campus Cumbayá, Quito, Ecuador
- Instituto de Simulación Computacional (ISC-USFQ), Universidad San Francisco de Quito USFQ, Quito, Ecuador
| | - Jonas Goossens
- Department of Plant Biotechnology and Bioinformatics, Ghent University, Ghent, Belgium
- VIB Center for Plant Systems Biology, Ghent, Belgium
| | - Laurens Pauwels
- Department of Plant Biotechnology and Bioinformatics, Ghent University, Ghent, Belgium
- VIB Center for Plant Systems Biology, Ghent, Belgium
| | - Alain Goossens
- Department of Plant Biotechnology and Bioinformatics, Ghent University, Ghent, Belgium
- VIB Center for Plant Systems Biology, Ghent, Belgium
| | - Antonio Leon-Reyes
- Laboratorio de Biotecnología Agrícola y de Alimentos, Ingeniería en Agronomía, Colegio de Ciencias e Ingenierías, Universidad San Francisco de Quito, Campus Cumbayá, Quito, Ecuador
- Colegio de Ciencias Biológicas y Ambientales COCIBA, Instituto de Microbiología, Universidad San Francisco de Quito USFQ, Campus Cumbayá, Quito, Ecuador
- Colegio de Ciencias Biológicas y Ambientales COCIBA, Instituto de Investigaciones Biológicas y Ambientales BIÓSFERA, Universidad San Francisco de Quito USFQ, Campus Cumbayá, Quito, Ecuador
- Department of Biology, The University of North Carolina at Chapel Hill, Chapel Hill, NC, United States
| | - Miguel Ángel Méndez
- Grupo de Química Computacional y Teórica, Departamento de Ingeniería Química, Universidad San Francisco de Quito USFQ, Campus Cumbayá, Quito, Ecuador
- Instituto de Simulación Computacional (ISC-USFQ), Universidad San Francisco de Quito USFQ, Quito, Ecuador
| |
Collapse
|
13
|
Zhu X, Liu L, He J, Fang T, Xiong Y, Mitchell JC. iPNHOT: a knowledge-based approach for identifying protein-nucleic acid interaction hot spots. BMC Bioinformatics 2020; 21:289. [PMID: 32631222 PMCID: PMC7336410 DOI: 10.1186/s12859-020-03636-w] [Citation(s) in RCA: 7] [Impact Index Per Article: 1.8] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 04/26/2019] [Accepted: 06/25/2020] [Indexed: 12/20/2022] Open
Abstract
Background The interaction between proteins and nucleic acids plays pivotal roles in various biological processes such as transcription, translation, and gene regulation. Hot spots are a small set of residues that contribute most to the binding affinity of a protein-nucleic acid interaction. Compared to the extensive studies of the hot spots on protein-protein interfaces, the hot spot residues within protein-nucleic acids interfaces remain less well-studied, in part because mutagenesis data for protein-nucleic acids interaction are not as abundant as that for protein-protein interactions. Results In this study, we built a new computational model, iPNHOT, to effectively predict hot spot residues on protein-nucleic acids interfaces. One training data set and an independent test set were collected from dbAMEPNI and some recent literature, respectively. To build our model, we generated 97 different sequential and structural features and used a two-step strategy to select the relevant features. The final model was built based only on 7 features using a support vector machine (SVM). The features include two unique features such as ∆SASsa1/2 and esp3, which are newly proposed in this study. Based on the cross validation results, our model gave F1 score and AUROC as 0.725 and 0.807 on the subset collected from ProNIT, respectively, compared to 0.407 and 0.670 of mCSM-NA, a state-of-the art model to predict the thermodynamic effects of protein-nucleic acid interaction. The iPNHOT model was further tested on the independent test set, which showed that our model outperformed other methods. Conclusion In this study, by collecting data from a recently published database dbAMEPNI, we proposed a new model, iPNHOT, to predict hotspots on both protein-DNA and protein-RNA interfaces. The results show that our model outperforms the existing state-of-art models. Our model is available for users through a webserver: http://zhulab.ahu.edu.cn/iPNHOT/.
Collapse
Affiliation(s)
- Xiaolei Zhu
- School of Sciences, Anhui Agricultural University, Hefei, Anhui, China. .,School of Life Sciences, Anhui University, Hefei, Anhui, China.
| | - Ling Liu
- School of Life Sciences, Anhui University, Hefei, Anhui, China
| | - Jingjing He
- School of Life Sciences, Anhui University, Hefei, Anhui, China
| | - Ting Fang
- School of Life Sciences, Anhui University, Hefei, Anhui, China
| | - Yi Xiong
- School of Life Sciences and Biotechnology, Shanghai Jiao Tong University, Shanghai, China
| | - Julie C Mitchell
- Biosciences Division, Oak Ridge National Laboratory, Oak Ridge, TN, USA.
| |
Collapse
|
14
|
Díaz-Valle A, Falcón-González JM, Carrillo-Tripp M. Hot Spots and Their Contribution to the Self-Assembly of the Viral Capsid: In Silico Prediction and Analysis. Int J Mol Sci 2019; 20:E5966. [PMID: 31783519 PMCID: PMC6928768 DOI: 10.3390/ijms20235966] [Citation(s) in RCA: 6] [Impact Index Per Article: 1.2] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/12/2019] [Revised: 11/11/2019] [Accepted: 11/15/2019] [Indexed: 02/06/2023] Open
Abstract
The viral capsid is a macromolecular complex formed by a defined number of self-assembled proteins, which, in many cases, are biopolymers with an identical amino acid sequence. Specific protein-protein interactions (PPI) drive the capsid self-assembly process, leading to several distinct protein interfaces. Following the PPI hot spot hypothesis, we present a conservation-based methodology to identify those interface residues hypothesized to be crucial elements on the self-assembly and thermodynamic stability of the capsid. We validate the predictions through a rigorous physical framework which integrates molecular dynamics simulations and free energy calculations by Umbrella sampling and the potential of mean force using an all-atom molecular representation of the capsid proteins of an icosahedral virus in an explicit solvent. Our results show that a single mutation in any of the structure-conserved hot spots significantly perturbs the quaternary protein-protein interaction, decreasing the absolute value of the binding free energy, without altering the protein's secondary nor tertiary structure. Our conservation-based hot spot prediction methodology can lead to strategies to rationally modulate the capsid's thermodynamic properties.
Collapse
Affiliation(s)
- Armando Díaz-Valle
- Biomolecular Diversity Laboratory, Centro de Investigación y de Estudios Avanzados del Instituto Politécnico Nacional Unidad Monterrey, Vía del Conocimiento 201, Parque PIIT, C.P. 66600 Apodaca, Nuevo León, Mexico;
| | - José Marcos Falcón-González
- Unidad Profesional Interdisciplinaria de Ingeniería Campus Guanajuato, Instituto Politécnico Nacional, Av. Mineral de Valenciana No. 200, Col. Fraccionamiento Industrial Puerto Interior, C.P. 36275 Silao de la Victoria, Guanajuato, Mexico;
| | - Mauricio Carrillo-Tripp
- Biomolecular Diversity Laboratory, Centro de Investigación y de Estudios Avanzados del Instituto Politécnico Nacional Unidad Monterrey, Vía del Conocimiento 201, Parque PIIT, C.P. 66600 Apodaca, Nuevo León, Mexico;
| |
Collapse
|
15
|
Machine Learning Approaches for Protein⁻Protein Interaction Hot Spot Prediction: Progress and Comparative Assessment. Molecules 2018; 23:molecules23102535. [PMID: 30287797 PMCID: PMC6222875 DOI: 10.3390/molecules23102535] [Citation(s) in RCA: 45] [Impact Index Per Article: 7.5] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 08/30/2018] [Revised: 09/27/2018] [Accepted: 10/02/2018] [Indexed: 12/27/2022] Open
Abstract
Hot spots are the subset of interface residues that account for most of the binding free energy, and they play essential roles in the stability of protein binding. Effectively identifying which specific interface residues of protein–protein complexes form the hot spots is critical for understanding the principles of protein interactions, and it has broad application prospects in protein design and drug development. Experimental methods like alanine scanning mutagenesis are labor-intensive and time-consuming. At present, the experimentally measured hot spots are very limited. Hence, the use of computational approaches to predicting hot spots is becoming increasingly important. Here, we describe the basic concepts and recent advances of machine learning applications in inferring the protein–protein interaction hot spots, and assess the performance of widely used features, machine learning algorithms, and existing state-of-the-art approaches. We also discuss the challenges and future directions in the prediction of hot spots.
Collapse
|
16
|
Enhanced Prediction of Hot Spots at Protein-Protein Interfaces Using Extreme Gradient Boosting. Sci Rep 2018; 8:14285. [PMID: 30250210 PMCID: PMC6155324 DOI: 10.1038/s41598-018-32511-1] [Citation(s) in RCA: 49] [Impact Index Per Article: 8.2] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/05/2018] [Accepted: 09/10/2018] [Indexed: 12/11/2022] Open
Abstract
Identification of hot spots, a small portion of protein-protein interface residues that contribute the majority of the binding free energy, can provide crucial information for understanding the function of proteins and studying their interactions. Based on our previous method (PredHS), we propose a new computational approach, PredHS2, that can further improve the accuracy of predicting hot spots at protein-protein interfaces. Firstly we build a new training dataset of 313 alanine-mutated interface residues extracted from 34 protein complexes. Then we generate a wide variety of 600 sequence, structure, exposure and energy features, together with Euclidean and Voronoi neighborhood properties. To remove redundant and irrelevant information, we select a set of 26 optimal features utilizing a two-step feature selection method, which consist of a minimum Redundancy Maximum Relevance (mRMR) procedure and a sequential forward selection process. Based on the selected 26 features, we use Extreme Gradient Boosting (XGBoost) to build our prediction model. Performance of our PredHS2 approach outperforms other machine learning algorithms and other state-of-the-art hot spot prediction methods on the training dataset and the independent test set (BID) respectively. Several novel features, such as solvent exposure characteristics, second structure features and disorder scores, are found to be more effective in discriminating hot spots. Moreover, the update of the training dataset and the new feature selection and classification algorithms play a vital role in improving the prediction quality.
Collapse
|
17
|
Lin X, Zhang X. Prediction of Hot Regions in PPIs Based on Improved Local Community Structure Detecting. IEEE/ACM TRANSACTIONS ON COMPUTATIONAL BIOLOGY AND BIOINFORMATICS 2018; 15:1470-1479. [PMID: 29994749 DOI: 10.1109/tcbb.2018.2793858] [Citation(s) in RCA: 3] [Impact Index Per Article: 0.5] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 06/08/2023]
Abstract
The hot regions in PPIs are some assembly regions which are composed of the tightly packed HotSpots. The discovery of hot regions helps to understand life activities and has very important value for biological applications. The identification of hot regions is the basis for protein design and cancer prevention. The existing algorithms of predicting hot regions often have some defects, such as low accuracy and unstability. This paper proposes a novel hot region prediction method based on diverse biological characteristics. First, feature evaluation is employed by using an impoved mRMR method. Then, SVM is adopted to create cassification model based on the features selected. In addition, a new clustering algorithm, namely LCSD (Local community structure detecting), is developed to detect and analyze the conformation of hot regions. In the clustering process, the link similarity of protein residues is introduced to handle the boundary nodes. This algorithm can effectively deal with the missing residue nodes and control the local community boundaries. The results indicate that the spatial structure of hot regions can be obtained more effectively, and that our method is more effective than previous methods for precise identification of hot regions.
Collapse
|
18
|
PPInS: a repository of protein-protein interaction sitesbase. Sci Rep 2018; 8:12453. [PMID: 30127348 PMCID: PMC6102274 DOI: 10.1038/s41598-018-30999-1] [Citation(s) in RCA: 5] [Impact Index Per Article: 0.8] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/25/2018] [Accepted: 08/03/2018] [Indexed: 01/14/2023] Open
Abstract
Protein-Protein Interaction Sitesbase (PPInS), a high-performance database of protein-protein interacting interfaces, is presented. The atomic level information of the molecular interaction happening amongst various protein chains in protein-protein complexes (as reported in the Protein Data Bank [PDB]) together with their evolutionary information in Structural Classification of Proteins (SCOPe release 2.06), is made available in PPInS. Total 32468 PDB files representing X-ray crystallized multimeric protein-protein complexes with structural resolution better than 2.5 Å had been shortlisted to demarcate the protein-protein interaction interfaces (PPIIs). A total of 111857 PPIIs with ~32.24 million atomic contact pairs (ACPs) were generated and made available on a web server for on-site analysis and downloading purpose. All these PPIIs and protein-protein interacting patches (PPIPs) involved in them, were also analyzed in terms of a number of residues contributing in patch formation, their hydrophobic nature, amount of surface area they contributed in binding, and their homo and heterodimeric nature, to describe the diversity of information covered in PPInS. It was observed that 42.37% of total PPIPs were made up of 6–20 interacting residues, 53.08% PPIPs had interface area ≤1000 Å2 in PPII formation, 82.64% PPIPs were reported with hydrophobicity score of ≤10, and 73.26% PPIPs were homologous to each other with the sequence similarity score ranging from 75–100%. A subset “Non-Redundant Database (NRDB)” of the PPInS containing 2265 PPIIs, with over 1.8 million ACPs corresponding to the 1931 protein-protein complexes (PDBs), was also designed by removing structural redundancies at the level of SCOP superfamily (SCOP release 1.75). The web interface of the PPInS (http://www.cup.edu.in:99/ppins/home.php) offers an easy-to-navigate, intuitive and user-friendly environment, and can be accessed by providing PDB ID, SCOP superfamily ID, and protein sequence.
Collapse
|
19
|
Heterodimer Binding Scaffolds Recognition via the Analysis of Kinetically Hot Residues. Pharmaceuticals (Basel) 2018; 11:ph11010029. [PMID: 29547506 PMCID: PMC5874725 DOI: 10.3390/ph11010029] [Citation(s) in RCA: 4] [Impact Index Per Article: 0.7] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 12/27/2017] [Revised: 03/06/2018] [Accepted: 03/08/2018] [Indexed: 12/13/2022] Open
Abstract
Physical interactions between proteins are often difficult to decipher. The aim of this paper is to present an algorithm that is designed to recognize binding patches and supporting structural scaffolds of interacting heterodimer proteins using the Gaussian Network Model (GNM). The recognition is based on the (self) adjustable identification of kinetically hot residues and their connection to possible binding scaffolds. The kinetically hot residues are residues with the lowest entropy, i.e., the highest contribution to the weighted sum of the fastest modes per chain extracted via GNM. The algorithm adjusts the number of fast modes in the GNM's weighted sum calculation using the ratio of predicted and expected numbers of target residues (contact and the neighboring first-layer residues). This approach produces very good results when applied to dimers with high protein sequence length ratios. The protocol's ability to recognize near native decoys was compared to the ability of the residue-level statistical potential of Lu and Skolnick using the Sternberg and Vakser decoy dimers sets. The statistical potential produced better overall results, but in a number of cases its predicting ability was comparable, or even inferior, to the prediction ability of the adjustable GNM approach. The results presented in this paper suggest that in heterodimers at least one protein has interacting scaffold determined by the immovable, kinetically hot residues. In many cases, interacting proteins (especially if being of noticeably different sizes) either behave as a rigid lock and key or, presumably, exhibit the opposite dynamic behavior. While the binding surface of one protein is rigid and stable, its partner's interacting scaffold is more flexible and adaptable.
Collapse
|
20
|
Qiao Y, Xiong Y, Gao H, Zhu X, Chen P. Protein-protein interface hot spots prediction based on a hybrid feature selection strategy. BMC Bioinformatics 2018; 19:14. [PMID: 29334889 PMCID: PMC5769548 DOI: 10.1186/s12859-018-2009-5] [Citation(s) in RCA: 52] [Impact Index Per Article: 8.7] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 08/15/2017] [Accepted: 01/03/2018] [Indexed: 11/16/2022] Open
Abstract
Background Hot spots are interface residues that contribute most binding affinity to protein-protein interaction. A compact and relevant feature subset is important for building machine learning methods to predict hot spots on protein-protein interfaces. Although different methods have been used to detect the relevant feature subset from a variety of features related to interface residues, it is still a challenge to detect the optimal feature subset for building the final model. Results In this study, three different feature selection methods were compared to propose a new hybrid feature selection strategy. This new strategy was proved to effectively reduce the feature space when we were building the prediction models for identifying hotspot residues. It was tested on eighty-two features, both conventional and newly proposed. According to the strategy, combining the feature subsets selected by decision tree and mRMR (maximum Relevance Minimum Redundancy) individually, we were able to build a model with 6 features by using a PSFS (Pseudo Sequential Forward Selection) process. Compared with other state-of-art methods for the independent test set, our model had shown better or comparable predictive performances (with F-measure 0.622 and recall 0.821). Analysis of the 6 features confirmed that our newly proposed feature CNSV_REL1 was important for our model. The analysis also showed that the complementarity between features should be considered as an important aspect when conducting the feature selection. Conclusion In this study, most important of all, a new strategy for feature selection was proposed and proved to be effective in selecting the optimal feature subset for building prediction models, which can be used to predict hot spot residues on protein-protein interfaces. Moreover, two aspects, the generalization of the single feature and the complementarity between features, were proved to be of great importance and should be considered in feature selection methods. Finally, our newly proposed feature CNSV_REL1 had been proved an alternative and effective feature in predicting hot spots by our study. Our model is available for users through a webserver: http://zhulab.ahu.edu.cn/iPPHOT/. Electronic supplementary material The online version of this article (10.1186/s12859-018-2009-5) contains supplementary material, which is available to authorized users.
Collapse
Affiliation(s)
- Yanhua Qiao
- School of Life Sciences, Anhui University, Hefei, Anhui, 230601, China
| | - Yi Xiong
- State Key Laboratory of Microbial Metabolism, Shanghai JiaoTong University, Shanghai, 200240, China.,School of Life Sciences and Biotechnology, Shanghai JiaoTong University, Shanghai, 200240, China
| | - Hongyun Gao
- Information and Engineering College, Dalian University, Dalian, Liaoning, 116622, China
| | - Xiaolei Zhu
- School of Life Sciences, Anhui University, Hefei, Anhui, 230601, China.
| | - Peng Chen
- Institute of Health Sciences, Anhui University, Hefei, Anhui, 230601, China.
| |
Collapse
|
21
|
Jiang J, Wang N, Chen P, Zheng C, Wang B. Prediction of Protein Hotspots from Whole Protein Sequences by a Random Projection Ensemble System. Int J Mol Sci 2017; 18:ijms18071543. [PMID: 28718782 PMCID: PMC5536031 DOI: 10.3390/ijms18071543] [Citation(s) in RCA: 15] [Impact Index Per Article: 2.1] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 05/07/2017] [Revised: 07/03/2017] [Accepted: 07/05/2017] [Indexed: 11/30/2022] Open
Abstract
Hotspot residues are important in the determination of protein-protein interactions, and they always perform specific functions in biological processes. The determination of hotspot residues is by the commonly-used method of alanine scanning mutagenesis experiments, which is always costly and time consuming. To address this issue, computational methods have been developed. Most of them are structure based, i.e., using the information of solved protein structures. However, the number of solved protein structures is extremely less than that of sequences. Moreover, almost all of the predictors identified hotspots from the interfaces of protein complexes, seldom from the whole protein sequences. Therefore, determining hotspots from whole protein sequences by sequence information alone is urgent. To address the issue of hotspot predictions from the whole sequences of proteins, we proposed an ensemble system with random projections using statistical physicochemical properties of amino acids. First, an encoding scheme involving sequence profiles of residues and physicochemical properties from the AAindex1 dataset is developed. Then, the random projection technique was adopted to project the encoding instances into a reduced space. Then, several better random projections were obtained by training an IBk classifier based on the training dataset, which were thus applied to the test dataset. The ensemble of random projection classifiers is therefore obtained. Experimental results showed that although the performance of our method is not good enough for real applications of hotspots, it is very promising in the determination of hotspot residues from whole sequences.
Collapse
Affiliation(s)
- Jinjian Jiang
- School of Electronics and Information Engineering, Anhui University, Hefei 230601, China.
- School of Computer and Information, Anqing Normal University, Anqing 246133, China.
| | - Nian Wang
- School of Electronics and Information Engineering, Anhui University, Hefei 230601, China.
| | - Peng Chen
- Institute of Health Sciences, Anhui University, Hefei 230601, China.
| | - Chunhou Zheng
- School of Electronic Engineering & Automation, Anhui University, Hefei 230601, China.
| | - Bing Wang
- School of Electrical and Information Engineering, Anhui University of Technology, Ma'anshan 243032, China.
| |
Collapse
|
22
|
Zhou S, Wang Q, Ren M, Zhang A, Liu H, Yao X. Molecular dynamics simulation on the inhibition mechanism of peptide-based inhibitor of islet amyloid polypeptide (IAPP) to islet amyloid polypeptide (IAPP22-28) oligomers. Chem Biol Drug Des 2017; 90:31-39. [DOI: 10.1111/cbdd.12924] [Citation(s) in RCA: 8] [Impact Index Per Article: 1.1] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 08/20/2016] [Revised: 11/18/2016] [Accepted: 11/27/2016] [Indexed: 12/21/2022]
Affiliation(s)
- Shuangyan Zhou
- State Key Laboratory of Applied Organic Chemistry and Department of Chemistry; Lanzhou University; Lanzhou China
- School of Pharmacy; Lanzhou University; Lanzhou China
| | - Qianqian Wang
- State Key Laboratory of Quality Research in Chinese Medicine; Macau Institute for Applied Research in Medicine and Health; Macau University of Science and Technology; Taipa Macau China
| | - Mengdan Ren
- State Key Laboratory of Applied Organic Chemistry and Department of Chemistry; Lanzhou University; Lanzhou China
| | - Ai Zhang
- School of Pharmacy; Lanzhou University; Lanzhou China
| | - Huanxiang Liu
- School of Pharmacy; Lanzhou University; Lanzhou China
| | - Xiaojun Yao
- State Key Laboratory of Applied Organic Chemistry and Department of Chemistry; Lanzhou University; Lanzhou China
- State Key Laboratory of Quality Research in Chinese Medicine; Macau Institute for Applied Research in Medicine and Health; Macau University of Science and Technology; Taipa Macau China
| |
Collapse
|
23
|
Vishwanath S, Sukhwal A, Sowdhamini R, Srinivasan N. Specificity and stability of transient protein-protein interactions. Curr Opin Struct Biol 2017; 44:77-86. [PMID: 28088083 DOI: 10.1016/j.sbi.2016.12.010] [Citation(s) in RCA: 18] [Impact Index Per Article: 2.6] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 09/17/2016] [Revised: 11/03/2016] [Accepted: 12/19/2016] [Indexed: 11/18/2022]
Abstract
Remarkable features that are achieved in a protein-protein complex to precise levels are stability and specificity. Deviation from the normal levels of specificity and stability, which is often caused by mutations, could result in disease conditions. Chemical nature, 3-D arrangement and dynamics of interface residues code for both specificity and stability. This article reviews roles of interfacial residues in transient protein-protein complexes. It is proposed that aside from hotspot residues conferring stability to the complex, a small set of 'rigid' residues at the interface that maintain conformation between complexed and uncomplexed forms, play a major role in conferring specificity. Exceptionally, 'super hotspot' residues, which confer both stability and specificity, are attractive sites for interaction with small molecule inhibitors.
Collapse
Affiliation(s)
- Sneha Vishwanath
- Molecular Biophysics Unit, Indian Institute of Science, Bangalore 560012, India
| | - Anshul Sukhwal
- National Centre for Biological Sciences, TIFR, UAS-GKVK Campus, Bellary road, Bangalore 560065, India; SASTRA Deemed University, Tirumalai Samudram, Thanjavur 613402, India
| | - Ramanathan Sowdhamini
- National Centre for Biological Sciences, TIFR, UAS-GKVK Campus, Bellary road, Bangalore 560065, India
| | | |
Collapse
|
24
|
Lugo-Martinez J, Pejaver V, Pagel KA, Jain S, Mort M, Cooper DN, Mooney SD, Radivojac P. The Loss and Gain of Functional Amino Acid Residues Is a Common Mechanism Causing Human Inherited Disease. PLoS Comput Biol 2016; 12:e1005091. [PMID: 27564311 PMCID: PMC5001644 DOI: 10.1371/journal.pcbi.1005091] [Citation(s) in RCA: 18] [Impact Index Per Article: 2.3] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/06/2015] [Accepted: 08/02/2016] [Indexed: 01/12/2023] Open
Abstract
Elucidating the precise molecular events altered by disease-causing genetic variants represents a major challenge in translational bioinformatics. To this end, many studies have investigated the structural and functional impact of amino acid substitutions. Most of these studies were however limited in scope to either individual molecular functions or were concerned with functional effects (e.g. deleterious vs. neutral) without specifically considering possible molecular alterations. The recent growth of structural, molecular and genetic data presents an opportunity for more comprehensive studies to consider the structural environment of a residue of interest, to hypothesize specific molecular effects of sequence variants and to statistically associate these effects with genetic disease. In this study, we analyzed data sets of disease-causing and putatively neutral human variants mapped to protein 3D structures as part of a systematic study of the loss and gain of various types of functional attribute potentially underlying pathogenic molecular alterations. We first propose a formal model to assess probabilistically function-impacting variants. We then develop an array of structure-based functional residue predictors, evaluate their performance, and use them to quantify the impact of disease-causing amino acid substitutions on catalytic activity, metal binding, macromolecular binding, ligand binding, allosteric regulation and post-translational modifications. We show that our methodology generates actionable biological hypotheses for up to 41% of disease-causing genetic variants mapped to protein structures suggesting that it can be reliably used to guide experimental validation. Our results suggest that a significant fraction of disease-causing human variants mapping to protein structures are function-altering both in the presence and absence of stability disruption. Identifying the molecular changes caused by mutations is a major challenge in understanding and treating human genetic disease. To address this problem, we have developed a wide range of profiling tools designed to predict specific types of functional site from protein 3D structures. We then apply these tools to data sets of inherited disease-associated and putatively neutral amino acid substitutions and estimate the relative contribution of the loss and gain of functional residues in disease. Our results suggest that alterations of molecular function are involved in a significant number of cases of human genetic disease and are over-represented as compared to putatively neutral variants. Additionally, we use experimental data to show that it is possible to computationally identify the loss of specific functional events in disease pathogenesis. Finally, our methodology can be used to reliably identify the potential molecular consequences of disease-causing genetic variants and hence prioritize experimental validation.
Collapse
Affiliation(s)
- Jose Lugo-Martinez
- Department of Computer Science and Informatics, Indiana University, Bloomington, Indiana, United States of America
| | - Vikas Pejaver
- Department of Computer Science and Informatics, Indiana University, Bloomington, Indiana, United States of America
| | - Kymberleigh A. Pagel
- Department of Computer Science and Informatics, Indiana University, Bloomington, Indiana, United States of America
| | - Shantanu Jain
- Department of Computer Science and Informatics, Indiana University, Bloomington, Indiana, United States of America
| | - Matthew Mort
- Institute of Medical Genetics, School of Medicine, Cardiff University, Cardiff, United Kingdom
| | - David N. Cooper
- Institute of Medical Genetics, School of Medicine, Cardiff University, Cardiff, United Kingdom
| | - Sean D. Mooney
- Department of Biomedical Informatics and Medical Education, University of Washington, Seattle, Washington, United States of America
- * E-mail: (SDM); (PR)
| | - Predrag Radivojac
- Department of Computer Science and Informatics, Indiana University, Bloomington, Indiana, United States of America
- * E-mail: (SDM); (PR)
| |
Collapse
|
25
|
Moreira C, Ramos MJ, Fernandes PA. Glutamine Synthetase Drugability beyond Its Active Site: Exploring Oligomerization Interfaces and Pockets. Molecules 2016; 21:E1028. [PMID: 27509490 PMCID: PMC6274088 DOI: 10.3390/molecules21081028] [Citation(s) in RCA: 12] [Impact Index Per Article: 1.5] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/08/2016] [Revised: 08/01/2016] [Accepted: 08/04/2016] [Indexed: 01/02/2023] Open
Abstract
BACKGROUND Glutamine synthetase (GS) is a crucial enzyme to the nitrogen cycle with great commercial and pharmaceutical value. Current inhibitors target the active site, affecting GS activity indiscriminately in all organisms. As the active site is located at the interface between two monomers, the protein-protein interface (PPI) of GSs gains a new role, by providing new targets for enzyme inhibition. Exploring GSs PPI could allow for the development of inhibitors selective for specific organisms. Here we map the PPI of three GSs-human (hsGS), maize (zmGS) and Mycobacterium tuberculosis (mtGS)-and unravel new drugable pockets. METHODS The PPI binding free energy coming from key residues on three GSs from different organisms were mapped by computational alanine scan mutagenesis, applying a multiple dielectric constant MM-PBSA methodology. The most relevant residues for binding are referred as hot-spots. Drugable pockets on GS were detected with the Fpocket software. RESULTS AND CONCLUSIONS A total of 23, 19 and 30 hot-spots were identified on hsGS, zmGS and mtGS PPI. Even possessing differences in the hot-spots, hsGS and zmGS PPI are overall very similar. On the other hand, mtGS PPI differs greatly from hsGS and zmGS PPI. A novel drugable pocket was detected on the mtGS PPI. It seems particularly promising for the development of selective anti-tuberculosis drugs given its location on a PPI region that is highly populated with hot-spots and is completely different from the hsGS and zmGS PPIs. Drugs targeting this pockets should be inactive on eukaryotic GS II enzymes.
Collapse
Affiliation(s)
- Cátia Moreira
- UCIBIO, REQUIMTE, Departamento de Química e Bioquímica, Faculdade de Ciências, Universidade do Porto, 4169-007 Porto, Portugal.
| | - Maria J Ramos
- UCIBIO, REQUIMTE, Departamento de Química e Bioquímica, Faculdade de Ciências, Universidade do Porto, 4169-007 Porto, Portugal.
| | - Pedro A Fernandes
- UCIBIO, REQUIMTE, Departamento de Química e Bioquímica, Faculdade de Ciências, Universidade do Porto, 4169-007 Porto, Portugal.
| |
Collapse
|
26
|
Wei G, Xi W, Nussinov R, Ma B. Protein Ensembles: How Does Nature Harness Thermodynamic Fluctuations for Life? The Diverse Functional Roles of Conformational Ensembles in the Cell. Chem Rev 2016; 116:6516-51. [PMID: 26807783 PMCID: PMC6407618 DOI: 10.1021/acs.chemrev.5b00562] [Citation(s) in RCA: 253] [Impact Index Per Article: 31.6] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 12/19/2022]
Abstract
All soluble proteins populate conformational ensembles that together constitute the native state. Their fluctuations in water are intrinsic thermodynamic phenomena, and the distributions of the states on the energy landscape are determined by statistical thermodynamics; however, they are optimized to perform their biological functions. In this review we briefly describe advances in free energy landscape studies of protein conformational ensembles. Experimental (nuclear magnetic resonance, small-angle X-ray scattering, single-molecule spectroscopy, and cryo-electron microscopy) and computational (replica-exchange molecular dynamics, metadynamics, and Markov state models) approaches have made great progress in recent years. These address the challenging characterization of the highly flexible and heterogeneous protein ensembles. We focus on structural aspects of protein conformational distributions, from collective motions of single- and multi-domain proteins, intrinsically disordered proteins, to multiprotein complexes. Importantly, we highlight recent studies that illustrate functional adjustment of protein conformational ensembles in the crowded cellular environment. We center on the role of the ensemble in recognition of small- and macro-molecules (protein and RNA/DNA) and emphasize emerging concepts of protein dynamics in enzyme catalysis. Overall, protein ensembles link fundamental physicochemical principles and protein behavior and the cellular network and its regulation.
Collapse
Affiliation(s)
- Guanghong Wei
- State Key Laboratory of Surface Physics, Key Laboratory for Computational Physical Sciences (MOE), and Department of Physics, Fudan University, Shanghai, P. R. China
| | - Wenhui Xi
- State Key Laboratory of Surface Physics, Key Laboratory for Computational Physical Sciences (MOE), and Department of Physics, Fudan University, Shanghai, P. R. China
| | - Ruth Nussinov
- Basic Science Program, Leidos Biomedical Research, Inc. Cancer and Inflammation Program, National Cancer Institute, Frederick, Maryland 21702, USA
- Sackler Inst. of Molecular Medicine Department of Human Genetics and Molecular Medicine Sackler School of Medicine, Tel Aviv University, Tel Aviv 69978, Israel
| | - Buyong Ma
- Basic Science Program, Leidos Biomedical Research, Inc. Cancer and Inflammation Program, National Cancer Institute, Frederick, Maryland 21702, USA
| |
Collapse
|
27
|
Kieslich CA, Tamamis P, Guzman YA, Onel M, Floudas CA. Highly Accurate Structure-Based Prediction of HIV-1 Coreceptor Usage Suggests Intermolecular Interactions Driving Tropism. PLoS One 2016; 11:e0148974. [PMID: 26859389 PMCID: PMC4747591 DOI: 10.1371/journal.pone.0148974] [Citation(s) in RCA: 20] [Impact Index Per Article: 2.5] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/14/2015] [Accepted: 01/26/2016] [Indexed: 01/21/2023] Open
Abstract
HIV-1 entry into host cells is mediated by interactions between the V3-loop of viral glycoprotein gp120 and chemokine receptor CCR5 or CXCR4, collectively known as HIV-1 coreceptors. Accurate genotypic prediction of coreceptor usage is of significant clinical interest and determination of the factors driving tropism has been the focus of extensive study. We have developed a method based on nonlinear support vector machines to elucidate the interacting residue pairs driving coreceptor usage and provide highly accurate coreceptor usage predictions. Our models utilize centroid-centroid interaction energies from computationally derived structures of the V3-loop:coreceptor complexes as primary features, while additional features based on established rules regarding V3-loop sequences are also investigated. We tested our method on 2455 V3-loop sequences of various lengths and subtypes, and produce a median area under the receiver operator curve of 0.977 based on 500 runs of 10-fold cross validation. Our study is the first to elucidate a small set of specific interacting residue pairs between the V3-loop and coreceptors capable of predicting coreceptor usage with high accuracy across major HIV-1 subtypes. The developed method has been implemented as a web tool named CRUSH, CoReceptor USage prediction for HIV-1, which is available at http://ares.tamu.edu/CRUSH/.
Collapse
Affiliation(s)
- Chris A Kieslich
- Artie McFerrin Department of Chemical Engineering, Texas A&M University, College Station, TX, United States of America.,Texas A&M Energy Institute, Texas A&M University, College Station, TX, United States of America
| | - Phanourios Tamamis
- Artie McFerrin Department of Chemical Engineering, Texas A&M University, College Station, TX, United States of America.,Texas A&M Energy Institute, Texas A&M University, College Station, TX, United States of America
| | - Yannis A Guzman
- Artie McFerrin Department of Chemical Engineering, Texas A&M University, College Station, TX, United States of America.,Texas A&M Energy Institute, Texas A&M University, College Station, TX, United States of America.,Department of Chemical and Biological Engineering, Princeton University, Princeton, NJ, United States of America
| | - Melis Onel
- Artie McFerrin Department of Chemical Engineering, Texas A&M University, College Station, TX, United States of America.,Texas A&M Energy Institute, Texas A&M University, College Station, TX, United States of America
| | - Christodoulos A Floudas
- Artie McFerrin Department of Chemical Engineering, Texas A&M University, College Station, TX, United States of America.,Texas A&M Energy Institute, Texas A&M University, College Station, TX, United States of America
| |
Collapse
|
28
|
Sivakumar D, Surapaneni KM, Prabu PC, Hari N, Thiruvasagam P, Rajasekaran M, Sivaraman T. Evaluation of the anticancer properties of the predicted hBaxBH3-mimetic compound 2-hydroxy-3,5-dinitrobenzamide in a mammary carcinogenesis-induced rat model. RSC Adv 2016. [DOI: 10.1039/c5ra23005e] [Citation(s) in RCA: 2] [Impact Index Per Article: 0.3] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/21/2022] Open
Abstract
Designing small molecular prototypes having potential to disrupt binding interfaces of pro-apoptotic–anti-apoptotic/BH3-only proteins is a promising strategy in cancer chemotherapy.
Collapse
Affiliation(s)
- Dakshinamurthy Sivakumar
- Structural Biology Lab
- Department of Bioinformatics
- School of Chemical and Biotechnology
- SASTRA University
- Thanjavur-613 401
| | | | | | - Natarajan Hari
- Department of Chemistry
- School of Chemical and Biotechnology
- SASTRA University
- Thanjavur-613 401
- India
| | - Ponnusamy Thiruvasagam
- Department of Chemistry
- School of Chemical and Biotechnology
- SASTRA University
- Thanjavur-613 401
- India
| | - Muthu Rajasekaran
- Department of Biotechnology
- School of Chemical and Biotechnology
- SASTRA University
- Thanjavur-613 401
- India
| | - Thirunavukkarasu Sivaraman
- Structural Biology Lab
- Department of Bioinformatics
- School of Chemical and Biotechnology
- SASTRA University
- Thanjavur-613 401
| |
Collapse
|
29
|
Antibody Binding Selectivity: Alternative Sets of Antigen Residues Entail High-Affinity Recognition. PLoS One 2015; 10:e0143374. [PMID: 26629896 PMCID: PMC4667898 DOI: 10.1371/journal.pone.0143374] [Citation(s) in RCA: 5] [Impact Index Per Article: 0.6] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/15/2015] [Accepted: 11/04/2015] [Indexed: 11/19/2022] Open
Abstract
Understanding the relationship between protein sequence and molecular recognition selectivity remains a major challenge. The antibody fragment scFv1F4 recognizes with sub nM affinity a decapeptide (sequence 6TAMFQDPQER15) derived from the N-terminal end of human papilloma virus E6 oncoprotein. Using this decapeptide as antigen, we had previously shown that only the wild type amino-acid or conservative replacements were allowed at positions 9 to 12 and 15 of the peptide, indicating a strong binding selectivity. Nevertheless phenylalanine (F) was equally well tolerated as the wild type glutamine (Q) at position 13, while all other amino acids led to weaker scFv binding. The interfaces of complexes involving either Q or F are expected to diverge, due to the different physico-chemistry of these residues. This would imply that high-affinity binding can be achieved through distinct interfacial geometries. In order to investigate this point, we disrupted the scFv-peptide interface by modifying one or several peptide positions. We then analyzed the effect on binding of amino acid changes at the remaining positions, an altered susceptibility being indicative of an altered role in complex formation. The 23 starting variants analyzed contained replacements whose effects on scFv1F4 binding ranged from minor to drastic. A permutation analysis (effect of replacing each peptide position by all other amino acids except cysteine) was carried out on the 23 variants using the PEPperCHIP® Platform technology. A comparison of their permutation patterns with that of the wild type peptide indicated that starting replacements at position 11, 12 or 13 modified the tolerance to amino-acid changes at the other two positions. The interdependence between the three positions was confirmed by SPR (Biacore® technology). Our data demonstrate that binding selectivity does not preclude the existence of alternative high-affinity recognition modes.
Collapse
|
30
|
Alsop JD, Mitchell JC. Interolog interfaces in protein-protein docking. Proteins 2015; 83:1940-6. [PMID: 25740680 PMCID: PMC5054918 DOI: 10.1002/prot.24788] [Citation(s) in RCA: 6] [Impact Index Per Article: 0.7] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/01/2014] [Revised: 02/17/2015] [Accepted: 02/24/2015] [Indexed: 12/26/2022]
Abstract
Proteins are essential elements of biological systems, and their function typically relies on their ability to successfully bind to specific partners. Recently, an emphasis of study into protein interactions has been on hot spots, or residues in the binding interface that make a significant contribution to the binding energetics. In this study, we investigate how conservation of hot spots can be used to guide docking prediction. We show that the use of evolutionary data combined with hot spot prediction highlights near‐native structures across a range of benchmark examples. Our approach explores various strategies for using hot spots and evolutionary data to score protein complexes, using both absolute and chemical definitions of conservation along with refinements to these strategies that look at windowed conservation and filtering to ensure a minimum number of hot spots in each binding partner. Finally, structure‐based models of orthologs were generated for comparison with sequence‐based scoring. Using two data sets of 22 and 85 examples, a high rate of top 10 and top 1 predictions are observed, with up to 82% of examples returning a top 10 hit and 35% returning top 1 hit depending on the data set and strategy applied; upon inclusion of the native structure among the decoys, up to 55% of examples yielded a top 1 hit. The 20 common examples between data sets show that more carefully curated interolog data yields better predictions, particularly in achieving top 1 hits. Proteins 2015; 83:1940–1946. © 2015 The Authors. Proteins: Structure, Function, and Bioinformatics Published by Wiley Periodicals, Inc.
Collapse
Affiliation(s)
- James D Alsop
- Department of Biochemistry, University of Wisconsin, Madison, Wisconsin
| | - Julie C Mitchell
- Department of Biochemistry, University of Wisconsin, Madison, Wisconsin.,Department of Mathematics, University of Wisconsin, Madison, Wisconsin
| |
Collapse
|
31
|
Sukhwal A, Sowdhamini R. PPCheck: A Webserver for the Quantitative Analysis of Protein-Protein Interfaces and Prediction of Residue Hotspots. Bioinform Biol Insights 2015; 9:141-51. [PMID: 26448684 PMCID: PMC4578551 DOI: 10.4137/bbi.s25928] [Citation(s) in RCA: 46] [Impact Index Per Article: 5.1] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/12/2015] [Revised: 04/21/2015] [Accepted: 04/28/2015] [Indexed: 12/28/2022] Open
Abstract
BACKGROUND Modeling protein-protein interactions (PPIs) using docking algorithms is useful for understanding biomolecular interactions and mechanisms. Typically, a docking algorithm generates a large number of docking poses, and it is often challenging to select the best native-like pose. A further challenge is to recognize key residues, termed as hotspots, at protein-protein interfaces, which contribute more in stabilizing a protein-protein interface. RESULTS We had earlier developed a computer algorithm, called PPCheck, which ascribes pseudoenergies to measure the strength of PPIs. Native-like poses could be successfully identified in 27 out of 30 test cases, when applied on a separate set of decoys that were generated using FRODOCK. PPCheck, along with conservation and accessibility scores, was able to differentiate 'native-like and non-native-like poses from 1883 decoys of Critical Assessment of Prediction of Interactions (CAPRI) targets with an accuracy of 60%. PPCheck was trained on a 10-fold mixed dataset and tested on a 10-fold mixed test set for hotspot prediction. We obtain an accuracy of 72%, which is in par with other methods, and a sensitivity of 59%, which is better than most existing methods available for hotspot prediction that uses similar datasets. Other relevant tests suggest that PPCheck can also be reliably used to identify conserved residues in a protein and to perform computational alanine scanning. CONCLUSIONS PPCheck webserver can be successfully used to differentiate native-like and non-native-like docking poses, as generated by docking algorithms. The webserver can also be a convenient platform for calculating residue conservation, for performing computational alanine scanning, and for predicting protein-protein interface hotspots. While PPCheck can differentiate the generated decoys into native-like and non-native-like decoys with a fairly good accuracy, the results improve dramatically when features like conservation and accessibility are included. The method can be successfully used in ranking/scoring the decoys, as obtained from docking algorithms.
Collapse
Affiliation(s)
- Anshul Sukhwal
- National Centre for Biological Sciences, Bangalore, Karnataka, India. ; SASTRA University, Tirumalaisamudram, Thanjavur, Tamil Nadu, India
| | | |
Collapse
|
32
|
Aumentado-Armstrong TT, Istrate B, Murgita RA. Algorithmic approaches to protein-protein interaction site prediction. Algorithms Mol Biol 2015; 10:7. [PMID: 25713596 PMCID: PMC4338852 DOI: 10.1186/s13015-015-0033-9] [Citation(s) in RCA: 44] [Impact Index Per Article: 4.9] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 08/23/2014] [Accepted: 01/07/2015] [Indexed: 12/19/2022] Open
Abstract
Interaction sites on protein surfaces mediate virtually all biological activities, and their identification holds promise for disease treatment and drug design. Novel algorithmic approaches for the prediction of these sites have been produced at a rapid rate, and the field has seen significant advancement over the past decade. However, the most current methods have not yet been reviewed in a systematic and comprehensive fashion. Herein, we describe the intricacies of the biological theory, datasets, and features required for modern protein-protein interaction site (PPIS) prediction, and present an integrative analysis of the state-of-the-art algorithms and their performance. First, the major sources of data used by predictors are reviewed, including training sets, evaluation sets, and methods for their procurement. Then, the features employed and their importance in the biological characterization of PPISs are explored. This is followed by a discussion of the methodologies adopted in contemporary prediction programs, as well as their relative performance on the datasets most recently used for evaluation. In addition, the potential utility that PPIS identification holds for rational drug design, hotspot prediction, and computational molecular docking is described. Finally, an analysis of the most promising areas for future development of the field is presented.
Collapse
|
33
|
Cukuroglu E, Engin HB, Gursoy A, Keskin O. Hot spots in protein–protein interfaces: Towards drug discovery. PROGRESS IN BIOPHYSICS AND MOLECULAR BIOLOGY 2014; 116:165-73. [DOI: 10.1016/j.pbiomolbio.2014.06.003] [Citation(s) in RCA: 113] [Impact Index Per Article: 11.3] [Reference Citation Analysis] [Track Full Text] [Subscribe] [Scholar Register] [Received: 03/06/2014] [Revised: 05/30/2014] [Accepted: 06/12/2014] [Indexed: 11/16/2022]
|
34
|
Shingate P, Manoharan M, Sukhwal A, Sowdhamini R. ECMIS: computational approach for the identification of hotspots at protein-protein interfaces. BMC Bioinformatics 2014; 15:303. [PMID: 25228146 PMCID: PMC4177600 DOI: 10.1186/1471-2105-15-303] [Citation(s) in RCA: 13] [Impact Index Per Article: 1.3] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 01/27/2014] [Accepted: 08/11/2014] [Indexed: 11/10/2022] Open
Abstract
BACKGROUND Various methods have been developed to computationally predict hotspot residues at novel protein-protein interfaces. However, there are various challenges in obtaining accurate prediction. We have developed a novel method which uses different aspects of protein structure and sequence space at residue level to highlight interface residues crucial for the protein-protein complex formation. RESULTS ECMIS (Energetic Conservation Mass Index and Spatial Clustering) algorithm was able to outperform existing hotspot identification methods. It was able to achieve around 80% accuracy with incredible increase in sensitivity and outperforms other existing methods. This method is even sensitive towards the hotspot residues contributing only small-scale hydrophobic interactions. CONCLUSION Combination of diverse features of the protein viz. energy contribution, extent of conservation, location and surrounding environment, along with optimized weightage for each feature, was the key for the success of the algorithm. The academic version of the algorithm is available at http://caps.ncbs.res.in/download/ECMIS/ECMIS.zip.
Collapse
Affiliation(s)
| | | | | | - Ramanathan Sowdhamini
- National Centre for Biological Sciences (TIFR), GKVK Campus, Bellary Road, Bangalore 560065, India.
| |
Collapse
|
35
|
Moal IH, Jiménez-García B, Fernández-Recio J. CCharPPI web server: computational characterization of protein-protein interactions from structure. Bioinformatics 2014; 31:123-5. [PMID: 25183488 DOI: 10.1093/bioinformatics/btu594] [Citation(s) in RCA: 48] [Impact Index Per Article: 4.8] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 02/03/2023] Open
Abstract
SUMMARY The atomic structures of protein-protein interactions are central to understanding their role in biological systems, and a wide variety of biophysical functions and potentials have been developed for their characterization and the construction of predictive models. These tools are scattered across a multitude of stand-alone programs, and are often available only as model parameters requiring reimplementation. This acts as a significant barrier to their widespread adoption. CCharPPI integrates many of these tools into a single web server. It calculates up to 108 parameters, including models of electrostatics, desolvation and hydrogen bonding, as well as interface packing and complementarity scores, empirical potentials at various resolutions, docking potentials and composite scoring functions. AVAILABILITY AND IMPLEMENTATION The server does not require registration by the user and is freely available for non-commercial academic use at http://life.bsc.es/pid/ccharppi.
Collapse
Affiliation(s)
- Iain H Moal
- Joint BSC-IRB Research Programme in Computational Biology, Department of Life Sciences, Barcelona Supercomputing Center, C/Jordi Girona 29, 08034 Barcelona, Spain
| | - Brian Jiménez-García
- Joint BSC-IRB Research Programme in Computational Biology, Department of Life Sciences, Barcelona Supercomputing Center, C/Jordi Girona 29, 08034 Barcelona, Spain
| | - Juan Fernández-Recio
- Joint BSC-IRB Research Programme in Computational Biology, Department of Life Sciences, Barcelona Supercomputing Center, C/Jordi Girona 29, 08034 Barcelona, Spain
| |
Collapse
|
36
|
Guo W, Wisniewski JA, Ji H. Hot spot-based design of small-molecule inhibitors for protein-protein interactions. Bioorg Med Chem Lett 2014; 24:2546-54. [PMID: 24751445 DOI: 10.1016/j.bmcl.2014.03.095] [Citation(s) in RCA: 107] [Impact Index Per Article: 10.7] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 01/06/2014] [Revised: 03/26/2014] [Accepted: 03/28/2014] [Indexed: 12/27/2022]
Abstract
Protein-protein interactions (PPIs) are important targets for the development of chemical probes and therapeutic agents. From the initial discovery of the existence of hot spots at PPI interfaces, it has been proposed that hot spots might provide the key for developing small-molecule PPI inhibitors. However, there has been no review on the ways in which the knowledge of hot spots can be used to achieve inhibitor design, nor critical examination of successful examples. This Digest discusses the characteristics of hot spots and the identification of druggable hot spot pockets. An analysis of four examples of hot spot-based design reveals the importance of this strategy in discovering potent and selective PPI inhibitors. A general procedure for hot spot-based design of PPI inhibitors is outlined.
Collapse
Affiliation(s)
- Wenxing Guo
- Department of Chemistry, Center for Cell and Genome Science, University of Utah, 315 South 1400 East, Salt Lake City, UT 84112-0850, USA
| | - John A Wisniewski
- Department of Chemistry, Center for Cell and Genome Science, University of Utah, 315 South 1400 East, Salt Lake City, UT 84112-0850, USA
| | - Haitao Ji
- Department of Chemistry, Center for Cell and Genome Science, University of Utah, 315 South 1400 East, Salt Lake City, UT 84112-0850, USA.
| |
Collapse
|
37
|
Bandyopadhyay S, Mallik S, Mukhopadhyay A. A Survey and Comparative Study of Statistical Tests for Identifying Differential Expression from Microarray Data. IEEE/ACM TRANSACTIONS ON COMPUTATIONAL BIOLOGY AND BIOINFORMATICS 2014; 11:95-115. [PMID: 26355511 DOI: 10.1109/tcbb.2013.147] [Citation(s) in RCA: 61] [Impact Index Per Article: 6.1] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 06/05/2023]
Abstract
DNA microarray is a powerful technology that can simultaneously determine the levels of thousands of transcripts (generated, for example, from genes/miRNAs) across different experimental conditions or tissue samples. The motto of differential expression analysis is to identify the transcripts whose expressions change significantly across different types of samples or experimental conditions. A number of statistical testing methods are available for this purpose. In this paper, we provide a comprehensive survey on different parametric and non-parametric testing methodologies for identifying differential expression from microarray data sets. The performances of the different testing methods have been compared based on some real-life miRNA and mRNA expression data sets. For validating the resulting differentially expressed miRNAs, the outcomes of each test are checked with the information available for miRNA in the standard miRNA database PhenomiR 2.0. Subsequently, we have prepared different simulated data sets of different sample sizes (from 10 to 100 per group/population) and thereafter the power of each test have been calculated individually. The comparative simulated study might lead to formulate robust and comprehensive judgements about the performance of each test in the basis of assumption of data distribution. Finally, a list of advantages and limitations of the different statistical tests has been provided, along with indications of some areas where further studies are required.
Collapse
|
38
|
Ma B, Nussinov R. Druggable orthosteric and allosteric hot spots to target protein-protein interactions. Curr Pharm Des 2014; 20:1293-301. [PMID: 23713780 PMCID: PMC6361532 DOI: 10.2174/13816128113199990073] [Citation(s) in RCA: 37] [Impact Index Per Article: 3.7] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 04/17/2013] [Accepted: 05/21/2013] [Indexed: 11/22/2022]
Abstract
Drug designing targeting protein-protein interactions is challenging. Because structural elucidation and computational analysis have revealed the importance of hot spot residues in stabilizing these interactions, there have been on-going efforts to develop drugs which bind the hot spots and out-compete the native protein partners. The question arises as to what are the key 'druggable' properties of hot spots in protein-protein interactions and whether these mimic the general hot spot definition. Identification of orthosteric (at the protein- protein interaction site) and allosteric (elsewhere) druggable hot spots is expected to help in discovering compounds that can more effectively modulate protein-protein interactions. For example, are there any other significant features beyond their location in pockets in the interface? The interactions of protein-protein hot spots are coupled with conformational dynamics of protein complexes. Currently increasing efforts focus on the allosteric drug discovery. Allosteric drugs bind away from the native binding site and can modulate the native interactions. We propose that identification of allosteric hot spots could similarly help in more effective allosteric drug discovery. While detection of allosteric hot spots is challenging, targeting drugs to these residues has the potential of greatly increasing the hot spot and protein druggability.
Collapse
Affiliation(s)
| | - Ruth Nussinov
- Basic Science Program, Leidos Biomedical Research, Inc. Cancer and Inflammation Program, NCIFrederick, Frederick, MD 21702.
| |
Collapse
|
39
|
Agrawal NJ, Helk B, Trout BL. A computational tool to predict the evolutionarily conserved protein-protein interaction hot-spot residues from the structure of the unbound protein. FEBS Lett 2013; 588:326-33. [PMID: 24239538 DOI: 10.1016/j.febslet.2013.11.004] [Citation(s) in RCA: 19] [Impact Index Per Article: 1.7] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 09/23/2013] [Revised: 11/01/2013] [Accepted: 11/04/2013] [Indexed: 11/29/2022]
Abstract
Identifying hot-spot residues - residues that are critical to protein-protein binding - can help to elucidate a protein's function and assist in designing therapeutic molecules to target those residues. We present a novel computational tool, termed spatial-interaction-map (SIM), to predict the hot-spot residues of an evolutionarily conserved protein-protein interaction from the structure of an unbound protein alone. SIM can predict the protein hot-spot residues with an accuracy of 36-57%. Thus, the SIM tool can be used to predict the yet unknown hot-spot residues for many proteins for which the structure of the protein-protein complexes are not available, thereby providing a clue to their functions and an opportunity to design therapeutic molecules to target these proteins.
Collapse
Affiliation(s)
- Neeraj J Agrawal
- Chemical Engineering, Massachusetts Institute of Technology, 77 Massachusetts Avenue, E19-502b, Cambridge, MA 02139, USA
| | | | - Bernhardt L Trout
- Chemical Engineering, Massachusetts Institute of Technology, 77 Massachusetts Avenue, E19-502b, Cambridge, MA 02139, USA.
| |
Collapse
|
40
|
Li Y, Liu Z, Han L, Li C, Wang R. Mining the Characteristic Interaction Patterns on Protein–Protein Binding Interfaces. J Chem Inf Model 2013; 53:2437-47. [DOI: 10.1021/ci400241s] [Citation(s) in RCA: 15] [Impact Index Per Article: 1.4] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/29/2022]
Affiliation(s)
- Yan Li
- State
Key Laboratory of Bioorganic and Natural
Products Chemistry, Shanghai Institute of Organic Chemistry, Chinese Academy of Sciences, 345 Lingling Road, Shanghai 200032, People’s Republic of China
| | - Zhihai Liu
- State
Key Laboratory of Bioorganic and Natural
Products Chemistry, Shanghai Institute of Organic Chemistry, Chinese Academy of Sciences, 345 Lingling Road, Shanghai 200032, People’s Republic of China
| | - Li Han
- State
Key Laboratory of Bioorganic and Natural
Products Chemistry, Shanghai Institute of Organic Chemistry, Chinese Academy of Sciences, 345 Lingling Road, Shanghai 200032, People’s Republic of China
| | - Chengke Li
- State
Key Laboratory of Bioorganic and Natural
Products Chemistry, Shanghai Institute of Organic Chemistry, Chinese Academy of Sciences, 345 Lingling Road, Shanghai 200032, People’s Republic of China
| | - Renxiao Wang
- State
Key Laboratory of Bioorganic and Natural
Products Chemistry, Shanghai Institute of Organic Chemistry, Chinese Academy of Sciences, 345 Lingling Road, Shanghai 200032, People’s Republic of China
- State Key Laboratory of Quality Research
in Chinese
Medicine, Macau Institute for Applied Research in Medicine and Health, Macau University of Science and Technology, Macau, People’s Republic of China
| |
Collapse
|
41
|
Agius R, Torchala M, Moal IH, Fernández-Recio J, Bates PA. Characterizing changes in the rate of protein-protein dissociation upon interface mutation using hotspot energy and organization. PLoS Comput Biol 2013; 9:e1003216. [PMID: 24039569 PMCID: PMC3764008 DOI: 10.1371/journal.pcbi.1003216] [Citation(s) in RCA: 28] [Impact Index Per Article: 2.5] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/01/2013] [Accepted: 07/25/2013] [Indexed: 12/21/2022] Open
Abstract
Predicting the effects of mutations on the kinetic rate constants of protein-protein interactions is central to both the modeling of complex diseases and the design of effective peptide drug inhibitors. However, while most studies have concentrated on the determination of association rate constants, dissociation rates have received less attention. In this work we take a novel approach by relating the changes in dissociation rates upon mutation to the energetics and architecture of hotspots and hotregions, by performing alanine scans pre- and post-mutation. From these scans, we design a set of descriptors that capture the change in hotspot energy and distribution. The method is benchmarked on 713 kinetically characterized mutations from the SKEMPI database. Our investigations show that, with the use of hotspot descriptors, energies from single-point alanine mutations may be used for the estimation of off-rate mutations to any residue type and also multi-point mutations. A number of machine learning models are built from a combination of molecular and hotspot descriptors, with the best models achieving a Pearson's Correlation Coefficient of 0.79 with experimental off-rates and a Matthew's Correlation Coefficient of 0.6 in the detection of rare stabilizing mutations. Using specialized feature selection models we identify descriptors that are highly specific and, conversely, broadly important to predicting the effects of different classes of mutations, interface regions and complexes. Our results also indicate that the distribution of the critical stability regions across protein-protein interfaces is a function of complex size more strongly than interface area. In addition, mutations at the rim are critical for the stability of small complexes, but consistently harder to characterize. The relationship between hotregion size and the dissociation rate is also investigated and, using hotspot descriptors which model cooperative effects within hotregions, we show how the contribution of hotregions of different sizes, changes under different cooperative effects.
Collapse
Affiliation(s)
- Rudi Agius
- Biomolecular Modelling Laboratory, Cancer Research UK London Research Institute, London, United Kingdom
| | - Mieczyslaw Torchala
- Biomolecular Modelling Laboratory, Cancer Research UK London Research Institute, London, United Kingdom
| | - Iain H. Moal
- Joint BSC-IRB Research Program in Computational Biology, Life Science Department, Barcelona Supercomputing Center, Barcelona, Spain
| | - Juan Fernández-Recio
- Joint BSC-IRB Research Program in Computational Biology, Life Science Department, Barcelona Supercomputing Center, Barcelona, Spain
| | - Paul A. Bates
- Biomolecular Modelling Laboratory, Cancer Research UK London Research Institute, London, United Kingdom
| |
Collapse
|
42
|
Chen P, Li J, Wong L, Kuwahara H, Huang JZ, Gao X. Accurate prediction of hot spot residues through physicochemical characteristics of amino acid sequences. Proteins 2013; 81:1351-62. [DOI: 10.1002/prot.24278] [Citation(s) in RCA: 41] [Impact Index Per Article: 3.7] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 11/07/2012] [Revised: 02/07/2013] [Accepted: 02/23/2013] [Indexed: 11/11/2022]
Affiliation(s)
- Peng Chen
- Computer, Electrical and Mathematical Sciences and Engineering Division; King Abdullah University of Science and Technology (KAUST); Thuwal 23955-6900 Saudi Arabia
| | - Jinyan Li
- Advanced Analytics Institute; University of Technology; Sydney New South Wales Australia
| | - Limsoon Wong
- School of Computing; National University of Singapore; Singapore 117417
| | - Hiroyuki Kuwahara
- Computer, Electrical and Mathematical Sciences and Engineering Division; King Abdullah University of Science and Technology (KAUST); Thuwal 23955-6900 Saudi Arabia
| | - Jianhua Z. Huang
- Department of Statistics; Texas A&M University; College Station Texas 77843-3143
| | - Xin Gao
- Computer, Electrical and Mathematical Sciences and Engineering Division; King Abdullah University of Science and Technology (KAUST); Thuwal 23955-6900 Saudi Arabia
- Computational Bioscience Research Center; King Abdullah University of Science and Technology (KAUST); Thuwal 23955-6900 Saudi Arabia
| |
Collapse
|
43
|
Wang L, Hou Y, Quan H, Xu W, Bao Y, Li Y, Fu Y, Zou S. A compound-based computational approach for the accurate determination of hot spots. Protein Sci 2013; 22:1060-70. [PMID: 23776011 DOI: 10.1002/pro.2296] [Citation(s) in RCA: 5] [Impact Index Per Article: 0.5] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 02/06/2013] [Revised: 05/27/2013] [Accepted: 06/01/2013] [Indexed: 12/21/2022]
Abstract
A plethora of both experimental and computational methods have been proposed in the past 20 years for the identification of hot spots at a protein-protein interface. The experimental determination of a protein-protein complex followed by alanine scanning mutagenesis, though able to determine hot spots with much precision, is expensive and has no guarantee of success while the accuracy of the current computational methods for hot-spot identification remains low. Here, we present a novel structure-based computational approach that accurately determines hot spots through docking into a set of proteins homologous to only one of the two interacting partners of a compound capable of disrupting the protein-protein interaction (PPI). This approach has been applied to identify the hot spots of human activin receptor type II (ActRII) critical for its binding toward Cripto-I. The subsequent experimental confirmation of the computationally identified hot spots portends a potentially accurate method for hot-spot determination in silico given a compound capable of disrupting the PPI in question. The hot spots of human ActRII first reported here may well become the focal points for the design of small molecule drugs that target the PPI. The determination of their interface may have significant biological implications in that it suggests that Cripto-I plays an important role in both activin and nodal signal pathways.
Collapse
Affiliation(s)
- Lincong Wang
- The College of Computer Science and Technology, Jilin University, Changchun, Jilin, China.
| | | | | | | | | | | | | | | |
Collapse
|
44
|
Dehouck Y, Kwasigroch JM, Rooman M, Gilis D. BeAtMuSiC: Prediction of changes in protein-protein binding affinity on mutations. Nucleic Acids Res 2013; 41:W333-9. [PMID: 23723246 PMCID: PMC3692068 DOI: 10.1093/nar/gkt450] [Citation(s) in RCA: 239] [Impact Index Per Article: 21.7] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 01/11/2023] Open
Abstract
The ability of proteins to establish highly selective interactions with a variety of (macro)molecular partners is a crucial prerequisite to the realization of their biological functions. The availability of computational tools to evaluate the impact of mutations on protein–protein binding can therefore be valuable in a wide range of industrial and biomedical applications, and help rationalize the consequences of non-synonymous single-nucleotide polymorphisms. BeAtMuSiC (http://babylone.ulb.ac.be/beatmusic) is a coarse-grained predictor of the changes in binding free energy induced by point mutations. It relies on a set of statistical potentials derived from known protein structures, and combines the effect of the mutation on the strength of the interactions at the interface, and on the overall stability of the complex. The BeAtMuSiC server requires as input the structure of the protein–protein complex, and gives the possibility to assess rapidly all possible mutations in a protein chain or at the interface, with predictive performances that are in line with the best current methodologies.
Collapse
Affiliation(s)
- Yves Dehouck
- Department of BioModelling, BioInformatics and BioProcesses, Université Libre de Bruxelles, CP165/61, Av. Fr. Roosevelt 50, 1050 Brussels, Belgium.
| | | | | | | |
Collapse
|
45
|
Shah S, Butler NM, Hannapel DJ, Rao AG. Mapping and characterization of the interaction interface between two polypyrimidine-tract binding proteins and a nova-type protein of Solanum tuberosum. PLoS One 2013; 8:e64783. [PMID: 23717658 PMCID: PMC3663837 DOI: 10.1371/journal.pone.0064783] [Citation(s) in RCA: 4] [Impact Index Per Article: 0.4] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 02/07/2013] [Accepted: 04/18/2013] [Indexed: 11/21/2022] Open
Abstract
Polypyrimidine tract-binding (PTB) proteins are RNA-binding proteins that generally contain four RNA recognition motifs (RRMs). In potato, six cDNAs encoding full-length PTB proteins have been identified. In the present study Nova1-like protein, designated StNova1, was identified as a potential interacting partner of the StPTB proteins via yeast two-hybrid screening. Nova protein is a RNA-binding protein that contains three K-homology (KH) domains. In humans, these proteins are involved in regulation of neuronal RNA metabolism but the role of Nova-like proteins in plants is poorly understood. We have validated this interaction and mapped the protein binding region on StNova1 and StPTB1 and −6 using a novel domain interaction phage display (DIPP) technique. The interaction between the two RNA-binding proteins StPTB1/6 and StNova1 is mediated through linker regions that are distinctly separated from the RRMs. Furthermore, using a random 21-mer phage-peptide library, we have identified a number of peptides with the consensus sequence motif [S/G][V/I][L/V]G that recognize the StPTB proteins. One over-represented peptide that recognizes StPTB6 contains the GVLGPWP sequence that is similar to the GIGGRYP sequence in the glycine-rich linker region between the KH2 and KH3 domains of StNova1. We show, through site-specific mutations, the importance of glycine and proline residues in StNova1-StPTB interactions.
Collapse
Affiliation(s)
- Shweta Shah
- Roy J. Carver Department of Biochemistry Biophysics and Molecular Biology, Iowa State University, Ames, Iowa, United States of America
| | - Nathaniel M. Butler
- Plant Biology Major, Iowa State University, Ames, Iowa, United States of America
| | - David J. Hannapel
- Plant Biology Major, Iowa State University, Ames, Iowa, United States of America
| | - A. Gururaj Rao
- Roy J. Carver Department of Biochemistry Biophysics and Molecular Biology, Iowa State University, Ames, Iowa, United States of America
- * E-mail:
| |
Collapse
|
46
|
Masso M. Improving Prediction Accuracy via Subspace Modeling in a Statistical Geometry Based Computational Protein Mutagenesis. Bioinformatics 2013. [DOI: 10.4018/978-1-4666-3604-0.ch054] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/09/2022] Open
Abstract
A computational mutagenesis is detailed whereby each single residue substitution in a protein chain of primary sequence length N is represented as a sparse N-dimensional feature vector, whose M << N nonzero components locally quantify environmental perturbations occurring at the mutated position and its neighbors in the protein structure. The methodology makes use of both the Delaunay tessellation algorithm for representing protein structures, as well as a four-body, knowledge based, statistical contact potential. Feature vectors for each subset of mutants due to all possible residue substitutions at a particular position cohabit the same M-dimensional subspace, where the value of M and the identities of the M nonzero components are similarly position dependent. The approach is used to characterize a large experimental dataset of single residue substitutions in bacteriophage T4 lysozyme, each categorized as either unaffected or affected based on the measured level of mutant activity relative to that of the native protein. Performance of a single classifier trained with the collective set of mutants in N-space is compared to that of an ensemble of position-specific classifiers trained using disjoint mutant subsets residing in significantly smaller subspaces. Results suggest that significant improvements can be achieved through subspace modeling.
Collapse
|
47
|
Xu B, Wei X, Deng L, Guan J, Zhou S. A semi-supervised boosting SVM for predicting hot spots at protein-protein interfaces. BMC SYSTEMS BIOLOGY 2012; 6 Suppl 2:S6. [PMID: 23282146 PMCID: PMC3521187 DOI: 10.1186/1752-0509-6-s2-s6] [Citation(s) in RCA: 13] [Impact Index Per Article: 1.1] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Subscribe] [Scholar Register] [Indexed: 01/18/2023]
Abstract
BACKGROUND Hot spots are residues contributing the most of binding free energy yet accounting for a small portion of a protein interface. Experimental approaches to identify hot spots such as alanine scanning mutagenesis are expensive and time-consuming, while computational methods are emerging as effective alternatives to experimental approaches. RESULTS In this study, we propose a semi-supervised boosting SVM, which is called sbSVM, to computationally predict hot spots at protein-protein interfaces by combining protein sequence and structure features. Here, feature selection is performed using random forests to avoid over-fitting. Due to the deficiency of positive samples, our approach samples useful unlabeled data iteratively to boost the performance of hot spots prediction. The performance evaluation of our method is carried out on a dataset generated from the ASEdb database for cross-validation and a dataset from the BID database for independent test. Furthermore, a balanced dataset with similar amounts of hot spots and non-hot spots (65 and 66 respectively) derived from the first training dataset is used to further validate our method. All results show that our method yields good sensitivity, accuracy and F1 score comparing with the existing methods. CONCLUSION Our method boosts prediction performance of hot spots by using unlabeled data to overcome the deficiency of available training data. Experimental results show that our approach is more effective than the traditional supervised algorithms and major existing hot spot prediction methods.
Collapse
Affiliation(s)
- Bin Xu
- Department of Computer Science and Technology, Tongji University, Shanghai 201804, China
| | | | | | | | | |
Collapse
|
48
|
Morrow JK, Zhang S. Computational prediction of protein hot spot residues. Curr Pharm Des 2012; 18:1255-65. [PMID: 22316154 DOI: 10.2174/138161212799436412] [Citation(s) in RCA: 67] [Impact Index Per Article: 5.6] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/27/2011] [Accepted: 12/06/2011] [Indexed: 11/22/2022]
Abstract
Most biological processes involve multiple proteins interacting with each other. It has been recently discovered that certain residues in these protein-protein interactions, which are called hot spots, contribute more significantly to binding affinity than others. Hot spot residues have unique and diverse energetic properties that make them challenging yet important targets in the modulation of protein-protein complexes. Design of therapeutic agents that interact with hot spot residues has proven to be a valid methodology in disrupting unwanted protein-protein interactions. Using biological methods to determine which residues are hot spots can be costly and time consuming. Recent advances in computational approaches to predict hot spots have incorporated a myriad of features, and have shown increasing predictive successes. Here we review the state of knowledge around protein-protein interactions, hot spots, and give an overview of multiple in silico prediction techniques of hot spot residues.
Collapse
Affiliation(s)
- John Kenneth Morrow
- Department of Experimental Therapeutics, The University of Texas M.D. Anderson Cancer Center, Houston, Texas 77054, USA
| | | |
Collapse
|
49
|
Liu Q, Wong L, Li J. Z-score biological significance of binding hot spots of protein interfaces by using crystal packing as the reference state. BIOCHIMICA ET BIOPHYSICA ACTA-PROTEINS AND PROTEOMICS 2012; 1824:1457-67. [PMID: 22728649 DOI: 10.1016/j.bbapap.2012.05.014] [Citation(s) in RCA: 2] [Impact Index Per Article: 0.2] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Subscribe] [Scholar Register] [Received: 02/16/2012] [Revised: 05/12/2012] [Accepted: 05/31/2012] [Indexed: 11/19/2022]
Abstract
Characterization of binding hot spots of protein interfaces is a fundamental study in molecular biology. Many computational methods have been proposed to identify binding hot spots. However, there are few studies to assess the biological significance of binding hot spots. We introduce the notion of biological significance of a contact residue for capturing the probability of the residue occurring in or contributing to protein binding interfaces. We take a statistical Z-score approach to the assessment of the biological significance. The method has three main steps. First, the potential score of a residue is defined by using a knowledge-based potential function with relative accessible surface area calculations. A null distribution of this potential score is then generated from artifact crystal packing contacts. Finally, the Z-score significance of a contact residue with a specific potential score is determined according to this null distribution. We hypothesize that residues at binding hot spots have big absolute values of Z-score as they contribute greatly to binding free energy. Thus, we propose to use Z-score to predict whether a contact residue is a hot spot residue. Comparison with previously reported methods on two benchmark datasets shows that this Z-score method is mostly superior to earlier methods. This article is part of a Special Issue entitled: Computational Methods for Protein Interaction and Structural Prediction.
Collapse
Affiliation(s)
- Qian Liu
- BIRC, SCE, Nanyang Technological University, Singapore 639798, Singapore
| | | | | |
Collapse
|
50
|
Koes DR, Camacho CJ. PocketQuery: protein-protein interaction inhibitor starting points from protein-protein interaction structure. Nucleic Acids Res 2012; 40:W387-92. [PMID: 22523085 PMCID: PMC3394328 DOI: 10.1093/nar/gks336] [Citation(s) in RCA: 64] [Impact Index Per Article: 5.3] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 12/22/2022] Open
Abstract
PocketQuery (http://pocketquery.csb.pitt.edu) is a web interface for exploring the properties of protein–protein interaction (PPI) interfaces with a focus on the discovery of promising starting points for small-molecule design. PocketQuery rapidly focuses attention on the key interacting residues of an interaction using a ‘druggability’ score that provides an estimate of how likely the chemical mimicry of a cluster of interface residues would result in a small-molecule inhibitor of an interaction. These residue clusters are chemical starting points that can be seamlessly exported to a pharmacophore-based drug discovery workflow. PocketQuery is updated on a weekly basis to contain all applicable PPI structures deposited in the Protein Data Bank and allows users to upload their own custom structures for analysis.
Collapse
Affiliation(s)
- David Ryan Koes
- Department of Computational and Systems Biology, University of Pittsburgh, 3501 Fifth Avenue, Pittsburgh, PA 15260, USA.
| | | |
Collapse
|