1
|
Kurcinski M, Blaszczyk M, Ciemny MP, Kolinski A, Kmiecik S. A protocol for CABS-dock protein-peptide docking driven by side-chain contact information. Biomed Eng Online 2017; 16:73. [PMID: 28830545 PMCID: PMC5568604 DOI: 10.1186/s12938-017-0363-6] [Citation(s) in RCA: 8] [Impact Index Per Article: 1.1] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 02/03/2023] Open
Abstract
Background The characterization of protein–peptide interactions is a challenge for computational molecular docking. Protein–peptide docking tools face at least two major difficulties: (1) efficient sampling of large-scale conformational changes induced by binding and (2) selection of the best models from a large set of predicted structures. In this paper, we merge an efficient sampling technique with external information about side-chain contacts to sample and select the best possible models. Methods In this paper we test a new protocol that uses information about side-chain contacts in CABS-dock protein–peptide docking. As shown in our recent studies, CABS-dock enables efficient modeling of large-scale conformational changes without knowledge about the binding site. However, the resulting set of binding sites and poses is in many cases highly diverse and difficult to score. Results As we demonstrate here, information about a single side-chain contact can significantly improve the prediction accuracy. Importantly, the imposed constraints for side-chain contacts are quite soft. Therefore, the developed protocol does not require precise contact information and ensures large-scale peptide flexibility in the broad contact area. Conclusions The demonstrated protocol provides the extension of the CABS-dock method that can be practically used in the structure prediction of protein–peptide complexes guided by the knowledge of the binding interface.
Collapse
Affiliation(s)
- Mateusz Kurcinski
- Faculty of Chemistry, Biological and Chemical Research Center, University of Warsaw, Żwirki i Wigury 101, 02-089, Warsaw, Poland
| | - Maciej Blaszczyk
- Faculty of Chemistry, Biological and Chemical Research Center, University of Warsaw, Żwirki i Wigury 101, 02-089, Warsaw, Poland
| | - Maciej Pawel Ciemny
- Faculty of Chemistry, Biological and Chemical Research Center, University of Warsaw, Żwirki i Wigury 101, 02-089, Warsaw, Poland.,Faculty of Physics, University of Warsaw, Pasteura 5, Warsaw, 02-093, Poland
| | - Andrzej Kolinski
- Faculty of Chemistry, Biological and Chemical Research Center, University of Warsaw, Żwirki i Wigury 101, 02-089, Warsaw, Poland
| | - Sebastian Kmiecik
- Faculty of Chemistry, Biological and Chemical Research Center, University of Warsaw, Żwirki i Wigury 101, 02-089, Warsaw, Poland.
| |
Collapse
|
2
|
Ciemny MP, Kurcinski M, Blaszczyk M, Kolinski A, Kmiecik S. Modeling EphB4-EphrinB2 protein-protein interaction using flexible docking of a short linear motif. Biomed Eng Online 2017; 16:71. [PMID: 28830442 PMCID: PMC5568603 DOI: 10.1186/s12938-017-0362-7] [Citation(s) in RCA: 15] [Impact Index Per Article: 2.1] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 12/28/2022] Open
Abstract
Background Many protein–protein interactions are mediated by a short linear motif. Usually, amino acid sequences of those motifs are known or can be predicted. It is much harder to experimentally characterize or predict their structure in the bound form. In this work, we test a possibility of using flexible docking of a short linear motif to predict the interaction interface of the EphB4-EphrinB2 complex (a system extensively studied for its significance in tumor progression). Methods In the modeling, we only use knowledge about the motif sequence and experimental structures of EphB4-EphrinB2 complex partners. The proposed protocol enables efficient modeling of significant conformational changes in the short linear motif fragment during molecular docking simulation. For the docking simulations, we use the CABS-dock method for docking fully flexible peptides to flexible protein receptors (available as a server at http://biocomp.chem.uw.edu.pl/CABSdock/). Based on the docking result, the protein–protein complex is reconstructed and refined. Results Using this novel protocol, we obtained an accurate EphB4-EphrinB2 interaction model. Conclusions The results show that the CABS-dock method may be useful as the primary docking tool in specific protein–protein docking cases similar to EphB4-EphrinB2 complex—that is, where a short linear motif fragment can be identified.
Collapse
Affiliation(s)
- Maciej Pawel Ciemny
- Faculty of Chemistry, Biological and Chemical Research Centre, University of Warsaw, Żwirki i Wigury 101, 02-089, Warsaw, Poland.,Faculty of Physics, University of Warsaw, Pasteura 5, Warsaw, Poland
| | - Mateusz Kurcinski
- Faculty of Chemistry, Biological and Chemical Research Centre, University of Warsaw, Żwirki i Wigury 101, 02-089, Warsaw, Poland
| | - Maciej Blaszczyk
- Faculty of Chemistry, Biological and Chemical Research Centre, University of Warsaw, Żwirki i Wigury 101, 02-089, Warsaw, Poland
| | - Andrzej Kolinski
- Faculty of Chemistry, Biological and Chemical Research Centre, University of Warsaw, Żwirki i Wigury 101, 02-089, Warsaw, Poland
| | - Sebastian Kmiecik
- Faculty of Chemistry, Biological and Chemical Research Centre, University of Warsaw, Żwirki i Wigury 101, 02-089, Warsaw, Poland.
| |
Collapse
|
3
|
Kmiecik S, Kolinski A. One-Dimensional Structural Properties of Proteins in the Coarse-Grained CABS Model. Methods Mol Biol 2017; 1484:83-113. [PMID: 27787822 DOI: 10.1007/978-1-4939-6406-2_8] [Citation(s) in RCA: 8] [Impact Index Per Article: 1.1] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 12/14/2022]
Abstract
Despite the significant increase in computational power, molecular modeling of protein structure using classical all-atom approaches remains inefficient, at least for most of the protein targets in the focus of biomedical research. Perhaps the most successful strategy to overcome the inefficiency problem is multiscale modeling to merge all-atom and coarse-grained models. This chapter describes a well-established CABS coarse-grained protein model. The CABS (C-Alpha, C-Beta, and Side chains) model assumes a 2-4 united-atom representation of amino acids, knowledge-based force field (derived from the statistical regularities seen in known protein sequences and structures) and efficient Monte Carlo sampling schemes (MC dynamics, MC replica-exchange, and combinations). A particular emphasis is given to the unique design of the CABS force-field, which is largely defined using one-dimensional structural properties of proteins, including protein secondary structure. This chapter also presents CABS-based modeling methods, including multiscale tools for de novo structure prediction, modeling of protein dynamics and prediction of protein-peptide complexes. CABS-based tools are freely available at http://biocomp.chem.uw.edu.pl/tools.
Collapse
Affiliation(s)
- Sebastian Kmiecik
- Faculty of Chemistry, University of Warsaw, Pasteura 1, Warszawa, 02-093, Poland
| | - Andrzej Kolinski
- Faculty of Chemistry, University of Warsaw, Pasteura 1, Warszawa, 02-093, Poland.
| |
Collapse
|
4
|
Modeling of protein-peptide interactions using the CABS-dock web server for binding site search and flexible docking. Methods 2015; 93:72-83. [PMID: 26165956 DOI: 10.1016/j.ymeth.2015.07.004] [Citation(s) in RCA: 113] [Impact Index Per Article: 12.6] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 05/04/2015] [Revised: 07/06/2015] [Accepted: 07/08/2015] [Indexed: 11/22/2022] Open
Abstract
Protein-peptide interactions play essential functional roles in living organisms and their structural characterization is a hot subject of current experimental and theoretical research. Computational modeling of the structure of protein-peptide interactions is usually divided into two stages: prediction of the binding site at a protein receptor surface, and then docking (and modeling) the peptide structure into the known binding site. This paper presents a comprehensive CABS-dock method for the simultaneous search of binding sites and flexible protein-peptide docking, available as a user's friendly web server. We present example CABS-dock results obtained in the default CABS-dock mode and using its advanced options that enable the user to increase the range of flexibility for chosen receptor fragments or to exclude user-selected binding modes from docking search. Furthermore, we demonstrate a strategy to improve CABS-dock performance by assessing the quality of models with classical molecular dynamics. Finally, we discuss the promising extensions and applications of the CABS-dock method and provide a tutorial appendix for the convenient analysis and visualization of CABS-dock results. The CABS-dock web server is freely available at http://biocomp.chem.uw.edu.pl/CABSdock/.
Collapse
|
5
|
Wabik J, Kurcinski M, Kolinski A. Coarse-Grained Modeling of Peptide Docking Associated with Large Conformation Transitions of the Binding Protein: Troponin I Fragment-Troponin C System. Molecules 2015; 20:10763-80. [PMID: 26111167 PMCID: PMC6272278 DOI: 10.3390/molecules200610763] [Citation(s) in RCA: 11] [Impact Index Per Article: 1.2] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/31/2015] [Revised: 05/14/2015] [Accepted: 05/21/2015] [Indexed: 11/25/2022] Open
Abstract
Most of the current docking procedures are focused on fine conformational adjustments of assembled complexes and fail to reproduce large-scale protein motion. In this paper, we test a new modeling approach developed to address this problem. CABS-dock is a versatile and efficient tool for modeling the structure, dynamics and interactions of protein complexes. The docking protocol employs a coarse-grained representation of proteins, a simplified model of interactions and advanced protocols for conformational sampling. CABS-dock is one of the very few tools that allow unrestrained docking with large conformational freedom of the receptor. In an example application we modeled the process of complex assembly between two proteins: Troponin C (TnC) and the N-terminal helix of Troponin I (TnI N-helix), which occurs in vivo during muscle contraction. Docking simulations illustrated how the TnC molecule undergoes significant conformational transition on complex formation, a phenomenon that can be modeled only when protein flexibility is properly accounted for. This way our procedure opens up a new possibility for studying mechanisms of protein complex assembly, which may be a supporting tool for rational drug design.
Collapse
Affiliation(s)
- Jacek Wabik
- Laboratory of Theory of Biopolymers, Faculty of Chemistry, University of Warsaw, Pasteura 1, 02-093 Warsaw, Poland.
| | - Mateusz Kurcinski
- Laboratory of Theory of Biopolymers, Faculty of Chemistry, University of Warsaw, Pasteura 1, 02-093 Warsaw, Poland.
| | - Andrzej Kolinski
- Laboratory of Theory of Biopolymers, Faculty of Chemistry, University of Warsaw, Pasteura 1, 02-093 Warsaw, Poland.
| |
Collapse
|
6
|
Kmiecik S, Jamroz M, Kolinski M. Structure prediction of the second extracellular loop in G-protein-coupled receptors. Biophys J 2015; 106:2408-16. [PMID: 24896119 PMCID: PMC4052351 DOI: 10.1016/j.bpj.2014.04.022] [Citation(s) in RCA: 27] [Impact Index Per Article: 3.0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 01/27/2014] [Revised: 03/26/2014] [Accepted: 04/17/2014] [Indexed: 12/29/2022] Open
Abstract
G-protein-coupled receptors (GPCRs) play key roles in living organisms. Therefore, it is important to determine their functional structures. The second extracellular loop (ECL2) is a functionally important region of GPCRs, which poses significant challenge for computational structure prediction methods. In this work, we evaluated CABS, a well-established protein modeling tool for predicting ECL2 structure in 13 GPCRs. The ECL2s (with between 13 and 34 residues) are predicted in an environment of other extracellular loops being fully flexible and the transmembrane domain fixed in its x-ray conformation. The modeling procedure used theoretical predictions of ECL2 secondary structure and experimental constraints on disulfide bridges. Our approach yielded ensembles of low-energy conformers and the most populated conformers that contained models close to the available x-ray structures. The level of similarity between the predicted models and x-ray structures is comparable to that of other state-of-the-art computational methods. Our results extend other studies by including newly crystallized GPCRs.
Collapse
Affiliation(s)
- Sebastian Kmiecik
- University of Warsaw, Faculty of Chemistry, Laboratory of Theory of Biopolymers, Pasteura 1, 02-093 Warsaw, Poland
| | - Michal Jamroz
- University of Warsaw, Faculty of Chemistry, Laboratory of Theory of Biopolymers, Pasteura 1, 02-093 Warsaw, Poland
| | - Michal Kolinski
- Mossakowski Medical Research Center, Polish Academy of Sciences, Bioinformatics Laboratory, Pawinskiego 5, 02-106 Warsaw, Poland.
| |
Collapse
|
7
|
Krüger DM, Ignacio Garzón J, Chacón P, Gohlke H. DrugScorePPI knowledge-based potentials used as scoring and objective function in protein-protein docking. PLoS One 2014; 9:e89466. [PMID: 24586799 PMCID: PMC3931789 DOI: 10.1371/journal.pone.0089466] [Citation(s) in RCA: 19] [Impact Index Per Article: 1.9] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 11/12/2013] [Accepted: 01/20/2014] [Indexed: 02/06/2023] Open
Abstract
The distance-dependent knowledge-based DrugScorePPI potentials, previously developed for in silico alanine scanning and hot spot prediction on given structures of protein-protein complexes, are evaluated as a scoring and objective function for the structure prediction of protein-protein complexes. When applied for ranking “unbound perturbation” (“unbound docking”) decoys generated by Baker and coworkers a 4-fold (1.5-fold) enrichment of acceptable docking solutions in the top ranks compared to a random selection is found. When applied as an objective function in FRODOCK for bound protein-protein docking on 97 complexes of the ZDOCK benchmark 3.0, DrugScorePPI/FRODOCK finds up to 10% (15%) more high accuracy solutions in the top 1 (top 10) predictions than the original FRODOCK implementation. When used as an objective function for global unbound protein-protein docking, fair docking success rates are obtained, which improve by ∼2-fold to 18% (58%) for an at least acceptable solution in the top 10 (top 100) predictions when performing knowledge-driven unbound docking. This suggests that DrugScorePPI balances well several different types of interactions important for protein-protein recognition. The results are discussed in view of the influence of crystal packing and the type of protein-protein complex docked. Finally, a simple criterion is provided with which to estimate a priori if unbound docking with DrugScorePPI/FRODOCK will be successful.
Collapse
Affiliation(s)
- Dennis M. Krüger
- Institute for Pharmaceutical and Medicinal Chemistry, Heinrich-Heine-University, Düsseldorf, Germany
| | - José Ignacio Garzón
- Rocasolano Physical Chemistry Institute, Consejo Superior de Investigaciones Científicas, Madrid, Spain
| | - Pablo Chacón
- Rocasolano Physical Chemistry Institute, Consejo Superior de Investigaciones Científicas, Madrid, Spain
| | - Holger Gohlke
- Institute for Pharmaceutical and Medicinal Chemistry, Heinrich-Heine-University, Düsseldorf, Germany
- * E-mail:
| |
Collapse
|
8
|
Ceres N, Lavery R. Coarse-grain Protein Models. INNOVATIONS IN BIOMOLECULAR MODELING AND SIMULATIONS 2012. [DOI: 10.1039/9781849735049-00219] [Citation(s) in RCA: 1] [Impact Index Per Article: 0.1] [Reference Citation Analysis] [Abstract] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 01/30/2023]
Abstract
Coarse-graining is a powerful approach for modeling biomolecules that, over the last few decades, has been extensively applied to proteins. Coarse-grain models offer access to large systems and to slow processes without becoming computationally unmanageable. In addition, they are very versatile, enabling both the protein representation and the energy function to be adapted to the biological problem in hand. This review concentrates on modeling soluble proteins and their assemblies. It presents an overview of the coarse-grain representations, of the associated interaction potentials, and of the optimization procedures used to define them. It then shows how coarse-grain models have been used to understand processes involving proteins, from their initial folding to their functional properties, their binary interactions, and the assembly of large complexes.
Collapse
Affiliation(s)
- N. Ceres
- Bases Moléculaires et Structurales des Systèmes Infectieux Université Lyon1/CNRS UMR 5086, IBCP, 7 Passage du Vercors, 69367, Lyon France
| | - R. Lavery
- Bases Moléculaires et Structurales des Systèmes Infectieux Université Lyon1/CNRS UMR 5086, IBCP, 7 Passage du Vercors, 69367, Lyon France
| |
Collapse
|
9
|
Latek D, Kolinski A. CABS-NMR-De novo tool for rapid global fold determination from chemical shifts, residual dipolar couplings and sparse methyl-methyl noes. J Comput Chem 2010; 32:536-44. [DOI: 10.1002/jcc.21640] [Citation(s) in RCA: 12] [Impact Index Per Article: 0.9] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 11/09/2009] [Revised: 06/27/2010] [Accepted: 06/27/2010] [Indexed: 01/20/2023]
|
10
|
Kurcinski M, Kolinski A. Theoretical study of molecular mechanism of binding TRAP220 coactivator to Retinoid X Receptor alpha, activated by 9-cis retinoic acid. J Steroid Biochem Mol Biol 2010; 121:124-9. [PMID: 20398753 PMCID: PMC2906686 DOI: 10.1016/j.jsbmb.2010.03.086] [Citation(s) in RCA: 10] [Impact Index Per Article: 0.7] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Submit a Manuscript] [Subscribe] [Scholar Register] [Received: 11/09/2009] [Accepted: 03/26/2010] [Indexed: 01/22/2023]
Abstract
Study on molecular mechanism of conformational reorientation of RXR-alpha ligand binding domain is presented. We employed CABS--a reduced model of protein dynamics to model folding pathways of binding 9-cis retinoic acid to apo-RXR molecule and TRAP220 peptide fragment to the holo form. Based on obtained results we also propose a sequential model of RXR activation by 9-cis retinoic acid and TRAP220 coactivator. Methodology presented here may be used for investigation of binding pathways of other NR/hormone/cofactor sets.
Collapse
Affiliation(s)
- Mateusz Kurcinski
- Department of Chemistry, University of Warsaw, Pasteura 1, 02-093 Warsaw, Poland.
| | | |
Collapse
|
11
|
Jamroz M, Kolinski A. Modeling of loops in proteins: a multi-method approach. BMC STRUCTURAL BIOLOGY 2010; 10:5. [PMID: 20149252 PMCID: PMC2837870 DOI: 10.1186/1472-6807-10-5] [Citation(s) in RCA: 32] [Impact Index Per Article: 2.3] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Subscribe] [Scholar Register] [Received: 10/08/2009] [Accepted: 02/11/2010] [Indexed: 11/23/2022]
Abstract
Background Template-target sequence alignment and loop modeling are key components of protein comparative modeling. Short loops can be predicted with high accuracy using structural fragments from other, not necessairly homologous proteins, or by various minimization methods. For longer loops multiscale approaches employing coarse-grained de novo modeling techniques should be more effective. Results For a representative set of protein structures of various structural classes test predictions of loop regions have been performed using MODELLER, ROSETTA, and a CABS coarse-grained de novo modeling tool. Loops of various length, from 4 to 25 residues, were modeled assuming an ideal target-template alignment of the remaining portions of the protein. It has been shown that classical modeling with MODELLER is usually better for short loops, while coarse-grained de novo modeling is more effective for longer loops. Even very long missing fragments in protein structures could be effectively modeled. Resolution of such models is usually on the level 2-6 Å, which could be sufficient for guiding protein engineering. Further improvement of modeling accuracy could be achieved by the combination of different methods. In particular, we used 10 top ranked models from sets of 500 models generated by MODELLER as multiple templates for CABS modeling. On average, the resulting molecular models were better than the models from individual methods. Conclusions Accuracy of protein modeling, as demonstrated for the problem of loop modeling, could be improved by the combinations of different modeling techniques.
Collapse
Affiliation(s)
- Michal Jamroz
- Laboratory of Theory of Biopolymers, Faculty of Chemistry, University of Warsaw, Warsaw, Poland
| | | |
Collapse
|
12
|
Latek D, Kolinski A. Contact prediction in protein modeling: scoring, folding and refinement of coarse-grained models. BMC STRUCTURAL BIOLOGY 2008; 8:36. [PMID: 18694501 PMCID: PMC2527566 DOI: 10.1186/1472-6807-8-36] [Citation(s) in RCA: 13] [Impact Index Per Article: 0.8] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Subscribe] [Scholar Register] [Received: 02/16/2008] [Accepted: 08/11/2008] [Indexed: 11/10/2022]
Abstract
BACKGROUND Several different methods for contact prediction succeeded within the Sixth Critical Assessment of Techniques for Protein Structure Prediction (CASP6). The most relevant were non-local contact predictions for targets from the most difficult categories: fold recognition-analogy and new fold. Such contacts could provide valuable structural information in case a template structure cannot be found in the PDB. RESULTS We described comprehensive tests of the effectiveness of contact data in various aspects of de novo modeling with CABS, an algorithm which was used successfully in CASP6 by the Kolinski-Bujnicki group. We used the predicted contacts in a simple scoring function for the post-simulation ranking of protein models and as a soft bias in the folding simulations and in the fold-refinement procedure. The latter approach turned out to be the most successful. The CABS force field used in the Replica Exchange Monte Carlo simulations cooperated with the true contacts and discriminated the false ones, which resulted in an improvement of the majority of Kolinski-Bujnicki's protein models. In the modeling we tested different sets of predicted contact data submitted to the CASP6 server. According to our results, the best performing were the contacts with the accuracy balanced with the coverage, obtained either from the best two predictors only or by a consensus from as many predictors as possible. CONCLUSION Our tests have shown that theoretically predicted contacts can be very beneficial for protein structure prediction. Depending on the protein modeling method, a contact data set applied should be prepared with differently balanced coverage and accuracy of predicted contacts. Namely, high coverage of contact data is important for the model ranking and high accuracy for the folding simulations.
Collapse
Affiliation(s)
- Dorota Latek
- Faculty of Chemistry, University of Warsaw, Pasteura 1, 02-093 Warsaw, Poland.
| | | |
Collapse
|
13
|
Predicting the complex structure and functional motions of the outer membrane transporter and signal transducer FecA. Biophys J 2008; 94:2482-91. [PMID: 18178655 DOI: 10.1529/biophysj.107.116046] [Citation(s) in RCA: 11] [Impact Index Per Article: 0.7] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/18/2022] Open
Abstract
Escherichia coli requires an efficient transport and signaling system to successfully sequester iron from its environment. FecA, a TonB-dependent protein, serves a critical role in this process: first, it binds and transports iron in the form of ferric citrate, and second, it initiates a signaling cascade that results in the transcription of several iron transporter genes in interaction with inner membrane proteins. The structure of the plug and barrel domains and the periplasmic N-terminal domain (NTD) are separately available. However, the linker connecting the plug and barrel and the NTD domains is highly mobile, which may prevent the determination of the FecA structure as a whole assembly. Here, we reduce the conformation space of this linker into most probable structural models using the modeling tool CABS, then apply normal-mode analysis to investigate the motions of the whole structure of FecA by using elastic network models. We relate the FecA domain motions to the outer-inner membrane communication, which initiates transcription. We observe that the global motions of FecA assign flexibility to the TonB box and the NTD, and control the exposure of the TonB box for binding to the TonB inner membrane protein, suggesting how these motions relate to FecA function. Our simulations suggest the presence of a communication between the loops on both ends of the protein, a signaling mechanism by which a signal could be transmitted by conformational transitions in response to the binding of ferric citrate.
Collapse
|