1
|
Nordquist E, Zhang G, Barethiya S, Ji N, White KM, Han L, Jia Z, Shi J, Cui J, Chen J. Incorporating physics to overcome data scarcity in predictive modeling of protein function: A case study of BK channels. PLoS Comput Biol 2023; 19:e1011460. [PMID: 37713443 PMCID: PMC10529646 DOI: 10.1371/journal.pcbi.1011460] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/24/2023] [Revised: 09/27/2023] [Accepted: 08/24/2023] [Indexed: 09/17/2023] Open
Abstract
Machine learning has played transformative roles in numerous chemical and biophysical problems such as protein folding where large amount of data exists. Nonetheless, many important problems remain challenging for data-driven machine learning approaches due to the limitation of data scarcity. One approach to overcome data scarcity is to incorporate physical principles such as through molecular modeling and simulation. Here, we focus on the big potassium (BK) channels that play important roles in cardiovascular and neural systems. Many mutants of BK channel are associated with various neurological and cardiovascular diseases, but the molecular effects are unknown. The voltage gating properties of BK channels have been characterized for 473 site-specific mutations experimentally over the last three decades; yet, these functional data by themselves remain far too sparse to derive a predictive model of BK channel voltage gating. Using physics-based modeling, we quantify the energetic effects of all single mutations on both open and closed states of the channel. Together with dynamic properties derived from atomistic simulations, these physical descriptors allow the training of random forest models that could reproduce unseen experimentally measured shifts in gating voltage, ∆V1/2, with a RMSE ~ 32 mV and correlation coefficient of R ~ 0.7. Importantly, the model appears capable of uncovering nontrivial physical principles underlying the gating of the channel, including a central role of hydrophobic gating. The model was further evaluated using four novel mutations of L235 and V236 on the S5 helix, mutations of which are predicted to have opposing effects on V1/2 and suggest a key role of S5 in mediating voltage sensor-pore coupling. The measured ∆V1/2 agree quantitatively with prediction for all four mutations, with a high correlation of R = 0.92 and RMSE = 18 mV. Therefore, the model can capture nontrivial voltage gating properties in regions where few mutations are known. The success of predictive modeling of BK voltage gating demonstrates the potential of combining physics and statistical learning for overcoming data scarcity in nontrivial protein function prediction.
Collapse
Affiliation(s)
- Erik Nordquist
- Department of Chemistry, University of Massachusetts Amherst, Amherst, Massachusetts, United States of America
| | - Guohui Zhang
- Department of Biomedical Engineering, Center for the Investigation of Membrane Excitability Disorders, Cardiac Bioelectricity and Arrhythmia Center, Washington University in St. Louis, St. Louis, Missouri, United States of America
| | - Shrishti Barethiya
- Department of Chemistry, University of Massachusetts Amherst, Amherst, Massachusetts, United States of America
| | - Nathan Ji
- Department of Biology, Boston College, Chestnut Hill, Massachusetts, United States of America
| | - Kelli M. White
- Department of Biomedical Engineering, Center for the Investigation of Membrane Excitability Disorders, Cardiac Bioelectricity and Arrhythmia Center, Washington University in St. Louis, St. Louis, Missouri, United States of America
| | - Lu Han
- Department of Biomedical Engineering, Center for the Investigation of Membrane Excitability Disorders, Cardiac Bioelectricity and Arrhythmia Center, Washington University in St. Louis, St. Louis, Missouri, United States of America
| | - Zhiguang Jia
- Department of Chemistry, University of Massachusetts Amherst, Amherst, Massachusetts, United States of America
| | - Jingyi Shi
- Department of Biomedical Engineering, Center for the Investigation of Membrane Excitability Disorders, Cardiac Bioelectricity and Arrhythmia Center, Washington University in St. Louis, St. Louis, Missouri, United States of America
| | - Jianmin Cui
- Department of Biomedical Engineering, Center for the Investigation of Membrane Excitability Disorders, Cardiac Bioelectricity and Arrhythmia Center, Washington University in St. Louis, St. Louis, Missouri, United States of America
| | - Jianhan Chen
- Department of Chemistry, University of Massachusetts Amherst, Amherst, Massachusetts, United States of America
| |
Collapse
|
2
|
Nordquist E, Zhang G, Barethiya S, Ji N, White KM, Han L, Jia Z, Shi J, Cui J, Chen J. Incorporating physics to overcome data scarcity in predictive modeling of protein function: a case study of BK channels. BIORXIV : THE PREPRINT SERVER FOR BIOLOGY 2023:2023.06.24.546384. [PMID: 37425916 PMCID: PMC10327070 DOI: 10.1101/2023.06.24.546384] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Grants] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 07/11/2023]
Abstract
Machine learning has played transformative roles in numerous chemical and biophysical problems such as protein folding where large amount of data exists. Nonetheless, many important problems remain challenging for data-driven machine learning approaches due to the limitation of data scarcity. One approach to overcome data scarcity is to incorporate physical principles such as through molecular modeling and simulation. Here, we focus on the big potassium (BK) channels that play important roles in cardiovascular and neural systems. Many mutants of BK channel are associated with various neurological and cardiovascular diseases, but the molecular effects are unknown. The voltage gating properties of BK channels have been characterized for 473 site-specific mutations experimentally over the last three decades; yet, these functional data by themselves remain far too sparse to derive a predictive model of BK channel voltage gating. Using physics-based modeling, we quantify the energetic effects of all single mutations on both open and closed states of the channel. Together with dynamic properties derived from atomistic simulations, these physical descriptors allow the training of random forest models that could reproduce unseen experimentally measured shifts in gating voltage, ΔV 1/2 , with a RMSE ∼ 32 mV and correlation coefficient of R ∼ 0.7. Importantly, the model appears capable of uncovering nontrivial physical principles underlying the gating of the channel, including a central role of hydrophobic gating. The model was further evaluated using four novel mutations of L235 and V236 on the S5 helix, mutations of which are predicted to have opposing effects on V 1/2 and suggest a key role of S5 in mediating voltage sensor-pore coupling. The measured ΔV 1/2 agree quantitatively with prediction for all four mutations, with a high correlation of R = 0.92 and RMSE = 18 mV. Therefore, the model can capture nontrivial voltage gating properties in regions where few mutations are known. The success of predictive modeling of BK voltage gating demonstrates the potential of combining physics and statistical learning for overcoming data scarcity in nontrivial protein function prediction. Author Summary Deep machine learning has brought many exciting breakthroughs in chemistry, physics and biology. These models require large amount of training data and struggle when the data is scarce. The latter is true for predictive modeling of the function of complex proteins such as ion channels, where only hundreds of mutational data may be available. Using the big potassium (BK) channel as a biologically important model system, we demonstrate that a reliable predictive model of its voltage gating property could be derived from only 473 mutational data by incorporating physics-derived features, which include dynamic properties from molecular dynamics simulations and energetic quantities from Rosetta mutation calculations. We show that the final random forest model captures key trends and hotspots in mutational effects of BK voltage gating, such as the important role of pore hydrophobicity. A particularly curious prediction is that mutations of two adjacent residues on the S5 helix would always have opposite effects on the gating voltage, which was confirmed by experimental characterization of four novel mutations. The current work demonstrates the importance and effectiveness of incorporating physics in predictive modeling of protein function with scarce data.
Collapse
Affiliation(s)
- Erik Nordquist
- Department of Chemistry, University of Massachusetts Amherst, Amherst, Massachusetts, USA
| | - Guohui Zhang
- Department of Biomedical Engineering, Center for the Investigation of Membrane Excitability Disorders, Cardiac Bioelectricity and Arrhythmia Center, Washington University in St. Louis, St. Louis, Missouri, USA
| | - Shrishti Barethiya
- Department of Chemistry, University of Massachusetts Amherst, Amherst, Massachusetts, USA
| | - Nathan Ji
- Department of Biology, Boston College, Chestnut Hill, Massachusetts, USA
| | - Kelli M. White
- Department of Biomedical Engineering, Center for the Investigation of Membrane Excitability Disorders, Cardiac Bioelectricity and Arrhythmia Center, Washington University in St. Louis, St. Louis, Missouri, USA
| | - Lu Han
- Department of Biomedical Engineering, Center for the Investigation of Membrane Excitability Disorders, Cardiac Bioelectricity and Arrhythmia Center, Washington University in St. Louis, St. Louis, Missouri, USA
| | - Zhiguang Jia
- Department of Chemistry, University of Massachusetts Amherst, Amherst, Massachusetts, USA
| | - Jingyi Shi
- Department of Biomedical Engineering, Center for the Investigation of Membrane Excitability Disorders, Cardiac Bioelectricity and Arrhythmia Center, Washington University in St. Louis, St. Louis, Missouri, USA
| | - Jianmin Cui
- Department of Biomedical Engineering, Center for the Investigation of Membrane Excitability Disorders, Cardiac Bioelectricity and Arrhythmia Center, Washington University in St. Louis, St. Louis, Missouri, USA
| | - Jianhan Chen
- Department of Chemistry, University of Massachusetts Amherst, Amherst, Massachusetts, USA
| |
Collapse
|
3
|
Nordquist EB, Clerico EM, Chen J, Gierasch LM. Computationally-Aided Modeling of Hsp70-Client Interactions: Past, Present, and Future. J Phys Chem B 2022; 126:6780-6791. [PMID: 36040440 PMCID: PMC10309085 DOI: 10.1021/acs.jpcb.2c03806] [Citation(s) in RCA: 2] [Impact Index Per Article: 1.0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 01/13/2023]
Abstract
Hsp70 molecular chaperones play central roles in maintaining a healthy cellular proteome. Hsp70s function by binding to short peptide sequences in incompletely folded client proteins, thus preventing them from misfolding and/or aggregating, and in many cases holding them in a state that is competent for subsequent processes like translocation across membranes. There is considerable interest in predicting the sites where Hsp70s may bind their clients, as the ability to do so sheds light on the cellular functions of the chaperone. In addition, the capacity of the Hsp70 chaperone family to bind to a broad array of clients and to identify accessible sequences that enable discrimination of those that are folded from those that are not fully folded, which is essential to their cellular roles, is a fascinating puzzle in molecular recognition. In this article we discuss efforts to harness computational modeling with input from experimental data to develop a predictive understanding of the promiscuous yet selective binding of Hsp70 molecular chaperones to accessible sequences within their client proteins. We trace how an increasing understanding of the complexities of Hsp70-client interactions has led computational modeling to new underlying assumptions and design features. We describe the trend from purely data-driven analysis toward increased reliance on physics-based modeling that deeply integrates structural information and sequence-based functional data with physics-based binding energies. Notably, new experimental insights are adding to our understanding of the molecular origins of "selective promiscuity" in substrate binding by Hsp70 chaperones and challenging the underlying assumptions and design used in earlier predictive models. Taking the new experimental findings together with exciting progress in computational modeling of protein structures leads us to foresee a bright future for a predictive understanding of selective-yet-promiscuous binding exploited by Hsp70 molecular chaperones; the resulting new insights will also apply to substrate binding by other chaperones and by signaling proteins.
Collapse
Affiliation(s)
- Erik B. Nordquist
- Department of Chemistry, University of Massachusetts, Amherst, Massachusetts, 01003, United States
| | - Eugenia M. Clerico
- Department of Biochemistry and Molecular Biology, University of Massachusetts, Amherst, Massachusetts, 01003, United States
| | - Jianhan Chen
- Department of Chemistry, University of Massachusetts, Amherst, Massachusetts, 01003, United States
- Department of Biochemistry and Molecular Biology, University of Massachusetts, Amherst, Massachusetts, 01003, United States
| | - Lila M. Gierasch
- Department of Chemistry, University of Massachusetts, Amherst, Massachusetts, 01003, United States
- Department of Biochemistry and Molecular Biology, University of Massachusetts, Amherst, Massachusetts, 01003, United States
| |
Collapse
|
4
|
Martinez JC, Castillo F, Ruiz-Sanz J, Murciano-Calles J, Camara-Artigas A, Luque I. Understanding binding affinity and specificity of modular protein domains: A focus in ligand design for the polyproline-binding families. ADVANCES IN PROTEIN CHEMISTRY AND STRUCTURAL BIOLOGY 2022; 130:161-188. [PMID: 35534107 DOI: 10.1016/bs.apcsb.2021.12.002] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 06/14/2023]
Abstract
Within the modular protein domains there are five families that recognize proline-rich sequences: SH3, WW, EVH1, GYF and UEV domains. This chapter reviews the main strategies developed for the design of ligands for these families, including peptides, peptidomimetics and drugs. We also describe some studies aimed to understand the molecular reasons responsible for the intrinsic affinity and specificity of these domains.
Collapse
Affiliation(s)
- Jose C Martinez
- Departamento de Química Física e Instituto de Biotecnología, Facultad de Ciencias, Universidad de Granada, Granada, Spain.
| | - Francisco Castillo
- Fundación MEDINA, Centro de Excelencia en Investigación de Medicamentos Innovadores en Andalucía, Parque Tecnológico Ciencias de la Salud, Granada, Spain
| | - Javier Ruiz-Sanz
- Departamento de Química Física e Instituto de Biotecnología, Facultad de Ciencias, Universidad de Granada, Granada, Spain
| | - Javier Murciano-Calles
- Departamento de Química Física e Instituto de Biotecnología, Facultad de Ciencias, Universidad de Granada, Granada, Spain
| | - Ana Camara-Artigas
- Departamento de Química Física, Universidad de Almería, Campus de Excelencia Internacional Agroalimentario ceiA3 y CIAMBITAL, Almeria, Spain
| | - Irene Luque
- Departamento de Química Física e Instituto de Biotecnología, Facultad de Ciencias, Universidad de Granada, Granada, Spain
| |
Collapse
|
5
|
Kazlauskas A, Schmotz C, Kesti T, Hepojoki J, Kleino I, Kaneko T, Li SSC, Saksela K. Large-Scale Screening of Preferred Interactions of Human Src Homology-3 (SH3) Domains Using Native Target Proteins as Affinity Ligands. Mol Cell Proteomics 2016; 15:3270-3281. [PMID: 27440912 DOI: 10.1074/mcp.m116.060483] [Citation(s) in RCA: 7] [Impact Index Per Article: 0.9] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 04/20/2016] [Indexed: 12/17/2022] Open
Abstract
The Src Homology-3 (SH3) domains are ubiquitous protein modules that mediate important intracellular protein interactions via binding to short proline-rich consensus motifs in their target proteins. The affinity and specificity of such core SH3 - ligand contacts are typically modest, but additional binding interfaces can give rise to stronger and more specific SH3-mediated interactions. To understand how commonly such robust SH3 interactions occur in the human protein interactome, and to identify these in an unbiased manner we have expressed 324 predicted human SH3 ligands as full-length proteins in mammalian cells, and screened for their preferred SH3 partners using a phage display-based approach. This discovery platform contains an essentially complete repertoire of the ∼300 human SH3 domains, and involves an inherent binding threshold that ensures selective identification of only SH3 interactions with relatively high affinity. Such strong and selective SH3 partners could be identified for only 19 of these 324 predicted ligand proteins, suggesting that the majority of human SH3 interactions are relatively weak, and thereby have capacity for only modest inherent selectivity. The panel of exceptionally robust SH3 interactions identified here provides a rich source of leads and hypotheses for further studies. However, a truly comprehensive characterization of the human SH3 interactome will require novel high-throughput methods based on function instead of absolute binding affinity.
Collapse
Affiliation(s)
- Arunas Kazlauskas
- From the ‡Department of Virology, University of Helsinki and Helsinki University Hospital, Helsinki, Finland
| | - Constanze Schmotz
- From the ‡Department of Virology, University of Helsinki and Helsinki University Hospital, Helsinki, Finland
| | - Tapio Kesti
- From the ‡Department of Virology, University of Helsinki and Helsinki University Hospital, Helsinki, Finland
| | - Jussi Hepojoki
- From the ‡Department of Virology, University of Helsinki and Helsinki University Hospital, Helsinki, Finland
| | - Iivari Kleino
- From the ‡Department of Virology, University of Helsinki and Helsinki University Hospital, Helsinki, Finland
| | - Tomonori Kaneko
- §Department of Biochemistry and the Siebens-Drake Research Institute, Schulich School of Medicine and Dentistry, University of Western Ontario, London, Ontario N6A 5C1, Canada
| | - Shawn S C Li
- §Department of Biochemistry and the Siebens-Drake Research Institute, Schulich School of Medicine and Dentistry, University of Western Ontario, London, Ontario N6A 5C1, Canada
| | - Kalle Saksela
- From the ‡Department of Virology, University of Helsinki and Helsinki University Hospital, Helsinki, Finland;
| |
Collapse
|
6
|
Kamisetty H, Ghosh B, Langmead CJ, Bailey-Kellogg C. Learning sequence determinants of protein:protein interaction specificity with sparse graphical models. J Comput Biol 2015; 22:474-86. [PMID: 25973864 DOI: 10.1089/cmb.2014.0289] [Citation(s) in RCA: 5] [Impact Index Per Article: 0.6] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/13/2022] Open
Abstract
In studying the strength and specificity of interaction between members of two protein families, key questions center on which pairs of possible partners actually interact, how well they interact, and why they interact while others do not. The advent of large-scale experimental studies of interactions between members of a target family and a diverse set of possible interaction partners offers the opportunity to address these questions. We develop here a method, DgSpi (data-driven graphical models of specificity in protein:protein interactions), for learning and using graphical models that explicitly represent the amino acid basis for interaction specificity (why) and extend earlier classification-oriented approaches (which) to predict the ΔG of binding (how well). We demonstrate the effectiveness of our approach in analyzing and predicting interactions between a set of 82 PDZ recognition modules against a panel of 217 possible peptide partners, based on data from MacBeath and colleagues. Our predicted ΔG values are highly predictive of the experimentally measured ones, reaching correlation coefficients of 0.69 in 10-fold cross-validation and 0.63 in leave-one-PDZ-out cross-validation. Furthermore, the model serves as a compact representation of amino acid constraints underlying the interactions, enabling protein-level ΔG predictions to be naturally understood in terms of residue-level constraints. Finally, the model DgSpi readily enables the design of new interacting partners, and we demonstrate that designed ligands are novel and diverse.
Collapse
Affiliation(s)
| | - Bornika Ghosh
- 3Department of Computer Science, Dartmouth, Hanover, New Hampshire
| | | | | |
Collapse
|
7
|
Kundu K, Mann M, Costa F, Backofen R. MoDPepInt: an interactive web server for prediction of modular domain-peptide interactions. ACTA ACUST UNITED AC 2014; 30:2668-9. [PMID: 24872426 PMCID: PMC4155253 DOI: 10.1093/bioinformatics/btu350] [Citation(s) in RCA: 31] [Impact Index Per Article: 3.1] [Reference Citation Analysis] [Abstract] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 02/06/2023]
Abstract
Summary: MoDPepInt (Modular Domain Peptide Interaction) is a new easy-to-use web server for the prediction of binding partners for modular protein domains. Currently, we offer models for SH2, SH3 and PDZ domains via the tools SH2PepInt, SH3PepInt and PDZPepInt, respectively. More specifically, our server offers predictions for 51 SH2 human domains and 69 SH3 human domains via single domain models, and predictions for 226 PDZ domains across several species, via 43 multidomain models. All models are based on support vector machines with different kernel functions ranging from polynomial, to Gaussian, to advanced graph kernels. In this way, we model non-linear interactions between amino acid residues. Results were validated on manually curated datasets achieving competitive performance against various state-of-the-art approaches. Availability and implementation: The MoDPepInt server is available under the URL http://modpepint.informatik.uni-freiburg.de/ Contact: backofen@informatik.uni-freiburg.de Supplementary information: Supplementary data are available at Bioinformatics online.
Collapse
Affiliation(s)
- Kousik Kundu
- Bioinformatics Group, Department of Computer Science, 79110 Freiburg and Centre for Biological Signalling Studies (BIOSS), 79104 Freiburg, University of Freiburg, Germany
| | - Martin Mann
- Bioinformatics Group, Department of Computer Science, 79110 Freiburg and Centre for Biological Signalling Studies (BIOSS), 79104 Freiburg, University of Freiburg, Germany
| | - Fabrizio Costa
- Bioinformatics Group, Department of Computer Science, 79110 Freiburg and Centre for Biological Signalling Studies (BIOSS), 79104 Freiburg, University of Freiburg, Germany
| | - Rolf Backofen
- Bioinformatics Group, Department of Computer Science, 79110 Freiburg and Centre for Biological Signalling Studies (BIOSS), 79104 Freiburg, University of Freiburg, Germany Bioinformatics Group, Department of Computer Science, 79110 Freiburg and Centre for Biological Signalling Studies (BIOSS), 79104 Freiburg, University of Freiburg, Germany
| |
Collapse
|
8
|
Kamisetty H, Ghosh B, Langmead CJ, Bailey-Kellogg C. Learning Sequence Determinants of Protein:protein Interaction Specificity with Sparse Graphical Models. RESEARCH IN COMPUTATIONAL MOLECULAR BIOLOGY : ... ANNUAL INTERNATIONAL CONFERENCE, RECOMB ... : PROCEEDINGS. RECOMB (CONFERENCE : 2005- ) 2014; 8394:129-143. [PMID: 25414914 DOI: 10.1007/978-3-319-05269-4_10] [Citation(s) in RCA: 2] [Impact Index Per Article: 0.2] [Reference Citation Analysis] [Abstract] [Grants] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 01/05/2023]
Abstract
In studying the strength and specificity of interaction between members of two protein families, key questions center on which pairs of possible partners actually interact, how well they interact, and why they interact while others do not. The advent of large-scale experimental studies of interactions between members of a target family and a diverse set of possible interaction partners offers the opportunity to address these questions. We develop here a method, DgSpi (Data-driven Graphical models of Specificity in Protein:protein Interactions), for learning and using graphical models that explicitly represent the amino acid basis for interaction specificity (why) and extend earlier classification-oriented approaches (which) to predict the ΔG of binding (how well). We demonstrate the effectiveness of our approach in analyzing and predicting interactions between a set of 82 PDZ recognition modules, against a panel of 217 possible peptide partners, based on data from MacBeath and colleagues. Our predicted ΔG values are highly predictive of the experimentally measured ones, reaching correlation coefficients of 0.69 in 10-fold cross-validation and 0.63 in leave-one-PDZ-out cross-validation. Furthermore, the model serves as a compact representation of amino acid constraints underlying the interactions, enabling protein-level ΔG predictions to be naturally understood in terms of residue-level constraints. Finally, as a generative model, DgSpi readily enables the design of new interacting partners, and we demonstrate that designed ligands are novel and diverse.
Collapse
|
9
|
Kundu K, Costa F, Backofen R. A graph kernel approach for alignment-free domain-peptide interaction prediction with an application to human SH3 domains. Bioinformatics 2013; 29:i335-43. [PMID: 23813002 PMCID: PMC3694653 DOI: 10.1093/bioinformatics/btt220] [Citation(s) in RCA: 13] [Impact Index Per Article: 1.2] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 01/14/2023] Open
Abstract
MOTIVATION State-of-the-art experimental data for determining binding specificities of peptide recognition modules (PRMs) is obtained by high-throughput approaches like peptide arrays. Most prediction tools applicable to this kind of data are based on an initial multiple alignment of the peptide ligands. Building an initial alignment can be error-prone, especially in the case of the proline-rich peptides bound by the SH3 domains. RESULTS Here, we present a machine-learning approach based on an efficient graph-kernel technique to predict the specificity of a large set of 70 human SH3 domains, which are an important class of PRMs. The graph-kernel strategy allows us to (i) integrate several types of physico-chemical information for each amino acid, (ii) consider high-order correlations between these features and (iii) eliminate the need for an initial peptide alignment. We build specialized models for each human SH3 domain and achieve competitive predictive performance of 0.73 area under precision-recall curve, compared with 0.27 area under precision-recall curve for state-of-the-art methods based on position weight matrices. We show that better models can be obtained when we use information on the noninteracting peptides (negative examples), which is currently not used by the state-of-the art approaches based on position weight matrices. To this end, we analyze two strategies to identify subsets of high confidence negative data. The techniques introduced here are more general and hence can also be used for any other protein domains, which interact with short peptides (i.e. other PRMs). AVAILABILITY The program with the predictive models can be found at http://www.bioinf.uni-freiburg.de/Software/SH3PepInt/SH3PepInt.tar.gz. We also provide a genome-wide prediction for all 70 human SH3 domains, which can be found under http://www.bioinf.uni-freiburg.de/Software/SH3PepInt/Genome-Wide-Predictions.tar.gz. SUPPLEMENTARY INFORMATION Supplementary data are available at Bioinformatics online.
Collapse
Affiliation(s)
- Kousik Kundu
- Bioinformatics Group, Department of Computer Science, Georges-Köhler-Allee 106, 79110 Freiburg, Germany
| | | | | |
Collapse
|
10
|
Li N, Stein RSL, He W, Komives E, Wang W. Identification of methyllysine peptides binding to chromobox protein homolog 6 chromodomain in the human proteome. Mol Cell Proteomics 2013; 12:2750-60. [PMID: 23842000 DOI: 10.1074/mcp.o112.025015] [Citation(s) in RCA: 4] [Impact Index Per Article: 0.4] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 12/20/2022] Open
Abstract
Methylation is one of the important post-translational modifications that play critical roles in regulating protein functions. Proteomic identification of this post-translational modification and understanding how it affects protein activity remain great challenges. We tackled this problem from the aspect of methylation mediating protein-protein interaction. Using the chromodomain of human chromobox protein homolog 6 as a model system, we developed a systematic approach that integrates structure modeling, bioinformatics analysis, and peptide microarray experiments to identify lysine residues that are methylated and recognized by the chromodomain in the human proteome. Given the important role of chromobox protein homolog 6 as a reader of histone modifications, it was interesting to find that the majority of its interacting partners identified via this approach function in chromatin remodeling and transcriptional regulation. Our study not only illustrates a novel angle for identifying methyllysines on a proteome-wide scale and elucidating their potential roles in regulating protein function, but also suggests possible strategies for engineering the chromodomain-peptide interface to enhance the recognition of and manipulate the signal transduction mediated by such interactions.
Collapse
Affiliation(s)
- Nan Li
- Department of Chemistry and Biochemistry, 9500 Gilman Drive, University of California, San Diego, La Jolla, California 92093-0359
| | | | | | | | | |
Collapse
|
11
|
González AJ, Liao L, Wu CH. Prediction of contact matrix for protein-protein interaction. ACTA ACUST UNITED AC 2013; 29:1018-25. [PMID: 23418186 DOI: 10.1093/bioinformatics/btt076] [Citation(s) in RCA: 15] [Impact Index Per Article: 1.4] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 01/08/2023]
Abstract
MOTIVATION Prediction of protein-protein interaction has become an important part of systems biology in reverse engineering the biological networks for better understanding the molecular biology of the cell. Although significant progress has been made in terms of prediction accuracy, most computational methods only predict whether two proteins interact but not their interacting residues-the information that can be very valuable for understanding the interaction mechanisms and designing modulation of the interaction. In this work, we developed a computational method to predict the interacting residue pairs-contact matrix for interacting protein domains, whose rows and columns correspond to the residues in the two interacting domains respectively and whose values (1 or 0) indicate whether the corresponding residues (do or do not) interact. RESULTS Our method is based on supervised learning using support vector machines. For each domain involved in a given domain-domain interaction (DDI), an interaction profile hidden Markov model (ipHMM) is first built for the domain family, and then each residue position for a member domain sequence is represented as a 20-dimension vector of Fisher scores, characterizing how similar it is as compared with the family profile at that position. Each element of the contact matrix for a sequence pair is now represented by a feature vector from concatenating the vectors of the two corresponding residues, and the task is to predict the element value (1 or 0) from the feature vector. A support vector machine is trained for a given DDI, using either a consensus contact matrix or contact matrices for individual sequence pairs, and is tested by leave-one-out cross validation. The performance averaged over a set of 115 DDIs collected from the 3 DID database shows significant improvement (sensitivity up to 85%, and specificity up to 85%), as compared with a multiple sequence alignment-based method (sensitivity 57%, and specificity 78%) previously reported in the literature. SUPPLEMENTARY INFORMATION Supplementary data are available at Bioinformatics online.
Collapse
Affiliation(s)
- Alvaro J González
- Department of Computer and Information Sciences, University of Delaware, Newark, DE 19716, USA
| | | | | |
Collapse
|
12
|
Hou T, Li N, Li Y, Wang W. Characterization of domain-peptide interaction interface: prediction of SH3 domain-mediated protein-protein interaction network in yeast by generic structure-based models. J Proteome Res 2012; 11:2982-95. [PMID: 22468754 PMCID: PMC3345086 DOI: 10.1021/pr3000688] [Citation(s) in RCA: 176] [Impact Index Per Article: 14.7] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 12/18/2022]
Abstract
Determination of the binding specificity of SH3 domain, a peptide recognition module (PRM), is important to understand their biological functions and reconstruct the SH3-mediated protein-protein interaction network. In the present study, the SH3-peptide interactions for both class I and II SH3 domains were characterized by the intermolecular residue-residue interaction network. We developed generic MIEC-SVM models to infer SH3 domain-peptide recognition specificity that achieved satisfactory prediction accuracy. By investigating the domain-peptide recognition mechanisms at the residue level, we found that the class-I and class-II binding peptides have different binding modes even though they occupy the same binding site of SH3. Furthermore, we predicted the potential binding partners of SH3 domains in the yeast proteome and constructed the SH3-mediated protein-protein interaction network. Comparison with the experimentally determined interactions confirmed the effectiveness of our approach. This study showed that our sophisticated computational approach not only provides a powerful platform to decipher protein recognition code at the molecular level but also allows identification of peptide-mediated protein interactions at a proteomic scale. We believe that such an approach is general to be applicable to other domain-peptide interactions.
Collapse
Affiliation(s)
- Tingjun Hou
- Institute of Functional Nano & Soft Materials (FUNSOM) and Jiangsu Key Laboratory for Carbon-Based Functional Materials & Devices, Soochow University, Suzhou, Jiangsu 215123, China
- College of Pharmaceutical Science, Soochow University, Suzhou, Jiangsu 215123, China
| | - Nan Li
- Department of Chemistry and Biochemistry, University of California at San Diego, La Jolla, California 92093, United States
| | - Youyong Li
- Institute of Functional Nano & Soft Materials (FUNSOM) and Jiangsu Key Laboratory for Carbon-Based Functional Materials & Devices, Soochow University, Suzhou, Jiangsu 215123, China
| | - Wei Wang
- Department of Chemistry and Biochemistry, University of California at San Diego, La Jolla, California 92093, United States
| |
Collapse
|
13
|
Li L, Zhao B, Du J, Zhang K, Ling CX, Li SSC. DomPep--a general method for predicting modular domain-mediated protein-protein interactions. PLoS One 2011; 6:e25528. [PMID: 22003397 PMCID: PMC3189207 DOI: 10.1371/journal.pone.0025528] [Citation(s) in RCA: 12] [Impact Index Per Article: 0.9] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/07/2011] [Accepted: 09/05/2011] [Indexed: 01/07/2023] Open
Abstract
Protein-protein interactions (PPIs) are frequently mediated by the binding of a modular domain in one protein to a short, linear peptide motif in its partner. The advent of proteomic methods such as peptide and protein arrays has led to the accumulation of a wealth of interaction data for modular interaction domains. Although several computational programs have been developed to predict modular domain-mediated PPI events, they are often restricted to a given domain type. We describe DomPep, a method that can potentially be used to predict PPIs mediated by any modular domains. DomPep combines proteomic data with sequence information to achieve high accuracy and high coverage in PPI prediction. Proteomic binding data were employed to determine a simple yet novel parameter Ligand-Binding Similarity which, in turn, is used to calibrate Domain Sequence Identity and Position-Weighted-Matrix distance, two parameters that are used in constructing prediction models. Moreover, DomPep can be used to predict PPIs for both domains with experimental binding data and those without. Using the PDZ and SH2 domain families as test cases, we show that DomPep can predict PPIs with accuracies superior to existing methods. To evaluate DomPep as a discovery tool, we deployed DomPep to identify interactions mediated by three human PDZ domains. Subsequent in-solution binding assays validated the high accuracy of DomPep in predicting authentic PPIs at the proteome scale. Because DomPep makes use of only interaction data and the primary sequence of a domain, it can be readily expanded to include other types of modular domains.
Collapse
Affiliation(s)
- Lei Li
- Department of Biochemistry, Siebens-Drake Medical Research Institute, Schulich School of Medicine and Dentistry, The University of Western Ontario, London, Ontario, Canada
| | - Bing Zhao
- Department of Biochemistry, Siebens-Drake Medical Research Institute, Schulich School of Medicine and Dentistry, The University of Western Ontario, London, Ontario, Canada
| | - Jun Du
- Department of Computer Science, The University of Western Ontario, London, Ontario, Canada
| | - Kaizhong Zhang
- Department of Computer Science, The University of Western Ontario, London, Ontario, Canada
| | - Charles X. Ling
- Department of Computer Science, The University of Western Ontario, London, Ontario, Canada
| | - Shawn Shun-Cheng Li
- Department of Biochemistry, Siebens-Drake Medical Research Institute, Schulich School of Medicine and Dentistry, The University of Western Ontario, London, Ontario, Canada
- * E-mail:
| |
Collapse
|
14
|
Hou T, Li Y, Wang W. Prediction of peptides binding to the PKA RIIalpha subunit using a hierarchical strategy. ACTA ACUST UNITED AC 2011; 27:1814-21. [PMID: 21586518 DOI: 10.1093/bioinformatics/btr294] [Citation(s) in RCA: 56] [Impact Index Per Article: 4.3] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/13/2022]
Abstract
MOTIVATION Favorable interaction between the regulatory subunit of the cAMP-dependent protein kinase (PKA) and a peptide in A-kinase anchoring proteins (AKAPs) is critical for translocating PKA to the subcellular sites where the enzyme phosphorylates its substrates. It is very hard to identify AKAPs peptides binding to PKA due to the high sequence diversity of AKAPs. RESULTS We propose a hierarchical and efficient approach, which combines molecular dynamics (MD) simulations, free energy calculations, virtual mutagenesis (VM) and bioinformatics analyses, to predict peptides binding to the PKA RIIα regulatory subunit in the human proteome systematically. Our approach successfully retrieved 15 out of 18 documented RIIα-binding peptides. Literature curation supported that many newly predicted peptides might be true AKAPs. Here, we present the first systematic search for AKAP peptides in the human proteome, which is useful to further experimental identification of AKAPs and functional analysis of their biological roles.
Collapse
Affiliation(s)
- Tingjun Hou
- Institute of Functional Nano & Soft Materials (FUNSOM) and Jiangsu Key Laboratory for Carbon-Based Functional Materials & Devices, Soochow University, Suzhou, Jiangsu 215123, PR China.
| | | | | |
Collapse
|
15
|
He P, Wu W, Yang K, Jing T, Liao KL, Zhang W, Wang HD, Hua X. Exploring the activity space of peptides binding to diverse SH3 domains using principal property descriptors derived from amino acid rotamers. Biopolymers 2011; 96:288-301. [DOI: 10.1002/bip.21531] [Citation(s) in RCA: 6] [Impact Index Per Article: 0.5] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/05/2022]
|
16
|
McDonald CB, Seldeen KL, Deegan BJ, Bhat V, Farooq A. Binding of the cSH3 domain of Grb2 adaptor to two distinct RXXK motifs within Gab1 docker employs differential mechanisms. J Mol Recognit 2010; 24:585-96. [PMID: 21472810 DOI: 10.1002/jmr.1080] [Citation(s) in RCA: 12] [Impact Index Per Article: 0.9] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/06/2010] [Revised: 07/26/2010] [Accepted: 07/26/2010] [Indexed: 12/29/2022]
Abstract
A ubiquitous component of cellular signaling machinery, Gab1 docker plays a pivotal role in routing extracellular information in the form of growth factors and cytokines to downstream targets such as transcription factors within the nucleus. Here, using isothermal titration calorimetry (ITC) in combination with macromolecular modeling (MM), we show that although Gab1 contains four distinct RXXK motifs, designated G1, G2, G3, and G4, only G1 and G2 motifs bind to the cSH3 domain of Grb2 adaptor and do so with distinct mechanisms. Thus, while the G1 motif strictly requires the PPRPPKP consensus sequence for high-affinity binding to the cSH3 domain, the G2 motif displays preference for the PXVXRXLKPXR consensus. Such sequential differences in the binding of G1 and G2 motifs arise from their ability to adopt distinct polyproline type II (PPII)- and 3(10) -helical conformations upon binding to the cSH3 domain, respectively. Collectively, our study provides detailed biophysical insights into a key protein-protein interaction involved in a diverse array of signaling cascades central to health and disease.
Collapse
Affiliation(s)
- Caleb B McDonald
- Department of Biochemistry & Molecular Biology, USylvester Braman Family Breast Cancer Institute, Leonard Miller School of Medicine, University of Miami, Miami, FL 33136, USA
| | | | | | | | | |
Collapse
|
17
|
Hong S, Chung T, Kim D. SH3 domain-peptide binding energy calculations based on structural ensemble and multiple peptide templates. PLoS One 2010; 5:e12654. [PMID: 20856816 PMCID: PMC2939891 DOI: 10.1371/journal.pone.0012654] [Citation(s) in RCA: 2] [Impact Index Per Article: 0.1] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/31/2010] [Accepted: 08/16/2010] [Indexed: 11/26/2022] Open
Abstract
SH3 domains mediate signal transduction by recognizing short peptides. Understanding of the driving forces in peptide recognitions will help us to predict the binding specificity of the domain-peptide recognition and to understand the molecular interaction networks of cells. However, accurate calculation of the binding energy is a tough challenge. In this study, we propose three ideas for improving our ability to predict the binding energy between SH3 domains and peptides: (1) utilizing the structural ensembles sampled from a molecular dynamics simulation trajectory, (2) utilizing multiple peptide templates, and (3) optimizing the sequence-structure mapping. We tested these three ideas on ten previously studied SH3 domains for which SPOT analysis data were available. The results indicate that calculating binding energy using the structural ensemble was most effective, clearly increasing the prediction accuracy, while the second and third ideas tended to give better binding energy predictions. We applied our method to the five SH3 targets in DREAM4 Challenge and selected the best performing method.
Collapse
Affiliation(s)
- Seungpyo Hong
- Department of Bio and Brain Engineering, KAIST, Daejeon, South Korea
| | | | | |
Collapse
|
18
|
Dergai M, Tsyba L, Dergai O, Zlatskii I, Skrypkina I, Kovalenko V, Rynditch A. Microexon-based regulation of ITSN1 and Src SH3 domains specificity relies on introduction of charged amino acids into the interaction interface. Biochem Biophys Res Commun 2010; 399:307-12. [PMID: 20659428 DOI: 10.1016/j.bbrc.2010.07.080] [Citation(s) in RCA: 17] [Impact Index Per Article: 1.2] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/21/2010] [Accepted: 07/22/2010] [Indexed: 11/25/2022]
Abstract
SH3 domains function as protein-protein interaction modules in assembly of signalling and endocytic protein complexes. Here we report investigations of the mechanism of regulation of the binding properties of the SH3 domains of intersectin (ITSN1) and Src kinase by alternative splicing. Comparative sequence analysis of ITSN1 and Src genes revealed the conservation of alternatively spliced microexons affecting the structure of the SH3 domains in vertebrates. We show that neuron-specific ITSN1 transcripts containing the microexon 20 that encodes five amino acid residues within the SH3A domain are expressed in zebrafish from the earliest stages of the development of the nervous system. Models of alternative isoforms of the ITSN1 SH3A domain revealed that the insertion encoded by the microexon is located at the beginning of the n-Src loop of this domain causing a shift of negatively charged amino acids towards the interaction interface. Mutational analysis confirmed the importance of translocation of these negatively charged amino acids for interaction with dynamin 1. We also identified a residue within the microexon-encoded insert in the SH3 domain of brain-specific variant of Src that abolishes interaction of the domain with dynamin 1. Thus microexons provide a mechanism for the control of tissue-specific interactions of ITSN1 and Src with their partners.
Collapse
Affiliation(s)
- Mykola Dergai
- Institute of Molecular Biology and Genetics, 150 Zabolotnogo Street, Kyiv 03680, Ukraine.
| | | | | | | | | | | | | |
Collapse
|
19
|
Amoutzias GD, Robertson DL, Bornberg-Bauer E. The evolution of protein interaction networks in regulatory proteins. Comp Funct Genomics 2010; 5:79-84. [PMID: 18629034 PMCID: PMC2447317 DOI: 10.1002/cfg.365] [Citation(s) in RCA: 19] [Impact Index Per Article: 1.4] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Journal Information] [Subscribe] [Scholar Register] [Received: 11/13/2003] [Revised: 11/18/2003] [Accepted: 11/25/2003] [Indexed: 12/05/2022] Open
Abstract
Interactions between proteins are essential for intracellular communication. They
form complex networks which have become an important source for functional
analysis of proteins. Combining phylogenies with network analysis, we investigate
the evolutionary history of interaction networks from the bHLH, NR and bZIP
transcription-factor families. The bHLH and NR networks show a hub-like structure
with varying γ values. Mutation and gene duplication play an important role
in adding and removing interactions. We conclude that in several of the protein
families that we have studied, networks have primarily arisen by the development of
heterodimerizing transcription factors, from an ancestral gene which interacts with
any of the newly emerging proteins but also homodimerizes.
Collapse
Affiliation(s)
- Gregory D Amoutzias
- School of Biological Sciences, University of Manchester, 2.205 Stopford Building, Oxford Road, Manchester M13 9PT, UK
| | | | | |
Collapse
|
20
|
Brannetti B, Zanzoni A, Montecchi-Palazzi L, Cesareni G, Helmer-Citterich M. iSPOT: a web tool for the analysis and recognition of protein domain specificity. Comp Funct Genomics 2010; 2:314-8. [PMID: 18629248 PMCID: PMC2448410 DOI: 10.1002/cfg.104] [Citation(s) in RCA: 8] [Impact Index Per Article: 0.6] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/28/2001] [Accepted: 07/27/2001] [Indexed: 11/08/2022] Open
Abstract
Methods that aim at predicting interaction partners are very likely to play an important
role in the interpretation of genomic information. iSPOT (iSpecificity Prediction Of
Target) is a web tool (accessible at http://cbm.bio.uniroma2.it/iSPOT) developed for the
prediction of protein-protein interaction mediated by families of peptide recognition
modules. iSPOT accesses a database of position specific residue-residue interaction
frequencies for members of the SH3 and PDZ protein domain families. The software
utilises this database to provide a score for any potential domain peptide interaction. iSPOT: 1. evaluates the likelihood of the interaction between any of the peptides
contained in an input protein and a list of domains of the two different families; 2. searches
in the SWISS-PROT database for potential partners of a query domain; and 3. has access
to a repository of all the domain/target peptide interaction data.
Collapse
Affiliation(s)
- B Brannetti
- Department of Biology, University of Rome Tor Vergata, Rome, Italy
| | | | | | | | | |
Collapse
|
21
|
Stein A, Aloy P. Novel peptide-mediated interactions derived from high-resolution 3-dimensional structures. PLoS Comput Biol 2010; 6:e1000789. [PMID: 20502673 PMCID: PMC2873903 DOI: 10.1371/journal.pcbi.1000789] [Citation(s) in RCA: 48] [Impact Index Per Article: 3.4] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 11/10/2009] [Accepted: 04/15/2010] [Indexed: 11/18/2022] Open
Abstract
Many biological responses to intra- and extracellular stimuli are regulated through complex networks of transient protein interactions where a globular domain in one protein recognizes a linear peptide from another, creating a relatively small contact interface. These peptide stretches are often found in unstructured regions of proteins, and contain a consensus motif complementary to the interaction surface displayed by their binding partners. While most current methods for the de novo discovery of such motifs exploit their tendency to occur in disordered regions, our work here focuses on another observation: upon binding to their partner domain, motifs adopt a well-defined structure. Indeed, through the analysis of all peptide-mediated interactions of known high-resolution three-dimensional (3D) structure, we found that the structure of the peptide may be as characteristic as the consensus motif, and help identify target peptides even though they do not match the established patterns. Our analyses of the structural features of known motifs reveal that they tend to have a particular stretched and elongated structure, unlike most other peptides of the same length. Accordingly, we have implemented a strategy based on a Support Vector Machine that uses this features, along with other structure-encoded information about binding interfaces, to search the set of protein interactions of known 3D structure and to identify unnoticed peptide-mediated interactions among them. We have also derived consensus patterns for these interactions, whenever enough information was available, and compared our results with established linear motif patterns and their binding domains. Finally, to cross-validate our identification strategy, we scanned interactome networks from four model organisms with our newly derived patterns to see if any of them occurred more often than expected. Indeed, we found significant over-representations for 64 domain-motif interactions, 46 of which had not been described before, involving over 6,000 interactions in total for which we could suggest the molecular details determining the binding. Protein-protein interactions are paramount in any aspect of the cellular life. Some proteins form large macromolecular complexes that execute core functionalities of the cell, while others transmit information in signalling networks to co-ordinate these processes. The latter type, of more transient nature, often occurs through the recognition of a small linear sequence motif in one protein by a specialized globular domain in the other. These peptide stretches often contain a consensus pattern complementary to the interaction surface displayed by their binding partners, and adopt a well-defined structure upon binding. Information that is currently available only from high-resolution three-dimensional (3D) structures, and that can be as characteristic as the consensus motif itself. In this manuscript, we present a strategy to identify novel domain-motif interactions (DMIs) among the set of protein complexes of known 3D structures, which provides information on the consensus motif and binding domain and also allows ready identification of the key interacting residues. A detailed knowledge of the interface is critical to plan further functional studies and for the development of interfering elements, be it drug-like compounds or novel engineered binding proteins or peptides. The small interfaces typical for DMIs make them interesting candidates for all these applications.
Collapse
Affiliation(s)
- Amelie Stein
- Institute for Research in Biomedicine, Joint IRB-BSC Program in Computational Biology, Barcelona, Spain
| | - Patrick Aloy
- Institute for Research in Biomedicine, Joint IRB-BSC Program in Computational Biology, Barcelona, Spain
- Institució Catalana de Recerca i Estudis Avançats, Barcelona, Spain
- * E-mail:
| |
Collapse
|
22
|
Gherardini PF, Ausiello G, Russell RB, Helmer-Citterich M. Modular architecture of nucleotide-binding pockets. Nucleic Acids Res 2010; 38:3809-16. [PMID: 20185567 PMCID: PMC2887960 DOI: 10.1093/nar/gkq090] [Citation(s) in RCA: 23] [Impact Index Per Article: 1.6] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 12/21/2022] Open
Abstract
Recently, modularity has emerged as a general attribute of complex biological systems. This is probably because modular systems lend themselves readily to optimization via random mutation followed by natural selection. Although they are not traditionally considered to evolve by this process, biological ligands are also modular, being composed of recurring chemical fragments, and moreover they exhibit similarities reminiscent of mutations (e.g. the few atoms differentiating adenine and guanine). Many ligands are also promiscuous in the sense that they bind to many different protein folds. Here, we investigated whether ligand chemical modularity is reflected in an underlying modularity of binding sites across unrelated proteins. We chose nucleotides as paradigmatic ligands, because they can be described as composed of well-defined fragments (nucleobase, ribose and phosphates) and are quite abundant both in nature and in protein structure databases. We found that nucleotide-binding sites do indeed show a modular organization and are composed of fragment-specific protein structural motifs, which parallel the modular structure of their ligands. Through an analysis of the distribution of these motifs in different proteins and in different folds, we discuss the evolutionary implications of these findings and argue that the structural features we observed can arise both as a result of divergence from a common ancestor or convergent evolution.
Collapse
Affiliation(s)
- Pier Federico Gherardini
- Centre for Molecular Bioinformatics, Department of Biology, University of Tor Vergata, Via della Ricerca Scientifica snc, 00133 Rome, Italy
| | | | | | | |
Collapse
|
23
|
Thomas J, Ramakrishnan N, Bailey-Kellogg C. Graphical models of protein-protein interaction specificity from correlated mutations and interaction data. Proteins 2009; 76:911-29. [DOI: 10.1002/prot.22398] [Citation(s) in RCA: 21] [Impact Index Per Article: 1.4] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/11/2022]
|
24
|
Stein A, Pache RA, Bernadó P, Pons M, Aloy P. Dynamic interactions of proteins in complex networks: a more structured view. FEBS J 2009; 276:5390-405. [DOI: 10.1111/j.1742-4658.2009.07251.x] [Citation(s) in RCA: 80] [Impact Index Per Article: 5.3] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 12/12/2022]
|
25
|
Van Durme J, Maurer-Stroh S, Gallardo R, Wilkinson H, Rousseau F, Schymkowitz J. Accurate prediction of DnaK-peptide binding via homology modelling and experimental data. PLoS Comput Biol 2009; 5:e1000475. [PMID: 19696878 PMCID: PMC2717214 DOI: 10.1371/journal.pcbi.1000475] [Citation(s) in RCA: 99] [Impact Index Per Article: 6.6] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 09/27/2007] [Accepted: 07/17/2009] [Indexed: 11/28/2022] Open
Abstract
Molecular chaperones are essential elements of the protein quality control machinery that governs translocation and folding of nascent polypeptides, refolding and degradation of misfolded proteins, and activation of a wide range of client proteins. The prokaryotic heat-shock protein DnaK is the E. coli representative of the ubiquitous Hsp70 family, which specializes in the binding of exposed hydrophobic regions in unfolded polypeptides. Accurate prediction of DnaK binding sites in E. coli proteins is an essential prerequisite to understand the precise function of this chaperone and the properties of its substrate proteins. In order to map DnaK binding sites in protein sequences, we have developed an algorithm that combines sequence information from peptide binding experiments and structural parameters from homology modelling. We show that this combination significantly outperforms either single approach. The final predictor had a Matthews correlation coefficient (MCC) of 0.819 when assessed over the 144 tested peptide sequences to detect true positives and true negatives. To test the robustness of the learning set, we have conducted a simulated cross-validation, where we omit sequences from the learning sets and calculate the rate of repredicting them. This resulted in a surprisingly good MCC of 0.703. The algorithm was also able to perform equally well on a blind test set of binders and non-binders, of which there was no prior knowledge in the learning sets. The algorithm is freely available at http://limbo.vib.be. Molecular chaperones are essential elements of the protein quality control machinery that governs translocation and folding of nascent polypeptides, refolding and degradation of misfolded proteins, and activation of a wide range of client proteins. This variety of functions results from the existence of multiple chaperones with different structures. Chaperones bind to exposed regions of proteins to fulfil their function. The chaperone must hereby recognise a certain signal sequence on the substrate protein. The nature of the sequence that is exposed will determine the types of chaperones that can interact with it, and in the end will also determine the fate of the substrate protein: refolding, translocation, degradation or activation. Knowledge of the binding sequence determinants of molecular chaperones will shed more light on the mechanism of how each chaperone contributes to the cellular protein quality control system. In this study we have made an algorithm which accurately predicts binding sites for the well studied E. coli Hsp70 chaperone, DnaK, which is implicated in folding efficiency and prevention of aggregation. The ability to detect and design high-affinity DnaK binding sites enhances our understanding of chaperone-substrate recognition and opens great opportunities to enhance protein solubility using protein-DnaK binding motif fusions.
Collapse
Affiliation(s)
- Joost Van Durme
- VIB Switch Laboratory, Vrije Universiteit Brussel, Brussels, Belgium
| | | | - Rodrigo Gallardo
- VIB Switch Laboratory, Vrije Universiteit Brussel, Brussels, Belgium
| | - Hannah Wilkinson
- VIB Switch Laboratory, Vrije Universiteit Brussel, Brussels, Belgium
| | - Frederic Rousseau
- VIB Switch Laboratory, Vrije Universiteit Brussel, Brussels, Belgium
- * E-mail: (FR); (JS)
| | - Joost Schymkowitz
- VIB Switch Laboratory, Vrije Universiteit Brussel, Brussels, Belgium
- * E-mail: (FR); (JS)
| |
Collapse
|
26
|
Toward quantitative characterization of the binding profile between the human amphiphysin-1 SH3 domain and its peptide ligands. Amino Acids 2009; 38:1209-18. [DOI: 10.1007/s00726-009-0332-x] [Citation(s) in RCA: 19] [Impact Index Per Article: 1.3] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 04/10/2009] [Accepted: 07/22/2009] [Indexed: 10/20/2022]
|
27
|
McDonald CB, Seldeen KL, Deegan BJ, Farooq A. SH3 domains of Grb2 adaptor bind to PXpsiPXR motifs within the Sos1 nucleotide exchange factor in a discriminate manner. Biochemistry 2009; 48:4074-85. [PMID: 19323566 DOI: 10.1021/bi802291y] [Citation(s) in RCA: 35] [Impact Index Per Article: 2.3] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 12/12/2022]
Abstract
Ubiquitously encountered in a wide variety of cellular processes, the Grb2-Sos1 interaction is mediated through the combinatorial binding of nSH3 and cSH3 domains of Grb2 to various sites containing PXpsiPXR motifs within Sos1. Here, using isothermal titration calorimetry, we demonstrate that while the nSH3 domain binds with affinities in the physiological range to all four sites containing PXpsiPXR motifs, designated S1, S2, S3, and S4, the cSH3 domain can only do so at the S1 site. Further scrutiny of these sites yields rationale for the recognition of various PXpsiPXR motifs by the SH3 domains in a discriminate manner. Unlike the PXpsiPXR motifs at S2, S3, and S4 sites, the PXpsiPXR motif at the S1 site is flanked at its C-terminus with two additional arginine residues that are absolutely required for high-affinity binding of the cSH3 domain. In striking contrast, these two additional arginine residues augment the binding of the nSH3 domain to the S1 site, but their role is not critical for the recognition of S2, S3, and S4 sites. Site-directed mutagenesis suggests that the two additional arginine residues flanking the PXpsiPXR motif at the S1 site contribute to free energy of binding via the formation of salt bridges with specific acidic residues in SH3 domains. Molecular modeling is employed to project these novel findings into the 3D structures of SH3 domains in complex with a peptide containing the PXpsiPXR motif and flanking arginine residues at the S1 site. Taken together, this study furthers our understanding of the assembly of a key signaling complex central to cellular machinery.
Collapse
Affiliation(s)
- Caleb B McDonald
- Department of Biochemistry & Molecular Biology and the UM/Sylvester Braman Family Breast Cancer Institute, Leonard Miller School of Medicine, University of Miami, Miami, Florida 33136, USA
| | | | | | | |
Collapse
|
28
|
Wunderlich Z, Mirny LA. Using genome-wide measurements for computational prediction of SH2-peptide interactions. Nucleic Acids Res 2009; 37:4629-41. [PMID: 19502496 PMCID: PMC2724268 DOI: 10.1093/nar/gkp394] [Citation(s) in RCA: 17] [Impact Index Per Article: 1.1] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 12/25/2022] Open
Abstract
Peptide-recognition modules (PRMs) are used throughout biology to mediate protein–protein interactions, and many PRMs are members of large protein domain families. Recent genome-wide measurements describe networks of peptide–PRM interactions. In these networks, very similar PRMs recognize distinct sets of peptides, raising the question of how peptide-recognition specificity is achieved using similar protein domains. The analysis of individual protein complex structures often gives answers that are not easily applicable to other members of the same PRM family. Bioinformatics-based approaches, one the other hand, may be difficult to interpret physically. Here we integrate structural information with a large, quantitative data set of SH2 domain–peptide interactions to study the physical origin of domain–peptide specificity. We develop an energy model, inspired by protein folding, based on interactions between the amino-acid positions in the domain and peptide. We use this model to successfully predict which SH2 domains and peptides interact and uncover the positions in each that are important for specificity. The energy model is general enough that it can be applied to other members of the SH2 family or to new peptides, and the cross-validation results suggest that these energy calculations will be useful for predicting binding interactions. It can also be adapted to study other PRM families, predict optimal peptides for a given SH2 domain, or study other biological interactions, e.g. protein–DNA interactions.
Collapse
Affiliation(s)
- Zeba Wunderlich
- Biophysics Program, Harvard University, Cambridge, MA 02138, USA
| | | |
Collapse
|
29
|
Schillinger C, Boisguerin P, Krause G. Domain Interaction Footprint: a multi-classification approach to predict domain-peptide interactions. ACTA ACUST UNITED AC 2009; 25:1632-9. [PMID: 19376827 DOI: 10.1093/bioinformatics/btp264] [Citation(s) in RCA: 12] [Impact Index Per Article: 0.8] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/13/2022]
Abstract
MOTIVATION The flow of information within cellular pathways largely relies on specific protein-protein interactions. Discovering such interactions that are mostly mediated by peptide recognition modules (PRM) is therefore a fundamental step towards unravelling the complexity of varying pathways. Since peptides can be recognized by more than one PRM and high-throughput experiments are both time consuming and expensive, it would be preferable to narrow down all potential peptide ligands for one specific PRM by a computational method. We at first present Domain Interaction Footprint (DIF) a new approach to predict binding peptides to PRMs merely based on the sequence of the peptides. Second, we show that our method is able to create a multi-classification model that assesses the binding specificity of a given peptide to all examined PRMs at once. RESULTS We first applied our approach to a previously investigated dataset of different SH3 domains and predicted their appropriate peptide ligands with an exceptionally high accuracy. This result outperforms all recent methods trained on the same dataset. Furthermore, we used our technique to build two multi-classification models (SH3 and PDZ domains) to predict the interaction preference between a peptide and every single domain in the corresponding domain family at once. Predicting the domain specificity most reliably, our proposed approach can be seen as a first step towards a complete multi-domain classification model comprised of all domains of one family. Such a comprehensive domain specificity model would benefit the quest for highly specific peptide ligands interacting solely with the domain of choice. SUPPLEMENTARY INFORMATION Supplementary data are available at Bioinformatics online.
Collapse
Affiliation(s)
- Christian Schillinger
- Leibniz Institute for Molecular Pharmacology, Robert-Roessle-Strasse 10, Berlin, FU-Berlin, Germany
| | | | | |
Collapse
|
30
|
Accurate prediction of peptide binding sites on protein surfaces. PLoS Comput Biol 2009; 5:e1000335. [PMID: 19325869 PMCID: PMC2653190 DOI: 10.1371/journal.pcbi.1000335] [Citation(s) in RCA: 120] [Impact Index Per Article: 8.0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/29/2008] [Accepted: 02/18/2009] [Indexed: 11/19/2022] Open
Abstract
Many important protein-protein interactions are mediated by the binding of a short peptide stretch in one protein to a large globular segment in another. Recent efforts have provided hundreds of examples of new peptides binding to proteins for which a three-dimensional structure is available (either known experimentally or readily modeled) but where no structure of the protein-peptide complex is known. To address this gap, we present an approach that can accurately predict peptide binding sites on protein surfaces. For peptides known to bind a particular protein, the method predicts binding sites with great accuracy, and the specificity of the approach means that it can also be used to predict whether or not a putative or predicted peptide partner will bind. We used known protein-peptide complexes to derive preferences, in the form of spatial position specific scoring matrices, which describe the binding-site environment in globular proteins for each type of amino acid in bound peptides. We then scan the surface of a putative binding protein for sites for each of the amino acids present in a peptide partner and search for combinations of high-scoring amino acid sites that satisfy constraints deduced from the peptide sequence. The method performed well in a benchmark and largely agreed with experimental data mapping binding sites for several recently discovered interactions mediated by peptides, including RG-rich proteins with SMN domains, Epstein-Barr virus LMP1 with TRADD domains, DBC1 with Sir2, and the Ago hook with Argonaute PIWI domain. The method, and associated statistics, is an excellent tool for predicting and studying binding sites for newly discovered peptides mediating critical events in biology.
Collapse
|
31
|
Identification of the variant Ala335Val of MED25 as responsible for CMT2B2: molecular data, functional studies of the SH3 recognition motif and correlation between wild-type MED25 and PMP22 RNA levels in CMT1A animal models. Neurogenetics 2009; 10:275-87. [PMID: 19290556 PMCID: PMC2847151 DOI: 10.1007/s10048-009-0183-3] [Citation(s) in RCA: 48] [Impact Index Per Article: 3.2] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/08/2008] [Accepted: 02/19/2009] [Indexed: 01/30/2023]
Abstract
Charcot-Marie-Tooth (CMT) disease is a clinically and genetically heterogeneous disorder. All mendelian patterns of inheritance have been described. We identified a homozygous p.A335V mutation in the MED25 gene in an extended Costa Rican family with autosomal recessively inherited Charcot-Marie-Tooth neuropathy linked to the CMT2B2 locus in chromosome 19q13.3. MED25, also known as ARC92 and ACID1, is a subunit of the human activator-recruited cofactor (ARC), a family of large transcriptional coactivator complexes related to the yeast Mediator. MED25 was identified by virtue of functional association with the activator domains of multiple cellular and viral transcriptional activators. Its exact physiological function in transcriptional regulation remains obscure. The CMT2B2-associated missense amino acid substitution p.A335V is located in a proline-rich region with high affinity for SH3 domains of the Abelson type. The mutation causes a decrease in binding specificity leading to the recognition of a broader range of SH3 domain proteins. Furthermore, Med25 is coordinately expressed with Pmp22 gene dosage and expression in transgenic mice and rats. These results suggest a potential role of this protein in the molecular etiology of CMT2B2 and suggest a potential, more general role of MED25 in gene dosage sensitive peripheral neuropathy pathogenesis.
Collapse
|
32
|
Hou T, Xu Z, Zhang W, McLaughlin WA, Case DA, Xu Y, Wang W. Characterization of domain-peptide interaction interface: a generic structure-based model to decipher the binding specificity of SH3 domains. Mol Cell Proteomics 2008; 8:639-49. [PMID: 19023120 DOI: 10.1074/mcp.m800450-mcp200] [Citation(s) in RCA: 93] [Impact Index Per Article: 5.8] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/06/2022] Open
Abstract
Extensive efforts have been devoted to determining the binding specificity of Src homology 3 (SH3) domains usually in a case-by-case manner. A generic structure-based model is necessary to decipher the protein recognition code of the entire domain family. In this study, we have developed a general framework that combines molecular modeling and a machine learning algorithm to capture the energetic characteristics of the domain-peptide interactions and predict the binding specificity of the SH3 domain family. Our model is not trained for individual SH3 domains; rather it is a generic model for the entire domain family. Our model not only achieved satisfactory prediction accuracy but also provided structural insights into which residues are important for the binding specificity. The success of our framework on SH3 domains suggests that it is possible to establish a theoretical model to decipher the protein recognition code of any modular domain.
Collapse
Affiliation(s)
- Tingjun Hou
- Department of Chemistry and Biochemistry, University of California at San Diego, La Jolla, California 92093, USA
| | | | | | | | | | | | | |
Collapse
|
33
|
Apgar JR, Gutwin KN, Keating AE. Predicting helix orientation for coiled-coil dimers. Proteins 2008; 72:1048-65. [PMID: 18506779 DOI: 10.1002/prot.22118] [Citation(s) in RCA: 33] [Impact Index Per Article: 2.1] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 12/20/2022]
Abstract
The alpha-helical coiled coil is a structurally simple protein oligomerization or interaction motif consisting of two or more alpha helices twisted into a supercoiled bundle. Coiled coils can differ in their stoichiometry, helix orientation, and axial alignment. Because of the near degeneracy of many of these variants, coiled coils pose a challenge to fold recognition methods for structure prediction. Whereas distinctions between some protein folds can be discriminated on the basis of hydrophobic/polar patterning or secondary structure propensities, the sequence differences that encode important details of coiled-coil structure can be subtle. This is emblematic of a larger problem in the field of protein structure and interaction prediction: that of establishing specificity between closely similar structures. We tested the behavior of different computational models on the problem of recognizing the correct orientation--parallel vs. antiparallel--of pairs of alpha helices that can form a dimeric coiled coil. For each of 131 examples of known structure, we constructed a large number of both parallel and antiparallel structural models and used these to assess the ability of five energy functions to recognize the correct fold. We also developed and tested three sequence-based approaches that make use of varying degrees of implicit structural information. The best structural methods performed similarly to the best sequence methods, correctly categorizing approximately 81% of dimers. Steric compatibility with the fold was important for some coiled coils we investigated. For many examples, the correct orientation was determined by smaller energy differences between parallel and antiparallel structures distributed over many residues and energy components. Prediction methods that used structure but incorporated varying approximations and assumptions showed quite different behaviors when used to investigate energetic contributions to orientation preference. Sequence based methods were sensitive to the choice of residue-pair interactions scored.
Collapse
Affiliation(s)
- James R Apgar
- MIT Department of Chemistry, Cambridge, Massachusetts 02139, USA
| | | | | |
Collapse
|
34
|
Identification and rational redesign of peptide ligands to CRIP1, a novel biomarker for cancers. PLoS Comput Biol 2008; 4:e1000138. [PMID: 18670594 PMCID: PMC2453235 DOI: 10.1371/journal.pcbi.1000138] [Citation(s) in RCA: 40] [Impact Index Per Article: 2.5] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 01/14/2008] [Accepted: 06/22/2008] [Indexed: 12/04/2022] Open
Abstract
Cysteine-rich intestinal protein 1 (CRIP1) has been identified as a novel marker for early detection of cancers. Here we report on the use of phage display in combination with molecular modeling to identify a high-affinity ligand for CRIP1. Panning experiments using a circularized C7C phage library yielded several consensus sequences with modest binding affinities to purified CRIP1. Two sequence motifs, A1 and B5, having the highest affinities for CRIP1, were chosen for further study. With peptide structure information and the NMR structure of CRIP1, the higher-affinity A1 peptide was computationally redesigned, yielding a novel peptide, A1M, whose affinity was predicted to be much improved. Synthesis of the peptide and saturation and competitive binding studies demonstrated approximately a 10–28-fold improvement in the affinity of A1M compared to that of either A1 or B5 peptide. These techniques have broad application to the design of novel ligand peptides. Breast cancer is one of the most frequently diagnosed malignancies in American females and is the second leading cause of cancer deaths in women. Several improvements in diagnostic protocols have enhanced our ability for earlier detection of breast cancer, resulting in improvement of therapeutic outcome and an increased survival rate for breast cancer patients. However, current early screening techniques are neither comprehensive nor infallible. Imaging techniques that improve breast cancer detection, localization, and evaluation of therapy are essential in combating the disease. Cysteine-rich intestinal protein 1 (CRIP1) has been identified as a novel marker for early detection of breast cancers. Here, we report the use of phage display and computational molecular modeling to identify a high-affinity ligand for CRIP1. Phage display panning experiments initially identified consensus peptide sequences with modest binding affinity to purified CRIP1. Using ab initio modeling of binding peptide structures, computational docking, and recently developed free energy estimation protocols, we redesigned the peptides to increase their affinity for CRIP1. Synthesis of the redesigned peptide and binding studies demonstrated approximately a 10–28-fold improvement in the binding affinity. The combination of computational and experimental techniques in this study demonstrates a potentially powerful tool in modulating protein–protein interactions.
Collapse
|
35
|
Petsalaki E, Russell RB. Peptide-mediated interactions in biological systems: new discoveries and applications. Curr Opin Biotechnol 2008; 19:344-50. [PMID: 18602004 DOI: 10.1016/j.copbio.2008.06.004] [Citation(s) in RCA: 188] [Impact Index Per Article: 11.8] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 05/28/2008] [Revised: 06/04/2008] [Accepted: 06/06/2008] [Indexed: 12/14/2022]
Abstract
Peptide-mediated interactions play very important roles in cellular processes. Recent years have seen much activity in the discovery of new bioactive peptides, and interactions mediated by protein-peptide binding events. At the same time, computational approaches continue to be developed that allow protein-peptide interactions to be discovered with great accuracy. There are also a growing number of chemicals that can target these interactions with various applications in disease. Both new discoveries and predictions suggest that these protein-peptide interactions play greater roles in cellular processes than previously thought. We propose that projects to uncover the protein-peptide repertoire used in Nature in a systematic way will have numerous applications in molecular biology and medicine.
Collapse
|
36
|
Kiel C, Beltrao P, Serrano L. Analyzing Protein Interaction Networks Using Structural Information. Annu Rev Biochem 2008; 77:415-41. [DOI: 10.1146/annurev.biochem.77.062706.133317] [Citation(s) in RCA: 78] [Impact Index Per Article: 4.9] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/09/2022]
Affiliation(s)
- Christina Kiel
- EMBL-CRG Systems Biology Unit, Center de Regulacio Genomica, Barcelona 08003, Spain; ,
| | - Pedro Beltrao
- European Molecular Biology Laboratory, 69117 Heidelberg, Germany;
| | - Luis Serrano
- EMBL-CRG Systems Biology Unit, Center de Regulacio Genomica, Barcelona 08003, Spain; ,
| |
Collapse
|
37
|
Benz PM, Blume C, Moebius J, Oschatz C, Schuh K, Sickmann A, Walter U, Feller SM, Renné T. Cytoskeleton assembly at endothelial cell-cell contacts is regulated by alphaII-spectrin-VASP complexes. ACTA ACUST UNITED AC 2008; 180:205-19. [PMID: 18195108 PMCID: PMC2213610 DOI: 10.1083/jcb.200709181] [Citation(s) in RCA: 97] [Impact Index Per Article: 6.1] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 12/16/2022]
Abstract
Directed cortical actin assembly is the driving force for intercellular adhesion. Regulated by phosphorylation, vasodilator-stimulated phosphoprotein (VASP) participates in actin fiber formation. We screened for endothelial proteins, which bind to VASP, dependent on its phosphorylation status. Differential proteomics identified αII-spectrin as such a VASP-interacting protein. αII-Spectrin binds to the VASP triple GP5-motif via its SH3 domain. cAMP-dependent protein kinase–mediated VASP phosphorylation at Ser157 inhibits αII-spectrin–VASP binding. VASP is dephosphorylated upon formation of cell–cell contacts and in confluent, but not in sparse cells, αII-spectrin colocalizes with nonphosphorylated VASP at cell–cell junctions. Ectopic expression of the αII-spectrin SH3 domain at cell–cell contacts translocates VASP, initiates cortical actin cytoskeleton formation, stabilizes cell–cell contacts, and decreases endothelial permeability. Conversely, the permeability of VASP-deficient endothelial cells (ECs) and microvessels of VASP-null mice increases. Reconstitution of VASP-deficient ECs rescues barrier function, whereas αII-spectrin binding-deficient VASP mutants fail to restore elevated permeability. We propose that αII-spectrin–VASP complexes regulate cortical actin cytoskeleton assembly with implications for vascular permeability.
Collapse
Affiliation(s)
- Peter M Benz
- Institute of Clinical Biochemistry and Pathobiochemistry, University of Würzburg, D-97080 Würzburg, Germany
| | | | | | | | | | | | | | | | | |
Collapse
|
38
|
Hou T, Zhang W, Case DA, Wang W. Characterization of Domain–Peptide Interaction Interface: A Case Study on the Amphiphysin-1 SH3 Domain. J Mol Biol 2008; 376:1201-14. [DOI: 10.1016/j.jmb.2007.12.054] [Citation(s) in RCA: 176] [Impact Index Per Article: 11.0] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/17/2007] [Revised: 12/14/2007] [Accepted: 12/20/2007] [Indexed: 11/25/2022]
|
39
|
Ferraro E, Peluso D, Via A, Ausiello G, Helmer-Citterich M. SH3-Hunter: discovery of SH3 domain interaction sites in proteins. Nucleic Acids Res 2007; 35:W451-4. [PMID: 17485474 PMCID: PMC1933191 DOI: 10.1093/nar/gkm296] [Citation(s) in RCA: 29] [Impact Index Per Article: 1.7] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/15/2022] Open
Abstract
SH3-Hunter (http://cbm.bio.uniroma2.it/SH3-Hunter/) is a web server for the recognition of putative SH3 domain interaction sites on protein sequences. Given an input query consisting of one or more protein sequences, the server identifies peptides containing poly-proline binding motifs and associates them to a list of SH3 domains, in order to compose peptide-domain pairs. The server can accept a list of peptides and allows users to upload an input file in a proper format. An accurate selection of SH3 domains is available and users can also submit their own SH3 domain sequence. SH3-Hunter evaluates which peptide-domain pair represents a possible interaction pair and produces as output a list of significant interaction sites for each query protein. Each proposed interaction site is associated to a propensity score and sensitivity and precision levels for the prediction. The server prediction capability is based on a neural network model integrating high-throughput pep-spot data with structural information extracted from known SH3-peptide complexes.
Collapse
Affiliation(s)
- Enrico Ferraro
- Centre for Molecular Bioinformatics, Department of Biology, University of Rome Tor Vergata, Via della Ricerca Scientifica, 00133, Rome, Italy.
| | | | | | | | | |
Collapse
|
40
|
Liu S, Liu S, Zhu X, Liang H, Cao A, Chang Z, Lai L. Nonnatural protein-protein interaction-pair design by key residues grafting. Proc Natl Acad Sci U S A 2007; 104:5330-5. [PMID: 17372228 PMCID: PMC1838465 DOI: 10.1073/pnas.0606198104] [Citation(s) in RCA: 66] [Impact Index Per Article: 3.9] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/18/2022] Open
Abstract
Protein-protein interface design is one of the most exciting fields in protein science; however, designing nonnatural protein-protein interaction pairs remains difficult. In this article we report a de novo design of a nonnatural protein-protein interaction pair by scanning the Protein Data Bank for suitable scaffold proteins that can be used for grafting key interaction residues and can form stable complexes with the target protein after additional mutations. Using our design algorithm, an unrelated protein, rat PLCdelta(1)-PH (pleckstrin homology domain of phospholipase C-delta1), was successfully designed to bind the human erythropoietin receptor (EPOR) after grafting the key interaction residues of human erythropoietin binding to EPOR. The designed mutants of rat PLCdelta(1)-PH were expressed and purified to test their binding affinities with EPOR. A designed triple mutation of PLCdelta(1)-PH (ERPH1) was found to bind EPOR with high affinity (K(D) of 24 nM and an IC(50) of 5.7 microM) both in vitro and in a cell-based assay, respectively, although the WT PLCdelta(1)-PH did not show any detectable binding under the assay conditions. The in vitro binding affinities of the PLCdelta(1)-PH mutants correlate qualitatively to the computational binding affinities, validating the design and the protein-protein interaction model. The successful practice of finding a proper protein scaffold and making it bind with EPOR demonstrates a prospective application in protein engineering targeting protein-protein interfaces.
Collapse
Affiliation(s)
- Sen Liu
- *Beijing National Laboratory for Molecular Sciences, State Key Laboratory for Structural Chemistry of Unstable and Stable Species, College of Chemistry and Molecular Engineering, Peking University, Beijing 100871, China
- Center for Theoretical Biology, Peking University, Beijing 100871, China; and
| | - Shiyong Liu
- *Beijing National Laboratory for Molecular Sciences, State Key Laboratory for Structural Chemistry of Unstable and Stable Species, College of Chemistry and Molecular Engineering, Peking University, Beijing 100871, China
- Center for Theoretical Biology, Peking University, Beijing 100871, China; and
| | - Xiaolei Zhu
- *Beijing National Laboratory for Molecular Sciences, State Key Laboratory for Structural Chemistry of Unstable and Stable Species, College of Chemistry and Molecular Engineering, Peking University, Beijing 100871, China
| | - Huanhuan Liang
- *Beijing National Laboratory for Molecular Sciences, State Key Laboratory for Structural Chemistry of Unstable and Stable Species, College of Chemistry and Molecular Engineering, Peking University, Beijing 100871, China
| | - Aoneng Cao
- *Beijing National Laboratory for Molecular Sciences, State Key Laboratory for Structural Chemistry of Unstable and Stable Species, College of Chemistry and Molecular Engineering, Peking University, Beijing 100871, China
| | - Zhijie Chang
- Department of Biological Sciences and Biotechnology, School of Medicine, Tsinghua University, Beijing 100084, China
| | - Luhua Lai
- *Beijing National Laboratory for Molecular Sciences, State Key Laboratory for Structural Chemistry of Unstable and Stable Species, College of Chemistry and Molecular Engineering, Peking University, Beijing 100871, China
- Center for Theoretical Biology, Peking University, Beijing 100871, China; and
- To whom correspondence should be addressed. E-mail:
| |
Collapse
|
41
|
Schmidt H, Hoffmann S, Tran T, Stoldt M, Stangler T, Wiesehan K, Willbold D. Solution structure of a Hck SH3 domain ligand complex reveals novel interaction modes. J Mol Biol 2006; 365:1517-32. [PMID: 17141806 DOI: 10.1016/j.jmb.2006.11.013] [Citation(s) in RCA: 25] [Impact Index Per Article: 1.4] [Reference Citation Analysis] [Abstract] [MESH Headings] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/09/2006] [Revised: 10/24/2006] [Accepted: 11/03/2006] [Indexed: 12/01/2022]
Abstract
We studied the interaction of hematopoietic cell kinase SH3 domain (HckSH3) with an artificial 12-residue proline-rich peptide PD1 (HSKYPLPPLPSL) identified as high affinity ligand (K(D)=0.2 muM). PD1 shows an unusual ligand sequence for SH3 binding in type I orientation because it lacks the typical basic anchor residue at position P(-3), but instead has a tyrosine residue at this position. A basic lysine residue, however, is present at position P(-4). The solution structure of the HckSH3:PD1 complex, which is the first HckSH3 complex structure available, clearly reveals that the P(-3) tyrosine residue of PD1 does not take the position of the typical anchor residue but rather forms additional van der Waals interactions with the HckSH3 RT loop. Instead, lysine at position P(-4) of PD1 substitutes the function of the P(-3) anchor residue. This finding expands the well known ligand consensus sequence +xxPpxP by +xxxPpxP. Thus, software tools like iSPOT fail to identify PD1 as a high-affinity HckSH3 ligand so far. In addition, a short antiparallel beta-sheet in the RT loop of HckSH3 is observed upon PD1 binding. The structure of the HckSH3:PD1 complex reveals novel features of SH3 ligand binding and yields new insights into the structural basics of SH3-ligand interactions. Consequences for computational prediction tools adressing SH3-ligand interactions as well as the biological relevance of our findings are discussed.
Collapse
Affiliation(s)
- Holger Schmidt
- Forschungszentrum Jülich GmbH, INB, Biomolecular NMR, 52425 Jülich, Germany
| | | | | | | | | | | | | |
Collapse
|
42
|
Joachimiak LA, Kortemme T, Stoddard BL, Baker D. Computational Design of a New Hydrogen Bond Network and at Least a 300-fold Specificity Switch at a Protein−Protein Interface. J Mol Biol 2006; 361:195-208. [PMID: 16831445 DOI: 10.1016/j.jmb.2006.05.022] [Citation(s) in RCA: 120] [Impact Index Per Article: 6.7] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 02/09/2006] [Revised: 05/06/2006] [Accepted: 05/10/2006] [Indexed: 11/17/2022]
Abstract
The redesign of protein-protein interactions is a stringent test of our understanding of molecular recognition and specificity. Previously we engineered a modest specificity switch into the colicin E7 DNase-Im7 immunity protein complex by identifying mutations that are disruptive in the native complex, but can be compensated by mutations on the interacting partner. Here we extend the approach by systematically sampling alternate rigid body orientations to optimize the interactions in a binding mode specific manner. Using this protocol we designed a de novo hydrogen bond network at the DNase-immunity protein interface and confirmed the design with X-ray crystallographic analysis. Subsequent design of the second shell of interactions guided by insights from the crystal structure on tightly bound water molecules, conformational strain, and packing defects yielded new binding partners that exhibited specificities of at least 300-fold between the cognate and the non-cognate complexes. This multi-step approach should be applicable to the design of polar protein-protein interactions and contribute to the re-engineering of regulatory networks mediated by protein-protein interactions.
Collapse
Affiliation(s)
- Lukasz A Joachimiak
- Howard Hughes Medical Institute & Department of Biochemistry, University of Washington, Seattle, 98195-7350, USA
| | | | | | | |
Collapse
|
43
|
Ferraro E, Via A, Ausiello G, Helmer-Citterich M. A novel structure-based encoding for machine-learning applied to the inference of SH3 domain specificity. ACTA ACUST UNITED AC 2006; 22:2333-9. [PMID: 16870929 DOI: 10.1093/bioinformatics/btl403] [Citation(s) in RCA: 23] [Impact Index Per Article: 1.3] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/14/2022]
Abstract
MOTIVATION Unravelling the rules underlying protein-protein and protein-ligand interactions is a crucial step in understanding cell machinery. Peptide recognition modules (PRMs) are globular protein domains which focus their binding targets on short protein sequences and play a key role in the frame of protein-protein interactions. High-throughput techniques permit the whole proteome scanning of each domain, but they are characterized by a high incidence of false positives. In this context, there is a pressing need for the development of in silico experiments to validate experimental results and of computational tools for the inference of domain-peptide interactions. RESULTS We focused on the SH3 domain family and developed a machine-learning approach for inferring interaction specificity. SH3 domains are well-studied PRMs which typically bind proline-rich short sequences characterized by the PxxP consensus. The binding information is known to be held in the conformation of the domain surface and in the short sequence of the peptide. Our method relies on interaction data from high-throughput techniques and benefits from the integration of sequence and structure data of the interacting partners. Here, we propose a novel encoding technique aimed at representing binding information on the basis of the domain-peptide contact residues in complexes of known structure. Remarkably, the new encoding requires few variables to represent an interaction, thus avoiding the 'curse of dimension'. Our results display an accuracy >90% in detecting new binders of known SH3 domains, thus outperforming neural models on standard binary encodings, profile methods and recent statistical predictors. The method, moreover, shows a generalization capability, inferring specificity of unknown SH3 domains displaying some degree of similarity with the known data.
Collapse
Affiliation(s)
- E Ferraro
- Centre of Molecular Bioinformatics, Department of Biology, University of Tor Vergata Rome, Italy.
| | | | | | | |
Collapse
|
44
|
Kärkkäinen S, Hiipakka M, Wang JH, Kleino I, Vähä-Jaakkola M, Renkema GH, Liss M, Wagner R, Saksela K. Identification of preferred protein interactions by phage-display of the human Src homology-3 proteome. EMBO Rep 2006; 7:186-91. [PMID: 16374509 PMCID: PMC1369250 DOI: 10.1038/sj.embor.7400596] [Citation(s) in RCA: 89] [Impact Index Per Article: 4.9] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/22/2005] [Revised: 11/10/2005] [Accepted: 11/14/2005] [Indexed: 01/24/2023] Open
Abstract
We have determined the human genome to contain 296 different Src homology-3 (SH3) domains and cloned them into a phage-display vector. This provided a powerful and unbiased system for simultaneous assaying of the complete human SH3 proteome for the strongest binding to target proteins of interest, without the limitations posed by short linear peptide ligands or confounding variables of more indirect methods for protein interaction screening. Studies involving three ligand proteins, human immunodeficiency virus-1 Nef, p21-activated kinase (PAK)2 and ADAM15, showed previously reported as well as novel SH3 partners with nanomolar affinities specific for them. This argues that SH3 domains may have a more dominant role in directing cellular protein interactions than has been assumed. Besides showing potentially important new SH3-directed interactions, these studies also led to the discovery of novel signalling proteins, such as the PAK2-binding adaptor protein POSH2 and the ADAM15-binding sorting nexin family member SNX30.
Collapse
Affiliation(s)
- Satu Kärkkäinen
- Institute of Medical Technology, University of Tampere and Tampere University Hospital, Biokatu 8, Tampere 33014, Finland
| | - Marita Hiipakka
- Institute of Medical Technology, University of Tampere and Tampere University Hospital, Biokatu 8, Tampere 33014, Finland
| | - Jing-Huan Wang
- Institute of Medical Technology, University of Tampere and Tampere University Hospital, Biokatu 8, Tampere 33014, Finland
| | - Iivari Kleino
- Institute of Medical Technology, University of Tampere and Tampere University Hospital, Biokatu 8, Tampere 33014, Finland
| | - Marika Vähä-Jaakkola
- Institute of Medical Technology, University of Tampere and Tampere University Hospital, Biokatu 8, Tampere 33014, Finland
| | - G Herma Renkema
- Institute of Medical Technology, University of Tampere and Tampere University Hospital, Biokatu 8, Tampere 33014, Finland
| | - Michael Liss
- Geneart GmbH, Josef-Engert-Strasse 9, Regensburg 93053, Germany
| | - Ralf Wagner
- Geneart GmbH, Josef-Engert-Strasse 9, Regensburg 93053, Germany
- Institute of Medical Microbiology & Hygiene, University of Regensburg, Franz-Josef-Strauß, Allee 11, Regensburg 93053, Germany
| | - Kalle Saksela
- Institute of Medical Technology, University of Tampere and Tampere University Hospital, Biokatu 8, Tampere 33014, Finland
- Department of Virology, Haartman Institute, University of Helsinki, POB21, Haartmaninkatu 3, Helsinki 00014, Finland
- Department of Virology, Helsinki University Central Hospital, 00029, Finland
- Tel: +358 9 191 26770; Fax: +358 9 191 26491; E-mail:
| |
Collapse
|
45
|
Zhang L, Shao C, Zheng D, Gao Y. An integrated machine learning system to computationally screen protein databases for protein binding peptide ligands. Mol Cell Proteomics 2006; 5:1224-32. [PMID: 16574641 DOI: 10.1074/mcp.m500346-mcp200] [Citation(s) in RCA: 17] [Impact Index Per Article: 0.9] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/06/2022] Open
Abstract
A fairly large set of protein interactions is mediated by families of peptide binding domains, such as Src homology 2 (SH2), SH3, PDZ, major histocompatibility complex, etc. To identify their ligands by experimental screening is not only labor-intensive but almost futile in screening low abundance species due to the suppression by high abundance species. An ideal way of studying protein-protein interactions is to use high throughput computational approaches to screen protein sequence databases to direct the validating experiments toward the most promising peptides. Predictors with only good cross-validation were not good enough to screen protein databases. In the current study we built integrated machine learning systems using three novel coding methods and screened the Swiss-Prot and GenBank protein databases for potential ligands of 10 SH3 and three PDZ domains. A large fraction of predictions has already been experimentally confirmed by other independent research groups, indicating a satisfying generalization capability for future applications in identifying protein interactions.
Collapse
Affiliation(s)
- Ling Zhang
- Proteomics Research Center, National Key Laboratory of Medical Molecular Biology, Institute of Basic Medical Sciences, Chinese Academy of Medical Sciences/Peking Union Medical College, 5 Dong Dan San Tiao, 100005 Beijing, China
| | | | | | | |
Collapse
|
46
|
Grigoryan G, Keating AE. Structure-based Prediction of bZIP Partnering Specificity. J Mol Biol 2006; 355:1125-42. [PMID: 16359704 DOI: 10.1016/j.jmb.2005.11.036] [Citation(s) in RCA: 50] [Impact Index Per Article: 2.8] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 09/13/2005] [Revised: 11/10/2005] [Accepted: 11/11/2005] [Indexed: 10/25/2022]
Abstract
Predicting protein interaction specificity from sequence is an important goal in computational biology. We present a model for predicting the interaction preferences of coiled-coil peptides derived from bZIP transcription factors that performs very well when tested against experimental protein microarray data. We used only sequence information to build atomic-resolution structures for 1711 dimeric complexes, and evaluated these with a variety of functions based on physics, learned empirical weights or experimental coupling energies. A purely physical model, similar to those used for protein design studies, gave reasonable performance. The results were improved significantly when helix propensities were used in place of a structurally explicit model to represent the unfolded reference state. Further improvement resulted upon accounting for residue-residue interactions in competing states in a generic way. Purely physical structure-based methods had difficulty capturing core interactions accurately, especially those involving polar residues such as asparagine. When these terms were replaced with weights from a machine-learning approach, the resulting model was able to correctly order the stabilities of over 6000 pairs of complexes with greater than 90% accuracy. The final model is physically interpretable, and suggests specific pairs of residues that are important for bZIP interaction specificity. Our results illustrate the power and potential of structural modeling as a method for predicting protein interactions and highlight obstacles that must be overcome to reach quantitative accuracy using a de novo approach. Our method shows unprecedented performance in predicting protein-protein interaction specificity accurately using structural modeling and suggests that predicting coiled-coil interactions generally may be within reach.
Collapse
|
47
|
Hou T, Chen K, McLaughlin WA, Lu B, Wang W. Computational analysis and prediction of the binding motif and protein interacting partners of the Abl SH3 domain. PLoS Comput Biol 2006; 2:e1. [PMID: 16446784 PMCID: PMC1356089 DOI: 10.1371/journal.pcbi.0020001] [Citation(s) in RCA: 131] [Impact Index Per Article: 7.3] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 08/12/2005] [Accepted: 12/05/2005] [Indexed: 11/18/2022] Open
Abstract
Protein-protein interactions, particularly weak and transient ones, are often mediated by peptide recognition domains, such as Src Homology 2 and 3 (SH2 and SH3) domains, which bind to specific sequence and structural motifs. It is important but challenging to determine the binding specificity of these domains accurately and to predict their physiological interacting partners. In this study, the interactions between 35 peptide ligands (15 binders and 20 non-binders) and the Abl SH3 domain were analyzed using molecular dynamics simulation and the Molecular Mechanics/Poisson-Boltzmann Solvent Area method. The calculated binding free energies correlated well with the rank order of the binding peptides and clearly distinguished binders from non-binders. Free energy component analysis revealed that the van der Waals interactions dictate the binding strength of peptides, whereas the binding specificity is determined by the electrostatic interaction and the polar contribution of desolvation. The binding motif of the Abl SH3 domain was then determined by a virtual mutagenesis method, which mutates the residue at each position of the template peptide relative to all other 19 amino acids and calculates the binding free energy difference between the template and the mutated peptides using the Molecular Mechanics/Poisson-Boltzmann Solvent Area method. A single position mutation free energy profile was thus established and used as a scoring matrix to search peptides recognized by the Abl SH3 domain in the human genome. Our approach successfully picked ten out of 13 experimentally determined binding partners of the Abl SH3 domain among the top 600 candidates from the 218,540 decapeptides with the PXXP motif in the SWISS-PROT database. We expect that this physical-principle based method can be applied to other protein domains as well. One of the central questions of molecular biology is to understand how signals are transduced in the cell. Intracellular signal transduction is mainly achieved through cascades of protein-protein interactions, which are often mediated by peptide-binding modular domains, such as Src Homology 2 and 3 (SH2 and SH3). Each family of these domains binds to peptides with specific sequence and structural characteristics. To reconstruct the protein-protein interaction networks mediated by modular domains, one must identify the peptide motifs recognized by these domains and understand the mechanism of binding specificity. These questions are challenging because the domain-peptide interactions are usually weak and transient. Here, the authors took a physical-principles approach to address these difficult questions for the SH3 domain of human protein Abl, which binds to peptides containing the PXXP motif (where P is proline and X is any amino acid). They generated a position-specific scoring matrix to represent the binding motif of the Abl SH3 domain. Analysis on the binding free energy components suggested insights into how the binding specificity is achieved. Most known protein interacting partners of the Abl SH3 domain were correctly identified using the position-specific scoring matrix, and other potential interacting partners were also suggested.
Collapse
Affiliation(s)
- Tingjun Hou
- Department of Chemistry and Biochemistry, University of California San Diego, La Jolla, California, United States of America
- Center for Theoretical Biological Physics, University of California San Diego, La Jolla, California, United States of America
| | - Ken Chen
- Department of Chemistry and Biochemistry, University of California San Diego, La Jolla, California, United States of America
- Center for Theoretical Biological Physics, University of California San Diego, La Jolla, California, United States of America
| | - William A McLaughlin
- Department of Chemistry and Biochemistry, University of California San Diego, La Jolla, California, United States of America
- Center for Theoretical Biological Physics, University of California San Diego, La Jolla, California, United States of America
| | - Benzhuo Lu
- Department of Chemistry and Biochemistry, University of California San Diego, La Jolla, California, United States of America
- Center for Theoretical Biological Physics, University of California San Diego, La Jolla, California, United States of America
| | - Wei Wang
- Department of Chemistry and Biochemistry, University of California San Diego, La Jolla, California, United States of America
- Center for Theoretical Biological Physics, University of California San Diego, La Jolla, California, United States of America
- * To whom correspondence should be addressed. E-mail:
| |
Collapse
|
48
|
Tran T, Hoffmann S, Wiesehan K, Jonas E, Luge C, Aladag A, Willbold D. Insights into human Lck SH3 domain binding specificity: different binding modes of artificial and native ligands. Biochemistry 2006; 44:15042-52. [PMID: 16274251 DOI: 10.1021/bi051403k] [Citation(s) in RCA: 24] [Impact Index Per Article: 1.3] [Reference Citation Analysis] [Abstract] [MESH Headings] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/30/2022]
Abstract
We analyzed the ligand binding specificity of the lymphocyte specific kinase (Lck) SH3 domain. We identified artificial Lck SH3 ligands using phage display. In addition, we analyzed Lck SH3 binding sites within known natural Lck SH3 binding proteins using an Lck specific binding assay on membrane-immobilized synthetic peptides. On one hand, from the phage-selected peptides, representing mostly special class I' ligands, a well-defined consensus sequence was obtained. Interestingly, a histidine outside the central polyproline motif contributes significantly to Lck SH3 binding affinity and specificity. On the other hand, we confirmed previously mapped Lck SH3 binding sites in ADAM15, HS1, SLP76, and NS5A, and identified putative Lck SH3 binding sites of Sam68, FasL, c-Cbl, and Cbl-b. Without exception, the comparatively diverse Lck SH3 binding sites of all analyzed natural Lck SH3 binding proteins emerged as class II proteins. Possible explanations for the observed variations between artificial and native ligands-which are not due to significant K(D) value differences as shown by calculating Lck SH3 affinities of artificial peptide PD1-Y(-3)R as well as for peptides comprising putative Lck SH3 binding sites of NS5A, Sos, and Sam68-are discussed. Our data suggest that phage display, a popular tool for determining SH3 binding specificity, must-at least in the case of Lck-not irrevocably mirror physiologically relevant protein-ligand interactions.
Collapse
Affiliation(s)
- Tuyen Tran
- Institut für Physikalische Biologie, Heinrich-Heine-Universität, 40225 Düsseldorf, Germany
| | | | | | | | | | | | | |
Collapse
|
49
|
Kobe B, Kampmann T, Forwood JK, Listwan P, Brinkworth RI. Substrate specificity of protein kinases and computational prediction of substrates. BIOCHIMICA ET BIOPHYSICA ACTA-PROTEINS AND PROTEOMICS 2005; 1754:200-9. [PMID: 16172032 DOI: 10.1016/j.bbapap.2005.07.036] [Citation(s) in RCA: 78] [Impact Index Per Article: 4.1] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Subscribe] [Scholar Register] [Received: 06/21/2005] [Revised: 07/13/2005] [Accepted: 07/14/2005] [Indexed: 10/25/2022]
Abstract
To ensure signalling fidelity, kinases must act only on a defined subset of cellular targets. Appreciating the basis for this substrate specificity is essential for understanding the role of an individual protein kinase in a particular cellular process. The specificity in the cell is determined by a combination of "peptide specificity" of the kinase (the molecular recognition of the sequence surrounding the phosphorylation site), substrate recruitment and phosphatase activity. Peptide specificity plays a crucial role and depends on the complementarity between the kinase and the substrate and therefore on their three-dimensional structures. Methods for experimental identification of kinase substrates and characterization of specificity are expensive and laborious, therefore, computational approaches are being developed to reduce the amount of experimental work required in substrate identification. We discuss the structural basis of substrate specificity of protein kinases and review the experimental and computational methods used to obtain specificity information.
Collapse
Affiliation(s)
- Bostjan Kobe
- School of Molecular and Microbial Sciences, University of Queensland, Brisbane, Australia.
| | | | | | | | | |
Collapse
|
50
|
Ferraro E, Via A, Ausiello G, Helmer-Citterich M. A neural strategy for the inference of SH3 domain-peptide interaction specificity. BMC Bioinformatics 2005; 6 Suppl 4:S13. [PMID: 16351739 PMCID: PMC1866395 DOI: 10.1186/1471-2105-6-s4-s13] [Citation(s) in RCA: 9] [Impact Index Per Article: 0.5] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/18/2022] Open
Abstract
Background The SH3 domain family is one of the most representative and widely studied cases of so-called Peptide Recognition Modules (PRM). The polyproline II motif PxxP that generally characterizes its ligands does not reflect the complex interaction spectrum of the over 1500 different SH3 domains, and the requirement of a more refined knowledge of their specificity implies the setting up of appropriate experimental and theoretical strategies. Due to the limitations of the current technology for peptide synthesis, several experimental high-throughput approaches have been devised to elucidate protein-protein interaction mechanisms. Such approaches can rely on and take advantage of computational techniques, such as regular expressions or position specific scoring matrices (PSSMs) to pre-process entire proteomes in the search for putative SH3 targets. In this regard, a reliable inference methodology to be used for reducing the sequence space of putative binding peptides represents a valuable support for molecular and cellular biologists. Results Using as benchmark the peptide sequences obtained from in vitro binding experiments, we set up a neural network model that performs better than PSSM in the detection of SH3 domain interactors. In particular our model is more precise in its predictions, even if its performance can vary among different SH3 domains and is strongly dependent on the number of binding peptides in the benchmark. Conclusion We show that a neural network can be more effective than standard methods in SH3 domain specificity detection. Neural classifiers identify general SH3 domain binders and domain-specific interactors from a PxxP peptide population, provided that there are a sufficient proportion of true positives in the training sets. This capability can also improve peptide selection for library definition in array experiments. Further advances can be achieved, including properly encoded domain sequences and structural information as input for a global neural network.
Collapse
Affiliation(s)
- Enrico Ferraro
- Centre for Molecular Bioinformatics, Department of Biology, University of Tor Vergata, Rome, Italy
| | - Allegra Via
- Centre for Molecular Bioinformatics, Department of Biology, University of Tor Vergata, Rome, Italy
| | - Gabriele Ausiello
- Centre for Molecular Bioinformatics, Department of Biology, University of Tor Vergata, Rome, Italy
| | - Manuela Helmer-Citterich
- Centre for Molecular Bioinformatics, Department of Biology, University of Tor Vergata, Rome, Italy
| |
Collapse
|