1
|
Cheng H, Wang GG, Chen L, Wang R. A dual-population multi-objective evolutionary algorithm driven by generative adversarial networks for benchmarking and protein-peptide docking. Comput Biol Med 2024; 168:107727. [PMID: 38029532 DOI: 10.1016/j.compbiomed.2023.107727] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 08/13/2023] [Revised: 09/28/2023] [Accepted: 11/15/2023] [Indexed: 12/01/2023]
Abstract
Multi-objective optimization problems (MOPs) are characterized as optimization problems in which multiple conflicting objective functions are optimized simultaneously. To solve MOPs, some algorithms used machine learning models to drive the evolutionary algorithms, leading to the design of a variety of model-based evolutionary algorithms. However, model collapse occurs during the generation of candidate solutions, which results in local optima and poor diversity in model-based evolutionary algorithms. To address this problem, we propose a dual-population multi-objective evolutionary algorithm driven by Wasserstein generative adversarial network with gradient penalty (DGMOEA), where the dual-populations coordinate and cooperate to generate high-quality solutions, thus improving the performance of the evolutionary algorithm. We compare the proposed algorithm with the 7 state-of-the-art algorithms on 20 multi-objective benchmark functions. Experimental results indicate that DGMOEA achieves significant results in solving MOPs, where the metrics IGD and HV outperform the other comparative algorithms on 15 and 18 out of 20 benchmarks, respectively. Our algorithm is evaluated on the LEADS-PEP dataset containing 53 protein-peptide complexes, and the experimental results on solving the protein-peptide docking problem indicated that DGMOEA can effectively reduce the RMSD between the generated and the original peptide's 3D poses and achieve more competitive results.
Collapse
Affiliation(s)
- Honglei Cheng
- School of Computer Science and Technology, Ocean University of China, Qingdao, China
| | - Gai-Ge Wang
- School of Computer Science and Technology, Ocean University of China, Qingdao, China.
| | - Liyan Chen
- Institute of Big Data and Information Technology, Wenzhou University, Wenzhou, China
| | - Rui Wang
- College of Systems Engineering, National University of Defense Technology, Changsha, China; Xiangjiang Laboratory, Changsha, China
| |
Collapse
|
2
|
Kang JE, Jun JH, Kwon JH, Lee JH, Hwang K, Kim S, Jeong N. Arabidopsis Transcription Regulatory Factor Domain/Domain Interaction Analysis Tool-Liquid/Liquid Phase Separation, Oligomerization, GO Analysis: A Toolkit for Interaction Data-Based Domain Analysis. Genes (Basel) 2023; 14:1476. [PMID: 37510380 PMCID: PMC10379056 DOI: 10.3390/genes14071476] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/05/2023] [Revised: 07/04/2023] [Accepted: 07/14/2023] [Indexed: 07/30/2023] Open
Abstract
Although a large number of databases are available for regulatory elements, a bottleneck has been created by the lack of bioinformatics tools to predict the interaction modes of regulatory elements. To reduce this gap, we developed the Arabidopsis Transcription Regulatory Factor Domain/Domain Interaction Analysis Tool-liquid/liquid phase separation (LLPS), oligomerization, GO analysis (ART FOUNDATION-LOG), a useful toolkit for protein-nucleic acid interaction (PNI) and protein-protein interaction (PPI) analysis based on domain-domain interactions (DDIs). LLPS, protein oligomerization, the structural properties of protein domains, and protein modifications are major components in the orchestration of the spatiotemporal dynamics of PPIs and PNIs. Our goal is to integrate PPI/PNI information into the development of a prediction model for identifying important genetic variants in peaches. Our program unified interdatabase relational keys based on protein domains to facilitate inference from the model species. A key advantage of this program lies in the integrated information of related features, such as protein oligomerization, LOG analysis, structural characterizations of domains (e.g., domain linkers, intrinsically disordered regions, DDIs, domain-motif (peptide) interactions, beta sheets, and transmembrane helices), and post-translational modification. We provided simple tests to demonstrate how to use this program, which can be applied to other eukaryotic organisms.
Collapse
Affiliation(s)
- Jee Eun Kang
- Fruit Research Division, National Institute of Horticultural and Herbal Science, Wanju 55365, Republic of Korea
| | - Ji Hae Jun
- Fruit Research Division, National Institute of Horticultural and Herbal Science, Wanju 55365, Republic of Korea
| | - Jung Hyun Kwon
- Fruit Research Division, National Institute of Horticultural and Herbal Science, Wanju 55365, Republic of Korea
| | - Ju-Hyun Lee
- Fruit Research Division, National Institute of Horticultural and Herbal Science, Wanju 55365, Republic of Korea
| | - Kidong Hwang
- Fruit Research Division, National Institute of Horticultural and Herbal Science, Wanju 55365, Republic of Korea
| | - Sungjong Kim
- Fruit Research Division, National Institute of Horticultural and Herbal Science, Wanju 55365, Republic of Korea
| | - Namhee Jeong
- Fruit Research Division, National Institute of Horticultural and Herbal Science, Wanju 55365, Republic of Korea
| |
Collapse
|
3
|
Monti A, Vitagliano L, Caporale A, Ruvo M, Doti N. Targeting Protein-Protein Interfaces with Peptides: The Contribution of Chemical Combinatorial Peptide Library Approaches. Int J Mol Sci 2023; 24:7842. [PMID: 37175549 PMCID: PMC10178479 DOI: 10.3390/ijms24097842] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/30/2023] [Revised: 04/22/2023] [Accepted: 04/23/2023] [Indexed: 05/15/2023] Open
Abstract
Protein-protein interfaces play fundamental roles in the molecular mechanisms underlying pathophysiological pathways and are important targets for the design of compounds of therapeutic interest. However, the identification of binding sites on protein surfaces and the development of modulators of protein-protein interactions still represent a major challenge due to their highly dynamic and extensive interfacial areas. Over the years, multiple strategies including structural, computational, and combinatorial approaches have been developed to characterize PPI and to date, several successful examples of small molecules, antibodies, peptides, and aptamers able to modulate these interfaces have been determined. Notably, peptides are a particularly useful tool for inhibiting PPIs due to their exquisite potency, specificity, and selectivity. Here, after an overview of PPIs and of the commonly used approaches to identify and characterize them, we describe and evaluate the impact of chemical peptide libraries in medicinal chemistry with a special focus on the results achieved through recent applications of this methodology. Finally, we also discuss the role that this methodology can have in the framework of the opportunities, and challenges that the application of new predictive approaches based on artificial intelligence is generating in structural biology.
Collapse
Affiliation(s)
- Alessandra Monti
- Institute of Biostructures and Bioimaging (IBB), National Research Council (CNR), 80131 Napoli, Italy; (A.M.); (L.V.); (M.R.)
| | - Luigi Vitagliano
- Institute of Biostructures and Bioimaging (IBB), National Research Council (CNR), 80131 Napoli, Italy; (A.M.); (L.V.); (M.R.)
| | - Andrea Caporale
- Institute of Crystallography (IC), National Research Council (CNR), Strada Statale 14 km 163.5, Basovizza, 34149 Triese, Italy;
| | - Menotti Ruvo
- Institute of Biostructures and Bioimaging (IBB), National Research Council (CNR), 80131 Napoli, Italy; (A.M.); (L.V.); (M.R.)
| | - Nunzianna Doti
- Institute of Biostructures and Bioimaging (IBB), National Research Council (CNR), 80131 Napoli, Italy; (A.M.); (L.V.); (M.R.)
| |
Collapse
|
4
|
Lin J, Wang S, Wen L, Ye H, Shang S, Li J, Shu J, Zhou P. Targeting peptide-mediated interactions in omics. Proteomics 2023; 23:e2200175. [PMID: 36461811 DOI: 10.1002/pmic.202200175] [Citation(s) in RCA: 9] [Impact Index Per Article: 9.0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/07/2022] [Revised: 11/28/2022] [Accepted: 11/28/2022] [Indexed: 12/05/2022]
Abstract
Peptide-mediated interactions (PMIs) play a crucial role in cell signaling network, which are responsible for about half of cellular protein-protein associations in the human interactome and have recently been recognized as a new kind of promising druggable target for drug development and disease therapy. In this article, we give a systematic review regarding the proteome-wide discovery of PMIs and targeting druggable PMIs (dPMIs) with chemical drugs, self-inhibitory peptides (SIPs) and protein agents, particularly focusing on their implications and applications for therapeutic purpose in omics. We also introduce computational peptidology strategies used to model, analyze, and design PMI-targeted molecular entities and further extend the concepts of protein context, direct/indirect readout, and enthalpy/entropy effect involved in PMIs. Current issues and future perspective on this topic are discussed. There is still a long way to go before establishment of efficient therapeutic strategies to target PMIs on the omics scale.
Collapse
Affiliation(s)
- Jing Lin
- Center for Informational Biology, School of Life Science and Technology, University of Electronic Science and Technology of China (UESTC), Chengdu, China
| | - Shaozhou Wang
- Center for Informational Biology, School of Life Science and Technology, University of Electronic Science and Technology of China (UESTC), Chengdu, China
| | - Li Wen
- Center for Informational Biology, School of Life Science and Technology, University of Electronic Science and Technology of China (UESTC), Chengdu, China
| | - Haiyang Ye
- Center for Informational Biology, School of Life Science and Technology, University of Electronic Science and Technology of China (UESTC), Chengdu, China
| | - Shuyong Shang
- Institute of Ecological Environment Protection, Chengdu Normal University, Chengdu, China
| | - Juelin Li
- Center for Informational Biology, School of Life Science and Technology, University of Electronic Science and Technology of China (UESTC), Chengdu, China
| | - Jianping Shu
- Center for Informational Biology, School of Life Science and Technology, University of Electronic Science and Technology of China (UESTC), Chengdu, China
| | - Peng Zhou
- Center for Informational Biology, School of Life Science and Technology, University of Electronic Science and Technology of China (UESTC), Chengdu, China
| |
Collapse
|
5
|
Martín M, Brunello FG, Modenutti CP, Nicola JP, Marti MA. MotSASi: Functional short linear motifs (SLiMs) prediction based on genomic single nucleotide variants and structural data. Biochimie 2022; 197:59-73. [DOI: 10.1016/j.biochi.2022.02.002] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 08/27/2021] [Revised: 01/17/2022] [Accepted: 02/02/2022] [Indexed: 11/28/2022]
|
6
|
Dwyer DS. Protein Receptors Evolved from Homologous Cohesion Modules That Self-Associated and Are Encoded by Interactive Networked Genes. Life (Basel) 2021; 11:life11121335. [PMID: 34947866 PMCID: PMC8707797 DOI: 10.3390/life11121335] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 08/30/2021] [Revised: 11/08/2021] [Accepted: 11/30/2021] [Indexed: 11/16/2022] Open
Abstract
Previously, it was proposed that protein receptors evolved from self-binding peptides that were encoded by self-interacting gene segments (inverted repeats) widely dispersed in the genome. In addition, self-association of the peptides was thought to be mediated by regions of amino acid sequence similarity. To extend these ideas, special features of receptors have been explored, such as their degree of homology to other proteins, and the arrangement of their genes for clues about their evolutionary origins and dynamics in the genome. As predicted, BLASTP searches for homologous proteins detected a greater number of unique hits for queries with receptor sequences than for sequences of randomly-selected, non-receptor proteins. This suggested that the building blocks (cohesion modules) for receptors were duplicated, dispersed, and maintained in the genome, due to structure/function relationships discussed here. Furthermore, the genes coding for a representative panel of receptors participated in a larger number of gene-gene interactions than for randomly-selected genes. This could conceivably reflect a greater evolutionary conservation of the receptor genes, with their more extensive integration into networks along with inherent properties of the genes themselves. In support of the latter possibility, some receptor genes were located in active areas of adaptive gene relocation/amalgamation to form functional blocks of related genes. It is suggested that adaptive relocation might allow for their joint regulation by common promoters and enhancers, and affect local chromatin structural domains to facilitate or repress gene expression. Speculation is included about the nature of the coordinated communication between receptors and the genes that encode them.
Collapse
Affiliation(s)
- Donard S Dwyer
- Departments of Psychiatry and Behavioral Medicine and Pharmacology, Toxicology and Neuroscience, LSU Health Shreveport, 1501 Kings Highway, Shreveport, LA 71130, USA
| |
Collapse
|
7
|
Teyra J, Kelil A, Jain S, Helmy M, Jajodia R, Hooda Y, Gu J, D’Cruz AA, Nicholson SE, Min J, Sudol M, Kim PM, Bader GD, Sidhu SS. Large-scale survey and database of high affinity ligands for peptide recognition modules. Mol Syst Biol 2020; 16:e9310. [PMID: 33438817 PMCID: PMC7724964 DOI: 10.15252/msb.20199310] [Citation(s) in RCA: 17] [Impact Index Per Article: 4.3] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/21/2019] [Revised: 11/03/2020] [Accepted: 11/04/2020] [Indexed: 12/13/2022] Open
Abstract
Many proteins involved in signal transduction contain peptide recognition modules (PRMs) that recognize short linear motifs (SLiMs) within their interaction partners. Here, we used large-scale peptide-phage display methods to derive optimal ligands for 163 unique PRMs representing 79 distinct structural families. We combined the new data with previous data that we collected for the large SH3, PDZ, and WW domain families to assemble a database containing 7,984 unique peptide ligands for 500 PRMs representing 82 structural families. For 74 PRMs, we acquired enough new data to map the specificity profiles in detail and derived position weight matrices and binding specificity logos based on multiple peptide ligands. These analyses showed that optimal peptide ligands resembled peptides observed in existing structures of PRM-ligand complexes, indicating that a large majority of the phage-derived peptides are likely to target natural peptide-binding sites and could thus act as inhibitors of natural protein-protein interactions. The complete dataset has been assembled in an online database (http://www.prm-db.org) that will enable many structural, functional, and biological studies of PRMs and SLiMs.
Collapse
Affiliation(s)
- Joan Teyra
- The Donnelly CentreUniversity of TorontoTorontoONCanada
| | | | - Shobhit Jain
- The Donnelly CentreUniversity of TorontoTorontoONCanada
- Department of Computer ScienceUniversity of TorontoTorontoONCanada
| | - Mohamed Helmy
- The Donnelly CentreUniversity of TorontoTorontoONCanada
- Present address:
Singapore Institute of Food and Biotechnology Innovation (SIFBI)Agency for ScienceTechnology and Research (A*STAR)Singapore CitySingapore
| | - Raghav Jajodia
- Indian Institute of Engineering Science and TechnologyShibpurIndia
| | - Yogesh Hooda
- The Donnelly CentreUniversity of TorontoTorontoONCanada
- Present address:
MRC Laboratory of Molecular BiologyCambridgeUK
| | - Jun Gu
- The Donnelly CentreUniversity of TorontoTorontoONCanada
| | - Akshay A D’Cruz
- The Walter and Eliza Hall Institute of Medical ResearchParkvilleVic.Australia
| | - Sandra E Nicholson
- The Walter and Eliza Hall Institute of Medical ResearchParkvilleVic.Australia
| | - Jinrong Min
- Structural Genomics ConsortiumUniversity of TorontoTorontoONCanada
- Department of PhysiologyUniversity of TorontoTorontoONCanada
| | - Marius Sudol
- Department of MedicineIcahn School of Medicine at Mount SinaiNew YorkNYUSA
| | - Philip M Kim
- The Donnelly CentreUniversity of TorontoTorontoONCanada
- Department of Computer ScienceUniversity of TorontoTorontoONCanada
- Department of Molecular GeneticsUniversity of TorontoTorontoONCanada
| | - Gary D Bader
- The Donnelly CentreUniversity of TorontoTorontoONCanada
- Department of Computer ScienceUniversity of TorontoTorontoONCanada
- Department of Molecular GeneticsUniversity of TorontoTorontoONCanada
| | - Sachdev S Sidhu
- The Donnelly CentreUniversity of TorontoTorontoONCanada
- Department of Molecular GeneticsUniversity of TorontoTorontoONCanada
| |
Collapse
|
8
|
Dar KB, Bhat AH, Amin S, Anjum S, Reshi BA, Zargar MA, Masood A, Ganie SA. Exploring Proteomic Drug Targets, Therapeutic Strategies and Protein - Protein Interactions in Cancer: Mechanistic View. Curr Cancer Drug Targets 2020; 19:430-448. [PMID: 30073927 DOI: 10.2174/1568009618666180803104631] [Citation(s) in RCA: 9] [Impact Index Per Article: 2.3] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 01/21/2018] [Revised: 07/17/2018] [Accepted: 07/19/2018] [Indexed: 12/31/2022]
Abstract
Protein-Protein Interactions (PPIs) drive major signalling cascades and play critical role in cell proliferation, apoptosis, angiogenesis and trafficking. Deregulated PPIs are implicated in multiple malignancies and represent the critical targets for treating cancer. Herein, we discuss the key protein-protein interacting domains implicated in cancer notably PDZ, SH2, SH3, LIM, PTB, SAM and PH. These domains are present in numerous enzymes/kinases, growth factors, transcription factors, adaptor proteins, receptors and scaffolding proteins and thus represent essential sites for targeting cancer. This review explores the candidature of various proteins involved in cellular trafficking (small GTPases, molecular motors, matrix-degrading enzymes, integrin), transcription (p53, cMyc), signalling (membrane receptor proteins), angiogenesis (VEGFs) and apoptosis (BCL-2family), which could possibly serve as targets for developing effective anti-cancer regimen. Interactions between Ras/Raf; X-linked inhibitor of apoptosis protein (XIAP)/second mitochondria-derived activator of caspases (Smac/DIABLO); Frizzled (FRZ)/Dishevelled (DVL) protein; beta-catenin/T Cell Factor (TCF) have also been studied as prospective anticancer targets. Efficacy of diverse molecules/ drugs targeting such PPIs although evaluated in various animal models/cell lines, there is an essential need for human-based clinical trials. Therapeutic strategies like the use of biologicals, high throughput screening (HTS) and fragment-based technology could play an imperative role in designing cancer therapeutics. Moreover, bioinformatic/computational strategies based on genome sequence, protein sequence/structure and domain data could serve as competent tools for predicting PPIs. Exploring hot spots in proteomic networks represents another approach for developing targetspecific therapeutics. Overall, this review lays emphasis on a productive amalgamation of proteomics, genomics, biochemistry, and molecular dynamics for successful treatment of cancer.
Collapse
Affiliation(s)
- Khalid Bashir Dar
- Department of Clinical Biochemistry, School of Biological Sciences, University of Kashmir, Srinagar, India.,Department of Biochemistry, School of Biological Sciences, University of Kashmir, Srinagar, India
| | - Aashiq Hussain Bhat
- Department of Clinical Biochemistry, School of Biological Sciences, University of Kashmir, Srinagar, India.,Department of Biochemistry, School of Biological Sciences, University of Kashmir, Srinagar, India
| | - Shajrul Amin
- Department of Biochemistry, School of Biological Sciences, University of Kashmir, Srinagar, India
| | - Syed Anjum
- Amity Institute of Biotechnology, Amity University, Rajasthan, India
| | - Bilal Ahmad Reshi
- Department of Biotechnology, School of Biological Sciences, University of Kashmir, Srinagar, India
| | - Mohammad Afzal Zargar
- Department of Clinical Biochemistry, School of Biological Sciences, University of Kashmir, Srinagar, India
| | - Akbar Masood
- Department of Clinical Biochemistry, School of Biological Sciences, University of Kashmir, Srinagar, India
| | - Showkat Ahmad Ganie
- Department of Clinical Biochemistry, School of Biological Sciences, University of Kashmir, Srinagar, India
| |
Collapse
|
9
|
Zhou P, Miao Q, Yan F, Li Z, Jiang Q, Wen L, Meng Y. Is protein context responsible for peptide-mediated interactions? Mol Omics 2019; 15:280-295. [DOI: 10.1039/c9mo00041k] [Citation(s) in RCA: 74] [Impact Index Per Article: 14.8] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 12/22/2022]
Abstract
Many cell signaling pathways are orchestrated by the weak, transient, and reversible peptide-mediated interactions (PMIs). Here, the role of protein context in contributing to the stability and specificity of PMIs is investigated systematically.
Collapse
Affiliation(s)
- Peng Zhou
- Center for Informational Biology
- University of Electronic Science and Technology of China (UESTC)
- Chengdu 611731
- China
- School of Life Science and Technology
| | - Qingqing Miao
- Center for Informational Biology
- University of Electronic Science and Technology of China (UESTC)
- Chengdu 611731
- China
- School of Life Science and Technology
| | - Fugang Yan
- Center for Informational Biology
- University of Electronic Science and Technology of China (UESTC)
- Chengdu 611731
- China
- School of Life Science and Technology
| | - Zhongyan Li
- Center for Informational Biology
- University of Electronic Science and Technology of China (UESTC)
- Chengdu 611731
- China
- School of Life Science and Technology
| | - Qianhu Jiang
- School of Life Science and Technology
- University of Electronic Science and Technology of China (UESTC)
- Chengdu 610054
- China
| | - Li Wen
- School of Life Science and Technology
- University of Electronic Science and Technology of China (UESTC)
- Chengdu 610054
- China
| | - Yang Meng
- School of Life Science and Technology
- University of Electronic Science and Technology of China (UESTC)
- Chengdu 610054
- China
| |
Collapse
|
10
|
Zhang W, Zhang C, Luo C, Zhan Y, Zhong B. Design, cyclization, and optimization of MMP13-TIMP1 interaction-derived self-inhibitory peptides against chondrocyte senescence in osteoarthritis. Int J Biol Macromol 2018; 121:921-929. [PMID: 30352228 DOI: 10.1016/j.ijbiomac.2018.10.141] [Citation(s) in RCA: 11] [Impact Index Per Article: 1.8] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 08/15/2018] [Revised: 10/18/2018] [Accepted: 10/18/2018] [Indexed: 02/07/2023]
Abstract
The matrix metallopeptidase 13 (MMP13) is a central regulator of chondrocyte senescence that contributes to the development and progression of osteoarthritis (OA). In the present study, the native inhibitory structure of MMP13 in complex with its natural cognate inhibitor, the tissue inhibitor of metalloproteinases 1 (TIMP1), was modeled at atomic level using a grafting-based structural bioinformatics method with existing crystal structures. The modeled complex structure was then examined in detail, from which a TIMP1 inhibitory site that directly inserts into the active site of MMP13 enzyme was identified. The inhibitory site contains a coiled inhibitory loop (ILP) and a stretched N-terminal tail (NTT); they are highly structured in the intact MMP13-TIMP1 complex interface, but exhibit a large flexibility and intrinsic disorder when split from the interface context. In vitro binding assays demonstrated that the isolated ILP and NTT peptides cannot effectively rebind at the MMP13 active site (Kd > ~100 μM or = n.d.), although they have all key interacting residues in the enzyme inhibition. In silico simulations revealed that splitting of the peptide segments from TIMP1 inhibitory site does not influence the direct intermolecular interaction between MMP13 and the peptides substantially; instead, the large conformational flexibility of these isolated peptides in absence of interface context is primarily responsible for the affinity impairment, which would incur a considerable entropy penalty upon the peptide binding to MMP13. An extended version of ILP peptide, namely eILP (63TPAMESVCGY72), was redesigned with a rational strategy to derive a number of its cyclized counterparts by introducing a disulfide bridge across the peptide two-termini; the redesign reduces the peptide flexibility in free state and constrains the peptide pre-folding to a native-like conformation, which would help the peptide binding with minimized entropy penalty. Binding assays substantiated that the affinity Kd values of four designed cyclic peptides (, , and ) were improved to 23, 67, 42 and 18 μM, respectively, from the 96 μM of linear eILP peptide.
Collapse
Affiliation(s)
- Wei Zhang
- Department of Orthopaedics, Shanghai Jiao Tong University Affiliated Sixth People's Hospital, Shanghai 200233, China.
| | - Chi Zhang
- Department of Orthopaedics, Shanghai Jiao Tong University Affiliated Sixth People's Hospital, Shanghai 200233, China
| | - Congfeng Luo
- Department of Orthopaedics, Shanghai Jiao Tong University Affiliated Sixth People's Hospital, Shanghai 200233, China
| | - Yulin Zhan
- Department of Orthopaedics, Shanghai Jiao Tong University Affiliated Sixth People's Hospital, Shanghai 200233, China
| | - Biao Zhong
- Department of Orthopaedics, Shanghai Jiao Tong University Affiliated Sixth People's Hospital, Shanghai 200233, China
| |
Collapse
|
11
|
Kelil A, Dubreuil B, Levy ED, Michnick SW. Exhaustive search of linear information encoding protein-peptide recognition. PLoS Comput Biol 2017; 13:e1005499. [PMID: 28426660 PMCID: PMC5417721 DOI: 10.1371/journal.pcbi.1005499] [Citation(s) in RCA: 11] [Impact Index Per Article: 1.6] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/20/2015] [Revised: 05/04/2017] [Accepted: 04/04/2017] [Indexed: 11/24/2022] Open
Abstract
High-throughput in vitro methods have been extensively applied to identify linear information that encodes peptide recognition. However, these methods are limited in number of peptides, sequence variation, and length of peptides that can be explored, and often produce solutions that are not found in the cell. Despite the large number of methods developed to attempt addressing these issues, the exhaustive search of linear information encoding protein-peptide recognition has been so far physically unfeasible. Here, we describe a strategy, called DALEL, for the exhaustive search of linear sequence information encoded in proteins that bind to a common partner. We applied DALEL to explore binding specificity of SH3 domains in the budding yeast Saccharomyces cerevisiae. Using only the polypeptide sequences of SH3 domain binding proteins, we succeeded in identifying the majority of known SH3 binding sites previously discovered either in vitro or in vivo. Moreover, we discovered a number of sites with both non-canonical sequences and distinct properties that may serve ancillary roles in peptide recognition. We compared DALEL to a variety of state-of-the-art algorithms in the blind identification of known binding sites of the human Grb2 SH3 domain. We also benchmarked DALEL on curated biological motifs derived from the ELM database to evaluate the effect of increasing/decreasing the enrichment of the motifs. Our strategy can be applied in conjunction with experimental data of proteins interacting with a common partner to identify binding sites among them. Yet, our strategy can also be applied to any group of proteins of interest to identify enriched linear motifs or to exhaustively explore the space of linear information encoded in a polypeptide sequence. Finally, we have developed a webserver located at http://michnick.bcm.umontreal.ca/dalel, offering user-friendly interface and providing different scenarios utilizing DALEL. Here we describe the first strategy for the exhaustive search of the linear information encoding protein-peptide recognition; an approach that has previously been physically unfeasible because the combinatorial space of polypeptide sequences is too vast. The search covers the entire space of sequences with no restriction on motif length or composition, and includes all possible combinations of amino acids at distinct positions of each sequence, as well as positions with correlated preferences for amino acids.
Collapse
Affiliation(s)
- Abdellali Kelil
- Donnelly Centre for Cellular and Biomolecular Research, University of Toronto, Toronto, Ontario, Canada
- Department of Biochemistry and Molecular Medicine, University of Montreal, Montreal, Quebec, Canada
| | - Benjamin Dubreuil
- Department of Structural Biology, Weizmann Institute of Science, Rehovot, Israel
| | - Emmanuel D. Levy
- Department of Biochemistry and Molecular Medicine, University of Montreal, Montreal, Quebec, Canada
- Department of Structural Biology, Weizmann Institute of Science, Rehovot, Israel
| | - Stephen W. Michnick
- Department of Biochemistry and Molecular Medicine, University of Montreal, Montreal, Quebec, Canada
- * E-mail:
| |
Collapse
|
12
|
Bai Z, Hou S, Zhang S, Li Z, Zhou P. Targeting Self-Binding Peptides as a Novel Strategy To Regulate Protein Activity and Function: A Case Study on the Proto-oncogene Tyrosine Protein Kinase c-Src. J Chem Inf Model 2017; 57:835-845. [DOI: 10.1021/acs.jcim.6b00673] [Citation(s) in RCA: 109] [Impact Index Per Article: 15.6] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/28/2022]
Affiliation(s)
- Zhengya Bai
- Center for Informational Biology, School of Life Science and Technology, ‡Center for Information
in BioMedicine, and §Key Laboratory for Neuroinformation of the Ministry of Education, University of Electronic Science and Technology of China (UESTC), Chengdu 610054, China
| | - Shasha Hou
- Center for Informational Biology, School of Life Science and Technology, ‡Center for Information
in BioMedicine, and §Key Laboratory for Neuroinformation of the Ministry of Education, University of Electronic Science and Technology of China (UESTC), Chengdu 610054, China
| | - Shilei Zhang
- Center for Informational Biology, School of Life Science and Technology, ‡Center for Information
in BioMedicine, and §Key Laboratory for Neuroinformation of the Ministry of Education, University of Electronic Science and Technology of China (UESTC), Chengdu 610054, China
| | - Zhongyan Li
- Center for Informational Biology, School of Life Science and Technology, ‡Center for Information
in BioMedicine, and §Key Laboratory for Neuroinformation of the Ministry of Education, University of Electronic Science and Technology of China (UESTC), Chengdu 610054, China
| | - Peng Zhou
- Center for Informational Biology, School of Life Science and Technology, ‡Center for Information
in BioMedicine, and §Key Laboratory for Neuroinformation of the Ministry of Education, University of Electronic Science and Technology of China (UESTC), Chengdu 610054, China
| |
Collapse
|
13
|
Zhang A, He L, Wang Y. Prediction of GCRV virus-host protein interactome based on structural motif-domain interactions. BMC Bioinformatics 2017; 18:145. [PMID: 28253857 PMCID: PMC5335770 DOI: 10.1186/s12859-017-1500-8] [Citation(s) in RCA: 32] [Impact Index Per Article: 4.6] [Reference Citation Analysis] [Abstract] [Key Words] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 09/24/2016] [Accepted: 01/27/2017] [Indexed: 01/13/2023] Open
Abstract
BACKGROUND Grass carp hemorrhagic disease, caused by grass carp reovirus (GCRV), is the most fatal causative agent in grass carp aquaculture. Protein-protein interactions between virus and host are one avenue through which GCRV can trigger infection and induce disease. Experimental approaches for the detection of host-virus interactome have many inherent limitations, and studies on protein-protein interactions between GCRV and its host remain rare. RESULTS In this study, based on known motif-domain interaction information, we systematically predicted the GCRV virus-host protein interactome by using motif-domain interaction pair searching strategy. These proteins derived from different domain families and were predicted to interact with different motif patterns in GCRV. JAM-A protein was successfully predicted to interact with motifs of GCRV Sigma1-like protein, and shared the similar binding mode compared with orthoreovirus. Differentially expressed genes during GCRV infection process were extracted and mapped to our predicted interactome, the overlapped genes displayed different tissue expression distributions on the whole, the overall expression level in intestinal is higher than that of other three tissues, which may suggest that the functions of these genes are more active in intestinal. Function annotation and pathway enrichment analysis revealed that the host targets were largely involved in signaling pathway and immune pathway, such as interferon-gamma signaling pathway, VEGF signaling pathway, EGF receptor signaling pathway, B cell activation, and T cell activation. CONCLUSIONS Although the predicted PPIs may contain some false positives due to limited data resource and poor research background in non-model species, the computational method still provide reasonable amount of interactions, which can be further validated by high throughput experiments. The findings of this work will contribute to the development of system biology for GCRV infectious diseases, and help guide the identification of novel receptors of GCRV in its host.
Collapse
Affiliation(s)
- Aidi Zhang
- State Key Laboratory of Freshwater Ecology and Biotechnology, Institute of Hydrobiology, Chinese Academy of Sciences, Wuhan, 430072, China
| | - Libo He
- State Key Laboratory of Freshwater Ecology and Biotechnology, Institute of Hydrobiology, Chinese Academy of Sciences, Wuhan, 430072, China
| | - Yaping Wang
- State Key Laboratory of Freshwater Ecology and Biotechnology, Institute of Hydrobiology, Chinese Academy of Sciences, Wuhan, 430072, China.
| |
Collapse
|
14
|
Evolution of domain-peptide interactions to coadapt specificity and affinity to functional diversity. Proc Natl Acad Sci U S A 2016; 113:E3862-71. [PMID: 27317745 DOI: 10.1073/pnas.1518469113] [Citation(s) in RCA: 31] [Impact Index Per Article: 3.9] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/18/2022] Open
Abstract
Evolution of complexity in eukaryotic proteomes has arisen, in part, through emergence of modular independently folded domains mediating protein interactions via binding to short linear peptides in proteins. Over 30 years, structural properties and sequence preferences of these peptides have been extensively characterized. Less successful, however, were efforts to establish relationships between physicochemical properties and functions of domain-peptide interactions. To our knowledge, we have devised the first strategy to exhaustively explore the binding specificity of protein domain-peptide interactions. We applied the strategy to SH3 domains to determine the properties of their binding peptides starting from various experimental data. The strategy identified the majority (∼70%) of experimentally determined SH3 binding sites. We discovered mutual relationships among binding specificity, binding affinity, and structural properties and evolution of linear peptides. Remarkably, we found that these properties are also related to functional diversity, defined by depth of proteins within hierarchies of gene ontologies. Our results revealed that linear peptides evolved to coadapt specificity and affinity to functional diversity of domain-peptide interactions. Thus, domain-peptide interactions follow human-constructed gene ontologies, which suggest that our understanding of biological process hierarchies reflect the way chemical and thermodynamic properties of linear peptides and their interaction networks, in general, have evolved.
Collapse
|
15
|
Corbi-Verge C, Kim PM. Motif mediated protein-protein interactions as drug targets. Cell Commun Signal 2016; 14:8. [PMID: 26936767 PMCID: PMC4776425 DOI: 10.1186/s12964-016-0131-4] [Citation(s) in RCA: 64] [Impact Index Per Article: 8.0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 12/30/2015] [Accepted: 02/25/2016] [Indexed: 12/17/2022] Open
Abstract
Protein-protein interactions (PPI) are involved in virtually every cellular process and thus represent an attractive target for therapeutic interventions. A significant number of protein interactions are frequently formed between globular domains and short linear peptide motifs (DMI). Targeting these DMIs has proven challenging and classical approaches to inhibiting such interactions with small molecules have had limited success. However, recent new approaches have led to the discovery of potent inhibitors, some of them, such as Obatoclax, ABT-199, AEG-40826 and SAH-p53-8 are likely to become approved drugs. These novel inhibitors belong to a wide range of different molecule classes, ranging from small molecules to peptidomimetics and biologicals. This article reviews the main reasons for limited success in targeting PPIs, discusses how successful approaches overcome these obstacles to discovery promising inhibitors for human protein double minute 2 (HDM2), B-cell lymphoma 2 (Bcl-2), X-linked inhibitor of apoptosis protein (XIAP), and provides a summary of the promising approaches currently in development that indicate the future potential of PPI inhibitors in drug discovery.
Collapse
Affiliation(s)
- Carles Corbi-Verge
- Terrence Donnelly Centre for Cellular and Biomolecular Research, University of Toronto, Toronto, ON, M5S 3E1, Canada.
| | - Philip M Kim
- Terrence Donnelly Centre for Cellular and Biomolecular Research, University of Toronto, Toronto, ON, M5S 3E1, Canada.
- Department of Molecular Genetics, University of Toronto, Toronto, ON, M5S 3E1, Canada.
- Department of Computer Science, University of Toronto, Toronto, ON, M5S 3E1, Canada.
| |
Collapse
|
16
|
Blikstad C, Ivarsson Y. High-throughput methods for identification of protein-protein interactions involving short linear motifs. Cell Commun Signal 2015; 13:38. [PMID: 26297553 PMCID: PMC4546347 DOI: 10.1186/s12964-015-0116-8] [Citation(s) in RCA: 35] [Impact Index Per Article: 3.9] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/30/2015] [Accepted: 08/11/2015] [Indexed: 02/07/2023] Open
Abstract
Interactions between modular domains and short linear motifs (3–10 amino acids peptide stretches) are crucial for cell signaling. The motifs typically reside in the disordered regions of the proteome and the interactions are often transient, allowing for rapid changes in response to changing stimuli. The properties that make domain-motif interactions suitable for cell signaling also make them difficult to capture experimentally and they are therefore largely underrepresented in the known protein-protein interaction networks. Most of the knowledge on domain-motif interactions is derived from low-throughput studies, although there exist dedicated high-throughput methods for the identification of domain-motif interactions. The methods include arrays of peptides or proteins, display of peptides on phage or yeast, and yeast-two-hybrid experiments. We here provide a survey of scalable methods for domain-motif interaction profiling. These methods have frequently been applied to a limited number of ubiquitous domain families. It is now time to apply them to a broader set of peptide binding proteins, to provide a comprehensive picture of the linear motifs in the human proteome and to link them to their potential binding partners. Despite the plethora of methods, it is still a challenge for most approaches to identify interactions that rely on post-translational modification or context dependent or conditional interactions, suggesting directions for further method development.
Collapse
Affiliation(s)
- Cecilia Blikstad
- Department of Chemistry - BMC, Husargatan 3, 751 23, Uppsala, Sweden
| | - Ylva Ivarsson
- Department of Chemistry - BMC, Husargatan 3, 751 23, Uppsala, Sweden.
| |
Collapse
|
17
|
Schindler CEM, de Vries SJ, Zacharias M. Fully Blind Peptide-Protein Docking with pepATTRACT. Structure 2015; 23:1507-1515. [PMID: 26146186 DOI: 10.1016/j.str.2015.05.021] [Citation(s) in RCA: 87] [Impact Index Per Article: 9.7] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/06/2015] [Revised: 05/21/2015] [Accepted: 05/25/2015] [Indexed: 02/02/2023]
Abstract
Peptide-protein interactions are ubiquitous in the cell and form an important part of the interactome. Computational docking methods can complement experimental characterization of these complexes, but current protocols are not applicable on the proteome scale. Here, we present a new fully blind flexible peptide-protein docking protocol, pepATTRACT, which combines a rapid coarse-grained global peptide docking search of the entire protein surface with a two-stage atomistic flexible refinement. Global unbound-unbound docking yielded near-native models for 70% of the docking cases when testing the protocol on the largest benchmark of peptide-protein complexes available to date. This performance is similar to that of state-of-the-art local docking protocols that rely on information about the binding site. Upon restricting the search to the peptide binding region, the resulting pepATTRACT-local approach outperformed existing methods. Docking scripts for pepATTRACT and pepATTRACT-local can be generated via a web interface at www.attract.ph.tum.de/peptide.html.
Collapse
Affiliation(s)
- Christina E M Schindler
- Physics Department T38, Technische Universität München, James-Franck-Straße 1, 85748 Garching, Germany
| | - Sjoerd J de Vries
- Physics Department T38, Technische Universität München, James-Franck-Straße 1, 85748 Garching, Germany
| | - Martin Zacharias
- Physics Department T38, Technische Universität München, James-Franck-Straße 1, 85748 Garching, Germany.
| |
Collapse
|
18
|
Chen TS, Petrey D, Garzon JI, Honig B. Predicting peptide-mediated interactions on a genome-wide scale. PLoS Comput Biol 2015; 11:e1004248. [PMID: 25938916 PMCID: PMC4418708 DOI: 10.1371/journal.pcbi.1004248] [Citation(s) in RCA: 14] [Impact Index Per Article: 1.6] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 12/03/2014] [Accepted: 03/18/2015] [Indexed: 12/20/2022] Open
Abstract
We describe a method to predict protein-protein interactions (PPIs) formed between structured domains and short peptide motifs. We take an integrative approach based on consensus patterns of known motifs in databases, structures of domain-motif complexes from the PDB and various sources of non-structural evidence. We combine this set of clues using a Bayesian classifier that reports the likelihood of an interaction and obtain significantly improved prediction performance when compared to individual sources of evidence and to previously reported algorithms. Our Bayesian approach was integrated into PrePPI, a structure-based PPI prediction method that, so far, has been limited to interactions formed between two structured domains. Around 80,000 new domain-motif mediated interactions were predicted, thus enhancing PrePPI’s coverage of the human protein interactome. Complexes formed between a structured domain on one protein and an unstructured peptide on another are ubiquitous. However, they are often quite difficult to detect experimentally. The development of computational approaches to predict domain-motif interactions is therefore an important goal. We report a method to predict domain-motif interactions using a Bayesian approach to integrate evidence from a variety of sources, including three-dimensional structural and non-structural information. The method was applied to the entire human proteome and showed significant improvement over existing methods. The method was incorporated into PrePPI, a computational pipeline for the prediction of protein-protein interactions that relies heavily on structural information. Approximately 80,000 new interactions were detected. The new PrePPI database provides easy access to about 400,000 human protein-protein interactions and should thus constitute a valuable resource in a variety of biological applications including the characterization of molecular interaction networks and, more generally, in the study of interactions mediated by proteins in families that may not be extensively studied experimentally.
Collapse
Affiliation(s)
- T. Scott Chen
- Howard Hughes Medical Institute, Columbia University, New York, New York, United States of America
- Department of Systems Biology, Columbia University, New York, New York, United States of America
- Department of Biochemistry and Molecular Biophysics, Columbia University, New York, New York, United States of America
- Center for Computational Biology and Bioinformatics, Columbia University, New York, New York, United States of America
| | - Donald Petrey
- Howard Hughes Medical Institute, Columbia University, New York, New York, United States of America
- Department of Systems Biology, Columbia University, New York, New York, United States of America
- Department of Biochemistry and Molecular Biophysics, Columbia University, New York, New York, United States of America
- Center for Computational Biology and Bioinformatics, Columbia University, New York, New York, United States of America
| | - Jose Ignacio Garzon
- Howard Hughes Medical Institute, Columbia University, New York, New York, United States of America
- Department of Systems Biology, Columbia University, New York, New York, United States of America
- Department of Biochemistry and Molecular Biophysics, Columbia University, New York, New York, United States of America
- Center for Computational Biology and Bioinformatics, Columbia University, New York, New York, United States of America
| | - Barry Honig
- Howard Hughes Medical Institute, Columbia University, New York, New York, United States of America
- Department of Systems Biology, Columbia University, New York, New York, United States of America
- Department of Biochemistry and Molecular Biophysics, Columbia University, New York, New York, United States of America
- Center for Computational Biology and Bioinformatics, Columbia University, New York, New York, United States of America
- * E-mail:
| |
Collapse
|
19
|
Palopoli N, Lythgow KT, Edwards RJ. QSLiMFinder: improved short linear motif prediction using specific query protein data. Bioinformatics 2015; 31:2284-93. [PMID: 25792551 PMCID: PMC4495300 DOI: 10.1093/bioinformatics/btv155] [Citation(s) in RCA: 15] [Impact Index Per Article: 1.7] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/07/2014] [Accepted: 03/16/2015] [Indexed: 12/16/2022] Open
Abstract
Motivation: The sensitivity of de novo short linear motif (SLiM) prediction is limited by the number of patterns (the motif space) being assessed for enrichment. QSLiMFinder uses specific query protein information to restrict the motif space and thereby increase the sensitivity and specificity of predictions. Results: QSLiMFinder was extensively benchmarked using known SLiM-containing proteins and simulated protein interaction datasets of real human proteins. Exploiting prior knowledge of a query protein likely to be involved in a SLiM-mediated interaction increased the proportion of true positives correctly returned and reduced the proportion of datasets returning a false positive prediction. The biggest improvement was seen if a short region of the query protein flanking the interaction site was known. Availability and implementation: All the tools and data used in this study, including QSLiMFinder and the SLiMBench benchmarking software, are freely available under a GNU license as part of SLiMSuite, at: http://bioware.soton.ac.uk. Contact:richard.edwards@unsw.edu.au Supplementary information:Supplementary data are available at Bioinformatics online.
Collapse
Affiliation(s)
- Nicolas Palopoli
- Centre for Biological Sciences, University of Southampton, Southampton, UK
| | - Kieren T Lythgow
- Centre for Biological Sciences, University of Southampton, Southampton, UK, Public Health England, London, UK
| | - Richard J Edwards
- Centre for Biological Sciences, University of Southampton, Southampton, UK, Institute for Life Sciences, University of Southampton, Southampton, UK and School of Biotechnology and Biomolecular Sciences, University of New South Wales, Sydney, NSW, Australia
| |
Collapse
|
20
|
Abstract
Regulated interactions between proteins govern signaling pathways within and between cells. Structural studies on protein complexes formed reversibly and/or transiently illustrate the remarkable diversity of interactions, both in terms of interfacial size and nature. In recent years, "domain-peptide" interactions have gained much greater recognition and may be viewed as both pre-translational and posttranslational-dependent functional switches. Our understanding of the multistep regulation of auto-inhibited multidomain proteins has also grown. Their activity may be understood as the "combinatorial" output of multiple input signals, including phosphorylation, location, and mechanical force. The prospects for bridging the gap between the new "systems biology" data and the traditional "reductionist" data are also discussed.
Collapse
Affiliation(s)
- Robert C Liddington
- Sanford-Burnham Medical Research Institute, 10901 North Torrey Pines Road, La Jolla, CA, 92037, USA,
| |
Collapse
|
21
|
Schoenrock A, Samanfar B, Pitre S, Hooshyar M, Jin K, Phillips CA, Wang H, Phanse S, Omidi K, Gui Y, Alamgir M, Wong A, Barrenäs F, Babu M, Benson M, Langston MA, Green JR, Dehne F, Golshani A. Efficient prediction of human protein-protein interactions at a global scale. BMC Bioinformatics 2014; 15:383. [PMID: 25492630 PMCID: PMC4272565 DOI: 10.1186/s12859-014-0383-1] [Citation(s) in RCA: 20] [Impact Index Per Article: 2.0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/06/2014] [Accepted: 11/12/2014] [Indexed: 12/13/2022] Open
Abstract
BACKGROUND Our knowledge of global protein-protein interaction (PPI) networks in complex organisms such as humans is hindered by technical limitations of current methods. RESULTS On the basis of short co-occurring polypeptide regions, we developed a tool called MP-PIPE capable of predicting a global human PPI network within 3 months. With a recall of 23% at a precision of 82.1%, we predicted 172,132 putative PPIs. We demonstrate the usefulness of these predictions through a range of experiments. CONCLUSIONS The speed and accuracy associated with MP-PIPE can make this a potential tool to study individual human PPI networks (from genomic sequences alone) for personalized medicine.
Collapse
Affiliation(s)
| | | | - Sylvain Pitre
- School of Computer Science, Carleton University, Ottawa, Canada.
| | | | - Ke Jin
- Banting and Best Department of Medical Research, University of Toronto, Toronto, Ontario, Canada.
| | - Charles A Phillips
- Department of Electrical Engineering and Computer Science, University of Tennessee, Knoxville, Tennessee, USA.
| | - Hui Wang
- Department of Pediatrics, Gothenburg University, Gothenburg, Sweden. .,The Centre for Individualized Medication, Linköping University, Linköping, Sweden.
| | - Sadhna Phanse
- Banting and Best Department of Medical Research, University of Toronto, Toronto, Ontario, Canada.
| | - Katayoun Omidi
- Department of Biology, Carleton University, Ottawa, Canada.
| | - Yuan Gui
- Department of Biology, Carleton University, Ottawa, Canada.
| | - Md Alamgir
- Department of Biology, Carleton University, Ottawa, Canada.
| | - Alex Wong
- Department of Biology, Carleton University, Ottawa, Canada.
| | - Fredrik Barrenäs
- Department of Pediatrics, Gothenburg University, Gothenburg, Sweden. .,The Centre for Individualized Medication, Linköping University, Linköping, Sweden.
| | - Mohan Babu
- Department of Biochemistry, Research and Innovation Centre, University of Regina, Regina, Saskatchewan, Canada.
| | - Mikael Benson
- Department of Pediatrics, Gothenburg University, Gothenburg, Sweden. .,The Centre for Individualized Medication, Linköping University, Linköping, Sweden.
| | - Michael A Langston
- Department of Electrical Engineering and Computer Science, University of Tennessee, Knoxville, Tennessee, USA.
| | - James R Green
- Department of Systems and Computer Engineering, Carleton University, Ottawa, Canada.
| | - Frank Dehne
- School of Computer Science, Carleton University, Ottawa, Canada.
| | | |
Collapse
|
22
|
Yu H, Zhou P, Deng M, Shang Z. Indirect Readout in Protein-Peptide Recognition: A Different Story from Classical Biomolecular Recognition. J Chem Inf Model 2014; 54:2022-32. [DOI: 10.1021/ci5000246] [Citation(s) in RCA: 96] [Impact Index Per Article: 9.6] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 12/12/2022]
Affiliation(s)
| | - Peng Zhou
- Center
of Bioinformatics (COBI), School of Life Science and Technology, University of Electronic Science and Technology of China (UESTC), Chengdu, Sichuan 610054, China
| | | | | |
Collapse
|
23
|
Cukuroglu E, Gursoy A, Nussinov R, Keskin O. Non-redundant unique interface structures as templates for modeling protein interactions. PLoS One 2014; 9:e86738. [PMID: 24475173 PMCID: PMC3903793 DOI: 10.1371/journal.pone.0086738] [Citation(s) in RCA: 52] [Impact Index Per Article: 5.2] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 09/28/2013] [Accepted: 12/18/2013] [Indexed: 01/16/2023] Open
Abstract
Improvements in experimental techniques increasingly provide structural data relating to protein-protein interactions. Classification of structural details of protein-protein interactions can provide valuable insights for modeling and abstracting design principles. Here, we aim to cluster protein-protein interactions by their interface structures, and to exploit these clusters to obtain and study shared and distinct protein binding sites. We find that there are 22604 unique interface structures in the PDB. These unique interfaces, which provide a rich resource of structural data of protein-protein interactions, can be used for template-based docking. We test the specificity of these non-redundant unique interface structures by finding protein pairs which have multiple binding sites. We suggest that residues with more than 40% relative accessible surface area should be considered as surface residues in template-based docking studies. This comprehensive study of protein interface structures can serve as a resource for the community. The dataset can be accessed at http://prism.ccbb.ku.edu.tr/piface.
Collapse
Affiliation(s)
- Engin Cukuroglu
- Center for Computational Biology and Bioinformatics and College of Engineering, Koc University, Istanbul, Turkey
| | - Attila Gursoy
- Center for Computational Biology and Bioinformatics and College of Engineering, Koc University, Istanbul, Turkey
| | - Ruth Nussinov
- National Cancer Institute, Cancer and Inflammation Program, Frederick National Laboratory for Cancer Research, Leidos Biomedical Research, Inc., National Cancer Institute, Frederick, Maryland, United States of America
- Sackler Institute of Molecular Medicine, Department of Human Genetics and Molecular Medicine, Sackler School of Medicine, Tel Aviv University, Tel Aviv, Israel
| | - Ozlem Keskin
- Center for Computational Biology and Bioinformatics and College of Engineering, Koc University, Istanbul, Turkey
| |
Collapse
|
24
|
Chandrasekaran P, Rajasekaran R. Structural characterization of disease-causing mutations on SAP and the functional impact on the SLAM peptide: a molecular dynamics approach. MOLECULAR BIOSYSTEMS 2014; 10:1869-80. [DOI: 10.1039/c4mb00177j] [Citation(s) in RCA: 12] [Impact Index Per Article: 1.2] [Reference Citation Analysis] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 12/31/2022]
|
25
|
Mosca R, Pons T, Céol A, Valencia A, Aloy P. Towards a detailed atlas of protein–protein interactions. Curr Opin Struct Biol 2013; 23:929-40. [DOI: 10.1016/j.sbi.2013.07.005] [Citation(s) in RCA: 87] [Impact Index Per Article: 7.9] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/27/2013] [Revised: 07/04/2013] [Accepted: 07/08/2013] [Indexed: 12/30/2022]
|
26
|
London N, Raveh B, Schueler-Furman O. Peptide docking and structure-based characterization of peptide binding: from knowledge to know-how. Curr Opin Struct Biol 2013; 23:894-902. [PMID: 24138780 DOI: 10.1016/j.sbi.2013.07.006] [Citation(s) in RCA: 73] [Impact Index Per Article: 6.6] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/08/2013] [Revised: 07/04/2013] [Accepted: 07/08/2013] [Indexed: 11/25/2022]
Abstract
Peptide-mediated interactions are gaining increased attention due to their predominant roles in the many regulatory processes that involve dynamic interactions between proteins. The structures of such interactions provide an excellent starting point for their characterization and manipulation, and can provide leads for targeted inhibitor design. The relatively few experimentally determined structures of peptide-protein complexes can be complemented with an outburst of modeling approaches that have been introduced in recent years, with increasing accuracy and applicability to ever more systems. We review different methods to address the considerable challenges in modeling the binding of a short yet highly flexible peptide to its partner. These methods apply an array of sampling strategies and draw from a recent amassing of knowledge about the biophysical nature of peptide-protein interactions. We elaborate on applications of these structure-based approaches and in particular on the characterization of peptide binding specificity to different peptide-binding domains and enzymes. Such applications can identify new biological targets and thus complement our current view of protein-protein interactions in living organisms. Accurate peptide-protein docking is of particular importance in the light of increased appreciation of the crucial functional roles of disordered regions and the many linear binding motifs embedded within.
Collapse
Affiliation(s)
- Nir London
- Department of Pharmaceutical Chemistry, University of California, San Francisco, San Francisco, CA 94158, USA
| | | | | |
Collapse
|
27
|
Mosca R, Céol A, Stein A, Olivella R, Aloy P. 3did: a catalog of domain-based interactions of known three-dimensional structure. Nucleic Acids Res 2013; 42:D374-9. [PMID: 24081580 PMCID: PMC3965002 DOI: 10.1093/nar/gkt887] [Citation(s) in RCA: 166] [Impact Index Per Article: 15.1] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/13/2022] Open
Abstract
The database of 3D interacting domains (3did, available online for browsing and bulk download at http://3did.irbbarcelona.org) is a catalog of protein–protein interactions for which a high-resolution 3D structure is known. 3did collects and classifies all structural templates of domain–domain interactions in the Protein Data Bank, providing molecular details for such interactions. The current version also includes a pipeline for the discovery and annotation of novel domain–motif interactions. For every interaction, 3did identifies and groups different binding modes by clustering similar interfaces into ‘interaction topologies’. By maintaining a constantly updated collection of domain-based structural interaction templates, 3did is a reference source of information for the structural characterization of protein interaction networks. 3did is updated every 6 months.
Collapse
Affiliation(s)
- Roberto Mosca
- Joint IRB-BSC Program in Computational Biology, Institute for Research in Biomedicine (IRB Barcelona), c/ Baldiri Reixac 10-12, 08028 Barcelona, Spain, Center for Genomic Science of IIT@SEMM, Istituto Italiano di Tecnologia (IIT), Via Adamello 16, 20139 Milan, Italy, California Institute for Quantitative Biomedical Research (qb3) and Department of Bioengineering and Therapeutic Sciences, MC 2530, University of California San Francisco (UCSF) CA 94158-2330, USA and Institució Catalana de Recerca i Estudis Avançats (ICREA), Passeig Lluís Companys 23, 08010 Barcelona, Spain
| | | | | | | | | |
Collapse
|
28
|
Functional characterization of human genes from exon expression and RNA interference results. Methods Mol Biol 2013; 910:33-53. [PMID: 22821591 DOI: 10.1007/978-1-61779-965-5_3] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 02/10/2023]
Abstract
Complex biological systems comprise a large number of interacting molecules. The identification and detailed characterization of the functions of the involved genes and proteins are crucial for modeling and understanding such systems. To interrogate the various cellular processes, high-throughput techniques such as the Affymetrix Exon Array or RNA interference (RNAi) screens are powerful experimental approaches for functional genomics. However, they typically yield long gene lists that require computational methods to further analyze and functionally annotate the experimental results and to gain more insight into important molecular interactions. Here, we focus on bioinformatics software tools for the functional interpretation of exon expression data to discover alternative splicing events and their impact on gene and protein architecture, molecular networks, and pathways. We additionally demonstrate how to explore large lists of candidate genes as they also result from RNAi screens. In particular, our exemplary application studies show how to analyze the function of human genes that play a major role in human stem cells or viral infections.
Collapse
|
29
|
Predicting binding within disordered protein regions to structurally characterised peptide-binding domains. PLoS One 2013; 8:e72838. [PMID: 24019881 PMCID: PMC3760854 DOI: 10.1371/journal.pone.0072838] [Citation(s) in RCA: 29] [Impact Index Per Article: 2.6] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 05/21/2013] [Accepted: 07/12/2013] [Indexed: 11/19/2022] Open
Abstract
Disordered regions of proteins often bind to structured domains, mediating interactions within and between proteins. However, it is difficult to identify a priori the short disordered regions involved in binding. We set out to determine if docking such peptide regions to peptide binding domains would assist in these predictions.We assembled a redundancy reduced dataset of SLiM (Short Linear Motif) containing proteins from the ELM database. We selected 84 sequences which had an associated PDB structures showing the SLiM bound to a protein receptor, where the SLiM was found within a 50 residue region of the protein sequence which was predicted to be disordered. First, we investigated the Vina docking scores of overlapping tripeptides from the 50 residue SLiM containing disordered regions of the protein sequence to the corresponding PDB domain. We found only weak discrimination of docking scores between peptides involved in binding and adjacent non-binding peptides in this context (AUC 0.58).Next, we trained a bidirectional recurrent neural network (BRNN) using as input the protein sequence, predicted secondary structure, Vina docking score and predicted disorder score. The results were very promising (AUC 0.72) showing that multiple sources of information can be combined to produce results which are clearly superior to any single source.We conclude that the Vina docking score alone has only modest power to define the location of a peptide within a larger protein region known to contain it. However, combining this information with other knowledge (using machine learning methods) clearly improves the identification of peptide binding regions within a protein sequence. This approach combining docking with machine learning is primarily a predictor of binding to peptide-binding sites, and is not intended as a predictor of specificity of binding to particular receptors.
Collapse
|
30
|
Segura-Cabrera A, García-Pérez CA, Guo X, Rodríguez-Pérez MA. A viral-human interactome based on structural motif-domain interactions captures the human infectome. PLoS One 2013; 8:e71526. [PMID: 23951184 PMCID: PMC3738538 DOI: 10.1371/journal.pone.0071526] [Citation(s) in RCA: 19] [Impact Index Per Article: 1.7] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/07/2013] [Accepted: 06/28/2013] [Indexed: 11/23/2022] Open
Abstract
Protein interactions between a pathogen and its host are fundamental in the establishment of the pathogen and underline the infection mechanism. In the present work, we developed a single predictive model for building a host-viral interactome based on the identification of structural descriptors from motif-domain interactions of protein complexes deposited in the Protein Data Bank (PDB). The structural descriptors were used for searching, in a database of protein sequences of human and five clinically important viruses; therefore, viral and human proteins sharing a descriptor were predicted as interacting proteins. The analysis of the host-viral interactome allowed to identify a set of new interactions that further explain molecular mechanism associated with viral infections and showed that it was able to capture human proteins already associated to viral infections (human infectome) and non-infectious diseases (human diseasome). The analysis of human proteins targeted by viral proteins in the context of a human interactome showed that their neighbors are enriched in proteins reported with differential expression under infection and disease conditions. It is expected that the findings of this work will contribute to the development of systems biology for infectious diseases, and help guide the rational identification and prioritization of novel drug targets.
Collapse
Affiliation(s)
- Aldo Segura-Cabrera
- Laboratorio de Bioinformática, Centro de Biotecnología Genómica, Instituto Politécnico Nacional, Reynosa, México.
| | | | | | | |
Collapse
|
31
|
Hugo W, Sung WK, Ng SK. Discovering interacting domains and motifs in protein-protein interactions. Methods Mol Biol 2013. [PMID: 23192537 DOI: 10.1007/978-1-62703-107-3_2] [Citation(s) in RCA: 3] [Impact Index Per Article: 0.3] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 02/13/2023]
Abstract
Many important biological processes, such as the signaling pathways, require protein-protein interactions (PPIs) that are designed for fast response to stimuli. These interactions are usually transient, easily formed, and disrupted, yet specific. Many of these transient interactions involve the binding of a protein domain to a short stretch (3-10) of amino acid residues, which can be characterized by a sequence pattern, i.e., a short linear motif (SLiM). We call these interacting domains and motifs domain-SLiM interactions. Existing methods have focused on discovering SLiMs in the interacting proteins' sequence data. With the recent increase in protein structures, we have a new opportunity to detect SLiMs directly from the proteins' 3D structures instead of their linear sequences. In this chapter, we describe a computational method called SLiMDIet to directly detect SLiMs on domain interfaces extracted from 3D structures of PPIs. SLiMDIet comprises two steps: (1) interaction interfaces belonging to the same domain are extracted and grouped together using structural clustering and (2) the extracted interaction interfaces in each cluster are structurally aligned to extract the corresponding SLiM. Using SLiMDIet, de novo SLiMs interacting with protein domains can be computationally detected from structurally clustered domain-SLiM interactions for PFAM domains which have available 3D structures in the PDB database.
Collapse
Affiliation(s)
- Willy Hugo
- School of Computing, National University of Singapore, Singapore, Singapore
| | | | | |
Collapse
|
32
|
Buljan M, Chalancon G, Dunker AK, Bateman A, Balaji S, Fuxreiter M, Babu MM. Alternative splicing of intrinsically disordered regions and rewiring of protein interactions. Curr Opin Struct Biol 2013; 23:443-50. [DOI: 10.1016/j.sbi.2013.03.006] [Citation(s) in RCA: 145] [Impact Index Per Article: 13.2] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 02/16/2013] [Revised: 03/19/2013] [Accepted: 03/25/2013] [Indexed: 12/31/2022]
|
33
|
Duran-Frigola M, Mosca R, Aloy P. Structural Systems Pharmacology: The Role of 3D Structures in Next-Generation Drug Development. ACTA ACUST UNITED AC 2013; 20:674-84. [DOI: 10.1016/j.chembiol.2013.03.004] [Citation(s) in RCA: 29] [Impact Index Per Article: 2.6] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 01/10/2013] [Revised: 02/28/2013] [Accepted: 03/05/2013] [Indexed: 01/12/2023]
|
34
|
Goodman SR, Daescu O, Kakhniashvili DG, Zivanic M. The proteomics and interactomics of human erythrocytes. Exp Biol Med (Maywood) 2013; 238:509-18. [DOI: 10.1177/1535370213488474] [Citation(s) in RCA: 53] [Impact Index Per Article: 4.8] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 12/31/2022] Open
Abstract
In this minireview, we focus on advances in our knowledge of the human erythrocyte proteome and interactome that have occurred since our seminal review on the topic published in 2007. As will be explained, the number of unique proteins has grown from 751 in 2007 to 2289 as of today. We describe how proteomics and interactomics tools have been used to probe critical protein changes in disorders impacting the blood. The primary example used is the work done on sickle cell disease where biomarkers of severity have been identified, protein changes in the erythrocyte membranes identified, pharmacoproteomic impact of hydroxyurea studied and interactomics used to identify erythrocyte protein changes that are predicted to have the greatest impact on protein interaction networks.
Collapse
Affiliation(s)
- Steven R Goodman
- Department of Biochemistry & Molecular Biology, SUNY Upstate Medical University, Syracuse, NY 13210, USA
| | - Ovidiu Daescu
- Department of Computer Science, University of Texas at Dallas, Richardson, TX 75080, USA
| | - David G Kakhniashvili
- Department of Biochemistry & Molecular Biology, SUNY Upstate Medical University, Syracuse, NY 13210, USA
| | - Marko Zivanic
- Department of Computer Science, University of Texas at Dallas, Richardson, TX 75080, USA
| |
Collapse
|
35
|
Verschueren E, Vanhee P, Rousseau F, Schymkowitz J, Serrano L. Protein-peptide complex prediction through fragment interaction patterns. Structure 2013; 21:789-97. [PMID: 23583037 DOI: 10.1016/j.str.2013.02.023] [Citation(s) in RCA: 55] [Impact Index Per Article: 5.0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/27/2012] [Revised: 02/04/2013] [Accepted: 02/25/2013] [Indexed: 01/13/2023]
Abstract
The number of protein-peptide interactions in a cell is so large that experimental determination of all these complex structures would be a daunting task. Although homology modeling and refinement protocols have vastly improved the number and quality of predicted structural models, ab initio methods are still challenged by both the large number of possible docking sites and the conformational space accessible to flexible peptides. We present a method that addresses these challenges by sampling the entire accessible surface of a protein with a reduced conformational space of interacting backbone fragment pairs from unrelated structures. We demonstrate its potential by predicting ab initio the bound structure for a variety of protein-peptide complexes. In addition, we show the potential of our method for the discovery of domain interaction sites and domain-domain docking.
Collapse
Affiliation(s)
- Erik Verschueren
- EMBL/CRG Systems Biology Research Unit, Centre for Genomic Regulation-CRG, Dr. Aiguader 88, 08003 Barcelona, Spain
| | | | | | | | | |
Collapse
|
36
|
Design of peptide affinity ligands for S-protein: a comparison of combinatorial and de novo design strategies. Mol Divers 2013; 17:357-69. [DOI: 10.1007/s11030-013-9436-z] [Citation(s) in RCA: 9] [Impact Index Per Article: 0.8] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 01/09/2013] [Accepted: 03/11/2013] [Indexed: 12/11/2022]
|
37
|
Mosca R, Céol A, Aloy P. Interactome3D: adding structural details to protein networks. Nat Methods 2013; 10:47-53. [DOI: 10.1038/nmeth.2289] [Citation(s) in RCA: 339] [Impact Index Per Article: 30.8] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/18/2012] [Accepted: 10/30/2012] [Indexed: 01/13/2023]
|
38
|
Disordered binding regions and linear motifs--bridging the gap between two models of molecular recognition. PLoS One 2012; 7:e46829. [PMID: 23056474 PMCID: PMC3463566 DOI: 10.1371/journal.pone.0046829] [Citation(s) in RCA: 52] [Impact Index Per Article: 4.3] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 08/03/2012] [Accepted: 09/05/2012] [Indexed: 12/25/2022] Open
Abstract
Intrinsically disordered proteins (IDPs) exist without the presence of a stable tertiary structure in isolation. These proteins are often involved in molecular recognition processes via their disordered binding regions that can recognize partner molecules by undergoing a coupled folding and binding process. The specific properties of disordered binding regions give way to specific, yet transient interactions that enable IDPs to play central roles in signaling pathways and act as hubs of protein interaction networks. An alternative model of protein-protein interactions with largely overlapping functional properties is offered by the concept of linear interaction motifs. This approach focuses on distilling a short consensus sequence pattern from proteins with a common interaction partner. These motifs often reside in disordered regions and are considered to mediate the interaction roughly independent from the rest of the protein. Although a connection between linear motifs and disordered binding regions has been established through common examples, the complementary nature of the two concepts has yet to be fully explored. In many cases the sequence based definition of linear motifs and the structural context based definition of disordered binding regions describe two aspects of the same phenomenon. To gain insight into the connection between the two models, prediction methods were utilized. We combined the regular expression based prediction of linear motifs with the disordered binding region prediction method ANCHOR, each specialized for either model to get the best of both worlds. The thorough analysis of the overlap of the two methods offers a bioinformatics tool for more efficient binding site prediction that can serve a wide range of practical implications. At the same time it can also shed light on the theoretical connection between the two co-existing interaction models.
Collapse
|
39
|
Kuzu G, Keskin O, Gursoy A, Nussinov R. Constructing structural networks of signaling pathways on the proteome scale. Curr Opin Struct Biol 2012; 22:367-77. [PMID: 22575757 DOI: 10.1016/j.sbi.2012.04.004] [Citation(s) in RCA: 57] [Impact Index Per Article: 4.8] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 12/16/2011] [Revised: 03/20/2012] [Accepted: 04/18/2012] [Indexed: 11/30/2022]
Abstract
Proteins function through their interactions, and the availability of protein interaction networks could help in understanding cellular processes. However, the known structural data are limited and the classical network node-and-edge representation, where proteins are nodes and interactions are edges, shows only which proteins interact; not how they interact. Structural networks provide this information. Protein-protein interface structures can also indicate which binding partners can interact simultaneously and which are competitive, and can help forecasting potentially harmful drug side effects. Here, we use a powerful protein-protein interactions prediction tool which is able to carry out accurate predictions on the proteome scale to construct the structural network of the extracellular signal-regulated kinases (ERK) in the mitogen-activated protein kinase (MAPK) signaling pathway. This knowledge-based method, PRISM, is motif-based, and is combined with flexible refinement and energy scoring. PRISM predicts protein interactions based on structural and evolutionary similarity to known protein interfaces.
Collapse
Affiliation(s)
- Guray Kuzu
- Center for Computational Biology and Bioinformatics and College of Engineering, Koc University Rumelifeneri Yolu, 34450 Sariyer Istanbul, Turkey
| | | | | | | |
Collapse
|
40
|
Weatheritt RJ, Luck K, Petsalaki E, Davey NE, Gibson TJ. The identification of short linear motif-mediated interfaces within the human interactome. ACTA ACUST UNITED AC 2012; 28:976-82. [PMID: 22328783 PMCID: PMC3315716 DOI: 10.1093/bioinformatics/bts072] [Citation(s) in RCA: 46] [Impact Index Per Article: 3.8] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/14/2022]
Abstract
MOTIVATION Eukaryotic proteins are highly modular, containing multiple interaction interfaces that mediate binding to a network of regulators and effectors. Recent advances in high-throughput proteomics have rapidly expanded the number of known protein-protein interactions (PPIs); however, the molecular basis for the majority of these interactions remains to be elucidated. There has been a growing appreciation of the importance of a subset of these PPIs, namely those mediated by short linear motifs (SLiMs), particularly the canonical and ubiquitous SH2, SH3 and PDZ domain-binding motifs. However, these motif classes represent only a small fraction of known SLiMs and outside these examples little effort has been made, either bioinformatically or experimentally, to discover the full complement of motif instances. RESULTS In this article, interaction data are analysed to identify and characterize an important subset of PPIs, those involving SLiMs binding to globular domains. To do this, we introduce iELM, a method to identify interactions mediated by SLiMs and add molecular details of the interaction interfaces to both interacting proteins. The method identifies SLiM-mediated interfaces from PPI data by searching for known SLiM-domain pairs. This approach was applied to the human interactome to identify a set of high-confidence putative SLiM-mediated PPIs. AVAILABILITY iELM is freely available at http://elmint.embl.de CONTACT toby.gibson@embl.de SUPPLEMENTARY INFORMATION Supplementary data are available at Bioinformatics online.
Collapse
Affiliation(s)
- R J Weatheritt
- Structural and Computational Biology Unit, European Molecular Biology Laboratory, Meyerhofstrasse 1, 69117 Heidelberg, Germany
| | | | | | | | | |
Collapse
|
41
|
Hugo W, Ng SK, Sung WK. D-SLIMMER: Domain–SLiM Interaction Motifs Miner for Sequence Based Protein–Protein Interaction Data. J Proteome Res 2011; 10:5285-95. [DOI: 10.1021/pr200312e] [Citation(s) in RCA: 5] [Impact Index Per Article: 0.4] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 12/27/2022]
Affiliation(s)
- Willy Hugo
- Department of Computer Science, National University of Singapore, 13 Computing Drive, 117417 Singapore
| | - See-Kiong Ng
- Data Mining Department, Institute for Infocomm Research, 1 Fusionopolis Way, 138632 Singapore
| | - Wing-Kin Sung
- Department of Computer Science, National University of Singapore, 13 Computing Drive, 117417 Singapore
- Department of Information and Mathematical Science, Genome Institute of Singapore, 60 Biopolis Street, 138672 Singapore
| |
Collapse
|
42
|
Mooney C, Pollastri G, Shields DC, Haslam NJ. Prediction of short linear protein binding regions. J Mol Biol 2011; 415:193-204. [PMID: 22079048 DOI: 10.1016/j.jmb.2011.10.025] [Citation(s) in RCA: 60] [Impact Index Per Article: 4.6] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/20/2011] [Revised: 09/13/2011] [Accepted: 10/13/2011] [Indexed: 12/17/2022]
Abstract
Short linear motifs in proteins (typically 3-12 residues in length) play key roles in protein-protein interactions by frequently binding specifically to peptide binding domains within interacting proteins. Their tendency to be found in disordered segments of proteins has meant that they have often been overlooked. Here we present SLiMPred (short linear motif predictor), the first general de novo method designed to computationally predict such regions in protein primary sequences independent of experimentally defined homologs and interactors. The method applies machine learning techniques to predict new motifs based on annotated instances from the Eukaryotic Linear Motif database, as well as structural, biophysical, and biochemical features derived from the protein primary sequence. We have integrated these data sources and benchmarked the predictive accuracy of the method, and found that it performs equivalently to a predictor of protein binding regions in disordered regions, in addition to having predictive power for other classes of motif sites such as polyproline II helix motifs and short linear motifs lying in ordered regions. It will be useful in predicting peptides involved in potential protein associations and will aid in the functional characterization of proteins, especially of proteins lacking experimental information on structures and interactions. We conclude that, despite the diversity of motif sequences and structures, SLiMPred is a valuable tool for prioritizing potential interaction motifs in proteins.
Collapse
Affiliation(s)
- Catherine Mooney
- Complex and Adaptive Systems Laboratory, University College Dublin, Dublin, Ireland
| | | | | | | |
Collapse
|
43
|
Sequence- and interactome-based prediction of viral protein hotspots targeting host proteins: a case study for HIV Nef. PLoS One 2011; 6:e20735. [PMID: 21738584 PMCID: PMC3125164 DOI: 10.1371/journal.pone.0020735] [Citation(s) in RCA: 10] [Impact Index Per Article: 0.8] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 04/19/2011] [Accepted: 05/08/2011] [Indexed: 01/03/2023] Open
Abstract
Virus proteins alter protein pathways of the host toward the synthesis of viral particles by breaking and making edges via binding to host proteins. In this study, we developed a computational approach to predict viral sequence hotspots for binding to host proteins based on sequences of viral and host proteins and literature-curated virus-host protein interactome data. We use a motif discovery algorithm repeatedly on collections of sequences of viral proteins and immediate binding partners of their host targets and choose only those motifs that are conserved on viral sequences and highly statistically enriched among binding partners of virus protein targeted host proteins. Our results match experimental data on binding sites of Nef to host proteins such as MAPK1, VAV1, LCK, HCK, HLA-A, CD4, FYN, and GNB2L1 with high statistical significance but is a poor predictor of Nef binding sites on highly flexible, hoop-like regions. Predicted hotspots recapture CD8 cell epitopes of HIV Nef highlighting their importance in modulating virus-host interactions. Host proteins potentially targeted or outcompeted by Nef appear crowding the T cell receptor, natural killer cell mediated cytotoxicity, and neurotrophin signaling pathways. Scanning of HIV Nef motifs on multiple alignments of hepatitis C protein NS5A produces results consistent with literature, indicating the potential value of the hotspot discovery in advancing our understanding of virus-host crosstalk.
Collapse
|
44
|
Verschueren E, Vanhee P, van der Sloot AM, Serrano L, Rousseau F, Schymkowitz J. Protein design with fragment databases. Curr Opin Struct Biol 2011; 21:452-9. [PMID: 21684149 DOI: 10.1016/j.sbi.2011.05.002] [Citation(s) in RCA: 26] [Impact Index Per Article: 2.0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 04/04/2011] [Accepted: 05/25/2011] [Indexed: 11/25/2022]
Abstract
Structure-based computational methods are popular tools for designing proteins and interactions between proteins because they provide the necessary insight and details required for rational engineering. Here, we first argue that large-scale databases of fragments contain a discrete but complete set of building blocks that can be used to design structures. We show that these structural alphabets can be saturated to provide conformational ensembles that sample the native structure space around energetic minima. Second, we show that catalogs of interaction patterns hold the key to overcome the lack of scaffolds when computationally designing protein interactions. Finally, we illustrate the power of database-driven computational protein design methods by recent successful applications and discuss what challenges remain to push this field forward.
Collapse
Affiliation(s)
- Erik Verschueren
- EMBL/CRG Systems Biology Research Unit, Centre for Genomic Regulation (CRG) and UPF, Barcelona, Spain
| | | | | | | | | | | |
Collapse
|
45
|
Stein A, Rueda M, Panjkovich A, Orozco M, Aloy P. A Systematic Study of the Energetics Involved in Structural Changes upon Association and Connectivity in Protein Interaction Networks. Structure 2011; 19:881-9. [DOI: 10.1016/j.str.2011.03.009] [Citation(s) in RCA: 34] [Impact Index Per Article: 2.6] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 12/01/2010] [Revised: 02/22/2011] [Accepted: 03/13/2011] [Indexed: 01/26/2023]
|
46
|
Vanhee P, van der Sloot AM, Verschueren E, Serrano L, Rousseau F, Schymkowitz J. Computational design of peptide ligands. Trends Biotechnol 2011; 29:231-9. [PMID: 21316780 DOI: 10.1016/j.tibtech.2011.01.004] [Citation(s) in RCA: 118] [Impact Index Per Article: 9.1] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/11/2010] [Revised: 01/11/2011] [Accepted: 01/12/2011] [Indexed: 12/19/2022]
Abstract
Peptides possess several attractive features when compared to small molecule and protein therapeutics, such as high structural compatibility with target proteins, the ability to disrupt protein-protein interfaces, and small size. Efficient design of high-affinity peptide ligands via rational methods has been a major obstacle to the development of this potential drug class. However, structural insights into the architecture of protein-peptide interfaces have recently culminated in several computational approaches for the rational design of peptides that target proteins. These methods provide a valuable alternative to experimental high-resolution structures of target protein-peptide complexes, bringing closer the dream of in silico designed peptides for therapeutic applications.
Collapse
Affiliation(s)
- Peter Vanhee
- VIB SWITCH Laboratory, Flanders Institute of Biotechnology (VIB), Pleinlaan 2, 1050 Brussels, Belgium
| | | | | | | | | | | |
Collapse
|
47
|
Stein A, Mosca R, Aloy P. Three-dimensional modeling of protein interactions and complexes is going 'omics. Curr Opin Struct Biol 2011; 21:200-8. [PMID: 21320770 DOI: 10.1016/j.sbi.2011.01.005] [Citation(s) in RCA: 68] [Impact Index Per Article: 5.2] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 11/10/2010] [Revised: 01/11/2011] [Accepted: 01/13/2011] [Indexed: 10/18/2022]
Abstract
High-throughput interaction discovery initiatives have revealed the existence of hundreds of multiprotein complexes whose functions are regulated through thousands of protein-protein interactions (PPIs). However, the structural details of these interactions, often necessary to understand their function, are only available for a tiny fraction, and the experimental difficulties surrounding complex structure determination make computational modeling techniques paramount. In this manuscript, we critically review some of the most recent developments in the field of structural bioinformatics applied to the modeling of protein interactions and complexes, from large macromolecular machines to domain-domain and peptide-mediated interactions. In particular, we place a special emphasis on those methods that can be applied in a proteome-wide manner, and discuss how they will help in the ultimate objective of building 3D interactome networks.
Collapse
Affiliation(s)
- Amelie Stein
- Institute for Research in Biomedicine (IRB Barcelona), Joint IRB-BSC Program in Computational Biology, c/Baldiri i Reixac 10-12, 08028 Barcelona, Spain
| | | | | |
Collapse
|
48
|
Shao X, Tan CSH, Voss C, Li SSC, Deng N, Bader GD. A regression framework incorporating quantitative and negative interaction data improves quantitative prediction of PDZ domain-peptide interaction from primary sequence. ACTA ACUST UNITED AC 2010; 27:383-90. [PMID: 21127034 PMCID: PMC3031032 DOI: 10.1093/bioinformatics/btq657] [Citation(s) in RCA: 26] [Impact Index Per Article: 1.9] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/20/2022]
Abstract
Motivation: Predicting protein interactions involving peptide recognition domains is essential for understanding the many important biological processes they mediate. It is important to consider the binding strength of these interactions to help us construct more biologically relevant protein interaction networks that consider cellular context and competition between potential binders. Results: We developed a novel regression framework that considers both positive (quantitative) and negative (qualitative) interaction data available for mouse PDZ domains to quantitatively predict interactions between PDZ domains, a large peptide recognition domain family, and their peptide ligands using primary sequence information. First, we show that it is possible to learn from existing quantitative and negative interaction data to infer the relative binding strength of interactions involving previously unseen PDZ domains and/or peptides given their primary sequence. Performance was measured using cross-validated hold out testing and testing with previously unseen PDZ domain–peptide interactions. Second, we find that incorporating negative data improves quantitative interaction prediction. Third, we show that sequence similarity is an important prediction performance determinant, which suggests that experimentally collecting additional quantitative interaction data for underrepresented PDZ domain subfamilies will improve prediction. Availability and Implementation: The Matlab code for our SemiSVR predictor and all data used here are available at http://baderlab.org/Data/PDZAffinity. Contact:gary.bader@utoronto.ca; dengnaiyang@cau.edu.cn Supplementary information:Supplementary data are available at Bioinformatics online.
Collapse
Affiliation(s)
- Xiaojian Shao
- Department of Applied Mathematics, College of Science, China Agricultural University, Beijing, 100083, China
| | | | | | | | | | | |
Collapse
|
49
|
Stein A, Céol A, Aloy P. 3did: identification and classification of domain-based interactions of known three-dimensional structure. Nucleic Acids Res 2010; 39:D718-23. [PMID: 20965963 PMCID: PMC3013799 DOI: 10.1093/nar/gkq962] [Citation(s) in RCA: 106] [Impact Index Per Article: 7.6] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/27/2022] Open
Abstract
The database of three-dimensional interacting domains (3did) is a collection of protein interactions for which high-resolution three-dimensional structures are known. 3did exploits the availability of structural data to provide molecular details on interactions between two globular domains as well as novel domain–peptide interactions, derived using a recently published method from our lab. The interface residues are presented for each interaction type individually, plus global domain interfaces at which one or more partners (domains or peptides) bind. The 3did web server at http://3did.irbbarcelona.org visualizes these interfaces along with atomic details of individual interactions using Jmol. The complete contents are also available for download.
Collapse
Affiliation(s)
- Amelie Stein
- Institute for Research in Biomedicine, Join IRB-BSC Program in Computational Biology, 08028 Barcelona, Spain
| | | | | |
Collapse
|