1
|
Yin S, Mi X, Shukla D. Leveraging machine learning models for peptide-protein interaction prediction. RSC Chem Biol 2024; 5:401-417. [PMID: 38725911 PMCID: PMC11078210 DOI: 10.1039/d3cb00208j] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Grants] [Track Full Text] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/27/2023] [Accepted: 02/07/2024] [Indexed: 05/12/2024] Open
Abstract
Peptides play a pivotal role in a wide range of biological activities through participating in up to 40% protein-protein interactions in cellular processes. They also demonstrate remarkable specificity and efficacy, making them promising candidates for drug development. However, predicting peptide-protein complexes by traditional computational approaches, such as docking and molecular dynamics simulations, still remains a challenge due to high computational cost, flexible nature of peptides, and limited structural information of peptide-protein complexes. In recent years, the surge of available biological data has given rise to the development of an increasing number of machine learning models for predicting peptide-protein interactions. These models offer efficient solutions to address the challenges associated with traditional computational approaches. Furthermore, they offer enhanced accuracy, robustness, and interpretability in their predictive outcomes. This review presents a comprehensive overview of machine learning and deep learning models that have emerged in recent years for the prediction of peptide-protein interactions.
Collapse
Affiliation(s)
- Song Yin
- Department of Chemical and Biomolecular Engineering, University of Illinois Urbana-Champaign Urbana 61801 Illinois USA
| | - Xuenan Mi
- Center for Biophysics and Quantitative Biology, University of Illinois Urbana-Champaign Urbana IL 61801 USA
| | - Diwakar Shukla
- Department of Chemical and Biomolecular Engineering, University of Illinois Urbana-Champaign Urbana 61801 Illinois USA
- Center for Biophysics and Quantitative Biology, University of Illinois Urbana-Champaign Urbana IL 61801 USA
- Department of Bioengineering, University of Illinois Urbana-Champaign Urbana IL 61801 USA
| |
Collapse
|
2
|
Shin WH, Kihara D. PL-PatchSurfer3: Improved Structure-Based Virtual Screening for Structure Variation Using 3D Zernike Descriptors. BIORXIV : THE PREPRINT SERVER FOR BIOLOGY 2024:2024.02.22.581511. [PMID: 38464318 PMCID: PMC10925112 DOI: 10.1101/2024.02.22.581511] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Grants] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 03/12/2024]
Abstract
Structure-based virtual screening (SBVS) is a widely used method in silico drug discovery. It necessitates a receptor structure or binding site to predict the binding pose and fitness of a ligand. Therefore, the performance of the SBVS is affected by the protein conformation. The most frequently used method in SBVS is the protein-ligand docking program, which utilizes atomic distance-based scoring functions. Hence, they are highly prone to sensitivity towards variation in receptor structure, and it is reported that the conformational change significantly drops the performance of the docking program. To address the problem, we have introduced a novel program of SBVS, named PL-PatchSurfer. This program makes use of molecular surface patches and the Zernike descriptor. The surfaces of the pocket and ligand are segmented into several patches by the program. These patches are then mapped with physico-chemical properties such as shape and electrostatic potential before being converted into the Zernike descriptor, which is rotationally invariant. A complementarity between the protein and the ligand is assessed by comparing the descriptors and geometric distribution of the patches in the molecules. A benchmarking study showed that PL-PatchSurfer2 was able to screen active molecules regardless of the receptor structure change with fast speed. However, the program could not achieve high performance for the targets that the hydrogen bonding feature is important such as nuclear hormone receptors. In this paper, we present the newer version of PL-PatchSurfer, PL-PatchSurfer3, which incorporates two new features: a change in the definition of hydrogen bond complementarity and consideration of visibility that contains curvature information of a patch. Our evaluation demonstrates that the new program outperforms its predecessor and other SBVS methods while retaining its characteristic tolerance to receptor structure changes. Interested individuals can access the program at kiharalab.org/plps3.
Collapse
Affiliation(s)
- Woong-Hee Shin
- Department of Biomedical Informatics, Korea University College of Medicine, Seoul, Republic of Korea
| | - Daisuke Kihara
- Department of Biological Science, Purdue University, West Lafayette, IN, USA
- Department of Computer Science, Purdue University, West Lafayette, IN, USA
- Center for Cancer Research, Purdue University, West Lafayette, IN, USA
| |
Collapse
|
3
|
Shin WH, Kumazawa K, Imai K, Hirokawa T, Kihara D. Quantitative comparison of protein-protein interaction interface using physicochemical feature-based descriptors of surface patches. Front Mol Biosci 2023; 10:1110567. [PMID: 36814641 PMCID: PMC9939524 DOI: 10.3389/fmolb.2023.1110567] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 11/29/2022] [Accepted: 01/24/2023] [Indexed: 02/09/2023] Open
Abstract
Driving mechanisms of many biological functions in a cell include physical interactions of proteins. As protein-protein interactions (PPIs) are also important in disease development, protein-protein interactions are highlighted in the pharmaceutical industry as possible therapeutic targets in recent years. To understand the variety of protein-protein interactions in a proteome, it is essential to establish a method that can identify similarity and dissimilarity between protein-protein interactions for inferring the binding of similar molecules, including drugs and other proteins. In this study, we developed a novel method, protein-protein interaction-Surfer, which compares and quantifies similarity of local surface regions of protein-protein interactions. protein-protein interaction-Surfer represents a protein-protein interaction surface with overlapping surface patches, each of which is described with a three-dimensional Zernike descriptor (3DZD), a compact mathematical representation of 3D function. 3DZD captures both the 3D shape and physicochemical properties of the protein surface. The performance of protein-protein interaction-Surfer was benchmarked on datasets of protein-protein interactions, where we were able to show that protein-protein interaction-Surfer finds similar potential drug binding regions that do not share sequence and structure similarity. protein-protein interaction-Surfer is available at https://kiharalab.org/ppi-surfer.
Collapse
Affiliation(s)
- Woong-Hee Shin
- Department of Chemistry Education, Sunchon National University, Suncheon, South Korea,Department of Advanced Components and Materials Engineering, Sunchon National University, Suncheon, South Korea
| | - Keiko Kumazawa
- Pharmaceutical Discovery Research Laboratories, Teijin Pharma Limited, Tokyo, Japan
| | - Kenichiro Imai
- Cellular and Molecular Biotechnology Research Institute, National Institute of Advanced Industrial Science and Technology, Tokyo, Japan
| | - Takatsugu Hirokawa
- Division of Biomedical Science, Faculty of Medicine, University of Tsukuba, Tsukuba, Japan,Transborder Medical Research Center, University of Tsukuba, Tsukuba, Japan
| | - Daisuke Kihara
- Department of Biological Sciences, Purdue University, West Lafayette, IN, United States,Department of Computer Science, Purdue University, West Lafayette, IN, United States,Center for Cancer Research, Purdue University, West Lafayette, IN, United States,*Correspondence: Daisuke Kihara,
| |
Collapse
|
4
|
Olek AT, Rushton PS, Kihara D, Ciesielski P, Aryal UK, Zhang Z, Stauffacher CV, McCann MC, Carpita NC. Essential amino acids in the Plant-Conserved and Class-Specific Regions of cellulose synthases. PLANT PHYSIOLOGY 2023; 191:142-160. [PMID: 36250895 PMCID: PMC9806608 DOI: 10.1093/plphys/kiac479] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Subscribe] [Scholar Register] [Received: 04/19/2022] [Accepted: 09/24/2022] [Indexed: 05/05/2023]
Abstract
The Plant-Conserved Region (P-CR) and the Class-Specific Region (CSR) are two plant-unique sequences in the catalytic core of cellulose synthases (CESAs) for which specific functions have not been established. Here, we used site-directed mutagenesis to replace amino acids and motifs within these sequences predicted to be essential for assembly and function of CESAs. We developed an in vivo method to determine the ability of mutated CesA1 transgenes to complement an Arabidopsis (Arabidopsis thaliana) temperature-sensitive root-swelling1 (rsw1) mutant. Replacement of a Cys residue in the CSR, which blocks dimerization in vitro, rendered the AtCesA1 transgene unable to complement the rsw1 mutation. Examination of the CSR sequences from 33 diverse angiosperm species showed domains of high-sequence conservation in a class-specific manner but with variation in the degrees of disorder, indicating a nonredundant role of the CSR structures in different CESA isoform classes. The Cys residue essential for dimerization was not always located in domains of intrinsic disorder. Expression of AtCesA1 transgene constructs, in which Pro417 and Arg453 were substituted for Ala or Lys in the coiled-coil of the P-CR, were also unable to complement the rsw1 mutation. Despite an expected role for Arg457 in trimerization of CESA proteins, AtCesA1 transgenes with Arg457Ala mutations were able to fully restore the wild-type phenotype in rsw1. Our data support that Cys662 within the CSR and Pro417 and Arg453 within the P-CR of Arabidopsis CESA1 are essential residues for functional synthase complex formation, but our data do not support a specific role for Arg457 in trimerization in native CESA complexes.
Collapse
Affiliation(s)
- Anna T Olek
- Department of Botany and Plant Pathology, Purdue University, West Lafayette, Indiana 47907, USA
| | - Phillip S Rushton
- Department of Biological Sciences, Purdue University, West Lafayette, Indiana 47907, USA
| | - Daisuke Kihara
- Department of Biological Sciences, Purdue University, West Lafayette, Indiana 47907, USA
- Department of Computer Science, Purdue University, West Lafayette, Indiana 47907, USA
| | - Peter Ciesielski
- Renewable Resources & Enabling Sciences Center, National Renewable Energy Laboratory, Golden, Colorado 80401, USA
| | - Uma K Aryal
- Bindley Biosciences Center, Purdue University, West Lafayette, Indiana 47907, USA
- Department of Comparative Pathobiology, College of Veterinary Medicine, Purdue University, West Lafayette, Indiana 47907, USA
| | - Zicong Zhang
- Department of Computer Science, Purdue University, West Lafayette, Indiana 47907, USA
| | - Cynthia V Stauffacher
- Department of Biological Sciences, Purdue University, West Lafayette, Indiana 47907, USA
| | - Maureen C McCann
- Department of Biological Sciences, Purdue University, West Lafayette, Indiana 47907, USA
- Biosciences Center, National Renewable Energy Laboratory, Golden, Colorado 80401, USA
| | - Nicholas C Carpita
- Department of Botany and Plant Pathology, Purdue University, West Lafayette, Indiana 47907, USA
- Department of Biological Sciences, Purdue University, West Lafayette, Indiana 47907, USA
- Biosciences Center, National Renewable Energy Laboratory, Golden, Colorado 80401, USA
| |
Collapse
|
5
|
Mahtarin R, Islam S, Islam MJ, Ullah MO, Ali MA, Halim MA. Structure and dynamics of membrane protein in SARS-CoV-2. J Biomol Struct Dyn 2022; 40:4725-4738. [PMID: 33353499 PMCID: PMC7784837 DOI: 10.1080/07391102.2020.1861983] [Citation(s) in RCA: 43] [Impact Index Per Article: 21.5] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 09/22/2020] [Accepted: 12/05/2020] [Indexed: 12/15/2022]
Abstract
SARS-CoV-2 membrane (M) protein performs a variety of critical functions in virus infection cycle. However, the expression and purification of membrane protein structure is difficult despite tremendous progress. In this study, the 3 D structure is modeled followed by intensive validation and molecular dynamics simulation. The lack of suitable homologous templates (>30% sequence identities) leads us to construct the membrane protein models using template-free modeling (de novo or ab initio) approach with Robetta and trRosetta servers. Comparing with other model structures, it is evident that trRosetta (TM-score: 0.64; TM region RMSD: 2 Å) can provide the best model than Robetta (TM-score: 0.61; TM region RMSD: 3.3 Å) and I-TASSER (TM-score: 0.45; TM region RMSD: 6.5 Å). 100 ns molecular dynamics simulations are performed on the model structures by incorporating membrane environment. Moreover, secondary structure elements and principal component analysis (PCA) have also been performed on MD simulation data. Finally, trRosetta model is utilized for interpretation and visualization of interacting residues during protein-protein interactions. The common interacting residues including Phe103, Arg107, Met109, Trp110, Arg131, and Glu135 in the C-terminal domain of M protein are identified in membrane-spike and membrane-nucleocapsid protein complexes. The active site residues are also predicted for potential drug and peptide binding. Overall, this study might be helpful to design drugs and peptides against the modeled membrane protein of SARS-CoV-2 to accelerate further investigation. Communicated by Ramaswamy H. Sarma.
Collapse
Affiliation(s)
- Rumana Mahtarin
- Division of Infectious Diseases and Division of Computer Aided Drug Design, The Red-Green Research Centre, BICCB, Tejgaon, Dhaka, Bangladesh
| | - Shafiqul Islam
- Division of Infectious Diseases and Division of Computer Aided Drug Design, The Red-Green Research Centre, BICCB, Tejgaon, Dhaka, Bangladesh
| | - Md. Jahirul Islam
- Division of Infectious Diseases and Division of Computer Aided Drug Design, The Red-Green Research Centre, BICCB, Tejgaon, Dhaka, Bangladesh
| | - M Obayed Ullah
- Division of Infectious Diseases and Division of Computer Aided Drug Design, The Red-Green Research Centre, BICCB, Tejgaon, Dhaka, Bangladesh
| | - Md Ackas Ali
- Division of Infectious Diseases and Division of Computer Aided Drug Design, The Red-Green Research Centre, BICCB, Tejgaon, Dhaka, Bangladesh
| | - Mohammad A. Halim
- Division of Infectious Diseases and Division of Computer Aided Drug Design, The Red-Green Research Centre, BICCB, Tejgaon, Dhaka, Bangladesh
- Department of Physical Sciences, University of Arkansas - Fort Smith, Fort Smith, AR, USA
| |
Collapse
|
6
|
Aderinwale T, Bharadwaj V, Christoffer C, Terashi G, Zhang Z, Jahandideh R, Kagaya Y, Kihara D. Real-time structure search and structure classification for AlphaFold protein models. Commun Biol 2022; 5:316. [PMID: 35383281 PMCID: PMC8983703 DOI: 10.1038/s42003-022-03261-8] [Citation(s) in RCA: 33] [Impact Index Per Article: 16.5] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 11/09/2021] [Accepted: 03/11/2022] [Indexed: 11/17/2022] Open
Abstract
Last year saw a breakthrough in protein structure prediction, where the AlphaFold2 method showed a substantial improvement in the modeling accuracy. Following the software release of AlphaFold2, predicted structures by AlphaFold2 for proteins in 21 species were made publicly available via the AlphaFold Database. Here, to facilitate structural analysis and application of AlphaFold2 models, we provide the infrastructure, 3D-AF-Surfer, which allows real-time structure-based search for the AlphaFold2 models. In 3D-AF-Surfer, structures are represented with 3D Zernike descriptors (3DZD), which is a rotationally invariant, mathematical representation of 3D shapes. We developed a neural network that takes 3DZDs of proteins as input and retrieves proteins of the same fold more accurately than direct comparison of 3DZDs. Using 3D-AF-Surfer, we report structure classifications of AlphaFold2 models and discuss the correlation between confidence levels of AlphaFold2 models and intrinsic disordered regions.
Collapse
Affiliation(s)
- Tunde Aderinwale
- Department of Computer Science, Purdue University, West Lafayette, IN, 47907, USA
| | - Vijay Bharadwaj
- Department of Computer Science, Purdue University, West Lafayette, IN, 47907, USA
| | - Charles Christoffer
- Department of Computer Science, Purdue University, West Lafayette, IN, 47907, USA
| | - Genki Terashi
- Department of Biological Sciences, Purdue University, West Lafayette, IN, 47907, USA
| | - Zicong Zhang
- Department of Computer Science, Purdue University, West Lafayette, IN, 47907, USA
| | | | - Yuki Kagaya
- Department of Biological Sciences, Purdue University, West Lafayette, IN, 47907, USA
| | - Daisuke Kihara
- Department of Computer Science, Purdue University, West Lafayette, IN, 47907, USA.
- Department of Biological Sciences, Purdue University, West Lafayette, IN, 47907, USA.
| |
Collapse
|
7
|
Guterres H, Park SJ, Cao Y, Im W. CHARMM-GUI Ligand Designer for Template-Based Virtual Ligand Design in a Binding Site. J Chem Inf Model 2021; 61:5336-5342. [PMID: 34757752 DOI: 10.1021/acs.jcim.1c01156] [Citation(s) in RCA: 3] [Impact Index Per Article: 1.0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/28/2022]
Abstract
Rational drug design involves a task of finding ligands that would bind to a specific target protein. This work presents CHARMM-GUI Ligand Designer that is an intuitive and interactive web-based tool to design virtual ligands that match the shape and chemical features of a given protein binding site. Ligand Designer provides ligand modification capabilities with 3D visualization that allow researchers to modify and redesign virtual ligands while viewing how the protein-ligand interactions are affected. Virtual ligands can also be parameterized for further molecular dynamics (MD) simulations and free energy calculations. Using 8 targets from 8 different protein classes in the directory of useful decoys, enhanced (DUD-E) data set, we show that Ligand Designer can produce similar ligands to the known active ligands in the crystal structures. Ligand Designer also produces stable protein-ligand complex structures when tested using short MD simulations. We expect that Ligand Designer can be a useful and user-friendly tool to design small molecules in any given potential ligand binding site on a protein of interest.
Collapse
Affiliation(s)
- Hugo Guterres
- Departments of Biological Sciences, Chemistry, Bioengineering, and Computer Science and Engineering, Lehigh University, Bethlehem, Pennsylvania 18015, United States
| | - Sang-Jun Park
- Departments of Biological Sciences, Chemistry, Bioengineering, and Computer Science and Engineering, Lehigh University, Bethlehem, Pennsylvania 18015, United States
| | - Yiwei Cao
- Departments of Biological Sciences, Chemistry, Bioengineering, and Computer Science and Engineering, Lehigh University, Bethlehem, Pennsylvania 18015, United States
| | - Wonpil Im
- Departments of Biological Sciences, Chemistry, Bioengineering, and Computer Science and Engineering, Lehigh University, Bethlehem, Pennsylvania 18015, United States
| |
Collapse
|
8
|
Yang Z, Liu M, Wang B, Wang B. Classification of protein domains based on their three-dimensional shapes (CPD3DS). Synth Syst Biotechnol 2021; 6:224-230. [PMID: 34541344 PMCID: PMC8429105 DOI: 10.1016/j.synbio.2021.08.003] [Citation(s) in RCA: 1] [Impact Index Per Article: 0.3] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/28/2021] [Revised: 08/23/2021] [Accepted: 08/30/2021] [Indexed: 11/13/2022] Open
Abstract
Protein design has become a powerful method to expand the number of natural proteins and design customized proteins according to demands. Domain-based protein design spares the need to create novel elements from scratch, which makes it a more efficient strategy than scratch-based protein design in designing multi-domain proteins, protein complexes and biomaterials. As the surface shape plays a central role in domain-domain and protein-protein interactions, a global map of the surface shapes of all domains should be very beneficial for domain-based protein design. Therefore, in this study, we characterized the surface shapes of protein domains, collected from CATH and SCOP databases, with their 3D-Zernike descriptors (3DZDs). Then similarities of domain shape features were identified, and all domains were classified accordingly. The preferences of the combinations of domains between different clusters were analyzed in natural proteins from the Protein Data Bank. A user-friendly website, termed CPD3DS, was also developed for storage, retrieval, analyses and visualization of our results. This work not only provides an overall view of protein domain shapes by showing their variety and similarities, but also opens up a new avenue to understand the properties of protein structural domains, and design principles of protein architectures.
Collapse
Affiliation(s)
- Zhaochang Yang
- School of Life Science and Technology, University of Electronic Science and Technology of China, China
| | - Mingkang Liu
- School of Life Science and Technology, University of Electronic Science and Technology of China, China
| | - Bin Wang
- School of Information and Software Engineering, University of Electronic Science and Technology of China, China
| | - Beibei Wang
- School of Life Science and Technology, University of Electronic Science and Technology of China, China.,Centre for Informational Biology, University of Electronic Science and Technology of China, 2006 Xiyuan Road, Chengdu, Sichuan, 611731, China
| |
Collapse
|
9
|
Wardah W, Dehzangi A, Taherzadeh G, Rashid MA, Khan M, Tsunoda T, Sharma A. Predicting protein-peptide binding sites with a deep convolutional neural network. J Theor Biol 2020; 496:110278. [DOI: 10.1016/j.jtbi.2020.110278] [Citation(s) in RCA: 4] [Impact Index Per Article: 1.0] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 12/30/2019] [Revised: 04/05/2020] [Accepted: 04/08/2020] [Indexed: 10/24/2022]
|
10
|
Sankar S, Yamaguchi M, Kawabata S, Ponnuraj K. Streptococcus pneumoniae Surface Adhesin PfbA Exhibits Host Specificity by Binding to Human Serum Albumin but Not Bovine, Rabbit and Porcine Serum Albumins. Protein J 2019; 39:1-9. [DOI: 10.1007/s10930-019-09875-y] [Citation(s) in RCA: 3] [Impact Index Per Article: 0.6] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 02/07/2023]
|
11
|
New insights in the opening mechanism of the heart-type fatty acid binding protein in its apo form (apo-FABP3). Struct Chem 2019. [DOI: 10.1007/s11224-019-01446-2] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/27/2022]
|
12
|
A global map of the protein shape universe. PLoS Comput Biol 2019; 15:e1006969. [PMID: 30978181 PMCID: PMC6481876 DOI: 10.1371/journal.pcbi.1006969] [Citation(s) in RCA: 21] [Impact Index Per Article: 4.2] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/15/2018] [Revised: 04/24/2019] [Accepted: 03/20/2019] [Indexed: 11/19/2022] Open
Abstract
Proteins are involved in almost all functions in a living cell, and functions of proteins are realized by their tertiary structures. Obtaining a global perspective of the variety and distribution of protein structures lays a foundation for our understanding of the building principle of protein structures. In light of the rapid accumulation of low-resolution structure data from electron tomography and cryo-electron microscopy, here we map and classify three-dimensional (3D) surface shapes of proteins into a similarity space. Surface shapes of proteins were represented with 3D Zernike descriptors, mathematical moment-based invariants, which have previously been demonstrated effective for biomolecular structure similarity search. In addition to single chains of proteins, we have also analyzed the shape space occupied by protein complexes. From the mapping, we have obtained various new insights into the relationship between shapes, main-chain folds, and complex formation. The unique view obtained from shape mapping opens up new ways to understand design principles, functions, and evolution of proteins. Proteins are the major molecules involved in almost all cellular processes. In this work, we present a novel mapping of protein shapes that represents the variety and the similarities of 3D shapes of proteins and their assemblies. This mapping provides various novel insights into protein shapes including determinant factors of protein 3D shapes, which enhance our understanding of the design principles of protein shapes. The mapping will also be a valuable resource for artificial protein design as well as references for classifying medium- to low-resolution protein structure images of determined by cryo-electron microscopy and tomography.
Collapse
|
13
|
Budowski-Tal I, Kolodny R, Mandel-Gutfreund Y. A Novel Geometry-Based Approach to Infer Protein Interface Similarity. Sci Rep 2018; 8:8192. [PMID: 29844500 PMCID: PMC5974305 DOI: 10.1038/s41598-018-26497-z] [Citation(s) in RCA: 4] [Impact Index Per Article: 0.7] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 05/24/2017] [Accepted: 05/10/2018] [Indexed: 11/21/2022] Open
Abstract
The protein interface is key to understand protein function, providing a vital insight on how proteins interact with each other and with other molecules. Over the years, many computational methods to compare protein structures were developed, yet evaluating interface similarity remains a very difficult task. Here, we present PatchBag – a geometry based method for efficient comparison of protein surfaces and interfaces. PatchBag is a Bag-Of-Words approach, which represents complex objects as vectors, enabling to search interface similarity in a highly efficient manner. Using a novel framework for evaluating interface similarity, we show that PatchBag performance is comparable to state-of-the-art alignment-based structural comparison methods. The great advantage of PatchBag is that it does not rely on sequence or fold information, thus enabling to detect similarities between interfaces in unrelated proteins. We propose that PatchBag can contribute to reveal novel evolutionary and functional relationships between protein interfaces.
Collapse
Affiliation(s)
- Inbal Budowski-Tal
- Faculty of Biology, Technion, Israel Institute of Technology, Haifa, 3200003, Israel.,Department of Computer Science, University of Haifa, Mount Carmel, Haifa, 3498838, Israel
| | - Rachel Kolodny
- Department of Computer Science, University of Haifa, Mount Carmel, Haifa, 3498838, Israel.
| | - Yael Mandel-Gutfreund
- Faculty of Biology, Technion, Israel Institute of Technology, Haifa, 3200003, Israel.
| |
Collapse
|
14
|
Abstract
Recent advances in high-throughput structure determination and computational protein structure prediction have significantly enriched the universe of protein structure. However, there is still a large gap between the number of available protein structures and that of proteins with annotated function in high accuracy. Computational structure-based protein function prediction has emerged to reduce this knowledge gap. The identification of a ligand binding site and its structure is critical to the determination of a protein's molecular function. We present a computational methodology for predicting small molecule ligand binding site and ligand structure using G-LoSA, our protein local structure alignment and similarity measurement tool. All the computational procedures described here can be easily implemented using G-LoSA Toolkit, a package of standalone software programs and preprocessed PDB structure libraries. G-LoSA and G-LoSA Toolkit are freely available to academic users at http://compbio.lehigh.edu/GLoSA . We also illustrate a case study to show the potential of our template-based approach harnessing G-LoSA for protein function prediction.
Collapse
Affiliation(s)
- Hui Sun Lee
- Department of Biological Sciences and Bioengineering Program, Lehigh University, Bethlehem, PA, 18015, USA.
| | - Wonpil Im
- Department of Biological Sciences and Bioengineering Program, Lehigh University, Bethlehem, PA, 18015, USA.
| |
Collapse
|
15
|
Shin WH, Kihara D. Virtual Ligand Screening Using PL-PatchSurfer2, a Molecular Surface-Based Protein-Ligand Docking Method. Methods Mol Biol 2018; 1762:105-121. [PMID: 29594770 DOI: 10.1007/978-1-4939-7756-7_7] [Citation(s) in RCA: 4] [Impact Index Per Article: 0.7] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 01/07/2023]
Abstract
Virtual screening is a computational technique for predicting a potent binding compound for a receptor protein from a ligand library. It has been a widely used in the drug discovery field to reduce the efforts of medicinal chemists to find hit compounds by experiments.Here, we introduce our novel structure-based virtual screening program, PL-PatchSurfer, which uses molecular surface representation with the three-dimensional Zernike descriptors, which is an effective mathematical representation for identifying physicochemical complementarities between local surfaces of a target protein and a ligand. The advantage of the surface-patch description is its tolerance on a receptor and compound structure variation. PL-PatchSurfer2 achieves higher accuracy on apo form and computationally modeled receptor structures than conventional structure-based virtual screening programs. Thus, PL-PatchSurfer2 opens up an opportunity for targets that do not have their crystal structures. The program is provided as a stand-alone program at http://kiharalab.org/plps2 . We also provide files for two ligand libraries, ChEMBL and ZINC Drug-like.
Collapse
Affiliation(s)
- Woong-Hee Shin
- Department of Biological Science, Purdue University, West Lafayette, IN, USA
| | - Daisuke Kihara
- Department of Biological Science, Purdue University, West Lafayette, IN, USA. .,Department of Computer Science, Purdue University, West Lafayette, IN, USA.
| |
Collapse
|
16
|
Preto J, Gentile F, Winter P, Churchill C, Omar SI, Tuszynski JA. Molecular Dynamics and Related Computational Methods with Applications to Drug Discovery. SPRINGER PROCEEDINGS IN MATHEMATICS & STATISTICS 2018. [DOI: 10.1007/978-3-319-76599-0_14] [Citation(s) in RCA: 1] [Impact Index Per Article: 0.2] [Reference Citation Analysis] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 12/12/2022]
|
17
|
Han X, Wei Q, Kihara D. Protein 3D Structure and Electron Microscopy Map Retrieval Using 3D-SURFER2.0 and EM-SURFER. ACTA ACUST UNITED AC 2017; 60:3.14.1-3.14.15. [PMID: 29220075 DOI: 10.1002/cpbi.37] [Citation(s) in RCA: 6] [Impact Index Per Article: 0.9] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/06/2022]
Abstract
With the rapid growth in the number of solved protein structures stored in the Protein Data Bank (PDB) and the Electron Microscopy Data Bank (EMDB), it is essential to develop tools to perform real-time structure similarity searches against the entire structure database. Since conventional structure alignment methods need to sample different orientations of proteins in the three-dimensional space, they are time consuming and unsuitable for rapid, real-time database searches. To this end, we have developed 3D-SURFER and EM-SURFER, which utilize 3D Zernike descriptors (3DZD) to conduct high-throughput protein structure comparison, visualization, and analysis. Taking an atomic structure or an electron microscopy map of a protein or a protein complex as input, the 3DZD of a query protein is computed and compared with the 3DZD of all other proteins in PDB or EMDB. In addition, local geometrical characteristics of a query protein can be analyzed using VisGrid and LIGSITECSC in 3D-SURFER. This article describes how to use 3D-SURFER and EM-SURFER to carry out protein surface shape similarity searches, local geometric feature analysis, and interpretation of the search results. © 2017 by John Wiley & Sons, Inc.
Collapse
Affiliation(s)
- Xusi Han
- Department of Biological Sciences, Purdue University, West Lafayette, Indiana
| | - Qing Wei
- Department of Computer Science, Purdue University, West Lafayette, Indiana
| | - Daisuke Kihara
- Department of Biological Sciences, Purdue University, West Lafayette, Indiana.,Department of Computer Science, Purdue University, West Lafayette, Indiana
| |
Collapse
|
18
|
Simões T, Lopes D, Dias S, Fernandes F, Pereira J, Jorge J, Bajaj C, Gomes A. Geometric Detection Algorithms for Cavities on Protein Surfaces in Molecular Graphics: A Survey. COMPUTER GRAPHICS FORUM : JOURNAL OF THE EUROPEAN ASSOCIATION FOR COMPUTER GRAPHICS 2017; 36:643-683. [PMID: 29520122 PMCID: PMC5839519 DOI: 10.1111/cgf.13158] [Citation(s) in RCA: 21] [Impact Index Per Article: 3.0] [Reference Citation Analysis] [Abstract] [Grants] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 05/17/2023]
Abstract
Detecting and analyzing protein cavities provides significant information about active sites for biological processes (e.g., protein-protein or protein-ligand binding) in molecular graphics and modeling. Using the three-dimensional structure of a given protein (i.e., atom types and their locations in 3D) as retrieved from a PDB (Protein Data Bank) file, it is now computationally viable to determine a description of these cavities. Such cavities correspond to pockets, clefts, invaginations, voids, tunnels, channels, and grooves on the surface of a given protein. In this work, we survey the literature on protein cavity computation and classify algorithmic approaches into three categories: evolution-based, energy-based, and geometry-based. Our survey focuses on geometric algorithms, whose taxonomy is extended to include not only sphere-, grid-, and tessellation-based methods, but also surface-based, hybrid geometric, consensus, and time-varying methods. Finally, we detail those techniques that have been customized for GPU (Graphics Processing Unit) computing.
Collapse
Affiliation(s)
- Tiago Simões
- Instituto de Telecomunicações, Portugal
- Universidade da Beira Interior, Portugal
| | | | - Sérgio Dias
- Instituto de Telecomunicações, Portugal
- Universidade da Beira Interior, Portugal
| | | | - João Pereira
- INESC-ID Lisboa, Portugal
- Instituto Superior Técnico, Universidade de Lisboa, Portugal
| | - Joaquim Jorge
- INESC-ID Lisboa, Portugal
- Instituto Superior Técnico, Universidade de Lisboa, Portugal
| | | | - Abel Gomes
- Instituto de Telecomunicações, Portugal
- Universidade da Beira Interior, Portugal
| |
Collapse
|
19
|
Seliman AA, Altaf M, Onawole AT, Ahmad S, Ahmed MY, Al-Saadi AA, Altuwaijri S, Bhatia G, Singh J, Isab AA. Synthesis, X-ray structures and anticancer activity of gold(I)-carbene complexes with selenones as co-ligands and their molecular docking studies with thioredoxin reductase. J Organomet Chem 2017. [DOI: 10.1016/j.jorganchem.2017.07.034] [Citation(s) in RCA: 12] [Impact Index Per Article: 1.7] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 01/27/2023]
|
20
|
Taherzadeh G, Zhou Y, Liew AWC, Yang Y. Structure-based prediction of protein– peptide binding regions using Random Forest. Bioinformatics 2017; 34:477-484. [DOI: 10.1093/bioinformatics/btx614] [Citation(s) in RCA: 46] [Impact Index Per Article: 6.6] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 04/19/2017] [Accepted: 09/25/2017] [Indexed: 11/12/2022] Open
Affiliation(s)
- Ghazaleh Taherzadeh
- School of Information and Communication Technology, Griffith University, Parklands Drive, Southport, QLD, Australia
| | - Yaoqi Zhou
- School of Information and Communication Technology, Griffith University, Parklands Drive, Southport, QLD, Australia
- Institute for Glycomics, Griffith University, Parklands Drive, Southport, QLD, Australia
| | - Alan Wee-Chung Liew
- School of Information and Communication Technology, Griffith University, Parklands Drive, Southport, QLD, Australia
| | - Yuedong Yang
- School of Information and Communication Technology, Griffith University, Parklands Drive, Southport, QLD, Australia
- Institute for Glycomics, Griffith University, Parklands Drive, Southport, QLD, Australia
- School of Data and Computer Science, Sun Yat-sen University, Guangzhou, China
| |
Collapse
|
21
|
Prediction of Local Quality of Protein Structure Models Considering Spatial Neighbors in Graphical Models. Sci Rep 2017; 7:40629. [PMID: 28074879 PMCID: PMC5225430 DOI: 10.1038/srep40629] [Citation(s) in RCA: 7] [Impact Index Per Article: 1.0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 09/07/2016] [Accepted: 12/08/2016] [Indexed: 12/31/2022] Open
Abstract
Protein tertiary structure prediction methods have matured in recent years. However, some proteins defy accurate prediction due to factors such as inadequate template structures. While existing model quality assessment methods predict global model quality relatively well, there is substantial room for improvement in local quality assessment, i.e. assessment of the error at each residue position in a model. Local quality is a very important information for practical applications of structure models such as interpreting/designing site-directed mutagenesis of proteins. We have developed a novel local quality assessment method for protein tertiary structure models. The method, named Graph-based Model Quality assessment method (GMQ), explicitly considers the predicted quality of spatially neighboring residues using a graph representation of a query protein structure model. GMQ uses conditional random field as its core of the algorithm, and performs a binary prediction of the quality of each residue in a model, indicating if a residue position is likely to be within an error cutoff or not. The accuracy of GMQ was improved by considering larger graphs to include quality information of more surrounding residues. Moreover, we found that using different edge weights in graphs reflecting different secondary structures further improves the accuracy. GMQ showed competitive performance on a benchmark for quality assessment of structure models from the Critical Assessment of Techniques for Protein Structure Prediction (CASP).
Collapse
|
22
|
Faraggi E, Kouza M, Zhou Y, Kloczkowski A. Fast and Accurate Accessible Surface Area Prediction Without a Sequence Profile. Methods Mol Biol 2017; 1484:127-136. [PMID: 27787824 DOI: 10.1007/978-1-4939-6406-2_10] [Citation(s) in RCA: 6] [Impact Index Per Article: 0.9] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 12/12/2022]
Abstract
A fast accessible surface area (ASA) predictor is presented. In this new approach no residue mutation profiles generated by multiple sequence alignments are used as inputs. Instead, we use only single sequence information and global features such as single-residue and two-residue compositions of the chain. The resulting predictor is both highly more efficient than sequence alignment based predictors and of comparable accuracy to them. Introduction of the global inputs significantly helps achieve this comparable accuracy. The predictor, termed ASAquick, is found to perform similarly well for so-called easy and hard cases indicating generalizability and possible usability for de-novo protein structure prediction. The source code and a Linux executables for ASAquick are available from Research and Information Systems at http://mamiris.com and from the Battelle Center for Mathematical Medicine at http://mathmed.org .
Collapse
Affiliation(s)
- Eshel Faraggi
- Department of Biochemistry and Molecular Biology, Indiana University School of Medicine, Indianapolis, IN, 46032, USA.,Research and Information Systems, LLC, Indianapolis, IN, USA
| | - Maksim Kouza
- Faculty of Chemistry, University of Warsaw, Warsaw, Poland
| | - Yaoqi Zhou
- Institute for Glycomics and School of Information and Communication Technology, Griffith University, Parklands Drive, Southport, QLD 4222, Australia
| | - Andrzej Kloczkowski
- Battelle Center for Mathematical Medicine, Nationwide Children's Hospital, 700 Children's Drive, Columbu, OH 43205, USA. .,Department of Pediatrics, The Ohio State University College of Medicine, Columbus, OH, USA.
| |
Collapse
|
23
|
Zeng L, Shin WH, Zhu X, Park SH, Park C, Tao WA, Kihara D. Discovery of Nicotinamide Adenine Dinucleotide Binding Proteins in the Escherichia coli Proteome Using a Combined Energetic- and Structural-Bioinformatics-Based Approach. J Proteome Res 2016; 16:470-480. [PMID: 28152599 DOI: 10.1021/acs.jproteome.6b00624] [Citation(s) in RCA: 10] [Impact Index Per Article: 1.3] [Reference Citation Analysis] [Abstract] [Key Words] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 12/30/2022]
Abstract
Protein-ligand interaction plays a critical role in regulating the biochemical functions of proteins. Discovering protein targets for ligands is vital to new drug development. Here, we present a strategy that combines experimental and computational approaches to identify ligand-binding proteins in a proteomic scale. For the experimental part, we coupled pulse proteolysis with filter-assisted sample preparation (FASP) and quantitative mass spectrometry. Under denaturing conditions, ligand binding affected protein stability, which resulted in altered protein abundance after pulse proteolysis. For the computational part, we used the software Patch-Surfer2.0. We applied the integrated approach to identify nicotinamide adenine dinucleotide (NAD)-binding proteins in the Escherichia coli proteome, which has over 4200 proteins. Pulse proteolysis and Patch-Surfer2.0 identified 78 and 36 potential NAD-binding proteins, respectively, including 12 proteins that were consistently detected by the two approaches. Interestingly, the 12 proteins included 8 that are not previously known as NAD binders. Further validation of these eight proteins showed that their binding affinities to NAD computed by AutoDock Vina are higher than their cognate ligands and also that their protein ratios in the pulse proteolysis are consistent with known NAD-binding proteins. These results strongly suggest that these eight proteins are indeed newly identified NAD binders.
Collapse
Affiliation(s)
| | | | - Xiaolei Zhu
- School of Life Science, Anhui University , Hefei, Anhui 230601, China
| | - Sung Hoon Park
- Research Institute of Food and Biotechnology, SPC Group , Seoul 06737, South Korea
| | | | | | | |
Collapse
|
24
|
Foster CA, West AH. Use of restrained molecular dynamics to predict the conformations of phosphorylated receiver domains in two-component signaling systems. Proteins 2016; 85:155-176. [PMID: 27802580 PMCID: PMC5242315 DOI: 10.1002/prot.25207] [Citation(s) in RCA: 12] [Impact Index Per Article: 1.5] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/02/2016] [Revised: 10/22/2016] [Accepted: 10/25/2016] [Indexed: 01/22/2023]
Abstract
Two‐component signaling (TCS) is the primary means by which bacteria, as well as certain plants and fungi, respond to external stimuli. Signal transduction involves stimulus‐dependent autophosphorylation of a sensor histidine kinase and phosphoryl transfer to the receiver domain of a downstream response regulator. Phosphorylation acts as an allosteric switch, inducing structural and functional changes in the pathway's components. Due to their transient nature, phosphorylated receiver domains are challenging to characterize structurally. In this work, we provide a methodology for simulating receiver domain phosphorylation to predict conformations that are nearly identical to experimental structures. Using restrained molecular dynamics, phosphorylated conformations of receiver domains can be reliably sampled on nanosecond timescales. These simulations also provide data on conformational dynamics that can be used to identify regions of functional significance related to phosphorylation. We first validated this approach on several well‐characterized receiver domains and then used it to compare the upstream and downstream components of the fungal Sln1 phosphorelay. Our results demonstrate that this technique provides structural insight, obtained in the absence of crystallographic or NMR information, regarding phosphorylation‐induced conformational changes in receiver domains that regulate the output of their associated signaling pathway. To our knowledge, this is the first time such a protocol has been described that can be broadly applied to TCS proteins for predictive purposes. Proteins 2016; 85:155–176. © 2016 Wiley Periodicals, Inc.
Collapse
Affiliation(s)
- Clay A Foster
- Department of Chemistry and Biochemistry, University of Oklahoma, Norman, Oklahoma
| | - Ann H West
- Department of Chemistry and Biochemistry, University of Oklahoma, Norman, Oklahoma
| |
Collapse
|
25
|
Saberi Fathi SM. A new definition and properties of the similarity value between two protein structures. J Biol Phys 2016; 42:621-636. [PMID: 27623799 DOI: 10.1007/s10867-016-9429-0] [Citation(s) in RCA: 1] [Impact Index Per Article: 0.1] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 12/15/2015] [Accepted: 08/12/2016] [Indexed: 12/13/2022] Open
Abstract
Knowledge regarding the 3D structure of a protein provides useful information about the protein's functional properties. Particularly, structural similarity between proteins can be used as a good predictor of functional similarity. One method that uses the 3D geometrical structure of proteins in order to compare them is the similarity value (SV). In this paper, we introduce a new definition of the SV measure for comparing two proteins. To this end, we consider the mass of the protein's atoms and concentrate on the number of protein's atoms to be compared. This defines a new measure, called the weighted similarity value (WSV), adding physical properties to geometrical properties. We also show that our results are in good agreement with the results obtained by TM-SCORE and DALILITE. WSV can be of use in protein classification and in drug discovery.
Collapse
Affiliation(s)
- S M Saberi Fathi
- Department of Physics, Ferdowsi University of Mashhad, Mashhad, 9177948974, Iran.
| |
Collapse
|
26
|
Hsieh SR, Reddy PM, Chang CJ, Kumar A, Wu WC, Lin HY. Exploring the Behavior of Bovine Serum Albumin in Response to Changes in the Chemical Composition of Responsive Polymers: Experimental and Simulation Studies. Polymers (Basel) 2016; 8:E238. [PMID: 30979331 PMCID: PMC6432219 DOI: 10.3390/polym8060238] [Citation(s) in RCA: 25] [Impact Index Per Article: 3.1] [Reference Citation Analysis] [Abstract] [Key Words] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 05/03/2016] [Revised: 06/13/2016] [Accepted: 06/13/2016] [Indexed: 01/06/2023] Open
Abstract
Knowledge of the interactions between polymer and protein is very important to fabricate the potential materials for many bio-related applications. In this regard, the present work investigated the effect of copolymers on the conformation and thermal stability of bovine serum albumin (BSA) with the aid of biophysical techniques such as fluorescence spectroscopy, circular dichroism (CD) spectroscopy and differential scanning calorimetry (DSC). In comparison with that of copolymer PGA-1.5, our fluorescence spectroscopy results reveal that the copolymer PGA-1, which has a lower PEGMA/AA ratio, shows greater influence on the conformation of BSA. Copolymers induced unfolding of the polypeptide chain of BSA, which was confirmed from the loss in the negative ellipticity of CD spectra. DSC results showed that the addition of PGA-1 and PGA-1.5 (0.05% (w/v) decreased the transition temperature by 14.8 and 11.5 °C, respectively). The results from the present study on the behavior of protein in response to changes in the chemical composition of synthetic polymers are significant for various biological applications such as enzyme immobilization, protein separations, sensor development and stimuli-responsive systems.
Collapse
Affiliation(s)
- Shih-Rong Hsieh
- Department of Surgery, Taichung Veterans General Hospital, 1650 Taiwan Boulevard Section 4, Taichung 40705, Taiwan.
| | - P Madhusudhana Reddy
- Department of Chemical Engineering, Feng Chia University, 100, Wenhwa Road, Seatwen, Taichung 40724, Taiwan.
| | - Chi-Jung Chang
- Department of Chemical Engineering, Feng Chia University, 100, Wenhwa Road, Seatwen, Taichung 40724, Taiwan.
| | - Awanish Kumar
- Department of Chemistry, Massachusetts Institute of Technology, Cambridge, MA 02139, USA.
| | - Wan-Chi Wu
- Department of Chemical Engineering, Feng Chia University, 100, Wenhwa Road, Seatwen, Taichung 40724, Taiwan.
| | - Hui-Yi Lin
- School of Pharmacy, China Medical University, 91, Hsueh-Shih Road, Taichung 40402, Taiwan.
| |
Collapse
|
27
|
Nurdiansyah R, Rifa'i M, Widodo. A comparative analysis of serum albumin from different species to determine a natural source of albumin that might be useful for human therapy. J Taibah Univ Med Sci 2016. [DOI: 10.1016/j.jtumed.2016.04.003] [Citation(s) in RCA: 3] [Impact Index Per Article: 0.4] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 10/21/2022] Open
|
28
|
Savane TS, Kumar S, Janakiraman VN, Kamalanathan AS, Vijayalakshmi MA. Molecular insight in the purification of immunoglobulin by pseudobiospecific ligand l-histidine and histidyl moieties in histidine ligand affinity chromatography (HLAC) by molecular docking. J Chromatogr B Analyt Technol Biomed Life Sci 2016; 1021:129-136. [DOI: 10.1016/j.jchromb.2015.09.010] [Citation(s) in RCA: 7] [Impact Index Per Article: 0.9] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 04/23/2015] [Accepted: 09/10/2015] [Indexed: 11/30/2022]
|
29
|
Taherzadeh G, Yang Y, Zhang T, Liew AW, Zhou Y. Sequence‐based prediction of protein–peptide binding sites using support vector machine. J Comput Chem 2016; 37:1223-9. [DOI: 10.1002/jcc.24314] [Citation(s) in RCA: 56] [Impact Index Per Article: 7.0] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 11/14/2015] [Accepted: 01/06/2016] [Indexed: 11/06/2022]
Affiliation(s)
- Ghazaleh Taherzadeh
- School of Information and Communication TechnologyGriffith UniversityParklands DriveSouthport Queensland4215 Australia
| | - Yuedong Yang
- School of Information and Communication TechnologyGriffith UniversityParklands DriveSouthport Queensland4215 Australia
- Institute for Glycomics, Griffith UniversityParklands DrSouthport Queensland4215 Australia
| | - Tuo Zhang
- Weill Cornell Medical College1300 York AvenueNew York, New York10065
| | - Alan Wee‐Chung Liew
- School of Information and Communication TechnologyGriffith UniversityParklands DriveSouthport Queensland4215 Australia
| | - Yaoqi Zhou
- School of Information and Communication TechnologyGriffith UniversityParklands DriveSouthport Queensland4215 Australia
- Institute for Glycomics, Griffith UniversityParklands DrSouthport Queensland4215 Australia
| |
Collapse
|
30
|
Shin WH, Bures MG, Kihara D. PatchSurfers: Two methods for local molecular property-based binding ligand prediction. Methods 2016; 93:41-50. [PMID: 26427548 PMCID: PMC4718779 DOI: 10.1016/j.ymeth.2015.09.026] [Citation(s) in RCA: 8] [Impact Index Per Article: 1.0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 08/02/2015] [Revised: 09/27/2015] [Accepted: 09/28/2015] [Indexed: 01/09/2023] Open
Abstract
Protein function prediction is an active area of research in computational biology. Function prediction can help biologists make hypotheses for characterization of genes and help interpret biological assays, and thus is a productive area for collaboration between experimental and computational biologists. Among various function prediction methods, predicting binding ligand molecules for a target protein is an important class because ligand binding events for a protein are usually closely intertwined with the proteins' biological function, and also because predicted binding ligands can often be directly tested by biochemical assays. Binding ligand prediction methods can be classified into two types: those which are based on protein-protein (or pocket-pocket) comparison, and those that compare a target pocket directly to ligands. Recently, our group proposed two computational binding ligand prediction methods, Patch-Surfer, which is a pocket-pocket comparison method, and PL-PatchSurfer, which compares a pocket to ligand molecules. The two programs apply surface patch-based descriptions to calculate similarity or complementarity between molecules. A surface patch is characterized by physicochemical properties such as shape, hydrophobicity, and electrostatic potentials. These properties on the surface are represented using three-dimensional Zernike descriptors (3DZD), which are based on a series expansion of a 3 dimensional function. Utilizing 3DZD for describing the physicochemical properties has two main advantages: (1) rotational invariance and (2) fast comparison. Here, we introduce Patch-Surfer and PL-PatchSurfer with an emphasis on PL-PatchSurfer, which is more recently developed. Illustrative examples of PL-PatchSurfer performance on binding ligand prediction as well as virtual drug screening are also provided.
Collapse
Affiliation(s)
- Woong-Hee Shin
- Department of Biological Sciences, Purdue University, West Lafayette, IN 47907, USA
| | - Mark Gregory Bures
- Discovery Chemistry Research and Technologies, Eli Lilly and Company, Indianapolis, IN 46285, USA
| | - Daisuke Kihara
- Department of Biological Sciences, Purdue University, West Lafayette, IN 47907, USA; Department of Computer Science, Purdue University, West Lafayette, IN 47907, USA.
| |
Collapse
|
31
|
Computing Discrete Fine-Grained Representations of Protein Surfaces. COMPUTATIONAL INTELLIGENCE METHODS FOR BIOINFORMATICS AND BIOSTATISTICS 2016. [DOI: 10.1007/978-3-319-44332-4_14] [Citation(s) in RCA: 6] [Impact Index Per Article: 0.8] [Reference Citation Analysis] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 12/12/2022]
|
32
|
Layers: A molecular surface peeling algorithm and its applications to analyze protein structures. Sci Rep 2015; 5:16141. [PMID: 26553411 PMCID: PMC4639851 DOI: 10.1038/srep16141] [Citation(s) in RCA: 3] [Impact Index Per Article: 0.3] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 02/20/2015] [Accepted: 10/01/2015] [Indexed: 11/08/2022] Open
Abstract
We present an algorithm 'Layers' to peel the atoms of proteins as layers. Using Layers we show an efficient way to transform protein structures into 2D pattern, named residue transition pattern (RTP), which is independent of molecular orientations. RTP explains the folding patterns of proteins and hence identification of similarity between proteins is simple and reliable using RTP than with the standard sequence or structure based methods. Moreover, Layers generates a fine-tunable coarse model for the molecular surface by using non-random sampling. The coarse model can be used for shape comparison, protein recognition and ligand design. Additionally, Layers can be used to develop biased initial configuration of molecules for protein folding simulations. We have developed a random forest classifier to predict the RTP of a given polypeptide sequence. Layers is a standalone application; however, it can be merged with other applications to reduce the computational load when working with large datasets of protein structures. Layers is available freely at http://www.csb.iitkgp.ernet.in/applications/mol_layers/main.
Collapse
|
33
|
Gowthaman R, Miller SA, Rogers S, Khowsathit J, Lan L, Bai N, Johnson DK, Liu C, Xu L, Anbanandam A, Aubé J, Roy A, Karanicolas J. DARC: Mapping Surface Topography by Ray-Casting for Effective Virtual Screening at Protein Interaction Sites. J Med Chem 2015; 59:4152-70. [PMID: 26126123 DOI: 10.1021/acs.jmedchem.5b00150] [Citation(s) in RCA: 18] [Impact Index Per Article: 2.0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 12/15/2022]
Abstract
Protein-protein interactions represent an exciting and challenging target class for therapeutic intervention using small molecules. Protein interaction sites are often devoid of the deep surface pockets presented by "traditional" drug targets, and crystal structures reveal that inhibitors typically engage these sites using very shallow binding modes. As a consequence, modern virtual screening tools developed to identify inhibitors of traditional drug targets do not perform as well when they are instead deployed at protein interaction sites. To address the need for novel inhibitors of important protein interactions, here we introduce an alternate docking strategy specifically designed for this regime. Our method, termed DARC (Docking Approach using Ray-Casting), matches the topography of a surface pocket "observed" from within the protein to the topography "observed" when viewing a potential ligand from the same vantage point. We applied DARC to carry out a virtual screen against the protein interaction site of human antiapoptotic protein Mcl-1 and found that four of the top-scoring 21 compounds showed clear inhibition in a biochemical assay. The Ki values for these compounds ranged from 1.2 to 21 μM, and each had ligand efficiency comparable to promising small-molecule inhibitors of other protein-protein interactions. These hit compounds do not resemble the natural (protein) binding partner of Mcl-1, nor do they resemble any known inhibitors of Mcl-1. Our results thus demonstrate the utility of DARC for identifying novel inhibitors of protein-protein interactions.
Collapse
Affiliation(s)
- Ragul Gowthaman
- Center for Computational Biology, ‡Department of Molecular Biosciences, §Center of Biomedical Research Excellence, Center for Cancer Experimental Therapeutics, ∥Department of Radiation Oncology, ⊥Biomolecular NMR Laboratory, #Department of Medicinal Chemistry, and ∇High Throughput Screening Laboratory University of Kansas , 2030 Becker Drive, Lawrence, Kansas 66045-7534, United States
| | - Sven A Miller
- Center for Computational Biology, ‡Department of Molecular Biosciences, §Center of Biomedical Research Excellence, Center for Cancer Experimental Therapeutics, ∥Department of Radiation Oncology, ⊥Biomolecular NMR Laboratory, #Department of Medicinal Chemistry, and ∇High Throughput Screening Laboratory University of Kansas , 2030 Becker Drive, Lawrence, Kansas 66045-7534, United States
| | - Steven Rogers
- Center for Computational Biology, ‡Department of Molecular Biosciences, §Center of Biomedical Research Excellence, Center for Cancer Experimental Therapeutics, ∥Department of Radiation Oncology, ⊥Biomolecular NMR Laboratory, #Department of Medicinal Chemistry, and ∇High Throughput Screening Laboratory University of Kansas , 2030 Becker Drive, Lawrence, Kansas 66045-7534, United States
| | - Jittasak Khowsathit
- Center for Computational Biology, ‡Department of Molecular Biosciences, §Center of Biomedical Research Excellence, Center for Cancer Experimental Therapeutics, ∥Department of Radiation Oncology, ⊥Biomolecular NMR Laboratory, #Department of Medicinal Chemistry, and ∇High Throughput Screening Laboratory University of Kansas , 2030 Becker Drive, Lawrence, Kansas 66045-7534, United States
| | - Lan Lan
- Center for Computational Biology, ‡Department of Molecular Biosciences, §Center of Biomedical Research Excellence, Center for Cancer Experimental Therapeutics, ∥Department of Radiation Oncology, ⊥Biomolecular NMR Laboratory, #Department of Medicinal Chemistry, and ∇High Throughput Screening Laboratory University of Kansas , 2030 Becker Drive, Lawrence, Kansas 66045-7534, United States
| | - Nan Bai
- Center for Computational Biology, ‡Department of Molecular Biosciences, §Center of Biomedical Research Excellence, Center for Cancer Experimental Therapeutics, ∥Department of Radiation Oncology, ⊥Biomolecular NMR Laboratory, #Department of Medicinal Chemistry, and ∇High Throughput Screening Laboratory University of Kansas , 2030 Becker Drive, Lawrence, Kansas 66045-7534, United States
| | - David K Johnson
- Center for Computational Biology, ‡Department of Molecular Biosciences, §Center of Biomedical Research Excellence, Center for Cancer Experimental Therapeutics, ∥Department of Radiation Oncology, ⊥Biomolecular NMR Laboratory, #Department of Medicinal Chemistry, and ∇High Throughput Screening Laboratory University of Kansas , 2030 Becker Drive, Lawrence, Kansas 66045-7534, United States
| | - Chunjing Liu
- Center for Computational Biology, ‡Department of Molecular Biosciences, §Center of Biomedical Research Excellence, Center for Cancer Experimental Therapeutics, ∥Department of Radiation Oncology, ⊥Biomolecular NMR Laboratory, #Department of Medicinal Chemistry, and ∇High Throughput Screening Laboratory University of Kansas , 2030 Becker Drive, Lawrence, Kansas 66045-7534, United States
| | - Liang Xu
- Center for Computational Biology, ‡Department of Molecular Biosciences, §Center of Biomedical Research Excellence, Center for Cancer Experimental Therapeutics, ∥Department of Radiation Oncology, ⊥Biomolecular NMR Laboratory, #Department of Medicinal Chemistry, and ∇High Throughput Screening Laboratory University of Kansas , 2030 Becker Drive, Lawrence, Kansas 66045-7534, United States
| | - Asokan Anbanandam
- Center for Computational Biology, ‡Department of Molecular Biosciences, §Center of Biomedical Research Excellence, Center for Cancer Experimental Therapeutics, ∥Department of Radiation Oncology, ⊥Biomolecular NMR Laboratory, #Department of Medicinal Chemistry, and ∇High Throughput Screening Laboratory University of Kansas , 2030 Becker Drive, Lawrence, Kansas 66045-7534, United States
| | - Jeffrey Aubé
- Center for Computational Biology, ‡Department of Molecular Biosciences, §Center of Biomedical Research Excellence, Center for Cancer Experimental Therapeutics, ∥Department of Radiation Oncology, ⊥Biomolecular NMR Laboratory, #Department of Medicinal Chemistry, and ∇High Throughput Screening Laboratory University of Kansas , 2030 Becker Drive, Lawrence, Kansas 66045-7534, United States
| | - Anuradha Roy
- Center for Computational Biology, ‡Department of Molecular Biosciences, §Center of Biomedical Research Excellence, Center for Cancer Experimental Therapeutics, ∥Department of Radiation Oncology, ⊥Biomolecular NMR Laboratory, #Department of Medicinal Chemistry, and ∇High Throughput Screening Laboratory University of Kansas , 2030 Becker Drive, Lawrence, Kansas 66045-7534, United States
| | - John Karanicolas
- Center for Computational Biology, ‡Department of Molecular Biosciences, §Center of Biomedical Research Excellence, Center for Cancer Experimental Therapeutics, ∥Department of Radiation Oncology, ⊥Biomolecular NMR Laboratory, #Department of Medicinal Chemistry, and ∇High Throughput Screening Laboratory University of Kansas , 2030 Becker Drive, Lawrence, Kansas 66045-7534, United States
| |
Collapse
|
34
|
Johnson DK, Karanicolas J. Selectivity by small-molecule inhibitors of protein interactions can be driven by protein surface fluctuations. PLoS Comput Biol 2015; 11:e1004081. [PMID: 25706586 PMCID: PMC4338137 DOI: 10.1371/journal.pcbi.1004081] [Citation(s) in RCA: 20] [Impact Index Per Article: 2.2] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/24/2014] [Accepted: 12/10/2014] [Indexed: 12/13/2022] Open
Abstract
Small-molecules that inhibit interactions between specific pairs of proteins have long represented a promising avenue for therapeutic intervention in a variety of settings. Structural studies have shown that in many cases, the inhibitor-bound protein adopts a conformation that is distinct from its unbound and its protein-bound conformations. This plasticity of the protein surface presents a major challenge in predicting which members of a protein family will be inhibited by a given ligand. Here, we use biased simulations of Bcl-2-family proteins to generate ensembles of low-energy conformations that contain surface pockets suitable for small molecule binding. We find that the resulting conformational ensembles include surface pockets that mimic those observed in inhibitor-bound crystal structures. Next, we find that the ensembles generated using different members of this protein family are overlapping but distinct, and that the activity of a given compound against a particular family member (ligand selectivity) can be predicted from whether the corresponding ensemble samples a complementary surface pocket. Finally, we find that each ensemble includes certain surface pockets that are not shared by any other family member: while no inhibitors have yet been identified to take advantage of these pockets, we expect that chemical scaffolds complementing these “distinct” pockets will prove highly selective for their targets. The opportunity to achieve target selectivity within a protein family by exploiting differences in surface fluctuations represents a new paradigm that may facilitate design of family-selective small-molecule inhibitors of protein-protein interactions. Despite intense interest and considerable effort, there are few examples of small molecules that directly inhibit protein-protein interactions. Crystal structures of early successes have highlighted the plasticity of the protein surface, as some inhibitor-bound proteins are captured in conformations that are distinct from both their unbound and their protein-bound conformations. The lack of a single well-defined structure presents a challenge for predicting the members of a protein family to which a given compound will show activity (ligand selectivity). Here we generate ensembles of conformations from simulation, and show that we can predict ligand selectivity based on which family members sample conformations complementary to the ligand. This approach may present a new avenue for designing highly-selective inhibitors of protein-protein interactions.
Collapse
Affiliation(s)
- David K. Johnson
- Center for Computational Biology, University of Kansas, Lawrence, Kansas, United States of America
| | - John Karanicolas
- Center for Computational Biology, University of Kansas, Lawrence, Kansas, United States of America
- Department of Molecular Biosciences, University of Kansas, Lawrence, Kansas, United States of America
- * E-mail:
| |
Collapse
|
35
|
Abstract
Moonlighting proteins perform multiple independent cellular functions within one polypeptide chain. Moonlighting proteins switch functions depending on various factors including the cell-type in which they are expressed, cellular location, oligomerization status and the binding of different ligands at different sites. Although an increasing number of moonlighting proteins have been experimentally identified in recent years, the quantity of known moonlighting proteins is insufficient to elucidate their overall landscape. Moreover, most moonlighting proteins have been identified as a serendipitous discovery. Hence, characterization of moonlighting proteins using bioinformatics approaches can have a significant impact on the overall understanding of protein function. In this work, we provide a short review of existing computational approaches for illuminating the functional diversity of moonlighting proteins.
Collapse
Affiliation(s)
- Ishita K Khan
- Department of Computer Science, Purdue University, West Lafayette, IN, 47907, USA
| | - Daisuke Kihara
- Department of Computer Science, Purdue University, West Lafayette, IN, 47907, USA
- Department of Biological Science, Purdue University, West Lafayette, IN, 47907, USA
| |
Collapse
|
36
|
Zhu X, Xiong Y, Kihara D. Large-scale binding ligand prediction by improved patch-based method Patch-Surfer2.0. ACTA ACUST UNITED AC 2014; 31:707-13. [PMID: 25359888 DOI: 10.1093/bioinformatics/btu724] [Citation(s) in RCA: 37] [Impact Index Per Article: 3.7] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/14/2022]
Abstract
MOTIVATION Ligand binding is a key aspect of the function of many proteins. Thus, binding ligand prediction provides important insight in understanding the biological function of proteins. Binding ligand prediction is also useful for drug design and examining potential drug side effects. RESULTS We present a computational method named Patch-Surfer2.0, which predicts binding ligands for a protein pocket. By representing and comparing pockets at the level of small local surface patches that characterize physicochemical properties of the local regions, the method can identify binding pockets of the same ligand even if they do not share globally similar shapes. Properties of local patches are represented by an efficient mathematical representation, 3D Zernike Descriptor. Patch-Surfer2.0 has significant technical improvements over our previous prototype, which includes a new feature that captures approximate patch position with a geodesic distance histogram. Moreover, we constructed a large comprehensive database of ligand binding pockets that will be searched against by a query. The benchmark shows better performance of Patch-Surfer2.0 over existing methods. AVAILABILITY AND IMPLEMENTATION http://kiharalab.org/patchsurfer2.0/ CONTACT: dkihara@purdue.edu SUPPLEMENTARY INFORMATION Supplementary data are available at Bioinformatics online.
Collapse
Affiliation(s)
- Xiaolei Zhu
- Department of Biological Science, Purdue University, West Lafayette, IN 47906, USA and Department of Computer Science, Purdue University, West Lafayette, IN 47906, USA
| | - Yi Xiong
- Department of Biological Science, Purdue University, West Lafayette, IN 47906, USA and Department of Computer Science, Purdue University, West Lafayette, IN 47906, USA
| | - Daisuke Kihara
- Department of Biological Science, Purdue University, West Lafayette, IN 47906, USA and Department of Computer Science, Purdue University, West Lafayette, IN 47906, USA Department of Biological Science, Purdue University, West Lafayette, IN 47906, USA and Department of Computer Science, Purdue University, West Lafayette, IN 47906, USA
| |
Collapse
|
37
|
Faraggi E, Zhou Y, Kloczkowski A. Accurate single-sequence prediction of solvent accessible surface area using local and global features. Proteins 2014; 82:3170-6. [PMID: 25204636 DOI: 10.1002/prot.24682] [Citation(s) in RCA: 43] [Impact Index Per Article: 4.3] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 04/13/2014] [Revised: 08/08/2014] [Accepted: 08/22/2014] [Indexed: 01/04/2023]
Abstract
We present a new approach for predicting the Accessible Surface Area (ASA) using a General Neural Network (GENN). The novelty of the new approach lies in not using residue mutation profiles generated by multiple sequence alignments as descriptive inputs. Instead we use solely sequential window information and global features such as single-residue and two-residue compositions of the chain. The resulting predictor is both highly more efficient than sequence alignment-based predictors and of comparable accuracy to them. Introduction of the global inputs significantly helps achieve this comparable accuracy. The predictor, termed ASAquick, is tested on predicting the ASA of globular proteins and found to perform similarly well for so-called easy and hard cases indicating generalizability and possible usability for de-novo protein structure prediction. The source code and a Linux executables for GENN and ASAquick are available from Research and Information Systems at http://mamiris.com, from the SPARKS Lab at http://sparks-lab.org, and from the Battelle Center for Mathematical Medicine at http://mathmed.org.
Collapse
Affiliation(s)
- Eshel Faraggi
- Department of Biochemistry and Molecular Biology, Indiana University School of Medicine, Indianapolis, Indiana, 46202; Battelle Center for Mathematical Medicine, Nationwide Children's Hospital, Columbus, Ohio, 43215; Physics Division, Research and Information Systems, LLC, Carmel, Indiana, 46032
| | | | | |
Collapse
|
38
|
Saberi Fathi SM, Tuszynski JA. A simple method for finding a protein's ligand-binding pockets. BMC STRUCTURAL BIOLOGY 2014; 14:18. [PMID: 25038637 PMCID: PMC4112621 DOI: 10.1186/1472-6807-14-18] [Citation(s) in RCA: 12] [Impact Index Per Article: 1.2] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Download PDF] [Figures] [Subscribe] [Scholar Register] [Received: 01/09/2014] [Accepted: 07/11/2014] [Indexed: 12/03/2022]
Abstract
BACKGROUND This paper provides a simple and rapid method for a protein-clustering strategy. The basic idea implemented here is to use computational geometry methods to predict and characterize ligand-binding pockets of a given protein structure. In addition to geometrical characteristics of the protein structure, we consider some simple biochemical properties that help recognize the best candidates for pockets in a protein's active site. RESULTS Our results are shown to produce good agreement with known empirical results. CONCLUSIONS The method presented in this paper is a low-cost rapid computational method that could be used to classify proteins and other biomolecules, and furthermore could be useful in reducing the cost and time of drug discovery.
Collapse
Affiliation(s)
| | - Jack A Tuszynski
- Department of Physics, University of Alberta, Edmonton, Alberta, Canada
| |
Collapse
|
39
|
Saberi Fathi SM, White DT, Tuszynski JA. Geometrical comparison of two protein structures using Wigner-D functions. Proteins 2014; 82:2756-69. [PMID: 25043646 DOI: 10.1002/prot.24640] [Citation(s) in RCA: 10] [Impact Index Per Article: 1.0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 12/24/2013] [Revised: 05/20/2014] [Accepted: 06/18/2014] [Indexed: 12/13/2022]
Abstract
In this article, we develop a quantitative comparison method for two arbitrary protein structures. This method uses a root-mean-square deviation characterization and employs a series expansion of the protein's shape function in terms of the Wigner-D functions to define a new criterion, which is called a "similarity value." We further demonstrate that the expansion coefficients for the shape function obtained with the help of the Wigner-D functions correspond to structure factors. Our method addresses the common problem of comparing two proteins with different numbers of atoms. We illustrate it with a worked example.
Collapse
Affiliation(s)
- S M Saberi Fathi
- Department of Physics, Ferdowsi University of Mashhad, Mashhad, Iran
| | | | | |
Collapse
|
40
|
3D-SURFER 2.0: web platform for real-time search and characterization of protein surfaces. Methods Mol Biol 2014; 1137:105-17. [PMID: 24573477 DOI: 10.1007/978-1-4939-0366-5_8] [Citation(s) in RCA: 8] [Impact Index Per Article: 0.8] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 12/13/2022]
Abstract
The increasing number of uncharacterized protein structures necessitates the development of computational approaches for function annotation using the protein tertiary structures. Protein structure database search is the basis of any structure-based functional elucidation of proteins. 3D-SURFER is a web platform for real-time protein surface comparison of a given protein structure against the entire PDB using 3D Zernike descriptors. It can smoothly navigate the protein structure space in real-time from one query structure to another. A major new feature of Release 2.0 is the ability to compare the protein surface of a single chain, a single domain, or a single complex against databases of protein chains, domains, complexes, or a combination of all three in the latest PDB. Additionally, two types of protein structures can now be compared: all-atom-surface and backbone-atom-surface. The server can also accept a batch job for a large number of database searches. Pockets in protein surfaces can be identified by VisGrid and LIGSITE (csc) . The server is available at http://kiharalab.org/3d-surfer/.
Collapse
|
41
|
Local functional descriptors for surface comparison based binding prediction. BMC Bioinformatics 2012; 13:314. [PMID: 23176080 PMCID: PMC3585919 DOI: 10.1186/1471-2105-13-314] [Citation(s) in RCA: 3] [Impact Index Per Article: 0.3] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 01/17/2012] [Accepted: 10/10/2012] [Indexed: 11/10/2022] Open
Abstract
Background Molecular recognition in proteins occurs due to appropriate arrangements of physical, chemical, and geometric properties of an atomic surface. Similar surface regions should create similar binding interfaces. Effective methods for comparing surface regions can be used in identifying similar regions, and to predict interactions without regard to the underlying structural scaffold that creates the surface. Results We present a new descriptor for protein functional surfaces and algorithms for using these descriptors to compare protein surface regions to identify ligand binding interfaces. Our approach uses descriptors of local regions of the surface, and assembles collections of matches to compare larger regions. Our approach uses a variety of physical, chemical, and geometric properties, adaptively weighting these properties as appropriate for different regions of the interface. Our approach builds a classifier based on a training corpus of examples of binding sites of the target ligand. The constructed classifiers can be applied to a query protein providing a probability for each position on the protein that the position is part of a binding interface. We demonstrate the effectiveness of the approach on a number of benchmarks, demonstrating performance that is comparable to the state-of-the-art, with an approach with more generality than these prior methods. Conclusions Local functional descriptors offer a new method for protein surface comparison that is sufficiently flexible to serve in a variety of applications.
Collapse
|
42
|
Lahti JL, Tang GW, Capriotti E, Liu T, Altman RB. Bioinformatics and variability in drug response: a protein structural perspective. J R Soc Interface 2012; 9:1409-37. [PMID: 22552919 DOI: 10.1098/rsif.2011.0843] [Citation(s) in RCA: 57] [Impact Index Per Article: 4.8] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 12/21/2022] Open
Abstract
Marketed drugs frequently perform worse in clinical practice than in the clinical trials on which their approval is based. Many therapeutic compounds are ineffective for a large subpopulation of patients to whom they are prescribed; worse, a significant fraction of patients experience adverse effects more severe than anticipated. The unacceptable risk-benefit profile for many drugs mandates a paradigm shift towards personalized medicine. However, prior to adoption of patient-specific approaches, it is useful to understand the molecular details underlying variable drug response among diverse patient populations. Over the past decade, progress in structural genomics led to an explosion of available three-dimensional structures of drug target proteins while efforts in pharmacogenetics offered insights into polymorphisms correlated with differential therapeutic outcomes. Together these advances provide the opportunity to examine how altered protein structures arising from genetic differences affect protein-drug interactions and, ultimately, drug response. In this review, we first summarize structural characteristics of protein targets and common mechanisms of drug interactions. Next, we describe the impact of coding mutations on protein structures and drug response. Finally, we highlight tools for analysing protein structures and protein-drug interactions and discuss their application for understanding altered drug responses associated with protein structural variants.
Collapse
Affiliation(s)
- Jennifer L Lahti
- Department of Bioengineering, Stanford University, Stanford, CA, USA
| | | | | | | | | |
Collapse
|
43
|
Sael L, Kihara D. Constructing patch-based ligand-binding pocket database for predicting function of proteins. BMC Bioinformatics 2012; 13 Suppl 2:S7. [PMID: 22536870 PMCID: PMC3375630 DOI: 10.1186/1471-2105-13-s2-s7] [Citation(s) in RCA: 7] [Impact Index Per Article: 0.6] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/24/2022] Open
Abstract
Background Many of solved tertiary structures of unknown functions do not have global sequence and structural similarities to proteins of known function. Often functional clues of unknown proteins can be obtained by predicting small ligand molecules that bind to the proteins. Methods In our previous work, we have developed an alignment free local surface-based pocket comparison method, named Patch-Surfer, which predicts ligand molecules that are likely to bind to a protein of interest. Given a query pocket in a protein, Patch-Surfer searches a database of known pockets and finds similar ones to the query. Here, we have extended the database of ligand binding pockets for Patch-Surfer to cover diverse types of binding ligands. Results and conclusion We selected 9393 representative pockets with 2707 different ligand types from the Protein Data Bank. We tested Patch-Surfer on the extended pocket database to predict binding ligand of 75 non-homologous proteins that bind one of seven different ligands. Patch-Surfer achieved the average enrichment factor at 0.1 percent of over 20.0. The results did not depend on the sequence similarity of the query protein to proteins in the database, indicating that Patch-Surfer can identify correct pockets even in the absence of known homologous structures in the database.
Collapse
Affiliation(s)
- Lee Sael
- Department of Biological Sciences, Purdue University, West Lafayette, IN 47907, USA
| | | |
Collapse
|
44
|
Sael L, Kihara D. Detecting local ligand-binding site similarity in nonhomologous proteins by surface patch comparison. Proteins 2012; 80:1177-95. [PMID: 22275074 DOI: 10.1002/prot.24018] [Citation(s) in RCA: 50] [Impact Index Per Article: 4.2] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 09/06/2011] [Revised: 11/27/2011] [Accepted: 12/13/2011] [Indexed: 11/06/2022]
Abstract
Functional elucidation of proteins is one of the essential tasks in biology. Function of a protein, specifically, small ligand molecules that bind to a protein, can be predicted by finding similar local surface regions in binding sites of known proteins. Here, we developed an alignment free local surface comparison method for predicting a ligand molecule which binds to a query protein. The algorithm, named Patch-Surfer, represents a binding pocket as a combination of segmented surface patches, each of which is characterized by its geometrical shape, the electrostatic potential, the hydrophobicity, and the concaveness. Representing a pocket by a set of patches is effective to absorb difference of global pocket shape while capturing local similarity of pockets. The shape and the physicochemical properties of surface patches are represented using the 3D Zernike descriptor, which is a series expansion of mathematical 3D function. Two pockets are compared using a modified weighted bipartite matching algorithm, which matches similar patches from the two pockets. Patch-Surfer was benchmarked on three datasets, which consist in total of 390 proteins that bind to one of 21 ligands. Patch-Surfer showed superior performance to existing methods including a global pocket comparison method, Pocket-Surfer, which we have previously introduced. Particularly, as intended, the accuracy showed large improvement for flexible ligand molecules, which bind to pockets in different conformations.
Collapse
Affiliation(s)
- Lee Sael
- Department of Computer Science, Purdue University, West Lafayette, Indiana 47907, USA
| | | |
Collapse
|
45
|
Sael L, Chitale M, Kihara D. Structure- and sequence-based function prediction for non-homologous proteins. ACTA ACUST UNITED AC 2012; 13:111-23. [PMID: 22270458 DOI: 10.1007/s10969-012-9126-6] [Citation(s) in RCA: 18] [Impact Index Per Article: 1.5] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 09/12/2011] [Accepted: 01/10/2012] [Indexed: 01/14/2023]
Abstract
The structural genomics projects have been accumulating an increasing number of protein structures, many of which remain functionally unknown. In parallel effort to experimental methods, computational methods are expected to make a significant contribution for functional elucidation of such proteins. However, conventional computational methods that transfer functions from homologous proteins do not help much for these uncharacterized protein structures because they do not have apparent structural or sequence similarity with the known proteins. Here, we briefly review two avenues of computational function prediction methods, i.e. structure-based methods and sequence-based methods. The focus is on our recent developments of local structure-based and sequence-based methods, which can effectively extract function information from distantly related proteins. Two structure-based methods, Pocket-Surfer and Patch-Surfer, identify similar known ligand binding sites for pocket regions in a query protein without using global protein fold similarity information. Two sequence-based methods, protein function prediction and extended similarity group, make use of weakly similar sequences that are conventionally discarded in homology based function annotation. Combined together with experimental methods we hope that computational methods will make leading contribution in functional elucidation of the protein structures.
Collapse
Affiliation(s)
- Lee Sael
- Department of Computer Science, Purdue University, West Lafayette, IN 47907, USA
| | | | | |
Collapse
|
46
|
Liu YS, Ramani K, Liu M. Computing the Inner Distances of Volumetric Models for Articulated Shape Description with a Visibility Graph. IEEE TRANSACTIONS ON PATTERN ANALYSIS AND MACHINE INTELLIGENCE 2011; 33:2538-2544. [PMID: 21670484 DOI: 10.1109/tpami.2011.116] [Citation(s) in RCA: 4] [Impact Index Per Article: 0.3] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 05/30/2023]
Abstract
A new visibility graph-based algorithm is presented for computing the inner distances of a 3D shape represented by a volumetric model. The inner distance is defined as the length of the shortest path between landmark points within the shape. The inner distance is robust to articulation and can reflect the deformation of a shape structure well without an explicit decomposition. Our method is based on the visibility graph approach. To check the visibility between pairwise points, we propose a novel, fast, and robust visibility checking algorithm based on a clustering technique which operates directly on the volumetric model without any surface reconstruction procedure, where an octree is used for accelerating the computation. The inner distance can be used as a replacement for other distance measures to build a more accurate description for complex shapes, especially for those with articulated parts. The binary executable program for the Windows platform is available from https://engineering.purdue.edu/PRECISE/VMID.
Collapse
|
47
|
Li Y, Cortés J, Siméon T. Enhancing systematic protein-protein docking methods using ray casting: Application to ATTRACT. Proteins 2011; 79:3037-49. [DOI: 10.1002/prot.23127] [Citation(s) in RCA: 1] [Impact Index Per Article: 0.1] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 11/21/2010] [Revised: 06/20/2011] [Accepted: 07/12/2011] [Indexed: 11/10/2022]
|
48
|
Sael L, Kihara D. Binding ligand prediction for proteins using partial matching of local surface patches. Int J Mol Sci 2010; 11:5009-26. [PMID: 21614188 PMCID: PMC3100846 DOI: 10.3390/ijms11125009] [Citation(s) in RCA: 27] [Impact Index Per Article: 1.9] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 11/02/2010] [Revised: 12/02/2010] [Accepted: 12/03/2010] [Indexed: 11/25/2022] Open
Abstract
Functional elucidation of uncharacterized protein structures is an important task in bioinformatics. We report our new approach for structure-based function prediction which captures local surface features of ligand binding pockets. Function of proteins, specifically, binding ligands of proteins, can be predicted by finding similar local surface regions of known proteins. To enable partial comparison of binding sites in proteins, a weighted bipartite matching algorithm is used to match pairs of surface patches. The surface patches are encoded with the 3D Zernike descriptors. Unlike the existing methods which compare global characteristics of the protein fold or the global pocket shape, the local surface patch method can find functional similarity between non-homologous proteins and binding pockets for flexible ligand molecules. The proposed method improves prediction results over global pocket shape-based method which was previously developed by our group.
Collapse
Affiliation(s)
- Lee Sael
- Department of Computer Science, Purdue University, West Lafayette, IN 47907, USA; E-Mail:
- Department of Biological Sciences, Purdue University, West Lafayette, IN 47907, USA
| | - Daisuke Kihara
- Department of Computer Science, Purdue University, West Lafayette, IN 47907, USA; E-Mail:
- Department of Biological Sciences, Purdue University, West Lafayette, IN 47907, USA
- Markey Center for Structural Biology, Purdue University, West Lafayette, IN 47907, USA
| |
Collapse
|
49
|
Chikhi R, Sael L, Kihara D. Real-time ligand binding pocket database search using local surface descriptors. Proteins 2010; 78:2007-28. [PMID: 20455259 DOI: 10.1002/prot.22715] [Citation(s) in RCA: 52] [Impact Index Per Article: 3.7] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/09/2022]
Abstract
Because of the increasing number of structures of unknown function accumulated by ongoing structural genomics projects, there is an urgent need for computational methods for characterizing protein tertiary structures. As functions of many of these proteins are not easily predicted by conventional sequence database searches, a legitimate strategy is to utilize structure information in function characterization. Of particular interest is prediction of ligand binding to a protein, as ligand molecule recognition is a major part of molecular function of proteins. Predicting whether a ligand molecule binds a protein is a complex problem due to the physical nature of protein-ligand interactions and the flexibility of both binding sites and ligand molecules. However, geometric and physicochemical complementarity is observed between the ligand and its binding site in many cases. Therefore, ligand molecules which bind to a local surface site in a protein can be predicted by finding similar local pockets of known binding ligands in the structure database. Here, we present two representations of ligand binding pockets and utilize them for ligand binding prediction by pocket shape comparison. These representations are based on mapping of surface properties of binding pockets, which are compactly described either by the two-dimensional pseudo-Zernike moments or the three-dimensional Zernike descriptors. These compact representations allow a fast real-time pocket searching against a database. Thorough benchmark studies employing two different datasets show that our representations are competitive with the other existing methods. Limitations and potentials of the shape-based methods as well as possible improvements are discussed.
Collapse
Affiliation(s)
- Rayan Chikhi
- Computer Science Department, Ecole Normale Supérieure de Cachan, 94235 Cachan cedex, Britanny, France
| | | | | |
Collapse
|
50
|
Volkamer A, Griewel A, Grombacher T, Rarey M. Analyzing the Topology of Active Sites: On the Prediction of Pockets and Subpockets. J Chem Inf Model 2010; 50:2041-52. [DOI: 10.1021/ci100241y] [Citation(s) in RCA: 122] [Impact Index Per Article: 8.7] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/28/2022]
Affiliation(s)
- Andrea Volkamer
- Research Group for Computational Molecular Design, Bundesstr. 43, 20146 Hamburg, Germany, and Merck KGaA, Frankfurter Str. 250, 64293 Darmstadt, Germany
| | - Axel Griewel
- Research Group for Computational Molecular Design, Bundesstr. 43, 20146 Hamburg, Germany, and Merck KGaA, Frankfurter Str. 250, 64293 Darmstadt, Germany
| | - Thomas Grombacher
- Research Group for Computational Molecular Design, Bundesstr. 43, 20146 Hamburg, Germany, and Merck KGaA, Frankfurter Str. 250, 64293 Darmstadt, Germany
| | - Matthias Rarey
- Research Group for Computational Molecular Design, Bundesstr. 43, 20146 Hamburg, Germany, and Merck KGaA, Frankfurter Str. 250, 64293 Darmstadt, Germany
| |
Collapse
|