201
|
de Moraes FR, Neshich IAP, Mazoni I, Yano IH, Pereira JGC, Salim JA, Jardine JG, Neshich G. Improving predictions of protein-protein interfaces by combining amino acid-specific classifiers based on structural and physicochemical descriptors with their weighted neighbor averages. PLoS One 2014; 9:e87107. [PMID: 24489849 PMCID: PMC3904977 DOI: 10.1371/journal.pone.0087107] [Citation(s) in RCA: 7] [Impact Index Per Article: 0.7] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 08/16/2012] [Accepted: 12/22/2013] [Indexed: 11/18/2022] Open
Abstract
Protein-protein interactions are involved in nearly all regulatory processes in the cell and are considered one of the most important issues in molecular biology and pharmaceutical sciences but are still not fully understood. Structural and computational biology contributed greatly to the elucidation of the mechanism of protein interactions. In this paper, we present a collection of the physicochemical and structural characteristics that distinguish interface-forming residues (IFR) from free surface residues (FSR). We formulated a linear discriminative analysis (LDA) classifier to assess whether chosen descriptors from the BlueStar STING database (http://www.cbi.cnptia.embrapa.br/SMS/) are suitable for such a task. Receiver operating characteristic (ROC) analysis indicates that the particular physicochemical and structural descriptors used for building the linear classifier perform much better than a random classifier and in fact, successfully outperform some of the previously published procedures, whose performance indicators were recently compared by other research groups. The results presented here show that the selected set of descriptors can be utilized to predict IFRs, even when homologue proteins are missing (particularly important for orphan proteins where no homologue is available for comparative analysis/indication) or, when certain conformational changes accompany interface formation. The development of amino acid type specific classifiers is shown to increase IFR classification performance. Also, we found that the addition of an amino acid conservation attribute did not improve the classification prediction. This result indicates that the increase in predictive power associated with amino acid conservation is exhausted by adequate use of an extensive list of independent physicochemical and structural parameters that, by themselves, fully describe the nano-environment at protein-protein interfaces. The IFR classifier developed in this study is now integrated into the BlueStar STING suite of programs. Consequently, the prediction of protein-protein interfaces for all proteins available in the PDB is possible through STING_interfaces module, accessible at the following website: (http://www.cbi.cnptia.embrapa.br/SMS/predictions/index.html).
Collapse
Affiliation(s)
- Fábio R. de Moraes
- Biology Institute, University of Campinas, Campinas, São Paulo, Brazil
- Brazilian Agricultural Research Corporation (EMBRAPA), National Center for Agricultural Informatics, Campinas, São Paulo, Brazil
| | - Izabella A. P. Neshich
- Biology Institute, University of Campinas, Campinas, São Paulo, Brazil
- Brazilian Agricultural Research Corporation (EMBRAPA), National Center for Agricultural Informatics, Campinas, São Paulo, Brazil
| | - Ivan Mazoni
- Biology Institute, University of Campinas, Campinas, São Paulo, Brazil
- Brazilian Agricultural Research Corporation (EMBRAPA), National Center for Agricultural Informatics, Campinas, São Paulo, Brazil
| | - Inácio H. Yano
- Brazilian Agricultural Research Corporation (EMBRAPA), National Center for Agricultural Informatics, Campinas, São Paulo, Brazil
| | - José G. C. Pereira
- Biology Institute, University of Campinas, Campinas, São Paulo, Brazil
- Brazilian Agricultural Research Corporation (EMBRAPA), National Center for Agricultural Informatics, Campinas, São Paulo, Brazil
| | - José A. Salim
- School of Electrical and Computer Engineering, University of Campinas, Campinas, São Paulo, Brazil
| | - José G. Jardine
- Brazilian Agricultural Research Corporation (EMBRAPA), National Center for Agricultural Informatics, Campinas, São Paulo, Brazil
| | - Goran Neshich
- Brazilian Agricultural Research Corporation (EMBRAPA), National Center for Agricultural Informatics, Campinas, São Paulo, Brazil
- * E-mail:
| |
Collapse
|
202
|
GUIDOLIN DIEGO, AGNATI LUIGIF, TORTORELLA CINZIA, MARCOLI MANUELA, MAURA GUIDO, ALBERTIN GIOVANNA, FUXE KJELL. Neuroglobin as a regulator of mitochondrial-dependent apoptosis: A bioinformatics analysis. Int J Mol Med 2013; 33:111-6. [DOI: 10.3892/ijmm.2013.1564] [Citation(s) in RCA: 25] [Impact Index Per Article: 2.3] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 09/19/2013] [Accepted: 11/01/2013] [Indexed: 11/05/2022] Open
|
203
|
Structural bioinformatics and protein docking analysis of the molecular chaperone-kinase interactions: towards allosteric inhibition of protein kinases by targeting the hsp90-cdc37 chaperone machinery. Pharmaceuticals (Basel) 2013; 6:1407-28. [PMID: 24287464 PMCID: PMC3854018 DOI: 10.3390/ph6111407] [Citation(s) in RCA: 5] [Impact Index Per Article: 0.5] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/29/2013] [Revised: 10/30/2013] [Accepted: 11/05/2013] [Indexed: 01/05/2023] Open
Abstract
A fundamental role of the Hsp90-Cdc37 chaperone system in mediating maturation of protein kinase clients and supporting kinase functional activity is essential for the integrity and viability of signaling pathways involved in cell cycle control and organism development. Despite significant advances in understanding structure and function of molecular chaperones, the molecular mechanisms and guiding principles of kinase recruitment to the chaperone system are lacking quantitative characterization. Structural and thermodynamic characterization of Hsp90-Cdc37 binding with protein kinase clients by modern experimental techniques is highly challenging, owing to a transient nature of chaperone-mediated interactions. In this work, we used experimentally-guided protein docking to probe the allosteric nature of the Hsp90-Cdc37 binding with the cyclin-dependent kinase 4 (Cdk4) kinase clients. The results of docking simulations suggest that the kinase recognition and recruitment to the chaperone system may be primarily determined by Cdc37 targeting of the N-terminal kinase lobe. The interactions of Hsp90 with the C-terminal kinase lobe may provide additional "molecular brakes" that can lock (or unlock) kinase from the system during client loading (release) stages. The results of this study support a central role of the Cdc37 chaperone in recognition and recruitment of the kinase clients. Structural analysis may have useful implications in developing strategies for allosteric inhibition of protein kinases by targeting the Hsp90-Cdc37 chaperone machinery.
Collapse
|
204
|
Li L, Huang Y, Xiao Y. How to use not-always-reliable binding site information in protein-protein docking prediction. PLoS One 2013; 8:e75936. [PMID: 24124522 PMCID: PMC3790831 DOI: 10.1371/journal.pone.0075936] [Citation(s) in RCA: 10] [Impact Index Per Article: 0.9] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 05/28/2013] [Accepted: 08/22/2013] [Indexed: 11/19/2022] Open
Abstract
In many protein-protein docking algorithms, binding site information is used to help predicting the protein complex structures. Using correct and accurate binding site information can increase protein-protein docking success rate significantly. On the other hand, using wrong binding sites information should lead to a failed prediction, or, at least decrease the success rate. Recently, various successful theoretical methods have been proposed to predict the binding sites of proteins. However, the predicted binding site information is not always reliable, sometimes wrong binding site information could be given. Hence there is a high risk to use the predicted binding site information in current docking algorithms. In this paper, a softly restricting method (SRM) is developed to solve this problem. By utilizing predicted binding site information in a proper way, the SRM algorithm is sensitive to the correct binding site information but insensitive to wrong information, which decreases the risk of using predicted binding site information. This SRM is tested on benchmark 3.0 using purely predicted binding site information. The result shows that when the predicted information is correct, SRM increases the success rate significantly; however, even if the predicted information is completely wrong, SRM only decreases success rate slightly, which indicates that the SRM is suitable for utilizing predicted binding site information.
Collapse
Affiliation(s)
- Lin Li
- Biomolecular Physics and Modeling Group, Department of Physics, Huazhong University of Science and Technology, Wuhan, Hubei, China
- Computational Biophysics and Bioinformatics, Department of Physics, Clemson University, South Carolina, United States of America
| | - Yanzhao Huang
- Biomolecular Physics and Modeling Group, Department of Physics, Huazhong University of Science and Technology, Wuhan, Hubei, China
- * E-mail: (YH); (YX)
| | - Yi Xiao
- Biomolecular Physics and Modeling Group, Department of Physics, Huazhong University of Science and Technology, Wuhan, Hubei, China
- * E-mail: (YH); (YX)
| |
Collapse
|
205
|
Sriwastava BK, Basu S, Maulik U, Plewczynski D. PPIcons: identification of protein-protein interaction sites in selected organisms. J Mol Model 2013; 19:4059-70. [PMID: 23729008 PMCID: PMC3744667 DOI: 10.1007/s00894-013-1886-9] [Citation(s) in RCA: 7] [Impact Index Per Article: 0.6] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 02/28/2013] [Accepted: 05/06/2013] [Indexed: 01/08/2023]
Abstract
The physico-chemical properties of interaction interfaces have a crucial role in characterization of protein-protein interactions (PPI). In silico prediction of participating amino acids helps to identify interface residues for further experimental verification using mutational analysis, or inhibition studies by screening library of ligands against given protein. Given the unbound structure of a protein and the fact that it forms a complex with another known protein, the objective of this work is to identify the residues that are involved in the interaction. We attempt to predict interaction sites in protein complexes using local composition of amino acids together with their physico-chemical characteristics. The local sequence segments (LSS) are dissected from the protein sequences using a sliding window of 21 amino acids. The list of LSSs is passed to the support vector machine (SVM) predictor, which identifies interacting residue pairs considering their inter-atom distances. We have analyzed three different model organisms of Escherichia coli, Saccharomyces Cerevisiae and Homo sapiens, where the numbers of considered hetero-complexes are equal to 40, 123 and 33 respectively. Moreover, the unified multi-organism PPI meta-predictor is also developed under the current work by combining the training databases of above organisms. The PPIcons interface residues prediction method is measured by the area under ROC curve (AUC) equal to 0.82, 0.75, 0.72 and 0.76 for the aforementioned organisms and the meta-predictor respectively.
Collapse
Affiliation(s)
- Brijesh K. Sriwastava
- Department of Computer Science and Engineering, Government College of Engineering and Leather Technology, Kolkata, 700098 India
| | - Subhadip Basu
- Department of Computer Science and Engineering, Jadavpur University, Kolkata, 700032 India
| | - Ujjwal Maulik
- Department of Computer Science and Engineering, Jadavpur University, Kolkata, 700032 India
| | - Dariusz Plewczynski
- Interdisciplinary Centre for Mathematical and Computational Modelling, University of Warsaw, 02-106 Warsaw, Poland
- Department of Physical Chemistry, Faculty of Pharmacy, Medical University of Warsaw, 02-097 Warsaw, Poland
| |
Collapse
|
206
|
Hwang H, Vreven T, Weng Z. Binding interface prediction by combining protein-protein docking results. Proteins 2013; 82:57-66. [PMID: 23836482 DOI: 10.1002/prot.24354] [Citation(s) in RCA: 30] [Impact Index Per Article: 2.7] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 01/02/2013] [Revised: 06/05/2013] [Accepted: 06/17/2013] [Indexed: 11/10/2022]
Abstract
We developed a method called residue contact frequency (RCF), which uses the complex structures generated by the protein-protein docking algorithm ZDOCK to predict interface residues. Unlike interface prediction algorithms that are based on monomers alone, RCF is binding partner specific. We evaluated the performance of RCF using the area under the precision-recall (PR) curve (AUC) on a large protein docking Benchmark. RCF (AUC = 0.44) performed as well as meta-PPISP (AUC = 0.43), which is one of the best monomer-based interface prediction methods. In addition, we test a support vector machine (SVM) to combine RCF with meta-PPISP and another monomer-based interface prediction algorithm Evolutionary Trace to further improve the performance. We found that the SVM that combined RCF and meta-PPISP achieved the best performance (AUC = 0.47). We used RCF to predict the binding interfaces of proteins that can bind to multiple partners and RCF was able to correctly predict interface residues that are unique for the respective binding partners. Furthermore, we found that residues that contributed greatly to binding affinity (hotspot residues) had significantly higher RCF than other residues.
Collapse
Affiliation(s)
- Howook Hwang
- Program in Bioinformatics and Integrative Biology, University of Massachusetts Medical School, Worcester, Massachusetts, 01605
| | | | | |
Collapse
|
207
|
Sandler I, Medalia O, Aharoni A. Experimental analysis of co-evolution within protein complexes: the yeast exosome as a model. Proteins 2013; 81:1997-2006. [PMID: 23852635 DOI: 10.1002/prot.24360] [Citation(s) in RCA: 6] [Impact Index Per Article: 0.5] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 04/10/2013] [Revised: 06/12/2013] [Accepted: 06/25/2013] [Indexed: 11/07/2022]
Abstract
Extensive bioinformatics analysis suggests that the stability and function of protein complexes are maintained throughout evolution by coordinated changes (co-evolution) of complex subunits. Yet, relatively little is known regarding the actual dynamics of such processes and the functional implications of co-evolution within protein complexes, since most of the bioinformatics predictions were not analyzed experimentally. Here, we describe a systematic experimental approach that allows a step-by-step observation of the co-evolution process in protein complexes. The exosome complex, an essential complex exhibiting a 3'→5' RNA degradation activity, served as a model system. In this study, we show that exosome subunits diverged very early during fungal evolution. Interestingly, we found that despite significant differences in conservation between Rrp41 and Mtr3 both subunits exhibit similar divergence pattern and co-evolutionary behavior through fungi evolution. Activity analysis of mutated exosomes exposes another layer of co-evolution between the core subunits and RNA substrates. Overall, our approach allows the experimental analysis of co-evolution within protein complexes and together with bioinformatics analysis can significantly deepen our understanding of the evolution of these complexes.
Collapse
Affiliation(s)
- Inga Sandler
- Departments of Life Sciences, Ben-Gurion University of the Negev, Be'er Sheva, 84105, Israel
| | | | | |
Collapse
|
208
|
Nolan K, Kattamuri C, Luedeke DM, Deng A, Jagpal A, Zhang F, Linhardt R, Kenny AP, Zorn AM, Thompson TB. Structure of protein related to Dan and Cerberus: insights into the mechanism of bone morphogenetic protein antagonism. Structure 2013; 21:1417-29. [PMID: 23850456 PMCID: PMC3749838 DOI: 10.1016/j.str.2013.06.005] [Citation(s) in RCA: 49] [Impact Index Per Article: 4.5] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 04/18/2013] [Revised: 05/31/2013] [Accepted: 06/09/2013] [Indexed: 11/26/2022]
Abstract
The bone morphogenetic proteins (BMPs) are secreted ligands largely known for their functional roles in embryogenesis and tissue development. A number of structurally diverse extracellular antagonists inhibit BMP ligands to regulate signaling. The differential screening-selected gene aberrative in neuroblastoma (DAN) family of antagonists represents the largest group of BMP inhibitors; however, little is known of how they mechanistically inhibit BMP ligands. Here, we present the structure of the DAN family member, protein related to Dan and Cerberus (PRDC), solved by X-ray crystallography. The structure reveals a growth factor-like appearance with an unexpected dimerization mechanism that is formed through extensive β strand contacts. Using site-directed mutagenesis coupled with in vitro and in vivo activity assays, we identified a BMP-binding epitope on PRDC. We also determined that PRDC binds heparin with high affinity and that heparin binding to PRDC interferes with BMP antagonism. These results offer insight for how DAN family antagonists functionally inhibit BMP ligands.
Collapse
Affiliation(s)
- Kristof Nolan
- Department of Molecular Genetics, Biochemistry and Microbiology, University of Cincinnati Medical Sciences Building, Cincinnati, OH 45267, USA
| | - Chandramohan Kattamuri
- Department of Molecular Genetics, Biochemistry and Microbiology, University of Cincinnati Medical Sciences Building, Cincinnati, OH 45267, USA
| | - David M. Luedeke
- Department of Molecular Genetics, Biochemistry and Microbiology, University of Cincinnati Medical Sciences Building, Cincinnati, OH 45267, USA
| | - Andy Deng
- Department of Molecular Genetics, Biochemistry and Microbiology, University of Cincinnati Medical Sciences Building, Cincinnati, OH 45267, USA
| | - Amrita Jagpal
- Division of Developmental Biology, Cincinnati Children’s Research Foundation and Department of Pediatrics College of Medicine University of Cincinnati, 3333 Burnet Avenue, Cincinnati, OH 45229, USA
| | - Fuming Zhang
- Department of Chemical and Biological Engineering and Department of Chemistry and Chemical Biology, Center for Biotechnology and Interdisciplinary Studies, Rensselaer Polytechnic Institute, Troy, New York 12180, USA
| | - Robert Linhardt
- Department of Chemical and Biological Engineering and Department of Chemistry and Chemical Biology, Center for Biotechnology and Interdisciplinary Studies, Rensselaer Polytechnic Institute, Troy, New York 12180, USA
- Departments of Biology and Biomedical Engineering, Center for Biotechnology and Interdisciplinary Studies, Rensselaer Polytechnic Institute, Troy, New York 12180, USA
| | - Alan P. Kenny
- Division of Developmental Biology, Cincinnati Children’s Research Foundation and Department of Pediatrics College of Medicine University of Cincinnati, 3333 Burnet Avenue, Cincinnati, OH 45229, USA
| | - Aaron M. Zorn
- Division of Developmental Biology, Cincinnati Children’s Research Foundation and Department of Pediatrics College of Medicine University of Cincinnati, 3333 Burnet Avenue, Cincinnati, OH 45229, USA
| | - Thomas B. Thompson
- Department of Molecular Genetics, Biochemistry and Microbiology, University of Cincinnati Medical Sciences Building, Cincinnati, OH 45267, USA
| |
Collapse
|
209
|
Mukherjee K, Abhipriya, Vidyarthi AS, Pandey DM. SVM based model generation for binding site prediction on helix turn helix motif type of transcription factors in eukaryotes. Bioinformation 2013; 9:500-5. [PMID: 23861565 PMCID: PMC3705624 DOI: 10.6026/97320630009500] [Citation(s) in RCA: 2] [Impact Index Per Article: 0.2] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 05/14/2013] [Accepted: 05/17/2013] [Indexed: 12/02/2022] Open
Abstract
Support vector machine is a class of machine learning algorithms which uses a set of related supervised learning methods
for classification and regression. Nowadays this method is vividly applied to many detection problems related with secondary
structure, tumor cell and binding residue prediction. In this work, support vector machines (SVMs) have been trained on 90
sequences of transcription factors with HTH motif. Four sequence features were used as attribute for the prediction of interaction
site in HTH motif. A web page was also developed so that user can easily enter the protein sequence and receive the output as
interaction site predicted or not predicted. The generated model shows a very high amount of accuracy, sensitivity and specificity
which proves to be a good model for the selected case.
Collapse
Affiliation(s)
- Koel Mukherjee
- Department of Biotechnology, Birla Institute of Technology, Mesra, Ranchi-835 215, Jharkhand, India
| | | | | | | |
Collapse
|
210
|
Mirabello C, Pollastri G. Porter, PaleAle 4.0: high-accuracy prediction of protein secondary structure and relative solvent accessibility. Bioinformatics 2013; 29:2056-8. [DOI: 10.1093/bioinformatics/btt344] [Citation(s) in RCA: 88] [Impact Index Per Article: 8.0] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/14/2022] Open
|
211
|
|
212
|
Baltoumas FA, Theodoropoulou MC, Hamodrakas SJ. Interactions of the α-subunits of heterotrimeric G-proteins with GPCRs, effectors and RGS proteins: A critical review and analysis of interacting surfaces, conformational shifts, structural diversity and electrostatic potentials. J Struct Biol 2013; 182:209-18. [DOI: 10.1016/j.jsb.2013.03.004] [Citation(s) in RCA: 52] [Impact Index Per Article: 4.7] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 12/04/2012] [Revised: 03/06/2013] [Accepted: 03/11/2013] [Indexed: 01/05/2023]
|
213
|
Defining the Escherichia coli SecA dimer interface residues through in vivo site-specific photo-cross-linking. J Bacteriol 2013; 195:2817-25. [PMID: 23585536 DOI: 10.1128/jb.02269-12] [Citation(s) in RCA: 16] [Impact Index Per Article: 1.5] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/20/2022] Open
Abstract
The motor protein SecA is a core component of the bacterial general secretory (Sec) pathway and is essential for cell viability. Despite evidence showing that SecA exists in a dynamic monomer-dimer equilibrium favoring the dimeric form in solution and in the cytoplasm, there is considerable debate as to the quaternary structural organization of the SecA dimer. Here, a site-directed photo-cross-linking technique was utilized to identify residues on the Escherichia coli SecA (ecSecA) dimer interface in the cytosol of intact cells. The feasibility of this method was demonstrated with residue Leu6, which is essential for ecSecA dimerization based on our analytical ultracentrifugation studies of SecA L6A and shown to form the cross-linked SecA dimer in vivo with p-benzoyl-phenylalanine (pBpa) substituted at position 6. Subsequently, the amino terminus (residues 2 to 11) in the nucleotide binding domain (NBD), Phe263 in the preprotein binding domain (PBD), and Tyr794 and Arg805 in the intramolecular regulator of the ATPase 1 domain (IRA1) were identified to be involved in ecSecA dimerization. Furthermore, the incorporation of pBpa at position 805 did not form a cross-linked dimer in the SecA Δ2-11 context, indicating the possibility that the amino terminus may directly contact Arg805 or that the deletion of residues 2 to 11 alters the topology of the naturally occurring ecSecA dimer.
Collapse
|
214
|
Chang CM, Huang YW, Shih CH, Hwang JK. On the relationship between the sequence conservation and the packing density profiles of the protein complexes. Proteins 2013; 81:1192-9. [DOI: 10.1002/prot.24268] [Citation(s) in RCA: 10] [Impact Index Per Article: 0.9] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 11/18/2012] [Revised: 01/29/2013] [Accepted: 02/01/2013] [Indexed: 11/12/2022]
Affiliation(s)
- Chih-Min Chang
- Institute of Bioinformatics and Systems Biology; National Chiao Tung University; HsinChu 30050; Taiwan; Republic of China
| | - Yu-Wen Huang
- Institute of Bioinformatics and Systems Biology; National Chiao Tung University; HsinChu 30050; Taiwan; Republic of China
| | - Chien-Hua Shih
- Institute of Bioinformatics and Systems Biology; National Chiao Tung University; HsinChu 30050; Taiwan; Republic of China
| | - Jenn-Kang Hwang
- Institute of Bioinformatics and Systems Biology; National Chiao Tung University; HsinChu 30050; Taiwan; Republic of China
| |
Collapse
|
215
|
Boyen P, Neven F, van Dyck D, Valentim FL, van Dijk ADJ. Mining minimal motif pair sets maximally covering interactions in a protein-protein interaction network. IEEE/ACM TRANSACTIONS ON COMPUTATIONAL BIOLOGY AND BIOINFORMATICS 2013; 10:73-86. [PMID: 23702545 DOI: 10.1109/tcbb.2012.165] [Citation(s) in RCA: 1] [Impact Index Per Article: 0.1] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 06/02/2023]
Abstract
Correlated motif covering (CMC) is the problem of finding a set of motif pairs, i.e., pairs of patterns, in the sequences of proteins from a protein-protein interaction network (PPI-network) that describe the interactions in the network as concisely as possible. In other words, a perfect solution for CMC would be a minimal set of motif pairs that describes the interaction behavior perfectly in the sense that two proteins from the network interact if and only if their sequences match a motif pair in the minimal set. In this paper, we introduce and formally define CMC and show that it is closely related to the red-blue set cover (RBSC) problem and its weighted version (WRBSC)--both well-known NP-hard problems for that there exist several algorithms with known approximation factor guarantees. We prove the hardness of approximation of CMC by providing an approximation factor preserving reduction from RBSC to CMC. We show the existence of a theoretical approximation algorithm for CMC by providing an approximation factor preserving reduction from CMC to WRBSC. We adapt the latter algorithm into a functional heuristic for CMC, called CMC-approx, and experimentally assess its performance and biological relevance. The implementation in Java can be found at >http://bioinformatics.uhasselt.be.
Collapse
Affiliation(s)
- Peter Boyen
- Hasselt University and Transnational University of Limburg, Agoralaan, Diepenbeek, Belgium.
| | | | | | | | | |
Collapse
|
216
|
Zhou W, Yan H. Alpha shape and Delaunay triangulation in studies of protein-related interactions. Brief Bioinform 2012. [PMID: 23193202 DOI: 10.1093/bib/bbs077] [Citation(s) in RCA: 21] [Impact Index Per Article: 1.8] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 01/09/2023] Open
Abstract
In recent years, more 3D protein structures have become available, which has made the analysis of large molecular structures much easier. There is a strong demand for geometric models for the study of protein-related interactions. Alpha shape and Delaunay triangulation are powerful tools to represent protein structures and have advantages in characterizing the surface curvature and atom contacts. This review presents state-of-the-art applications of alpha shape and Delaunay triangulation in the studies on protein-DNA, protein-protein, protein-ligand interactions and protein structure analysis.
Collapse
Affiliation(s)
- Weiqiang Zhou
- Department of Electronic Engineering, City University of Hong Kong, Tat Chee Avenue 83, Hong Kong.
| | | |
Collapse
|
217
|
Guo F, Li SC, Wang L, Zhu D. Protein-protein binding site identification by enumerating the configurations. BMC Bioinformatics 2012; 13:158. [PMID: 22768846 PMCID: PMC3478195 DOI: 10.1186/1471-2105-13-158] [Citation(s) in RCA: 34] [Impact Index Per Article: 2.8] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 12/08/2011] [Accepted: 06/15/2012] [Indexed: 11/10/2022] Open
Abstract
Background The ability to predict protein-protein binding sites has a wide range of applications, including signal transduction studies, de novo drug design, structure identification and comparison of functional sites. The interface in a complex involves two structurally matched protein subunits, and the binding sites can be predicted by identifying structural matches at protein surfaces. Results We propose a method which enumerates “all” the configurations (or poses) between two proteins (3D coordinates of the two subunits in a complex) and evaluates each configuration by the interaction between its components using the Atomic Contact Energy function. The enumeration is achieved efficiently by exploring a set of rigid transformations. Our approach incorporates a surface identification technique and a method for avoiding clashes of two subunits when computing rigid transformations. When the optimal transformations according to the Atomic Contact Energy function are identified, the corresponding binding sites are given as predictions. Our results show that this approach consistently performs better than other methods in binding site identification. Conclusions Our method achieved a success rate higher than other methods, with the prediction quality improved in terms of both accuracy and coverage. Moreover, our method is being able to predict the configurations of two binding proteins, where most of other methods predict only the binding sites. The software package is available at
http://sites.google.com/site/guofeics/dobi for non-commercial use.
Collapse
Affiliation(s)
- Fei Guo
- Department of Computer Science, City University of Hong Kong, Kowloon, Hong Kong
| | | | | | | |
Collapse
|
218
|
Häfner AK, Cernescu M, Hofmann B, Ermisch M, Hörnig M, Metzner J, Schneider G, Brutschy B, Steinhilber D. Dimerization of human 5-lipoxygenase. Biol Chem 2012; 392:1097-111. [PMID: 22050225 DOI: 10.1515/bc.2011.200] [Citation(s) in RCA: 44] [Impact Index Per Article: 3.7] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/15/2022]
Abstract
Human 5-lipoxygenase (5-LO) can form dimers as shown here via native gel electrophoresis, gel filtration chromatography and LILBID (laser induced liquid bead ion desorption) mass spectrometry. After glutathionylation of 5-LO by diamide/glutathione treatment, dimeric 5-LO was no longer detectable and 5-LO almost exclusively exists in the monomeric form which showed full catalytic activity. Incubation of 5-LO with diamide alone led to a disulfide-bridged dimer and to oligomer formation which displays a strongly reduced catalytic activity. The bioinformatic analysis of the 5-LO surface for putative protein-protein interaction domains and molecular modeling of the dimer interface suggests a head to tail orientation of the dimer which also explains the localization of previously reported ATP binding sites. This interface domain was confirmed by the observation that 5-LO dimer formation and inhibition of activity by diamide was largely prevented when four cysteines (C159S, C300S, C416S, C418S) in this domain were mutated to serines.
Collapse
Affiliation(s)
- Ann-Kathrin Häfner
- Institute of Pharmaceutical Chemistry/ZAFES, University of Frankfurt, Max-von-Laue-Strasse 9, 60438 Frankfurt, Germany
| | | | | | | | | | | | | | | | | |
Collapse
|
219
|
Chen P, Wong L, Li J. Detection of outlier residues for improving interface prediction in protein heterocomplexes. IEEE/ACM TRANSACTIONS ON COMPUTATIONAL BIOLOGY AND BIOINFORMATICS 2012; 9:1155-1165. [PMID: 22529331 DOI: 10.1109/tcbb.2012.58] [Citation(s) in RCA: 13] [Impact Index Per Article: 1.1] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 05/31/2023]
Abstract
Sequence-based understanding and identification of protein binding interfaces is a challenging research topic due to the complexity in protein systems and the imbalanced distribution between interface and noninterface residues. This paper presents an outlier detection idea to address the redundancy problem in protein interaction data. The cleaned training data are then used for improving the prediction performance. We use three novel measures to describe the extent a residue is considered as an outlier in comparison to the other residues: the distance of a residue instance from the center instance of all residue instances of the same class label (Dist), the probability of the class label of the residue instance (PCL), and the importance of within-class and between-class (IWB) residue instances. Outlier scores are computed by integrating the three factors; instances with a sufficiently large score are treated as outliers and removed. The data sets without outliers are taken as input for a support vector machine (SVM) ensemble. The proposed SVM ensemble trained on input data without outliers performs better than that with outliers. Our method is also more accurate than many literature methods on benchmark data sets. From our empirical studies, we found that some outlier interface residues are truly near to noninterface regions, and some outlier noninterface residues are close to interface regions.
Collapse
Affiliation(s)
- Peng Chen
- Institute of Intelligent Machines, Chinese Academy of Sciences, PO Box 1130, Hefei 230031, China.
| | | | | |
Collapse
|
220
|
Chen CT, Peng HP, Jian JW, Tsai KC, Chang JY, Yang EW, Chen JB, Ho SY, Hsu WL, Yang AS. Protein-protein interaction site predictions with three-dimensional probability distributions of interacting atoms on protein surfaces. PLoS One 2012; 7:e37706. [PMID: 22701576 PMCID: PMC3368894 DOI: 10.1371/journal.pone.0037706] [Citation(s) in RCA: 23] [Impact Index Per Article: 1.9] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 12/09/2011] [Accepted: 04/23/2012] [Indexed: 11/18/2022] Open
Abstract
Protein-protein interactions are key to many biological processes. Computational methodologies devised to predict protein-protein interaction (PPI) sites on protein surfaces are important tools in providing insights into the biological functions of proteins and in developing therapeutics targeting the protein-protein interaction sites. One of the general features of PPI sites is that the core regions from the two interacting protein surfaces are complementary to each other, similar to the interior of proteins in packing density and in the physicochemical nature of the amino acid composition. In this work, we simulated the physicochemical complementarities by constructing three-dimensional probability density maps of non-covalent interacting atoms on the protein surfaces. The interacting probabilities were derived from the interior of known structures. Machine learning algorithms were applied to learn the characteristic patterns of the probability density maps specific to the PPI sites. The trained predictors for PPI sites were cross-validated with the training cases (consisting of 432 proteins) and were tested on an independent dataset (consisting of 142 proteins). The residue-based Matthews correlation coefficient for the independent test set was 0.423; the accuracy, precision, sensitivity, specificity were 0.753, 0.519, 0.677, and 0.779 respectively. The benchmark results indicate that the optimized machine learning models are among the best predictors in identifying PPI sites on protein surfaces. In particular, the PPI site prediction accuracy increases with increasing size of the PPI site and with increasing hydrophobicity in amino acid composition of the PPI interface; the core interface regions are more likely to be recognized with high prediction confidence. The results indicate that the physicochemical complementarity patterns on protein surfaces are important determinants in PPIs, and a substantial portion of the PPI sites can be predicted correctly with the physicochemical complementarity features based on the non-covalent interaction data derived from protein interiors.
Collapse
Affiliation(s)
- Ching-Tai Chen
- Genomics Research Center, Academia Sinica, Taipei, Taiwan
- Institute of Bioinformatics and Systems Biology, National Chiao-Tung University, Hsinchu, Taiwan
- Institute of Information Science, Academia Sinica, Taipei, Taiwan
- Bioinformatics Program, Taiwan International Graduate Program, Institute of Information Science, Academia Sinica, Taipei, Taiwan
| | - Hung-Pin Peng
- Genomics Research Center, Academia Sinica, Taipei, Taiwan
- Institute of Biomedical Informatics, National Yang-Ming University, Taipei, Taiwan
- Bioinformatics Program, Taiwan International Graduate Program, Institute of Information Science, Academia Sinica, Taipei, Taiwan
| | - Jhih-Wei Jian
- Genomics Research Center, Academia Sinica, Taipei, Taiwan
- Institute of Biomedical Informatics, National Yang-Ming University, Taipei, Taiwan
- Bioinformatics Program, Taiwan International Graduate Program, Institute of Information Science, Academia Sinica, Taipei, Taiwan
| | | | - Jeng-Yih Chang
- Genomics Research Center, Academia Sinica, Taipei, Taiwan
| | - Ei-Wen Yang
- Genomics Research Center, Academia Sinica, Taipei, Taiwan
- Institute of Information Science, Academia Sinica, Taipei, Taiwan
| | - Jun-Bo Chen
- Genomics Research Center, Academia Sinica, Taipei, Taiwan
- Department of Computer Science, National Tsing-Hua University, Hsinchu, Taiwan
| | - Shinn-Ying Ho
- Institute of Bioinformatics and Systems Biology, National Chiao-Tung University, Hsinchu, Taiwan
| | - Wen-Lian Hsu
- Institute of Information Science, Academia Sinica, Taipei, Taiwan
- * E-mail: (AY); (WH)
| | - An-Suei Yang
- Genomics Research Center, Academia Sinica, Taipei, Taiwan
- * E-mail: (AY); (WH)
| |
Collapse
|
221
|
Shih CH, Chang CM, Lin YS, Lo WC, Hwang JK. Evolutionary information hidden in a single protein structure. Proteins 2012; 80:1647-57. [DOI: 10.1002/prot.24058] [Citation(s) in RCA: 33] [Impact Index Per Article: 2.8] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 08/22/2011] [Revised: 02/07/2012] [Accepted: 02/12/2012] [Indexed: 11/07/2022]
|
222
|
Jordan RA, EL-Manzalawy Y, Dobbs D, Honavar V. Predicting protein-protein interface residues using local surface structural similarity. BMC Bioinformatics 2012; 13:41. [PMID: 22424103 PMCID: PMC3386866 DOI: 10.1186/1471-2105-13-41] [Citation(s) in RCA: 58] [Impact Index Per Article: 4.8] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 09/12/2011] [Accepted: 03/18/2012] [Indexed: 11/10/2022] Open
Abstract
BACKGROUND Identification of the residues in protein-protein interaction sites has a significant impact in problems such as drug discovery. Motivated by the observation that the set of interface residues of a protein tend to be conserved even among remote structural homologs, we introduce PrISE, a family of local structural similarity-based computational methods for predicting protein-protein interface residues. RESULTS We present a novel representation of the surface residues of a protein in the form of structural elements. Each structural element consists of a central residue and its surface neighbors. The PrISE family of interface prediction methods uses a representation of structural elements that captures the atomic composition and accessible surface area of the residues that make up each structural element. Each of the members of the PrISE methods identifies for each structural element in the query protein, a collection of similar structural elements in its repository of structural elements and weights them according to their similarity with the structural element of the query protein. PrISEL relies on the similarity between structural elements (i.e. local structural similarity). PrISEG relies on the similarity between protein surfaces (i.e. general structural similarity). PrISEC, combines local structural similarity and general structural similarity to predict interface residues. These predictors label the central residue of a structural element in a query protein as an interface residue if a weighted majority of the structural elements that are similar to it are interface residues, and as a non-interface residue otherwise. The results of our experiments using three representative benchmark datasets show that the PrISEC outperforms PrISEL and PrISEG; and that PrISEC is highly competitive with state-of-the-art structure-based methods for predicting protein-protein interface residues. Our comparison of PrISEC with PredUs, a recently developed method for predicting interface residues of a query protein based on the known interface residues of its (global) structural homologs, shows that performance superior or comparable to that of PredUs can be obtained using only local surface structural similarity. PrISEC is available as a Web server at http://prise.cs.iastate.edu/ CONCLUSIONS Local surface structural similarity based methods offer a simple, efficient, and effective approach to predict protein-protein interface residues.
Collapse
Affiliation(s)
- Rafael A Jordan
- Department of Computer Science, Iowa State University, Ames, IA 50011, USA
- Department of Systems and Computer Engineering, Pontificia Universidad Javeriana, Cali, Colombia
| | - Yasser EL-Manzalawy
- Department of Computer Science, Iowa State University, Ames, IA 50011, USA
- Department of Systems and Computer Engineering, Al-Azhar University, Cairo, Egypt
| | - Drena Dobbs
- Department of Genetics, Development and Cell Biology, Iowa State University, Ames, IA 50011, USA
- Bioinformatics and Computational Biology Program, Iowa State University, Ames, IA 50011, USA
| | - Vasant Honavar
- Department of Computer Science, Iowa State University, Ames, IA 50011, USA
- Bioinformatics and Computational Biology Program, Iowa State University, Ames, IA 50011, USA
| |
Collapse
|
223
|
Biesiada J, Porollo A, Velayutham P, Kouril M, Meller J. Survey of public domain software for docking simulations and virtual screening. Hum Genomics 2012; 5:497-505. [PMID: 21807604 PMCID: PMC3525969 DOI: 10.1186/1479-7364-5-5-497] [Citation(s) in RCA: 38] [Impact Index Per Article: 3.2] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 12/16/2022] Open
Abstract
Progress in functional genomics and structural studies on biological macromolecules are generating a growing number of potential targets for therapeutics, adding to the importance of computational approaches for small molecule docking and virtual screening of candidate compounds. In this review, recent improvements in several public domain packages that are widely used in the context of drug development, including DOCK, AutoDock, AutoDock Vina and Screening for Ligands by Induced-fit Docking Efficiently (SLIDE) are surveyed. The authors also survey methods for the analysis and visualisation of docking simulations, as an important step in the overall assessment of the results. In order to illustrate the performance and limitations of current docking programs, the authors used the National Center for Toxicological Research (NCTR) oestrogen receptor benchmark set of 232 oestrogenic compounds with experimentally measured strength of binding to oestrogen receptor alpha. The methods tested here yielded a correlation coefficient of up to 0.6 between the predicted and observed binding affinities for active compounds in this benchmark.
Collapse
Affiliation(s)
- Jacek Biesiada
- Biomedical Informatics, Children's Hospital Research Foundation, Cincinnati, OH 45229, USA
| | | | | | | | | |
Collapse
|
224
|
Li B, Kihara D. Protein docking prediction using predicted protein-protein interface. BMC Bioinformatics 2012; 13:7. [PMID: 22233443 PMCID: PMC3287255 DOI: 10.1186/1471-2105-13-7] [Citation(s) in RCA: 49] [Impact Index Per Article: 4.1] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 08/26/2011] [Accepted: 01/10/2012] [Indexed: 11/10/2022] Open
Abstract
Background Many important cellular processes are carried out by protein complexes. To provide physical pictures of interacting proteins, many computational protein-protein prediction methods have been developed in the past. However, it is still difficult to identify the correct docking complex structure within top ranks among alternative conformations. Results We present a novel protein docking algorithm that utilizes imperfect protein-protein binding interface prediction for guiding protein docking. Since the accuracy of protein binding site prediction varies depending on cases, the challenge is to develop a method which does not deteriorate but improves docking results by using a binding site prediction which may not be 100% accurate. The algorithm, named PI-LZerD (using Predicted Interface with Local 3D Zernike descriptor-based Docking algorithm), is based on a pair wise protein docking prediction algorithm, LZerD, which we have developed earlier. PI-LZerD starts from performing docking prediction using the provided protein-protein binding interface prediction as constraints, which is followed by the second round of docking with updated docking interface information to further improve docking conformation. Benchmark results on bound and unbound cases show that PI-LZerD consistently improves the docking prediction accuracy as compared with docking without using binding site prediction or using the binding site prediction as post-filtering. Conclusion We have developed PI-LZerD, a pairwise docking algorithm, which uses imperfect protein-protein binding interface prediction to improve docking accuracy. PI-LZerD consistently showed better prediction accuracy over alternative methods in the series of benchmark experiments including docking using actual docking interface site predictions as well as unbound docking cases.
Collapse
Affiliation(s)
- Bin Li
- Department of Computer Science, Purdue University, West Lafayette, IN 47907, USA
| | | |
Collapse
|
225
|
Mullins JGL. Structural modelling pipelines in next generation sequencing projects. ADVANCES IN PROTEIN CHEMISTRY AND STRUCTURAL BIOLOGY 2012; 89:117-67. [PMID: 23046884 DOI: 10.1016/b978-0-12-394287-6.00005-7] [Citation(s) in RCA: 15] [Impact Index Per Article: 1.3] [Reference Citation Analysis] [Abstract] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 02/06/2023]
Abstract
Our capacity to reliably predict protein structure from sequence is steadily improving due to the increased numbers and better targeting of protein structures being experimentally determined by structural genomics projects, along with the development of better modeling methodologies. Template-based (homology) modeling and de novo modeling methods are being combined to fill in remaining gaps in template coverage, and powerful automated structural modeling pipelines are being applied to large data sets of protein sequences. The improved quality of 3D models of proteins has led to their routine use in assessing the functional impact of nonsynonymous single nucleotide polymorphisms (nsSNPs) in specific protein systems, with the development of approaches that may be applied in a predictive fashion to nsSNPs emerging from next-generation sequencing projects. The challenges encountered in deriving functionally meaningful deductions from structural modeling can be quite different for proteins of different protein functional classes. The specific challenges to the assessment of the structural and functional impact of nsSNPs in globular proteins such as binding and regulatory proteins, structural proteins, and enzymes are discussed, as well as membrane transport proteins and ion channels. The mapping of reliable predictions of the structural and functional impact of SNPs, generated from automated modeling pipelines, on to protein-protein interaction networks will facilitate new approaches to understanding complex polygenic disorders and predisposition to disease.
Collapse
Affiliation(s)
- Jonathan G L Mullins
- Genome and Structural Bioinformatics, Institute of Life Science, College of Medicine, Swansea University, Singleton Park, Swansea, Wales, UK.
| |
Collapse
|
226
|
Biesiada J, Porollo A, Meller J. On setting up and assessing docking simulations for virtual screening. Methods Mol Biol 2012; 928:1-16. [PMID: 22956129 DOI: 10.1007/978-1-62703-008-3_1] [Citation(s) in RCA: 1] [Impact Index Per Article: 0.1] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 12/17/2022]
Abstract
Small molecule docking and virtual screening of candidate compounds have become an integral part of drug discovery pipelines, complementing and streamlining experimental efforts in that regard. In this chapter, we describe specific software packages and protocols that can be used to efficiently set up a computational screening using a library of compounds and a docking program. We also discuss consensus- and clustering-based approaches that can be used to assess the results, and potentially re-rank the hits. While docking programs share many common features, they may require tailored implementation of virtual screening pipelines for specific computing platforms. Here, we primarily focus on solutions for several public domain packages that are widely used in the context of drug development.
Collapse
Affiliation(s)
- Jacek Biesiada
- Biomedical Informatics, Children's Hospital Research Foundation, Cincinnati, OH, USA
| | | | | |
Collapse
|
227
|
|
228
|
Partner-aware prediction of interacting residues in protein-protein complexes from sequence data. PLoS One 2011; 6:e29104. [PMID: 22194998 PMCID: PMC3237601 DOI: 10.1371/journal.pone.0029104] [Citation(s) in RCA: 61] [Impact Index Per Article: 4.7] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/12/2011] [Accepted: 11/21/2011] [Indexed: 12/22/2022] Open
Abstract
Computational prediction of residues that participate in protein-protein interactions is a difficult task, and state of the art methods have shown only limited success in this arena. One possible problem with these methods is that they try to predict interacting residues without incorporating information about the partner protein, although it is unclear how much partner information could enhance prediction performance. To address this issue, the two following comparisons are of crucial significance: (a) comparison between the predictability of inter-protein residue pairs, i.e., predicting exactly which residue pairs interact with each other given two protein sequences; this can be achieved by either combining conventional single-protein predictions or making predictions using a new model trained directly on the residue pairs, and the performance of these two approaches may be compared: (b) comparison between the predictability of the interacting residues in a single protein (irrespective of the partner residue or protein) from conventional methods and predictions converted from the pair-wise trained model. Using these two streams of training and validation procedures and employing similar two-stage neural networks, we showed that the models trained on pair-wise contacts outperformed the partner-unaware models in predicting both interacting pairs and interacting single-protein residues. Prediction performance decreased with the size of the conformational change upon complex formation; this trend is similar to docking, even though no structural information was used in our prediction. An example application that predicts two partner-specific interfaces of a protein was shown to be effective, highlighting the potential of the proposed approach. Finally, a preliminary attempt was made to score docking decoy poses using prediction of interacting residue pairs; this analysis produced an encouraging result.
Collapse
|
229
|
Häfner AK, Cernescu M, Hofmann B, Ermisch M, Hörnig M, Metzner J, Schneider G, Brutschy B, Steinhilber D. Dimerization of human 5-lipoxygenase. Biol Chem 2011. [PMID: 22050225 DOI: 10.1515/bc-2011-200] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/15/2022]
Abstract
Human 5-lipoxygenase (5-LO) can form dimers as shown here via native gel electrophoresis, gel filtration chromatography and LILBID (laser induced liquid bead ion desorption) mass spectrometry. After glutathionylation of 5-LO by diamide/glutathione treatment, dimeric 5-LO was no longer detectable and 5-LO almost exclusively exists in the monomeric form which showed full catalytic activity. Incubation of 5-LO with diamide alone led to a disulfide-bridged dimer and to oligomer formation which displays a strongly reduced catalytic activity. The bioinformatic analysis of the 5-LO surface for putative protein-protein interaction domains and molecular modeling of the dimer interface suggests a head to tail orientation of the dimer which also explains the localization of previously reported ATP binding sites. This interface domain was confirmed by the observation that 5-LO dimer formation and inhibition of activity by diamide was largely prevented when four cysteines (C159S, C300S, C416S, C418S) in this domain were mutated to serines.
Collapse
Affiliation(s)
- Ann-Kathrin Häfner
- Institute of Pharmaceutical Chemistry/ZAFES, University of Frankfurt, Max-von-Laue-Strasse 9, 60438 Frankfurt, Germany
| | | | | | | | | | | | | | | | | |
Collapse
|
230
|
Zellner H, Staudigel M, Trenner T, Bittkowski M, Wolowski V, Icking C, Merkl R. Prescont: Predicting protein-protein interfaces utilizing four residue properties. Proteins 2011; 80:154-68. [DOI: 10.1002/prot.23172] [Citation(s) in RCA: 37] [Impact Index Per Article: 2.8] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/11/2011] [Revised: 08/18/2011] [Accepted: 08/29/2011] [Indexed: 12/26/2022]
|
231
|
Qiu Z, Wang X. Prediction of protein-protein interaction sites using patch-based residue characterization. J Theor Biol 2011; 293:143-50. [PMID: 22037062 DOI: 10.1016/j.jtbi.2011.10.021] [Citation(s) in RCA: 17] [Impact Index Per Article: 1.3] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/25/2011] [Revised: 09/13/2011] [Accepted: 10/15/2011] [Indexed: 10/15/2022]
Abstract
Identifying protein-protein interaction sites provides important clues to the function of a protein and is becoming increasingly relevant in topics such as systems biology and drug discovery. Using a patch-based model for residue characterization, we trained random forest classifiers for residue-based interface prediction, which was followed by a clustering procedure to produce patches for patch-based interface prediction. For residue-based interface prediction, our method achieves a specificity rate of 0.7 and a sensitivity rate of 0.78. For patch-based interface prediction, a success rate of 0.80 is achieved. Based on same datasets, we also compare it with several published methods. The results show that our method is a successful predictor for residue-based and patch-based interface prediction.
Collapse
Affiliation(s)
- Zhijun Qiu
- The State Key Laboratory of Structural Analysis of Industrial Equipment, Dalian University of Technology, 2 Ling-Gong Road, Dalian 116024, China
| | | |
Collapse
|
232
|
Jain S, Kim HG, Lacbawan F, Meliciani I, Wenzel W, Kurth I, Sharma J, Schoeneman M, Ten S, Layman LC, Jacobson-Dickman E. Unique phenotype in a patient with CHARGE syndrome. INTERNATIONAL JOURNAL OF PEDIATRIC ENDOCRINOLOGY 2011; 2011:11. [PMID: 21995344 PMCID: PMC3216247 DOI: 10.1186/1687-9856-2011-11] [Citation(s) in RCA: 9] [Impact Index Per Article: 0.7] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Subscribe] [Scholar Register] [Received: 10/05/2011] [Accepted: 10/13/2011] [Indexed: 12/24/2022]
Abstract
CHARGE is a phenotypically heterogeneous autosomal dominant disorder recognized as a cohesive syndrome since the identification of CHD7 as a genetic etiology. Classic features include: Coloboma, Heart defects, Atresia choanae, Retarded growth and development, Genitourinary abnormalities, and Ear anomalies and/or deafness. With greater accessibility to genetic analysis, a wider spectrum of features are emerging, and overlap with disorders such as DiGeorge syndrome, Kallmann syndrome, and Hypoparathyroidism Sensorineural Deafness and Renal Disease syndrome, is increasingly evident. We present a patient with a unique manifestation of CHARGE syndrome, including primary hypoparathyroidism and a limb anomaly; to our knowledge, he is also the first CHARGE subject reported with bilateral multicystic dysplastic kidneys. Furthermore, with structural modeling and murine expression studies, we characterize a putative CHD7 G744S missense mutation. Our report continues to expand the CHARGE phenotype and highlights that stringent fulfillment of conventional criteria should not strictly guide genetic analysis.
Collapse
Affiliation(s)
- Shobhit Jain
- State University of New York Downstate Medical Center, Children's Hospital at Downstate, Department of Pediatrics, Division of Pediatric Endocrinology, Brooklyn, NY 11203 USA.
| | | | | | | | | | | | | | | | | | | | | |
Collapse
|
233
|
La D, Kihara D. A novel method for protein-protein interaction site prediction using phylogenetic substitution models. Proteins 2011; 80:126-41. [PMID: 21989996 DOI: 10.1002/prot.23169] [Citation(s) in RCA: 24] [Impact Index Per Article: 1.8] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 01/08/2011] [Revised: 07/07/2011] [Accepted: 08/17/2011] [Indexed: 11/10/2022]
Abstract
Protein-protein binding events mediate many critical biological functions in the cell. Typically, functionally important sites in proteins can be well identified by considering sequence conservation. However, protein-protein interaction sites exhibit higher sequence variation than other functional regions, such as catalytic sites of enzymes. Consequently, the mutational behavior leading to weak sequence conservation poses significant challenges to the protein-protein interaction site prediction. Here, we present a phylogenetic framework to capture critical sequence variations that favor the selection of residues essential for protein-protein binding. Through the comprehensive analysis of diverse protein families, we show that protein binding interfaces exhibit distinct amino acid substitution as compared with other surface residues. On the basis of this analysis, we have developed a novel method, BindML, which utilizes the substitution models to predict protein-protein binding sites of protein with unknown interacting partners. BindML estimates the likelihood that a phylogenetic tree of a local surface region in a query protein structure follows the substitution patterns of protein binding interface and nonbinding surfaces. BindML is shown to perform well compared to alternative methods for protein binding interface prediction. The methodology developed in this study is very versatile in the sense that it can be generally applied for predicting other types of functional sites, such as DNA, RNA, and membrane binding sites in proteins.
Collapse
Affiliation(s)
- David La
- Department of Biological Sciences, College of Science, Purdue University, West Lafayette, Indiana 47907, USA
| | | |
Collapse
|
234
|
Computational prediction of heme-binding residues by exploiting residue interaction network. PLoS One 2011; 6:e25560. [PMID: 21991319 PMCID: PMC3184988 DOI: 10.1371/journal.pone.0025560] [Citation(s) in RCA: 34] [Impact Index Per Article: 2.6] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/18/2011] [Accepted: 09/05/2011] [Indexed: 11/29/2022] Open
Abstract
Computational identification of heme-binding residues is beneficial for predicting and designing novel heme proteins. Here we proposed a novel method for heme-binding residue prediction by exploiting topological properties of these residues in the residue interaction networks derived from three-dimensional structures. Comprehensive analysis showed that key residues located in heme-binding regions are generally associated with the nodes with higher degree, closeness and betweenness, but lower clustering coefficient in the network. HemeNet, a support vector machine (SVM) based predictor, was developed to identify heme-binding residues by combining topological features with existing sequence and structural features. The results showed that incorporation of network-based features significantly improved the prediction performance. We also compared the residue interaction networks of heme proteins before and after heme binding and found that the topological features can well characterize the heme-binding sites of apo structures as well as those of holo structures, which led to reliable performance improvement as we applied HemeNet to predicting the binding residues of proteins in the heme-free state. HemeNet web server is freely accessible at http://mleg.cse.sc.edu/hemeNet/.
Collapse
|
235
|
Mashiach-Farkash E, Nussinov R, Wolfson HJ. SymmRef: a flexible refinement method for symmetric multimers. Proteins 2011; 79:2607-23. [PMID: 21721046 PMCID: PMC3155011 DOI: 10.1002/prot.23082] [Citation(s) in RCA: 16] [Impact Index Per Article: 1.2] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 12/13/2010] [Revised: 05/02/2011] [Accepted: 05/04/2011] [Indexed: 11/11/2022]
Abstract
Symmetric protein complexes are abundant in the living cell. Predicting their atomic structure can shed light on the mechanism of many important biological processes. Symmetric docking methods aim to predict the structure of these complexes given the unbound structure of a single monomer, or its model. Symmetry constraints reduce the search-space of these methods and make the prediction easier compared to asymmetric protein-protein docking. However, the challenge of modeling the conformational changes that the monomer might undergo is a major obstacle. In this article, we present SymmRef, a novel method for refinement and reranking of symmetric docking solutions. The method models backbone and side-chain movements and optimizes the rigid-body orientations of the monomers. The backbone movements are modeled by normal modes minimization and the conformations of the side-chains are modeled by selecting optimal rotamers. Since solved structures of symmetric multimers show asymmetric side-chain conformations, we do not use symmetry constraints in the side-chain optimization procedure. The refined models are re-ranked according to an energy score. We tested the method on a benchmark of unbound docking challenges. The results show that the method significantly improves the accuracy and the ranking of symmetric rigid docking solutions. SymmRef is available for download at http:// bioinfo3d.cs.tau.ac.il/SymmRef/download.html.
Collapse
Affiliation(s)
- Efrat Mashiach-Farkash
- Blavatnik School of Computer Science, Raymond and Beverly Sackler Faculty of Exact Sciences, Tel Aviv University, Tel Aviv 69978, Israel
| | - Ruth Nussinov
- Basic Research Program, SAIC-Frederick, Inc., Center for Cancer Research Nanobiology Program, NCI - Frederick, Frederick, MD 21702, USA
- Department of Human Genetics and Molecular Medicine, Sackler Faculty of Medicine, Tel Aviv University, Tel Aviv 69978, Israel
| | - Haim J. Wolfson
- Blavatnik School of Computer Science, Raymond and Beverly Sackler Faculty of Exact Sciences, Tel Aviv University, Tel Aviv 69978, Israel
| |
Collapse
|
236
|
Segura J, Jones PF, Fernandez-Fuentes N. Improving the prediction of protein binding sites by combining heterogeneous data and Voronoi diagrams. BMC Bioinformatics 2011; 12:352. [PMID: 21861881 PMCID: PMC3171731 DOI: 10.1186/1471-2105-12-352] [Citation(s) in RCA: 41] [Impact Index Per Article: 3.2] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 05/10/2011] [Accepted: 08/23/2011] [Indexed: 11/10/2022] Open
Abstract
BACKGROUND Protein binding site prediction by computational means can yield valuable information that complements and guides experimental approaches to determine the structure of protein complexes. Predictions become even more relevant and timely given the current resolution of protein interaction maps, where there is a very large and still expanding gap between the available information on: (i) which proteins interact and (ii) how proteins interact. Proteins interact through exposed residues that present differential physicochemical properties, and these can be exploited to identify protein interfaces. RESULTS Here we present VORFFIP, a novel method for protein binding site prediction. The method makes use of broad set of heterogeneous data and defined of residue environment, by means of Voronoi Diagrams that are integrated by a two-steps Random Forest ensemble classifier. Four sets of residue features (structural, energy terms, sequence conservation, and crystallographic B-factors) used in different combinations together with three definitions of residue environment (Voronoi Diagrams, sequence sliding window, and Euclidian distance) have been analyzed in order to maximize the performance of the method. CONCLUSIONS The integration of different forms information such as structural features, energy term, evolutionary conservation and crystallographic B-factors, improves the performance of binding site prediction. Including the information of neighbouring residues also improves the prediction of protein interfaces. Among the different approaches that can be used to define the environment of exposed residues, Voronoi Diagrams provide the most accurate description. Finally, VORFFIP compares favourably to other methods reported in the recent literature.
Collapse
Affiliation(s)
- Joan Segura
- Leeds Institute of Molecular Medicine, Section of Experimental Therapeutics, University of Leeds, Leeds, LS9 7TF, UK
| | | | | |
Collapse
|
237
|
Xue LC, Dobbs D, Honavar V. HomPPI: a class of sequence homology based protein-protein interface prediction methods. BMC Bioinformatics 2011; 12:244. [PMID: 21682895 PMCID: PMC3213298 DOI: 10.1186/1471-2105-12-244] [Citation(s) in RCA: 78] [Impact Index Per Article: 6.0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 05/25/2010] [Accepted: 06/17/2011] [Indexed: 01/12/2023] Open
Abstract
BACKGROUND Although homology-based methods are among the most widely used methods for predicting the structure and function of proteins, the question as to whether interface sequence conservation can be effectively exploited in predicting protein-protein interfaces has been a subject of debate. RESULTS We studied more than 300,000 pair-wise alignments of protein sequences from structurally characterized protein complexes, including both obligate and transient complexes. We identified sequence similarity criteria required for accurate homology-based inference of interface residues in a query protein sequence.Based on these analyses, we developed HomPPI, a class of sequence homology-based methods for predicting protein-protein interface residues. We present two variants of HomPPI: (i) NPS-HomPPI (Non partner-specific HomPPI), which can be used to predict interface residues of a query protein in the absence of knowledge of the interaction partner; and (ii) PS-HomPPI (Partner-specific HomPPI), which can be used to predict the interface residues of a query protein with a specific target protein.Our experiments on a benchmark dataset of obligate homodimeric complexes show that NPS-HomPPI can reliably predict protein-protein interface residues in a given protein, with an average correlation coefficient (CC) of 0.76, sensitivity of 0.83, and specificity of 0.78, when sequence homologs of the query protein can be reliably identified. NPS-HomPPI also reliably predicts the interface residues of intrinsically disordered proteins. Our experiments suggest that NPS-HomPPI is competitive with several state-of-the-art interface prediction servers including those that exploit the structure of the query proteins. The partner-specific classifier, PS-HomPPI can, on a large dataset of transient complexes, predict the interface residues of a query protein with a specific target, with a CC of 0.65, sensitivity of 0.69, and specificity of 0.70, when homologs of both the query and the target can be reliably identified. The HomPPI web server is available at http://homppi.cs.iastate.edu/. CONCLUSIONS Sequence homology-based methods offer a class of computationally efficient and reliable approaches for predicting the protein-protein interface residues that participate in either obligate or transient interactions. For query proteins involved in transient interactions, the reliability of interface residue prediction can be improved by exploiting knowledge of putative interaction partners.
Collapse
Affiliation(s)
- Li C Xue
- Department of Computer Science, Iowa State University, Ames, IA 50011, USA.
| | | | | |
Collapse
|
238
|
Acuner Ozbabacan SE, Engin HB, Gursoy A, Keskin O. Transient protein-protein interactions. Protein Eng Des Sel 2011; 24:635-48. [DOI: 10.1093/protein/gzr025] [Citation(s) in RCA: 170] [Impact Index Per Article: 13.1] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 01/05/2023] Open
|
239
|
Emig D, Sander O, Mayr G, Albrecht M. Structure collisions between interacting proteins. PLoS One 2011; 6:e19581. [PMID: 21655095 PMCID: PMC3107212 DOI: 10.1371/journal.pone.0019581] [Citation(s) in RCA: 3] [Impact Index Per Article: 0.2] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 09/10/2010] [Accepted: 04/12/2011] [Indexed: 11/24/2022] Open
Abstract
Protein-protein interactions take place at defined binding interfaces. One protein may bind two or more proteins at different interfaces at the same time. So far it has been commonly accepted that non-overlapping interfaces allow a given protein to bind other proteins simultaneously while no collisions occur between the binding protein structures. To test this assumption, we performed a comprehensive analysis of structural protein interactions to detect potential collisions. Our results did not indicate cases of biologically relevant collisions in the Protein Data Bank of protein structures. However, we discovered a number of collisions that originate from alternative protein conformations or quaternary structures due to different experimental conditions.
Collapse
Affiliation(s)
- Dorothea Emig
- Department of Computational Biology and Applied Algorithmics, Max Planck Institute for Informatics, Saarbrücken, Germany
| | - Oliver Sander
- Department of Computational Biology and Applied Algorithmics, Max Planck Institute for Informatics, Saarbrücken, Germany
| | - Gabriele Mayr
- Department of Computational Biology and Applied Algorithmics, Max Planck Institute for Informatics, Saarbrücken, Germany
| | - Mario Albrecht
- Department of Computational Biology and Applied Algorithmics, Max Planck Institute for Informatics, Saarbrücken, Germany
- * E-mail:
| |
Collapse
|
240
|
Abstract
Some bacterial species are able to utilize extracellular mineral forms of iron and manganese as respiratory electron acceptors. In Shewanella oneidensis this involves decaheme cytochromes that are located on the bacterial cell surface at the termini of trans-outer-membrane electron transfer conduits. The cell surface cytochromes can potentially play multiple roles in mediating electron transfer directly to insoluble electron sinks, catalyzing electron exchange with flavin electron shuttles or participating in extracellular intercytochrome electron exchange along "nanowire" appendages. We present a 3.2-Å crystal structure of one of these decaheme cytochromes, MtrF, that allows the spatial organization of the 10 hemes to be visualized for the first time. The hemes are organized across four domains in a unique crossed conformation, in which a staggered 65-Å octaheme chain transects the length of the protein and is bisected by a planar 45-Å tetraheme chain that connects two extended Greek key split β-barrel domains. The structure provides molecular insight into how reduction of insoluble substrate (e.g., minerals), soluble substrates (e.g., flavins), and cytochrome redox partners might be possible in tandem at different termini of a trifurcated electron transport chain on the cell surface.
Collapse
|
241
|
Fernández‐Recio J. Prediction of protein binding sites and hot spots. WILEY INTERDISCIPLINARY REVIEWS-COMPUTATIONAL MOLECULAR SCIENCE 2011. [DOI: 10.1002/wcms.45] [Citation(s) in RCA: 42] [Impact Index Per Article: 3.2] [Reference Citation Analysis] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 11/10/2022]
|
242
|
Guidolin D, Ciruela F, Genedani S, Guescini M, Tortorella C, Albertin G, Fuxe K, Agnati LF. Bioinformatics and mathematical modelling in the study of receptor–receptor interactions and receptor oligomerization. BIOCHIMICA ET BIOPHYSICA ACTA-BIOMEMBRANES 2011; 1808:1267-83. [DOI: 10.1016/j.bbamem.2010.09.022] [Citation(s) in RCA: 9] [Impact Index Per Article: 0.7] [Reference Citation Analysis] [Track Full Text] [Subscribe] [Scholar Register] [Received: 07/03/2010] [Revised: 08/31/2010] [Accepted: 09/26/2010] [Indexed: 10/19/2022]
|
243
|
Cheng H, Mohammed F, Nam G, Chen Y, Qi J, Garner LI, Allen RL, Yan J, Willcox BE, Gao GF. Crystal structure of leukocyte Ig-like receptor LILRB4 (ILT3/LIR-5/CD85k): a myeloid inhibitory receptor involved in immune tolerance. J Biol Chem 2011; 286:18013-25. [PMID: 21454581 DOI: 10.1074/jbc.m111.221028] [Citation(s) in RCA: 38] [Impact Index Per Article: 2.9] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 12/31/2022] Open
Abstract
The myeloid inhibitory receptor LILRB4 (also called ILT3, LIR-5, CD85k), a member of the leukocyte immunoglobulin-like receptors (LILRs/LIRs), is an important mediator of immune tolerance. Up-regulated on tolerogenic dendritic cells, it has been shown to modulate immune responses via induction of T cell anergy and differentiation of CD8(+) T suppressor cells and may play a role in establishing immune tolerance in cancer. Consequently, characterizing the molecular mechanisms involved in LILRB4 function and in particular its structure and ligands is a key aim but has remained elusive to date. Here we describe the production, crystallization, and structure of the LILRB4 ectodomain to 1.7 Å using an expression strategy involving engineering of an additional disulfide bond in the D2 domain to enhance protein stability. LILRB4 comprises two immunoglobulin domains similar in structure to other LILRs; however, the D2 domain, which is most closely related to the D4 domains of other family members, contains 3(10) helices not previously observed. At the D1-D2 interface, reduced interdomain contacts resulted in an obtuse interdomain angle of ∼107°. Comparison with MHC class I binding Group 1 LILRs suggests LILRB4 is both conformationally and electrostatically unsuited to MHC ligation, consistent with LILRB4 status as a Group 2 LILR likely to bind novel non-MHC class I ligands. Finally, examination of the LILRB4 surface highlighted distinctive surface patches on the D1 domain and D1D2 hinge region, which may be involved in ligand binding. These findings will facilitate our attempts to precisely define the role of LILRB4 in the regulation of immune tolerance.
Collapse
Affiliation(s)
- Hao Cheng
- CAS Key Laboratory of Pathogenic Microbiology and Immunology, Institute of Microbiology, Chinese Academy of Sciences, Beijing 100101, China
| | | | | | | | | | | | | | | | | | | |
Collapse
|
244
|
de Vries SJ, Bonvin AMJJ. CPORT: a consensus interface predictor and its performance in prediction-driven docking with HADDOCK. PLoS One 2011; 6:e17695. [PMID: 21464987 PMCID: PMC3064578 DOI: 10.1371/journal.pone.0017695] [Citation(s) in RCA: 233] [Impact Index Per Article: 17.9] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 11/15/2010] [Accepted: 02/08/2011] [Indexed: 11/19/2022] Open
Abstract
Background Macromolecular complexes are the molecular machines of the cell. Knowledge at the atomic level is essential to understand and influence their function. However, their number is huge and a significant fraction is extremely difficult to study using classical structural methods such as NMR and X-ray crystallography. Therefore, the importance of large-scale computational approaches in structural biology is evident. This study combines two of these computational approaches, interface prediction and docking, to obtain atomic-level structures of protein-protein complexes, starting from their unbound components. Methodology/Principal Findings Here we combine six interface prediction web servers into a consensus method called CPORT (Consensus Prediction Of interface Residues in Transient complexes). We show that CPORT gives more stable and reliable predictions than each of the individual predictors on its own. A protocol was developed to integrate CPORT predictions into our data-driven docking program HADDOCK. For cases where experimental information is limited, this prediction-driven docking protocol presents an alternative to ab initio docking, the docking of complexes without the use of any information. Prediction-driven docking was performed on a large and diverse set of protein-protein complexes in a blind manner. Our results indicate that the performance of the HADDOCK-CPORT combination is competitive with ZDOCK-ZRANK, a state-of-the-art ab initio docking/scoring combination. Finally, the original interface predictions could be further improved by interface post-prediction (contact analysis of the docking solutions). Conclusions/Significance The current study shows that blind, prediction-driven docking using CPORT and HADDOCK is competitive with ab initio docking methods. This is encouraging since prediction-driven docking represents the absolute bottom line for data-driven docking: any additional biological knowledge will greatly improve the results obtained by prediction-driven docking alone. Finally, the fact that original interface predictions could be further improved by interface post-prediction suggests that prediction-driven docking has not yet been pushed to the limit. A web server for CPORT is freely available at http://haddock.chem.uu.nl/services/CPORT.
Collapse
Affiliation(s)
- Sjoerd J de Vries
- Faculty of Science, Bijvoet Center for Biomolecular Research, Utrecht University, Utrecht, The Netherlands.
| | | |
Collapse
|
245
|
Hamer R, Luo Q, Armitage JP, Reinert G, Deane CM. i-Patch: interprotein contact prediction using local network information. Proteins 2011; 78:2781-97. [PMID: 20635422 DOI: 10.1002/prot.22792] [Citation(s) in RCA: 24] [Impact Index Per Article: 1.8] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/07/2022]
Abstract
Biological processes are commonly controlled by precise protein-protein interactions. These connections rely on specific amino acids at the binding interfaces. Here we predict the binding residues of such interprotein complexes. We have developed a suite of methods, i-Patch, which predict the interprotein contact sites by considering the two proteins as a network, with residues as nodes and contacts as edges. i-Patch starts with two proteins, A and B, which are assumed to interact, but for which the structure of the complex is not available. However, we assume that for each protein, we have a reference structure and a multiple sequence alignment of homologues. i-Patch then uses the propensities of patches of residues to interact, to predict interprotein contact sites. i-Patch outperforms several other tested algorithms for prediction of interprotein contact sites. It gives 59% precision with 20% recall on a blind test set of 31 protein pairs. Combining the i-Patch scores with an existing correlated mutation algorithm, McBASC, using a logistic model gave little improvement. Results from a case study, on bacterial chemotaxis protein complexes, demonstrate that our predictions can identify contact residues, as well as suggesting unknown interfaces in multiprotein complexes.
Collapse
Affiliation(s)
- Rebecca Hamer
- Oxford Centre for Integrative Systems Biology, Department of Biochemistry, University of Oxford, Oxford, United Kingdom
| | | | | | | | | |
Collapse
|
246
|
de Vries SJ, Melquiond ASJ, Kastritis PL, Karaca E, Bordogna A, van Dijk M, Rodrigues JPGLM, Bonvin AMJJ. Strengths and weaknesses of data-driven docking in critical assessment of prediction of interactions. Proteins 2011; 78:3242-9. [PMID: 20718048 DOI: 10.1002/prot.22814] [Citation(s) in RCA: 34] [Impact Index Per Article: 2.6] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/08/2022]
Abstract
The recent CAPRI rounds have introduced new docking challenges in the form of protein-RNA complexes, multiple alternative interfaces, and an unprecedented number of targets for which homology modeling was required. We present here the performance of HADDOCK and its web server in the CAPRI experiment and discuss the strengths and weaknesses of data-driven docking. HADDOCK was successful for 6 out of 9 complexes (6 out of 11 targets) and accurately predicted the individual interfaces for two more complexes. The HADDOCK server, which is the first allowing the simultaneous docking of generic multi-body complexes, was successful in 4 out of 7 complexes for which it participated. In the scoring experiment, we predicted the highest number of targets of any group. The main weakness of data-driven docking revealed from these last CAPRI results is its vulnerability for incorrect experimental data related to the interface or the stoichiometry of the complex. At the same time, the use of experimental and/or predicted information is also the strength of our approach as evidenced for those targets for which accurate experimental information was available (e.g., the 10 three-stars predictions for T40!). Even when the models show a wrong orientation, the individual interfaces are generally well predicted with an average coverage of 60% ± 26% over all targets. This makes data-driven docking particularly valuable in a biological context to guide experimental studies like, for example, targeted mutagenesis.
Collapse
Affiliation(s)
- Sjoerd J de Vries
- NMR Research Group, Bijvoet Center for Biomolecular Research, Utrecht University, 3584 CH Utrecht, The Netherlands
| | | | | | | | | | | | | | | |
Collapse
|
247
|
Xu N, Kim HG, Bhagavath B, Cho SG, Lee JH, Ha K, Meliciani I, Wenzel W, Podolsky RH, Chorich LP, Stackhouse KA, Grove AMH, Odom LN, Ozata M, Bick DP, Sherins RJ, Kim SH, Cameron RS, Layman LC. Nasal embryonic LHRH factor (NELF) mutations in patients with normosmic hypogonadotropic hypogonadism and Kallmann syndrome. Fertil Steril 2011; 95:1613-20.e1-7. [PMID: 21300340 DOI: 10.1016/j.fertnstert.2011.01.010] [Citation(s) in RCA: 51] [Impact Index Per Article: 3.9] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/30/2010] [Revised: 12/28/2010] [Accepted: 01/03/2011] [Indexed: 01/09/2023]
Abstract
OBJECTIVE To determine if mutations in NELF, a gene isolated from migratory GnRH neurons, cause normosmic idiopathic hypogonadotropic hypogonadism (IHH) and Kallmann syndrome (KS). DESIGN Molecular analysis correlated with phenotype. SETTING Academic medical center. PATIENT(S) A total of 168 IHH/KS patients as well as unrelated control subjects were studied for NELF mutations. INTERVENTION(S) NELF coding regions/splice junctions were subjected to polymerase chain reaction (PCR)-based DNA sequencing. Eleven additional IHH/KS genes were sequenced in three patients with NELF mutations. MAIN OUTCOME MEASURE(S) Mutations were confirmed by sorting intolerant from tolerant, reverse-transcription (RT)-PCR, and Western blot analysis. RESULT(S) Three novel NELF mutations absent in 372 ethnically matched control subjects were identified in 3/168 (1.8%) IHH/KS patients. One IHH patient had compound heterozygous NELF mutations (c.629-21G>C and c.629-23C>G), and he did not have mutations in 11 other known IHH/KS genes. Two unrelated KS patients had heterozygous NELF mutations and mutation in a second gene: NELF/KAL1 (c.757G>A; p.Ala253Thr of NELF and c.488_490delGTT; p.Cys163del of KAL1) and NELF/TACR3 (c.1160-13C>T of NELF and c.824G>A; p.Trp275X of TACR3). In vitro evidence of these NELF mutations included reduced protein expression and splicing defects. CONCLUSION(S) Our findings suggest that NELF is associated with normosmic IHH and KS, either singly or in combination with a mutation in another gene.
Collapse
Affiliation(s)
- Ning Xu
- Section of Reproductive Endocrinology, Infertility, and Genetics, Department of Obstetrics and Gynecology, Medical College of Georgia, Augusta, Georgia 30912, USA
| | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | |
Collapse
|
248
|
Monji H, Koizumi S, Ozaki T, Ohkawa T. Interaction site prediction by structural similarity to neighboring clusters in protein-protein interaction networks. BMC Bioinformatics 2011; 12 Suppl 1:S39. [PMID: 21342570 PMCID: PMC3044295 DOI: 10.1186/1471-2105-12-s1-s39] [Citation(s) in RCA: 4] [Impact Index Per Article: 0.3] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/10/2022] Open
Abstract
BACKGROUND Recently, revealing the function of proteins with protein-protein interaction (PPI) networks is regarded as one of important issues in bioinformatics. With the development of experimental methods such as the yeast two-hybrid method, the data of protein interaction have been increasing extremely. Many databases dealing with these data comprehensively have been constructed and applied to analyzing PPI networks. However, few research on prediction interaction sites using both PPI networks and the 3D protein structures complementarily has explored. RESULTS We propose a method of predicting interaction sites in proteins with unknown function by using both of PPI networks and protein structures. For a protein with unknown function as a target, several clusters are extracted from the neighboring proteins based on their structural similarity. Then, interaction sites are predicted by extracting similar sites from the group of a protein cluster and the target protein. Moreover, the proposed method can improve the prediction accuracy by introducing repetitive prediction process. CONCLUSIONS The proposed method has been applied to small scale dataset, then the effectiveness of the method has been confirmed. The challenge will now be to apply the method to large-scale datasets.
Collapse
Affiliation(s)
- Hiroyuki Monji
- Graduate School of System Informatics, Kobe University, Rokkodai, Nada, Kobe 657-8501, Japan.
| | | | | | | |
Collapse
|
249
|
Carl N, Konc J, Vehar B, Janezic D. Protein-protein binding site prediction by local structural alignment. J Chem Inf Model 2011; 50:1906-13. [PMID: 20919700 DOI: 10.1021/ci100265x] [Citation(s) in RCA: 12] [Impact Index Per Article: 0.9] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 02/08/2023]
Abstract
Generalization of an earlier algorithm has led to the development of new local structural alignment algorithms for prediction of protein-protein binding sites. The algorithms use maximum cliques on protein graphs to define structurally similar protein regions. The search for structural neighbors in the new algorithms has been extended to all the proteins in the PDB and the query protein is compared to more than 60,000 proteins or over 300,000 single-chain structures. The resulting structural similarities are combined and used to predict the protein binding sites. This study shows that the location of protein binding sites can be predicted by comparing only local structural similarities irrespective of general protein folds.
Collapse
Affiliation(s)
- Nejc Carl
- National Institute of Chemistry, Hajdrihova 19, SI-1000 Ljubljana, Slovenia
| | | | | | | |
Collapse
|
250
|
Xue LC, Jordan RA, El-Manzalawy Y, Dobbs D, Honavar V. Ranking Docked Models of Protein-Protein Complexes Using Predicted Partner-Specific Protein-Protein Interfaces: A Preliminary Study. ACM-BCB ... ... : THE ... ACM CONFERENCE ON BIOINFORMATICS, COMPUTATIONAL BIOLOGY AND BIOMEDICINE. ACM CONFERENCE ON BIOINFORMATICS, COMPUTATIONAL BIOLOGY AND BIOMEDICINE 2011; 2011:441-445. [PMID: 25905110 DOI: 10.1145/2147805.2147866] [Citation(s) in RCA: 7] [Impact Index Per Article: 0.5] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 12/13/2022]
Abstract
Computational protein-protein docking is a valuable tool for determining the conformation of complexes formed by interacting proteins. Selecting near-native conformations from the large number of possible models generated by docking software presents a significant challenge in practice. We introduce a novel method for ranking docked conformations based on the degree of overlap between the interface residues of a docked conformation formed by a pair of proteins with the set of predicted interface residues between them. Our approach relies on a method, called PS-HomPPI, for reliably predicting protein-protein interface residues by taking into account information derived from both interacting proteins. PS-HomPPI infers the residues of a query protein that are likely to interact with a partner protein based on known interface residues of the homo-interologs of the query-partner protein pair, i.e., pairs of interacting proteins that are homologous to the query protein and partner protein. Our results on Docking Benchmark 3.0 show that the quality of the ranking of docked conformations using our method is consistently superior to that produced using ClusPro cluster-size-based and energy-based criteria for 61 out of the 64 docking complexes for which PS-HomPPI produces interface predictions. An implementation of our method for ranking docked models is freely available at: http://einstein.cs.iastate.edu/DockRank/.
Collapse
Affiliation(s)
- Li C Xue
- Bioinformatics and Computational Biology Program, Iowa State University, Ames, IA, 50011, USA
| | - Rafael A Jordan
- Department of Genetics, Development and Cell Biology, Iowa State University, Ames, 50011, USA
| | | | - Drena Dobbs
- Department of Computer Science, Pontificia Universidad Javeriana, Cali, Colombia
| | - Vasant Honavar
- Department of Systems and Computer Engineering, AI-Azhar University, Cairo, Egypt
| |
Collapse
|