1
|
PDB-wide identification of physiological hetero-oligomeric assemblies based on conserved quaternary structure geometry. Structure 2021; 29:1303-1311.e3. [PMID: 34520740 PMCID: PMC8575123 DOI: 10.1016/j.str.2021.07.012] [Citation(s) in RCA: 5] [Impact Index Per Article: 1.7] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 09/15/2020] [Revised: 03/22/2021] [Accepted: 07/23/2021] [Indexed: 11/21/2022]
Abstract
An accurate understanding of biomolecular mechanisms and diseases requires information on protein quaternary structure (QS). A critical challenge in inferring QS information from crystallography data is distinguishing biological interfaces from fortuitous crystal-packing contacts. Here, we employ QS conservation across homologs to infer the biological relevance of hetero-oligomers. We compare the structures and compositions of hetero-oligomers, which allow us to annotate 7,810 complexes as physiologically relevant, 1,060 as likely errors, and 1,432 with comparative information on subunit stoichiometry and composition. Excluding immunoglobulins, these annotations encompass over 51% of hetero-oligomers in the PDB. We curate a dataset of 577 hetero-oligomeric complexes to benchmark these annotations, which reveals an accuracy >94%. When homology information is not available, we compare QS across repositories (PDB, PISA, and EPPIC) to derive confidence estimates. This work provides high-quality annotations along with a large benchmark dataset of hetero-assemblies.
Collapse
|
2
|
Gaber A, Pavšič M. Modeling and Structure Determination of Homo-Oligomeric Proteins: An Overview of Challenges and Current Approaches. Int J Mol Sci 2021; 22:9081. [PMID: 34445785 PMCID: PMC8396596 DOI: 10.3390/ijms22169081] [Citation(s) in RCA: 9] [Impact Index Per Article: 3.0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/30/2021] [Revised: 08/20/2021] [Accepted: 08/20/2021] [Indexed: 12/12/2022] Open
Abstract
Protein homo-oligomerization is a very common phenomenon, and approximately half of proteins form homo-oligomeric assemblies composed of identical subunits. The vast majority of such assemblies possess internal symmetry which can be either exploited to help or poses challenges during structure determination. Moreover, aspects of symmetry are critical in the modeling of protein homo-oligomers either by docking or by homology-based approaches. Here, we first provide a brief overview of the nature of protein homo-oligomerization. Next, we describe how the symmetry of homo-oligomers is addressed by crystallographic and non-crystallographic symmetry operations, and how biologically relevant intermolecular interactions can be deciphered from the ordered array of molecules within protein crystals. Additionally, we describe the most important aspects of protein homo-oligomerization in structure determination by NMR. Finally, we give an overview of approaches aimed at modeling homo-oligomers using computational methods that specifically address their internal symmetry and allow the incorporation of other experimental data as spatial restraints to achieve higher model reliability.
Collapse
|
3
|
Biological vs. Crystallographic Protein Interfaces: An Overview of Computational Approaches for Their Classification. CRYSTALS 2020. [DOI: 10.3390/cryst10020114] [Citation(s) in RCA: 12] [Impact Index Per Article: 3.0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 12/31/2022]
Abstract
Complexes between proteins are at the basis of almost every process in cells. Their study, from a structural perspective, has a pivotal role in understanding biological functions and, importantly, in drug development. X-ray crystallography represents the broadest source for the experimental structural characterization of protein-protein complexes. Correctly identifying the biologically relevant interface from the crystallographic ones is, however, not trivial and can be prone to errors. Over the past two decades, computational methodologies have been developed to study the differences of those interfaces and automatically classify them as biological or crystallographic. Overall, protein-protein interfaces show differences in terms of composition, energetics and evolutionary conservation between biological and crystallographic ones. Based on those observations, a number of computational methods have been developed for this classification problem, which can be grouped into three main categories: Energy-, empirical knowledge- and machine learning-based approaches. In this review, we give a comprehensive overview of the training datasets and methods so far implemented, providing useful links and a brief description of each method.
Collapse
|
4
|
Accurate Classification of Biological and non-Biological Interfaces in Protein Crystal Structures using Subtle Covariation Signals. Sci Rep 2019; 9:12603. [PMID: 31471543 PMCID: PMC6717244 DOI: 10.1038/s41598-019-48913-8] [Citation(s) in RCA: 4] [Impact Index Per Article: 0.8] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 01/10/2017] [Accepted: 08/14/2019] [Indexed: 11/08/2022] Open
Abstract
Proteins often work as oligomers or multimers in vivo. Therefore, elucidating their oligomeric or multimeric form (quaternary structure) is crucially important to ascertain their function. X-ray crystal structures of numerous proteins have been accumulated, providing information related to their biological units. Extracting information of biological units from protein crystal structures represents a meaningful task for modern biology. Nevertheless, although many methods have been proposed for identifying biological units appearing in protein crystal structures, it is difficult to distinguish biological protein-protein interfaces from crystallographic ones. Therefore, our simple but highly accurate classifier was developed to infer biological units in protein crystal structures using large amounts of protein sequence information and a modern contact prediction method to exploit covariation signals (CSs) in proteins. We demonstrate that our proposed method is promising even for weak signals of biological interfaces. We also discuss the relation between classification accuracy and conservation of biological units, and illustrate how the selection of sequences included in multiple sequence alignments as sources for obtaining CSs affects the results. With increased amounts of sequence data, the proposed method is expected to become increasingly useful.
Collapse
|
5
|
Principles and characteristics of biological assemblies in experimentally determined protein structures. Curr Opin Struct Biol 2019; 55:34-49. [PMID: 30965224 DOI: 10.1016/j.sbi.2019.03.006] [Citation(s) in RCA: 11] [Impact Index Per Article: 2.2] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 02/26/2019] [Accepted: 03/01/2019] [Indexed: 12/27/2022]
Abstract
More than half of all structures in the PDB are assemblies of two or more proteins, including both homooligomers and heterooligomers. Structural information on these assemblies comes from X-ray crystallography, NMR, and cryo-EM spectroscopy. The correct assembly in an X-ray structure is often ambiguous, and computational methods have been developed to identify the most likely biologically relevant assembly based on physical properties of assemblies and sequence conservation in interfaces. Taking advantage of the large number of structures now available, some of the most recent methods have relied on similarity of interfaces and assemblies across structures of homologous proteins.
Collapse
|
6
|
Swanson J, Audie J. An unexpected way forward: towards a more accurate and rigorous protein-protein binding affinity scoring function by eliminating terms from an already simple scoring function. J Biomol Struct Dyn 2016; 36:83-97. [PMID: 27989231 DOI: 10.1080/07391102.2016.1268974] [Citation(s) in RCA: 2] [Impact Index Per Article: 0.3] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 10/20/2022]
Abstract
A fundamental and unsolved problem in biophysical chemistry is the development of a computationally simple, physically intuitive, and generally applicable method for accurately predicting and physically explaining protein-protein binding affinities from protein-protein interaction (PPI) complex coordinates. Here, we propose that the simplification of a previously described six-term PPI scoring function to a four term function results in a simple expression of all physically and statistically meaningful terms that can be used to accurately predict and explain binding affinities for a well-defined subset of PPIs that are characterized by (1) crystallographic coordinates, (2) rigid-body association, (3) normal interface size, and hydrophobicity and hydrophilicity, and (4) high quality experimental binding affinity measurements. We further propose that the four-term scoring function could be regarded as a core expression for future development into a more general PPI scoring function. Our work has clear implications for PPI modeling and structure-based drug design.
Collapse
Affiliation(s)
- Jon Swanson
- a ChemModeling LLC , Suite 101, 500 Huber Park Ct, Weldon Spring , MO 63304 , USA
| | - Joseph Audie
- b CMD Bioscience , 5 Science Park , New Haven , CT 06511 , USA.,c Department of Chemistry , Sacred Heart University , 5151 Park Ave, Fairfield , CT 06825 , USA
| |
Collapse
|
7
|
Soner S, Ozbek P, Garzon JI, Ben-Tal N, Haliloglu T. DynaFace: Discrimination between Obligatory and Non-obligatory Protein-Protein Interactions Based on the Complex's Dynamics. PLoS Comput Biol 2015; 11:e1004461. [PMID: 26506003 PMCID: PMC4623975 DOI: 10.1371/journal.pcbi.1004461] [Citation(s) in RCA: 9] [Impact Index Per Article: 1.0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/30/2015] [Accepted: 07/08/2015] [Indexed: 12/31/2022] Open
Abstract
Protein-protein interfaces have been evolutionarily-designed to enable transduction between the interacting proteins. Thus, we hypothesize that analysis of the dynamics of the complex can reveal details about the nature of the interaction, and in particular whether it is obligatory, i.e., persists throughout the entire lifetime of the proteins, or not. Indeed, normal mode analysis, using the Gaussian network model, shows that for the most part obligatory and non-obligatory complexes differ in their decomposition into dynamic domains, i.e., the mobile elements of the protein complex. The dynamic domains of obligatory complexes often mix segments from the interacting chains, and the hinges between them do not overlap with the interface between the chains. In contrast, in non-obligatory complexes the interface often hinges between dynamic domains, held together through few anchor residues on one side of the interface that interact with their counterpart grooves in the other end. In automatic analysis, 117 of 139 obligatory (84.2%) and 203 of 246 non-obligatory (82.5%) complexes are correctly classified by our method: DynaFace. We further use DynaFace to predict obligatory and non-obligatory interactions among a set of 300 putative protein complexes. DynaFace is available at: http://safir.prc.boun.edu.tr/dynaface.
Collapse
Affiliation(s)
- Seren Soner
- Department of Computer Engineering and Polymer Research Center, Bogazici University, Istanbul, Turkey
| | - Pemra Ozbek
- Department of Bioengineering, Marmara University, Istanbul, Turkey
| | - Jose Ignacio Garzon
- Departments of Biochemistry and Molecular Biophysics and Systems Biology and Howard Hughes Medical Institute, Columbia University, New York, New York, United States of America
| | - Nir Ben-Tal
- Department of Biochemistry and Molecular Biology, George S. Wise Faculty of Life Sciences, Tel Aviv University, Tel Aviv, Israel
| | - Turkan Haliloglu
- Department of Chemical Engineering and Polymer Research Center, Bogazici University, Istanbul, Turkey
- * E-mail:
| |
Collapse
|
8
|
A structural dissection of large protein-protein crystal packing contacts. Sci Rep 2015; 5:14214. [PMID: 26370141 PMCID: PMC4572935 DOI: 10.1038/srep14214] [Citation(s) in RCA: 29] [Impact Index Per Article: 3.2] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 11/25/2014] [Accepted: 08/21/2015] [Indexed: 01/31/2023] Open
Abstract
With the rapid increase in crystal structures of protein-protein complexes deposited in the Protein Data Bank (PDB), more and more crystal contacts have been shown to have similar or even larger interface areas than biological interfaces. However, little attention has been paid to these large crystal packing contacts and their structural principles remain unknown. To address this issue, we used a comparative feature analysis to analyze the geometric and physicochemical properties of large crystal packing contacts by comparing two types of specific protein-protein interactions (PPIs), weak transient complexes and permanent homodimers. Our results show that although large crystal packing contacts have a similar interface area and contact size as permanent homodimers, they tend to be more planar, loosely packed and less hydrophobic than permanent homodimers and cannot form a central core region that is fully buried during interaction. However, the properties of large crystal packing contacts, except for the interface area and contact size, more closely resemble those of weak transient complexes. The large overlap between biological and large crystal packing contacts indicates that interface properties are not efficient indicators for classification of biological interfaces from large crystal packing contacts and finding other specific features urgently needed.
Collapse
|
9
|
Luo J, Guo Y, Fu Y, Wang Y, Li W, Li M. Effective discrimination between biologically relevant contacts and crystal packing contacts using new determinants. Proteins 2014; 82:3090-100. [PMID: 25142782 DOI: 10.1002/prot.24670] [Citation(s) in RCA: 12] [Impact Index Per Article: 1.2] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 04/16/2014] [Revised: 06/20/2014] [Accepted: 08/11/2014] [Indexed: 12/24/2022]
Abstract
In the structural models determined by X-ray crystallography, contacts between molecules can be divided into two categories: biologically relevant contacts and crystal packing contacts. With the growth in the number and quality of available large crystal packing contacts structures, distinguishing crystal packing contacts from biologically relevant contacts remains a difficult task, which can lead to wrong interpretation of structural models. In this study, we performed a systematic analysis on the biologically relevant contacts and crystal packing contacts. The analysis results reveal that biologically contacts are more tightly packed than crystal packing contacts. This property of biologically contacts may contribute to the formation of their interfacial core region. Meanwhile, the differences between the core and surface region of biologically contacts in amino acid composition and evolutionary measure are more dramatic than crystal packing contacts and these differences appear to be useful in distinguishing these two categories of contacts. On the basis of the features derived from our analysis, we developed a random forest model to classify biological relevant contacts and crystal packing contacts. Our method can achieve a high receiver operating curve of 0.923 in the 5-fold cross-validation and accuracies of 91.4% and 91.7% for two different test sets. Moreover, in a comparison study, our model outperforms other existing methods, such as DiMoVo, Pita, Pisa, and Eppic. We believe that this study will provide useful help in the validation of oligomeric proteins and protein complexes. The model and all data used in this paper are freely available at http://cic.scu.edu.cn/bioinformatics/bio-cry.zip.
Collapse
Affiliation(s)
- Jiesi Luo
- College of Chemistry and State Key Laboratory of Biotherapy, Sichuan University, Chengdu, Sichuan, 610064, People's Republic of China
| | | | | | | | | | | |
Collapse
|
10
|
Sudarshan S, Kodathala SB, Mahadik AC, Mehta I, Beck BW. Protein-protein interface detection using the energy centrality relationship (ECR) characteristic of proteins. PLoS One 2014; 9:e97115. [PMID: 24830938 PMCID: PMC4022497 DOI: 10.1371/journal.pone.0097115] [Citation(s) in RCA: 2] [Impact Index Per Article: 0.2] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 01/23/2014] [Accepted: 04/14/2014] [Indexed: 01/17/2023] Open
Abstract
Specific protein interactions are responsible for most biological functions. Distinguishing Functionally Linked Interfaces of Proteins (FLIPs), from Functionally uncorrelated Contacts (FunCs), is therefore important to characterizing these interactions. To achieve this goal, we have created a database of protein structures called FLIPdb, containing proteins belonging to various functional sub-categories. Here, we use geometric features coupled with Kortemme and Baker's computational alanine scanning method to calculate the energetic sensitivity of each amino acid at the interface to substitution, identify hotspots, and identify other factors that may contribute towards an interface being FLIP or FunC. Using Principal Component Analysis and K-means clustering on a training set of 160 interfaces, we could distinguish FLIPs from FunCs with an accuracy of 76%. When these methods were applied to two test sets of 18 and 170 interfaces, we achieved similar accuracies of 78% and 80%. We have identified that FLIP interfaces have a stronger central organizing tendency than FunCs, due, we suggest, to greater specificity. We also observe that certain functional sub-categories, such as enzymes, antibody-heavy-light, antibody-antigen, and enzyme-inhibitors form distinct sub-clusters. The antibody-antigen and enzyme-inhibitors interfaces have patterns of physical characteristics similar to those of FunCs, which is in agreement with the fact that the selection pressures of these interfaces is differently evolutionarily driven. As such, our ECR model also successfully describes the impact of evolution and natural selection on protein-protein interfaces. Finally, we indicate how our ECR method may be of use in reducing the false positive rate of docking calculations.
Collapse
Affiliation(s)
- Sanjana Sudarshan
- Department of Biology, Texas Woman's University, Denton, Texas, United States of America
| | - Sasi B. Kodathala
- Department of Biology, Texas Woman's University, Denton, Texas, United States of America
| | - Amruta C. Mahadik
- Department of Biology, Texas Woman's University, Denton, Texas, United States of America
| | - Isha Mehta
- Department of Biology, Texas Woman's University, Denton, Texas, United States of America
| | - Brian W. Beck
- Department of Biology, Texas Woman's University, Denton, Texas, United States of America
- Department of Mathematics and Computer Science, Texas Woman's University, Denton, Texas, United States of America
- Department of Chemistry and Biochemistry, Texas Woman's University, Denton, Texas, United States of America
- * E-mail:
| |
Collapse
|
11
|
Duarte JM, Srebniak A, Schärer MA, Capitani G. Protein interface classification by evolutionary analysis. BMC Bioinformatics 2012; 13:334. [PMID: 23259833 PMCID: PMC3556496 DOI: 10.1186/1471-2105-13-334] [Citation(s) in RCA: 112] [Impact Index Per Article: 9.3] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 08/02/2012] [Accepted: 12/15/2012] [Indexed: 01/01/2023] Open
Abstract
Background Distinguishing biologically relevant interfaces from lattice contacts in protein crystals is a fundamental problem in structural biology. Despite efforts towards the computational prediction of interface character, many issues are still unresolved. Results We present here a protein-protein interface classifier that relies on evolutionary data to detect the biological character of interfaces. The classifier uses a simple geometric measure, number of core residues, and two evolutionary indicators based on the sequence entropy of homolog sequences. Both aim at detecting differential selection pressure between interface core and rim or rest of surface. The core residues, defined as fully buried residues (>95% burial), appear to be fundamental determinants of biological interfaces: their number is in itself a powerful discriminator of interface character and together with the evolutionary measures it is able to clearly distinguish evolved biological contacts from crystal ones. We demonstrate that this definition of core residues leads to distinctively better results than earlier definitions from the literature. The stringent selection and quality filtering of structural and sequence data was key to the success of the method. Most importantly we demonstrate that a more conservative selection of homolog sequences - with relatively high sequence identities to the query - is able to produce a clearer signal than previous attempts. Conclusions An evolutionary approach like the one presented here is key to the advancement of the field, which so far was missing an effective method exploiting the evolutionary character of protein interfaces. Its coverage and performance will only improve over time thanks to the incessant growth of sequence databases. Currently our method reaches an accuracy of 89% in classifying interfaces of the Ponstingl 2003 datasets and it lends itself to a variety of useful applications in structural biology and bioinformatics. We made the corresponding software implementation available to the community as an easy-to-use graphical web interface at http://www.eppic-web.org.
Collapse
Affiliation(s)
- Jose M Duarte
- Paul Scherrer Institut, Villigen, CH-5232, Switzerland
| | | | | | | |
Collapse
|
12
|
Chae MH, Krull F, Lorenzen S, Knapp EW. Predicting protein complex geometries with a neural network. Proteins 2010; 78:1026-39. [PMID: 19938153 DOI: 10.1002/prot.22626] [Citation(s) in RCA: 16] [Impact Index Per Article: 1.1] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 12/24/2022]
Abstract
A major challenge of the protein docking problem is to define scoring functions that can distinguish near-native protein complex geometries from a large number of non-native geometries (decoys) generated with noncomplexed protein structures (unbound docking). In this study, we have constructed a neural network that employs the information from atom-pair distance distributions of a large number of decoys to predict protein complex geometries. We found that docking prediction can be significantly improved using two different types of polar hydrogen atoms. To train the neural network, 2000 near-native decoys of even distance distribution were used for each of the 185 considered protein complexes. The neural network normalizes the information from different protein complexes using an additional protein complex identity input neuron for each complex. The parameters of the neural network were determined such that they mimic a scoring funnel in the neighborhood of the native complex structure. The neural network approach avoids the reference state problem, which occurs in deriving knowledge-based energy functions for scoring. We show that a distance-dependent atom pair potential performs much better than a simple atom-pair contact potential. We have compared the performance of our scoring function with other empirical and knowledge-based scoring functions such as ZDOCK 3.0, ZRANK, ITScore-PP, EMPIRE, and RosettaDock. In spite of the simplicity of the method and its functional form, our neural network-based scoring function achieves a reasonable performance in rigid-body unbound docking of proteins. Proteins 2010. (c) 2009 Wiley-Liss, Inc.
Collapse
Affiliation(s)
- Myong-Ho Chae
- Department of Biology, University of Science, Unjong-District, Pyongyang, DPR Korea
| | | | | | | |
Collapse
|
13
|
Aleksandrov A, Simonson T. Molecular dynamics simulations show that conformational selection governs the binding preferences of imatinib for several tyrosine kinases. J Biol Chem 2010; 285:13807-15. [PMID: 20200154 DOI: 10.1074/jbc.m110.109660] [Citation(s) in RCA: 64] [Impact Index Per Article: 4.6] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 12/13/2022] Open
Abstract
Tyrosine kinases transmit cellular signals through a complex mechanism, involving their phosphorylation and switching between inactive and active conformations. The cancer drug imatinib binds tightly to several homologous kinases, including Abl, but weakly to others, including Src. Imatinib specifically targets the inactive, so-called "DFG-out" conformation of Abl, which differs from the preferred, "DFG-in" conformation of Src in the orientation of a conserved Asp-Phe-Gly (DFG) activation loop. However, recent x-ray structures showed that Src can also adopt the DFG-out conformation and uses it to bind imatinib. The Src/Abl-binding free energy difference can thus be decomposed into two contributions. Contribution i measures the different protein-imatinib interactions when either kinase is in its DFG-out conformation. Contribution ii depends on the ability of imatinib to select or induce this conformation, i.e. on the relative stabilities of the DFG-out and DFG-in conformations of each kinase. Neither contribution has been measured experimentally. We use molecular dynamics simulations to show that contribution i is very small, 0.2 +/- 0.6 kcal/mol; imatinib interactions are very similar in the two kinases, including long range electrostatic interactions with the imatinib positive charge. Contribution ii, deduced using the experimental binding free energy difference, is much larger, 4.4 +/- 0.9 kcal/mol. Thus, conformational selection, easy in Abl, difficult in Src, underpins imatinib specificity. Contribution ii has a simple interpretation; it closely approximates the stability difference between the DFG-out and DFG-in conformations of apo-Src. Additional calculations show that conformational selection also governs the relative binding of imatinib to the kinases c-Kit and Lck. These results should help clarify the current framework for engineering kinase signaling.
Collapse
Affiliation(s)
- Alexey Aleksandrov
- Department of Biology, Laboratoire de Biochimie (CNRS UMR7654), Ecole Polytechnique, 91128 Palaiseau, France
| | | |
Collapse
|
14
|
Liu Q, Li J. Propensity vectors of low-ASA residue pairs in the distinction of protein interactions. Proteins 2009; 78:589-602. [DOI: 10.1002/prot.22583] [Citation(s) in RCA: 20] [Impact Index Per Article: 1.3] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 12/24/2022]
|
15
|
Shang X, Wang L, Niu W, Meng G, Fu X, Ni B, Lin Z, Yang Z, Chen X, Wu Y. Rational optimization of tumor epitopes using in silico
analysis-assisted substitution of TCR contact residues. Eur J Immunol 2009; 39:2248-58. [DOI: 10.1002/eji.200939338] [Citation(s) in RCA: 8] [Impact Index Per Article: 0.5] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 12/26/2022]
|
16
|
Tsuchiya Y, Nakamura H, Kinoshita K. Discrimination between biological interfaces and crystal-packing contacts. Adv Appl Bioinform Chem 2008; 1:99-113. [PMID: 21918609 PMCID: PMC3169932 DOI: 10.2147/aabc.s4255] [Citation(s) in RCA: 16] [Impact Index Per Article: 1.0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/23/2022] Open
Abstract
A discrimination method between biologically relevant interfaces and artificial crystal-packing contacts in crystal structures was constructed. The method evaluates protein-protein interfaces in terms of complementarities for hydrophobicity, electrostatic potential and shape on the protein surfaces, and chooses the most probable biological interfaces among all possible contacts in the crystal. The method uses a discriminator named as "COMP", which is a linear combination of the complementarities for the above three surface features and does not correlate with the contact area. The discrimination of homo-dimer interfaces from symmetry-related crystal-packing contacts based on the COMP value achieved the modest success rate. Subsequent detailed review of the discrimination results raised the success rate to about 88.8%. In addition, our discrimination method yielded some clues for understanding the interaction patterns in several examples in the PDB. Thus, the COMP discriminator can also be used as an indicator of the "biological-ness" of protein-protein interfaces.
Collapse
Affiliation(s)
- Yuko Tsuchiya
- Institute of Medical Science, The University of Tokyo, 4-6-1 Shirokanedai, Minatoku, Tokyo, 108-8639, Japan
| | | | | |
Collapse
|
17
|
Launay G, Simonson T. Homology modelling of protein-protein complexes: a simple method and its possibilities and limitations. BMC Bioinformatics 2008; 9:427. [PMID: 18844985 PMCID: PMC2586029 DOI: 10.1186/1471-2105-9-427] [Citation(s) in RCA: 28] [Impact Index Per Article: 1.8] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 04/14/2008] [Accepted: 10/09/2008] [Indexed: 11/21/2022] Open
Abstract
Background Structure-based computational methods are needed to help identify and characterize protein-protein complexes and their function. For individual proteins, the most successful technique is homology modelling. We investigate a simple extension of this technique to protein-protein complexes. We consider a large set of complexes of known structures, involving pairs of single-domain proteins. The complexes are compared with each other to establish their sequence and structural similarities and the relation between the two. Compared to earlier studies, a simpler dataset, a simpler structural alignment procedure, and an additional energy criterion are used. Next, we compare the Xray structures to models obtained by threading the native sequence onto other, homologous complexes. An elementary requirement for a successful energy function is to rank the native structure above any threaded structure. We use the DFIREβ energy function, whose quality and complexity are typical of the models used today. Finally, we compare near-native models to distinctly non-native models. Results If weakly stable complexes are excluded (defined by a binding energy cutoff), as well as a few unusual complexes, a simple homology principle holds: complexes that share more than 35% sequence identity share similar structures and interaction modes; this principle was less clearcut in earlier studies. The energy function was then tested for its ability to identify experimental structures among sets of decoys, produced by a simple threading procedure. On average, the experimental structure is ranked above 92% of the alternate structures. Thus, discrimination of the native structure is good but not perfect. The discrimination of near-native structures is fair. Typically, a single, alternate, non-native binding mode exists that has a native-like energy. Some of the associated failures may correspond to genuine, alternate binding modes and/or native complexes that are artefacts of the crystal environment. In other cases, additional model filtering with more sophisticated tools is needed. Conclusion The results suggest that the simple modelling procedure applied here could help identify and characterize protein-protein complexes. The next step is to apply it on a genomic scale.
Collapse
Affiliation(s)
- Guillaume Launay
- Laboratoire de Biochimie (UMR CNRS 7654), Department of Biology, Ecole Polytechnique, 91128, Palaiseau, France.
| | | |
Collapse
|
18
|
Xu Q, Canutescu AA, Wang G, Shapovalov M, Obradovic Z, Dunbrack RL. Statistical analysis of interface similarity in crystals of homologous proteins. J Mol Biol 2008; 381:487-507. [PMID: 18599072 DOI: 10.1016/j.jmb.2008.06.002] [Citation(s) in RCA: 91] [Impact Index Per Article: 5.7] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 04/10/2008] [Revised: 05/30/2008] [Accepted: 06/02/2008] [Indexed: 11/27/2022]
Abstract
Many proteins function as homo-oligomers and are regulated via their oligomeric state. For some proteins, the stoichiometry of homo-oligomeric states under various conditions has been studied using gel filtration or analytical ultracentrifugation experiments. The interfaces involved in these assemblies may be identified using cross-linking and mass spectrometry, solution-state NMR, and other experiments. However, for most proteins, the actual interfaces that are involved in oligomerization are inferred from X-ray crystallographic structures using assumptions about interface surface areas and physical properties. Examination of interfaces across different Protein Data Bank (PDB) entries in a protein family reveals several important features. First, similarities in space group, asymmetric unit size, and cell dimensions and angles (within 1%) do not guarantee that two crystals are actually the same crystal form, containing similar relative orientations and interactions within the crystal. Conversely, two crystals in different space groups may be quite similar in terms of all the interfaces within each crystal. Second, NMR structures and an existing benchmark of PDB crystallographic entries consisting of 126 dimers as well as larger structures and 132 monomers were used to determine whether the existence or lack of common interfaces across multiple crystal forms can be used to predict whether a protein is an oligomer or not. Monomeric proteins tend to have common interfaces across only a minority of crystal forms, whereas higher-order structures exhibit common interfaces across a majority of available crystal forms. The data can be used to estimate the probability that an interface is biological if two or more crystal forms are available. Finally, the Protein Interfaces, Surfaces, and Assemblies (PISA) database available from the European Bioinformatics Institute is more consistent in identifying interfaces observed in many crystal forms compared with the PDB and the European Bioinformatics Institute's Protein Quaternary Server (PQS). The PDB, in particular, is missing highly likely biological interfaces in its biological unit files for about 10% of PDB entries.
Collapse
Affiliation(s)
- Qifang Xu
- Institute for Cancer Research, Fox Chase Cancer Center, 333 Cottman Avenue, Philadelphia, PA 19111, USA
| | | | | | | | | | | |
Collapse
|
19
|
Sawaya MR, Kudryashov DS, Pashkov I, Adisetiyo H, Reisler E, Yeates TO. Multiple crystal structures of actin dimers and their implications for interactions in the actin filament. ACTA CRYSTALLOGRAPHICA SECTION D: BIOLOGICAL CRYSTALLOGRAPHY 2008; 64:454-65. [PMID: 18391412 PMCID: PMC2631129 DOI: 10.1107/s0907444908003351] [Citation(s) in RCA: 17] [Impact Index Per Article: 1.1] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Subscribe] [Scholar Register] [Received: 09/11/2007] [Accepted: 01/30/2008] [Indexed: 01/04/2023]
Abstract
Multiple crystal structures are reported of cross-linked actin dimers. Interactions that are conserved across crystal structures suggest detailed interactions that are likely to be present in F-actin filaments. The structure of actin in its monomeric form is known at high resolution, while the structure of filamentous F-actin is only understood at considerably lower resolution. Knowing precisely how the monomers of actin fit together would lead to a deeper understanding of the dynamic behavior of the actin filament. Here, a series of crystal structures of actin dimers are reported which were prepared by cross-linking in either the longitudinal or the lateral direction in the filament state. Laterally cross-linked dimers, comprised of monomers belonging to different protofilaments, are found to adopt configurations in crystals that are not related to the native structure of filamentous actin. In contrast, multiple structures of longitudinal dimers consistently reveal the same interface between monomers within a single protofilament. The reappearance of the same longitudinal interface in multiple crystal structures adds weight to arguments that the interface visualized is similar to that in actin filaments. Highly conserved atomic interactions involving residues 199–205 and 287–291 are highlighted.
Collapse
Affiliation(s)
- Michael R Sawaya
- UCLA Department of Chemistry and Biochemistry, Los Angeles, CA 90095-1569, USA
| | | | | | | | | | | |
Collapse
|
20
|
Bernauer J, Bahadur RP, Rodier F, Janin J, Poupon A. DiMoVo: a Voronoi tessellation-based method for discriminating crystallographic and biological protein-protein interactions. Bioinformatics 2008; 24:652-8. [PMID: 18204058 DOI: 10.1093/bioinformatics/btn022] [Citation(s) in RCA: 74] [Impact Index Per Article: 4.6] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/13/2022] Open
Abstract
MOTIVATION Knowledge of the oligomeric state of a protein is often essential for understanding its function and mechanism. Within a protein crystal, each protein monomer is in contact with many others, forming many small interfaces and a few larger ones that are biologically significant if the protein is a homodimer in solution, but not if the protein is monomeric. Telling such 'crystal dimers' from real ones remains a difficult task. RESULTS It has already been demonstrated that the interfaces of native and non-native protein-protein complexes can be distinguished using a combination of parameters computed with a method on the Voronoi tessellation. We show in this article that the same parameters highlight significant differences between the interfaces of biological and crystal dimers. Using these parameters as descriptors in machine learning methods leads to accurate classification of specific and non-specific protein-protein interfaces. AVAILABILITY Software is available at http://fifi.ibbmc.u-psud.fr/DiMoVo.
Collapse
Affiliation(s)
- Julie Bernauer
- Department of Structural Biology, Fairchild Science Building, Stanford University School of Medicine, Stanford, CA 94305-5126, USA
| | | | | | | | | |
Collapse
|
21
|
Bai H, Ma W, Liu S, Lai L. Dynamic property is a key determinant for protein-protein interactions. Proteins 2007; 70:1323-31. [PMID: 17876833 DOI: 10.1002/prot.21625] [Citation(s) in RCA: 9] [Impact Index Per Article: 0.5] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 01/28/2023]
Abstract
Dynamic property is highly correlated with the biological functions of macromolecules, such as the activity and specificity of enzymes and the allosteric regulation in the signal transduction process. Applications of the dynamic property to protein function researches have been discussed and encouraging progresses have been achieved, for example, in enzyme activity and protein-protein docking studies. However, how the global dynamic property contributes to protein-protein interaction was still unclear. We have studied the dynamic property in protein-protein interactions based on Gaussian Network Model and applied it to classify biological and nonbiological protein-protein complexes in crystal structures. The global motion correlation between residues from the two protomers was found to be remarkably different for biological and nonbiological complexes. This correlation has been used to discriminate biological and nonbiological complexes in crystal and gave a classification rate of 86.9% in the cross-validation test. The innovation of this feature is that it is a global dynamic property which does not rely directly on the interfacial properties of the complex. In addition, the correlation of the global motions was found to be weakly correlated with the dissociation rate constant of protein complexes. We suggest that the dynamic property is a key determinant for protein-protein interaction, which can be used to discriminate native and crystal complexes and potentially be applied in protein-protein dynamic rate constants estimations.
Collapse
Affiliation(s)
- Hongjun Bai
- Beijing National Laboratory for Molecular Sciences, College of Chemistry and Molecular Engineering, Peking University, Beijing 100871, China
| | | | | | | |
Collapse
|
22
|
Liu S, Liu S, Zhu X, Liang H, Cao A, Chang Z, Lai L. Nonnatural protein-protein interaction-pair design by key residues grafting. Proc Natl Acad Sci U S A 2007; 104:5330-5. [PMID: 17372228 PMCID: PMC1838465 DOI: 10.1073/pnas.0606198104] [Citation(s) in RCA: 66] [Impact Index Per Article: 3.9] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/18/2022] Open
Abstract
Protein-protein interface design is one of the most exciting fields in protein science; however, designing nonnatural protein-protein interaction pairs remains difficult. In this article we report a de novo design of a nonnatural protein-protein interaction pair by scanning the Protein Data Bank for suitable scaffold proteins that can be used for grafting key interaction residues and can form stable complexes with the target protein after additional mutations. Using our design algorithm, an unrelated protein, rat PLCdelta(1)-PH (pleckstrin homology domain of phospholipase C-delta1), was successfully designed to bind the human erythropoietin receptor (EPOR) after grafting the key interaction residues of human erythropoietin binding to EPOR. The designed mutants of rat PLCdelta(1)-PH were expressed and purified to test their binding affinities with EPOR. A designed triple mutation of PLCdelta(1)-PH (ERPH1) was found to bind EPOR with high affinity (K(D) of 24 nM and an IC(50) of 5.7 microM) both in vitro and in a cell-based assay, respectively, although the WT PLCdelta(1)-PH did not show any detectable binding under the assay conditions. The in vitro binding affinities of the PLCdelta(1)-PH mutants correlate qualitatively to the computational binding affinities, validating the design and the protein-protein interaction model. The successful practice of finding a proper protein scaffold and making it bind with EPOR demonstrates a prospective application in protein engineering targeting protein-protein interfaces.
Collapse
Affiliation(s)
- Sen Liu
- *Beijing National Laboratory for Molecular Sciences, State Key Laboratory for Structural Chemistry of Unstable and Stable Species, College of Chemistry and Molecular Engineering, Peking University, Beijing 100871, China
- Center for Theoretical Biology, Peking University, Beijing 100871, China; and
| | - Shiyong Liu
- *Beijing National Laboratory for Molecular Sciences, State Key Laboratory for Structural Chemistry of Unstable and Stable Species, College of Chemistry and Molecular Engineering, Peking University, Beijing 100871, China
- Center for Theoretical Biology, Peking University, Beijing 100871, China; and
| | - Xiaolei Zhu
- *Beijing National Laboratory for Molecular Sciences, State Key Laboratory for Structural Chemistry of Unstable and Stable Species, College of Chemistry and Molecular Engineering, Peking University, Beijing 100871, China
| | - Huanhuan Liang
- *Beijing National Laboratory for Molecular Sciences, State Key Laboratory for Structural Chemistry of Unstable and Stable Species, College of Chemistry and Molecular Engineering, Peking University, Beijing 100871, China
| | - Aoneng Cao
- *Beijing National Laboratory for Molecular Sciences, State Key Laboratory for Structural Chemistry of Unstable and Stable Species, College of Chemistry and Molecular Engineering, Peking University, Beijing 100871, China
| | - Zhijie Chang
- Department of Biological Sciences and Biotechnology, School of Medicine, Tsinghua University, Beijing 100084, China
| | - Luhua Lai
- *Beijing National Laboratory for Molecular Sciences, State Key Laboratory for Structural Chemistry of Unstable and Stable Species, College of Chemistry and Molecular Engineering, Peking University, Beijing 100871, China
- Center for Theoretical Biology, Peking University, Beijing 100871, China; and
- To whom correspondence should be addressed. E-mail:
| |
Collapse
|