951
|
David A, Razali R, Wass MN, Sternberg MJE. Protein-protein interaction sites are hot spots for disease-associated nonsynonymous SNPs. Hum Mutat 2011; 33:359-63. [PMID: 22072597 DOI: 10.1002/humu.21656] [Citation(s) in RCA: 118] [Impact Index Per Article: 8.4] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 08/16/2011] [Accepted: 10/31/2011] [Indexed: 11/08/2022]
Abstract
Many nonsynonymous single nucleotide polymorphisms (nsSNPs) are disease causing due to effects at protein-protein interfaces. We have integrated a database of the three-dimensional (3D) structures of human protein/protein complexes and the humsavar database of nsSNPs. We analyzed the location of nsSNPS in terms of their location in the protein core, at protein-protein interfaces, and on the surface when not at an interface. Disease-causing nsSNPs that do not occur in the protein core are preferentially located at protein-protein interfaces rather than surface noninterface regions when compared to random segregation. The disruption of the protein-protein interaction can be explained by a range of structural effects including the loss of an electrostatic salt bridge, the destabilization due to reduction of the hydrophobic effect, the formation of a steric clash, and the introduction of a proline altering the main-chain conformation.
Collapse
Affiliation(s)
- Alessia David
- Centre for Integrative Systems Biology and Bioinformatics, Division of Molecular Biosciences, Department of Life Sciences, Imperial College London, London SW7 2AZ, UK
| | | | | | | |
Collapse
|
952
|
Abstract
BACKGROUND Protein side-chain packing problem has remained one of the key open problems in bioinformatics. The three main components of protein side-chain prediction methods are a rotamer library, an energy function and a search algorithm. Rotamer libraries summarize the existing knowledge of the experimentally determined structures quantitatively. Depending on how much contextual information is encoded, there are backbone-independent rotamer libraries and backbone-dependent rotamer libraries. Backbone-independent libraries only encode sequential information, whereas backbone-dependent libraries encode both sequential and locally structural information. However, side-chain conformations are determined by spatially local information, rather than sequentially local information. Since in the side-chain prediction problem, the backbone structure is given, spatially local information should ideally be encoded into the rotamer libraries. METHODS In this paper, we propose a new type of backbone-dependent rotamer library, which encodes structural information of all the spatially neighboring residues. We call it protein-dependent rotamer libraries. Given any rotamer library and a protein backbone structure, we first model the protein structure as a Markov random field. Then the marginal distributions are estimated by the inference algorithms, without doing global optimization or search. The rotamers from the given library are then re-ranked and associated with the updated probabilities. RESULTS Experimental results demonstrate that the proposed protein-dependent libraries significantly outperform the widely used backbone-dependent libraries in terms of the side-chain prediction accuracy and the rotamer ranking ability. Furthermore, without global optimization/search, the side-chain prediction power of the protein-dependent library is still comparable to the global-search-based side-chain prediction methods.
Collapse
Affiliation(s)
- Md Shariful Islam Bhuyan
- Mathematical and Computer Sciences and Engineering Division, King Abdullah University of Science and Technology, Thuwal, 23955, KSA
| | | |
Collapse
|
953
|
Koehl P, Orland H, Delarue M. Adapting Poisson-Boltzmann to the self-consistent mean field theory: application to protein side-chain modeling. J Chem Phys 2011; 135:055104. [PMID: 21823735 DOI: 10.1063/1.3621831] [Citation(s) in RCA: 7] [Impact Index Per Article: 0.5] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/14/2022] Open
Abstract
We present an extension of the self-consistent mean field theory for protein side-chain modeling in which solvation effects are included based on the Poisson-Boltzmann (PB) theory. In this approach, the protein is represented with multiple copies of its side chains. Each copy is assigned a weight that is refined iteratively based on the mean field energy generated by the rest of the protein, until self-consistency is reached. At each cycle, the variational free energy of the multi-copy system is computed; this free energy includes the internal energy of the protein that accounts for vdW and electrostatics interactions and a solvation free energy term that is computed using the PB equation. The method converges in only a few cycles and takes only minutes of central processing unit time on a commodity personal computer. The predicted conformation of each residue is then set to be its copy with the highest weight after convergence. We have tested this method on a database of hundred highly refined NMR structures to circumvent the problems of crystal packing inherent to x-ray structures. The use of the PB-derived solvation free energy significantly improves prediction accuracy for surface side chains. For example, the prediction accuracies for χ(1) for surface cysteine, serine, and threonine residues improve from 68%, 35%, and 43% to 80%, 53%, and 57%, respectively. A comparison with other side-chain prediction algorithms demonstrates that our approach is consistently better in predicting the conformations of exposed side chains.
Collapse
Affiliation(s)
- Patrice Koehl
- Department of Biological Sciences, National University of Singapore, Singapore.
| | | | | |
Collapse
|
954
|
Levit A, Yarnitzky T, Wiener A, Meidan R, Niv MY. Modeling of human prokineticin receptors: interactions with novel small-molecule binders and potential off-target drugs. PLoS One 2011; 6:e27990. [PMID: 22132188 PMCID: PMC3221691 DOI: 10.1371/journal.pone.0027990] [Citation(s) in RCA: 29] [Impact Index Per Article: 2.1] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/01/2011] [Accepted: 10/29/2011] [Indexed: 11/18/2022] Open
Abstract
BACKGROUND AND MOTIVATION The Prokineticin receptor (PKR) 1 and 2 subtypes are novel members of family A GPCRs, which exhibit an unusually high degree of sequence similarity. Prokineticins (PKs), their cognate ligands, are small secreted proteins of ∼80 amino acids; however, non-peptidic low-molecular weight antagonists have also been identified. PKs and their receptors play important roles under various physiological conditions such as maintaining circadian rhythm and pain perception, as well as regulating angiogenesis and modulating immunity. Identifying binding sites for known antagonists and for additional potential binders will facilitate studying and regulating these novel receptors. Blocking PKRs may serve as a therapeutic tool for various diseases, including acute pain, inflammation and cancer. METHODS AND RESULTS Ligand-based pharmacophore models were derived from known antagonists, and virtual screening performed on the DrugBank dataset identified potential human PKR (hPKR) ligands with novel scaffolds. Interestingly, these included several HIV protease inhibitors for which endothelial cell dysfunction is a documented side effect. Our results suggest that the side effects might be due to inhibition of the PKR signaling pathway. Docking of known binders to a 3D homology model of hPKR1 is in agreement with the well-established canonical TM-bundle binding site of family A GPCRs. Furthermore, the docking results highlight residues that may form specific contacts with the ligands. These contacts provide structural explanation for the importance of several chemical features that were obtained from the structure-activity analysis of known binders. With the exception of a single loop residue that might be perused in the future for obtaining subtype-specific regulation, the results suggest an identical TM-bundle binding site for hPKR1 and hPKR2. In addition, analysis of the intracellular regions highlights variable regions that may provide subtype specificity.
Collapse
Affiliation(s)
- Anat Levit
- Institute of Biochemistry, Food Science and Nutrition, Faculty of Agriculture, Food and Environment, The Hebrew University of Jerusalem, Rehovot, Israel
- Department of Animal Sciences, Faculty of Agriculture, Food and Environment, The Hebrew University of Jerusalem, Rehovot, Israel
| | - Talia Yarnitzky
- Institute of Biochemistry, Food Science and Nutrition, Faculty of Agriculture, Food and Environment, The Hebrew University of Jerusalem, Rehovot, Israel
| | - Ayana Wiener
- Institute of Biochemistry, Food Science and Nutrition, Faculty of Agriculture, Food and Environment, The Hebrew University of Jerusalem, Rehovot, Israel
| | - Rina Meidan
- Department of Animal Sciences, Faculty of Agriculture, Food and Environment, The Hebrew University of Jerusalem, Rehovot, Israel
| | - Masha Y. Niv
- Institute of Biochemistry, Food Science and Nutrition, Faculty of Agriculture, Food and Environment, The Hebrew University of Jerusalem, Rehovot, Israel
- The Fritz Haber Center for Molecular Dynamics, The Hebrew University of Jerusalem, Jerusalem, Israel
- * E-mail:
| |
Collapse
|
955
|
Tian F, Lv Y, Yang L. Structure-based prediction of protein–protein binding affinity with consideration of allosteric effect. Amino Acids 2011; 43:531-43. [DOI: 10.1007/s00726-011-1101-1] [Citation(s) in RCA: 41] [Impact Index Per Article: 2.9] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 04/07/2011] [Accepted: 09/21/2011] [Indexed: 11/28/2022]
|
956
|
Xu D, Zhang Y. Improving the physical realism and structural accuracy of protein models by a two-step atomic-level energy minimization. Biophys J 2011; 101:2525-34. [PMID: 22098752 DOI: 10.1016/j.bpj.2011.10.024] [Citation(s) in RCA: 750] [Impact Index Per Article: 53.6] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 05/12/2011] [Revised: 09/20/2011] [Accepted: 10/21/2011] [Indexed: 11/15/2022] Open
Abstract
Most protein structural prediction algorithms assemble structures as reduced models that represent amino acids by a reduced number of atoms to speed up the conformational search. Building accurate full-atom models from these reduced models is a necessary step toward a detailed function analysis. However, it is difficult to ensure that the atomic models retain the desired global topology while maintaining a sound local atomic geometry because the reduced models often have unphysical local distortions. To address this issue, we developed a new program, called ModRefiner, to construct and refine protein structures from Cα traces based on a two-step, atomic-level energy minimization. The main-chain structures are first constructed from initial Cα traces and the side-chain rotamers are then refined together with the backbone atoms with the use of a composite physics- and knowledge-based force field. We tested the method by performing an atomic structure refinement of 261 proteins with the initial models constructed from both ab initio and template-based structure assemblies. Compared with other state-of-art programs, ModRefiner shows improvements in both global and local structures, which have more accurate side-chain positions, better hydrogen-bonding networks, and fewer atomic overlaps. ModRefiner is freely available at http://zhanglab.ccmb.med.umich.edu/ModRefiner.
Collapse
Affiliation(s)
- Dong Xu
- Center for Computational Medicine and Bioinformatics, University of Michigan, Ann Arbor, Michigan, USA
| | | |
Collapse
|
957
|
Nagata K, Randall A, Baldi P. SIDEpro: a novel machine learning approach for the fast and accurate prediction of side-chain conformations. Proteins 2011; 80:142-53. [PMID: 22072531 DOI: 10.1002/prot.23170] [Citation(s) in RCA: 48] [Impact Index Per Article: 3.4] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 02/28/2011] [Revised: 07/07/2011] [Accepted: 07/31/2011] [Indexed: 11/10/2022]
Abstract
Accurate protein side-chain conformation prediction is crucial for protein modeling and existing methods for the task are widely used; however, faster and more accurate methods are still required. Here we present a new machine learning approach to the problem where an energy function for each rotamer in a structure is computed additively over pairs of contacting atoms. A family of 156 neural networks indexed by amino acid and contacting atom types is used to compute these rotamer energies as a function of atomic contact distances. Although direct energy targets are not available for training, the neural networks can still be optimized by converting the energies to probabilities and optimizing these probabilities using Markov Chain Monte Carlo methods. The resulting predictor SIDEpro makes predictions by initially setting the rotamer probabilities for each residue from a backbone-dependent rotamer library, then iteratively updating these probabilities using the trained neural networks. After convergences of the probabilities, the side-chains are set to the highest probability rotamer. Finally, a post processing clash reduction step is applied to the models. SIDEpro represents a significant improvement in speed and a modest, but statistically significant, improvement in accuracy when compared with the state-of-the-art for rapid side-chain prediction method SCWRL4 on the following datasets: (1) 379 protein test set of SCWRL4; (2) 94 proteins from CASP9; (3) a set of seven large protein-only complexes; and (4) a ribosome with and without the RNA. Using the SCWRL4 test set, SIDEpro's accuracy (χ(1) 86.14%, χ(1+2) 74.15%) is slightly better than SCWRL4-FRM (χ(1) 85.43%, χ(1+2) 73.47%) and it is 7.0 times faster. On the same test set SIDEpro is clearly more accurate than SCWRL4-rigid rotamer model (RRM) (χ(1) 84.15%, χ(1+2) 71.24%) and 2.4 times faster. Evaluation on the additional test sets yield similar accuracy results with SIDEpro being slightly more accurate than SCWRL4-flexible rotamer model (FRM) and clearly more accurate than SCWRL4-RRM; however, the gap in CPU time is much more significant when the methods are applied to large protein complexes. SIDEpro is part of the SCRATCH suite of predictors and available from: http://scratch.proteomics.ics.uci.edu/.
Collapse
Affiliation(s)
- Ken Nagata
- Department of Computer Science, University of California, Irvine, California, USA
| | | | | |
Collapse
|
958
|
Jaroszewski L, Li Z, Cai XH, Weber C, Godzik A. FFAS server: novel features and applications. Nucleic Acids Res 2011; 39:W38-44. [PMID: 21715387 PMCID: PMC3125803 DOI: 10.1093/nar/gkr441] [Citation(s) in RCA: 120] [Impact Index Per Article: 8.6] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/28/2022] Open
Abstract
The Fold and Function Assignment System (FFAS) server [Jaroszewski et al. (2005) FFAS03: a server for profile–profile sequence alignments. Nucleic Acids Research, 33, W284–W288] implements the algorithm for protein profile–profile alignment introduced originally in [Rychlewski et al. (2000) Comparison of sequence profiles. Strategies for structural predictions using sequence information. Protein Science: a Publication of the Protein Society, 9, 232–241]. Here, we present updates, changes and novel functionality added to the server since 2005 and discuss its new applications. The sequence database used to calculate sequence profiles was enriched by adding sets of publicly available metagenomic sequences. The profile of a user’s protein can now be compared with ∼20 additional profile databases, including several complete proteomes, human proteins involved in genetic diseases and a database of microbial virulence factors. A newly developed interface uses a system of tabs, allowing the user to navigate multiple results pages, and also includes novel functionality, such as a dotplot graph viewer, modeling tools, an improved 3D alignment viewer and links to the database of structural similarities. The FFAS server was also optimized for speed: running times were reduced by an order of magnitude. The FFAS server, http://ffas.godziklab.org, has no log-in requirement, albeit there is an option to register and store results in individual, password-protected directories. Source code and Linux executables for the FFAS program are available for download from the FFAS server.
Collapse
Affiliation(s)
- Lukasz Jaroszewski
- Bioinformatics and Systems Biology Program, Sanford Burnham Medical Research Institute, 10901 N. Torrey Pines Road, La Jolla, CA 92037, USA
| | | | | | | | | |
Collapse
|
959
|
Tamamis P, Pierou P, Mytidou C, Floudas CA, Morikis D, Archontis G. Design of a modified mouse protein with ligand binding properties of its human analog by molecular dynamics simulations: the case of C3 inhibition by compstatin. Proteins 2011; 79:3166-79. [PMID: 21989937 PMCID: PMC3193182 DOI: 10.1002/prot.23149] [Citation(s) in RCA: 23] [Impact Index Per Article: 1.6] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/27/2011] [Revised: 07/05/2011] [Accepted: 07/25/2011] [Indexed: 01/26/2023]
Abstract
The peptide compstatin and its derivatives inhibit the complement-component protein C3 in primate mammals and are potential therapeutic agents against the unregulated activation of complement in humans, but are inactive against C3 from lower mammals. Recent molecular dynamics (MD) simulations showed that the most potent compstatin analog comprised entirely of natural amino acids (W4A9) had a smaller affinity for rat C3, due to reproducible changes in the rat protein structure with respect to the human protein, which eliminated or weakened specific protein-ligand interactions seen in the human C3:W4A9 complex. Here, we study by MD simulations three W4A9 complexes with the mouse C3 protein, and two "transgenic" mouse derivatives, containing a small number (6-9) of human C3 substitutions. The mouse complex experiences the conformational changes and affinity reduction of the rat complex. In the "transgenic" complexes, the conformation remains closer to that of the human complex, the protein-ligand interactions are improved, and the affinity for compstatin becomes "human-like." The present work creates new avenues for a compstatin-sensitive animal model. A similar strategy, involving the comparison of a series of complexes by MD simulations, could be used to design "transgenic" sequences in other systems.
Collapse
Affiliation(s)
- Phanourios Tamamis
- Department of Physics, University of Cyprus, PO20537, CY1678, Nicosia, Cyprus
- Department of Chemical and Biological Engineering, Princeton University, Princeton, New Jersey 08544, USA
- Department of Bioengineering, University of California, Riverside, California 92521, USA
| | - Panayiota Pierou
- Department of Physics, University of Cyprus, PO20537, CY1678, Nicosia, Cyprus
| | - Chrystalla Mytidou
- Department of Physics, University of Cyprus, PO20537, CY1678, Nicosia, Cyprus
| | | | - Dimitrios Morikis
- Department of Chemical and Biological Engineering, Princeton University, Princeton, New Jersey 08544, USA
| | - Georgios Archontis
- Department of Physics, University of Cyprus, PO20537, CY1678, Nicosia, Cyprus
| |
Collapse
|
960
|
Segura J, Oliva B, Fernandez-Fuentes N. CAPS-DB: a structural classification of helix-capping motifs. Nucleic Acids Res 2011; 40:D479-85. [PMID: 22021380 PMCID: PMC3245141 DOI: 10.1093/nar/gkr879] [Citation(s) in RCA: 7] [Impact Index Per Article: 0.5] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 01/19/2023] Open
Abstract
The regions of the polypeptide chain immediately preceding or following an α-helix are known as Nt- and Ct cappings, respectively. Cappings play a central role stabilizing α-helices due to lack of intrahelical hydrogen bonds in the first and last turn. Sequence patterns of amino acid type preferences have been derived for cappings but the structural motifs associated to them are still unclassified. CAPS-DB is a database of clusters of structural patterns of different capping types. The clustering algorithm is based in the geometry and the (ϕ–ψ)-space conformation of these regions. CAPS-DB is a relational database that allows the user to search, browse, inspect and retrieve structural data associated to cappings. The contents of CAPS-DB might be of interest to a wide range of scientist covering different areas such as protein design and engineering, structural biology and bioinformatics. The database is accessible at: http://www.bioinsilico.org/CAPSDB.
Collapse
Affiliation(s)
- Joan Segura
- Leeds Institute of Molecular Medicine, Section of Experimental Therapeutics, University of Leeds, St James's University Hospital, Leeds LS9 7TF, UK
| | | | | |
Collapse
|
961
|
Sethi A, Goldstein B, Gnanakaran S. Quantifying intramolecular binding in multivalent interactions: a structure-based synergistic study on Grb2-Sos1 complex. PLoS Comput Biol 2011; 7:e1002192. [PMID: 22022247 PMCID: PMC3192808 DOI: 10.1371/journal.pcbi.1002192] [Citation(s) in RCA: 16] [Impact Index Per Article: 1.1] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/11/2011] [Accepted: 07/27/2011] [Indexed: 01/27/2023] Open
Abstract
Numerous signaling proteins use multivalent binding to increase the specificity and affinity of their interactions within the cell. Enhancement arises because the effective binding constant for multivalent binding is larger than the binding constants for each individual interaction. We seek to gain both qualitative and quantitative understanding of the multivalent interactions of an adaptor protein, growth factor receptor bound protein-2 (Grb2), containing two SH3 domains interacting with the nucleotide exchange factor son-of-sevenless 1 (Sos1) containing multiple polyproline motifs separated by flexible unstructured regions. Grb2 mediates the recruitment of Sos1 from the cytosol to the plasma membrane where it activates Ras by inducing the exchange of GDP for GTP. First, using a combination of evolutionary information and binding energy calculations, we predict an additional polyproline motif in Sos1 that binds to the SH3 domains of Grb2. This gives rise to a total of five polyproline motifs in Sos1 that are capable of binding to the two SH3 domains of Grb2. Then, using a hybrid method combining molecular dynamics simulations and polymer models, we estimate the enhancement in local concentration of a polyproline motif on Sos1 near an unbound SH3 domain of Grb2 when its other SH3 domain is bound to a different polyproline motif on Sos1. We show that the local concentration of the Sos1 motifs that a Grb2 SH3 domain experiences is approximately 1000 times greater than the cellular concentration of Sos1. Finally, we calculate the intramolecular equilibrium constants for the crosslinking of Grb2 on Sos1 and use thermodynamic modeling to calculate the stoichiometry. With these equilibrium constants, we are able to predict the distribution of complexes that form at physiological concentrations. We believe this is the first systematic analysis that combines sequence, structure, and thermodynamic analyses to determine the stoichiometry of the complexes that are dominant in the cellular environment.
Collapse
Affiliation(s)
- Anurag Sethi
- Theoretical Biology and Biophysics, Los Alamos National Laboratory, Los Alamos, New Mexico, United States of America
- Center for Nonlinear Studies, Los Alamos National Laboratory, Los Alamos, New Mexico, United States of America
| | - Byron Goldstein
- Theoretical Biology and Biophysics, Los Alamos National Laboratory, Los Alamos, New Mexico, United States of America
| | - S. Gnanakaran
- Theoretical Biology and Biophysics, Los Alamos National Laboratory, Los Alamos, New Mexico, United States of America
- * E-mail:
| |
Collapse
|
962
|
Wymore T, Chen BY, Nicholas HB, Ropelewski AJ, Brooks CL. A Mechanism for Evolving Novel Plant Sesquiterpene Synthase Function. Mol Inform 2011; 30:896-906. [DOI: 10.1002/minf.201100087] [Citation(s) in RCA: 6] [Impact Index Per Article: 0.4] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 05/20/2011] [Accepted: 09/11/2011] [Indexed: 11/06/2022]
|
963
|
Zeng J, Roberts KE, Zhou P, Donald BR. A Bayesian approach for determining protein side-chain rotamer conformations using unassigned NOE data. J Comput Biol 2011; 18:1661-79. [PMID: 21970619 DOI: 10.1089/cmb.2011.0172] [Citation(s) in RCA: 4] [Impact Index Per Article: 0.3] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/12/2022] Open
Abstract
A major bottleneck in protein structure determination via nuclear magnetic resonance (NMR) is the lengthy and laborious process of assigning resonances and nuclear Overhauser effect (NOE) cross peaks. Recent studies have shown that accurate backbone folds can be determined using sparse NMR data, such as residual dipolar couplings (RDCs) or backbone chemical shifts. This opens a question of whether we can also determine the accurate protein side-chain conformations using sparse or unassigned NMR data. We attack this question by using unassigned nuclear Overhauser effect spectroscopy (NOESY) data, which records the through-space dipolar interactions between protons nearby in three-dimensional (3D) space. We propose a Bayesian approach with a Markov random field (MRF) model to integrate the likelihood function derived from observed experimental data, with prior information (i.e., empirical molecular mechanics energies) about the protein structures. We unify the side-chain structure prediction problem with the side-chain structure determination problem using unassigned NMR data, and apply the deterministic dead-end elimination (DEE) and A* search algorithms to provably find the global optimum solution that maximizes the posterior probability. We employ a Hausdorff-based measure to derive the likelihood of a rotamer or a pairwise rotamer interaction from unassigned NOESY data. In addition, we apply a systematic and rigorous approach to estimate the experimental noise in NMR data, which also determines the weighting factor of the data term in the scoring function derived from the Bayesian framework. We tested our approach on real NMR data of three proteins: the FF Domain 2 of human transcription elongation factor CA150 (FF2), the B1 domain of Protein G (GB1), and human ubiquitin. The promising results indicate that our algorithm can be applied in high-resolution protein structure determination. Since our approach does not require any NOE assignment, it can accelerate the NMR structure determination process.
Collapse
Affiliation(s)
- Jianyang Zeng
- Department of Computer Science, Duke University, Durham, NC 27708, USA
| | | | | | | |
Collapse
|
964
|
Hu C, Koehl P, Max N. PackHelix: a tool for helix-sheet packing during protein structure prediction. Proteins 2011; 79:2828-43. [PMID: 21905109 PMCID: PMC3172692 DOI: 10.1002/prot.23108] [Citation(s) in RCA: 1] [Impact Index Per Article: 0.1] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 02/02/2011] [Revised: 04/18/2011] [Accepted: 05/13/2011] [Indexed: 11/09/2022]
Abstract
The three-dimensional structure of a protein is organized around the packing of its secondary structure elements. Predicting the topology and constructing the geometry of structural motifs involving α-helices and/or β-strands are therefore key steps for accurate prediction of protein structure. While many efforts have focused on how to pack helices and on how to sample exhaustively the topologies and geometries of multiple strands forming a β-sheet in a protein, there has been little progress on generating native-like packings of helices on sheets. We describe a method that can generate the packing of multiple helices on a given β-sheet for αβα sandwich type protein folds. This method mines the results of a statistical analysis of the conformations of αβ(2) motifs in protein structures to provide input values for the geometric attributes of the packing of a helix on a sheet. It then proceeds with a geometric builder that generates multiple arrangements of the helices on the sheet of interest by sampling through these values and performing consistency checks that guarantee proper loop geometry between the helices and the strands, minimal number of collisions between the helices, and proper formation of a hydrophobic core. The method is implemented as a module of ProteinShop. Our results show that it produces structures that are within 4-6 Å RMSD of the native one, regardless of the number of helices that need to be packed, though this number may increase if the protein has several helices between two consecutive strands in the sequence that pack on the sheet formed by these two strands.
Collapse
Affiliation(s)
- Chengcheng Hu
- Department of Computer Science, University of California, Davis, CA 95616
| | - Patrice Koehl
- Department of Computer Science and Genome Center, University of California, Davis, CA 95616
| | - Nelson Max
- Department of Computer Science, University of California, Davis, CA 95616
| |
Collapse
|
965
|
Characterization of PDZ domain–peptide interactions using an integrated protocol of QM/MM, PB/SA, and CFEA analyses. J Comput Aided Mol Des 2011; 25:947-58. [DOI: 10.1007/s10822-011-9474-5] [Citation(s) in RCA: 64] [Impact Index Per Article: 4.6] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 04/25/2011] [Accepted: 09/13/2011] [Indexed: 01/04/2023]
|
966
|
Structure-based characterization of the binding of peptide to the human endophilin-1 Src homology 3 domain using position-dependent noncovalent potential analysis. J Mol Model 2011; 18:2153-61. [DOI: 10.1007/s00894-011-1197-y] [Citation(s) in RCA: 5] [Impact Index Per Article: 0.4] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 05/28/2011] [Accepted: 07/20/2011] [Indexed: 02/05/2023]
|
967
|
Fernandez FJ, Garces F, López-Estepa M, Aguilar J, Baldomà L, Coll M, Badia J, Vega MC. The UlaG protein family defines novel structural and functional motifs grafted on an ancient RNase fold. BMC Evol Biol 2011; 11:273. [PMID: 21943130 PMCID: PMC3219644 DOI: 10.1186/1471-2148-11-273] [Citation(s) in RCA: 6] [Impact Index Per Article: 0.4] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/21/2011] [Accepted: 09/26/2011] [Indexed: 12/13/2022] Open
Abstract
Background Bacterial populations are highly successful at colonizing new habitats and adapting to changing environmental conditions, partly due to their capacity to evolve novel virulence and metabolic pathways in response to stress conditions and to shuffle them by horizontal gene transfer (HGT). A common theme in the evolution of new functions consists of gene duplication followed by functional divergence. UlaG, a unique manganese-dependent metallo-β-lactamase (MBL) enzyme involved in L-ascorbate metabolism by commensal and symbiotic enterobacteria, provides a model for the study of the emergence of new catalytic activities from the modification of an ancient fold. Furthermore, UlaG is the founding member of the so-called UlaG-like (UlaGL) protein family, a recently established and poorly characterized family comprising divalent (and perhaps trivalent) metal-binding MBLs that catalyze transformations on phosphorylated sugars and nucleotides. Results Here we combined protein structure-guided and sequence-only molecular phylogenetic analyses to dissect the molecular evolution of UlaG and to study its phylogenomic distribution, its relatedness with present-day UlaGL protein sequences and functional conservation. Phylogenetic analyses indicate that UlaGL sequences are present in Bacteria and Archaea, with bona fide orthologs found mainly in mammalian and plant-associated Gram-negative and Gram-positive bacteria. The incongruence between the UlaGL tree and known species trees indicates exchange by HGT and suggests that the UlaGL-encoding genes provided a growth advantage under changing conditions. Our search for more distantly related protein sequences aided by structural homology has uncovered that UlaGL sequences have a common evolutionary origin with present-day RNA processing and metabolizing MBL enzymes widespread in Bacteria, Archaea, and Eukarya. This observation suggests an ancient origin for the UlaGL family within the broader trunk of the MBL superfamily by duplication, neofunctionalization and fixation. Conclusions Our results suggest that the forerunner of UlaG was present as an RNA metabolizing enzyme in the last common ancestor, and that the modern descendants of that ancestral gene have a wide phylogenetic distribution and functional roles. We propose that the UlaGL family evolved new metabolic roles among bacterial and possibly archeal phyla in the setting of a close association with metazoans, such as in the mammalian gastrointestinal tract or in animal and plant pathogens, as well as in environmental settings. Accordingly, the major evolutionary forces shaping the UlaGL family include vertical inheritance and lineage-specific duplication and acquisition of novel metabolic functions, followed by HGT and numerous lineage-specific gene loss events.
Collapse
Affiliation(s)
- Francisco J Fernandez
- Structural and Quantitative Biology Department, Centro de Investigaciones Biológicas (CIB-CSIC), Madrid, Spain.
| | | | | | | | | | | | | | | |
Collapse
|
968
|
Abstract
MOTIVATION Modeling of side chain conformations constitutes an indispensable effort in protein structure modeling, protein-protein docking and protein design. Thanks to an intensive attention to this field, many of the existing programs can achieve reasonably good and comparable prediction accuracy. Moreover, in our previous work on CIS-RR, we argued that the prediction with few atomic clashes can complement the current existing methods for subsequent analysis and refinement of protein structures. However, these recent efforts to enhance the quality of predicted side chains have been accompanied by a significant increase of computational cost. RESULTS In this study, by mainly focusing on improving the speed of side chain conformation prediction, we present a RApid Side-chain Predictor, called RASP. To achieve a much faster speed with a comparable accuracy to the best existing methods, we not only employ the clash elimination strategy of CIS-RR, but also carefully optimize energy terms and integrate different search algorithms. In comprehensive benchmark testings, RASP is over one order of magnitude faster (~ 40 times over CIS-RR) than the recently developed methods, while achieving comparable or even better accuracy.
Collapse
Affiliation(s)
- Zhichao Miao
- National Laboratory of Biomacromolecules, Institute of Biophysics, Chinese Academy of Sciences, Beijing 100101, China
| | | | | |
Collapse
|
969
|
Liao WWP, Arthur JW. Predicting peptide binding affinities to MHC molecules using a modified semi-empirical scoring function. PLoS One 2011; 6:e25055. [PMID: 21966412 PMCID: PMC3178607 DOI: 10.1371/journal.pone.0025055] [Citation(s) in RCA: 12] [Impact Index Per Article: 0.9] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 08/19/2011] [Accepted: 08/23/2011] [Indexed: 12/19/2022] Open
Abstract
The Major Histocompatibility Complex (MHC) plays an important role in the human immune system. The MHC is involved in the antigen presentation system assisting T cells to identify foreign or pathogenic proteins. However, an MHC molecule binding a self-peptide may incorrectly trigger an immune response and cause an autoimmune disease, such as multiple sclerosis. Understanding the molecular mechanism of this process will greatly assist in determining the aetiology of various diseases and in the design of effective drugs. In the present study, we have used the Fresno semi-empirical scoring function and modify the approach to the prediction of peptide-MHC binding by using open-source and public domain software. We apply the method to HLA class II alleles DR15, DR1, and DR4, and the HLA class I allele HLA A2. Our analysis shows that using a large set of binding data and multiple crystal structures improves the predictive capability of the method. The performance of the method is also shown to be correlated to the structural similarity of the crystal structures used. We have exposed some of the obstacles faced by structure-based prediction methods and proposed possible solutions to those obstacles. It is envisaged that these obstacles need to be addressed before the performance of structure-based methods can be on par with the sequence-based methods.
Collapse
Affiliation(s)
- Webber W. P. Liao
- Sydney Medical School, University of Sydney, Sydney, New South Wales, Australia
| | - Jonathan W. Arthur
- Sydney Medical School, University of Sydney, Sydney, New South Wales, Australia
- Children's Medical Research Institute, Sydney, New South Wales, Australia
- * E-mail:
| |
Collapse
|
970
|
Bi J, Song R, Yang H, Li B, Fan J, Liu Z, Long C. Stepwise identification of HLA-A*0201-restricted CD8+ T-cell epitope peptides from herpes simplex virus type 1 genome boosted by a StepRank scheme. Biopolymers 2011; 96:328-39. [PMID: 21072852 DOI: 10.1002/bip.21564] [Citation(s) in RCA: 6] [Impact Index Per Article: 0.4] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 01/29/2023]
Abstract
Identification of immunodominant epitopes is the first step in the rational design of peptide vaccines aimed at T-cell immunity. To date, however, it is yet a great challenge for accurately predicting the potent epitope peptides from a pool of large-scale candidates with an efficient manner. In this study, a method that we named StepRank has been developed for the reliable and rapid prediction of binding capabilities/affinities between proteins and genome-wide peptides. In this procedure, instead of single strategy used in most traditional epitope identification algorithms, four steps with different purposes and thus different computational demands are employed in turn to screen the large-scale peptide candidates that are normally generated from, for example, pathogenic genome. The steps 1 and 2 aim at qualitative exclusion of typical nonbinders by using empirical rule and linear statistical approach, while the steps 3 and 4 focus on quantitative examination and prediction of the interaction energy profile and binding affinity of peptide to target protein via quantitative structure-activity relationship (QSAR) and structure-based free energy analysis. We exemplify this method through its application to binding predictions of the peptide segments derived from the 76 known open-reading frames (ORFs) of herpes simplex virus type 1 (HSV-1) genome with or without affinity to human major histocompatibility complex class I (MHC I) molecule HLA-A*0201, and find that the predictive results are well compatible with the classical anchor residue theory and perfectly match for the extended motif pattern of MHC I-binding peptides. The putative epitopes are further confirmed by comparisons with 11 experimentally measured HLA-A*0201-restrcited peptides from the HSV-1 glycoproteins D and K. We expect that this well-designed scheme can be applied in the computational screening of other viral genomes as well.
Collapse
Affiliation(s)
- Jianjun Bi
- Department of Dermatology, General Hospital of Guangzhou Military Command of PLA, Guangzhou, China
| | | | | | | | | | | | | |
Collapse
|
971
|
Terakawa T, Takada S. Multiscale ensemble modeling of intrinsically disordered proteins: p53 N-terminal domain. Biophys J 2011; 101:1450-8. [PMID: 21943426 DOI: 10.1016/j.bpj.2011.08.003] [Citation(s) in RCA: 70] [Impact Index Per Article: 5.0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 04/28/2011] [Revised: 07/29/2011] [Accepted: 08/01/2011] [Indexed: 11/29/2022] Open
Abstract
Intrinsically disordered proteins (IDPs) are ubiquitous and play key roles in transcriptional regulations and other cellular processes. To characterize diverse structural ensembles of IDPs, combinations of NMR and computational modeling showed some promise, but they need further improvements. Here, for accurate and efficient modeling of IDPs, we propose a systematic multiscale computational method. We first perform all-atom replica-exchange molecular dynamics (MD) simulations of a few fragments selected from a target IDP. These results together with generic knowledge-based local potentials are fed into the iterative Boltzmann inversion method to obtain an accurate coarse-grained potential. Then coarse-grained MD simulations provide the IDP ensemble. We tested the new method for the disordered N-terminal domain of p53 showing that the method reproduced the residual dipolar coupling and x-ray scattering profile very accurately. Further local structure analyses revealed that, guided by all-atom MD ensemble of fragments, the p53 N-terminal domain ensemble was biased to kinked structures in the AD1 region and biased to extended conformers in a proline-rich region and these biases contributed to improvement of the reproduction of the experiments.
Collapse
Affiliation(s)
- Tsuyoshi Terakawa
- Department of Biophysics Graduate School of Science, Kyoto University, Kyoto, Japan
| | | |
Collapse
|
972
|
Li N, Hou T, Ding B, Wang W. Characterization of PDZ domain-peptide interaction interface based on energetic patterns. Proteins 2011; 79:3208-20. [PMID: 21928318 DOI: 10.1002/prot.23157] [Citation(s) in RCA: 12] [Impact Index Per Article: 0.9] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 11/29/2010] [Revised: 07/12/2011] [Accepted: 07/14/2011] [Indexed: 11/12/2022]
Abstract
PDZ domain is one of the abundant modular domains that recognize short peptide sequences to mediate protein-protein interactions. To decipher the binding specificity of PDZ domain, we analyzed the interactions between 11 mouse PDZ domains and 217 [corrected] peptides using a method called MIEC-SVM, which energetically characterizes the domain-peptide interaction using molecular interaction energy components (MIECs) and predicts binding specificity using support vector machine (SVM). Cross-validation and leave-one-domain-out test showed that the MIEC-SVM using all 44 PDZ-peptide residue pairs at the interaction interface outperformed the sequence-based methods in the literature. A further feature (residue pair) selection procedure illustrated that 16 residue pairs were uninformative to the binding specificity, even though they contributed significantly (~50%) to the binding energy. If only using the 28 informative residue pairs, the performance of the MIEC-SVM on predicting the PDZ binding specificity was significantly improved. This analysis suggests that the informative and uninformative residue interactions between the PDZ domain and the peptide may represent those contributing to binding specificity and affinity, respectively. We performed additional structural and energetic analyses to shed light on understanding how the PDZ-peptide recognition is established. The success of the MIEC-SVM method on PDZ domains in this study and SH3 domains in our previous studies illustrates its generality on characterizing protein-peptide interactions and understanding protein recognition from a structural and energetic viewpoint.
Collapse
Affiliation(s)
- Nan Li
- Department of Chemistry and Biochemistry, University of California, San Diego, La Jolla, California 92093-0359, USA
| | | | | | | |
Collapse
|
973
|
A systematical comparison of DFT methods in reproducing the interaction energies of halide series with protein moieties. J Mol Model 2011; 18:2079-98. [DOI: 10.1007/s00894-011-1232-z] [Citation(s) in RCA: 5] [Impact Index Per Article: 0.4] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/13/2011] [Accepted: 08/24/2011] [Indexed: 10/17/2022]
|
974
|
Structure-based discovery of prescription drugs that interact with the norepinephrine transporter, NET. Proc Natl Acad Sci U S A 2011; 108:15810-5. [PMID: 21885739 DOI: 10.1073/pnas.1106030108] [Citation(s) in RCA: 101] [Impact Index Per Article: 7.2] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 12/30/2022] Open
Abstract
The norepinephrine transporter (NET) transports norepinephrine from the synapse into presynaptic neurons, where norepinephrine regulates signaling pathways associated with cardiovascular effects and behavioral traits via binding to various receptors (e.g., β2-adrenergic receptor). NET is a known target for a variety of prescription drugs, including antidepressants and psychostimulants, and may mediate off-target effects of other prescription drugs. Here, we identify prescription drugs that bind NET, using virtual ligand screening followed by experimental validation of predicted ligands. We began by constructing a comparative structural model of NET based on its alignment to the atomic structure of a prokaryotic NET homolog, the leucine transporter LeuT. The modeled binding site was validated by confirming that known NET ligands can be docked favorably compared to nonbinding molecules. We then computationally screened 6,436 drugs from the Kyoto Encyclopedia of Genes and Genomes (KEGG DRUG) against the NET model. Ten of the 18 high-scoring drugs tested experimentally were found to be NET inhibitors; five of these were chemically novel ligands of NET. These results may rationalize the efficacy of several sympathetic (tuaminoheptane) and antidepressant (tranylcypromine) drugs, as well as side effects of diabetes (phenformin) and Alzheimer's (talsaclidine) drugs. The observations highlight the utility of virtual screening against a comparative model, even when the target shares less than 30% sequence identity with its template structure and no known ligands in the primary binding site.
Collapse
|
975
|
Song Y. Exploring conformational changes coupled to ionization states using a hybrid Rosetta-MCCE protocol. Proteins 2011; 79:3356-63. [DOI: 10.1002/prot.23146] [Citation(s) in RCA: 10] [Impact Index Per Article: 0.7] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 01/11/2011] [Revised: 07/09/2011] [Accepted: 07/24/2011] [Indexed: 11/08/2022]
|
976
|
Liang S, Zheng D, Zhang C, Standley DM. Fast and accurate prediction of protein side-chain conformations. ACTA ACUST UNITED AC 2011; 27:2913-4. [PMID: 21873640 PMCID: PMC3187653 DOI: 10.1093/bioinformatics/btr482] [Citation(s) in RCA: 49] [Impact Index Per Article: 3.5] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/14/2022]
Abstract
Summary: We developed a fast and accurate side-chain modeling program [Optimized Side Chain Atomic eneRgy (OSCAR)-star] based on orientation-dependent energy functions and a rigid rotamer model. The average computing time was 18 s per protein for 218 test proteins with higher prediction accuracy (1.1% increase for χ1 and 0.8% increase for χ1+2) than the best performing program developed by other groups. We show that the energy functions, which were calibrated to tolerate the discrete errors of rigid rotamers, are appropriate for protein loop selection, especially for decoys without extensive structural refinement. Availability: OSCAR-star and the 218 test proteins are available for download at http://sysimm.ifrec.osaka-u.ac.jp/OSCAR Contact:standley@ifrec.osaka-u.ac.jp Supplementary information:Supplementary data are available at Bioinformatics online.
Collapse
Affiliation(s)
- Shide Liang
- Systems Immunology Lab, Immunology Frontier Research Center, Osaka University, Suita, Osaka 565-0871, Japan
| | | | | | | |
Collapse
|
977
|
Singh N, Pydi SP, Upadhyaya J, Chelikani P. Structural basis of activation of bitter taste receptor T2R1 and comparison with Class A G-protein-coupled receptors (GPCRs). J Biol Chem 2011; 286:36032-36041. [PMID: 21852241 DOI: 10.1074/jbc.m111.246983] [Citation(s) in RCA: 62] [Impact Index Per Article: 4.4] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/06/2022] Open
Abstract
The human bitter taste receptors (T2Rs) are non-Class A members of the G-protein-coupled receptor (GPCR) superfamily, with very limited structural information. Amino acid sequence analysis reveals that most of the important motifs present in the transmembrane helices (TM1-TM7) of the well studied Class A GPCRs are absent in T2Rs, raising fundamental questions regarding the mechanisms of activation and how T2Rs recognize bitter ligands with diverse chemical structures. In this study, the bitter receptor T2R1 was used to systematically investigate the role of 15 transmembrane amino acids in T2Rs, including 13 highly conserved residues, by amino acid replacements guided by molecular modeling. Functional analysis of the mutants by calcium imaging analysis revealed that replacement of Asn-66(2.65) and the highly conserved Asn-24(1.50) resulted in greater than 90% loss of agonist-induced signaling. Our results show that Asn-24(1.50) plays a crucial role in receptor activation by mediating an hydrogen bond network connecting TM1-TM2-TM7, whereas Asn-66(2.65) is essential for binding to the agonist dextromethorphan. The interhelical hydrogen bond between Asn-24(1.50) and Arg-55(2.54) restrains T2R receptor activity because loss of this bond in I27A and R55A mutants results in hyperactive receptor. The conserved amino acids Leu-197(5.50), Ser-200(5.53), and Leu-201(5.54) form a putative LXXSL motif which performs predominantly a structural role by stabilizing the helical conformation of TM5 at the cytoplasmic end. This study provides for the first time mechanistic insights into the roles of the conserved transmembrane residues in T2Rs and allows comparison of the activation mechanisms of T2Rs with the Class A GPCRs.
Collapse
Affiliation(s)
- Nisha Singh
- Department of Oral Biology, University of Manitoba, Winnipeg, Manitoba R3E 0W2, Canada
| | - Sai Prasad Pydi
- Department of Oral Biology, University of Manitoba, Winnipeg, Manitoba R3E 0W2, Canada
| | - Jasbir Upadhyaya
- Department of Oral Biology, University of Manitoba, Winnipeg, Manitoba R3E 0W2, Canada
| | - Prashen Chelikani
- Department of Oral Biology, University of Manitoba, Winnipeg, Manitoba R3E 0W2, Canada.
| |
Collapse
|
978
|
Abstract
Thiol peroxidases comprise glutathione peroxidases (GPx) and peroxiredoxins (Prx). The enzymes of both families reduce hydroperoxides with thiols by enzyme-substitution mechanisms. H(2)O(2) and organic hydroperoxides are reduced by all thiol peroxidases, most efficiently by SecGPxs, whereas fast peroxynitrite reduction is more common in Prxs. Reduction of lipid hydroperoxides is the domain of monomeric GPx4-type enzymes and of some Prxs. The catalysis starts with oxidation of an active-site selenocysteine (U(P)) or cysteine (C(P)). Activation of Cys (Sec) for hydroperoxide reduction in the GPx family is achieved by a typical tetrad composed of Cys (Sec), Asn, Gln, and Trp, whereas a triad of Cys Thr (or Ser) and Arg is the signature of Prx. In many of the CysGPxs and Prxs, a second Cys (C(R)) is required. In these 2-CysGPxs and 2-CysPrxs, the C(P) oxidized to a sulfenic acid forms an intra- or intermolecular disulfide (typical 2-CysPrx) with C(R), before a stepwise regeneration of ground-state enzyme by redoxin-type proteins can proceed. In SecGPxs and sporadically in Prxs, GSH is used as the reductant. Diversity combined with structural variability predestines thiol peroxidases for redox regulation via ROOH sensing and direct or indirect transduction of oxidant signals to specific protein targets.
Collapse
Affiliation(s)
- Leopold Flohé
- Otto-von-Guericke-Universität and MOLISA GmbH, Magdeburg, Germany.
| | | | | | | |
Collapse
|
979
|
Pantazes RJ, Grisewood MJ, Maranas CD. Recent advances in computational protein design. Curr Opin Struct Biol 2011; 21:467-72. [PMID: 21600758 DOI: 10.1016/j.sbi.2011.04.005] [Citation(s) in RCA: 73] [Impact Index Per Article: 5.2] [Reference Citation Analysis] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/29/2011] [Accepted: 04/28/2011] [Indexed: 11/30/2022]
Affiliation(s)
- Robert J Pantazes
- The Pennsylvania State University, Department of Chemical Engineering, 112 Fenske Lab, University Park, PA 16802, USA
| | | | | |
Collapse
|
980
|
Grahnen JA, Kubelka J, Liberles DA. Fast Side Chain Replacement in Proteins Using a Coarse-Grained Approach for Evaluating the Effects of Mutation During Evolution. J Mol Evol 2011; 73:23-33. [DOI: 10.1007/s00239-011-9454-3] [Citation(s) in RCA: 4] [Impact Index Per Article: 0.3] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 12/09/2010] [Accepted: 07/14/2011] [Indexed: 11/28/2022]
|
981
|
A mutation in VPS35, encoding a subunit of the retromer complex, causes late-onset Parkinson disease. Am J Hum Genet 2011; 89:168-75. [PMID: 21763483 DOI: 10.1016/j.ajhg.2011.06.008] [Citation(s) in RCA: 671] [Impact Index Per Article: 47.9] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 05/08/2011] [Revised: 06/15/2011] [Accepted: 06/21/2011] [Indexed: 11/24/2022] Open
Abstract
To identify rare causal variants in late-onset Parkinson disease (PD), we investigated an Austrian family with 16 affected individuals by exome sequencing. We found a missense mutation, c.1858G>A (p.Asp620Asn), in the VPS35 gene in all seven affected family members who are alive. By screening additional PD cases, we saw the same variant cosegregating with the disease in an autosomal-dominant mode with high but incomplete penetrance in two further families with five and ten affected members, respectively. The mean age of onset in the affected individuals was 53 years. Genotyping showed that the shared haplotype extends across 65 kilobases around VPS35. Screening the entire VPS35 coding sequence in an additional 860 cases and 1014 controls revealed six further nonsynonymous missense variants. Three were only present in cases, two were only present in controls, and one was present in cases and controls. The familial mutation p.Asp620Asn and a further variant, c.1570C>T (p.Arg524Trp), detected in a sporadic PD case were predicted to be damaging by sequence-based and molecular-dynamics analyses. VPS35 is a component of the retromer complex and mediates retrograde transport between endosomes and the trans-Golgi network, and it has recently been found to be involved in Alzheimer disease.
Collapse
|
982
|
Park H, Ko J, Joo K, Lee J, Seok C, Lee J. Refinement of protein termini in template-based modeling using conformational space annealing. Proteins 2011; 79:2725-34. [PMID: 21755541 DOI: 10.1002/prot.23101] [Citation(s) in RCA: 19] [Impact Index Per Article: 1.4] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 04/06/2011] [Revised: 05/17/2011] [Accepted: 05/27/2011] [Indexed: 02/05/2023]
Abstract
The rapid increase in the number of experimentally determined protein structures in recent years enables us to obtain more reliable protein tertiary structure models than ever by template-based modeling. However, refinement of template-based models beyond the limit available from the best templates is still needed for understanding protein function in atomic detail. In this work, we develop a new method for protein terminus modeling that can be applied to refinement of models with unreliable terminus structures. The energy function for terminus modeling consists of both physics-based and knowledge-based potential terms with carefully optimized relative weights. Effective sampling of both the framework and terminus is performed using the conformational space annealing technique. This method has been tested on a set of termini derived from a nonredundant structure database and two sets of termini from the CASP8 targets. The performance of the terminus modeling method is significantly improved over our previous method that does not employ terminus refinement. It is also comparable or superior to the best server methods tested in CASP8. The success of the current approach suggests that similar strategy may be applied to other types of refinement problems such as loop modeling or secondary structure rearrangement.
Collapse
Affiliation(s)
- Hahnbeom Park
- Department of Chemistry, Seoul National University, Seoul 151-747, Republic of Korea
| | | | | | | | | | | |
Collapse
|
983
|
Bi J, Yang H, Yan H, Song R, Fan J. Knowledge-based virtual screening of HLA-A*0201-restricted CD8+ T-cell epitope peptides from herpes simplex virus genome. J Theor Biol 2011; 281:133-9. [DOI: 10.1016/j.jtbi.2011.04.018] [Citation(s) in RCA: 15] [Impact Index Per Article: 1.1] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/17/2011] [Revised: 04/13/2011] [Accepted: 04/15/2011] [Indexed: 11/28/2022]
|
984
|
Ni Z, Zhou P, Jin X, Lin XF. Integrating In Silico and In vitro Approaches to Dissect the Stereoselectivity of Bacillus subtilis Lipase A toward Ketoprofen Vinyl Ester. Chem Biol Drug Des 2011; 78:301-8. [PMID: 21477088 DOI: 10.1111/j.1747-0285.2011.01097.x] [Citation(s) in RCA: 17] [Impact Index Per Article: 1.2] [Reference Citation Analysis] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/28/2022]
Affiliation(s)
- Zhong Ni
- Department of Chemistry, Zhejiang University, Hangzhou 310027, China
| | | | | | | |
Collapse
|
985
|
Mazier S, Quick M, Shi L. Conserved tyrosine in the first transmembrane segment of solute:sodium symporters is involved in Na+-coupled substrate co-transport. J Biol Chem 2011; 286:29347-29355. [PMID: 21705334 DOI: 10.1074/jbc.m111.263327] [Citation(s) in RCA: 11] [Impact Index Per Article: 0.8] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/06/2022] Open
Abstract
Solute:sodium symporters (SSSs) transport vital molecules across the plasma membrane of all living organisms. vSGLT, the Na(+)/galactose transporter of Vibrio parahemeolyticus, is the only SSS for which high resolution structural information is available, revealing a LeuT-like fold and a Na(+)-binding site analogous to the Na2 site of LeuT. Whereas the core transmembrane segments (TMs) of SSSs share high structural similarity with other transporters of LeuT-like fold, TM1 does not correspond to any TM in those structural homologs and was only resolved for the backbone atoms in the initial vSGLT structure (Protein Data Bank code 3DH4). To assess the role of TM1 in Na(+)-coupled substrate symport by the SSSs, here we have studied the role of a conserved residue in TM1 by computational modeling in conjunction with radiotracer transport and binding studies. Based on our sequence alignment and much topological data for homologous PutP, the Na(+)/proline transporter, we have simulated a series of vSGLT models with shifted TM1 residue assignments. We show that in two converged vSGLT models that retained the original TM1 backbone conformation, a conserved residue, Tyr-19, is associated with the Na(+) binding interaction network. In silico and in vitro mutagenesis of homologous Tyr-14 in PutP revealed the involvement of this conserved residue in Na(+)-dependent substrate binding and transport. Thus, our combined computational and experimental data provide the first clues about the importance of a conserved residue in TM1, a unique TM in the proteins with LeuT-like fold, in the Na(+)-coupled symport mechanism of SSSs.
Collapse
Affiliation(s)
- Sonia Mazier
- Department of Physiology and Biophysics, Weill Cornell Medical College, New York, New York 10065
| | - Matthias Quick
- Center for Molecular Recognition & Department of Psychiatry, Columbia University College of Physicians and Surgeons, New York, New York 10032, and the New York State Psychiatric Institute, Division of Molecular Therapeutics, New York, New York 10032.
| | - Lei Shi
- Department of Physiology and Biophysics, Weill Cornell Medical College, New York, New York 10065; HRH Prince Alwaleed Bin Talal Bin Abdulaziz Alsaud Institute for Computational Biomedicine, Weill Cornell Medical College, New York, New York 10065,.
| |
Collapse
|
986
|
Verschueren E, Vanhee P, van der Sloot AM, Serrano L, Rousseau F, Schymkowitz J. Protein design with fragment databases. Curr Opin Struct Biol 2011; 21:452-9. [PMID: 21684149 DOI: 10.1016/j.sbi.2011.05.002] [Citation(s) in RCA: 26] [Impact Index Per Article: 1.9] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 04/04/2011] [Accepted: 05/25/2011] [Indexed: 11/25/2022]
Abstract
Structure-based computational methods are popular tools for designing proteins and interactions between proteins because they provide the necessary insight and details required for rational engineering. Here, we first argue that large-scale databases of fragments contain a discrete but complete set of building blocks that can be used to design structures. We show that these structural alphabets can be saturated to provide conformational ensembles that sample the native structure space around energetic minima. Second, we show that catalogs of interaction patterns hold the key to overcome the lack of scaffolds when computationally designing protein interactions. Finally, we illustrate the power of database-driven computational protein design methods by recent successful applications and discuss what challenges remain to push this field forward.
Collapse
Affiliation(s)
- Erik Verschueren
- EMBL/CRG Systems Biology Research Unit, Centre for Genomic Regulation (CRG) and UPF, Barcelona, Spain
| | | | | | | | | | | |
Collapse
|
987
|
Bradshaw NJ, Soares DC, Carlyle BC, Ogawa F, Davidson-Smith H, Christie S, Mackie S, Thomson PA, Porteous DJ, Millar JK. PKA phosphorylation of NDE1 is DISC1/PDE4 dependent and modulates its interaction with LIS1 and NDEL1. J Neurosci 2011; 31:9043-54. [PMID: 21677187 PMCID: PMC3610090 DOI: 10.1523/jneurosci.5410-10.2011] [Citation(s) in RCA: 62] [Impact Index Per Article: 4.4] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/07/2010] [Revised: 03/21/2011] [Accepted: 04/12/2011] [Indexed: 02/01/2023] Open
Abstract
Nuclear distribution factor E-homolog 1 (NDE1), Lissencephaly 1 (LIS1), and NDE-like 1 (NDEL1) together participate in essential neurodevelopmental processes, including neuronal precursor proliferation and differentiation, neuronal migration, and neurite outgrowth. NDE1/LIS1/NDEL1 interacts with Disrupted in Schizophrenia 1 (DISC1) and the cAMP-hydrolyzing enzyme phosphodiesterase 4 (PDE4). DISC1, PDE4, NDE1, and NDEL1 have each been implicated as genetic risk factors for major mental illness. Here, we demonstrate that DISC1 and PDE4 modulate NDE1 phosphorylation by cAMP-dependent protein kinase A (PKA) and identify a novel PKA substrate site on NDE1 at threonine-131 (T131). Homology modeling predicts that phosphorylation at T131 modulates NDE1-LIS1 and NDE1-NDEL1 interactions, which we confirm experimentally. DISC1-PDE4 interaction thus modulates organization of the NDE1/NDEL1/LIS1 complex. T131-phosphorylated NDE1 is present at the postsynaptic density, in proximal axons, within the nucleus, and at the centrosome where it becomes substantially enriched during mitosis. Mutation of the NDE1 T131 site to mimic PKA phosphorylation inhibits neurite outgrowth. Thus PKA-dependent phosphorylation of the NDE1/LIS1/NDEL1 complex is DISC1-PDE4 modulated and likely to regulate its neural functions.
Collapse
Affiliation(s)
- Nicholas J. Bradshaw
- Medical Genetics Section, Molecular Medicine Centre, Institute of Genetics and Molecular Medicine, University of Edinburgh, Western General Hospital, Edinburgh EH4 2XU, United Kingdom
| | - Dinesh C. Soares
- Medical Genetics Section, Molecular Medicine Centre, Institute of Genetics and Molecular Medicine, University of Edinburgh, Western General Hospital, Edinburgh EH4 2XU, United Kingdom
| | - Becky C. Carlyle
- Medical Genetics Section, Molecular Medicine Centre, Institute of Genetics and Molecular Medicine, University of Edinburgh, Western General Hospital, Edinburgh EH4 2XU, United Kingdom
| | - Fumiaki Ogawa
- Medical Genetics Section, Molecular Medicine Centre, Institute of Genetics and Molecular Medicine, University of Edinburgh, Western General Hospital, Edinburgh EH4 2XU, United Kingdom
| | - Hazel Davidson-Smith
- Medical Genetics Section, Molecular Medicine Centre, Institute of Genetics and Molecular Medicine, University of Edinburgh, Western General Hospital, Edinburgh EH4 2XU, United Kingdom
| | - Sheila Christie
- Medical Genetics Section, Molecular Medicine Centre, Institute of Genetics and Molecular Medicine, University of Edinburgh, Western General Hospital, Edinburgh EH4 2XU, United Kingdom
| | - Shaun Mackie
- Medical Genetics Section, Molecular Medicine Centre, Institute of Genetics and Molecular Medicine, University of Edinburgh, Western General Hospital, Edinburgh EH4 2XU, United Kingdom
| | - Pippa A. Thomson
- Medical Genetics Section, Molecular Medicine Centre, Institute of Genetics and Molecular Medicine, University of Edinburgh, Western General Hospital, Edinburgh EH4 2XU, United Kingdom
| | - David J. Porteous
- Medical Genetics Section, Molecular Medicine Centre, Institute of Genetics and Molecular Medicine, University of Edinburgh, Western General Hospital, Edinburgh EH4 2XU, United Kingdom
| | - J. Kirsty Millar
- Medical Genetics Section, Molecular Medicine Centre, Institute of Genetics and Molecular Medicine, University of Edinburgh, Western General Hospital, Edinburgh EH4 2XU, United Kingdom
| |
Collapse
|
988
|
Shapovalov MV, Dunbrack RL. A smoothed backbone-dependent rotamer library for proteins derived from adaptive kernel density estimates and regressions. Structure 2011; 19:844-58. [PMID: 21645855 PMCID: PMC3118414 DOI: 10.1016/j.str.2011.03.019] [Citation(s) in RCA: 702] [Impact Index Per Article: 50.1] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/23/2010] [Revised: 03/19/2011] [Accepted: 03/22/2011] [Indexed: 11/15/2022]
Abstract
Rotamer libraries are used in protein structure determination, prediction, and design. The backbone-dependent rotamer library consists of rotamer frequencies, mean dihedral angles, and variances as a function of the backbone dihedral angles. Structure prediction and design methods that employ backbone flexibility would strongly benefit from smoothly varying probabilities and angles. A new version of the backbone-dependent rotamer library has been developed using adaptive kernel density estimates for the rotamer frequencies and adaptive kernel regression for the mean dihedral angles and variances. This formulation allows for evaluation of the rotamer probabilities, mean angles, and variances as a smooth and continuous function of phi and psi. Continuous probability density estimates for the nonrotameric degrees of freedom of amides, carboxylates, and aromatic side chains have been modeled as a function of the backbone dihedrals and rotamers of the remaining degrees of freedom. New backbone-dependent rotamer libraries at varying levels of smoothing are available from http://dunbrack.fccc.edu.
Collapse
Affiliation(s)
- Maxim V Shapovalov
- Institute for Cancer Research, Fox Chase Cancer Center, 333 Cottman Avenue, Philadelphia, PA 19111, USA
| | | |
Collapse
|
989
|
Liang S, Zhou Y, Grishin N, Standley DM. Protein side chain modeling with orientation-dependent atomic force fields derived by series expansions. J Comput Chem 2011; 32:1680-6. [PMID: 21374632 PMCID: PMC3072444 DOI: 10.1002/jcc.21747] [Citation(s) in RCA: 29] [Impact Index Per Article: 2.1] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/28/2010] [Revised: 12/10/2010] [Accepted: 12/11/2010] [Indexed: 11/09/2022]
Abstract
We describe the development of new force fields for protein side chain modeling called optimized side chain atomic energy (OSCAR). The distance-dependent energy functions (OSCAR-d) and side-chain dihedral angle potential energy functions were represented as power and Fourier series, respectively. The resulting 802 adjustable parameters were optimized by discriminating the native side chain conformations from non-native conformations, using a training set of 12,000 side chains for each residue type. In the course of optimization, for every residue, its side chain was replaced by varying rotamers, whereas conformations for all other residues were kept as they appeared in the crystal structure. Then, the OSCAR-d were multiplied by an orientation-dependent function to yield OSCAR-o. A total of 1087 parameters of the orientation-dependent energy functions (OSCAR-o) were optimized by maximizing the energy gap between the native conformation and subrotamers calculated as low energy by OSCAR-d. When OSCAR-o with optimized parameters were used to model side chain conformations simultaneously for 218 recently released protein structures, the prediction accuracies were 88.8% for χ(1) , 79.7% for χ(1 + 2) , 1.24 Å overall root mean square deviation (RMSD), and 0.62 Å RMSD for core residues, respectively, compared with the next-best performing side-chain modeling program which achieved 86.6% for χ(1) , 75.7% for χ(1 + 2) , 1.40 Å overall RMSD, and 0.86 Å RMSD for core residues, respectively. The continuous energy functions obtained in this study are suitable for gradient-based optimization techniques for protein structure refinement. A program with built-in OSCAR for protein side chain prediction is available for download at http://sysimm.ifrec.osaka-u.ac.jp/OSCAR/.
Collapse
Affiliation(s)
- Shide Liang
- Systems Immunology Lab, Immunology Frontier Research Center, Osaka University, Suita, Osaka 565-0871, Japan.
| | | | | | | |
Collapse
|
990
|
Song Y, Tyka M, Leaver-Fay A, Thompson J, Baker D. Structure-guided forcefield optimization. Proteins 2011; 79:1898-909. [PMID: 21488100 PMCID: PMC3457920 DOI: 10.1002/prot.23013] [Citation(s) in RCA: 38] [Impact Index Per Article: 2.7] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 08/24/2010] [Revised: 01/06/2011] [Accepted: 01/20/2011] [Indexed: 11/06/2022]
Abstract
Accurate modeling of biomolecular systems requires accurate forcefields. Widely used molecular mechanics (MM) forcefields obtain parameters from experimental data and quantum chemistry calculations on small molecules but do not have a clear way to take advantage of the information in high-resolution macromolecular structures. In contrast, knowledge-based methods largely ignore the physical chemistry of interatomic interactions, and instead derive parameters almost exclusively from macromolecular structures. This can involve considerable double counting of the same physical interactions. Here, we describe a method for forcefield improvement that combines the strengths of the two approaches. We use this method to improve the Rosetta all-atom forcefield, in which the total energy is expressed as the sum of terms representing different physical interactions as in MM forcefields and the parameters are tuned to reproduce the properties of macromolecular structures. To resolve inaccuracies resulting from possible double counting of interactions, we compare distribution functions from low-energy modeled structures to those from crystal structures. The structural and physical bases of the deviations between the modeled and reference structures are identified and used to guide forcefield improvements. We describe improvements resolving double counting between backbone hydrogen bond interactions and Lennard-Jones interactions in helices; between sidechain-backbone hydrogen bonds and the backbone torsion potential; and between the sidechain torsion potential and Lennard-Jones interactions. Discrepancies between computed and observed distributions are also used to guide the incorporation of an explicit Cα-hydrogen bond in β sheets. The method can be used generally to integrate different sources of information for forcefield improvement.
Collapse
Affiliation(s)
- Yifan Song
- Department of Biochemistry, University of WashingtonSeattle, Washington 98195
| | - Michael Tyka
- Department of Biochemistry, University of WashingtonSeattle, Washington 98195
| | - Andrew Leaver-Fay
- Department of Biochemistry, University of WashingtonSeattle, Washington 98195
| | - James Thompson
- Department of Biochemistry, University of WashingtonSeattle, Washington 98195
| | - David Baker
- Department of Biochemistry, University of WashingtonSeattle, Washington 98195
- Howard Hughes Medical Institute, University of WashingtonBox 357370, Seattle, Washington 98195
| |
Collapse
|
991
|
Kim HJ, di Luccio E, Kong ANT, Kim JS. Nrf2-mediated induction of phase 2 detoxifying enzymes by glyceollins derived from soybean exposed to Aspergillus sojae. Biotechnol J 2011; 6:525-36. [PMID: 21538894 DOI: 10.1002/biot.201100010] [Citation(s) in RCA: 28] [Impact Index Per Article: 2.0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 12/16/2023]
Abstract
Numerous antioxidants have been reported to cause transcriptional activation of several antioxidant enzymes through binding antioxidant-response element on their promoter region. We, therefore, attempted to examine whether glyceollins, which share common structural features with many phase 2 enzyme inducers and antioxidant activity, could induce detoxifying/antioxidant enzymes. Glyceollins induced NAD(P)H:quinone oxidoreductase activity in a dose-dependent manner in both mouse hepatoma Hepa1c1c7 and its mutant BPRc1 cells. The compounds also increased the expression of some representative antioxidant enzymes, such as heme oxygenase 1,gamma-glutamylcysteine synthase, and glutathione reductase, by promoting nuclear translocation of the NF-E2-related factor-2 (Nrf2). Furthermore, phosphorylation of Akt and antioxidant response element-mediated reporter gene expression were enhanced by glyceollins but suppressed by LY294002, an inhibitor of phosphoinositide 3-kinases (PI3K). This suggests that glyceollins may cause Nrf2-mediated phase 2 enzyme induction through activation of the PI3K signaling pathway as well as interaction with Keap1. Our molecular docking simulations also suggest that the glyceollin isomers tightly bind into the binding pocket around Cys151, preventing Nrf2 from docking to Keap1. In conclusion, the current data suggest that glyceollins induced phase 2 detoxifying enzymes likely through promoting nuclear translocation of Nrf2, which is known to be regulated by phosphorylation of Nrf2 and/or disrupting Keap1-Nrf2 complex formation.
Collapse
Affiliation(s)
- Hyo Jung Kim
- School of Applied Biosciences, Kyungpook National University, Daegu, Republic of Korea
| | | | | | | |
Collapse
|
992
|
Bazzoli A, Tettamanzi AGB, Zhang Y. Computational protein design and large-scale assessment by I-TASSER structure assembly simulations. J Mol Biol 2011; 407:764-76. [PMID: 21329699 PMCID: PMC3070181 DOI: 10.1016/j.jmb.2011.02.017] [Citation(s) in RCA: 31] [Impact Index Per Article: 2.2] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Journal Information] [Subscribe] [Scholar Register] [Received: 09/23/2010] [Revised: 01/30/2011] [Accepted: 02/05/2011] [Indexed: 10/18/2022]
Abstract
Protein design aims at designing new protein molecules of desired structure and functionality. One of the major obstacles to large-scale protein design are the extensive time and manpower requirements for experimental validation of designed sequences. Recent advances in protein structure prediction have provided potentials for an automated assessment of the designed sequences via folding simulations. We present a new protocol for protein design and validation. The sequence space is initially searched by Monte Carlo sampling guided by a public atomic potential, with candidate sequences selected by the clustering of sequence decoys. The designed sequences are then assessed by I-TASSER folding simulations, which generate full-length atomic structural models by the iterative assembly of threading fragments. The protocol is tested on 52 nonhomologous single-domain proteins, with an average sequence identity of 24% between the designed sequences and the native sequences. Despite this low sequence identity, three-dimensional models predicted for the first designed sequence have an RMSD of <2 Å to the target structure in 62% of cases. This percentage increases to 77% if we consider the three-dimensional models from the top 10 designed sequences. Such a striking consistency between the target structure and the structural prediction from nonhomologous sequences, despite the fact that the design and folding algorithms adopt completely different force fields, indicates that the design algorithm captures the features essential to the global fold of the target. On average, the designed sequences have a free energy that is 0.39 kcal/(mol residue) lower than in the native sequences, potentially affording a greater stability to synthesized target folds.
Collapse
Affiliation(s)
- Andrea Bazzoli
- Center for Computational Medicine and Bioinformatics, University of Michigan, 100 Washtenaw Avenue, Ann Arbor, MI 48109-2218, USA
| | - Andrea G. B. Tettamanzi
- Dipartimento di Tecnologie dell'Informazione, Università degli Studi di Milano, via Bramante, 65, I-26013 Crema (CR), Italy
| | - Yang Zhang
- Center for Computational Medicine and Bioinformatics, University of Michigan, 100 Washtenaw Avenue, Ann Arbor, MI 48109-2218, USA
- Department of Biological Chemistry, University of Michigan, 100 Washtenaw Avenue, Ann Arbor, MI 48109-2218, USA
| |
Collapse
|
993
|
Cao Y, Koh X, Dong L, Du X, Wu A, Ding X, Deng H, Shu Y, Chen J, Jiang T. Rapid estimation of binding activity of influenza virus hemagglutinin to human and avian receptors. PLoS One 2011; 6:e18664. [PMID: 21533248 PMCID: PMC3076431 DOI: 10.1371/journal.pone.0018664] [Citation(s) in RCA: 12] [Impact Index Per Article: 0.9] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 12/14/2010] [Accepted: 03/09/2011] [Indexed: 11/19/2022] Open
Abstract
A critical step for avian influenza viruses to infect human hosts and cause epidemics or pandemics is acquisition of the ability of the viral hemagglutinin (HA) to bind to human receptors. However, current global influenza surveillance does not monitor HA binding specificity due to a lack of rapid and reliable assays. Here we report a computational method that uses an effective scoring function to quantify HA-receptor binding activities with high accuracy and speed. Application of this method reveals receptor specificity changes and its temporal relationship with antigenicity changes during the evolution of human H3N2 viruses. The method predicts that two amino acid differences at 222 and 225 between HAs of A/Fujian/411/02 and A/Panama/2007/99 viruses account for their differences in binding to both avian and human receptors; this prediction was verified experimentally. The new computational method could provide an urgently needed tool for rapid and large-scale analysis of HA receptor specificities for global influenza surveillance.
Collapse
Affiliation(s)
- Yang Cao
- National Laboratory of Biomacromolecules, Institute of Biophysics, Chinese Academy of Sciences, Beijing, China
- Graduate University of Chinese Academy of Sciences, Beijing, China
| | - Xiaoying Koh
- Department of Biological Engineering, Massachusetts Institute of Technology, Cambridge, Massachusetts, United States of America
| | - Libo Dong
- State Key Laboratory for Molecular Virology and Genetic Engineering, National Institute for Viral Infectious Disease Control and Prevention, Chinese Center for Disease Control and Prevention, Beijing, China
| | - Xiangjun Du
- National Laboratory of Biomacromolecules, Institute of Biophysics, Chinese Academy of Sciences, Beijing, China
- Graduate University of Chinese Academy of Sciences, Beijing, China
| | - Aiping Wu
- National Laboratory of Biomacromolecules, Institute of Biophysics, Chinese Academy of Sciences, Beijing, China
- Graduate University of Chinese Academy of Sciences, Beijing, China
| | - Xilai Ding
- National Laboratory of Biomacromolecules, Institute of Biophysics, Chinese Academy of Sciences, Beijing, China
- Center for Infection and Immunity, Institute of Biophysics, Chinese Academy of Sciences, Beijing, China
| | - Hongyu Deng
- National Laboratory of Biomacromolecules, Institute of Biophysics, Chinese Academy of Sciences, Beijing, China
- Center for Infection and Immunity, Institute of Biophysics, Chinese Academy of Sciences, Beijing, China
| | - Yuelong Shu
- State Key Laboratory for Molecular Virology and Genetic Engineering, National Institute for Viral Infectious Disease Control and Prevention, Chinese Center for Disease Control and Prevention, Beijing, China
- * E-mail: (TJ); (JC); (YS)
| | - Jianzhu Chen
- Center for Infection and Immunity, Institute of Biophysics, Chinese Academy of Sciences, Beijing, China
- Department of Biology, Koch Institute for Integrative Cancer Research, Massachusetts Institute of Technology, Cambridge, Massachusetts, United States of America
- * E-mail: (TJ); (JC); (YS)
| | - Taijiao Jiang
- National Laboratory of Biomacromolecules, Institute of Biophysics, Chinese Academy of Sciences, Beijing, China
- * E-mail: (TJ); (JC); (YS)
| |
Collapse
|
994
|
Hijikata A, Yura K, Noguti T, Go M. Revisiting gap locations in amino acid sequence alignments and a proposal for a method to improve them by introducing solvent accessibility. Proteins 2011; 79:1868-77. [PMID: 21465562 PMCID: PMC3110861 DOI: 10.1002/prot.23011] [Citation(s) in RCA: 24] [Impact Index Per Article: 1.7] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 09/25/2010] [Revised: 01/23/2011] [Accepted: 01/28/2011] [Indexed: 12/27/2022]
Abstract
In comparative modeling, the quality of amino acid sequence alignment still constitutes a major bottleneck in the generation of high quality models of protein three-dimensional (3D) structures. Substantial efforts have been made to improve alignment quality by revising the substitution matrix, introducing multiple sequences, replacing dynamic programming with hidden Markov models, and incorporating 3D structure information. Improvements in the gap penalty have not been a major focus, however, following the development of the affine gap penalty and of the secondary structure dependent gap penalty. We revisited the correlation between protein 3D structure and gap location in a large protein 3D structure data set, and found that the frequency of gap locations approximated to an exponential function of the solvent accessibility of the inserted residues. The nonlinearity of the gap frequency as a function of accessibility corresponded well to the relationship between residue mutation pattern and residue accessibility. By introducing this relationship into the gap penalty calculation for pairwise alignment between template and target amino acid sequences, we were able to obtain a sequence alignment much closer to the structural alignment. The quality of the alignments was substantially improved on a pair of sequences with identity in the “twilight zone” between 20 and 40%. The relocation of gaps by our new method made a significant improvement in comparative modeling, exemplified here by the Bacillus subtilis yitF protein. The method was implemented in a computer program, ALAdeGAP (ALignment with Accessibility dependent GAp Penalty), which is available at http://cib.cf.ocha.ac.jp/target_protein/. Proteins 2011; © 2011 Wiley-Liss, Inc.
Collapse
Affiliation(s)
- Atsushi Hijikata
- Division of Biological Science, Graduate School of Science, Nagoya University, Furo, Chikusa, Nagoya 464-8602, Japan
| | | | | | | |
Collapse
|
995
|
Wu F, Liu Y, Zhu Z, Huang H, Ding B, Wu J, Shi Y. The 1.9Å crystal structure of Prp20p from Saccharomyces cerevisiae and its binding properties to Gsp1p and histones. J Struct Biol 2011; 174:213-22. [DOI: 10.1016/j.jsb.2010.11.016] [Citation(s) in RCA: 4] [Impact Index Per Article: 0.3] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 09/23/2010] [Revised: 11/11/2010] [Accepted: 11/15/2010] [Indexed: 12/01/2022]
|
996
|
Ni Z, Jin X, Zhou P, Wu Q, Lin XF. A Combination of Computational and Experimental Approaches to Investigate the Binding Behavior of B.sub Lipase A Mutants with Substrate pNPP. Mol Inform 2011; 30:359-67. [PMID: 27466952 DOI: 10.1002/minf.201000110] [Citation(s) in RCA: 11] [Impact Index Per Article: 0.8] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 09/15/2010] [Accepted: 01/29/2011] [Indexed: 11/09/2022]
Affiliation(s)
- Zhong Ni
- Department of Chemistry, Zhejiang University, Hangzhou 31002, P. R. China tel: +86 571 87952618, fax: +86 571 87951588
| | - Xin Jin
- Department of Chemistry, Zhejiang University, Hangzhou 31002, P. R. China tel: +86 571 87952618, fax: +86 571 87951588
| | - Peng Zhou
- Department of Chemistry, Zhejiang University, Hangzhou 31002, P. R. China tel: +86 571 87952618, fax: +86 571 87951588
| | - Qi Wu
- Department of Chemistry, Zhejiang University, Hangzhou 31002, P. R. China tel: +86 571 87952618, fax: +86 571 87951588
| | - Xian-Fu Lin
- Department of Chemistry, Zhejiang University, Hangzhou 31002, P. R. China tel: +86 571 87952618, fax: +86 571 87951588.
| |
Collapse
|
997
|
Lau AY, Roux B. The hidden energetics of ligand binding and activation in a glutamate receptor. Nat Struct Mol Biol 2011; 18:283-7. [PMID: 21317895 PMCID: PMC3075596 DOI: 10.1038/nsmb.2010] [Citation(s) in RCA: 98] [Impact Index Per Article: 7.0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 08/17/2010] [Accepted: 11/30/2010] [Indexed: 02/07/2023]
Abstract
Ionotropic glutamate receptors (iGluRs) are ligand-gated ion channels that mediate most excitatory synaptic transmission in the central nervous system. The free energy of neurotransmitter binding to the ligand-binding domains (LBDs) of iGluRs is converted into useful work to drive receptor activation. We have computed the principal thermodynamic contributions from ligand docking and ligand-induced closure of LBDs for nine ligands of GluA2 using all-atom molecular dynamics free energy simulations. We have validated the results by comparison with experimentally measured apparent affinities to the isolated LBD. Features in the free energy landscapes that govern closure of LBDs are key determinants of binding free energies. An analysis of accessible LBD conformations transposed into the context of an intact GluA2 receptor revealed that the relative displacement of specific diagonal subunits in the tetrameric structure may be key to the action of partial agonists.
Collapse
Affiliation(s)
- Albert Y Lau
- Department of Biochemistry and Molecular Biology, The University of Chicago, Chicago, IL, USA
| | | |
Collapse
|
998
|
Brylinski M, Skolnick J. Comprehensive structural and functional characterization of the human kinome by protein structure modeling and ligand virtual screening. J Chem Inf Model 2011; 50:1839-54. [PMID: 20853887 DOI: 10.1021/ci100235n] [Citation(s) in RCA: 31] [Impact Index Per Article: 2.2] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 12/12/2022]
Abstract
The growing interest in the identification of kinase inhibitors, promising therapeutics in the treatment of many diseases, has created a demand for the structural characterization of the entire human kinome. At the outset of the drug development process, the lead-finding stage, approaches that enrich the screening library with bioactive compounds are needed. Here, protein structure based methods can play an important role, but despite structural genomics efforts, it is unlikely that the three-dimensional structures of the entire kinome will be available soon. Therefore, at the proteome level, structure-based approaches must rely on predicted models, with a key issue being their utility in virtual ligand screening. In this study, we employ the recently developed FINDSITE/Q-Dock ligand homology modeling approach, which is well-suited for proteome-scale applications using predicted structures, to provide extensive structural and functional characterization of the human kinome. Specifically, we construct structure models for the human kinome; these are subsequently subject to virtual screening against a library of more than 2 million compounds. To rank the compounds, we employ a hierarchical approach that combines ligand- and structure-based filters. Modeling accuracy is carefully validated using available experimental data with particularly encouraging results found for the ability to identify, without prior knowledge, specific kinase inhibitors. More generally, the modeling procedure results in a large number of predicted molecular interactions between kinases and small ligands that should be of practical use in the development of novel inhibitors. The data set is freely available to the academic community via a user-friendly Web interface at http://cssb.biology.gatech.edu/kinomelhm/ as well as at the ZINC Web site ( http://zinc.docking.org/applications/2010Apr/Brylinski-2010.tar.gz ).
Collapse
Affiliation(s)
- Michal Brylinski
- Center for the Study of Systems Biology, School of Biology, Georgia Institute of Technology, Atlanta, Georgia 30318, USA
| | | |
Collapse
|
999
|
Ng AH, Snow CD. Polarizable protein packing. J Comput Chem 2011; 32:1334-44. [DOI: 10.1002/jcc.21714] [Citation(s) in RCA: 3] [Impact Index Per Article: 0.2] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 12/09/2009] [Revised: 10/12/2010] [Accepted: 10/17/2010] [Indexed: 11/11/2022]
|
1000
|
Cao Y, Song L, Miao Z, Hu Y, Tian L, Jiang T. Improved side-chain modeling by coupling clash-detection guided iterative search with rotamer relaxation. ACTA ACUST UNITED AC 2011; 27:785-90. [PMID: 21216772 DOI: 10.1093/bioinformatics/btr009] [Citation(s) in RCA: 46] [Impact Index Per Article: 3.3] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/14/2022]
Abstract
MOTIVATION Side-chain modeling has seen wide applications in computational structure biology. Most of the popular side-chain modeling programs explore the conformation space using discrete rigid rotamers for speed and efficiency. However, in the tightly packed environments of protein interiors, these methods will inherently lead to atomic clashes and hinder the prediction accuracy. RESULTS We present a side-chain modeling method (CIS-RR), which couples a novel clash-detection guided iterative search (CIS) algorithm with continuous torsion space optimization of rotamers (RR). Benchmark testing shows that compared with the existing popular side-chain modeling methods, CIS-RR removes atomic clashes much more effectively and achieves comparable or even better prediction accuracy while having comparable computational cost. We believe that CIS-RR could be a useful method for accurate side-chain modeling. AVAILABILITY CIS-RR is available to non-commercial users at our website: http://jianglab.ibp.ac.cn/lims/cisrr/cisrr.html.
Collapse
Affiliation(s)
- Yang Cao
- National Laboratory of Biomacromolecules, Institute of Biophysics, Chinese Academy of Sciences, Beijing, China
| | | | | | | | | | | |
Collapse
|