51
|
Structural and Computational Biology in the Design of Immunogenic Vaccine Antigens. J Immunol Res 2015; 2015:156241. [PMID: 26526043 PMCID: PMC4615220 DOI: 10.1155/2015/156241] [Citation(s) in RCA: 54] [Impact Index Per Article: 6.0] [Reference Citation Analysis] [Abstract] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 05/29/2015] [Accepted: 08/02/2015] [Indexed: 01/08/2023] Open
Abstract
Vaccination is historically one of the most important medical interventions for the prevention of infectious disease. Previously, vaccines were typically made of rather crude mixtures of inactivated or attenuated causative agents. However, over the last 10–20 years, several important technological and computational advances have enabled major progress in the discovery and design of potently immunogenic recombinant protein vaccine antigens. Here we discuss three key breakthrough approaches that have potentiated structural and computational vaccine design. Firstly, genomic sciences gave birth to the field of reverse vaccinology, which has enabled the rapid computational identification of potential vaccine antigens. Secondly, major advances in structural biology, experimental epitope mapping, and computational epitope prediction have yielded molecular insights into the immunogenic determinants defining protective antigens, enabling their rational optimization. Thirdly, and most recently, computational approaches have been used to convert this wealth of structural and immunological information into the design of improved vaccine antigens. This review aims to illustrate the growing power of combining sequencing, structural and computational approaches, and we discuss how this may drive the design of novel immunogens suitable for future vaccines urgently needed to increase the global prevention of infectious disease.
Collapse
|
52
|
Cao R, Bhattacharya D, Adhikari B, Li J, Cheng J. Massive integration of diverse protein quality assessment methods to improve template based modeling in CASP11. Proteins 2015; 84 Suppl 1:247-59. [PMID: 26369671 DOI: 10.1002/prot.24924] [Citation(s) in RCA: 23] [Impact Index Per Article: 2.6] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 05/14/2015] [Revised: 08/21/2015] [Accepted: 09/10/2015] [Indexed: 12/28/2022]
Abstract
Model evaluation and selection is an important step and a big challenge in template-based protein structure prediction. Individual model quality assessment methods designed for recognizing some specific properties of protein structures often fail to consistently select good models from a model pool because of their limitations. Therefore, combining multiple complimentary quality assessment methods is useful for improving model ranking and consequently tertiary structure prediction. Here, we report the performance and analysis of our human tertiary structure predictor (MULTICOM) based on the massive integration of 14 diverse complementary quality assessment methods that was successfully benchmarked in the 11th Critical Assessment of Techniques of Protein Structure prediction (CASP11). The predictions of MULTICOM for 39 template-based domains were rigorously assessed by six scoring metrics covering global topology of Cα trace, local all-atom fitness, side chain quality, and physical reasonableness of the model. The results show that the massive integration of complementary, diverse single-model and multi-model quality assessment methods can effectively leverage the strength of single-model methods in distinguishing quality variation among similar good models and the advantage of multi-model quality assessment methods of identifying reasonable average-quality models. The overall excellent performance of the MULTICOM predictor demonstrates that integrating a large number of model quality assessment methods in conjunction with model clustering is a useful approach to improve the accuracy, diversity, and consequently robustness of template-based protein structure prediction. Proteins 2016; 84(Suppl 1):247-259. © 2015 Wiley Periodicals, Inc.
Collapse
Affiliation(s)
- Renzhi Cao
- Department of Computer Science, University of Missouri, Columbia, Missouri, 65211
| | | | - Badri Adhikari
- Department of Computer Science, University of Missouri, Columbia, Missouri, 65211
| | - Jilong Li
- Department of Computer Science, University of Missouri, Columbia, Missouri, 65211
| | - Jianlin Cheng
- Department of Computer Science, University of Missouri, Columbia, Missouri, 65211. .,Informatics Institute, University of Missouri, Columbia, Missouri, 65211.
| |
Collapse
|
53
|
Olson MA, Zabetakis D, Legler PM, Turner KB, Anderson GP, Goldman ER. Can template-based protein models guide the design of sequence fitness for enhanced thermal stability of single domain antibodies? Protein Eng Des Sel 2015; 28:395-402. [PMID: 26374895 DOI: 10.1093/protein/gzv047] [Citation(s) in RCA: 7] [Impact Index Per Article: 0.8] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 05/27/2015] [Accepted: 08/14/2015] [Indexed: 12/18/2022] Open
Abstract
We investigate the practical use of comparative (template-based) protein models in replica-exchange simulations of single-domain antibody (sdAb) chains to evaluate if the models can correctly predict in rank order the thermal susceptibility to unfold relative to experimental melting temperatures. The baseline model system is the recently determined crystallographic structure of a llama sdAb (denoted as A3), which exhibits an unusually high thermal stability. An evaluation of the simulation results for the A3 comparative model and crystal structure shows that, despite the overall low Cα root-mean-square deviation between the two structures, the model contains misfolded regions that yields a thermal profile of unraveling at a lower temperature. Yet comparison of the simulations of four different comparative models for sdAb A3, C8, A3C8 and E9, where A3C8 is a design of swapping the sequence of the complementarity determining regions of C8 onto the A3 framework, discriminated among the sequences to detect the highest and lowest experimental melting transition temperatures. Further structural analysis of A3 for selected alanine substitutions by a combined computational and experimental study found unexpectedly that the comparative model performed admirably in recognizing substitution 'hot spots' when using a support-vector machine algorithm.
Collapse
Affiliation(s)
- Mark A Olson
- Department of Cell Biology and Biochemistry, Molecular and Translational Sciences Division, USAMRIID, Frederick, MD, USA
| | - Dan Zabetakis
- Center for Bio/Molecular Science and Engineering, Naval Research Laboratory, 4555 Overlook Avenue SW, Washington, DC, USA
| | - Patricia M Legler
- Center for Bio/Molecular Science and Engineering, Naval Research Laboratory, 4555 Overlook Avenue SW, Washington, DC, USA
| | - Kendrick B Turner
- Center for Bio/Molecular Science and Engineering, Naval Research Laboratory, 4555 Overlook Avenue SW, Washington, DC, USA
| | - George P Anderson
- Center for Bio/Molecular Science and Engineering, Naval Research Laboratory, 4555 Overlook Avenue SW, Washington, DC, USA
| | - Ellen R Goldman
- Center for Bio/Molecular Science and Engineering, Naval Research Laboratory, 4555 Overlook Avenue SW, Washington, DC, USA
| |
Collapse
|
54
|
Joo K, Joung I, Lee SY, Kim JY, Cheng Q, Manavalan B, Joung JY, Heo S, Lee J, Nam M, Lee IH, Lee SJ, Lee J. Template based protein structure modeling by global optimization in CASP11. Proteins 2015; 84 Suppl 1:221-32. [PMID: 26329522 DOI: 10.1002/prot.24917] [Citation(s) in RCA: 26] [Impact Index Per Article: 2.9] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 05/15/2015] [Revised: 08/04/2015] [Accepted: 08/21/2015] [Indexed: 11/11/2022]
Abstract
For the template-based modeling (TBM) of CASP11 targets, we have developed three new protein modeling protocols (nns for server prediction and LEE and LEER for human prediction) by improving upon our previous CASP protocols (CASP7 through CASP10). We applied the powerful global optimization method of conformational space annealing to three stages of optimization, including multiple sequence-structure alignment, three-dimensional (3D) chain building, and side-chain remodeling. For more successful fold recognition, a new alignment method called CRFalign was developed. It can incorporate sensitive positional and environmental dependence in alignment scores as well as strong nonlinear correlations among various features. Modifications and adjustments were made to the form of the energy function and weight parameters pertaining to the chain building procedure. For the side-chain remodeling step, residue-type dependence was introduced to the cutoff value that determines the entry of a rotamer to the side-chain modeling library. The improved performance of the nns server method is attributed to successful fold recognition achieved by combining several methods including CRFalign and to the current modeling formulation that can incorporate native-like structural aspects present in multiple templates. The LEE protocol is identical to the nns one except that CASP11-released server models are used as templates. The success of LEE in utilizing CASP11 server models indicates that proper template screening and template clustering assisted by appropriate cluster ranking promises a new direction to enhance protein 3D modeling. Proteins 2016; 84(Suppl 1):221-232. © 2015 Wiley Periodicals, Inc.
Collapse
Affiliation(s)
- Keehyoung Joo
- Center for in Silico Protein Science, Korea Institute for Advanced Study, Seoul, 130-722, Korea.,Center for Advanced Computation, Korea Institute for Advanced Study, Seoul, 130-722, Korea
| | - InSuk Joung
- Center for in Silico Protein Science, Korea Institute for Advanced Study, Seoul, 130-722, Korea.,School of Computational Sciences, Korea Institute for Advanced Study, Seoul, 130-722, Korea
| | - Sun Young Lee
- Center for in Silico Protein Science, Korea Institute for Advanced Study, Seoul, 130-722, Korea
| | - Jong Yun Kim
- Center for in Silico Protein Science, Korea Institute for Advanced Study, Seoul, 130-722, Korea
| | - Qianyi Cheng
- Center for in Silico Protein Science, Korea Institute for Advanced Study, Seoul, 130-722, Korea.,School of Computational Sciences, Korea Institute for Advanced Study, Seoul, 130-722, Korea
| | - Balachandran Manavalan
- Center for in Silico Protein Science, Korea Institute for Advanced Study, Seoul, 130-722, Korea.,School of Computational Sciences, Korea Institute for Advanced Study, Seoul, 130-722, Korea
| | - Jong Young Joung
- School of Computational Sciences, Korea Institute for Advanced Study, Seoul, 130-722, Korea
| | - Seungryong Heo
- Center for in Silico Protein Science, Korea Institute for Advanced Study, Seoul, 130-722, Korea
| | - Juyong Lee
- Laboratory of Computational Biology, National Heart, Lung, and Blood Institute, National Institutes of Health, Bethesda, Maryland, 20852
| | - Mikyung Nam
- Center for in Silico Protein Science, Korea Institute for Advanced Study, Seoul, 130-722, Korea
| | - In-Ho Lee
- Center for in Silico Protein Science, Korea Institute for Advanced Study, Seoul, 130-722, Korea.,Korea Research Institute of Standards and Science (KRISS), Seoul, 305-600, Korea
| | - Sung Jong Lee
- Center for in Silico Protein Science, Korea Institute for Advanced Study, Seoul, 130-722, Korea.,Department of Physics, University of Suwon, Hwaseong-Si, Gyeonggi-Do, 445-743, Korea
| | - Jooyoung Lee
- Center for in Silico Protein Science, Korea Institute for Advanced Study, Seoul, 130-722, Korea. .,Center for Advanced Computation, Korea Institute for Advanced Study, Seoul, 130-722, Korea. .,School of Computational Sciences, Korea Institute for Advanced Study, Seoul, 130-722, Korea.
| |
Collapse
|
55
|
Feig M, Mirjalili V. Protein structure refinement via molecular-dynamics simulations: What works and what does not? Proteins 2015; 84 Suppl 1:282-92. [PMID: 26234208 DOI: 10.1002/prot.24871] [Citation(s) in RCA: 49] [Impact Index Per Article: 5.4] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 05/19/2015] [Revised: 07/15/2015] [Accepted: 07/29/2015] [Indexed: 12/26/2022]
Abstract
Protein structure refinement during CASP11 by the Feig group was described. Molecular dynamics simulations were used in combination with an improved selection and averaging protocol. On average, modest refinement was achieved with some targets improved significantly. Analysis of the CASP submission from our group focused on refinement success versus amount of sampling, refinement of different secondary structure elements and whether refinement varied as a function of which group provided initial models. The refinement of local stereochemical features was examined via the MolProbity score and an updated protocol was developed that can generate high-quality structures with very low MolProbity scores for most starting structures with modest computational effort. Proteins 2016; 84(Suppl 1):282-292. © 2015 Wiley Periodicals, Inc.
Collapse
Affiliation(s)
- Michael Feig
- Department of Biochemistry and Molecular Biology, Michigan State University, East Lansing, Michigan, 48824. .,Department of Chemistry, Michigan State University, East Lansing, Michigan, 48824.
| | - Vahid Mirjalili
- Department of Biochemistry and Molecular Biology, Michigan State University, East Lansing, Michigan, 48824.,Department of Mechanical Engineering, Michigan State University, East Lansing, Michigan, 48824
| |
Collapse
|
56
|
Messih MA, Lepore R, Tramontano A. LoopIng: a template-based tool for predicting the structure of protein loops. Bioinformatics 2015; 31:3767-72. [PMID: 26249814 PMCID: PMC4653384 DOI: 10.1093/bioinformatics/btv438] [Citation(s) in RCA: 19] [Impact Index Per Article: 2.1] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 04/07/2015] [Accepted: 07/21/2015] [Indexed: 12/31/2022] Open
Abstract
Motivation: Predicting the structure of protein loops is very challenging, mainly because they are not necessarily subject to strong evolutionary pressure. This implies that, unlike the rest of the protein, standard homology modeling techniques are not very effective in modeling their structure. However, loops are often involved in protein function, hence inferring their structure is important for predicting protein structure as well as function. Results: We describe a method, LoopIng, based on the Random Forest automated learning technique, which, given a target loop, selects a structural template for it from a database of loop candidates. Compared to the most recently available methods, LoopIng is able to achieve similar accuracy for short loops (4–10 residues) and significant enhancements for long loops (11–20 residues). The quality of the predictions is robust to errors that unavoidably affect the stem regions when these are modeled. The method returns a confidence score for the predicted template loops and has the advantage of being very fast (on average: 1 min/loop). Availability and implementation:www.biocomputing.it/looping Contact:anna.tramontano@uniroma1.it Supplementary information:Supplementary data are available at Bioinformatics online.
Collapse
Affiliation(s)
| | - Rosalba Lepore
- Department of Physics, Sapienza University, 00185 Rome, Italy and
| | - Anna Tramontano
- Department of Physics, Sapienza University, 00185 Rome, Italy and Istituto Pasteur-Fondazione Cenci Bolognetti, Viale Regina Elena 291, 00161 Rome, Italy
| |
Collapse
|
57
|
Yu J, Picord G, Tuffery P, Guerois R. HHalign-Kbest: exploring sub-optimal alignments for remote homology comparative modeling. Bioinformatics 2015; 31:3850-2. [PMID: 26231431 DOI: 10.1093/bioinformatics/btv441] [Citation(s) in RCA: 7] [Impact Index Per Article: 0.8] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/10/2014] [Accepted: 07/21/2015] [Indexed: 11/14/2022] Open
Abstract
MOTIVATION The HHsearch algorithm, implementing a hidden Markov model (HMM)-HMM alignment method, has shown excellent alignment performance in the so-called twilight zone (target-template sequence identity with ∼20%). However, an optimal alignment by HHsearch may contain small to large errors, leading to poor structure prediction if these errors are located in important structural elements. RESULTS HHalign-Kbest server runs a full pipeline, from the generation of suboptimal HMM-HMM alignments to the evaluation of the best structural models. In the HHsearch framework, it implements a novel algorithm capable of generating k-best HMM-HMM suboptimal alignments rather than only the optimal one. For large proteins, a directed acyclic graph-based implementation reduces drastically the memory usage. Improved alignments were systematically generated among the top k suboptimal alignments. To recognize them, corresponding structural models were systematically generated and evaluated with Qmean score. The method was benchmarked over 420 targets from the SCOP30 database. In the range of HHsearch probability of 20-99%, average quality of the models (TM-score) raised by 4.1-16.3% and 8.0-21.0% considering the top 1 and top 10 best models, respectively. AVAILABILITY AND IMPLEMENTATION http://bioserv.rpbs.univ-paris-diderot.fr/services/HHalign-Kbest/ (source code and server). CONTACT guerois@cea.fr. SUPPLEMENTARY INFORMATION Supplementary data are available at Bioinformatics online.
Collapse
Affiliation(s)
- Jinchao Yu
- Institute for Integrative Biology of the Cell (I2BC), CEA, CNRS, University Paris-Saclay, CEA-Saclay, 91191 Gif-sur-Yvette
| | - Geraldine Picord
- INSERM U973, MTi, F-75205 Paris, Université Paris Diderot, Sorbonne Paris Cité F-75205 Paris and Ressource Parisienne en Bioinformatique Structurale, F-75205 Paris, France
| | - Pierre Tuffery
- INSERM U973, MTi, F-75205 Paris, Université Paris Diderot, Sorbonne Paris Cité F-75205 Paris and Ressource Parisienne en Bioinformatique Structurale, F-75205 Paris, France
| | - Raphael Guerois
- Institute for Integrative Biology of the Cell (I2BC), CEA, CNRS, University Paris-Saclay, CEA-Saclay, 91191 Gif-sur-Yvette
| |
Collapse
|
58
|
Shultis D, Dodge G, Zhang Y. Crystal structure of designed PX domain from cytokine-independent survival kinase and implications on evolution-based protein engineering. J Struct Biol 2015; 191:197-206. [PMID: 26073968 DOI: 10.1016/j.jsb.2015.06.009] [Citation(s) in RCA: 7] [Impact Index Per Article: 0.8] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/04/2015] [Revised: 05/13/2015] [Accepted: 06/10/2015] [Indexed: 01/03/2023]
Abstract
The Phox homology domain (PX domain) is a phosphoinositide-binding structural domain that is critical in mediating protein and cell membrane association and has been found in more than 100 eukaryotic proteins. The abundance of PX domains in nature offers an opportunity to redesign the protein using EvoDesign, a computational approach to design new sequences based on structure profiles of multiple evolutionarily related proteins. In this study, we report the X-ray crystallographic structure of a designed PX domain from the cytokine-independent survival kinase (CISK), which has been implicated as functioning in parallel with PKB/Akt in cell survival and insulin responses. Detailed data analysis of the designed CISK-PX protein demonstrates positive impacts of knowledge-based secondary structure and solvation predictions and structure-based sequence profiles on the efficiency of the evolutionary-based protein design method. The structure of the designed CISK-PX domain is close to the wild-type (1.54 Å in Cα RMSD), which was accurately predicted by I-TASSER based fragment assembly simulations (1.32 Å in Cα RMSD). This study represents the first successfully designed conditional peripheral membrane protein fold and has important implications in the examination and experimental validation of the evolution-based protein design approaches.
Collapse
Affiliation(s)
- David Shultis
- Department of Computational Medicine and Bioinformatics, University of Michigan, 100 Washtenaw Avenue, Ann Arbor, MI 48109, USA
| | - Gregory Dodge
- Department of Biological Chemistry, University of Michigan, 100 Washtenaw Avenue, Ann Arbor, MI 48109, USA
| | - Yang Zhang
- Department of Computational Medicine and Bioinformatics, University of Michigan, 100 Washtenaw Avenue, Ann Arbor, MI 48109, USA; Department of Biological Chemistry, University of Michigan, 100 Washtenaw Avenue, Ann Arbor, MI 48109, USA.
| |
Collapse
|
59
|
Tong J, Sadreyev RI, Pei J, Kinch LN, Grishin NV. Using homology relations within a database markedly boosts protein sequence similarity search. Proc Natl Acad Sci U S A 2015; 112:7003-8. [PMID: 26038555 PMCID: PMC4460465 DOI: 10.1073/pnas.1424324112] [Citation(s) in RCA: 5] [Impact Index Per Article: 0.6] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/18/2022] Open
Abstract
Inference of homology from protein sequences provides an essential tool for analyzing protein structure, function, and evolution. Current sequence-based homology search methods are still unable to detect many similarities evident from protein spatial structures. In computer science a search engine can be improved by considering networks of known relationships within the search database. Here, we apply this idea to protein-sequence-based homology search and show that it dramatically enhances the search accuracy. Our new method, COMPADRE (COmparison of Multiple Protein sequence Alignments using Database RElationships) assesses the relationship between the query sequence and a hit in the database by considering the similarity between the query and hit's known homologs. This approach increases detection quality, boosting the precision rate from 18% to 83% at half-coverage of all database homologs. The increased precision rate allows detection of a large fraction of protein structural relationships, thus providing structure and function predictions for previously uncharacterized proteins. Our results suggest that this general approach is applicable to a wide variety of methods for detection of biological similarities. The web server is available at prodata.swmed.edu/compadre.
Collapse
Affiliation(s)
- Jing Tong
- Department of Molecular Biophysics, University of Texas Southwestern Medical Center, Dallas, TX 75390-9050
| | - Ruslan I Sadreyev
- Department of Molecular Biology, Massachusetts General Hospital, Boston, MA 02114; Department of Pathology, Massachusetts General Hospital and Harvard Medical School, Boston, MA 02114
| | - Jimin Pei
- Howard Hughes Medical Institute, University of Texas Southwestern Medical Center, Dallas, TX 75390-9050
| | - Lisa N Kinch
- Howard Hughes Medical Institute, University of Texas Southwestern Medical Center, Dallas, TX 75390-9050
| | - Nick V Grishin
- Department of Molecular Biophysics, University of Texas Southwestern Medical Center, Dallas, TX 75390-9050; Howard Hughes Medical Institute, University of Texas Southwestern Medical Center, Dallas, TX 75390-9050
| |
Collapse
|
60
|
Krupa P, Mozolewska MA, Joo K, Lee J, Czaplewski C, Liwo A. Prediction of Protein Structure by Template-Based Modeling Combined with the UNRES Force Field. J Chem Inf Model 2015; 55:1271-81. [DOI: 10.1021/acs.jcim.5b00117] [Citation(s) in RCA: 14] [Impact Index Per Article: 1.6] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 12/28/2022]
Affiliation(s)
- Paweł Krupa
- Faculty of Chemistry, University of Gdańsk, Wita Stwosza 63, 80-308 Gdańsk, Poland
| | | | | | | | - Cezary Czaplewski
- Faculty of Chemistry, University of Gdańsk, Wita Stwosza 63, 80-308 Gdańsk, Poland
| | - Adam Liwo
- Faculty of Chemistry, University of Gdańsk, Wita Stwosza 63, 80-308 Gdańsk, Poland
| |
Collapse
|
61
|
Yang J, Zhang Y. I-TASSER server: new development for protein structure and function predictions. Nucleic Acids Res 2015; 43:W174-81. [PMID: 25883148 PMCID: PMC4489253 DOI: 10.1093/nar/gkv342] [Citation(s) in RCA: 1641] [Impact Index Per Article: 182.3] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 01/27/2015] [Accepted: 04/06/2015] [Indexed: 12/11/2022] Open
Abstract
The I-TASSER server (http://zhanglab.ccmb.med.umich.edu/I-TASSER) is an online resource for automated protein structure prediction and structure-based function annotation. In I-TASSER, structural templates are first recognized from the PDB using multiple threading alignment approaches. Full-length structure models are then constructed by iterative fragment assembly simulations. The functional insights are finally derived by matching the predicted structure models with known proteins in the function databases. Although the server has been widely used for various biological and biomedical investigations, numerous comments and suggestions have been reported from the user community. In this article, we summarize recent developments on the I-TASSER server, which were designed to address the requirements from the user community and to increase the accuracy of modeling predictions. Focuses have been made on the introduction of new methods for atomic-level structure refinement, local structure quality estimation and biological function annotations. We expect that these new developments will improve the quality of the I-TASSER server and further facilitate its use by the community for high-resolution structure and function prediction.
Collapse
Affiliation(s)
- Jianyi Yang
- Department of Computational Medicine and Bioinformatics, University of Michigan, 100 Washtenaw Avenue, Ann Arbor, MI 48109-2218, USA School of Mathematical Sciences and LPMC, Nankai University, Tianjin, 300071, PR China
| | - Yang Zhang
- Department of Computational Medicine and Bioinformatics, University of Michigan, 100 Washtenaw Avenue, Ann Arbor, MI 48109-2218, USA Department of Biological Chemistry, University of Michigan, 100 Washtenaw Avenue, Ann Arbor, MI 48109-2218, USA
| |
Collapse
|
62
|
McGuffin LJ, Atkins JD, Salehe BR, Shuid AN, Roche DB. IntFOLD: an integrated server for modelling protein structures and functions from amino acid sequences. Nucleic Acids Res 2015; 43:W169-73. [PMID: 25820431 PMCID: PMC4489238 DOI: 10.1093/nar/gkv236] [Citation(s) in RCA: 82] [Impact Index Per Article: 9.1] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 02/02/2015] [Accepted: 03/08/2015] [Indexed: 11/13/2022] Open
Abstract
IntFOLD is an independent web server that integrates our leading methods for structure and function prediction. The server provides a simple unified interface that aims to make complex protein modelling data more accessible to life scientists. The server web interface is designed to be intuitive and integrates a complex set of quantitative data, so that 3D modelling results can be viewed on a single page and interpreted by non-expert modellers at a glance. The only required input to the server is an amino acid sequence for the target protein. Here we describe major performance and user interface updates to the server, which comprises an integrated pipeline of methods for: tertiary structure prediction, global and local 3D model quality assessment, disorder prediction, structural domain prediction, function prediction and modelling of protein-ligand interactions. The server has been independently validated during numerous CASP (Critical Assessment of Techniques for Protein Structure Prediction) experiments, as well as being continuously evaluated by the CAMEO (Continuous Automated Model Evaluation) project. The IntFOLD server is available at: http://www.reading.ac.uk/bioinf/IntFOLD/.
Collapse
Affiliation(s)
- Liam J McGuffin
- School of Biological Sciences, University of Reading, Reading, RG6 6AS, UK
| | - Jennifer D Atkins
- School of Biological Sciences, University of Reading, Reading, RG6 6AS, UK
| | - Bajuna R Salehe
- School of Biological Sciences, University of Reading, Reading, RG6 6AS, UK
| | - Ahmad N Shuid
- School of Biological Sciences, University of Reading, Reading, RG6 6AS, UK
| | - Daniel B Roche
- Institut de Biologie Computationnelle, LIRMM, CNRS, Université de Montpellier, Montpellier 34095, France Centre de Recherches de Biochimie Macromoléculaire, CNRS- UMR 5237, Montpellier 34293, France
| |
Collapse
|
63
|
Zuckerman DM, Boucher LE, Xie K, Engelhardt H, Bosch J, Hoiczyk E. The bactofilin cytoskeleton protein BacM of Myxococcus xanthus forms an extended β-sheet structure likely mediated by hydrophobic interactions. PLoS One 2015; 10:e0121074. [PMID: 25803609 PMCID: PMC4372379 DOI: 10.1371/journal.pone.0121074] [Citation(s) in RCA: 14] [Impact Index Per Article: 1.6] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/08/2014] [Accepted: 01/29/2015] [Indexed: 11/18/2022] Open
Abstract
Bactofilins are novel cytoskeleton proteins that are widespread in Gram-negative bacteria. Myxococcus xanthus, an important predatory soil bacterium, possesses four bactofilins of which one, BacM (Mxan_7475) plays an important role in cell shape maintenance. Electron and fluorescence light microscopy, as well as studies using over-expressed, purified BacM, indicate that this protein polymerizes in vivo and in vitro into ~3 nm wide filaments that further associate into higher ordered fibers of about 10 nm. Here we use a multipronged approach combining secondary structure determination, molecular modeling, biochemistry, and genetics to identify and characterize critical molecular elements that enable BacM to polymerize. Our results indicate that the bactofilin-determining domain DUF583 folds into an extended β-sheet structure, and we hypothesize a left-handed β-helix with polymerization into 3 nm filaments primarily via patches of hydrophobic amino acid residues. These patches form the interface allowing head-to-tail polymerization during filament formation. Biochemical analyses of these processes show that folding and polymerization occur across a wide variety of conditions and even in the presence of chaotropic agents such as one molar urea. Together, these data suggest that bactofilins are comprised of a structure unique to cytoskeleton proteins, which enables robust polymerization.
Collapse
Affiliation(s)
- David M. Zuckerman
- W. Harry Feinstone Department of Molecular Microbiology and Immunology, Johns Hopkins Bloomberg School of Public Health, Baltimore, Maryland, United States of America
| | - Lauren E. Boucher
- Department of Biochemistry and Molecular Biology, Johns Hopkins Bloomberg School of Public Health and Johns Hopkins Malaria Research Institute, Baltimore, Maryland, United States of America
| | - Kefang Xie
- W. Harry Feinstone Department of Molecular Microbiology and Immunology, Johns Hopkins Bloomberg School of Public Health, Baltimore, Maryland, United States of America
| | - Harald Engelhardt
- Department of Structural Biology, Max Planck Institute of Biochemistry, Am Klopferspitz 18, Martinsried, Germany
| | - Jürgen Bosch
- Department of Biochemistry and Molecular Biology, Johns Hopkins Bloomberg School of Public Health and Johns Hopkins Malaria Research Institute, Baltimore, Maryland, United States of America
| | - Egbert Hoiczyk
- W. Harry Feinstone Department of Molecular Microbiology and Immunology, Johns Hopkins Bloomberg School of Public Health, Baltimore, Maryland, United States of America
- * E-mail:
| |
Collapse
|
64
|
Xun S, Jiang F, Wu YD. Significant Refinement of Protein Structure Models Using a Residue-Specific Force Field. J Chem Theory Comput 2015; 11:1949-56. [PMID: 26574396 DOI: 10.1021/acs.jctc.5b00029] [Citation(s) in RCA: 21] [Impact Index Per Article: 2.3] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 01/12/2023]
Abstract
An important application of all-atom explicit-solvent molecular dynamics (MD) simulations is the refinement of protein structures from low-resolution experiments or template-based modeling. A critical requirement is that the native structure is stable with the force field. We have applied a recently developed residue-specific force field, RSFF1, to a set of 30 refinement targets from recent CASP experiments. Starting from their experimental structures, 1.0 μs unrestrained simulations at 298 K retain most of the native structures quite well except for a few flexible terminals and long internal loops. Starting from each homology model, a 150 ns MD simulation at 380 K generates the best RMSD improvement of 0.85 Å on average. The structural improvements roughly correlate with the RMSD of the initial homology models, indicating possible consistent structure refinement. Finally, targets TR614 and TR624 have been subjected to long-time replica-exchange MD simulations. Significant structural improvements are generated, with RMSD of 1.91 and 1.36 Å with respect to their crystal structures. Thus, it is possible to achieve realistic refinement of protein structure models to near-experimental accuracy, using accurate force field with sufficient conformational sampling.
Collapse
Affiliation(s)
- Sangni Xun
- Laboratory of Computational Chemistry and Drug Design, Laboratory of Chemical Genomics, Peking University Shenzhen Graduate School , Shenzhen, 518055, China
| | - Fan Jiang
- Laboratory of Computational Chemistry and Drug Design, Laboratory of Chemical Genomics, Peking University Shenzhen Graduate School , Shenzhen, 518055, China
| | - Yun-Dong Wu
- Laboratory of Computational Chemistry and Drug Design, Laboratory of Chemical Genomics, Peking University Shenzhen Graduate School , Shenzhen, 518055, China.,College of Chemistry and Molecular Engineering, Peking University , Beijing, 100871, China
| |
Collapse
|
65
|
Wiech EM, Cheng HP, Singh SM. Molecular modeling and computational analyses suggests that the Sinorhizobium meliloti periplasmic regulator protein ExoR adopts a superhelical fold and is controlled by a unique mechanism of proteolysis. Protein Sci 2015; 24:319-27. [PMID: 25492513 PMCID: PMC4353358 DOI: 10.1002/pro.2616] [Citation(s) in RCA: 9] [Impact Index Per Article: 1.0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/08/2014] [Revised: 11/26/2014] [Accepted: 12/01/2014] [Indexed: 12/12/2022]
Abstract
The Sinorhizobium meliloti periplasmic ExoR protein and the ExoS/ChvI two-component system form a regulatory mechanism that directly controls the transformation of free-living to host-invading cells. In the absence of crystal structures, understanding the molecular mechanism of interaction between ExoR and the ExoS sensor, which is believed to drive the key regulatory step in the invasion process, remains a major challenge. In this study, we present a theoretical structural model of the active form of ExoR protein, ExoRm , generated using computational methods. Our model suggests that ExoR possesses a super-helical fold comprising 12 α-helices forming six Sel1-like repeats, including two that were unidentified in previous studies. This fold is highly conducive to mediating protein-protein interactions and this is corroborated by the identification of putative protein binding sites on the surface of the ExoRm protein. Our studies reveal two novel insights: (a) an extended conformation of the third Sel1-like repeat that might be important for ExoR regulatory function and (b) a buried proteolytic site that implies a unique proteolytic mechanism. This study provides new and interesting insights into the structure of S. meliloti ExoR, lays the groundwork for elaborating the molecular mechanism of ExoRm cleavage, ExoRm -ExoS interactions, and studies of ExoR homologs in other bacterial host interactions.
Collapse
Affiliation(s)
- Eliza M Wiech
- Department of Biology, The Graduate Center of the City University of New YorkNew York, New York, 10016
- Department of Biology, Brooklyn College, The City University of New YorkBrooklyn, New York, 11210
| | - Hai-Ping Cheng
- Department of Biology, The Graduate Center of the City University of New YorkNew York, New York, 10016
- Biological Sciences Department, Lehman College, The City University of New YorkBronx, New York, 10468
| | - Shaneen M Singh
- Department of Biology, The Graduate Center of the City University of New YorkNew York, New York, 10016
- Department of Biology, Brooklyn College, The City University of New YorkBrooklyn, New York, 11210
| |
Collapse
|
66
|
Feig M, Harada R, Mori T, Yu I, Takahashi K, Sugita Y. Complete atomistic model of a bacterial cytoplasm for integrating physics, biochemistry, and systems biology. J Mol Graph Model 2015; 58:1-9. [PMID: 25765281 DOI: 10.1016/j.jmgm.2015.02.004] [Citation(s) in RCA: 55] [Impact Index Per Article: 6.1] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 12/19/2014] [Revised: 02/18/2015] [Accepted: 02/22/2015] [Indexed: 01/10/2023]
Abstract
A model for the cytoplasm of Mycoplasma genitalium is presented that integrates data from a variety of sources into a physically and biochemically consistent model. Based on gene annotations, core genes expected to be present in the cytoplasm were determined and a metabolic reaction network was reconstructed. The set of cytoplasmic genes and metabolites from the predicted reactions were assembled into a comprehensive atomistic model consisting of proteins with predicted structures, RNA, protein/RNA complexes, metabolites, ions, and solvent. The resulting model bridges between atomistic and cellular scales, between physical and biochemical aspects, and between structural and systems views of cellular systems and is meant as a starting point for a variety of simulation studies.
Collapse
Affiliation(s)
- Michael Feig
- Department of Biochemistry & Molecular Biology, Michigan State University, East Lansing, MI 48824, United States; Department of Chemistry, Michigan State University, East Lansing, MI 48824, United States; Quantitative Biology Center, RIKEN, International Medical Device Alliance (IMDA) 6F, 1-6-5 Minatojima-minamimachi, Chuo-ku, Kobe, Hyogo 650-0047, Japan.
| | - Ryuhei Harada
- Advanced Institute for Computational Science, RIKEN, 7-1-26 Minatojima-minamimachi, Chuo-ku, Kobe, Hyogo 650-0047, Japan; Quantitative Biology Center, RIKEN, International Medical Device Alliance (IMDA) 6F, 1-6-5 Minatojima-minamimachi, Chuo-ku, Kobe, Hyogo 650-0047, Japan
| | - Takaharu Mori
- Quantitative Biology Center, RIKEN, International Medical Device Alliance (IMDA) 6F, 1-6-5 Minatojima-minamimachi, Chuo-ku, Kobe, Hyogo 650-0047, Japan; Theoretical Molecular Science Laboratory and iTHES, RIKEN, 2-1 Hirosawa, Wako-shi, Saitama 351-0198, Japan
| | - Isseki Yu
- Theoretical Molecular Science Laboratory and iTHES, RIKEN, 2-1 Hirosawa, Wako-shi, Saitama 351-0198, Japan
| | - Koichi Takahashi
- Quantitative Biology Center, RIKEN, Laboratory for Biochemical Simulation, Suita, Osaka 565-0874, Japan; Institute for Advanced Biosciences, Keio University, Fujisawa 252-8520, Japan
| | - Yuji Sugita
- Advanced Institute for Computational Science, RIKEN, 7-1-26 Minatojima-minamimachi, Chuo-ku, Kobe, Hyogo 650-0047, Japan; Quantitative Biology Center, RIKEN, International Medical Device Alliance (IMDA) 6F, 1-6-5 Minatojima-minamimachi, Chuo-ku, Kobe, Hyogo 650-0047, Japan; Theoretical Molecular Science Laboratory and iTHES, RIKEN, 2-1 Hirosawa, Wako-shi, Saitama 351-0198, Japan
| |
Collapse
|
67
|
Simoncini D, Nakata H, Ogata K, Nakamura S, Zhang KY. Quality Assessment of Predicted Protein Models Using Energies Calculated by the Fragment Molecular Orbital Method. Mol Inform 2015; 34:97-104. [PMID: 27490032 DOI: 10.1002/minf.201400108] [Citation(s) in RCA: 12] [Impact Index Per Article: 1.3] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 08/01/2014] [Accepted: 10/13/2014] [Indexed: 12/12/2022]
Abstract
Protein structure prediction directly from sequences is a very challenging problem in computational biology. One of the most successful approaches employs stochastic conformational sampling to search an empirically derived energy function landscape for the global energy minimum state. Due to the errors in the empirically derived energy function, the lowest energy conformation may not be the best model. We have evaluated the use of energy calculated by the fragment molecular orbital method (FMO energy) to assess the quality of predicted models and its ability to identify the best model among an ensemble of predicted models. The fragment molecular orbital method implemented in GAMESS was used to calculate the FMO energy of predicted models. When tested on eight protein targets, we found that the model ranking based on FMO energies is better than that based on empirically derived energies when there is sufficient diversity among these models. This model diversity can be estimated prior to the FMO energy calculations. Our result demonstrates that the FMO energy calculated by the fragment molecular orbital method is a practical and promising measure for the assessment of protein model quality and the selection of the best protein model among many generated.
Collapse
Affiliation(s)
- David Simoncini
- Structural Bioinformatics Team, Division of Structural and Synthetic Biology, Center for Life Science Technologies, RIKEN, 1-7-22 Suehiro, Yokohama, Kanagawa 230-0045, Japan phone: +81(0)45-503-9560/fax: +81(0)45-503-9559.,Present address: Mathématiques et Informatique Appliquées de Toulouse, Unité de Recherche 875, Institut National de la Recherche Agronomique, F-31320 Castanet-Tolosan, France
| | - Hiroya Nakata
- RIKEN Research Cluster for Innovation, 2-1 Hirosawa, Wako, Saitama 351-0198 Japan phone/fax: +81(0)48-467-9477/+81(0)48-467-8503.,Department of Biomolecular Engineering, Tokyo Institute of Technology, 4259 Nagatsutacho, Midori-ku, Yokohama, Kanagawa 226-8501, Japan.,Japan Society for the Promotion of Science, Kojimachi Business Center Building, 5-3-1 Kojimachi, Chiyoda-ku, Tokyo 102-0083, Japan
| | - Koji Ogata
- RIKEN Research Cluster for Innovation, 2-1 Hirosawa, Wako, Saitama 351-0198 Japan phone/fax: +81(0)48-467-9477/+81(0)48-467-8503
| | - Shinichiro Nakamura
- RIKEN Research Cluster for Innovation, 2-1 Hirosawa, Wako, Saitama 351-0198 Japan phone/fax: +81(0)48-467-9477/+81(0)48-467-8503.
| | - Kam Yj Zhang
- Structural Bioinformatics Team, Division of Structural and Synthetic Biology, Center for Life Science Technologies, RIKEN, 1-7-22 Suehiro, Yokohama, Kanagawa 230-0045, Japan phone: +81(0)45-503-9560/fax: +81(0)45-503-9559.
| |
Collapse
|
68
|
Tong J, Pei J, Otwinowski Z, Grishin NV. Refinement by shifting secondary structure elements improves sequence alignments. Proteins 2015; 83:411-27. [PMID: 25546158 DOI: 10.1002/prot.24746] [Citation(s) in RCA: 3] [Impact Index Per Article: 0.3] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 08/29/2014] [Revised: 11/25/2014] [Accepted: 12/10/2014] [Indexed: 01/09/2023]
Abstract
Constructing a model of a query protein based on its alignment to a homolog with experimentally determined spatial structure (the template) is still the most reliable approach to structure prediction. Alignment errors are the main bottleneck for homology modeling when the query is distantly related to the template. Alignment methods often misalign secondary structural elements by a few residues. Therefore, better alignment solutions can be found within a limited set of local shifts of secondary structures. We present a refinement method to improve pairwise sequence alignments by evaluating alignment variants generated by local shifts of template-defined secondary structures. Our method SFESA is based on a novel scoring function that combines the profile-based sequence score and the structure score derived from residue contacts in a template. Such a combined score frequently selects a better alignment variant among a set of candidate alignments generated by local shifts and leads to overall increase in alignment accuracy. Evaluation of several benchmarks shows that our refinement method significantly improves alignments made by automatic methods such as PROMALS, HHpred and CNFpred. The web server is available at http://prodata.swmed.edu/sfesa.
Collapse
Affiliation(s)
- Jing Tong
- Department of Biophysics, University of Texas Southwestern Medical Center at Dallas, Dallas, Texas, 75390; Department of Biochemistry, University of Texas Southwestern Medical Center at Dallas, Dallas, Texas, 75390
| | | | | | | |
Collapse
|
69
|
Secondary and Tertiary Structure Prediction of Proteins: A Bioinformatic Approach. COMPLEX SYSTEM MODELLING AND CONTROL THROUGH INTELLIGENT SOFT COMPUTATIONS 2015. [DOI: 10.1007/978-3-319-12883-2_19] [Citation(s) in RCA: 5] [Impact Index Per Article: 0.6] [Reference Citation Analysis] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 12/27/2022]
|
70
|
Du H, Brender JR, Zhang J, Zhang Y. Protein structure prediction provides comparable performance to crystallographic structures in docking-based virtual screening. Methods 2015; 71:77-84. [PMID: 25220914 PMCID: PMC4431978 DOI: 10.1016/j.ymeth.2014.08.017] [Citation(s) in RCA: 24] [Impact Index Per Article: 2.7] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/02/2014] [Revised: 08/14/2014] [Accepted: 08/31/2014] [Indexed: 11/26/2022] Open
Abstract
Structure based virtual screening has largely been limited to protein targets for which either an experimental structure is available or a strongly homologous template exists so that a high-resolution model can be constructed. The performance of state of the art protein structure predictions in virtual screening in systems where only weakly homologous templates are available is largely untested. Using the challenging DUD database of structural decoys, we show here that even using templates with only weak sequence homology (<30% sequence identity) structural models can be constructed by I-TASSER which achieve comparable enrichment rates to using the experimental bound crystal structure in the majority of the cases studied. For 65% of the targets, the I-TASSER models, which are constructed essentially in the apo conformations, reached 70% of the virtual screening performance of using the holo-crystal structures. A correlation was observed between the success of I-TASSER in modeling the global fold and local structures in the binding pockets of the proteins versus the relative success in virtual screening. The virtual screening performance can be further improved by the recognition of chemical features of the ligand compounds. These results suggest that the combination of structure-based docking and advanced protein structure modeling methods should be a valuable approach to the large-scale drug screening and discovery studies, especially for the proteins lacking crystallographic structures.
Collapse
Affiliation(s)
- Hongying Du
- Department of Computational Medicine and Bioinformatics, University of Michigan, 100 Washtenaw Avenue, Ann Arbor, MI 48109, USA; Department of Public Health, Lanzhou University, Lanzhou 730000, China
| | - Jeffrey R Brender
- Department of Computational Medicine and Bioinformatics, University of Michigan, 100 Washtenaw Avenue, Ann Arbor, MI 48109, USA
| | - Jian Zhang
- Department of Computational Medicine and Bioinformatics, University of Michigan, 100 Washtenaw Avenue, Ann Arbor, MI 48109, USA
| | - Yang Zhang
- Department of Computational Medicine and Bioinformatics, University of Michigan, 100 Washtenaw Avenue, Ann Arbor, MI 48109, USA.
| |
Collapse
|
71
|
Berman HM, Gabanyi MJ, Groom CR, Johnson JE, Murshudov GN, Nicholls RA, Reddy V, Schwede T, Zimmerman MD, Westbrook J, Minor W. Data to knowledge: how to get meaning from your result. IUCRJ 2015; 2:45-58. [PMID: 25610627 PMCID: PMC4285880 DOI: 10.1107/s2052252514023306] [Citation(s) in RCA: 8] [Impact Index Per Article: 0.9] [Reference Citation Analysis] [Abstract] [Key Words] [Grants] [Track Full Text] [Subscribe] [Scholar Register] [Received: 09/16/2014] [Accepted: 10/22/2014] [Indexed: 05/19/2023]
Abstract
Structural and functional studies require the development of sophisticated 'Big Data' technologies and software to increase the knowledge derived and ensure reproducibility of the data. This paper presents summaries of the Structural Biology Knowledge Base, the VIPERdb Virus Structure Database, evaluation of homology modeling by the Protein Model Portal, the ProSMART tool for conformation-independent structure comparison, the LabDB 'super' laboratory information management system and the Cambridge Structural Database. These techniques and technologies represent important tools for the transformation of crystallographic data into knowledge and information, in an effort to address the problem of non-reproducibility of experimental results.
Collapse
Affiliation(s)
- Helen M. Berman
- Center for Integrative Proteomics Research, Department of Chemistry and Chemical Biology, Rutgers, State University of New Jersey, Piscataway, NJ 08854, USA
| | - Margaret J. Gabanyi
- Center for Integrative Proteomics Research, Department of Chemistry and Chemical Biology, Rutgers, State University of New Jersey, Piscataway, NJ 08854, USA
| | - Colin R. Groom
- Cambridge Crystallographic Data Centre, 12 Union Road, Cambridge CB2 1EZ, England
| | - John E. Johnson
- Department of Integrative Structural and Computational Biology, Scripps Research Institute, La Jolla, CA 92037, USA
| | - Garib N. Murshudov
- MRC Laboratory of Molecular Biology, Francis Crick Avenue, Cambridge Biomedical Campus, Cambridge CB2 0QH, England
| | - Robert A. Nicholls
- MRC Laboratory of Molecular Biology, Francis Crick Avenue, Cambridge Biomedical Campus, Cambridge CB2 0QH, England
| | - Vijay Reddy
- Department of Integrative Structural and Computational Biology, Scripps Research Institute, La Jolla, CA 92037, USA
| | - Torsten Schwede
- Biozentrum, University of Basel, Klingelbergstrasse 50-70, 4056 Basel, Switzerland
- SIB-Swiss Institute of Bioinformatics, Basel, Switzerland
| | - Matthew D. Zimmerman
- Department of Molecular Physiology and Biological Physics, University of Virginia, Charlottesville, VA 22908, USA
| | - John Westbrook
- Center for Integrative Proteomics Research, Department of Chemistry and Chemical Biology, Rutgers, State University of New Jersey, Piscataway, NJ 08854, USA
| | - Wladek Minor
- Department of Molecular Physiology and Biological Physics, University of Virginia, Charlottesville, VA 22908, USA
| |
Collapse
|
72
|
Zhou J, Grigoryan G. Rapid search for tertiary fragments reveals protein sequence-structure relationships. Protein Sci 2014; 24:508-24. [PMID: 25420575 DOI: 10.1002/pro.2610] [Citation(s) in RCA: 55] [Impact Index Per Article: 5.5] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 09/25/2014] [Accepted: 11/21/2014] [Indexed: 12/31/2022]
Abstract
Finding backbone substructures from the Protein Data Bank that match an arbitrary query structural motif, composed of multiple disjoint segments, is a problem of growing relevance in structure prediction and protein design. Although numerous protein structure search approaches have been proposed, methods that address this specific task without additional restrictions and on practical time scales are generally lacking. Here, we propose a solution, dubbed MASTER, that is both rapid, enabling searches over the Protein Data Bank in a matter of seconds, and provably correct, finding all matches below a user-specified root-mean-square deviation cutoff. We show that despite the potentially exponential time complexity of the problem, running times in practice are modest even for queries with many segments. The ability to explore naturally plausible structural and sequence variations around a given motif has the potential to synthesize its design principles in an automated manner; so we go on to illustrate the utility of MASTER to protein structural biology. We demonstrate its capacity to rapidly establish structure-sequence relationships, uncover the native designability landscapes of tertiary structural motifs, identify structural signatures of binding, and automatically rewire protein topologies. Given the broad utility of protein tertiary fragment searches, we hope that providing MASTER in an open-source format will enable novel advances in understanding, predicting, and designing protein structure.
Collapse
Affiliation(s)
- Jianfu Zhou
- Department of Computer Science, Dartmouth College, Hanover, New Hampshire, 03755
| | | |
Collapse
|
73
|
Three-dimensional protein structure prediction: Methods and computational strategies. Comput Biol Chem 2014; 53PB:251-276. [DOI: 10.1016/j.compbiolchem.2014.10.001] [Citation(s) in RCA: 121] [Impact Index Per Article: 12.1] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/12/2014] [Revised: 10/03/2014] [Accepted: 10/07/2014] [Indexed: 01/01/2023]
|
74
|
Tyagi A, Tuknait A, Anand P, Gupta S, Sharma M, Mathur D, Joshi A, Singh S, Gautam A, Raghava GPS. CancerPPD: a database of anticancer peptides and proteins. Nucleic Acids Res 2014; 43:D837-43. [PMID: 25270878 PMCID: PMC4384006 DOI: 10.1093/nar/gku892] [Citation(s) in RCA: 240] [Impact Index Per Article: 24.0] [Reference Citation Analysis] [Abstract] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 01/08/2023] Open
Abstract
CancerPPD (http://crdd.osdd.net/raghava/cancerppd/) is a repository of experimentally verified anticancer peptides (ACPs) and anticancer proteins. Data were manually collected from published research articles, patents and from other databases. The current release of CancerPPD consists of 3491 ACP and 121 anticancer protein entries. Each entry provides comprehensive information related to a peptide like its source of origin, nature of the peptide, anticancer activity, N- and C-terminal modifications, conformation, etc. Additionally, CancerPPD provides the information of around 249 types of cancer cell lines and 16 different assays used for testing the ACPs. In addition to natural peptides, CancerPPD contains peptides having non-natural, chemically modified residues and D-amino acids. Besides this primary information, CancerPPD stores predicted tertiary structures as well as peptide sequences in SMILES format. Tertiary structures of peptides were predicted using the state-of-art method, PEPstr and secondary structural states were assigned using DSSP. In order to assist users, a number of web-based tools have been integrated, these include keyword search, data browsing, sequence and structural similarity search. We believe that CancerPPD will be very useful in designing peptide-based anticancer therapeutics.
Collapse
Affiliation(s)
- Atul Tyagi
- Bioinformatics Centre, CSIR-Institute of Microbial Technology, Chandigarh 160036, Punjab, India
| | - Abhishek Tuknait
- Bioinformatics Centre, CSIR-Institute of Microbial Technology, Chandigarh 160036, Punjab, India
| | - Priya Anand
- Bioinformatics Centre, CSIR-Institute of Microbial Technology, Chandigarh 160036, Punjab, India
| | - Sudheer Gupta
- Bioinformatics Centre, CSIR-Institute of Microbial Technology, Chandigarh 160036, Punjab, India
| | - Minakshi Sharma
- Bioinformatics Centre, CSIR-Institute of Microbial Technology, Chandigarh 160036, Punjab, India
| | - Deepika Mathur
- Bioinformatics Centre, CSIR-Institute of Microbial Technology, Chandigarh 160036, Punjab, India
| | - Anshika Joshi
- Bioinformatics Centre, CSIR-Institute of Microbial Technology, Chandigarh 160036, Punjab, India
| | - Sandeep Singh
- Bioinformatics Centre, CSIR-Institute of Microbial Technology, Chandigarh 160036, Punjab, India
| | - Ankur Gautam
- Bioinformatics Centre, CSIR-Institute of Microbial Technology, Chandigarh 160036, Punjab, India
| | - Gajendra P S Raghava
- Bioinformatics Centre, CSIR-Institute of Microbial Technology, Chandigarh 160036, Punjab, India
| |
Collapse
|
75
|
Gallo Cassarino T, Bordoli L, Schwede T. Assessment of ligand binding site predictions in CASP10. Proteins 2014; 82 Suppl 2:154-63. [PMID: 24339001 DOI: 10.1002/prot.24495] [Citation(s) in RCA: 34] [Impact Index Per Article: 3.4] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/12/2013] [Revised: 12/04/2013] [Accepted: 12/09/2013] [Indexed: 12/27/2022]
Abstract
The identification of amino acid residues in proteins involved in binding small molecule ligands is an important step for their functional characterization, as the function of a protein often depends on specific interactions with other molecules. The accuracy of computational methods aiming to predict such binding residues was evaluated within the "function prediction (prediction of binding sites, FN)" category of the critical assessment of protein structure prediction (CASP) experiment. In the last edition of the experiment (CASP10), 17 research groups participated in this category, and their predictions were evaluated on 13 prediction targets containing biologically relevant ligands. The results of this experiment indicate that several methods achieved an overall good performance, showing the usefulness of such methods in predicting ligand binding residues. As in previous years, methods based on a homology transfer approach were dominating. In comparison to CASP9, a larger fraction of the top predictors are automated servers. However, due to the small number of targets and the characteristics of the prediction format, the differences observed among the first ten methods were not statistically significant and it was also not possible to analyze differences in accuracy for different ligand types or overall structure, difficulty. To overcome these limitations and to allow for a more detailed evaluation, in future editions of CASP, methods in the FN category will no longer be evaluated on the "normal" CASP targets, but assessed continuously by CAMEO (continuous automated model evaluation) based on weekly prereleased sequences from the PDB.
Collapse
Affiliation(s)
- Tiziano Gallo Cassarino
- Biozentrum, University of Basel, Basel, Switzerland; SIB Swiss Institute of Bioinformatics, Basel, Switzerland
| | | | | |
Collapse
|
76
|
Lhor M, Salesse C. Retinol dehydrogenases: membrane-bound enzymes for the visual function. Biochem Cell Biol 2014; 92:510-23. [PMID: 25357265 DOI: 10.1139/bcb-2014-0082] [Citation(s) in RCA: 12] [Impact Index Per Article: 1.2] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/22/2022] Open
Abstract
Retinoid metabolism is important for many physiological functions, such as differenciation, growth, and vision. In the visual context, after the absorption of light in rod photoreceptors by the visual pigment rhodopsin, 11-cis retinal is isomerized to all-trans retinal. This retinoid subsequently undergoes a series of modifications during the visual cycle through a cascade of reactions occurring in photoreceptors and in the retinal pigment epithelium. Retinol dehydrogenases (RDHs) are enzymes responsible for crucial steps of this visual cycle. They belong to a large family of proteins designated as short-chain dehydrogenases/reductases. The structure of these RDHs has been predicted using modern bioinformatics tools, which allowed to propose models with similar structures including a common Rossman fold. These enzymes undergo oxidoreduction reactions, whose direction is dictated by the preference and concentration of their individual cofactor (NAD(H)/NADP(H)). This review presents the current state of knowledge on functional and structural features of RDHs involved in the visual cycle as well as knockout models. RDHs are described as integral or peripheral enzymes. A topology model of the membrane binding of these RDHs via their N- and (or) C-terminal domain has been proposed on the basis of their individual properties. Membrane binding is a crucial issue for these enzymes because of the high hydrophobicity of their retinoid substrates.
Collapse
Affiliation(s)
- Mustapha Lhor
- a CUO-Recherche, Centre de recherche du CHU de Québec, Hôpital du Saint Sacrement, Département d'ophtalmologie, Faculté de médicine, Université Laval, Québec, QC G1S 4L8, Canada
| | | |
Collapse
|
77
|
Modelling the structure of full-length Epstein–Barr virus nuclear antigen 1. Virus Genes 2014; 49:358-72. [DOI: 10.1007/s11262-014-1101-9] [Citation(s) in RCA: 13] [Impact Index Per Article: 1.3] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/06/2014] [Accepted: 06/27/2014] [Indexed: 12/27/2022]
|
78
|
Kryshtafovych A, Moult J, Bales P, Bazan JF, Biasini M, Burgin A, Chen C, Cochran FV, Craig TK, Das R, Fass D, Garcia-Doval C, Herzberg O, Lorimer D, Luecke H, Ma X, Nelson DC, van Raaij MJ, Rohwer F, Segall A, Seguritan V, Zeth K, Schwede T. Challenging the state of the art in protein structure prediction: Highlights of experimental target structures for the 10th Critical Assessment of Techniques for Protein Structure Prediction Experiment CASP10. Proteins 2014; 82 Suppl 2:26-42. [PMID: 24318984 PMCID: PMC4072496 DOI: 10.1002/prot.24489] [Citation(s) in RCA: 47] [Impact Index Per Article: 4.7] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/08/2013] [Revised: 11/01/2013] [Accepted: 11/09/2013] [Indexed: 11/12/2022]
Abstract
For the last two decades, CASP has assessed the state of the art in techniques for protein structure prediction and identified areas which required further development. CASP would not have been possible without the prediction targets provided by the experimental structural biology community. In the latest experiment, CASP10, more than 100 structures were suggested as prediction targets, some of which appeared to be extraordinarily difficult for modeling. In this article, authors of some of the most challenging targets discuss which specific scientific question motivated the experimental structure determination of the target protein, which structural features were especially interesting from a structural or functional perspective, and to what extent these features were correctly reproduced in the predictions submitted to CASP10. Specifically, the following targets will be presented: the acid-gated urea channel, a difficult to predict transmembrane protein from the important human pathogen Helicobacter pylori; the structure of human interleukin (IL)-34, a recently discovered helical cytokine; the structure of a functionally uncharacterized enzyme OrfY from Thermoproteus tenax formed by a gene duplication and a novel fold; an ORFan domain of mimivirus sulfhydryl oxidase R596; the fiber protein gene product 17 from bacteriophage T7; the bacteriophage CBA-120 tailspike protein; a virus coat protein from metagenomic samples of the marine environment; and finally, an unprecedented class of structure prediction targets based on engineered disulfide-rich small proteins.
Collapse
Affiliation(s)
- Andriy Kryshtafovych
- Genome Center, University of California, Davis, 451 Health Sciences Drive, Davis, California 95616,
| | - John Moult
- Institute for Bioscience and Biotechnology Research, Department of Cell Biology and Molecular genetics, University of Maryland, 9600 Gudelsky Drive, Rockville, MD 20850, USA;
| | - Patrick Bales
- Institute for Bioscience and Biotechnology Research, University of Maryland, 9600 Gudelsky Drive, Rockville, MD 20850, USA;
| | - J. Fernando Bazan
- (1) Departments of Protein Engineering and (2) Structural Biology, Genentech, 1 DNA Way, South San Francisco, CA 94080, (3) Present address: 44th & Aspen Life Sciences, 924 4th St. N., Stillwater, MN 55082,
| | - Marco Biasini
- (1) Biozentrum, University of Basel, Klingelbergstrasse 50, 4056 Basel, Switzerland; (2) SIB Swiss Institute of Bioinformatics, Klingelbergstrasse 50, 4056 Basel, Switzerland;
| | - Alex Burgin
- Broad Institute, 5 Cambridge Center, Cambridge, MA 02142, USA;
| | - Chen Chen
- Institute for Bioscience and Biotechnology Research, University of Maryland, 9600 Gudelsky Drive, Rockville, MD 20850, USA;
| | - Frank V. Cochran
- Department of Biochemistry, Stanford University, Stanford, California, 94305, USA;
| | | | - Rhiju Das
- (1) Department of Biochemistry, Stanford University, Stanford, California, 94305, USA; (2) Department of Physics, Stanford University, Stanford, California, 94305, USA,
| | - Deborah Fass
- Department of Structural Biology, Weizmann Institute of Science, Rehovot 76100 Israel, Tel: +972-8-934-3214; Fax: +972-8-934-4136;
| | - Carmela Garcia-Doval
- Centro Nactional de Biotecnologia (CNB-CSIC), calle Darwin 3, E-28049 Madrid, Spain.
| | - Osnat Herzberg
- (1) Institute for Bioscience and Biotechnology Research, University of Maryland, 9600 Gudelsky Drive, Rockville, MD 20850, USA; (2) Department of Chemistry and Biochemistry, University of Maryland, College Park;
| | - Donald Lorimer
- Emerald Bio, 7869 NE Day Rd W, Bainbridge Isle, WA 98110, USA;
| | - Hartmut Luecke
- Center for Biomembrane Systems and Depts. of Biochemistry, Biophysics & Computer Science, 3205 McGaugh Hall, University of California, Irvine, CA 92697-3900, USA;
| | - Xiaolei Ma
- (1) Departments of Protein Engineering and (2) Structural Biology, Genentech, 1 DNA Way, South San Francisco, CA 94080 (3) Present address: Novartis Institutes for Biomedical Research, 4560 Horton St., Emeryville, CA 94608, USA;
| | - Daniel C. Nelson
- (1) Institute for Bioscience and Biotechnology Research, University of Maryland, 9600 Gudelsky Drive, Rockville, MD 20850, USA; (2) Department of Veterinary Medicine, University of Maryland, College Park,
| | - Mark J. van Raaij
- Centro Nactional de Biotecnologia (CNB-CSIC), calle Darwin 3, E-28049 Madrid, Spain.
| | - Forest Rohwer
- Department of Biology, San Diego State University, San Diego, CA 92182, USA;
| | - Anca Segall
- Department of Biology, San Diego State University, San Diego, CA 92182, USA;
| | - Victor Seguritan
- Department of Biology, San Diego State University, San Diego, CA 9218
| | - Kornelius Zeth
- Unidad de Biofisica (CSIC-UPV/EHU), Barrio Sarriena s/n 48940, Leioa, Vizcaya, SPAIN, and IKERBASQUE, Basque Foundation for Science, Bilbao, Spain;
| | - Torsten Schwede
- (1) Biozentrum, University of Basel, Klingelbergstrasse 50, 4056 Basel, Switzerland; (2) SIB Swiss Institute of Bioinformatics, Klingelbergstrasse 50, 4056 Basel, Switzerland;
| |
Collapse
|
79
|
Moult J, Fidelis K, Kryshtafovych A, Schwede T, Tramontano A. Critical assessment of methods of protein structure prediction (CASP)--round x. Proteins 2014. [PMID: 24344053 DOI: 10.1002/prot.24452.critical] [Citation(s) in RCA: 1] [Impact Index Per Article: 0.1] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 04/16/2023]
Abstract
This article is an introduction to the special issue of the journal PROTEINS, dedicated to the tenth Critical Assessment of Structure Prediction (CASP) experiment to assess the state of the art in protein structure modeling. The article describes the conduct of the experiment, the categories of prediction included, and outlines the evaluation and assessment procedures. The 10 CASP experiments span almost 20 years of progress in the field of protein structure modeling, and there have been enormous advances in methods and model accuracy in that period. Notable in this round is the first sustained improvement of models with refinement methods, using molecular dynamics. For the first time, we tested the ability of modeling methods to make use of sparse experimental three-dimensional contact information, such as may be obtained from new experimental techniques, with encouraging results. On the other hand, new contact prediction methods, though holding considerable promise, have yet to make an impact in CASP testing. The nature of CASP targets has been changing in recent CASPs, reflecting shifts in experimental structural biology, with more irregular structures, more multi-domain and multi-subunit structures, and less standard versions of known folds. When allowance is made for these factors, we continue to see steady progress in the overall accuracy of models, particularly resulting from improvement of non-template regions.
Collapse
Affiliation(s)
- John Moult
- Institute for Bioscience and Biotechnology Research and Department of Cell Biology and Molecular Genetics, University of Maryland, Rockville, Maryland, 20850
| | | | | | | | | |
Collapse
|
80
|
Moult J, Fidelis K, Kryshtafovych A, Schwede T, Tramontano A. Critical assessment of methods of protein structure prediction (CASP)--round x. Proteins 2014; 82 Suppl 2:1-6. [PMID: 24344053 PMCID: PMC4394854 DOI: 10.1002/prot.24452] [Citation(s) in RCA: 312] [Impact Index Per Article: 31.2] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/16/2013] [Accepted: 10/21/2013] [Indexed: 12/28/2022]
Abstract
This article is an introduction to the special issue of the journal PROTEINS, dedicated to the tenth Critical Assessment of Structure Prediction (CASP) experiment to assess the state of the art in protein structure modeling. The article describes the conduct of the experiment, the categories of prediction included, and outlines the evaluation and assessment procedures. The 10 CASP experiments span almost 20 years of progress in the field of protein structure modeling, and there have been enormous advances in methods and model accuracy in that period. Notable in this round is the first sustained improvement of models with refinement methods, using molecular dynamics. For the first time, we tested the ability of modeling methods to make use of sparse experimental three-dimensional contact information, such as may be obtained from new experimental techniques, with encouraging results. On the other hand, new contact prediction methods, though holding considerable promise, have yet to make an impact in CASP testing. The nature of CASP targets has been changing in recent CASPs, reflecting shifts in experimental structural biology, with more irregular structures, more multi-domain and multi-subunit structures, and less standard versions of known folds. When allowance is made for these factors, we continue to see steady progress in the overall accuracy of models, particularly resulting from improvement of non-template regions.
Collapse
Affiliation(s)
- John Moult
- Institute for Bioscience and Biotechnology Research, and Department of Cell Biology and Molecular Genetics, University of Maryland, Rockville, Maryland 20850
| | | | | | - Torsten Schwede
- University of Basel, Biozentrum & SIB Swiss Institute of Bioinformatics, Basel, Switzerland
| | - Anna Tramontano
- Department of Physics and Istituto Pasteur-Fondazione Cenci Bolognetti, Sapienza University of Rome, 00185 Rome, Italy
| |
Collapse
|
81
|
Abstract
This article is an introduction to the special issue of the journal PROTEINS, dedicated to the tenth Critical Assessment of Structure Prediction (CASP) experiment to assess the state of the art in protein structure modeling. The article describes the conduct of the experiment, the categories of prediction included, and outlines the evaluation and assessment procedures. The 10 CASP experiments span almost 20 years of progress in the field of protein structure modeling, and there have been enormous advances in methods and model accuracy in that period. Notable in this round is the first sustained improvement of models with refinement methods, using molecular dynamics. For the first time, we tested the ability of modeling methods to make use of sparse experimental three-dimensional contact information, such as may be obtained from new experimental techniques, with encouraging results. On the other hand, new contact prediction methods, though holding considerable promise, have yet to make an impact in CASP testing. The nature of CASP targets has been changing in recent CASPs, reflecting shifts in experimental structural biology, with more irregular structures, more multi-domain and multi-subunit structures, and less standard versions of known folds. When allowance is made for these factors, we continue to see steady progress in the overall accuracy of models, particularly resulting from improvement of non-template regions.
Collapse
|
82
|
Kryshtafovych A, Fidelis K, Moult J. CASP10 results compared to those of previous CASP experiments. Proteins 2013; 82 Suppl 2:164-74. [PMID: 24150928 DOI: 10.1002/prot.24448] [Citation(s) in RCA: 88] [Impact Index Per Article: 8.0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/31/2013] [Revised: 10/04/2013] [Accepted: 10/04/2013] [Indexed: 11/11/2022]
Abstract
We compare results of the community efforts in modeling protein structures in the tenth CASP experiment, with those in earlier CASPs particularly in CASP5, a decade ago. There is a substantial improvement in template based model accuracy as reflected in more successful modeling of regions of structure not easily derived from a single experimental structure template, most likely reflecting intensive work within the modeling community in developing methods that make use of multiple templates, as well as the increased number of experimental structures available. Deriving structural information not obvious from a template is the most demanding as well as one of the most useful tasks that modeling can perform. Thus this is gratifying progress. By contrast, overall backbone accuracy of models appears little changed in the last decade. This puzzling result is explained by two factors--increased database size in some ways makes it harder to choose the best available templates, and the increased intrinsic difficulty of CASP targets as experimental work has progressed to larger and more unusual structures. There is no detectable recent improvement in template-free modeling, but again, this may reflect the changing nature of CASP targets.
Collapse
|