1
|
Toth JM, DePietro PJ, Haas J, McLaughlin WA. ResiRole: residue-level functional site predictions to gauge the accuracies of protein structure prediction techniques. Bioinformatics 2021; 37:351-359. [PMID: 32780798 PMCID: PMC8058773 DOI: 10.1093/bioinformatics/btaa712] [Citation(s) in RCA: 1] [Impact Index Per Article: 0.3] [Reference Citation Analysis] [Abstract] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 12/12/2019] [Revised: 07/31/2020] [Accepted: 08/05/2020] [Indexed: 11/25/2022] Open
Abstract
Motivation Methods to assess the quality of protein structure models are needed for user applications. To aid with the selection of structure models and further inform the development of structure prediction techniques, we describe the ResiRole method for the assessment of the quality of structure models. Results Structure prediction techniques are ranked according to the results of round-robin, head-to-head comparisons using difference scores. Each difference score was defined as the absolute value of the cumulative probability for a functional site prediction made with the FEATURE program for the reference structure minus that for the structure model. Overall, the difference scores correlate well with other model quality metrics; and based on benchmarking studies with NaïveBLAST, they are found to detect additional local structural similarities between the structure models and reference structures. Availabilityand implementation Automated analyses of models addressed in CAMEO are available via the ResiRole server, URL http://protein.som.geisinger.edu/ResiRole/. Interactive analyses with user-provided models and reference structures are also enabled. Code is available at github.com/wamclaughlin/ResiRole. Supplementary information Supplementary data are available at Bioinformatics online.
Collapse
Affiliation(s)
- Joshua M Toth
- Department of Medical Education, Geisinger Commonwealth School of Medicine, Scranton, PA 18510, USA
| | - Paul J DePietro
- Department of Medical Education, Geisinger Commonwealth School of Medicine, Scranton, PA 18510, USA
| | - Juergen Haas
- Biozentrum, University of Basel and SIB Swiss Institute of Bioinformatics, CH-4056 Basel, Switzerland
| | - William A McLaughlin
- Department of Medical Education, Geisinger Commonwealth School of Medicine, Scranton, PA 18510, USA
| |
Collapse
|
2
|
Methods for the Refinement of Protein Structure 3D Models. Int J Mol Sci 2019; 20:ijms20092301. [PMID: 31075942 PMCID: PMC6539982 DOI: 10.3390/ijms20092301] [Citation(s) in RCA: 36] [Impact Index Per Article: 7.2] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 04/02/2019] [Revised: 04/24/2019] [Accepted: 05/07/2019] [Indexed: 12/25/2022] Open
Abstract
The refinement of predicted 3D protein models is crucial in bringing them closer towards experimental accuracy for further computational studies. Refinement approaches can be divided into two main stages: The sampling and scoring stages. Sampling strategies, such as the popular Molecular Dynamics (MD)-based protocols, aim to generate improved 3D models. However, generating 3D models that are closer to the native structure than the initial model remains challenging, as structural deviations from the native basin can be encountered due to force-field inaccuracies. Therefore, different restraint strategies have been applied in order to avoid deviations away from the native structure. For example, the accurate prediction of local errors and/or contacts in the initial models can be used to guide restraints. MD-based protocols, using physics-based force fields and smart restraints, have made significant progress towards a more consistent refinement of 3D models. The scoring stage, including energy functions and Model Quality Assessment Programs (MQAPs) are also used to discriminate near-native conformations from non-native conformations. Nevertheless, there are often very small differences among generated 3D models in refinement pipelines, which makes model discrimination and selection problematic. For this reason, the identification of the most native-like conformations remains a major challenge.
Collapse
|
3
|
Fouladi M, Sarhadi S, Tohidkia M, Fahimi F, Samadi N, Sadeghi J, Barar J, Omidi Y. Selection of a fully human single domain antibody specific to Helicobacter pylori urease. Appl Microbiol Biotechnol 2019; 103:3407-3420. [DOI: 10.1007/s00253-019-09674-6] [Citation(s) in RCA: 9] [Impact Index Per Article: 1.8] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 09/07/2018] [Revised: 01/29/2019] [Accepted: 01/30/2019] [Indexed: 12/14/2022]
|
4
|
Keasar C, McGuffin LJ, Wallner B, Chopra G, Adhikari B, Bhattacharya D, Blake L, Bortot LO, Cao R, Dhanasekaran BK, Dimas I, Faccioli RA, Faraggi E, Ganzynkowicz R, Ghosh S, Ghosh S, Giełdoń A, Golon L, He Y, Heo L, Hou J, Khan M, Khatib F, Khoury GA, Kieslich C, Kim DE, Krupa P, Lee GR, Li H, Li J, Lipska A, Liwo A, Maghrabi AHA, Mirdita M, Mirzaei S, Mozolewska MA, Onel M, Ovchinnikov S, Shah A, Shah U, Sidi T, Sieradzan AK, Ślusarz M, Ślusarz R, Smadbeck J, Tamamis P, Trieber N, Wirecki T, Yin Y, Zhang Y, Bacardit J, Baranowski M, Chapman N, Cooper S, Defelicibus A, Flatten J, Koepnick B, Popović Z, Zaborowski B, Baker D, Cheng J, Czaplewski C, Delbem ACB, Floudas C, Kloczkowski A, Ołdziej S, Levitt M, Scheraga H, Seok C, Söding J, Vishveshwara S, Xu D, Crivelli SN. An analysis and evaluation of the WeFold collaborative for protein structure prediction and its pipelines in CASP11 and CASP12. Sci Rep 2018; 8:9939. [PMID: 29967418 PMCID: PMC6028396 DOI: 10.1038/s41598-018-26812-8] [Citation(s) in RCA: 18] [Impact Index Per Article: 3.0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 11/21/2017] [Accepted: 05/17/2018] [Indexed: 01/14/2023] Open
Abstract
Every two years groups worldwide participate in the Critical Assessment of Protein Structure Prediction (CASP) experiment to blindly test the strengths and weaknesses of their computational methods. CASP has significantly advanced the field but many hurdles still remain, which may require new ideas and collaborations. In 2012 a web-based effort called WeFold, was initiated to promote collaboration within the CASP community and attract researchers from other fields to contribute new ideas to CASP. Members of the WeFold coopetition (cooperation and competition) participated in CASP as individual teams, but also shared components of their methods to create hybrid pipelines and actively contributed to this effort. We assert that the scale and diversity of integrative prediction pipelines could not have been achieved by any individual lab or even by any collaboration among a few partners. The models contributed by the participating groups and generated by the pipelines are publicly available at the WeFold website providing a wealth of data that remains to be tapped. Here, we analyze the results of the 2014 and 2016 pipelines showing improvements according to the CASP assessment as well as areas that require further adjustments and research.
Collapse
Affiliation(s)
- Chen Keasar
- Department of Computer Science, Ben Gurion University of the Negev, Be'er sheva, Israel
| | - Liam J McGuffin
- Biomedical Sciences Division, School of Biological Sciences, University of Reading, Reading, RG6 6AS, UK
| | - Björn Wallner
- Division of Bioinformatics, Department of Physics, Chemistry, and Biology, Linköping University, Linköping, Sweden
| | - Gaurav Chopra
- Department of Chemistry, College of Science, Purdue University, West Lafayette, IN, USA
- Purdue Institute for Drug Discovery, Purdue University, West Lafayette, IN, USA
- Purdue Center for Cancer Research, Purdue University, West Lafayette, IN, USA
- Purdue Institute for Inflammation, Immunology and Infectious Disease, Purdue University, West Lafayette, IN, USA
- Purdue Institute for Integrative Neuroscience, Purdue University, West Lafayette, IN, USA
| | - Badri Adhikari
- Department of Electrical Engineering and Computer Science, University of Missouri, Columbia, MO, USA
| | - Debswapna Bhattacharya
- Department of Electrical Engineering and Computer Science, University of Missouri, Columbia, MO, USA
- Department of Computer Science and Software Engineering, Auburn University, Auburn, AL, USA
| | - Lauren Blake
- Lawrence Berkeley National Laboratory, Berkeley, CA, USA
| | - Leandro Oliveira Bortot
- Laboratory of Biological Physics, Faculty of Pharmaceutical Sciences at Ribeirão Preto, University of São Paulo, São Paulo, Brazil
| | - Renzhi Cao
- Department of Electrical Engineering and Computer Science, University of Missouri, Columbia, MO, USA
| | - B K Dhanasekaran
- Molecular Biophysics Unit and IISC Mathematics Initiative, Indian Institute of Science, Bangalore, India
| | - Itzhel Dimas
- Lawrence Berkeley National Laboratory, Berkeley, CA, USA
| | | | - Eshel Faraggi
- Research and Information Systems, LLC, Carmel, IN, USA
- Department of Biochemistry and Molecular Biology, IU School of Medicine, Indianapolis, IN, USA
- Batelle Center for Mathematical Medicine, The Research Institute at Nationwide Children's Hospital, Columbus, OH, USA
| | | | - Sambit Ghosh
- Molecular Biophysics Unit and IISC Mathematics Initiative, Indian Institute of Science, Bangalore, India
| | - Soma Ghosh
- Molecular Biophysics Unit and IISC Mathematics Initiative, Indian Institute of Science, Bangalore, India
| | - Artur Giełdoń
- Faculty of Chemistry, University of Gdansk, Gdańsk, Poland
| | - Lukasz Golon
- Faculty of Chemistry, University of Gdansk, Gdańsk, Poland
| | - Yi He
- School of Engineering, University of California, Merced, CA, USA
| | - Lim Heo
- Department of Chemistry, Seoul National University, Seoul, Republic of Korea
| | - Jie Hou
- Department of Electrical Engineering and Computer Science, University of Missouri, Columbia, MO, USA
| | - Main Khan
- Department of Computer and Information Science, University of Massachusetts Dartmouth, MA, USA
| | - Firas Khatib
- Department of Computer and Information Science, University of Massachusetts Dartmouth, MA, USA
| | - George A Khoury
- Department of Chemical and Biological Engineering, Princeton University, Princeton, NJ, USA
| | - Chris Kieslich
- Texas A&M Energy Institute, Texas A&M University, College Station, TX, USA
| | - David E Kim
- Department of Biochemistry, University of Washington, Seattle, WA, USA
- Howard Hughes Medical Institute, University of Washington, Seattle, WA, USA
| | - Pawel Krupa
- Faculty of Chemistry, University of Gdansk, Gdańsk, Poland
| | - Gyu Rie Lee
- Department of Chemistry, Seoul National University, Seoul, Republic of Korea
| | - Hongbo Li
- Department of Electrical Engineering and Computer Science, University of Missouri, Columbia, MO, USA
- School of Computer Science and Information Technology, NorthEast Normal University, Changchun, China
- Christopher S. Bond Life Sciences Center, University of Missouri, Columbia, MO, USA
| | - Jilong Li
- Department of Electrical Engineering and Computer Science, University of Missouri, Columbia, MO, USA
| | | | - Adam Liwo
- Faculty of Chemistry, University of Gdansk, Gdańsk, Poland
| | - Ali Hassan A Maghrabi
- Biomedical Sciences Division, School of Biological Sciences, University of Reading, Reading, RG6 6AS, UK
| | - Milot Mirdita
- Max Planck Institute for Biophysical Chemistry, Göttingen, Germany
| | - Shokoufeh Mirzaei
- Lawrence Berkeley National Laboratory, Berkeley, CA, USA
- California State Polytechnic University, Pomona, CA, USA
| | | | - Melis Onel
- Artie McFerrin Department of Chemical Engineering, Texas A&M University, College Station, TX, USA
| | - Sergey Ovchinnikov
- Department of Biochemistry, University of Washington, Seattle, WA, USA
- Institute for Protein Design, University of Washington, Seattle, WA, USA
| | - Anand Shah
- Department of Computer and Information Science, University of Massachusetts Dartmouth, MA, USA
| | - Utkarsh Shah
- Artie McFerrin Department of Chemical Engineering, Texas A&M University, College Station, TX, USA
| | - Tomer Sidi
- Department of Computer Science, Ben Gurion University of the Negev, Be'er sheva, Israel
| | | | | | - Rafal Ślusarz
- Faculty of Chemistry, University of Gdansk, Gdańsk, Poland
| | - James Smadbeck
- Department of Chemical and Biological Engineering, Princeton University, Princeton, NJ, USA
| | - Phanourios Tamamis
- Texas A&M Energy Institute, Texas A&M University, College Station, TX, USA
- Artie McFerrin Department of Chemical Engineering, Texas A&M University, College Station, TX, USA
| | - Nicholas Trieber
- Department of Computer and Information Science, University of Massachusetts Dartmouth, MA, USA
| | - Tomasz Wirecki
- Faculty of Chemistry, University of Gdansk, Gdańsk, Poland
| | - Yanping Yin
- Baker Laboratory of Chemistry and Chemical Biology, Cornell University, Ithaca, NY, USA
| | - Yang Zhang
- Department of Computational Medicine and Bioinformatics, University of Michigan, Ann Arbor, MI, USA
| | - Jaume Bacardit
- Interdisciplinary Computing and Complex BioSystems (ICOS) research group, School of Computing, Newcastle University, Newcastle-upon-Tyne, UK
| | - Maciej Baranowski
- Intercollegiate Faculty of Biotechnology, University of Gdańsk and Medical University of Gdańsk, Gdańsk, Poland
| | - Nicholas Chapman
- Center for Game Science, Department of Computer Science & Engineering, University of Washington, Seattle, WA, USA
| | - Seth Cooper
- College of Computer and Information Science, Northeastern University, Boston, MA, USA
| | - Alexandre Defelicibus
- Institute of Mathematical and Computer Sciences, University of São Paulo, São Paulo, Brazil
| | - Jeff Flatten
- Center for Game Science, Department of Computer Science & Engineering, University of Washington, Seattle, WA, USA
| | - Brian Koepnick
- Department of Biochemistry, University of Washington, Seattle, WA, USA
| | - Zoran Popović
- Center for Game Science, Department of Computer Science & Engineering, University of Washington, Seattle, WA, USA
| | | | - David Baker
- Department of Biochemistry, University of Washington, Seattle, WA, USA
- Howard Hughes Medical Institute, University of Washington, Seattle, WA, USA
- Center for Game Science, Department of Computer Science & Engineering, University of Washington, Seattle, WA, USA
| | - Jianlin Cheng
- Department of Electrical Engineering and Computer Science, University of Missouri, Columbia, MO, USA
| | | | | | | | | | - Stanislaw Ołdziej
- Intercollegiate Faculty of Biotechnology, University of Gdańsk and Medical University of Gdańsk, Gdańsk, Poland
| | - Michael Levitt
- Department of Structural Biology, School of Medicine, Stanford University, Stanford, CA, USA
| | - Harold Scheraga
- Baker Laboratory of Chemistry and Chemical Biology, Cornell University, Ithaca, NY, USA
| | - Chaok Seok
- Department of Chemistry, Seoul National University, Seoul, Republic of Korea
| | - Johannes Söding
- Max Planck Institute for Biophysical Chemistry, Göttingen, Germany
| | - Saraswathi Vishveshwara
- Molecular Biophysics Unit and IISC Mathematics Initiative, Indian Institute of Science, Bangalore, India
| | - Dong Xu
- Department of Electrical Engineering and Computer Science, University of Missouri, Columbia, MO, USA
- Christopher S. Bond Life Sciences Center, University of Missouri, Columbia, MO, USA
| | - Silvia N Crivelli
- Lawrence Berkeley National Laboratory, Berkeley, CA, USA.
- Department of Computer Science, University of California, Davis, CA, USA.
| |
Collapse
|
5
|
Orr AA, Gonzalez-Rivera JC, Wilson M, Bhikha PR, Wang D, Contreras LM, Tamamis P. A high-throughput and rapid computational method for screening of RNA post-transcriptional modifications that can be recognized by target proteins. Methods 2018; 143:34-47. [DOI: 10.1016/j.ymeth.2018.01.015] [Citation(s) in RCA: 6] [Impact Index Per Article: 1.0] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 11/16/2017] [Revised: 01/14/2018] [Accepted: 01/26/2018] [Indexed: 12/25/2022] Open
|
6
|
Phage display-derived antibody fragments against conserved regions of VacA toxin of Helicobacter pylori. Appl Microbiol Biotechnol 2018; 102:6899-6913. [PMID: 29862446 DOI: 10.1007/s00253-018-9068-4] [Citation(s) in RCA: 11] [Impact Index Per Article: 1.8] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 01/16/2018] [Revised: 05/01/2018] [Accepted: 05/02/2018] [Indexed: 12/12/2022]
Abstract
Infection with Helicobacter pylori may result in the emergence of gastric adenocarcinoma. Among various toxins assisting pathogenesis of H. pylori, the vacuolating cytotoxin A (VacA) is one of the most potent toxins known as the major cause of the peptic ulcer and gastric adenocarcinoma. To isolate single-chain variable fragments (scFvs) against two conserved regions of VacA, we capitalized on the phage display technology and a solution-phase biopanning (SPB). Characterization of scFvs was carried out by enzyme-linked immunosorbent assay (ELISA), immunoblotting, and surface plasmon resonance (SPR). Bioinformatics analyses were also performed in order to characterize the structural and functional properties of the isolated scFvs and the interaction(s) between the isolated antibodies (Ab)-antigen (Ag). After four rounds of biopanning, the positive colonies detected by scFv ELISA were harvested to extract the plasmids and perform sequencing. Of several colonies, three colonies showed high affinity to the VacA1 and two colonies for the VacA2. Further complementary examinations (e.g., sodium dodecyl sulfate polyacrylamide gel electrophoresis (SDS-PAGE), western blot, SPR, and flow cytometry) displayed the high affinity and specificity of the isolated scFvs to the VacA. Docking results revealed the interaction of the complementarity-determining regions (CDRs) with the VacA peptide. In conclusion, for the first time, we report on the isolation of several scFvs against conserved residues of VacA toxin with high affinity and specificity, which may be used as novel diagnostic/therapeutic tool in the H. pylori infection.
Collapse
|
7
|
Heo L, Feig M. PREFMD: a web server for protein structure refinement via molecular dynamics simulations. Bioinformatics 2018; 34:1063-1065. [PMID: 29126101 PMCID: PMC5860225 DOI: 10.1093/bioinformatics/btx726] [Citation(s) in RCA: 21] [Impact Index Per Article: 3.5] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/10/2017] [Revised: 10/04/2017] [Accepted: 11/07/2017] [Indexed: 11/13/2022] Open
Abstract
Summary Refinement of protein structure models is a long-standing problem in structural bioinformatics. Molecular dynamics-based methods have emerged as an avenue to achieve consistent refinement. The PREFMD web server implements an optimized protocol based on the method successfully tested in CASP11. Validation with recent CASP refinement targets shows consistent and more significant improvement in global structure accuracy over other state-of-the-art servers. Availability and implementation PREFMD is freely available as a web server at http://feiglab.org/prefmd. Scripts for running PREFMD as a stand-alone package are available at https://github.com/feiglab/prefmd.git. Contact feig@msu.edu. Supplementary information Supplementary data are available at Bioinformatics online.
Collapse
Affiliation(s)
- Lim Heo
- Biochemistry and Molecular Biology, Michigan State University, East Lansing, MI, USA
| | - Michael Feig
- Biochemistry and Molecular Biology, Michigan State University, East Lansing, MI, USA
| |
Collapse
|
8
|
Structural and functional modeling of viral protein 5 of Infectious Bursal Disease Virus. Virus Res 2018; 247:55-60. [PMID: 29427596 DOI: 10.1016/j.virusres.2018.01.017] [Citation(s) in RCA: 6] [Impact Index Per Article: 1.0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 11/02/2017] [Revised: 01/28/2018] [Accepted: 01/31/2018] [Indexed: 11/22/2022]
Abstract
Infectious Bursal Disease (IBD) is an acute, highly contagious and immunosuppressive disease of young chicken. The causative virus (IBDV) is a bi-segmented, double-stranded RNA virus. The virus encodes five major proteins, viral protein (VP) 1-5. VPs 1-3 have been characterized crystallographically. Albeit a rise in the number of studies reporting successful heterologous expression of VP5 in recent times, challenging the notion that rapid death of host cells overexpressing VP5 disallows obtaining sufficiently pure preparations of the protein for crystallographic studies, the structure of VP5 remains unknown and its function controversial. Our study describes the first 3D model of IBD VP5 obtained through an elaborate computational workflow. Based on the results of the study, IBD VP5 can be predicted to be a structural analog of the leucine-rich repeat (LRR) family of proteins. Functional implications arising from structural similarity of VP5 with host Toll-like receptor (Tlr) 3 also satisfy the previously reported opposing roles of the protein in first abolishing and later inducing host-cell apoptosis.
Collapse
|
9
|
Hong SH, Joung I, Flores-Canales JC, Manavalan B, Cheng Q, Heo S, Kim JY, Lee SY, Nam M, Joo K, Lee IH, Lee SJ, Lee J. Protein structure modeling and refinement by global optimization in CASP12. Proteins 2017; 86 Suppl 1:122-135. [PMID: 29159837 DOI: 10.1002/prot.25426] [Citation(s) in RCA: 14] [Impact Index Per Article: 2.0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/18/2017] [Revised: 11/10/2017] [Accepted: 11/16/2017] [Indexed: 11/09/2022]
Abstract
For protein structure modeling in the CASP12 experiment, we have developed a new protocol based on our previous CASP11 approach. The global optimization method of conformational space annealing (CSA) was applied to 3 stages of modeling: multiple sequence-structure alignment, three-dimensional (3D) chain building, and side-chain re-modeling. For better template selection and model selection, we updated our model quality assessment (QA) method with the newly developed SVMQA (support vector machine for quality assessment). For 3D chain building, we updated our energy function by including restraints generated from predicted residue-residue contacts. New energy terms for the predicted secondary structure and predicted solvent accessible surface area were also introduced. For difficult targets, we proposed a new method, LEEab, where the template term played a less significant role than it did in LEE, complemented by increased contributions from other terms such as the predicted contact term. For TBM (template-based modeling) targets, LEE performed better than LEEab, but for FM targets, LEEab was better. For model refinement, we modified our CASP11 molecular dynamics (MD) based protocol by using explicit solvents and tuning down restraint weights. Refinement results from MD simulations that used a new augmented statistical energy term in the force field were quite promising. Finally, when using inaccurate information (such as the predicted contacts), it was important to use the Lorentzian function for which the maximal penalty arising from wrong information is always bounded.
Collapse
Affiliation(s)
- Seung Hwan Hong
- Center for In Silico Protein Science, Korea Institute for Advanced Study, Seoul, South Korea.,School of Computational Sciences, Korea Institute for Advanced Study, Seoul, South Korea
| | - InSuk Joung
- Center for In Silico Protein Science, Korea Institute for Advanced Study, Seoul, South Korea.,School of Computational Sciences, Korea Institute for Advanced Study, Seoul, South Korea
| | - Jose C Flores-Canales
- Center for In Silico Protein Science, Korea Institute for Advanced Study, Seoul, South Korea.,School of Computational Sciences, Korea Institute for Advanced Study, Seoul, South Korea
| | - Balachandran Manavalan
- Center for In Silico Protein Science, Korea Institute for Advanced Study, Seoul, South Korea.,School of Computational Sciences, Korea Institute for Advanced Study, Seoul, South Korea
| | - Qianyi Cheng
- Center for In Silico Protein Science, Korea Institute for Advanced Study, Seoul, South Korea.,School of Computational Sciences, Korea Institute for Advanced Study, Seoul, South Korea
| | - Seungryong Heo
- Center for In Silico Protein Science, Korea Institute for Advanced Study, Seoul, South Korea
| | - Jong Yun Kim
- Center for In Silico Protein Science, Korea Institute for Advanced Study, Seoul, South Korea
| | - Sun Young Lee
- Center for In Silico Protein Science, Korea Institute for Advanced Study, Seoul, South Korea
| | - Mikyung Nam
- Center for In Silico Protein Science, Korea Institute for Advanced Study, Seoul, South Korea
| | - Keehyoung Joo
- Center for In Silico Protein Science, Korea Institute for Advanced Study, Seoul, South Korea.,Center for Advanced Computation, Korea Institute for Advanced Study, Seoul, South Korea
| | - In-Ho Lee
- Center for In Silico Protein Science, Korea Institute for Advanced Study, Seoul, South Korea.,Korea Research Institute of Standards and Science (KRISS), Daejeon, South Korea
| | - Sung Jong Lee
- Center for In Silico Protein Science, Korea Institute for Advanced Study, Seoul, South Korea.,The Research Institute for Basic Sciences, Changwon National University, Changwon-Si, Gyeongsangnam-do, South Korea
| | - Jooyoung Lee
- Center for In Silico Protein Science, Korea Institute for Advanced Study, Seoul, South Korea.,School of Computational Sciences, Korea Institute for Advanced Study, Seoul, South Korea.,Center for Advanced Computation, Korea Institute for Advanced Study, Seoul, South Korea
| |
Collapse
|
10
|
Hovan L, Oleinikovas V, Yalinca H, Kryshtafovych A, Saladino G, Gervasio FL. Assessment of the model refinement category in CASP12. Proteins 2017; 86 Suppl 1:152-167. [PMID: 29071750 DOI: 10.1002/prot.25409] [Citation(s) in RCA: 32] [Impact Index Per Article: 4.6] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/19/2017] [Revised: 10/03/2017] [Accepted: 10/24/2017] [Indexed: 01/07/2023]
Abstract
We here report on the assessment of the model refinement predictions submitted to the 12th Experiment on the Critical Assessment of Protein Structure Prediction (CASP12). This is the fifth refinement experiment since CASP8 (2008) and, as with the previous experiments, the predictors were invited to refine selected server models received in the regular (nonrefinement) stage of the CASP experiment. We assessed the submitted models using a combination of standard CASP measures. The coefficients for the linear combination of Z-scores (the CASP12 score) have been obtained by a machine learning algorithm trained on the results of visual inspection. We identified eight groups that improve both the backbone conformation and the side chain positioning for the majority of targets. Albeit the top methods adopted distinctively different approaches, their overall performance was almost indistinguishable, with each of them excelling in different scores or target subsets. What is more, there were a few novel approaches that, while doing worse than average in most cases, provided the best refinements for a few targets, showing significant latitude for further innovation in the field.
Collapse
Affiliation(s)
- Ladislav Hovan
- Department of Chemistry, University College London, WC1E 6BT, United Kingdom
| | | | - Havva Yalinca
- Department of Chemistry, University College London, WC1E 6BT, United Kingdom
| | | | - Giorgio Saladino
- Department of Chemistry, University College London, WC1E 6BT, United Kingdom
| | - Francesco Luigi Gervasio
- Department of Chemistry, University College London, WC1E 6BT, United Kingdom.,Institute of Structural and Molecular Biology, University College London, London, WC1E 6BT, United Kingdom
| |
Collapse
|
11
|
Cheng Q, Joung I, Lee J. A Simple and Efficient Protein Structure Refinement Method. J Chem Theory Comput 2017; 13:5146-5162. [PMID: 28800396 DOI: 10.1021/acs.jctc.7b00470] [Citation(s) in RCA: 9] [Impact Index Per Article: 1.3] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/30/2022]
Abstract
Improving the quality of a given protein structure can serve as the ultimate solution for accurate protein structure prediction, and seeking such a method is currently a challenge in computational structural biology. In order to promote and encourage much needed such efforts, CASP (Critical Assessment of Structure Prediction) has been providing an ideal computational experimental platform, where it was reported only recently (since CASP10) that systematic protein structure refinement is possible by carrying out extensive (approximately millisecond) MD simulations with proper restraints generated from the given structure. Using an explicit solvent model and much reduced positional and distance restraints than previously exercised, we propose a refinement protocol that combines a series of short (5 ns) MD simulations with energy minimization procedures. Testing and benchmarking on 54 CASP8-10 refinement targets and 34 CASP11 refinement targets shows quite promising results. Using only a small fraction of MD simulation steps (nanosecond versus millisecond), systematic protein structure refinement was demonstrated in this work, indicating that refinement of a given model can be achieved using a few hours of desktop computing.
Collapse
Affiliation(s)
- Qianyi Cheng
- Center for In Silico Protein Science and School of Computational Sciences, Korea Institute for Advanced Study , Seoul 02455, Korea
| | - InSuk Joung
- Center for In Silico Protein Science and School of Computational Sciences, Korea Institute for Advanced Study , Seoul 02455, Korea
| | - Jooyoung Lee
- Center for In Silico Protein Science and School of Computational Sciences, Korea Institute for Advanced Study , Seoul 02455, Korea
| |
Collapse
|
12
|
Feig M. Computational protein structure refinement: Almost there, yet still so far to go. WILEY INTERDISCIPLINARY REVIEWS. COMPUTATIONAL MOLECULAR SCIENCE 2017; 7:e1307. [PMID: 30613211 PMCID: PMC6319934 DOI: 10.1002/wcms.1307] [Citation(s) in RCA: 46] [Impact Index Per Article: 6.6] [Reference Citation Analysis] [Abstract] [Grants] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 12/21/2022]
Abstract
Protein structures are essential in modern biology yet experimental methods are far from being able to catch up with the rapid increase in available genomic data. Computational protein structure prediction methods aim to fill the gap while the role of protein structure refinement is to take approximate initial template-based models and bring them closer to the true native structure. Current methods for computational structure refinement rely on molecular dynamics simulations, related sampling methods, or iterative structure optimization protocols. The best methods are able to achieve moderate degrees of refinement but consistent refinement that can reach near-experimental accuracy remains elusive. Key issues revolve around the accuracy of the energy function, the inability to reliably rank multiple models, and the use of restraints that keep sampling close to the native state but also limit the degree of possible refinement. A different aspect is the question of what exactly the target of high-resolution refinement should be as experimental structures are affected by experimental conditions and different biological questions require varying levels of accuracy. While improvement of the global protein structure is a difficult problem, high-resolution refinement methods that improves local structural quality such as favorable stereochemistry and the avoidance of atomic clashes are much more successful.
Collapse
Affiliation(s)
- Michael Feig
- Department of Biochemistry and Molecular Biology, Michigan State University, 603 Wilson Rd., Room 218 BCH, East Lansing, MI, USA, ; 517-432-7439
| |
Collapse
|
13
|
Khoury GA, Smadbeck J, Kieslich CA, Koskosidis AJ, Guzman YA, Tamamis P, Floudas CA. Princeton_TIGRESS 2.0: High refinement consistency and net gains through support vector machines and molecular dynamics in double-blind predictions during the CASP11 experiment. Proteins 2017; 85:1078-1098. [PMID: 28241391 DOI: 10.1002/prot.25274] [Citation(s) in RCA: 13] [Impact Index Per Article: 1.9] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/27/2016] [Revised: 02/01/2017] [Accepted: 02/14/2017] [Indexed: 12/28/2022]
Abstract
Protein structure refinement is the challenging problem of operating on any protein structure prediction to improve its accuracy with respect to the native structure in a blind fashion. Although many approaches have been developed and tested during the last four CASP experiments, a majority of the methods continue to degrade models rather than improve them. Princeton_TIGRESS (Khoury et al., Proteins 2014;82:794-814) was developed previously and utilizes separate sampling and selection stages involving Monte Carlo and molecular dynamics simulations and classification using an SVM predictor. The initial implementation was shown to consistently refine protein structures 76% of the time in our own internal benchmarking on CASP 7-10 targets. In this work, we improved the sampling and selection stages and tested the method in blind predictions during CASP11. We added a decomposition of physics-based and hybrid energy functions, as well as a coordinate-free representation of the protein structure through distance-binning Cα-Cα distances to capture fine-grained movements. We performed parameter estimation to optimize the adjustable SVM parameters to maximize precision while balancing sensitivity and specificity across all cross-validated data sets, finding enrichment in our ability to select models from the populations of similar decoys generated for targets in CASPs 7-10. The MD stage was enhanced such that larger structures could be further refined. Among refinement methods that are currently implemented as web-servers, Princeton_TIGRESS 2.0 demonstrated the most consistent and most substantial net refinement in blind predictions during CASP11. The enhanced refinement protocol Princeton_TIGRESS 2.0 is freely available as a web server at http://atlas.engr.tamu.edu/refinement/. Proteins 2017; 85:1078-1098. © 2017 Wiley Periodicals, Inc.
Collapse
Affiliation(s)
- George A Khoury
- Department of Chemical and Biological Engineering, Princeton University, Princeton, New Jersey
| | - James Smadbeck
- Department of Chemical and Biological Engineering, Princeton University, Princeton, New Jersey
| | - Chris A Kieslich
- Artie McFerrin Department of Chemical Engineering, Texas A&M University, College Station, Texas.,Texas A&M Energy Institute, Texas A&M University, College Station, Texas
| | - Alexandra J Koskosidis
- Artie McFerrin Department of Chemical Engineering, Texas A&M University, College Station, Texas.,Texas A&M Energy Institute, Texas A&M University, College Station, Texas
| | - Yannis A Guzman
- Department of Chemical and Biological Engineering, Princeton University, Princeton, New Jersey.,Artie McFerrin Department of Chemical Engineering, Texas A&M University, College Station, Texas.,Texas A&M Energy Institute, Texas A&M University, College Station, Texas
| | - Phanourios Tamamis
- Artie McFerrin Department of Chemical Engineering, Texas A&M University, College Station, Texas.,Texas A&M Energy Institute, Texas A&M University, College Station, Texas
| | - Christodoulos A Floudas
- Artie McFerrin Department of Chemical Engineering, Texas A&M University, College Station, Texas.,Texas A&M Energy Institute, Texas A&M University, College Station, Texas
| |
Collapse
|
14
|
Ghaffar A, Tariq A. In-silico analysis of Pasteurella multocida to identify common epitopes between fowl, goat and buffalo. Gene 2016; 580:58-66. [DOI: 10.1016/j.gene.2016.01.020] [Citation(s) in RCA: 2] [Impact Index Per Article: 0.3] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/20/2015] [Revised: 12/16/2015] [Accepted: 01/04/2016] [Indexed: 10/22/2022]
|
15
|
Kumar A, Campitelli P, Thorpe MF, Ozkan SB. Partial unfolding and refolding for structure refinement: A unified approach of geometric simulations and molecular dynamics. Proteins 2015; 83:2279-92. [PMID: 26476100 DOI: 10.1002/prot.24947] [Citation(s) in RCA: 4] [Impact Index Per Article: 0.4] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/26/2015] [Revised: 09/11/2015] [Accepted: 09/29/2015] [Indexed: 12/26/2022]
Abstract
The most successful protein structure prediction methods to date have been template-based modeling (TBM) or homology modeling, which predicts protein structure based on experimental structures. These high accuracy predictions sometimes retain structural errors due to incorrect templates or a lack of accurate templates in the case of low sequence similarity, making these structures inadequate in drug-design studies or molecular dynamics simulations. We have developed a new physics based approach to the protein refinement problem by mimicking the mechanism of chaperons that rehabilitate misfolded proteins. The template structure is unfolded by selectively (targeted) pulling on different portions of the protein using the geometric based technique FRODA, and then refolded using hierarchically restrained replica exchange molecular dynamics simulations (hr-REMD). FRODA unfolding is used to create a diverse set of topologies for surveying near native-like structures from a template and to provide a set of persistent contacts to be employed during re-folding. We have tested our approach on 13 previous CASP targets and observed that this method of folding an ensemble of partially unfolded structures, through the hierarchical addition of contact restraints (that is, first local and then nonlocal interactions), leads to a refolding of the structure along with refinement in most cases (12/13). Although this approach yields refined models through advancement in sampling, the task of blind selection of the best refined models still needs to be solved. Overall, the method can be useful for improved sampling for low resolution models where certain of the portions of the structure are incorrectly modeled.
Collapse
Affiliation(s)
- Avishek Kumar
- Department of Physics and Center for Biological Physics, Arizona State University, Tempe, Arizona
| | - Paul Campitelli
- Department of Physics and Center for Biological Physics, Arizona State University, Tempe, Arizona
| | - M F Thorpe
- Department of Physics and Center for Biological Physics, Arizona State University, Tempe, Arizona.,Rudolf Peierls Center for Theoretical Physics, University of Oxford, Oxford, OX1 3NP, United Kingdom
| | - S Banu Ozkan
- Department of Physics and Center for Biological Physics, Arizona State University, Tempe, Arizona
| |
Collapse
|
16
|
Joung I, Lee SY, Cheng Q, Kim JY, Joo K, Lee SJ, Lee J. Template-free modeling by LEE and LEER in CASP11. Proteins 2015; 84 Suppl 1:118-30. [DOI: 10.1002/prot.24944] [Citation(s) in RCA: 7] [Impact Index Per Article: 0.8] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 05/15/2015] [Revised: 08/26/2015] [Accepted: 10/11/2015] [Indexed: 12/25/2022]
Affiliation(s)
- InSuk Joung
- Center for In Silico Protein Science, Korea Institute for Advanced Study; Seoul 130-722 Korea
- School of Computational Sciences; Korea Institute for Advanced Study; Seoul 130-722 Korea
| | - Sun Young Lee
- Center for In Silico Protein Science, Korea Institute for Advanced Study; Seoul 130-722 Korea
| | - Qianyi Cheng
- Center for In Silico Protein Science, Korea Institute for Advanced Study; Seoul 130-722 Korea
- School of Computational Sciences; Korea Institute for Advanced Study; Seoul 130-722 Korea
| | - Jong Yun Kim
- Center for In Silico Protein Science, Korea Institute for Advanced Study; Seoul 130-722 Korea
| | - Keehyoung Joo
- Center for In Silico Protein Science, Korea Institute for Advanced Study; Seoul 130-722 Korea
- Center for Advanced Computation, Korea Institute for Advanced Study; Seoul 130-722 Korea
| | - Sung Jong Lee
- Center for In Silico Protein Science, Korea Institute for Advanced Study; Seoul 130-722 Korea
- Department of Physics; University of Suwon; Hwaseong-Si Gyeonggi-Do 445-743 Korea
| | - Jooyoung Lee
- Center for In Silico Protein Science, Korea Institute for Advanced Study; Seoul 130-722 Korea
- School of Computational Sciences; Korea Institute for Advanced Study; Seoul 130-722 Korea
- Center for Advanced Computation, Korea Institute for Advanced Study; Seoul 130-722 Korea
| |
Collapse
|
17
|
Park H, DiMaio F, Baker D. CASP11 refinement experiments with ROSETTA. Proteins 2015; 84 Suppl 1:314-22. [PMID: 26205421 PMCID: PMC4724349 DOI: 10.1002/prot.24862] [Citation(s) in RCA: 21] [Impact Index Per Article: 2.3] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 05/15/2015] [Revised: 07/19/2015] [Accepted: 07/21/2015] [Indexed: 12/28/2022]
Abstract
We report new Rosetta-based approaches to tackling the major issues that confound protein structure refinement, and the testing of these approaches in the CASP11 experiment. Automated refinement protocols were developed that integrate a range of sampling methods using parallel computation and multiobjective optimization. In CASP11, we used a more aggressive large-scale structure rebuilding approach for poor starting models, and a less aggressive local rebuilding plus core refinement approach for starting models likely to be closer to the native structure. The more incorrectly modeled a structure was predicted to be, the more it was allowed to vary during refinement. The CASP11 experiment revealed strengths and weaknesses of the approaches: the high-resolution strategy incorporating local rebuilding with core refinement consistently improved starting structures, while the low-resolution strategy incorporating the reconstruction of large parts of the structures improved starting models in some cases but often considerably worsened them, largely because of model selection issues. Overall, the results suggest the high-resolution refinement protocol is a promising method orthogonal to other approaches, while the low-resolution refinement method clearly requires further development. Proteins 2016; 84(Suppl 1):314-322. © 2015 Wiley Periodicals, Inc.
Collapse
Affiliation(s)
- Hahnbeom Park
- Department of Biochemistry, University of Washington, Seattle, Washington, 98195.,Institute for Protein Design, University of Washington, Seattle, Washington, 98195
| | - Frank DiMaio
- Department of Biochemistry, University of Washington, Seattle, Washington, 98195.,Institute for Protein Design, University of Washington, Seattle, Washington, 98195
| | - David Baker
- Department of Biochemistry, University of Washington, Seattle, Washington, 98195. .,Institute for Protein Design, University of Washington, Seattle, Washington, 98195. .,Howard Hughes Medical Institute, University of Washington, Seattle, Washington, 98195.
| |
Collapse
|
18
|
Tamamis P, Terzaki K, Kassinopoulos M, Mastrogiannis L, Mossou E, Forsyth VT, Mitchell EP, Mitraki A, Archontis G. Self-Assembly of an Aspartate-Rich Sequence from the Adenovirus Fiber Shaft: Insights from Molecular Dynamics Simulations and Experiments. J Phys Chem B 2014; 118:1765-74. [DOI: 10.1021/jp409988n] [Citation(s) in RCA: 19] [Impact Index Per Article: 1.9] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 01/18/2023]
Affiliation(s)
- Phanourios Tamamis
- Department
of Physics, University of Cyprus, 75 Kallipoleos Street, CY1678 Nicosia, Cyprus
| | - Konstantina Terzaki
- Department
of Materials Science and Technology, University of Crete, P.O. Box 2208, GR-71003 Heraklion, Crete, Greece
- Institute for Electronic Structure and Laser, FORTH, P.O. Box 1527, GR-71110 Heraklion, Greece
| | - Michalis Kassinopoulos
- Department
of Physics, University of Cyprus, 75 Kallipoleos Street, CY1678 Nicosia, Cyprus
| | - Lefteris Mastrogiannis
- Department
of Materials Science and Technology, University of Crete, P.O. Box 2208, GR-71003 Heraklion, Crete, Greece
- Institute for Electronic Structure and Laser, FORTH, P.O. Box 1527, GR-71110 Heraklion, Greece
| | - Estelle Mossou
- EPSAM/ISTM, Keele University, Keele, Staffordshire ST5
5BG, United Kingdom
- Partnership
for Structural Biology, Institut Laue Langevin, 6 rue Jules Horowitz, 38042 Grenoble Cedex 9, France
| | - V. Trevor Forsyth
- EPSAM/ISTM, Keele University, Keele, Staffordshire ST5
5BG, United Kingdom
- Partnership
for Structural Biology, Institut Laue Langevin, 6 rue Jules Horowitz, 38042 Grenoble Cedex 9, France
| | - Edward P. Mitchell
- Partnership
for Structural Biology, Institut Laue Langevin, 6 rue Jules Horowitz, 38042 Grenoble Cedex 9, France
- European Synchrotron Radiation Facility, 6 rue Jules Horowitz, 38043 Grenoble Cedex 9, France
| | - Anna Mitraki
- Department
of Materials Science and Technology, University of Crete, P.O. Box 2208, GR-71003 Heraklion, Crete, Greece
- Institute for Electronic Structure and Laser, FORTH, P.O. Box 1527, GR-71110 Heraklion, Greece
| | - Georgios Archontis
- Department
of Physics, University of Cyprus, 75 Kallipoleos Street, CY1678 Nicosia, Cyprus
| |
Collapse
|