Reference Citation Analysis: Find an Article, Find a Category, Find a Journal, Find a Scholar

For: Yang Y, Zhou Y. Specific interactions for ab initio folding of protein terminal regions with secondary structures. Proteins 2008;72:793-803. [PMID: 18260109 DOI: 10.1002/prot.21968] [Citation(s) in RCA: 186] [Impact Index Per Article: 11.6] [Reference Citation Analysis] [What about the content of this article? (0)] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 12/26/2022]

For:	Yang Y, Zhou Y. Specific interactions for ab initio folding of protein terminal regions with secondary structures. Proteins 2008;72:793-803. [PMID: 18260109 DOI: 10.1002/prot.21968] [Citation(s) in RCA: 186] [Impact Index Per Article: 11.6] [Reference Citation Analysis] [What about the content of this article? (0)] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 12/26/2022]

Number

Cited by Other Article(s)

Rosignoli S, Lustrino E, Di Silverio I, Paiardini A. Making Use of Averaging Methods in MODELLER for Protein Structure Prediction. Int J Mol Sci 2024;25:1731. [PMID: 38339009 PMCID: PMC10855553 DOI: 10.3390/ijms25031731] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 12/23/2023] [Revised: 01/23/2024] [Accepted: 01/29/2024] [Indexed: 02/12/2024] Open

Abstract

Recent advances in protein structure prediction, driven by AlphaFold 2 and machine learning, demonstrate proficiency in static structures but encounter challenges in capturing essential dynamic features crucial for understanding biological function. In this context, homology-based modeling emerges as a cost-effective and computationally efficient alternative. The MODELLER (version 10.5, accessed on 30 November 2023) algorithm can be harnessed for this purpose since it computes intermediate models during simulated annealing, enabling the exploration of attainable configurational states and energies while minimizing its objective function. There have been a few attempts to date to improve the models generated by its algorithm, and in particular, there is no literature regarding the implementation of an averaging procedure involving the intermediate models in the MODELLER algorithm. In this study, we examined MODELLER's output using 225 target-template pairs, extracting the best representatives of intermediate models. Applying an averaging procedure to the selected intermediate structures based on statistical potentials, we aimed to determine: (1) whether averaging improves the quality of structural models during the building phase; (2) if ranking by statistical potentials reliably selects the best models, leading to improved final model quality; (3) whether using a single template versus multiple templates affects the averaging approach; (4) whether the "ensemble" nature of the MODELLER building phase can be harnessed to capture low-energy conformations in holo structures modeling. Our findings indicate that while improvements typically fall short of a few decimal points in the model evaluation metric, a notable fraction of configurations exhibit slightly higher similarity to the native structure than MODELLER's proposed final model. The averaging-building procedure proves particularly beneficial in (1) regions of low sequence identity between the target and template(s), the most challenging aspect of homology modeling; (2) holo protein conformations generation, an area in which MODELLER and related tools usually fall short of the expected performance.

Collapse

Teruel N, Borges VM, Najmanovich R. Surfaces: a software to quantify and visualize interactions within and between proteins and ligands. Bioinformatics 2023;39:btad608. [PMID: 37788107 PMCID: PMC10568369 DOI: 10.1093/bioinformatics/btad608] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 04/26/2023] [Revised: 08/23/2023] [Accepted: 09/29/2023] [Indexed: 10/05/2023] Open

Ahmadi N, Aghasadeghi M, Hamidi-Fard M, Motevalli F, Bahramali G. Reverse Vaccinology and Immunoinformatic Approach for Designing a Bivalent Vaccine Candidate Against Hepatitis A and Hepatitis B Viruses. Mol Biotechnol 2023:10.1007/s12033-023-00867-z. [PMID: 37715882 DOI: 10.1007/s12033-023-00867-z] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 02/22/2023] [Accepted: 08/21/2023] [Indexed: 09/18/2023]

Abstract

Hepatitis A and B are two crucial viral infections that still dramatically affect public health worldwide. Hepatitis A Virus (HAV) is the main cause of acute hepatitis, whereas Hepatitis B Virus (HBV) leads to the chronic form of the disease, possibly cirrhosis or liver failure. Therefore, vaccination has always been considered the most effective preventive method against pathogens. At this moment, we aimed at the immunoinformatic analysis of HAV-Viral Protein 1 (VP1) as the major capsid protein to come up with the most conserved immunogenic truncated protein to be fused by HBV surface antigen (HBs Ag) to achieve a bivalent vaccine against HAV and HBV using an AAY linker. Various computational approaches were employed to predict highly conserved regions and the most immunogenic B-cell and T-cell epitopes of HAV-VP1 capsid protein in both humans and BALB/c. Moreover, the predicted fusion protein was analyzed regarding primary and secondary structures and also homology validation. Afterward, the three-dimensional structure of vaccine constructs docked with various toll-like receptors (TLR) 2, 4 and 7. According to the bioinformatics tools, the region of 99-259 amino acids of VP1 was selected with high immunogenicity and conserved epitopes. T-cell epitope prediction showed that this region contains 32 antigenic peptides for Human leukocyte antigen (HLA) class I and 20 antigenic peptides in terms of HLA class II which are almost fully conserved in the Iranian population. The vaccine design includes 5 linear and 4 conformational B-cell lymphocyte (BCL) epitopes to induce humoral immune responses. The designed VP1-AAY-HBsAg fusion protein has the potency to be constructed and expressed to achieve a bivalent vaccine candidate, especially in the Iranian population. These findings led us to claim that the designed vaccine candidate provides potential pathways for creating an exploratory vaccine against Hepatitis A and Hepatitis B Viruses with high confidence for the identified strains.

Collapse

Jung Y, Geng C, Bonvin AMJJ, Xue LC, Honavar VG. MetaScore: A Novel Machine-Learning-Based Approach to Improve Traditional Scoring Functions for Scoring Protein-Protein Docking Conformations. Biomolecules 2023;13:121. [PMID: 36671507 PMCID: PMC9855734 DOI: 10.3390/biom13010121] [Citation(s) in RCA: 3] [Impact Index Per Article: 3.0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 12/01/2022] [Revised: 12/22/2022] [Accepted: 12/26/2022] [Indexed: 01/11/2023] Open

Abstract

Protein-protein interactions play a ubiquitous role in biological function. Knowledge of the three-dimensional (3D) structures of the complexes they form is essential for understanding the structural basis of those interactions and how they orchestrate key cellular processes. Computational docking has become an indispensable alternative to the expensive and time-consuming experimental approaches for determining the 3D structures of protein complexes. Despite recent progress, identifying near-native models from a large set of conformations sampled by docking-the so-called scoring problem-still has considerable room for improvement. We present MetaScore, a new machine-learning-based approach to improve the scoring of docked conformations. MetaScore utilizes a random forest (RF) classifier trained to distinguish near-native from non-native conformations using their protein-protein interfacial features. The features include physicochemical properties, energy terms, interaction-propensity-based features, geometric properties, interface topology features, evolutionary conservation, and also scores produced by traditional scoring functions (SFs). MetaScore scores docked conformations by simply averaging the score produced by the RF classifier with that produced by any traditional SF. We demonstrate that (i) MetaScore consistently outperforms each of the nine traditional SFs included in this work in terms of success rate and hit rate evaluated over conformations ranked among the top 10; (ii) an ensemble method, MetaScore-Ensemble, that combines 10 variants of MetaScore obtained by combining the RF score with each of the traditional SFs outperforms each of the MetaScore variants. We conclude that the performance of traditional SFs can be improved upon by using machine learning to judiciously leverage protein-protein interfacial features and by using ensemble methods to combine multiple scoring functions.

Collapse

An X, Zhang W, Rong C, Liu S. Understanding Ramachandran plot for dipeptide: A density functional theory and i nformation‐theoretic approach study. J CHIN CHEM SOC-TAIP 2022. [DOI: 10.1002/jccs.202200444] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 12/25/2022]

Mufassirin MMM, Newton MAH, Sattar A. Artificial intelligence for template-free protein structure prediction: a comprehensive review. Artif Intell Rev 2022. [DOI: 10.1007/s10462-022-10350-x] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 12/23/2022]

Ochoa R, Lunardelli VAS, Rosa DS, Laio A, Cossio P. Multiple-Allele MHC Class II Epitope Engineering by a Molecular Dynamics-Based Evolution Protocol. Front Immunol 2022;13:862851. [PMID: 35572587 PMCID: PMC9094701 DOI: 10.3389/fimmu.2022.862851] [Citation(s) in RCA: 5] [Impact Index Per Article: 2.5] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 01/26/2022] [Accepted: 03/28/2022] [Indexed: 11/13/2022] Open

Akhter N, Kabir KL, Chennupati G, Vangara R, Alexandrov BS, Djidjev H, Shehu A. Improved Protein Decoy Selection via Non-Negative Matrix Factorization. IEEE/ACM TRANSACTIONS ON COMPUTATIONAL BIOLOGY AND BIOINFORMATICS 2022;19:1670-1682. [PMID: 33400654 DOI: 10.1109/tcbb.2020.3049088] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 06/12/2023]

Wang WF, Xie XY, Huang Y, Li YK, Liu H, Chen XL, Wang HL. Identification of a Novel Antimicrobial Peptide From the Ancient Marine Arthropod Chinese Horseshoe Crab, Tachypleus tridentatus. Front Immunol 2022;13:794779. [PMID: 35401525 PMCID: PMC8984021 DOI: 10.3389/fimmu.2022.794779] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/14/2021] [Accepted: 02/24/2022] [Indexed: 12/02/2022] Open

Radusky LG, Serrano L. pyFoldX: enabling biomolecular analysis and engineering along structural ensembles. Bioinformatics 2022;38:2353-2355. [PMID: 35176149 PMCID: PMC9004634 DOI: 10.1093/bioinformatics/btac072] [Citation(s) in RCA: 4] [Impact Index Per Article: 2.0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 09/21/2021] [Revised: 12/19/2021] [Accepted: 02/09/2022] [Indexed: 02/03/2023] Open

Yamamori Y, Tomii K. Application of Homology Modeling by Enhanced Profile-Profile Alignment and Flexible-Fitting Simulation to Cryo-EM Based Structure Determination. Int J Mol Sci 2022;23:ijms23041977. [PMID: 35216093 PMCID: PMC8879198 DOI: 10.3390/ijms23041977] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 12/17/2021] [Revised: 02/07/2022] [Accepted: 02/09/2022] [Indexed: 12/03/2022] Open

Ochoa R, Soler MA, Gladich I, Battisti A, Minovski N, Rodriguez A, Fortuna S, Cossio P, Laio A. Computational Evolution Protocol for Peptide Design. Methods Mol Biol 2022;2405:335-359. [PMID: 35298821 DOI: 10.1007/978-1-0716-1855-4_16] [Citation(s) in RCA: 1] [Impact Index Per Article: 0.5] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 06/14/2023]

Protein-Protein Docking: Past, Present, and Future. Protein J 2021;41:1-26. [PMID: 34787783 DOI: 10.1007/s10930-021-10031-8] [Citation(s) in RCA: 3] [Impact Index Per Article: 1.0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Accepted: 11/01/2021] [Indexed: 10/19/2022]

Redesigning an antibody H3 loop by virtual screening of a small library of human germline-derived sequences. Sci Rep 2021;11:21362. [PMID: 34725391 PMCID: PMC8560851 DOI: 10.1038/s41598-021-00669-w] [Citation(s) in RCA: 1] [Impact Index Per Article: 0.3] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/15/2021] [Accepted: 10/05/2021] [Indexed: 01/01/2023] Open

Abstract

The design of superior biologic therapeutics, including antibodies and engineered proteins, involves optimizing their specific ability to bind to disease-related molecular targets. Previously, we developed and applied the Assisted Design of Antibody and Protein Therapeutics (ADAPT) platform for virtual affinity maturation of antibodies (Vivcharuk et al. in PLoS One 12(7):e0181490, 10.1371/journal.pone.0181490, 2017). However, ADAPT is limited to point mutations of hot-spot residues in existing CDR loops. In this study, we explore the possibility of wholesale replacement of the entire H3 loop with no restriction to maintain the parental loop length. This complements other currently published studies that sample replacements for the CDR loops L1, L2, L3, H1 and H2. Given the immense sequence space theoretically available to H3, we focused on the virtual grafting of over 5000 human germline-derived H3 sequences from the IGMT/LIGM database increasing the diversity of the sequence space when compared to using crystalized H3 loop sequences. H3 loop conformations are generated and scored to identify optimized H3 sequences. Experimental testing of high-ranking H3 sequences grafted into the framework of the bH1 antibody against human VEGF-A led to the discovery of multiple hits, some of which had similar or better affinities relative to the parental antibody. In over 75% of the tested designs, the re-designed H3 loop contributed favorably to overall binding affinity. The hits also demonstrated good developability attributes such as high thermal stability and no aggregation. Crystal structures of select re-designed H3 variants were solved and indicated that although some deviations from predicted structures were seen in the more solvent accessible regions of the H3 loop, they did not significantly affect predicted affinity scores.

Collapse

Jeon S, Blazyte A, Yoon C, Ryu H, Jeon Y, Bhak Y, Bolser D, Manica A, Shin ES, Cho YS, Kim BC, Ryoo N, Choi H, Bhak J. Regional TMPRSS2 V197M Allele Frequencies Are Correlated with COVID-19 Case Fatality Rates. Mol Cells 2021;44:680-687. [PMID: 34588322 PMCID: PMC8490206 DOI: 10.14348/molcells.2021.2249] [Citation(s) in RCA: 5] [Impact Index Per Article: 1.7] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 12/18/2020] [Revised: 06/14/2021] [Accepted: 07/10/2021] [Indexed: 02/08/2023] Open

Affiliation(s)

Sungwon Jeon Korean Genomics Center (KOGIC), Ulsan National Institute of Science and Technology (UNIST), Ulsan 44919, Korea Department of Biomedical Engineering, College of Information and Biotechnology, UNIST, Ulsan 44919, Korea
Asta Blazyte Korean Genomics Center (KOGIC), Ulsan National Institute of Science and Technology (UNIST), Ulsan 44919, Korea Department of Biomedical Engineering, College of Information and Biotechnology, UNIST, Ulsan 44919, Korea
Changhan Yoon Korean Genomics Center (KOGIC), Ulsan National Institute of Science and Technology (UNIST), Ulsan 44919, Korea Department of Biomedical Engineering, College of Information and Biotechnology, UNIST, Ulsan 44919, Korea
Hyojung Ryu Korean Genomics Center (KOGIC), Ulsan National Institute of Science and Technology (UNIST), Ulsan 44919, Korea Department of Biomedical Engineering, College of Information and Biotechnology, UNIST, Ulsan 44919, Korea
Yeonsu Jeon Korean Genomics Center (KOGIC), Ulsan National Institute of Science and Technology (UNIST), Ulsan 44919, Korea Department of Biomedical Engineering, College of Information and Biotechnology, UNIST, Ulsan 44919, Korea
Youngjune Bhak Korean Genomics Center (KOGIC), Ulsan National Institute of Science and Technology (UNIST), Ulsan 44919, Korea Department of Biomedical Engineering, College of Information and Biotechnology, UNIST, Ulsan 44919, Korea
Dan Bolser Geromics, Ltd., Cambridge CB1 3NF, UK
Andrea Manica Department of Zoology, University of Cambridge, Cambridge CB2 3EJ, UK
Eun-Seok Shin Division of Cardiology, Department of Internal Medicine, Ulsan Medical Center, Ulsan 44686, Korea Personal Genomics Institute (PGI), Genome Research Foundation (GRF), Cheongju 28160, Korea
Yun Sung Cho Clinomics, Inc., Ulsan 44919, Korea
Byung Chul Kim Clinomics, Inc., Ulsan 44919, Korea
Namhee Ryoo Department of Laboratory Medicine, Keimyung University School of Medicine, Daegu 42601, Korea
Hansol Choi Korean Genomics Center (KOGIC), Ulsan National Institute of Science and Technology (UNIST), Ulsan 44919, Korea Department of Biomedical Engineering, College of Information and Biotechnology, UNIST, Ulsan 44919, Korea
Jong Bhak Korean Genomics Center (KOGIC), Ulsan National Institute of Science and Technology (UNIST), Ulsan 44919, Korea Department of Biomedical Engineering, College of Information and Biotechnology, UNIST, Ulsan 44919, Korea Geromics, Ltd., Cambridge CB1 3NF, UK Personal Genomics Institute (PGI), Genome Research Foundation (GRF), Cheongju 28160, Korea Clinomics, Inc., Ulsan 44919, Korea

Collapse

Liu X, Luo Y, Li P, Song S, Peng J. Deep geometric representations for modeling effects of mutations on protein-protein binding affinity. PLoS Comput Biol 2021;17:e1009284. [PMID: 34347784 PMCID: PMC8366979 DOI: 10.1371/journal.pcbi.1009284] [Citation(s) in RCA: 33] [Impact Index Per Article: 11.0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 11/05/2020] [Revised: 08/16/2021] [Accepted: 07/17/2021] [Indexed: 11/19/2022] Open

Abstract

Modeling the impact of amino acid mutations on protein-protein interaction plays a crucial role in protein engineering and drug design. In this study, we develop GeoPPI, a novel structure-based deep-learning framework to predict the change of binding affinity upon mutations. Based on the three-dimensional structure of a protein, GeoPPI first learns a geometric representation that encodes topology features of the protein structure via a self-supervised learning scheme. These representations are then used as features for training gradient-boosting trees to predict the changes of protein-protein binding affinity upon mutations. We find that GeoPPI is able to learn meaningful features that characterize interactions between atoms in protein structures. In addition, through extensive experiments, we show that GeoPPI achieves new state-of-the-art performance in predicting the binding affinity changes upon both single- and multi-point mutations on six benchmark datasets. Moreover, we show that GeoPPI can accurately estimate the difference of binding affinities between a few recently identified SARS-CoV-2 antibodies and the receptor-binding domain (RBD) of the S protein. These results demonstrate the potential of GeoPPI as a powerful and useful computational tool in protein design and engineering. Our code and datasets are available at: https://github.com/Liuxg16/GeoPPI.

Estimating the binding affinities of protein-protein interactions (PPIs) is crucial to understand protein function and design new functional proteins. Since the experimental measurement in wet-labs is labor-intensive and time-consuming, fast and accurate in silico approaches have received much attention. Although considerable efforts have been made in this direction, predicting the effects of mutations on the protein-protein binding affinity is still a challenging research problem. In this work, we introduce GeoPPI, a novel computational approach that uses deep geometric representations of protein complexes to predict the effects of mutations on the binding affinity. The geometric representations are first learned via a self-supervised learning scheme and then integrated with gradient-boosting trees to accomplish the prediction. We find that the learned representations encode meaningful patterns underlying the interactions between atoms in protein structures. Also, extensive tests on major benchmark datasets show that GeoPPI has made an important improvement over the existing methods in predicting the effects of mutations on the binding affinity.

Collapse

Pearce R, Zhang Y. Toward the solution of the protein structure prediction problem. J Biol Chem 2021;297:100870. [PMID: 34119522 PMCID: PMC8254035 DOI: 10.1016/j.jbc.2021.100870] [Citation(s) in RCA: 60] [Impact Index Per Article: 20.0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 04/20/2021] [Revised: 06/07/2021] [Accepted: 06/09/2021] [Indexed: 11/20/2022] Open

Bordbar A, Amanlou M, Pooshang Bagheri K, Ready PD, Ebrahimi S, Shahbaz Mohammadi H, Ghafari SM, Parvizi P. Cloning, high-level gene expression and bioinformatics analysis of SP15 and LeIF from Leishmania major and Iranian Phlebotomus papatasi saliva as single and novel fusion proteins: a potential vaccine candidate against leishmaniasis. Trans R Soc Trop Med Hyg 2021;115:699-713. [PMID: 33155034 DOI: 10.1093/trstmh/traa119] [Citation(s) in RCA: 2] [Impact Index Per Article: 0.7] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 04/09/2020] [Revised: 08/09/2020] [Accepted: 10/16/2020] [Indexed: 11/13/2022] Open

Protein model accuracy estimation empowered by deep learning and inter-residue distance prediction in CASP14. Sci Rep 2021;11:10943. [PMID: 34035363 PMCID: PMC8149836 DOI: 10.1038/s41598-021-90303-6] [Citation(s) in RCA: 8] [Impact Index Per Article: 2.7] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 02/09/2021] [Accepted: 05/10/2021] [Indexed: 11/28/2022] Open

Heo L, Park S, Seok C. GalaxyWater-wKGB: Prediction of Water Positions on Protein Structure Using wKGB Statistical Potential. J Chem Inf Model 2021;61:2283-2293. [PMID: 33938216 DOI: 10.1021/acs.jcim.0c01434] [Citation(s) in RCA: 10] [Impact Index Per Article: 3.3] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/29/2022]

Abstract

Proteins fold and function in water, and protein-water interactions play important roles in protein structure and function. In computational studies on protein structure and interaction, the effect of water is considered either implicitly or explicitly. Implicit water models are frequently used in protein structure prediction and docking because they are computationally much more efficient than explicit water models, which are often employed in molecular dynamics (MD) simulations. However, implicit water models that treat water as a continuous solvent medium cannot account for specific atomistic protein-water interactions that are critical for structure formation and interactions with other molecules. Various methods for predicting water molecules that form specific atomistic interactions with proteins have been developed. Methods involving MD simulations or the integral equation theory tend to produce more accurate results at a higher computational cost than simple geometry- or energy-based methods. Here, we present a novel method for predicting water positions on a protein surface called GalaxyWater-wKGB, which is based on a statistical potential, a water knowledge-based potential based on the generalized Born model (wKGB). This method is accurate and rapid because it does not require conformational sampling or iterative computation owing to the effective statistical treatment employed to derive the potential. The statistical potential describes specific protein atom-water interactions more accurately than conventional potentials by considering the dependence on the degree of solvent accessibility of protein atoms as well as on protein atom-water distances and orientations. The introduction of solvent accessibility allows effective consideration of competing nonspecific protein-water and intraprotein interactions. When tested on high-resolution protein crystal structures, this method could recover similar or larger fractions of crystallographic water 180 times faster than the sophisticated integral equation theory, 3D-RISM. A web service of this water prediction method is freely available at http://galaxy.seoklab.org/wkgb.

Collapse

Postic G, Janel N, Moroy G. Representations of protein structure for exploring the conformational space: A speed-accuracy trade-off. Comput Struct Biotechnol J 2021;19:2618-2625. [PMID: 34025948 PMCID: PMC8120936 DOI: 10.1016/j.csbj.2021.04.049] [Citation(s) in RCA: 2] [Impact Index Per Article: 0.7] [Reference Citation Analysis] [Abstract] [Key Words] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 02/03/2021] [Revised: 04/19/2021] [Accepted: 04/20/2021] [Indexed: 11/25/2022] Open

Abstract

•

We compare ten structural representations, either atomistic or coarse-grained.

•

Thus, ten distance-dependent statistical potentials of mean force (PMF) were built.

•

The Cβ-only and Cα + Cβ representations provide the best speed–accuracy trade-off.

•

Including glycines through Cα, in a Cβ-only representation, yields a higher accuracy.

•

We generalize the conclusions to the total information gain (TIG) scoring function.

The recent breakthrough in the field of protein structure prediction shows the relevance of using knowledge-based based scoring functions in combination with a low-resolution 3D representation of protein macromolecules. The choice of not using all atoms is barely supported by any data in the literature, and is mostly motivated by empirical and practical reasons, such as the computational cost of assessing the numerous folds of the protein conformational space. Here, we present a comprehensive study, carried on a large and balanced benchmark of predicted protein structures, to see how different types of structural representations rank in either accuracy or calculation speed, and which ones offer the best compromise between these two criteria. We tested ten representations, including low-resolution, high-resolution, and coarse-grained approaches. We also investigated the generalization of the findings to other formalisms than the widely-used “potential of mean force” (PMF) method. Thus, we observed that representing protein structures by their β carbons—combined or not with Cα—provides the best speed–accuracy trade-off, when using a “total information gain” scoring function. For statistical PMFs, using MARTINI backbone and side-chains beads is the best option. Finally, we also demonstrated the necessity of training the reference state on all atom types, and of including the Cα atoms of glycine residues, in a Cβ-based representation.

Collapse

Ochoa R, Laskowski RA, Thornton JM, Cossio P. Impact of Structural Observables From Simulations to Predict the Effect of Single-Point Mutations in MHC Class II Peptide Binders. Front Mol Biosci 2021;8:636562. [PMID: 34222328 PMCID: PMC8253603 DOI: 10.3389/fmolb.2021.636562] [Citation(s) in RCA: 1] [Impact Index Per Article: 0.3] [Reference Citation Analysis] [Abstract] [Key Words] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 12/01/2020] [Accepted: 02/15/2021] [Indexed: 11/23/2022] Open

Dixit H, Kumar C S, Chaudhary R, Thaker D, Gadewal N, Dasgupta D. Role of Phosphorylation and Hyperphosphorylation of Tau in Its Interaction with βα Dimeric Tubulin Studied from a Bioinformatics Perspective. Avicenna J Med Biotechnol 2021;13:24-34. [PMID: 33680370 PMCID: PMC7903436 DOI: 10.18502/ajmb.v13i1.4579] [Citation(s) in RCA: 1] [Impact Index Per Article: 0.3] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/24/2022] Open

Sadat SM, Aghadadeghi MR, Yousefi M, Khodaei A, Sadat Larijani M, Bahramali G. Bioinformatics Analysis of SARS-CoV-2 to Approach an Effective Vaccine Candidate Against COVID-19. Mol Biotechnol 2021;63:389-409. [PMID: 33625681 PMCID: PMC7902242 DOI: 10.1007/s12033-021-00303-0] [Citation(s) in RCA: 11] [Impact Index Per Article: 3.7] [Reference Citation Analysis] [Abstract] [Key Words] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Accepted: 01/21/2021] [Indexed: 02/07/2023]

Guest JD, Vreven T, Zhou J, Moal I, Jeliazkov JR, Gray JJ, Weng Z, Pierce BG. An expanded benchmark for antibody-antigen docking and affinity prediction reveals insights into antibody recognition determinants. Structure 2021;29:606-621.e5. [PMID: 33539768 DOI: 10.1016/j.str.2021.01.005] [Citation(s) in RCA: 51] [Impact Index Per Article: 17.0] [Reference Citation Analysis] [Abstract] [Key Words] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/08/2020] [Revised: 11/15/2020] [Accepted: 01/11/2021] [Indexed: 01/04/2023]

Hernandez R, Facelli JC. Understanding protein structural changes for oncogenic missense variants. Heliyon 2021;7:e06013. [PMID: 33553733 PMCID: PMC7846930 DOI: 10.1016/j.heliyon.2021.e06013] [Citation(s) in RCA: 2] [Impact Index Per Article: 0.7] [Reference Citation Analysis] [Abstract] [Key Words] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/14/2020] [Revised: 08/20/2020] [Accepted: 01/15/2021] [Indexed: 12/31/2022] Open

Akhter N, Chennupati G, Djidjev H, Shehu A. Decoy selection for protein structure prediction via extreme gradient boosting and ranking. BMC Bioinformatics 2020;21:189. [PMID: 33297949 PMCID: PMC7724862 DOI: 10.1186/s12859-020-3523-9] [Citation(s) in RCA: 4] [Impact Index Per Article: 1.0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 04/15/2020] [Accepted: 04/29/2020] [Indexed: 11/10/2022] Open

Bhattacharya S, Sah PP, Banerjee A, Ray S. Structural impact due to PPQEE deletion in multiple cancer associated protein - Integrin αV: An In silico exploration. Biosystems 2020;198:104216. [DOI: 10.1016/j.biosystems.2020.104216] [Citation(s) in RCA: 4] [Impact Index Per Article: 1.0] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 02/11/2020] [Revised: 07/22/2020] [Accepted: 07/27/2020] [Indexed: 12/12/2022]

Amalgamation of 3D structure and sequence information for protein-protein interaction prediction. Sci Rep 2020;10:19171. [PMID: 33154416 PMCID: PMC7645622 DOI: 10.1038/s41598-020-75467-x] [Citation(s) in RCA: 5] [Impact Index Per Article: 1.3] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 05/18/2020] [Accepted: 09/17/2020] [Indexed: 11/08/2022] Open

Abstract

Protein is the primary building block of living organisms. It interacts with other proteins and is then involved in various biological processes. Protein-protein interactions (PPIs) help in predicting and hence help in understanding the functionality of the proteins, causes and growth of diseases, and designing new drugs. However, there is a vast gap between the available protein sequences and the identification of protein-protein interactions. To bridge this gap, researchers proposed several computational methods to reveal the interactions between proteins. These methods merely depend on sequence-based information of proteins. With the advancement of technology, different types of information related to proteins are available such as 3D structure information. Nowadays, deep learning techniques are adopted successfully in various domains, including bioinformatics. So, current work focuses on the utilization of different modalities, such as 3D structures and sequence-based information of proteins, and deep learning algorithms to predict PPIs. The proposed approach is divided into several phases. We first get several illustrations of proteins using their 3D coordinates information, and three attributes, such as hydropathy index, isoelectric point, and charge of amino acids. Amino acids are the building blocks of proteins. A pre-trained ResNet50 model, a subclass of a convolutional neural network, is utilized to extract features from these representations of proteins. Autocovariance and conjoint triad are two widely used sequence-based methods to encode proteins, which are used here as another modality of protein sequences. A stacked autoencoder is utilized to get the compact form of sequence-based information. Finally, the features obtained from different modalities are concatenated in pairs and fed into the classifier to predict labels for protein pairs. We have experimented on the human PPIs dataset and Saccharomyces cerevisiae PPIs dataset and compared our results with the state-of-the-art deep-learning-based classifiers. The results achieved by the proposed method are superior to those obtained by the existing methods. Extensive experimentations on different datasets indicate that our approach to learning and combining features from two different modalities is useful in PPI prediction.

Collapse

Chen X, Song S, Ji J, Tang Z, Todo Y. Incorporating a multiobjective knowledge-based energy function into differential evolution for protein structure prediction. Inf Sci (N Y) 2020. [DOI: 10.1016/j.ins.2020.06.003] [Citation(s) in RCA: 2] [Impact Index Per Article: 0.5] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 01/17/2023]

Chang RL, Stanley JA, Robinson MC, Sher JW, Li Z, Chan YA, Omdahl AR, Wattiez R, Godzik A, Matallana-Surget S. Protein structure, amino acid composition and sequence determine proteome vulnerability to oxidation-induced damage. EMBO J 2020;39:e104523. [PMID: 33073387 PMCID: PMC7705453 DOI: 10.15252/embj.2020104523] [Citation(s) in RCA: 22] [Impact Index Per Article: 5.5] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/09/2020] [Revised: 09/16/2020] [Accepted: 09/22/2020] [Indexed: 02/05/2023] Open

Kulandaisamy A, Zaucha J, Frishman D, Gromiha MM. MPTherm-pred: Analysis and Prediction of Thermal Stability Changes upon Mutations in Transmembrane Proteins. J Mol Biol 2020;433:166646. [PMID: 32920050 DOI: 10.1016/j.jmb.2020.09.005] [Citation(s) in RCA: 5] [Impact Index Per Article: 1.3] [Reference Citation Analysis] [Abstract] [Key Words] [Journal Information] [Subscribe] [Scholar Register] [Received: 08/15/2020] [Revised: 09/04/2020] [Accepted: 09/04/2020] [Indexed: 01/06/2023]

Abstract

The stability of membrane proteins differs from globular proteins due to the presence of nonpolar membrane-spanning regions. Using a dataset of 929 membrane protein mutations whose effects on thermal stability (ΔT_m) were experimentally determined, we found that the average ΔT_m due to 190 stabilizing and 232 destabilizing mutations occurring in membrane-spanning regions are 2.43(3.1) °C and -5.48(5.5) °C, respectively. The ΔT_m values for mutations occurring in solvent-exposed regions are 2.56(2.82) and - 6.8(7.2) °C. We have systematically analyzed the factors influencing the stability of mutants and observed that changes in hydrophobicity, number of contacts between Cα atoms and frequency of aliphatic residues are important determinants of the stability change induced by mutations occurring in membrane-spanning regions. We have developed structure- and sequence-based machine learning predictors of ΔT_m due to mutations specifically for membrane proteins. They showed a correlation and mean absolute error (MAE) of 0.72 and 2.85 °C, respectively, between experimental and predicted ΔT_m for mutations in membrane-spanning regions on 10-fold group-wise cross-validation. The average correlation and MAE for mutations in aqueous regions are 0.73 and 3.7 °C, respectively. These MAE values are about 50% lower than standard deviations from the mean ΔT_m values. The reliability of the method was affirmed on a test set of mutations occurring in evolutionary independent protein sequences. The developed MPTherm-pred server for predicting thermal stability changes upon mutations in membrane proteins is available at https://web.iitm.ac.in/bioinfo2/mpthermpred/. Our results provide insights into factors influencing the stability of membrane proteins and can aid in designing mutants that are more resistant to thermal stress.

Collapse

Postic G, Janel N, Tufféry P, Moroy G. An information gain-based approach for evaluating protein structure models. Comput Struct Biotechnol J 2020;18:2228-2236. [PMID: 32837711 PMCID: PMC7431362 DOI: 10.1016/j.csbj.2020.08.013] [Citation(s) in RCA: 2] [Impact Index Per Article: 0.5] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/01/2020] [Revised: 08/06/2020] [Accepted: 08/07/2020] [Indexed: 12/23/2022] Open

Pei J, Song LF, Merz KM. Pair Potentials as Machine Learning Features. J Chem Theory Comput 2020;16:5385-5400. [PMID: 32559380 DOI: 10.1021/acs.jctc.9b01246] [Citation(s) in RCA: 3] [Impact Index Per Article: 0.8] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/30/2022]

Abstract

Atom pairwise potential functions make up an essential part of many scoring functions for protein decoy detection. With the development of machine learning (ML) tools, there are multiple ways to combine potential functions to create novel ML models and methods. Potential function parameters can be easily extracted; however, it is usually hard to directly obtain the calculated atom pairwise energies from scoring functions. Amber, as one of the most popular suites of modeling programs, has an extensive history and library of force field potential functions. In this work, we directly used the force field parameters in ff94 and ff14SB from Amber and encoded them to calculate atom pairwise energies for different interactions. Two sets of structures (single amino acid set and a dipeptide set) were used to evaluate the performance of our encoded Amber potentials. From the comparison results between energy terms obtained from our encoding and Amber, we find energy difference within ±0.06 kcal/mol for all tested structures. Previously we have shown that the Random Forest (RF) model can help to emphasize more important atom pairwise interactions and ignore insignificant ones [Pei, J.; Zheng, Z.; Merz, K. M. J. Chem. Inf. Model. 2019, 59, 1919-1929]. Here, as an example of combining ML methods with traditional potential functions, we followed the same work flow to combine the RF models with force field potential functions from Amber. To determine the performance of our RF models with force field potential functions, 224 different protein native-decoy systems were used as our training and testing sets We find that the RF models with ff94 and ff14SB force field parameters outperformed all other scoring functions (RF models with KECSA2, RWplus, DFIRE, dDFIRE, and GOAP) considered in this work for native structure detection, and they performed similarly in detecting the best decoy. Through inclusion of best decoy to decoy comparisons in building our RF models, we were able to generate models that outperformed the score functions tested herein both on accuracy and best decoy detection, again showing the performance and flexibility of our RF models to tackle this problem. Finally, the importance of the RF algorithm and force field parameters were also tested and the comparison results suggest that both the RF algorithm and force field potentials are important with the ML scoring function achieving its best performance only by combining them together. All code and data used in this work are available at https://github.com/JunPei000/FFENCODER_for_Protein_Folding_Pose_Selection.

Collapse

Tanemura KA, Pei J, Merz KM. Refinement of pairwise potentials via logistic regression to score protein-protein interactions. Proteins 2020;88:1559-1568. [PMID: 32729132 DOI: 10.1002/prot.25973] [Citation(s) in RCA: 2] [Impact Index Per Article: 0.5] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 02/25/2020] [Revised: 05/17/2020] [Accepted: 06/14/2020] [Indexed: 12/20/2022]

Bi J, Chen S, Zhao X, Nie Y, Xu Y. Computation-aided engineering of starch-debranching pullulanase from Bacillus thermoleovorans for enhanced thermostability. Appl Microbiol Biotechnol 2020;104:7551-7562. [PMID: 32632476 DOI: 10.1007/s00253-020-10764-z] [Citation(s) in RCA: 30] [Impact Index Per Article: 7.5] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 04/16/2020] [Revised: 06/17/2020] [Accepted: 06/30/2020] [Indexed: 12/26/2022]

Prediction of Protein Tertiary Structure via Regularized Template Classification Techniques. Molecules 2020;25:molecules25112467. [PMID: 32466409 PMCID: PMC7321371 DOI: 10.3390/molecules25112467] [Citation(s) in RCA: 2] [Impact Index Per Article: 0.5] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 04/23/2020] [Revised: 05/21/2020] [Accepted: 05/22/2020] [Indexed: 11/24/2022] Open

Chen J, Siu SWI. Machine Learning Approaches for Quality Assessment of Protein Structures. Biomolecules 2020;10:biom10040626. [PMID: 32316682 PMCID: PMC7226485 DOI: 10.3390/biom10040626] [Citation(s) in RCA: 9] [Impact Index Per Article: 2.3] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/03/2020] [Revised: 04/07/2020] [Accepted: 04/09/2020] [Indexed: 11/16/2022] Open

Bordbar A, Bagheri KP, Ebrahimi S, Parvizi P. Bioinformatics analyses of immunogenic T-cell epitopes of LeIF and PpSP15 proteins from Leishmania major and sand fly saliva used as model antigens for the design of a multi-epitope vaccine to control leishmaniasis. INFECTION GENETICS AND EVOLUTION 2020;80:104189. [PMID: 31931259 DOI: 10.1016/j.meegid.2020.104189] [Citation(s) in RCA: 9] [Impact Index Per Article: 2.3] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Subscribe] [Scholar Register] [Received: 08/22/2019] [Revised: 01/05/2020] [Accepted: 01/08/2020] [Indexed: 11/17/2022]

Abstract

Leishmaniasis is caused by protozoan parasites belonging to 20 Leishmania species. This infectious disease is transmitted by bites of infected phlebotomine sandflies, and is widespread in 97 countries throughout the world. No preventive or effective vaccine has been developed yet. In this study, diverse computational methods were integrated to calculate evolutionary divergence, immunogenicity, IFN-γ production, epitope conservancy, and population coverage of protein fusion models of LeIF-SP15 namely SaLeish. Immunogenicity of LeIF of Leishmania species and SP15 of sandfly saliva has not been investigated in-silico in fusion form. A complete set of 9-mer MHC class I and 15-mer MHC class II peptides were identified with a high affinity for the antigenic epitopes of SaLeish inducing specific responses of CD8⁺ and CD4⁺ T cells from BALB/c and human. Our preferred approach was determining truncated fragment of SaLeish rather than a whole length bearing the capacity to trigger specific immune response. Phylogenetic analysis showed that LeIF protein is under balancing selection and is conserved between different Leishmania species. Selected SaLeish model contained 19 and 35 antigenic peptides for MHC class I and II, respectively, with strong binding affinity to both highly frequent HLA-I and HLA-II alleles. Analysis of class I CTL epitopes showed that promiscuous peptides of KSLKADIRK, MSCIPHCKY, LQAGVIVAV, and YQYYGFVAM have greater affinity to interact with HLA-A*01:01, HLA-A*02 (03, 06), HLA-A*30:02, HLA-B*40:01, and HLA-B*52:01 molecules. Population coverage with a range of 78-85% confirmed SaLeish-Model4 as an appropriate vaccine candidate among Persian, South Asia, Europe, and North America population. Also, predicted antigenic epitopes of AKPEIRTFSNVLIKY, TRVQDDLRKLQAGVI, and VALFSATMPEEVLEL corresponding to MHC class II were found to provide strong ability to produce IFNγ toward TH(1)-biased-DTH responses. Findings of the current investigation warrant the future experimental assessment of promising SaLeish prophylaxis vaccine that is capable to enhance both innate and specific cellular immune responses.

Collapse

Residual Participation and Thermodynamic Stability Due to Molecular Interactions in IL11, IL11Rα and Gp130 from Homo sapiens: An In Silico Outlook for IL11 as a Therapeutic Remedy. Int J Pept Res Ther 2019. [DOI: 10.1007/s10989-019-09996-z] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 10/25/2022]

Cai Y, Li X, Sun Z, Lu Y, Zhao H, Hanson J, Paliwal K, Litfin T, Zhou Y, Yang Y. SPOT-Fold: Fragment-Free Protein Structure Prediction Guided by Predicted Backbone Structure and Contact Map. J Comput Chem 2019;41:745-750. [PMID: 31845383 DOI: 10.1002/jcc.26132] [Citation(s) in RCA: 8] [Impact Index Per Article: 1.6] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/22/2019] [Revised: 10/07/2019] [Accepted: 12/01/2019] [Indexed: 02/01/2023]

Zhang T, Hu G, Yang Y, Wang J, Zhou Y. All-Atom Knowledge-Based Potential for RNA Structure Discrimination Based on the Distance-Scaled Finite Ideal-Gas Reference State. J Comput Biol 2019;27:856-867. [PMID: 31638408 DOI: 10.1089/cmb.2019.0251] [Citation(s) in RCA: 9] [Impact Index Per Article: 1.8] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 12/18/2022] Open

Akhter N, Chennupati G, Kabir KL, Djidjev H, Shehu A. Unsupervised and Supervised Learning over theEnergy Landscape for Protein Decoy Selection. Biomolecules 2019;9:E607. [PMID: 31615116 PMCID: PMC6843838 DOI: 10.3390/biom9100607] [Citation(s) in RCA: 4] [Impact Index Per Article: 0.8] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 08/05/2019] [Revised: 10/03/2019] [Accepted: 10/04/2019] [Indexed: 11/17/2022] Open

Abstract

The energy landscape that organizes microstates of a molecular system and governs theunderlying molecular dynamics exposes the relationship between molecular form/structure, changesto form, and biological activity or function in the cell. However, several challenges stand in the wayof leveraging energy landscapes for relating structure and structural dynamics to function. Energylandscapes are high-dimensional, multi-modal, and often overly-rugged. Deep wells or basins inthem do not always correspond to stable structural states but are instead the result of inherentinaccuracies in semi-empirical molecular energy functions. Due to these challenges, energeticsis typically ignored in computational approaches addressing long-standing central questions incomputational biology, such as protein decoy selection. In the latter, the goal is to determine over apossibly large number of computationally-generated three-dimensional structures of a protein thosestructures that are biologically-active/native. In recent work, we have recast our attention on theprotein energy landscape and its role in helping us to advance decoy selection. Here, we summarizesome of our successes so far in this direction via unsupervised learning. More importantly, we furtheradvance the argument that the energy landscape holds valuable information to aid and advance thestate of protein decoy selection via novel machine learning methodologies that leverage supervisedlearning. Our focus in this article is on decoy selection for the purpose of a rigorous, quantitativeevaluation of how leveraging protein energy landscapes advances an important problem in proteinmodeling. However, the ideas and concepts presented here are generally useful to make discoveriesin studies aiming to relate molecular structure and structural dynamics to function.

Collapse

Larijani MS, Sadat SM, Bolhassani A, Pouriayevali MH, Bahramali G, Ramezani A. In Silico Design and Immunologic Evaluation of HIV-1 p24-Nef Fusion Protein to Approach a Therapeutic Vaccine Candidate. Curr HIV Res 2019;16:322-337. [PMID: 30605062 PMCID: PMC6446525 DOI: 10.2174/1570162x17666190102151717] [Citation(s) in RCA: 7] [Impact Index Per Article: 1.4] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 08/26/2018] [Revised: 12/04/2018] [Accepted: 12/27/2018] [Indexed: 01/24/2023]

Abstract

Background:

Acquired immune deficiency syndrome (HIV/AIDS) has been a major glob-al health concern for over 38 years. No safe and effective preventive or therapeutic vaccine has been developed although many products have been investigated. Computational methods have facilitated vaccine developments in recent decades. Among HIV-1 proteins, p24 and Nef are two suitable targets to provoke the cellular immune response. However, the fusion form of these two proteins has not been analyzed in silico yet.

Objective:

This study aimed at the evaluation of possible fusion forms of p24 and Nef in order to achieve a potential therapeutic subunit vaccine against HIV-1.

Method:

In this study, various computational approaches have been applied to predict the most effec-tive fusion form of p24-Nef including CTL (Cytotoxic T lymphocytes) response, immunogenicity, conservation and population coverage. Moreover, binding to MHC (Major histocompatibility com-plex) molecules was assessed in both human and BALB/c.

Results:

After analyzing six possible fusion protein forms using AAY linker, we came up with the most practical form of p24 from 80 to 231 and Nef from 120 to 150 regions (according to their refer-ence sequence of HXB2 strain) using an AAY linker, based on their peptides affinity to MHC mole-cules which are located in a conserved region among different virus clades. The selected fusion protein contains seventeen MHC I antigenic epitopes, among them KRWIILGLN, YKRWIILGL, DIAG-TTSTL and FPDWQNYTP are fully conserved between the virus clades. Furthermore, analyzed class I CTL epitopes showed greater affinity binding to HLA-B 57*01, HLA-B*51:01 and HLA-B 27*02 molecules. The population coverage with the rate of >70% coverage in the Persian population supports this truncated form as an appropriate candidate against HIV-I virus.

Conclusion:

The predicted fusion protein, p24-AAY-Nef in a truncated form with a high rate of T cell epitopes and high conservancy rate among different clades, provides a helpful model for developing a therapeutic vaccine candidate against HIV-1.

Collapse

Ochoa R, Laio A, Cossio P. Predicting the Affinity of Peptides to Major Histocompatibility Complex Class II by Scoring Molecular Dynamics Simulations. J Chem Inf Model 2019;59:3464-3473. [PMID: 31290667 DOI: 10.1021/acs.jcim.9b00403] [Citation(s) in RCA: 24] [Impact Index Per Article: 4.8] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 12/31/2022]

Wang L, Wang HF, Liu SR, Yan X, Song KJ. Predicting Protein-Protein Interactions from Matrix-Based Protein Sequence Using Convolution Neural Network and Feature-Selective Rotation Forest. Sci Rep 2019;9:9848. [PMID: 31285519 PMCID: PMC6614364 DOI: 10.1038/s41598-019-46369-4] [Citation(s) in RCA: 41] [Impact Index Per Article: 8.2] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/05/2019] [Accepted: 06/10/2019] [Indexed: 01/09/2023] Open

Wang E, Sun H, Wang J, Wang Z, Liu H, Zhang JZH, Hou T. End-Point Binding Free Energy Calculation with MM/PBSA and MM/GBSA: Strategies and Applications in Drug Design. Chem Rev 2019;119:9478-9508. [DOI: 10.1021/acs.chemrev.9b00055] [Citation(s) in RCA: 578] [Impact Index Per Article: 115.6] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 12/19/2022]

Heo L, Arbour CF, Feig M. Driven to near-experimental accuracy by refinement via molecular dynamics simulations. Proteins 2019;87:1263-1275. [PMID: 31197841 DOI: 10.1002/prot.25759] [Citation(s) in RCA: 27] [Impact Index Per Article: 5.4] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 04/19/2019] [Revised: 06/01/2019] [Accepted: 06/07/2019] [Indexed: 12/17/2022]

Baek M, Park T, Heo L, Park C, Seok C. GalaxyHomomer: a web server for protein homo-oligomer structure prediction from a monomer sequence or structure. Nucleic Acids Res 2019;45:W320-W324. [PMID: 28387820 PMCID: PMC5570155 DOI: 10.1093/nar/gkx246] [Citation(s) in RCA: 81] [Impact Index Per Article: 16.2] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 02/04/2017] [Accepted: 04/05/2017] [Indexed: 11/18/2022] Open

Methods for the Refinement of Protein Structure 3D Models. Int J Mol Sci 2019;20:ijms20092301. [PMID: 31075942 PMCID: PMC6539982 DOI: 10.3390/ijms20092301] [Citation(s) in RCA: 34] [Impact Index Per Article: 6.8] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 04/02/2019] [Revised: 04/24/2019] [Accepted: 05/07/2019] [Indexed: 12/25/2022] Open