Reference Citation Analysis: Find an Article, Find a Category, Find a Journal, Find a Scholar

For: Getov I, Petukh M, Alexov E. SAAFEC: Predicting the Effect of Single Point Mutations on Protein Folding Free Energy Using a Knowledge-Modified MM/PBSA Approach. Int J Mol Sci 2016;17:512. [PMID: 27070572 DOI: 10.3390/ijms17040512] [Citation(s) in RCA: 60] [Impact Index Per Article: 7.5] [Reference Citation Analysis] [What about the content of this article? (0)] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 01/07/2016] [Accepted: 03/28/2016] [Indexed: 11/16/2022] Open

For:	Getov I, Petukh M, Alexov E. SAAFEC: Predicting the Effect of Single Point Mutations on Protein Folding Free Energy Using a Knowledge-Modified MM/PBSA Approach. Int J Mol Sci 2016;17:512. [PMID: 27070572 DOI: 10.3390/ijms17040512] [Citation(s) in RCA: 60] [Impact Index Per Article: 7.5] [Reference Citation Analysis] [What about the content of this article? (0)] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 01/07/2016] [Accepted: 03/28/2016] [Indexed: 11/16/2022] Open

Number

Cited by Other Article(s)

Liu B, Jiang Y, Yang Y, Chen JX. OmeDDG: Improved Protein Mutation Stability Prediction Based on Predicted 3D Structures. J Phys Chem B 2024;128:67-76. [PMID: 38130113 DOI: 10.1021/acs.jpcb.3c05601] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 12/23/2023]

Wang S, Tang H, Shan P, Wu Z, Zuo L. ProS-GNN: Predicting effects of mutations on protein stability using graph neural networks. Comput Biol Chem 2023;107:107952. [PMID: 37643501 DOI: 10.1016/j.compbiolchem.2023.107952] [Citation(s) in RCA: 4] [Impact Index Per Article: 4.0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 12/13/2022] [Revised: 08/18/2023] [Accepted: 08/25/2023] [Indexed: 08/31/2023]

Chen J, Woldring DR, Huang F, Huang X, Wei GW. Topological deep learning based deep mutational scanning. Comput Biol Med 2023;164:107258. [PMID: 37506452 PMCID: PMC10528359 DOI: 10.1016/j.compbiomed.2023.107258] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 05/09/2023] [Revised: 06/28/2023] [Accepted: 07/08/2023] [Indexed: 07/30/2023]

Pandey P, Panday SK, Rimal P, Ancona N, Alexov E. Predicting the Effect of Single Mutations on Protein Stability and Binding with Respect to Types of Mutations. Int J Mol Sci 2023;24:12073. [PMID: 37569449 PMCID: PMC10418460 DOI: 10.3390/ijms241512073] [Citation(s) in RCA: 2] [Impact Index Per Article: 2.0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/11/2023] [Revised: 07/24/2023] [Accepted: 07/26/2023] [Indexed: 08/13/2023] Open

Kan Y, Paung Y, Kim Y, Seeliger MA, Miller WT. Biochemical Studies of Systemic Lupus Erythematosus-Associated Mutations in Nonreceptor Tyrosine Kinases Ack1 and Brk. Biochemistry 2023;62:1124-1137. [PMID: 36854171 PMCID: PMC10052838 DOI: 10.1021/acs.biochem.2c00685] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 03/02/2023]

Tu H, Han Y, Wang Z, Li J. Clustered tree regression to learn protein energy change with mutated amino acid. Brief Bioinform 2022;23:6702668. [PMID: 36124753 DOI: 10.1093/bib/bbac374] [Citation(s) in RCA: 2] [Impact Index Per Article: 1.0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/24/2022] [Revised: 07/31/2022] [Accepted: 08/08/2022] [Indexed: 12/14/2022] Open

Yang ZY, Ye ZF, Xiao YJ, Hsieh CY, Zhang SY. SPLDExtraTrees: robust machine learning approach for predicting kinase inhibitor resistance. Brief Bioinform 2022;23:6543900. [PMID: 35262669 DOI: 10.1093/bib/bbac050] [Citation(s) in RCA: 1] [Impact Index Per Article: 0.5] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/18/2021] [Revised: 01/17/2022] [Accepted: 01/31/2022] [Indexed: 12/25/2022] Open

Lai J, Yang J, Gamsiz Uzun ED, Rubenstein BM, Sarkar IN. LYRUS: a machine learning model for predicting the pathogenicity of missense variants. BIOINFORMATICS ADVANCES 2021;2:vbab045. [PMID: 35036922 PMCID: PMC8754197 DOI: 10.1093/bioadv/vbab045] [Citation(s) in RCA: 3] [Impact Index Per Article: 1.0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Subscribe] [Scholar Register] [Received: 07/08/2021] [Revised: 12/08/2021] [Accepted: 12/21/2021] [Indexed: 01/27/2023]

Sun T, Chen Y, Wen Y, Zhu Z, Li M. PremPLI: a machine learning model for predicting the effects of missense mutations on protein-ligand interactions. Commun Biol 2021;4:1311. [PMID: 34799678 PMCID: PMC8604987 DOI: 10.1038/s42003-021-02826-3] [Citation(s) in RCA: 17] [Impact Index Per Article: 5.7] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 04/12/2021] [Accepted: 10/26/2021] [Indexed: 02/07/2023] Open

Koirala M, Shashikala HBM, Jeffries J, Wu B, Loftus SK, Zippin JH, Alexov E. Computational Investigation of the pH Dependence of Stability of Melanosome Proteins: Implication for Melanosome formation and Disease. Int J Mol Sci 2021;22:ijms22158273. [PMID: 34361043 PMCID: PMC8347052 DOI: 10.3390/ijms22158273] [Citation(s) in RCA: 1] [Impact Index Per Article: 0.3] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/07/2021] [Revised: 07/27/2021] [Accepted: 07/29/2021] [Indexed: 11/16/2022] Open

Li G, Pahari S, Murthy AK, Liang S, Fragoza R, Yu H, Alexov E. SAAMBE-SEQ: a sequence-based method for predicting mutation effect on protein-protein binding affinity. Bioinformatics 2021;37:992-999. [PMID: 32866236 PMCID: PMC8128451 DOI: 10.1093/bioinformatics/btaa761] [Citation(s) in RCA: 7] [Impact Index Per Article: 2.3] [Reference Citation Analysis] [Abstract] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/27/2020] [Revised: 08/17/2020] [Accepted: 08/24/2020] [Indexed: 01/04/2023] Open

Abstract

MOTIVATION

Vast majority of human genetic disorders are associated with mutations that affect protein-protein interactions by altering wild-type binding affinity. Therefore, it is extremely important to assess the effect of mutations on protein-protein binding free energy to assist the development of therapeutic solutions. Currently, the most popular approaches use structural information to deliver the predictions, which precludes them to be applicable on genome-scale investigations. Indeed, with the progress of genomic sequencing, researchers are frequently dealing with assessing effect of mutations for which there is no structure available.

RESULTS

Here, we report a Gradient Boosting Decision Tree machine learning algorithm, the SAAMBE-SEQ, which is completely sequence-based and does not require structural information at all. SAAMBE-SEQ utilizes 80 features representing evolutionary information, sequence-based features and change of physical properties upon mutation at the mutation site. The approach is shown to achieve Pearson correlation coefficient (PCC) of 0.83 in 5-fold cross validation in a benchmarking test against experimentally determined binding free energy change (ΔΔG). Further, a blind test (no-STRUC) is compiled collecting experimental ΔΔG upon mutation for protein complexes for which structure is not available and used to benchmark SAAMBE-SEQ resulting in PCC in the range of 0.37-0.46. The accuracy of SAAMBE-SEQ method is found to be either better or comparable to most advanced structure-based methods. SAAMBE-SEQ is very fast, available as webserver and stand-alone code, and indeed utilizes only sequence information, and thus it is applicable for genome-scale investigations to study the effect of mutations on protein-protein interactions.

AVAILABILITY AND IMPLEMENTATION

SAAMBE-SEQ is available at http://compbio.clemson.edu/saambe_webserver/indexSEQ.php#started.

SUPPLEMENTARY INFORMATION

Supplementary data are available at Bioinformatics online.

Collapse

Hayes RL, Brooks CL. A strategy for proline and glycine mutations to proteins with alchemical free energy calculations. J Comput Chem 2021;42:1088-1094. [PMID: 33844328 DOI: 10.1002/jcc.26525] [Citation(s) in RCA: 8] [Impact Index Per Article: 2.7] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 09/27/2020] [Revised: 03/03/2021] [Accepted: 03/05/2021] [Indexed: 11/07/2022]

Bahia MS, Khazanov N, Zhou Q, Yang Z, Wang C, Hong JS, Rab A, Sorscher EJ, Brouillette CG, Hunt JF, Senderowitz H. Stability Prediction for Mutations in the Cytosolic Domains of Cystic Fibrosis Transmembrane Conductance Regulator. J Chem Inf Model 2021;61:1762-1777. [PMID: 33720715 DOI: 10.1021/acs.jcim.0c01207] [Citation(s) in RCA: 6] [Impact Index Per Article: 2.0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/28/2022]

Abstract

Cystic Fibrosis (CF) is caused by mutations to the Cystic Fibrosis Transmembrane Conductance Regulator (CFTR) chloride channel. CFTR is composed of two membrane spanning domains, two cytosolic nucleotide-binding domains (NBD1 and NBD2) and a largely unstructured R-domain. Multiple CF-causing mutations reside in the NBDs and some are known to compromise the stability of these domains. The ability to predict the effect of mutations on the stability of the cytosolic domains of CFTR and to shed light on the mechanisms by which they exert their effect is therefore important in CF research. With this in mind, we have predicted the effect on domain stability of 59 mutations in NBD1 and NBD2 using 15 different algorithms and evaluated their performances via comparison to experimental data using several metrics including the correct classification rate (CCR), and the squared Pearson correlation (R²) and Spearman's correlation (ρ) calculated between the experimental ΔT_m values and the computationally predicted ΔΔG values. Overall, the best results were obtained with FoldX and Rosetta. For NBD1 (35 mutations), FoldX provided R² and ρ values of 0.64 and -0.71, respectively, with an 86% correct classification rate (CCR). For NBD2 (24 mutations), FoldX R², ρ, and CCR were 0.51, -0.73, and 75%, respectively. Application of the Rosetta high-resolution protocol (Rosetta_hrp) to NBD1 yielded R², ρ, and CCR of 0.64, -0.75, and 69%, respectively, and for NBD2 yielded R², ρ, and CCR of 0.29, -0.27, and 50%, respectively. The corresponding numbers for the Rosetta's low-resolution protocol (Rosetta_lrp) were R² = 0.47, ρ = -0.69, and CCR = 69% for NBD1 and R² = 0.27, ρ = -0.24, and CCR = 63% for NBD2. For NBD1, both algorithms suggest that destabilizing mutations suffer from destabilizing vdW clashes, whereas stabilizing mutations benefit from favorable H-bond interactions. Two triple consensus approaches based on FoldX, Rosetta_lpr, and Rosetta_hpr were attempted using either "majority-voting" or "all-voting". The all-voting consensus outperformed the individual predictors, albeit on a smaller data set. In summary, our results suggest that the effect of mutations on the stability of CFTR's NBDs could be largely predicted. Since NBDs are common to all ABC transporters, these results may find use in predicting the effect and mechanism of the action of multiple disease-causing mutations in other proteins.

Collapse

Blake S, Hemming I, Heng JIT, Agostino M. Structure-Based Approaches to Classify the Functional Impact of ZBTB18 Missense Variants in Health and Disease. ACS Chem Neurosci 2021;12:979-989. [PMID: 33621064 DOI: 10.1021/acschemneuro.0c00758] [Citation(s) in RCA: 4] [Impact Index Per Article: 1.3] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 01/18/2023] Open

SAAFEC-SEQ: A Sequence-Based Method for Predicting the Effect of Single Point Mutations on Protein Thermodynamic Stability. Int J Mol Sci 2021;22:ijms22020606. [PMID: 33435356 PMCID: PMC7827184 DOI: 10.3390/ijms22020606] [Citation(s) in RCA: 50] [Impact Index Per Article: 16.7] [Reference Citation Analysis] [Abstract] [Key Words] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 12/05/2020] [Revised: 12/23/2020] [Accepted: 01/06/2021] [Indexed: 01/04/2023] Open

Chen Y, Lu H, Zhang N, Zhu Z, Wang S, Li M. PremPS: Predicting the impact of missense mutations on protein stability. PLoS Comput Biol 2020;16:e1008543. [PMID: 33378330 PMCID: PMC7802934 DOI: 10.1371/journal.pcbi.1008543] [Citation(s) in RCA: 93] [Impact Index Per Article: 23.3] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/23/2020] [Revised: 01/12/2021] [Accepted: 11/16/2020] [Indexed: 12/12/2022] Open

Abstract

Computational methods that predict protein stability changes induced by missense mutations have made a lot of progress over the past decades. Most of the available methods however have very limited accuracy in predicting stabilizing mutations because existing experimental sets are dominated by mutations reducing protein stability. Moreover, few approaches could consistently perform well across different test cases. To address these issues, we developed a new computational method PremPS to more accurately evaluate the effects of missense mutations on protein stability. The PremPS method is composed of only ten evolutionary- and structure-based features and parameterized on a balanced dataset with an equal number of stabilizing and destabilizing mutations. A comprehensive comparison of the predictive performance of PremPS with other available methods on nine benchmark datasets confirms that our approach consistently outperforms other methods and shows considerable improvement in estimating the impacts of stabilizing mutations. A protein could have multiple structures available, and if another structure of the same protein is used, the predicted change in stability for structure-based methods might be different. Thus, we further estimated the impact of using different structures on prediction accuracy, and demonstrate that our method performs well across different types of structures except for low-resolution structures and models built based on templates with low sequence identity. PremPS can be used for finding functionally important variants, revealing the molecular mechanisms of functional influences and protein design. PremPS is freely available at https://lilab.jysw.suda.edu.cn/research/PremPS/, which allows to do large-scale mutational scanning and takes about four minutes to perform calculations for a single mutation per protein with ~ 300 residues and requires ~ 0.4 seconds for each additional mutation.

The development of computational methods to accurately predict the impacts of amino acid substitutions on protein stability is of paramount importance for the field of protein design and understanding the roles of missense mutations in disease. However, most of the available methods have very limited predictive accuracy for mutations increasing stability and few could consistently perform well across different test cases. Here we present a new computational approach PremPS, which is capable of predicting the effects of single point mutations on protein stability. PremPS employs only ten evolutionary- and structure-based features and is trained on a symmetrical dataset consisting of the same number of cases of stabilizing and destabilizing mutations. Our method was tested against numerous blind datasets and shows a considerable improvement especially in evaluating the effects of stabilizing mutations, outperforming previously developed methods. PremPS is freely available as a user-friendly web server at http://lilab.jysw.suda.edu.cn/research/PremPS/, which is fast enough to handle the large number of cases.

Collapse

Wang R, Chen J, Hozumi Y, Yin C, Wei GW. Decoding Asymptomatic COVID-19 Infection and Transmission. J Phys Chem Lett 2020;11:10007-10015. [PMID: 33179934 PMCID: PMC8150094 DOI: 10.1021/acs.jpclett.0c02765] [Citation(s) in RCA: 48] [Impact Index Per Article: 12.0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 05/24/2023]

Sarkar A, Yang Y, Vihinen M. Variation benchmark datasets: update, criteria, quality and applications. DATABASE-THE JOURNAL OF BIOLOGICAL DATABASES AND CURATION 2020;2020:5710862. [PMID: 32016318 PMCID: PMC6997940 DOI: 10.1093/database/baz117] [Citation(s) in RCA: 19] [Impact Index Per Article: 4.8] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Subscribe] [Scholar Register] [Received: 04/12/2019] [Revised: 06/03/2019] [Accepted: 07/01/2019] [Indexed: 02/07/2023]

Abstract

Development of new computational methods and testing their performance has to be carried out using experimental data. Only in comparison to existing knowledge can method performance be assessed. For that purpose, benchmark datasets with known and verified outcome are needed. High-quality benchmark datasets are valuable and may be difficult, laborious and time consuming to generate. VariBench and VariSNP are the two existing databases for sharing variation benchmark datasets used mainly for variation interpretation. They have been used for training and benchmarking predictors for various types of variations and their effects. VariBench was updated with 419 new datasets from 109 papers containing altogether 329 014 152 variants; however, there is plenty of redundancy between the datasets. VariBench is freely available at http://structure.bmc.lu.se/VariBench/. The contents of the datasets vary depending on information in the original source. The available datasets have been categorized into 20 groups and subgroups. There are datasets for insertions and deletions, substitutions in coding and non-coding region, structure mapped, synonymous and benign variants. Effect-specific datasets include DNA regulatory elements, RNA splicing, and protein property for aggregation, binding free energy, disorder and stability. Then there are several datasets for molecule-specific and disease-specific applications, as well as one dataset for variation phenotype effects. Variants are often described at three molecular levels (DNA, RNA and protein) and sometimes also at the protein structural level including relevant cross references and variant descriptions. The updated VariBench facilitates development and testing of new methods and comparison of obtained performances to previously published methods. We compared the performance of the pathogenicity/tolerance predictor PON-P2 to several benchmark studies, and show that such comparisons are feasible and useful, however, there may be limitations due to lack of provided details and shared data.

Database URL: http://structure.bmc.lu.se/VariBench

Collapse

Heydari A, Abolnezhadian F, Sadeghi-Shabestari M, Saberi A, Shamsizadeh A, Ghadiri AA, Ghandil P. Identification of Cytochrome b-245, beta-chain gene mutations, and clinical presentations in Iranian patients with X-linked chronic granulomatous disease. J Clin Lab Anal 2020;35:e23637. [PMID: 33098164 PMCID: PMC7891530 DOI: 10.1002/jcla.23637] [Citation(s) in RCA: 2] [Impact Index Per Article: 0.5] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/05/2020] [Revised: 10/06/2020] [Accepted: 10/08/2020] [Indexed: 01/25/2023] Open

Mohamadian M, Ghandil P, Naseri M, Bahrami A, Momen AA. A novel homozygous variant in an Iranian pedigree with cerebellar ataxia, mental retardation, and dysequilibrium syndrome type 4. J Clin Lab Anal 2020;34:e23484. [PMID: 33079427 PMCID: PMC7676196 DOI: 10.1002/jcla.23484] [Citation(s) in RCA: 5] [Impact Index Per Article: 1.3] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 04/27/2020] [Revised: 06/12/2020] [Accepted: 06/26/2020] [Indexed: 01/20/2023] Open

Structural and Molecular Interaction Studies on Familial Hypercholesterolemia Causative PCSK9 Functional Domain Mutations Reveals Binding Affinity Alterations with LDLR. Int J Pept Res Ther 2020. [DOI: 10.1007/s10989-020-10121-8] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 10/23/2022]

Enzyme dysfunction at atomic resolution: Disease-associated variants of human phosphoglucomutase-1. Biochimie 2020;183:44-48. [PMID: 32898648 DOI: 10.1016/j.biochi.2020.08.017] [Citation(s) in RCA: 5] [Impact Index Per Article: 1.3] [Reference Citation Analysis] [Abstract] [Key Words] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/08/2020] [Revised: 08/26/2020] [Accepted: 08/30/2020] [Indexed: 11/20/2022]

Mazurenko S. Predicting protein stability and solubility changes upon mutations: data perspective. ChemCatChem 2020. [DOI: 10.1002/cctc.202000933] [Citation(s) in RCA: 14] [Impact Index Per Article: 3.5] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 12/18/2022]

Insight into the structural and functional analysis of the impact of missense mutation on cytochrome P450 oxidoreductase. J Mol Graph Model 2020;100:107708. [PMID: 32805558 DOI: 10.1016/j.jmgm.2020.107708] [Citation(s) in RCA: 4] [Impact Index Per Article: 1.0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 02/12/2020] [Revised: 07/15/2020] [Accepted: 07/15/2020] [Indexed: 01/26/2023]

Mahase V, Sobitan A, Johnson C, Cooper F, Xie Y, Li L, Teng S. Computational analysis of hereditary spastic paraplegia mutations in the kinesin motor domains of KIF1A and KIF5A. JOURNAL OF THEORETICAL & COMPUTATIONAL CHEMISTRY 2020. [DOI: 10.1142/s0219633620410035] [Citation(s) in RCA: 2] [Impact Index Per Article: 0.5] [Reference Citation Analysis] [Abstract] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 11/18/2022]

Zhang N, Lu H, Chen Y, Zhu Z, Yang Q, Wang S, Li M. PremPRI: Predicting the Effects of Missense Mutations on Protein-RNA Interactions. Int J Mol Sci 2020;21:ijms21155560. [PMID: 32756481 PMCID: PMC7432928 DOI: 10.3390/ijms21155560] [Citation(s) in RCA: 9] [Impact Index Per Article: 2.3] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/29/2020] [Revised: 07/28/2020] [Accepted: 07/30/2020] [Indexed: 12/23/2022] Open

Sanavia T, Birolo G, Montanucci L, Turina P, Capriotti E, Fariselli P. Limitations and challenges in protein stability prediction upon genome variations: towards future applications in precision medicine. Comput Struct Biotechnol J 2020;18:1968-1979. [PMID: 32774791 PMCID: PMC7397395 DOI: 10.1016/j.csbj.2020.07.011] [Citation(s) in RCA: 59] [Impact Index Per Article: 14.8] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/15/2020] [Revised: 07/10/2020] [Accepted: 07/14/2020] [Indexed: 12/13/2022] Open

Mutations in FAM50A suggest that Armfield XLID syndrome is a spliceosomopathy. Nat Commun 2020;11:3698. [PMID: 32703943 PMCID: PMC7378245 DOI: 10.1038/s41467-020-17452-6] [Citation(s) in RCA: 32] [Impact Index Per Article: 8.0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 05/30/2019] [Accepted: 06/17/2020] [Indexed: 02/06/2023] Open

Ganakammal SR, Koirala M, Wu B, Alexov E. In-silico analysis to identify the role of MEN1 missense mutations in breast cancer. JOURNAL OF THEORETICAL & COMPUTATIONAL CHEMISTRY 2020. [DOI: 10.1142/s0219633620410023] [Citation(s) in RCA: 2] [Impact Index Per Article: 0.5] [Reference Citation Analysis] [Abstract] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 11/18/2022]

Lu Y, Villoutreix BO, Biswas I, Ding Q, Wang X, Rezaie AR. Thr90Ser Mutation in Antithrombin is Associated with Recurrent Thrombosis in a Heterozygous Carrier. Thromb Haemost 2020;120:1045-1055. [PMID: 32422680 DOI: 10.1055/s-0040-1710590] [Citation(s) in RCA: 5] [Impact Index Per Article: 1.3] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 12/26/2022]

Banerjee A, Mitra P. Estimating the Effect of Single-Point Mutations on Protein Thermodynamic Stability and Analyzing the Mutation Landscape of the p53 Protein. J Chem Inf Model 2020;60:3315-3323. [PMID: 32401507 DOI: 10.1021/acs.jcim.0c00256] [Citation(s) in RCA: 9] [Impact Index Per Article: 2.3] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 01/09/2023]

Effects of Single and Double Mutants in Human Glucose-6-Phosphate Dehydrogenase Variants Present in the Mexican Population: Biochemical and Structural Analysis. Int J Mol Sci 2020;21:ijms21082732. [PMID: 32326520 PMCID: PMC7215812 DOI: 10.3390/ijms21082732] [Citation(s) in RCA: 8] [Impact Index Per Article: 2.0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/24/2020] [Revised: 04/12/2020] [Accepted: 04/13/2020] [Indexed: 11/16/2022] Open

Funk CR, Huey ES, May MM, Peng Y, Michonova E, Best RG, Schwartz CE, Blenda AV. Rare missense variant p.Ala505Ser in the ZAK protein observed in a patient with split-hand/foot malformation from a non-consanguineous pedigree. J Int Med Res 2020;48:300060519879293. [PMID: 32266845 PMCID: PMC7144677 DOI: 10.1177/0300060519879293] [Citation(s) in RCA: 3] [Impact Index Per Article: 0.8] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/26/2022] Open

Gyulkhandanyan A, Rezaie AR, Roumenina L, Lagarde N, Fremeaux-Bacchi V, Miteva MA, Villoutreix BO. Analysis of protein missense alterations by combining sequence- and structure-based methods. Mol Genet Genomic Med 2020;8:e1166. [PMID: 32096919 PMCID: PMC7196459 DOI: 10.1002/mgg3.1166] [Citation(s) in RCA: 18] [Impact Index Per Article: 4.5] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 11/26/2019] [Revised: 01/20/2020] [Accepted: 01/27/2020] [Indexed: 12/11/2022] Open

Medina-Ortiz D, Contreras S, Quiroz C, Olivera-Nappa Á. Development of Supervised Learning Predictive Models for Highly Non-linear Biological, Biomedical, and General Datasets. Front Mol Biosci 2020;7:13. [PMID: 32118039 PMCID: PMC7031350 DOI: 10.3389/fmolb.2020.00013] [Citation(s) in RCA: 6] [Impact Index Per Article: 1.5] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 09/09/2019] [Accepted: 01/22/2020] [Indexed: 11/13/2022] Open

Abstract

In highly non-linear datasets, attributes or features do not allow readily finding visual patterns for identifying common underlying behaviors. Therefore, it is not possible to achieve classification or regression using linear or mildly non-linear hyperspace partition functions. Hence, supervised learning models based on the application of most existing algorithms are limited, and their performance metrics are low. Linear transformations of variables, such as principal components analysis, cannot avoid the problem, and even models based on artificial neural networks and deep learning are unable to improve the metrics. Sometimes, even when features allow classification or regression in reported cases, performance metrics of supervised learning algorithms remain unsatisfyingly low. This problem is recurrent in many areas of study as, per example, the clinical, biotechnological, and protein engineering areas, where many of the attributes are correlated in an unknown and very non-linear fashion or are categorical and difficult to relate to a target response variable. In such areas, being able to create predictive models would dramatically impact the quality of their outcomes, generating an immediate added value for both the scientific and general public. In this manuscript, we present RV-Clustering, a library of unsupervised learning algorithms, and a new methodology designed to find optimum partitions within highly non-linear datasets that allow deconvoluting variables and notoriously improving performance metrics in supervised learning classification or regression models. The partitions obtained are statistically cross-validated, ensuring correct representativity and no over-fitting. We have successfully tested RV-Clustering in several highly non-linear datasets with different origins. The approach herein proposed has generated classification and regression models with high-performance metrics, which further supports its ability to generate predictive models for highly non-linear datasets. Advantageously, the method does not require significant human input, which guarantees a higher usability in the biological, biomedical, and protein engineering community with no specific knowledge in the machine learning area.

Collapse

Goswami AM. Computational analyses prioritize and reveal the deleterious nsSNPs in human angiotensinogen gene. Comput Biol Chem 2020;84:107199. [PMID: 31931433 DOI: 10.1016/j.compbiolchem.2019.107199] [Citation(s) in RCA: 2] [Impact Index Per Article: 0.5] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/01/2019] [Revised: 12/26/2019] [Accepted: 12/30/2019] [Indexed: 12/27/2022]

Li C, Jia Z, Chakravorty A, Pahari S, Peng Y, Basu S, Koirala M, Panday SK, Petukh M, Li L, Alexov E. DelPhi Suite: New Developments and Review of Functionalities. J Comput Chem 2019;40:2502-2508. [PMID: 31237360 PMCID: PMC6771749 DOI: 10.1002/jcc.26006] [Citation(s) in RCA: 34] [Impact Index Per Article: 6.8] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 04/03/2019] [Revised: 05/07/2019] [Accepted: 06/09/2019] [Indexed: 12/25/2022]

Koirala M, Alexov E. Computational chemistry methods to investigate the effects caused by DNA variants linked with disease. JOURNAL OF THEORETICAL & COMPUTATIONAL CHEMISTRY 2019. [DOI: 10.1142/s0219633619300015] [Citation(s) in RCA: 2] [Impact Index Per Article: 0.4] [Reference Citation Analysis] [Abstract] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 02/06/2023]

Novel Genetic Markers for Early Detection of Elevated Breast Cancer Risk in Women. Int J Mol Sci 2019;20:ijms20194828. [PMID: 31569399 PMCID: PMC6801521 DOI: 10.3390/ijms20194828] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 09/02/2019] [Revised: 09/20/2019] [Accepted: 09/25/2019] [Indexed: 12/25/2022] Open

Tajielyato N, Alexov E. Modeling pKas of unfolded proteins to probe structural models of unfolded state. JOURNAL OF THEORETICAL & COMPUTATIONAL CHEMISTRY 2019. [DOI: 10.1142/s0219633619500202] [Citation(s) in RCA: 2] [Impact Index Per Article: 0.4] [Reference Citation Analysis] [Abstract] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 11/18/2022]

Kumar V, Pandey P, Idrees D, Prakash A, Lynn A. Delineating the effect of mutations on the conformational dynamics of N-terminal domain of TDP-43. Biophys Chem 2019;250:106174. [DOI: 10.1016/j.bpc.2019.106174] [Citation(s) in RCA: 15] [Impact Index Per Article: 3.0] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 12/28/2018] [Revised: 03/06/2019] [Accepted: 04/21/2019] [Indexed: 12/12/2022]

Spellicy CJ, Peng Y, Olewiler L, Cathey SS, Rogers RC, Bartholomew D, Johnson J, Alexov E, Lee JA, Friez MJ, Jones JR. Three additional patients with EED-associated overgrowth: potential mutation hotspots identified? J Hum Genet 2019;64:561-572. [PMID: 30858506 DOI: 10.1038/s10038-019-0585-5] [Citation(s) in RCA: 11] [Impact Index Per Article: 2.2] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 09/20/2018] [Revised: 02/12/2019] [Accepted: 02/13/2019] [Indexed: 12/25/2022]

Chakravorty A, Gallicchio E, Alexov E. A grid-based algorithm in conjunction with a gaussian-based model of atoms for describing molecular geometry. J Comput Chem 2019;40:1290-1304. [PMID: 30698861 PMCID: PMC6506848 DOI: 10.1002/jcc.25786] [Citation(s) in RCA: 2] [Impact Index Per Article: 0.4] [Reference Citation Analysis] [Abstract] [Key Words] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 11/02/2018] [Revised: 12/12/2018] [Accepted: 01/06/2019] [Indexed: 11/06/2022]

Peng Y, Alexov E, Basu S. Structural Perspective on Revealing and Altering Molecular Functions of Genetic Variants Linked with Diseases. Int J Mol Sci 2019;20:ijms20030548. [PMID: 30696058 PMCID: PMC6386852 DOI: 10.3390/ijms20030548] [Citation(s) in RCA: 14] [Impact Index Per Article: 2.8] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 12/22/2018] [Revised: 01/25/2019] [Accepted: 01/26/2019] [Indexed: 12/25/2022] Open

Qi R, Luo R. Robustness and Efficiency of Poisson-Boltzmann Modeling on Graphics Processing Units. J Chem Inf Model 2018;59:409-420. [PMID: 30550277 DOI: 10.1021/acs.jcim.8b00761] [Citation(s) in RCA: 7] [Impact Index Per Article: 1.2] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 12/15/2022]

PremPDI estimates and interprets the effects of missense mutations on protein-DNA interactions. PLoS Comput Biol 2018;14:e1006615. [PMID: 30533007 PMCID: PMC6303081 DOI: 10.1371/journal.pcbi.1006615] [Citation(s) in RCA: 25] [Impact Index Per Article: 4.2] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 09/11/2018] [Revised: 12/21/2018] [Accepted: 11/01/2018] [Indexed: 01/01/2023] Open

Abstract

Protein-DNA interactions play important roles in regulations of many vital cellular processes, including transcription, translation, DNA replication and recombination. Sequence variants occurring in these DNA binding proteins that alter protein-DNA interactions may cause significant perturbations or complete abolishment of function, potentially leading to diseases. Developing a mechanistic understanding of impacts of variants on protein-DNA interactions becomes a persistent need. To address this need we introduce a new computational method PremPDI that predicts the effect of single missense mutation in the protein on the protein-DNA interaction and calculates the quantitative binding affinity change. The PremPDI method is based on molecular mechanics force fields and fast side-chain optimization algorithms with parameters optimized on experimental sets of 219 mutations from 49 protein-DNA complexes. PremPDI yields a very good agreement between predicted and experimental values with Pearson correlation coefficient of 0.71 and root-mean-square error of 0.86 kcal mol^-1. The PremPDI server could map mutations on a structural protein-DNA complex, calculate the associated changes in binding affinity, determine the deleterious effect of a mutation, and produce a mutant structural model for download. PremPDI can be applied to many tasks, such as determination of potential damaging mutations in cancer and other diseases. PremPDI is available at http://lilab.jysw.suda.edu.cn/research/PremPDI/.

Developing methods for accurate prediction of effects of amino acid substitutions on protein-DNA interactions is important for a wide range of biomedical applications such as understanding disease-causing mechanism of missense mutations and guiding protein engineering. Very few methods have been developed for predicting the effects of mutations on protein-DNA binding affinity. Here we report a new computational method, PRedicts the Effects of single Mutations on Protein-DNA Interactions (PremPDI). The core of the PremPDI method is based on molecular mechanics force fields and fast side-chain optimization algorithms that makes the PremPDI algorithm efficient and being fast enough to handle large number of cases. The performance of the PremPDI protocol was tested against experimentally determined binding free energy changes of 219 mutations from 49 protein-DNA complexes and yields very good correlation coefficient. The PremPDI webserver is available to the community at http://lilab.jysw.suda.edu.cn/research/PremPDI/.

Collapse

Valdebenito-Maturana B, Reyes-Suarez JA, Henriquez J, Holmes DS, Quatrini R, Pohl E, Arenas-Salinas M. Mutantelec: An In Silico mutation simulation platform for comparative electrostatic potential profiling of proteins. J Comput Chem 2018;38:467-474. [PMID: 28114729 DOI: 10.1002/jcc.24712] [Citation(s) in RCA: 6] [Impact Index Per Article: 1.0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/06/2016] [Revised: 12/06/2016] [Accepted: 12/07/2016] [Indexed: 11/07/2022]

Zhou Y, Fujikura K, Mkrtchian S, Lauschke VM. Computational Methods for the Pharmacogenetic Interpretation of Next Generation Sequencing Data. Front Pharmacol 2018;9:1437. [PMID: 30564131 PMCID: PMC6288784 DOI: 10.3389/fphar.2018.01437] [Citation(s) in RCA: 48] [Impact Index Per Article: 8.0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 08/03/2018] [Accepted: 11/20/2018] [Indexed: 12/21/2022] Open

Srinivasan E, Rajasekaran R. Quantum chemical and molecular mechanics studies on the assessment of interactions between resveratrol and mutant SOD1 (G93A) protein. J Comput Aided Mol Des 2018;32:1347-1361. [PMID: 30368622 DOI: 10.1007/s10822-018-0175-1] [Citation(s) in RCA: 23] [Impact Index Per Article: 3.8] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/19/2018] [Accepted: 10/24/2018] [Indexed: 12/29/2022]

Peng Y, Sun L, Jia Z, Li L, Alexov E. Predicting protein-DNA binding free energy change upon missense mutations using modified MM/PBSA approach: SAMPDI webserver. Bioinformatics 2018;34:779-786. [PMID: 29091991 DOI: 10.1093/bioinformatics/btx698] [Citation(s) in RCA: 42] [Impact Index Per Article: 7.0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/14/2017] [Accepted: 10/27/2017] [Indexed: 12/28/2022] Open