Reference Citation Analysis: Find an Article, Find a Category, Find a Journal, Find a Scholar

For: Huang LT, Gromiha MM, Ho SY. iPTREE-STAB: interpretable decision tree based method for predicting protein stability changes upon mutations. Bioinformatics 2007;23:1292-3. [PMID: 17379687 DOI: 10.1093/bioinformatics/btm100] [Citation(s) in RCA: 108] [Impact Index Per Article: 6.4] [Reference Citation Analysis] [What about the content of this article? (0)] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/14/2022] Open

For:	Huang LT, Gromiha MM, Ho SY. iPTREE-STAB: interpretable decision tree based method for predicting protein stability changes upon mutations. Bioinformatics 2007;23:1292-3. [PMID: 17379687 DOI: 10.1093/bioinformatics/btm100] [Citation(s) in RCA: 108] [Impact Index Per Article: 6.4] [Reference Citation Analysis] [What about the content of this article? (0)] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/14/2022] Open

Number

Cited by Other Article(s)

Chu SKS, Narang K, Siegel JB. Protein stability prediction by fine-tuning a protein language model on a mega-scale dataset. PLoS Comput Biol 2024;20:e1012248. [PMID: 39038042 DOI: 10.1371/journal.pcbi.1012248] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 12/11/2023] [Accepted: 06/13/2024] [Indexed: 07/24/2024] Open

Zhou L, Tao C, Shen X, Sun X, Wang J, Yuan Q. Unlocking the potential of enzyme engineering via rational computational design strategies. Biotechnol Adv 2024;73:108376. [PMID: 38740355 DOI: 10.1016/j.biotechadv.2024.108376] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 12/27/2023] [Revised: 04/27/2024] [Accepted: 05/08/2024] [Indexed: 05/16/2024]

Ndochinwa OG, Wang QY, Amadi OC, Nwagu TN, Nnamchi CI, Okeke ES, Moneke AN. Current status and emerging frontiers in enzyme engineering: An industrial perspective. Heliyon 2024;10:e32673. [PMID: 38912509 PMCID: PMC11193041 DOI: 10.1016/j.heliyon.2024.e32673] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 12/08/2023] [Revised: 06/05/2024] [Accepted: 06/06/2024] [Indexed: 06/25/2024] Open

Abstract

Protein engineering mechanisms can be an efficient approach to enhance the biochemical properties of various biocatalysts. Immobilization of biocatalysts and the introduction of new-to-nature chemical reactivities are also possible through the same mechanism. Discovering new protocols that enhance the catalytic active protein that possesses novelty in terms of being stable, active, and, stereoselectivity with functions could be identified as essential areas in terms of concurrent bioorganic chemistry (synergistic relationship between organic chemistry and biochemistry in the context of enzyme engineering). However, with our current level of knowledge about protein folding and its correlation with protein conformation and activities, it is almost impossible to design proteins with specific biological and physical properties. Hence, contemporary protein engineering typically involves reprogramming existing enzymes by mutagenesis to generate new phenotypes with desired properties. These processes ensure that limitations of naturally occurring enzymes are not encountered. For example, researchers have engineered cellulases and hemicellulases to withstand harsh conditions encountered during biomass pretreatment, such as high temperatures and acidic environments. By enhancing the activity and robustness of these enzymes, biofuel production becomes more economically viable and environmentally sustainable. Recent trends in enzyme engineering have enabled the development of tailored biocatalysts for pharmaceutical applications. For instance, researchers have engineered enzymes such as cytochrome P450s and amine oxidases to catalyze challenging reactions involved in drug synthesis. In addition to conventional methods, there has been an increasing application of machine learning techniques to identify patterns in data. These patterns are then used to predict protein structures, enhance enzyme solubility, stability, and function, forecast substrate specificity, and assist in rational protein design. In this review, we discussed recent trends in enzyme engineering to optimize the biochemical properties of various biocatalysts. Using examples relevant to biotechnology in engineering enzymes, we try to expatiate the significance of enzyme engineering with how these methods could be applied to optimize the biochemical properties of a naturally occurring enzyme.

Collapse

Kumar R, Jayaraman M, Ramadas K, Chandrasekaran A. Computational identification and analysis of deleterious non-synonymous single nucleotide polymorphisms (nsSNPs) in the human POR gene: a structural and functional impact. J Biomol Struct Dyn 2024;42:1518-1532. [PMID: 37173831 DOI: 10.1080/07391102.2023.2211674] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 02/02/2023] [Accepted: 04/02/2023] [Indexed: 05/15/2023]

Abstract

Cytochrome P450 oxidoreductase (POR) protein is essential for steroidogenesis, and POR gene mutations are frequently associated with P450 Oxidoreductase Deficiency (PORD), a disorder of hormone production. To our knowledge, no previous attempt has been made to identify and analyze the deleterious/pathogenic non-synonymous single nucleotide polymorphisms (nsSNPs) in the human POR gene through an extensive computational approach. Computational algorithms and tools were employed to identify, characterize, and validate the pathogenic SNPs associated with certain diseases. To begin with, all the high-confidence SNPs were collected, and their structural and functional impacts on the protein structures were explored. The results of various in silico analyses affirm that the A287P and R457H variants of POR could destabilize the interactions between the amino acids and the hydrogen bond networks, resulting in functional deviations of POR. The literature study further confirms that the pathogenic mutations (A287P and R457H) are associated with the onset of PORD. Molecular dynamics simulations (MDS) and essential dynamics (ED) studies characterized the structural consequences of prioritized deleterious mutations, representing the structural destabilization that might disrupt POR biological function. The identified deleterious mutations at the cofactor's binding domains might interfere with the essential interactions between the protein and cofactors, thus inhibiting POR catalytic activity. The consolidated insights from the computational analyses can be used to predict potential deleterious mutants and understand the disease's pathological basis and the molecular mechanism of drug metabolism for the application of personalized medication. HIGHLIGHTSNADPH cytochrome P450 oxidoreductase (POR) mutations are associated with a broad spectrum of human diseasesIdentified and analyzed the most deleterious nsSNPs of POR through the sequence and structure-based prediction toolsInvestigated the structural and functional impacts of the most significant mutations (A287P and R457H) associated with PORDMolecular dynamics and PCA-based FEL analysis were utilized to probe the mutation-induced structural alterations in PORCommunicated by Ramaswamy H. Sarma.

Collapse

Thayyil Menambath D, Adiga U, Rai T, Adiga S, Shetty V. Identification of the SIRT1 gene's most harmful non-synonymous SNPs and their effects on functional and structural features-an in silico analysis. F1000Res 2024;12:66. [PMID: 38283900 PMCID: PMC10822041 DOI: 10.12688/f1000research.128706.2] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Figures] [Journal Information] [Submit a Manuscript] [Subscribe] [Scholar Register] [Accepted: 01/16/2024] [Indexed: 01/30/2024] Open

Abstract

Introduction

The sirtuin (Silent mating type information regulation 2 homolog)1(SIRT1) protein plays a vital role in many disorders such as diabetes, cancer, obesity, inflammation, and neurodegenerative and cardiovascular diseases. The objective of this in silico analysis of SIRT1's functional single nucleotide polymorphisms (SNPs) was to gain valuable insight into the harmful effects of non-synonymous SNPs (nsSNPs) on the protein. The objective of the study was to use bioinformatics methods to investigate the genetic variations and modifications that may have an impact on the SIRT1 gene's expression and function.

Methods

nsSNPs of SIRT1 protein were collected from the dbSNP site, from its three (3) different protein accession IDs. These were then fed to various bioinformatic tools such as SIFT, Provean, and I- Mutant to find the most deleterious ones. Functional and structural effects were examined using the HOPE server and I-Tasser. Gene interactions were predicted by STRING software. The SIFT, Provean, and I-Mutant tools detected the most deleterious three nsSNPs (rs769519031, rs778184510, and rs199983221).

Results

Out of 252 nsSNPs, SIFT analysis showed that 94 were deleterious, Provean listed 67 dangerous, and I-Mutant found 58 nsSNPs resulting in lowered stability of proteins. HOPE modelling of rs199983221 and rs769519031 suggested reduced hydrophobicity due to Ile 4Thr and Ile223Ser resulting in decreased hydrophobic interactions. In contrast, on modelling rs778184510, the mutant protein had a higher hydrophobicity than the wild type.

Conclusions

Our study reports that three nsSNPs (D357A, I223S, I4T) are the most damaging mutations of the SIRT1 gene. Mutations may result in altered protein structure and functions. Such altered protein may be the basis for various disorders. Our findings may be a crucial guide in establishing the pathogenesis of various disorders.

Collapse

Li G, Jia L, Wang K, Sun T, Huang J. Prediction of Thermostability of Enzymes Based on the Amino Acid Index (AAindex) Database and Machine Learning. Molecules 2023;28:8097. [PMID: 38138586 PMCID: PMC10746113 DOI: 10.3390/molecules28248097] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 09/07/2023] [Revised: 12/06/2023] [Accepted: 12/12/2023] [Indexed: 12/24/2023] Open

Wang S, Tang H, Shan P, Wu Z, Zuo L. ProS-GNN: Predicting effects of mutations on protein stability using graph neural networks. Comput Biol Chem 2023;107:107952. [PMID: 37643501 DOI: 10.1016/j.compbiolchem.2023.107952] [Citation(s) in RCA: 4] [Impact Index Per Article: 4.0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 12/13/2022] [Revised: 08/18/2023] [Accepted: 08/25/2023] [Indexed: 08/31/2023]

Musil M, Jezik A, Horackova J, Borko S, Kabourek P, Damborsky J, Bednar D. FireProt 2.0: web-based platform for the fully automated design of thermostable proteins. Brief Bioinform 2023;25:bbad425. [PMID: 38018911 PMCID: PMC10685400 DOI: 10.1093/bib/bbad425] [Citation(s) in RCA: 1] [Impact Index Per Article: 1.0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 08/31/2023] [Revised: 10/25/2023] [Accepted: 11/01/2023] [Indexed: 11/30/2023] Open

Kunka A, Marques SM, Havlasek M, Vasina M, Velatova N, Cengelova L, Kovar D, Damborsky J, Marek M, Bednar D, Prokop Z. Advancing Enzyme's Stability and Catalytic Efficiency through Synergy of Force-Field Calculations, Evolutionary Analysis, and Machine Learning. ACS Catal 2023;13:12506-12518. [PMID: 37822856 PMCID: PMC10563018 DOI: 10.1021/acscatal.3c02575] [Citation(s) in RCA: 4] [Impact Index Per Article: 4.0] [Reference Citation Analysis] [Abstract] [Key Words] [Grants] [Track Full Text] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/06/2023] [Revised: 08/24/2023] [Indexed: 10/13/2023]

Affiliation(s)

Antonin Kunka Loschmidt Laboratories, Department of Experimental Biology and RECETOX, Faculty of Science, Masaryk University, Brno 601 77, Czech Republic International Clinical Research Center, St. Anne’s University Hospital, Brno 601 77, Czech Republic
Sérgio M. Marques Loschmidt Laboratories, Department of Experimental Biology and RECETOX, Faculty of Science, Masaryk University, Brno 601 77, Czech Republic International Clinical Research Center, St. Anne’s University Hospital, Brno 601 77, Czech Republic
Martin Havlasek Loschmidt Laboratories, Department of Experimental Biology and RECETOX, Faculty of Science, Masaryk University, Brno 601 77, Czech Republic
Michal Vasina Loschmidt Laboratories, Department of Experimental Biology and RECETOX, Faculty of Science, Masaryk University, Brno 601 77, Czech Republic International Clinical Research Center, St. Anne’s University Hospital, Brno 601 77, Czech Republic
Nikola Velatova Loschmidt Laboratories, Department of Experimental Biology and RECETOX, Faculty of Science, Masaryk University, Brno 601 77, Czech Republic
Lucia Cengelova Loschmidt Laboratories, Department of Experimental Biology and RECETOX, Faculty of Science, Masaryk University, Brno 601 77, Czech Republic
David Kovar Loschmidt Laboratories, Department of Experimental Biology and RECETOX, Faculty of Science, Masaryk University, Brno 601 77, Czech Republic International Clinical Research Center, St. Anne’s University Hospital, Brno 601 77, Czech Republic
Jiri Damborsky Loschmidt Laboratories, Department of Experimental Biology and RECETOX, Faculty of Science, Masaryk University, Brno 601 77, Czech Republic International Clinical Research Center, St. Anne’s University Hospital, Brno 601 77, Czech Republic
Martin Marek Loschmidt Laboratories, Department of Experimental Biology and RECETOX, Faculty of Science, Masaryk University, Brno 601 77, Czech Republic International Clinical Research Center, St. Anne’s University Hospital, Brno 601 77, Czech Republic
David Bednar Loschmidt Laboratories, Department of Experimental Biology and RECETOX, Faculty of Science, Masaryk University, Brno 601 77, Czech Republic International Clinical Research Center, St. Anne’s University Hospital, Brno 601 77, Czech Republic
Zbynek Prokop Loschmidt Laboratories, Department of Experimental Biology and RECETOX, Faculty of Science, Masaryk University, Brno 601 77, Czech Republic International Clinical Research Center, St. Anne’s University Hospital, Brno 601 77, Czech Republic

Collapse

Azmi MB, Khan W, Azim MK, Nisar MI, Jehan F. Identification of potential therapeutic intervening targets by in-silico analysis of nsSNPs in preterm birth-related genes. PLoS One 2023;18:e0280305. [PMID: 36881567 PMCID: PMC9990928 DOI: 10.1371/journal.pone.0280305] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/23/2022] [Accepted: 12/27/2022] [Indexed: 03/08/2023] Open

Abstract

Prematurity is the foremost cause of death in children under 5 years of age. Genetics contributes to 25-40% of all preterm births (PTB) yet we still need to identify specific targets for intervention based on genetic pathways. This study involved the effect of region-specific non-synonymous variations and their transcript level mutational impact on protein functioning and stability by various in-silico tools. This investigation identifies potential therapeutic targets to manage the challenge of PTB, corresponding protein cavities and explores their binding interactions with intervening compounds. We searched 20 genes coding 55 PTB proteins from NCBI. Single Nucleotide Polymorphisms (SNPs) of concerned genes were extracted from ENSEMBL, and filtration of exonic variants (non-synonymous) was performed. Several in-silico downstream protein functional effect prediction tools were used to identify damaging variants. Rare coding variants were selected with an allele frequency of ≤1% in 1KGD, further supported by South Asian ALFA frequencies and GTEx gene/tissue expression database. CNN1, COL24A1, IQGAP2 and SLIT2 were identified with 7 rare pathogenic variants found in 17 transcript sequences. The functional impact analyses of rs532147352 (R>H) of CNN1 computed through PhD-SNP, PROVEAN, SNP&GO, PMut and MutPred2 algorithms showed impending deleterious effects, and the presence of this pathogenic mutation in CNN1 resulted in large decrease in protein structural stability (ΔΔG (kcal/mol). After structural protein identification, homology modelling of CNN1, which has been previously reported as a biomarker for the prediction of PTB, was performed, followed by the stereochemical quality checks of the 3D model. Blind docking approach were used to search the binding cavities and molecular interactions with progesterone, ranked with energetic estimations. Molecular interactions of CNN1 with progesterone were investigated through LigPlot 2D. Further, molecular docking experimentation of CNN1 showed the significant interactions at S102, L105, A106, K123, Y124 with five selected PTB-drugs, Allylestrenol (-7.56 kcal/mol), Hydroxyprogesterone caproate (-8.19 kcal/mol), Retosiban (-9.43 kcal/mol), Ritodrine (-7.39 kcal/mol) and Terbutaline (-6.87 kcal/mol). Calponin-1 gene and its molecular interaction analysis could serve as an intervention target for the prevention of PTB.

Collapse

Affiliation(s)

Muhammad Bilal Azmi Department of Biochemistry, Dow Medical College, Dow University of Health Sciences, Karachi, Pakistan Department of Biosciences, Faculty of Life Sciences, Mohammad Ali Jinnah University, Karachi, Pakistan
Waqasuddin Khan Biorepositroy and Omics Research Group, Department of Pediatrics and Child Health, Faculty of Health Sciences, Medical College, The Aga Khan University, Karachi, Pakistan Department of Pediatrics and Child Health, Faculty of Health Sciences, Medical College, The Aga Khan University, Karachi, Pakistan CITRIC Center for Bioinformatics and Computational Biology, Department of Pediatrics and Child Health, Faculty of Health Sciences, Medical College, The Aga Khan University, Karachi, Pakistan * E-mail:
M. Kamran Azim Department of Biosciences, Faculty of Life Sciences, Mohammad Ali Jinnah University, Karachi, Pakistan
Muhammad Imran Nisar Biorepositroy and Omics Research Group, Department of Pediatrics and Child Health, Faculty of Health Sciences, Medical College, The Aga Khan University, Karachi, Pakistan Department of Pediatrics and Child Health, Faculty of Health Sciences, Medical College, The Aga Khan University, Karachi, Pakistan CITRIC Center for Bioinformatics and Computational Biology, Department of Pediatrics and Child Health, Faculty of Health Sciences, Medical College, The Aga Khan University, Karachi, Pakistan
Fyezah Jehan Biorepositroy and Omics Research Group, Department of Pediatrics and Child Health, Faculty of Health Sciences, Medical College, The Aga Khan University, Karachi, Pakistan Department of Pediatrics and Child Health, Faculty of Health Sciences, Medical College, The Aga Khan University, Karachi, Pakistan CITRIC Center for Bioinformatics and Computational Biology, Department of Pediatrics and Child Health, Faculty of Health Sciences, Medical College, The Aga Khan University, Karachi, Pakistan

Collapse

Patra P, B R D, Kundu P, Das M, Ghosh A. Recent advances in machine learning applications in metabolic engineering. Biotechnol Adv 2023;62:108069. [PMID: 36442697 DOI: 10.1016/j.biotechadv.2022.108069] [Citation(s) in RCA: 9] [Impact Index Per Article: 9.0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/17/2022] [Revised: 10/18/2022] [Accepted: 11/22/2022] [Indexed: 11/27/2022]

Abstract

Metabolic engineering encompasses several widely-used strategies, which currently hold a high seat in the field of biotechnology when its potential is manifesting through a plethora of research and commercial products with a strong societal impact. The genomic revolution that occurred almost three decades ago has initiated the generation of large omics-datasets which has helped in gaining a better understanding of cellular behavior. The itinerary of metabolic engineering that has occurred based on these large datasets has allowed researchers to gain detailed insights and a reasonable understanding of the intricacies of biosystems. However, the existing trail-and-error approaches for metabolic engineering are laborious and time-intensive when it comes to the production of target compounds with high yields through genetic manipulations in host organisms. Machine learning (ML) coupled with the available metabolic engineering test instances and omics data brings a comprehensive and multidisciplinary approach that enables scientists to evaluate various parameters for effective strain design. This vast amount of biological data should be standardized through knowledge engineering to train different ML models for providing accurate predictions in gene circuits designing, modification of proteins, optimization of bioprocess parameters for scaling up, and screening of hyper-producing robust cell factories. This review briefs on the premise of ML, followed by mentioning various ML methods and algorithms alongside the numerous omics datasets available to train ML models for predicting metabolic outcomes with high-accuracy. The combinative interplay between the ML algorithms and biological datasets through knowledge engineering have guided the recent advancements in applications such as CRISPR/Cas systems, gene circuits, protein engineering, metabolic pathway reconstruction, and bioprocess engineering. Finally, this review addresses the probable challenges of applying ML in metabolic engineering which will guide the researchers toward novel techniques to overcome the limitations.

Collapse

Gong J, Wang J, Zong X, Ma Z, Xu D. Prediction of protein stability changes upon single-point variant using 3D structure profile. Comput Struct Biotechnol J 2022;21:354-364. [PMID: 36582438 PMCID: PMC9791599 DOI: 10.1016/j.csbj.2022.12.008] [Citation(s) in RCA: 6] [Impact Index Per Article: 3.0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 09/01/2022] [Revised: 12/04/2022] [Accepted: 12/05/2022] [Indexed: 12/13/2022] Open

Maharaj A, Güran T, Buonocore F, Achermann JC, Metherell L, Prasad R, Çetinkaya S. Insights From Long-term Follow-up of a Girl With Adrenal Insufficiency and Sphingosine-1-Phosphate Lyase Deficiency. J Endocr Soc 2022;6:bvac020. [PMID: 35308304 PMCID: PMC8926068 DOI: 10.1210/jendso/bvac020] [Citation(s) in RCA: 3] [Impact Index Per Article: 1.5] [Reference Citation Analysis] [Abstract] [Key Words] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Submit a Manuscript] [Subscribe] [Scholar Register] [Received: 11/29/2021] [Indexed: 11/21/2022] Open

Structural Consequence of Non-Synonymous Single-Nucleotide Variants in the N-Terminal Domain of LIS1. Int J Mol Sci 2022;23:ijms23063109. [PMID: 35328531 PMCID: PMC8955593 DOI: 10.3390/ijms23063109] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 02/08/2022] [Revised: 03/10/2022] [Accepted: 03/11/2022] [Indexed: 02/04/2023] Open

Tarapara B, Shah F. An in-silico analysis to identify structural, functional and regulatory role of SNPs in hMRE11. J Biomol Struct Dyn 2022;41:2160-2174. [PMID: 35048780 DOI: 10.1080/07391102.2022.2028678] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 10/19/2022]

Computational design of a cutinase for plastic biodegradation by mining molecular dynamics simulations trajectories. Comput Struct Biotechnol J 2022;20:459-470. [PMID: 35070168 PMCID: PMC8761609 DOI: 10.1016/j.csbj.2021.12.042] [Citation(s) in RCA: 19] [Impact Index Per Article: 9.5] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 11/10/2021] [Revised: 12/29/2021] [Accepted: 12/30/2021] [Indexed: 11/24/2022] Open

Sigamani V, Rajasingh S, Gurusamy N, Panda A, Rajasingh J. In-Silico and In-Vitro Analysis of Human SOS1 Protein Causing Noonan Syndrome - A Novel Approach to Explore the Molecular Pathways. Curr Genomics 2021;22:526-540. [PMID: 35386434 PMCID: PMC8905634 DOI: 10.2174/1389202922666211130144221] [Citation(s) in RCA: 2] [Impact Index Per Article: 0.7] [Reference Citation Analysis] [Abstract] [Key Words] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 08/13/2021] [Revised: 11/02/2021] [Accepted: 11/03/2021] [Indexed: 11/22/2022] Open

Udosen B, Soremekun O, Ekenna C, Idowu Omotuyi O, Chikowore T, Nashiru O, Fatumo S. In-silico analysis reveals druggable single nucleotide polymorphisms in angiotensin 1 converting enzyme involved in the onset of blood pressure. BMC Res Notes 2021;14:457. [PMID: 34930451 PMCID: PMC8686250 DOI: 10.1186/s13104-021-05879-z] [Citation(s) in RCA: 1] [Impact Index Per Article: 0.3] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 08/05/2021] [Accepted: 12/06/2021] [Indexed: 12/29/2022] Open

Artificial intelligence challenges for predicting the impact of mutations on protein stability. Curr Opin Struct Biol 2021;72:161-168. [PMID: 34922207 DOI: 10.1016/j.sbi.2021.11.001] [Citation(s) in RCA: 35] [Impact Index Per Article: 11.7] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/30/2021] [Revised: 09/15/2021] [Accepted: 11/08/2021] [Indexed: 01/17/2023]

Ranjan P, Das P. Understanding the impact of missense mutations on the structure and function of the EDA gene in X-linked hypohidrotic ectodermal dysplasia: A bioinformatics approach. J Cell Biochem 2021;123:431-449. [PMID: 34817077 DOI: 10.1002/jcb.30186] [Citation(s) in RCA: 2] [Impact Index Per Article: 0.7] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 09/15/2021] [Revised: 11/05/2021] [Accepted: 11/10/2021] [Indexed: 12/19/2022]

Identifying the impact of structurally and functionally high-risk nonsynonymous SNPs on human patched protein using in-silico approach. GENE REPORTS 2021. [DOI: 10.1016/j.genrep.2021.101097] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/19/2022]

Iqbal S, Li F, Akutsu T, Ascher DB, Webb GI, Song J. Assessing the performance of computational predictors for estimating protein stability changes upon missense mutations. Brief Bioinform 2021;22:6289890. [PMID: 34058752 DOI: 10.1093/bib/bbab184] [Citation(s) in RCA: 23] [Impact Index Per Article: 7.7] [Reference Citation Analysis] [Abstract] [Key Words] [Journal Information] [Subscribe] [Scholar Register] [Received: 02/08/2021] [Revised: 04/07/2021] [Accepted: 04/21/2021] [Indexed: 11/14/2022] Open

Abstract

Understanding how a mutation might affect protein stability is of significant importance to protein engineering and for understanding protein evolution genetic diseases. While a number of computational tools have been developed to predict the effect of missense mutations on protein stability protein stability upon mutations, they are known to exhibit large biases imparted in part by the data used to train and evaluate them. Here, we provide a comprehensive overview of predictive tools, which has provided an evolving insight into the importance and relevance of features that can discern the effects of mutations on protein stability. A diverse selection of these freely available tools was benchmarked using a large mutation-level blind dataset of 1342 experimentally characterised mutations across 130 proteins from ThermoMutDB, a second test dataset encompassing 630 experimentally characterised mutations across 39 proteins from iStable2.0 and a third blind test dataset consisting of 268 mutations in 27 proteins from the newly published ProThermDB. The performance of the methods was further evaluated with respect to the site of mutation, type of mutant residue and by ranging the pH and temperature. Additionally, the classification performance was also evaluated by classifying the mutations as stabilizing (∆∆G ≥ 0) or destabilizing (∆∆G < 0). The results reveal that the performance of the predictors is affected by the site of mutation and the type of mutant residue. Further, the results show very low performance for pH values 6-8 and temperature higher than 65 for all predictors except iStable2.0 on the S630 dataset. To illustrate how stability and structure change upon single point mutation, we considered four stabilizing, two destabilizing and two stabilizing mutations from two proteins, namely the toxin protein and bovine liver cytochrome. Overall, the results on S268, S630 and S1342 datasets show that the performance of the integrated predictors is better than the mechanistic or individual machine learning predictors. We expect that this paper will provide useful guidance for the design and development of next-generation bioinformatic tools for predicting protein stability changes upon mutations.

Collapse

Ye Z, Yang W, Yang Y, Ouyang D. Interpretable machine learning methods for in vitro pharmaceutical formulation development. FOOD FRONTIERS 2021. [DOI: 10.1002/fft2.78] [Citation(s) in RCA: 4] [Impact Index Per Article: 1.3] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 12/21/2022] Open

Chang J, Zhang C, Cheng H, Tan YW. Rational Design of Adenylate Kinase Thermostability through Coevolution and Sequence Divergence Analysis. Int J Mol Sci 2021;22:ijms22052768. [PMID: 33803409 PMCID: PMC7967156 DOI: 10.3390/ijms22052768] [Citation(s) in RCA: 1] [Impact Index Per Article: 0.3] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 02/01/2021] [Revised: 03/04/2021] [Accepted: 03/05/2021] [Indexed: 01/09/2023] Open

Ajadi MB, Soremekun OS, Adewumi AT, Kumalo HM, Soliman MES. Functional Analysis of Single Nucleotide Polymorphism in ZUFSP Protein and Implication in Pathogenesis. Protein J 2021;40:28-40. [PMID: 33512633 DOI: 10.1007/s10930-021-09962-z] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Accepted: 01/04/2021] [Indexed: 11/25/2022]

Planas-Iglesias J, Marques SM, Pinto GP, Musil M, Stourac J, Damborsky J, Bednar D. Computational design of enzymes for biotechnological applications. Biotechnol Adv 2021;47:107696. [PMID: 33513434 DOI: 10.1016/j.biotechadv.2021.107696] [Citation(s) in RCA: 36] [Impact Index Per Article: 12.0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/07/2020] [Revised: 01/12/2021] [Accepted: 01/13/2021] [Indexed: 12/14/2022]

Affiliation(s)

Joan Planas-Iglesias Loschmidt Laboratories, Department of Experimental Biology and RECETOX, Faculty of Science, Masaryk University, Kamenice 5/A13, 625 00 Brno, Czech Republic; International Clinical Research Center, St. Anne's University Hospital Brno, Pekarska 53, 656 91 Brno, Czech Republic
Sérgio M Marques Loschmidt Laboratories, Department of Experimental Biology and RECETOX, Faculty of Science, Masaryk University, Kamenice 5/A13, 625 00 Brno, Czech Republic; International Clinical Research Center, St. Anne's University Hospital Brno, Pekarska 53, 656 91 Brno, Czech Republic
Gaspar P Pinto Loschmidt Laboratories, Department of Experimental Biology and RECETOX, Faculty of Science, Masaryk University, Kamenice 5/A13, 625 00 Brno, Czech Republic; International Clinical Research Center, St. Anne's University Hospital Brno, Pekarska 53, 656 91 Brno, Czech Republic
Milos Musil Loschmidt Laboratories, Department of Experimental Biology and RECETOX, Faculty of Science, Masaryk University, Kamenice 5/A13, 625 00 Brno, Czech Republic; International Clinical Research Center, St. Anne's University Hospital Brno, Pekarska 53, 656 91 Brno, Czech Republic; IT4Innovations Centre of Excellence, Faculty of Information Technology, Brno University of Technology, 61266 Brno, Czech Republic
Jan Stourac Loschmidt Laboratories, Department of Experimental Biology and RECETOX, Faculty of Science, Masaryk University, Kamenice 5/A13, 625 00 Brno, Czech Republic; International Clinical Research Center, St. Anne's University Hospital Brno, Pekarska 53, 656 91 Brno, Czech Republic
Jiri Damborsky Loschmidt Laboratories, Department of Experimental Biology and RECETOX, Faculty of Science, Masaryk University, Kamenice 5/A13, 625 00 Brno, Czech Republic; International Clinical Research Center, St. Anne's University Hospital Brno, Pekarska 53, 656 91 Brno, Czech Republic.
David Bednar Loschmidt Laboratories, Department of Experimental Biology and RECETOX, Faculty of Science, Masaryk University, Kamenice 5/A13, 625 00 Brno, Czech Republic.

Collapse

Sarkar A, Yang Y, Vihinen M. Variation benchmark datasets: update, criteria, quality and applications. DATABASE-THE JOURNAL OF BIOLOGICAL DATABASES AND CURATION 2020;2020:5710862. [PMID: 32016318 PMCID: PMC6997940 DOI: 10.1093/database/baz117] [Citation(s) in RCA: 19] [Impact Index Per Article: 4.8] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Subscribe] [Scholar Register] [Received: 04/12/2019] [Revised: 06/03/2019] [Accepted: 07/01/2019] [Indexed: 02/07/2023]

Abstract

Development of new computational methods and testing their performance has to be carried out using experimental data. Only in comparison to existing knowledge can method performance be assessed. For that purpose, benchmark datasets with known and verified outcome are needed. High-quality benchmark datasets are valuable and may be difficult, laborious and time consuming to generate. VariBench and VariSNP are the two existing databases for sharing variation benchmark datasets used mainly for variation interpretation. They have been used for training and benchmarking predictors for various types of variations and their effects. VariBench was updated with 419 new datasets from 109 papers containing altogether 329 014 152 variants; however, there is plenty of redundancy between the datasets. VariBench is freely available at http://structure.bmc.lu.se/VariBench/. The contents of the datasets vary depending on information in the original source. The available datasets have been categorized into 20 groups and subgroups. There are datasets for insertions and deletions, substitutions in coding and non-coding region, structure mapped, synonymous and benign variants. Effect-specific datasets include DNA regulatory elements, RNA splicing, and protein property for aggregation, binding free energy, disorder and stability. Then there are several datasets for molecule-specific and disease-specific applications, as well as one dataset for variation phenotype effects. Variants are often described at three molecular levels (DNA, RNA and protein) and sometimes also at the protein structural level including relevant cross references and variant descriptions. The updated VariBench facilitates development and testing of new methods and comparison of obtained performances to previously published methods. We compared the performance of the pathogenicity/tolerance predictor PON-P2 to several benchmark studies, and show that such comparisons are feasible and useful, however, there may be limitations due to lack of provided details and shared data.

Database URL: http://structure.bmc.lu.se/VariBench

Collapse

Soremekun OS, Ezenwa C, Isewon I, Soliman M, Idowu O, Nashiru O, Fatumo S. Computational and drug target analysis of functional single nucleotide polymorphisms associated with Haemoglobin Subunit Beta (HBB) gene. Comput Biol Med 2020;125:104018. [PMID: 33022520 DOI: 10.1016/j.compbiomed.2020.104018] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 08/12/2020] [Revised: 09/13/2020] [Accepted: 09/20/2020] [Indexed: 10/23/2022]

Current pivotal strategies leading a difficult target protein to a sample suitable for crystallographic analysis. Biochem Soc Trans 2020;48:1661-1673. [PMID: 32677661 DOI: 10.1042/bst20200106] [Citation(s) in RCA: 1] [Impact Index Per Article: 0.3] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 05/16/2020] [Revised: 06/26/2020] [Accepted: 06/30/2020] [Indexed: 12/15/2022]

Sanavia T, Birolo G, Montanucci L, Turina P, Capriotti E, Fariselli P. Limitations and challenges in protein stability prediction upon genome variations: towards future applications in precision medicine. Comput Struct Biotechnol J 2020;18:1968-1979. [PMID: 32774791 PMCID: PMC7397395 DOI: 10.1016/j.csbj.2020.07.011] [Citation(s) in RCA: 72] [Impact Index Per Article: 18.0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/15/2020] [Revised: 07/10/2020] [Accepted: 07/14/2020] [Indexed: 12/13/2022] Open

Proteus: An algorithm for proposing stabilizing mutation pairs based on interactions observed in known protein 3D structures. BMC Bioinformatics 2020;21:275. [PMID: 32611389 PMCID: PMC7330979 DOI: 10.1186/s12859-020-03575-6] [Citation(s) in RCA: 4] [Impact Index Per Article: 1.0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 12/13/2019] [Accepted: 05/28/2020] [Indexed: 11/10/2022] Open

Abstract

Background

Protein engineering has many applications for industry, such as the development of new drugs, vaccines, treatment therapies, food, and biofuel production. A common way to engineer a protein is to perform mutations in functionally essential residues to optimize their function. However, the discovery of beneficial mutations for proteins is a complex task, with a time-consuming and high cost for experimental validation. Hence, computational approaches have been used to propose new insights for experiments narrowing the search space and reducing the costs.

Results

In this study, we developed Proteus (an acronym for Protein Engineering Supporter), a new algorithm for proposing mutation pairs in a target 3D structure. These suggestions are based on contacts observed in other known structures from Protein Data Bank (PDB). Proteus’ basic assumption is that if a non-interacting pair of amino acid residues in the target structure is exchanged to an interacting pair, this could enhance protein stability. This trade is only allowed if the main-chain conformation of the residues involved in the contact is conserved. Furthermore, no steric impediment is expected between the proposed mutations and the surrounding protein atoms. To evaluate Proteus, we performed two case studies with proteins of industrial interests. In the first case study, we evaluated if the mutations suggested by Proteus for four protein structures enhance the number of inter-residue contacts. Our results suggest that most mutations proposed by Proteus increase the number of interactions into the protein. In the second case study, we used Proteus to suggest mutations for a lysozyme protein. Then, we compared Proteus’ outcomes to mutations with available experimental evidence reported in the ProTherm database. Four mutations, in which our results agree with the experimental data, were found. This could be initial evidence that changes in the side-chain of some residues do not cause disturbances that harm protein structure stability.

Conclusion

We believe that Proteus could be used combined with other methods to give new insights into the rational development of engineered proteins. Proteus user-friendly web-based tool is available at <http://proteus.dcc.ufmg.br>.

Collapse

Nisar H, Pasha U, Mirza MU, Abid R, Hanif K, Kadarmideen HN, Sadaf S. Impact of IL-17F 7488T/C Functional Polymorphism on Progressive Rheumatoid Arthritis: Novel Insight from the Molecular Dynamic Simulations. Immunol Invest 2020;50:416-426. [PMID: 32543936 DOI: 10.1080/08820139.2020.1775642] [Citation(s) in RCA: 4] [Impact Index Per Article: 1.0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 12/15/2022]

Sharma A, Kaur S, Duseja A, Changotra H. The autophagy gene ATG16L1 (T300A) variant is associated with the risk and progression of HBV infection. INFECTION GENETICS AND EVOLUTION 2020;84:104404. [PMID: 32526369 DOI: 10.1016/j.meegid.2020.104404] [Citation(s) in RCA: 11] [Impact Index Per Article: 2.8] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Subscribe] [Scholar Register] [Received: 03/28/2020] [Revised: 05/12/2020] [Accepted: 06/03/2020] [Indexed: 12/01/2022]

Marabotti A, Scafuri B, Facchiano A. Predicting the stability of mutant proteins by computational approaches: an overview. Brief Bioinform 2020;22:5850907. [PMID: 32496523 DOI: 10.1093/bib/bbaa074] [Citation(s) in RCA: 29] [Impact Index Per Article: 7.3] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Journal Information] [Subscribe] [Scholar Register] [Received: 02/12/2020] [Revised: 04/07/2020] [Accepted: 04/10/2020] [Indexed: 01/06/2023] Open

Lv X, Chen J, Lu Y, Chen Z, Xiao N, Yang Y. Accurately Predicting Mutation-Caused Stability Changes from Protein Sequences Using Extreme Gradient Boosting. J Chem Inf Model 2020;60:2388-2395. [PMID: 32203653 DOI: 10.1021/acs.jcim.0c00064] [Citation(s) in RCA: 14] [Impact Index Per Article: 3.5] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 12/14/2022]

Chen CW, Lin MH, Liao CC, Chang HP, Chu YW. iStable 2.0: Predicting protein thermal stability changes by integrating various characteristic modules. Comput Struct Biotechnol J 2020;18:622-630. [PMID: 32226595 PMCID: PMC7090336 DOI: 10.1016/j.csbj.2020.02.021] [Citation(s) in RCA: 46] [Impact Index Per Article: 11.5] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/15/2019] [Revised: 02/25/2020] [Accepted: 02/27/2020] [Indexed: 11/15/2022] Open

Abstract

Protein mutations can lead to structural changes that affect protein function and result in disease occurrence. In protein engineering, drug design or and optimization industries, mutations are often used to improve protein stability or to change protein properties while maintaining stability. To provide possible candidates for novel protein design, several computational tools for predicting protein stability changes have been developed. Although many prediction tools are available, each tool employs different algorithms and features. This can produce conflicting prediction results that make it difficult for users to decide upon the correct protein design. Therefore, this study proposes an integrated prediction tool, iStable 2.0, which integrates 11 sequence-based and structure-based prediction tools by machine learning and adds protein sequence information as features. Three coding modules are designed for the system, an Online Server Module, a Stand-alone Module and a Sequence Coding Module, to improve the prediction performance of the previous version of the system. The final integrated structure-based classification model has a higher Matthews correlation coefficient than that of the single prediction tool (0.708 vs 0.547, respectively), and the Pearson correlation coefficient of the regression model likewise improves from 0.669 to 0.714. The sequence-based model not only successfully integrates off-the-shelf predictors but also improves the Matthews correlation coefficient of the best single prediction tool by at least 0.161, which is better than the individual structure-based prediction tools. In addition, both the Sequence Coding Module and the Stand-alone Module maintain performance with only a 5% decrease of the Matthews correlation coefficient when the integrated online tools are unavailable. iStable 2.0 is available at http://ncblab.nchu.edu.tw/iStable2.

Collapse

Lessel I, Chen MJ, Lüttgen S, Arndt F, Fuchs S, Meien S, Thiele H, Jones JR, Shaw BR, Crossman DK, Nürnberg P, Korf BR, Kubisch C, Lessel D. Two novel cases further expand the phenotype of TOR1AIP1-associated nuclear envelopathies. Hum Genet 2020;139:483-498. [PMID: 32055997 PMCID: PMC7078146 DOI: 10.1007/s00439-019-02105-6] [Citation(s) in RCA: 8] [Impact Index Per Article: 2.0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 11/02/2019] [Accepted: 12/22/2019] [Indexed: 12/19/2022]

Affiliation(s)

Ivana Lessel Institute of Human Genetics, University Medical Center Hamburg-Eppendorf, Martinistrasse 52, 20246, Hamburg, Germany
Mei-Jan Chen Department of Genetics, University of Alabama at Birmingham, Birmingham, AL, 36394, USA
Sabine Lüttgen Institute of Human Genetics, University Medical Center Hamburg-Eppendorf, Martinistrasse 52, 20246, Hamburg, Germany
Florian Arndt Department for Pediatric Cardiology, University Heart Center Hamburg, University Medical Center Hamburg-Eppendorf, 20246, Hamburg, Germany
Sigrid Fuchs Institute of Human Genetics, University Medical Center Hamburg-Eppendorf, Martinistrasse 52, 20246, Hamburg, Germany
Stefanie Meien Institute of Human Genetics, University Medical Center Hamburg-Eppendorf, Martinistrasse 52, 20246, Hamburg, Germany
Holger Thiele Cologne Center for Genomics, University of Cologne, 50931, Cologne, Germany
Julie R Jones Molecular Diagnostic Laboratory, Greenwood Genetic Center, Greenwood, SC, 29646, USA
Brandon R Shaw Department of Genetics, University of Alabama at Birmingham, Birmingham, AL, 36394, USA
David K Crossman Department of Genetics, University of Alabama at Birmingham, Birmingham, AL, 36394, USA
Peter Nürnberg Cologne Center for Genomics, University of Cologne, 50931, Cologne, Germany.,Center for Molecular Medicine Cologne, University of Cologne, 50931, Cologne, Germany.,Cologne Excellence Cluster on Cellular Stress Responses in Aging-Associated Diseases, University of Cologne, 50931, Cologne, Germany
Bruce R Korf Department of Genetics, University of Alabama at Birmingham, Birmingham, AL, 36394, USA
Christian Kubisch Institute of Human Genetics, University Medical Center Hamburg-Eppendorf, Martinistrasse 52, 20246, Hamburg, Germany
Davor Lessel Institute of Human Genetics, University Medical Center Hamburg-Eppendorf, Martinistrasse 52, 20246, Hamburg, Germany.

Collapse

Spielmann A, Brack Y, van Beek H, Flachbart L, Sundermeyer L, Baumgart M, Bott M. NADPH biosensor-based identification of an alcohol dehydrogenase variant with improved catalytic properties caused by a single charge reversal at the protein surface. AMB Express 2020;10:14. [PMID: 31955268 PMCID: PMC6969876 DOI: 10.1186/s13568-020-0946-7] [Citation(s) in RCA: 2] [Impact Index Per Article: 0.5] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 08/20/2019] [Accepted: 01/06/2020] [Indexed: 01/29/2023] Open

Pandurangan AP, Blundell TL. Prediction of impacts of mutations on protein structure and interactions: SDM, a statistical approach, and mCSM, using machine learning. Protein Sci 2020;29:247-257. [PMID: 31693276 PMCID: PMC6933854 DOI: 10.1002/pro.3774] [Citation(s) in RCA: 46] [Impact Index Per Article: 11.5] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 08/30/2019] [Revised: 10/31/2019] [Accepted: 10/31/2019] [Indexed: 02/02/2023]

Abstract

Next-generation sequencing methods have not only allowed an understanding of genome sequence variation during the evolution of organisms but have also provided invaluable information about genetic variants in inherited disease and the emergence of resistance to drugs in cancers and infectious disease. A challenge is to distinguish mutations that are drivers of disease or drug resistance, from passengers that are neutral or even selectively advantageous to the organism. This requires an understanding of impacts of missense mutations in gene expression and regulation, and on the disruption of protein function by modulating protein stability or disturbing interactions with proteins, nucleic acids, small molecule ligands, and other biological molecules. Experimental approaches to understanding differences between wild-type and mutant proteins are most accurate but are also time-consuming and costly. Computational tools used to predict the impacts of mutations can provide useful information more quickly. Here, we focus on two widely used structure-based approaches, originally developed in the Blundell lab: site-directed mutator (SDM), a statistical approach to analyze amino acid substitutions, and mutation cutoff scanning matrix (mCSM), which uses graph-based signatures to represent the wild-type structural environment and machine learning to predict the effect of mutations on protein stability. Here, we describe DUET that uses machine learning to combine the two approaches. We discuss briefly the development of mCSM for understanding the impacts of mutations on interfaces with other proteins, nucleic acids, and ligands, and we exemplify the wide application of these approaches to understand human genetic disorders and drug resistance mutations relevant to cancer and mycobacterial infections. STATEMENT FOR A BROADER AUDIENCE: Genetic or somatic changes in genes can lead to mutations in human proteins, which give rise to genetic disorders or cancer, or to genes of pathogens leading to drug resistance. Computer software described here, using statistical approaches or machine learning, uses the information from genome sequencing of humans and pathogens, together with experimental or modeled 3D structures of gene products, the proteins, to predict impacts of mutations in genetic disease, cancer and drug resistance.

Collapse

Mazurenko S, Prokop Z, Damborsky J. Machine Learning in Enzyme Engineering. ACS Catal 2019. [DOI: 10.1021/acscatal.9b04321] [Citation(s) in RCA: 134] [Impact Index Per Article: 26.8] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 12/17/2022]

Fang X, Huang J, Zhang R, Wang F, Zhang Q, Li G, Yan J, Zhang H, Yan Y, Xu L. Convolution Neural Network-Based Prediction of Protein Thermostability. J Chem Inf Model 2019;59:4833-4843. [PMID: 31657922 DOI: 10.1021/acs.jcim.9b00220] [Citation(s) in RCA: 13] [Impact Index Per Article: 2.6] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 12/24/2022]

Affiliation(s)

Xingrong Fang Key Laboratory of Molecular Biophysics, Ministry of Education, College of Life Science and Technology , Huazhong University of Science and Technology , Wuhan 430074 , P. R. China
Jinsha Huang Key Laboratory of Molecular Biophysics, Ministry of Education, College of Life Science and Technology , Huazhong University of Science and Technology , Wuhan 430074 , P. R. China
Rui Zhang Editorial Board of the Journal of Wuhan Institute of Technology , Wuhan Institute of Technology , Wuhan 430074 , P. R. China
Fei Wang Key Laboratory of Molecular Biophysics, Ministry of Education, College of Life Science and Technology , Huazhong University of Science and Technology , Wuhan 430074 , P. R. China
Qiuyu Zhang Key Laboratory of Molecular Biophysics, Ministry of Education, College of Life Science and Technology , Huazhong University of Science and Technology , Wuhan 430074 , P. R. China
Guanlin Li Key Laboratory of Molecular Biophysics, Ministry of Education, College of Life Science and Technology , Huazhong University of Science and Technology , Wuhan 430074 , P. R. China
Jinyong Yan Key Laboratory of Molecular Biophysics, Ministry of Education, College of Life Science and Technology , Huazhong University of Science and Technology , Wuhan 430074 , P. R. China
Houjin Zhang Key Laboratory of Molecular Biophysics, Ministry of Education, College of Life Science and Technology , Huazhong University of Science and Technology , Wuhan 430074 , P. R. China
Yunjun Yan Key Laboratory of Molecular Biophysics, Ministry of Education, College of Life Science and Technology , Huazhong University of Science and Technology , Wuhan 430074 , P. R. China
Li Xu Key Laboratory of Molecular Biophysics, Ministry of Education, College of Life Science and Technology , Huazhong University of Science and Technology , Wuhan 430074 , P. R. China

Collapse

Computational analysis of high-risk SNPs in human CHK2 gene responsible for hereditary breast cancer: A functional and structural impact. PLoS One 2019;14:e0220711. [PMID: 31398194 PMCID: PMC6688789 DOI: 10.1371/journal.pone.0220711] [Citation(s) in RCA: 10] [Impact Index Per Article: 2.0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 02/04/2019] [Accepted: 07/22/2019] [Indexed: 12/18/2022] Open

Abstract

Nowadays CHK2 mutation is studied frequently in hereditary breast and ovarian cancer patients in addition to BRCA1/BRCA2. CHK2 is a tumor suppressor gene that encodes a serine/threonine kinase, also involved in pathways such as DNA repair, cell cycle regulation and apoptosis in response to DNA damage. CHK2 is a well-studied moderate penetrance gene that correlates with third high risk susceptibility gene with an increased risk for breast cancer. Hence before planning large population study, it is better to scrutinize putative functional SNPs of CHK2 using different computational tools. In this study, we have used various computational approaches to identify nsSNPs which are deleterious to the structure and/or function of CHK2 protein that might be causing this disease. Computational analysis was performed by different in silico tools including SIFT, Align GVGD, SNAP-2, PROVEAN, Poly-Phen-2, PANTHER, PhD-SNP, MUpro, iPTREE-STAB, Consurf, InterPro, NCBI Conserved Domain Search tool, ModPred, SPARKS-X, RAMPAGE, Verify-3D, FT Site, COACH and PyMol. Out of 78 nsSNP of human CHK2 gene, seven nsSNPs were predicted functionally most significant SNPs. Among these seven nsSNP, p.Arg160Gly, p.Gly210Arg and p.Ser415Phe are highly conserved residues with conservation score of 9 and three nsSNP were predicted to be involved in post translational modification. The p.Arg160Gly and p.Gly210Arg may interfere in phosphopeptide binding site on FHA conserved domain. The p.Ser415Phe may interfere in formation of activation loop of protein-kinase domain and might interfere in interactions of CHK2 with ligand. The study concludes that mutation of serine to phenylalanine at position 415 is a major mutation in native CHK2 protein which might contribute to its malfunction, ultimately causing disease. This is the first comprehensive study, where CHK2 gene variants are analyzed using in silico tools hence it will be of great help while considering large scale studies and also in developing precision medicines related to these polymorphisms in the era of personalized medicine.

Collapse

Montanucci L, Capriotti E, Frank Y, Ben-Tal N, Fariselli P. DDGun: an untrained method for the prediction of protein stability changes upon single and multiple point variations. BMC Bioinformatics 2019;20:335. [PMID: 31266447 PMCID: PMC6606456 DOI: 10.1186/s12859-019-2923-1] [Citation(s) in RCA: 64] [Impact Index Per Article: 12.8] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 12/20/2022] Open

Abstract

Background

Predicting the effect of single point variations on protein stability constitutes a crucial step toward understanding the relationship between protein structure and function. To this end, several methods have been developed to predict changes in the Gibbs free energy of unfolding (∆∆G) between wild type and variant proteins, using sequence and structure information. Most of the available methods however do not exhibit the anti-symmetric prediction property, which guarantees that the predicted ∆∆G value for a variation is the exact opposite of that predicted for the reverse variation, i.e., ∆∆G(A → B) = −∆∆G(B → A), where A and B are amino acids.

Results

Here we introduce simple anti-symmetric features, based on evolutionary information, which are combined to define an untrained method, DDGun (DDG untrained). DDGun is a simple approach based on evolutionary information that predicts the ∆∆G for single and multiple variations from sequence and structure information (DDGun3D). Our method achieves remarkable performance without any training on the experimental datasets, reaching Pearson correlation coefficients between predicted and measured ∆∆G values of ~ 0.5 and ~ 0.4 for single and multiple site variations, respectively. Surprisingly, DDGun performances are comparable with those of state of the art methods. DDGun also naturally predicts multiple site variations, thereby defining a benchmark method for both single site and multiple site predictors. DDGun is anti-symmetric by construction predicting the value of the ∆∆G of a reciprocal variation as almost equal (depending on the sequence profile) to -∆∆G of the direct variation. This is a valuable property that is missing in the majority of the methods.

Conclusions

Evolutionary information alone combined in an untrained method can achieve remarkably high performances in the prediction of ∆∆G upon protein mutation. Non-trained approaches like DDGun represent a valid benchmark both for scoring the predictive power of the individual features and for assessing the learning capability of supervised methods.

Electronic supplementary material

The online version of this article (10.1186/s12859-019-2923-1) contains supplementary material, which is available to authorized users.

Collapse

Pandurangan AP, Ochoa-Montaño B, Ascher DB, Blundell TL. SDM: a server for predicting effects of mutations on protein stability. Nucleic Acids Res 2019;45:W229-W235. [PMID: 28525590 PMCID: PMC5793720 DOI: 10.1093/nar/gkx439] [Citation(s) in RCA: 320] [Impact Index Per Article: 64.0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 02/10/2017] [Accepted: 05/15/2017] [Indexed: 02/02/2023] Open

Khabou B, Trigui A, Boudawara TS, Keskes L, Kamoun H, Barbu V, Fakhfakh F. A homozygous ABCB4 mutation causing an LPAC syndrome evolves into cholangiocarcinoma. Clin Chim Acta 2019;495:598-605. [PMID: 31181191 DOI: 10.1016/j.cca.2019.06.007] [Citation(s) in RCA: 5] [Impact Index Per Article: 1.0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/04/2019] [Revised: 05/16/2019] [Accepted: 06/06/2019] [Indexed: 02/08/2023]

Industrial Cyber-Physical System Evolution Detection and Alert Generation. APPLIED SCIENCES-BASEL 2019. [DOI: 10.3390/app9081586] [Citation(s) in RCA: 5] [Impact Index Per Article: 1.0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 11/17/2022]

Das SS, Chakravorty N. Identification of deleterious SNPs and their effects on BCL11A, the master regulator of fetal hemoglobin expression. Genomics 2019;112:397-403. [PMID: 30853596 DOI: 10.1016/j.ygeno.2019.03.002] [Citation(s) in RCA: 14] [Impact Index Per Article: 2.8] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 09/07/2018] [Revised: 01/14/2019] [Accepted: 03/04/2019] [Indexed: 12/18/2022]

Desai M, Chauhan JB. Predicting the functional and structural consequences of nsSNPs in human methionine synthase gene using computational tools. Syst Biol Reprod Med 2019;65:288-300. [DOI: 10.1080/19396368.2019.1568611] [Citation(s) in RCA: 6] [Impact Index Per Article: 1.2] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Submit a Manuscript] [Subscribe] [Scholar Register] [Indexed: 02/06/2023]

Doh CY, Li J, Mamidi R, Stelzer JE. The HCM-causing Y235S cMyBPC mutation accelerates contractile function by altering C1 domain structure. Biochim Biophys Acta Mol Basis Dis 2019;1865:661-677. [PMID: 30611859 DOI: 10.1016/j.bbadis.2019.01.007] [Citation(s) in RCA: 7] [Impact Index Per Article: 1.4] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 08/20/2018] [Revised: 12/18/2018] [Accepted: 01/02/2019] [Indexed: 12/20/2022]

Chen CW, Chang KP, Ho CW, Chang HP, Chu YW. KStable: A Computational Method for Predicting Protein Thermal Stability Changes by K-Star with Regular-mRMR Feature Selection. ENTROPY 2018;20:e20120988. [PMID: 33266711 PMCID: PMC7512587 DOI: 10.3390/e20120988] [Citation(s) in RCA: 4] [Impact Index Per Article: 0.7] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Subscribe] [Scholar Register] [Received: 11/21/2018] [Revised: 12/11/2018] [Accepted: 12/16/2018] [Indexed: 11/24/2022]