101
|
Eaton AF, Brown D, Merkulova M. The evolutionary conserved TLDc domain defines a new class of (H +)V-ATPase interacting proteins. Sci Rep 2021; 11:22654. [PMID: 34811399 PMCID: PMC8608904 DOI: 10.1038/s41598-021-01809-y] [Citation(s) in RCA: 3] [Impact Index Per Article: 1.0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/09/2021] [Accepted: 11/02/2021] [Indexed: 01/26/2023] Open
Abstract
We recently found that nuclear receptor coactivator 7 (Ncoa7) and Oxr1 interact with the proton-pumping V-ATPase. Ncoa7 and Oxr1 belong to a group of proteins playing a role in the oxidative stress response, that contain the conserved “TLDc” domain. Here we asked if the three other proteins in this family, i.e., Tbc1d24, Tldc1 and Tldc2 also interact with the V-ATPase and if the TLDc domains are involved in all these interactions. By co-immunoprecipitation, endogenous kidney Tbc1d24 (and Ncoa7 and Oxr1) and overexpressed Tldc1 and Tldc2, all interacted with the V-ATPase. In addition, purified TLDc domains of Ncoa7, Oxr1 and Tldc2 (but not Tbc1d24 or Tldc1) interacted with V-ATPase in GST pull-downs. At the amino acid level, point mutations G815A, G845A and G896A in conserved regions of the Ncoa7 TLDc domain abolished interaction with the V-ATPase, and S817A, L926A and E938A mutations resulted in decreased interaction. Furthermore, poly-E motifs upstream of the TLDc domain in Ncoa7 and Tldc2 show a (nonsignificant) trend towards enhancing the interaction with V-ATPase. Our principal finding is that all five members of the TLDc family of proteins interact with the V-ATPase. We conclude that the TLDc motif defines a new class of V-ATPase interacting regulatory proteins.
Collapse
Affiliation(s)
- A F Eaton
- Program in Membrane Biology and Division of Nephrology, Massachusetts General Hospital and Harvard Medical School, Boston, MA, 02114, USA
| | - D Brown
- Program in Membrane Biology and Division of Nephrology, Massachusetts General Hospital and Harvard Medical School, Boston, MA, 02114, USA
| | - M Merkulova
- Program in Membrane Biology and Division of Nephrology, Massachusetts General Hospital and Harvard Medical School, Boston, MA, 02114, USA. .,Program in Membrane Biology and Division of Nephrology, Massachusetts General Hospital, Simches Research Center, 128 Cambridge St., Boston, MA, 02114, USA.
| |
Collapse
|
102
|
Modeling coronavirus spike protein dynamics: implications for immunogenicity and immune escape. Biophys J 2021; 120:5592-5618. [PMID: 34767789 PMCID: PMC8577870 DOI: 10.1016/j.bpj.2021.11.009] [Citation(s) in RCA: 5] [Impact Index Per Article: 1.7] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 12/23/2020] [Revised: 03/19/2021] [Accepted: 11/04/2021] [Indexed: 12/23/2022] Open
Abstract
The ongoing COVID-19 pandemic is a global public health emergency requiring urgent development of efficacious vaccines. While concentrated research efforts have focused primarily on antibody-based vaccines that neutralize SARS-CoV-2, and several first-generation vaccines have either been approved or received emergency use authorization, it is forecasted that COVID-19 will become an endemic disease requiring updated second-generation vaccines. The SARS-CoV-2 surface spike (S) glycoprotein represents a prime target for vaccine development because antibodies that block viral attachment and entry, i.e., neutralizing antibodies, bind almost exclusively to the receptor-binding domain. Here, we develop computational models for a large subset of S proteins associated with SARS-CoV-2, implemented through coarse-grained elastic network models and normal mode analysis. We then analyze local protein domain dynamics of the S protein systems and their thermal stability to characterize structural and dynamical variability among them. These results are compared against existing experimental data and used to elucidate the impact and mechanisms of SARS-CoV-2 S protein mutations and their associated antibody binding behavior. We construct a SARS-CoV-2 antigenic map and offer predictions about the neutralization capabilities of antibody and S mutant combinations based on protein dynamic signatures. We then compare SARS-CoV-2 S protein dynamics to SARS-CoV and MERS-CoV S proteins to investigate differing antibody binding and cellular fusion mechanisms that may explain the high transmissibility of SARS-CoV-2. The outbreaks associated with SARS-CoV, MERS-CoV, and SARS-CoV-2 over the last two decades suggest that the threat presented by coronaviruses is ever-changing and long term. Our results provide insights into the dynamics-driven mechanisms of immunogenicity associated with coronavirus S proteins and present a new, to our knowledge, approach to characterize and screen potential mutant candidates for immunogen design, as well as to characterize emerging natural variants that may escape vaccine-induced antibody responses.
Collapse
|
103
|
Kar RK. Computational Resources for Bioscience Education. Appl Biochem Biotechnol 2021; 193:3418-3424. [PMID: 34101113 PMCID: PMC8184869 DOI: 10.1007/s12010-021-03601-0] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 04/12/2021] [Accepted: 05/28/2021] [Indexed: 11/29/2022]
Abstract
With the ongoing laboratory restrictions, it is often challenging for bioscience students to make satisfactory progress in their projects. A long-standing practice in multi-disciplinary research is to use computational and theoretical method to corroborate with experiment findings. In line with the lack of opportunity to access laboratory instruments, the pandemic situation is a win-win scenario for scholars to focus on computational methods. This communication outline some of the standalone tools and webservers that bioscience students can successfully learn and adopt to obtain in-depth insights into biochemistry, biophysics, biotechnology, and bioengineering research work.
Collapse
Affiliation(s)
- Rajiv K Kar
- Faculty II-Mathematics and Natural Sciences, Technische Universität Berlin, Sekr. PC 14, Strasse des 17. Juni 135, D-10623, Berlin, Germany.
| |
Collapse
|
104
|
Samaga YBL, Raghunathan S, Priyakumar UD. SCONES: Self-Consistent Neural Network for Protein Stability Prediction Upon Mutation. J Phys Chem B 2021; 125:10657-10671. [PMID: 34546056 DOI: 10.1021/acs.jpcb.1c04913] [Citation(s) in RCA: 10] [Impact Index Per Article: 3.3] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 01/17/2023]
Abstract
Engineering proteins to have desired properties by mutating amino acids at specific sites is commonplace. Such engineered proteins must be stable to function. Experimental methods used to determine stability at throughputs required to scan the protein sequence space thoroughly are laborious. To this end, many machine learning based methods have been developed to predict thermodynamic stability changes upon mutation. These methods have been evaluated for symmetric consistency by testing with hypothetical reverse mutations. In this work, we propose transitive data augmentation, evaluating transitive consistency with our new Stransitive data set, and a new machine learning based method, the first of its kind, that incorporates both symmetric and transitive properties into the architecture. Our method, called SCONES, is an interpretable neural network that predicts small relative protein stability changes for missense mutations that do not significantly alter the structure. It estimates a residue's contributions toward protein stability (ΔG) in its local structural environment, and the difference between independently predicted contributions of the reference and mutant residues is reported as ΔΔG. We show that this self-consistent machine learning architecture is immune to many common biases in data sets, relies less on data than existing methods, is robust to overfitting, and can explain a substantial portion of the variance in experimental data.
Collapse
Affiliation(s)
- Yashas B L Samaga
- Center for Computational Natural Sciences and Bioinformatics, International Institute of Information Technology, Hyderabad 500 032, India
| | - Shampa Raghunathan
- Center for Computational Natural Sciences and Bioinformatics, International Institute of Information Technology, Hyderabad 500 032, India
| | - U Deva Priyakumar
- Center for Computational Natural Sciences and Bioinformatics, International Institute of Information Technology, Hyderabad 500 032, India
| |
Collapse
|
105
|
Jeon S, Blazyte A, Yoon C, Ryu H, Jeon Y, Bhak Y, Bolser D, Manica A, Shin ES, Cho YS, Kim BC, Ryoo N, Choi H, Bhak J. Regional TMPRSS2 V197M Allele Frequencies Are Correlated with COVID-19 Case Fatality Rates. Mol Cells 2021; 44:680-687. [PMID: 34588322 PMCID: PMC8490206 DOI: 10.14348/molcells.2021.2249] [Citation(s) in RCA: 5] [Impact Index Per Article: 1.7] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 12/18/2020] [Revised: 06/14/2021] [Accepted: 07/10/2021] [Indexed: 02/08/2023] Open
Abstract
Coronavirus disease, COVID-19 (coronavirus disease 2019), caused by SARS-CoV-2 (severe acute respiratory syndrome coronavirus 2), has a higher case fatality rate in European countries than in others, especially East Asian ones. One potential explanation for this regional difference is the diversity of the viral infection efficiency. Here, we analyzed the allele frequencies of a nonsynonymous variant rs12329760 (V197M) in the TMPRSS2 gene, a key enzyme essential for viral infection and found a significant association between the COVID-19 case fatality rate and the V197M allele frequencies, using over 200,000 present-day and ancient genomic samples. East Asian countries have higher V197M allele frequencies than other regions, including European countries which correlates to their lower case fatality rates. Structural and energy calculation analysis of the V197M amino acid change showed that it destabilizes the TMPRSS2 protein, possibly negatively affecting its ACE2 and viral spike protein processing.
Collapse
Affiliation(s)
- Sungwon Jeon
- Korean Genomics Center (KOGIC), Ulsan National Institute of Science and Technology (UNIST), Ulsan 44919, Korea
- Department of Biomedical Engineering, College of Information and Biotechnology, UNIST, Ulsan 44919, Korea
| | - Asta Blazyte
- Korean Genomics Center (KOGIC), Ulsan National Institute of Science and Technology (UNIST), Ulsan 44919, Korea
- Department of Biomedical Engineering, College of Information and Biotechnology, UNIST, Ulsan 44919, Korea
| | - Changhan Yoon
- Korean Genomics Center (KOGIC), Ulsan National Institute of Science and Technology (UNIST), Ulsan 44919, Korea
- Department of Biomedical Engineering, College of Information and Biotechnology, UNIST, Ulsan 44919, Korea
| | - Hyojung Ryu
- Korean Genomics Center (KOGIC), Ulsan National Institute of Science and Technology (UNIST), Ulsan 44919, Korea
- Department of Biomedical Engineering, College of Information and Biotechnology, UNIST, Ulsan 44919, Korea
| | - Yeonsu Jeon
- Korean Genomics Center (KOGIC), Ulsan National Institute of Science and Technology (UNIST), Ulsan 44919, Korea
- Department of Biomedical Engineering, College of Information and Biotechnology, UNIST, Ulsan 44919, Korea
| | - Youngjune Bhak
- Korean Genomics Center (KOGIC), Ulsan National Institute of Science and Technology (UNIST), Ulsan 44919, Korea
- Department of Biomedical Engineering, College of Information and Biotechnology, UNIST, Ulsan 44919, Korea
| | | | - Andrea Manica
- Department of Zoology, University of Cambridge, Cambridge CB2 3EJ, UK
| | - Eun-Seok Shin
- Division of Cardiology, Department of Internal Medicine, Ulsan Medical Center, Ulsan 44686, Korea
- Personal Genomics Institute (PGI), Genome Research Foundation (GRF), Cheongju 28160, Korea
| | | | | | - Namhee Ryoo
- Department of Laboratory Medicine, Keimyung University School of Medicine, Daegu 42601, Korea
| | - Hansol Choi
- Korean Genomics Center (KOGIC), Ulsan National Institute of Science and Technology (UNIST), Ulsan 44919, Korea
- Department of Biomedical Engineering, College of Information and Biotechnology, UNIST, Ulsan 44919, Korea
| | - Jong Bhak
- Korean Genomics Center (KOGIC), Ulsan National Institute of Science and Technology (UNIST), Ulsan 44919, Korea
- Department of Biomedical Engineering, College of Information and Biotechnology, UNIST, Ulsan 44919, Korea
- Geromics, Ltd., Cambridge CB1 3NF, UK
- Personal Genomics Institute (PGI), Genome Research Foundation (GRF), Cheongju 28160, Korea
- Clinomics, Inc., Ulsan 44919, Korea
| |
Collapse
|
106
|
Grahame DAS, Dupuis JH, Bryksa BC, Tanaka T, Yada RY. Improving the alkaline stability of pepsin through rational protein design using renin, an alkaline-stable aspartic protease, as a structural and functional reference. Enzyme Microb Technol 2021; 150:109871. [PMID: 34489030 DOI: 10.1016/j.enzmictec.2021.109871] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 04/13/2021] [Revised: 06/28/2021] [Accepted: 07/12/2021] [Indexed: 10/20/2022]
Abstract
The present study sought to identify the structural determinants of aspartic protease structural stability and activity at elevated pH. Various hypotheses have been published regarding the features responsible for the unusual alkaline structural stability of renin, however, few structure-function studies have verified these claims. Using pepsin as a model system, and renin as a template for functional and structural alkaline stability, a rational re-design of pepsin was undertaken to identify residues contributing to the alkaline instability of pepsin-like aspartic proteases in regards to both structure and function. We constructed 13 mutants based on this strategy. Among them, mutants D159 L and D60A led to an increase in activity at elevated pH levels (p ≤ 0.05) and E4V and H53F were shown to retain native-like structure at elevated pH (p ≤ 0.05). Previously suggested carboxyl groups Asp11, Asp118, and Glu13 were individually shown not to be responsible for the structural instability or lack of activity at neutral pH in pepsin. The importance of the β-barrel to structural stability was highlighted as the majority of the stabilizing residues identified, and 39% of the weakly conserved residues in the N-terminal lobe, were located in β-sheet strands of the barrel. The results of the present study indicate that alkaline stabilization of pepsin will require reduction of electrostatic repulsions and an improved understanding of the role of the hydrogen bonding network of the characteristic β-barrel.
Collapse
Affiliation(s)
- Douglas A S Grahame
- Department of Food Science, Ontario Agricultural College, University of Guelph, Guelph, ON, N1G 2W1, Canada
| | - John H Dupuis
- Food, Nutrition, and Health Program, Faculty of Land and Food Systems, University of British Columbia, Vancouver, BC, V6T 1Z4, Canada
| | - Brian C Bryksa
- Department of Food Science, Ontario Agricultural College, University of Guelph, Guelph, ON, N1G 2W1, Canada
| | - Takuji Tanaka
- Department of Food and Bioproduct Sciences, College of Agriculture and Bioresources, University of Saskatchewan, Saskatoon, SK, S7N 5A8, Canada
| | - Rickey Y Yada
- Food, Nutrition, and Health Program, Faculty of Land and Food Systems, University of British Columbia, Vancouver, BC, V6T 1Z4, Canada.
| |
Collapse
|
107
|
D’Onofrio C, Knoll W, Pelosi P. Aphid Odorant-Binding Protein 9 Is Narrowly Tuned to Linear Alcohols and Aldehydes of Sixteen Carbon Atoms. INSECTS 2021; 12:741. [PMID: 34442308 PMCID: PMC8396812 DOI: 10.3390/insects12080741] [Citation(s) in RCA: 9] [Impact Index Per Article: 3.0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Subscribe] [Scholar Register] [Received: 07/27/2021] [Accepted: 08/16/2021] [Indexed: 01/14/2023]
Abstract
Aphid odorant-binding protein 9 is almost exclusively expressed in antennae and is well conserved between different aphid species. In order to investigate its function, we have expressed this protein and measured ligand-binding affinities to a number of common natural compounds. The best ligands are long-chain aldehydes and alcohols, in particular Z9-hexadecenal and Z11-hexadecenal, as well as 1-hexadecanol and Z11-1-hexadecenol. A model of this protein indicated Lys37 as the residue that is likely to establish strong interactions with the ligands, probably a Schiff base with aldehydes and a hydrogen bond with alcohols. Indeed, when we replaced this lysine with a leucine, the mutated protein lost its affinity to both long aldehydes and alcohols, while the binding of other volatiles was unaffected. Long-chain linear alcohols are common products of molds and have been reported as aphid antifeedants. Corresponding aldehydes, instead, are major components of sex pheromones for several species of Lepidoptera. We speculate that aphids might use OBP9 to avoid mold-contaminated plants as well as competition with lepidopteran larvae.
Collapse
Affiliation(s)
- Chiara D’Onofrio
- Biosensor Technologies, AIT Austrian Institute of Technology GmbH, Konrad-Lorenz Straße 24, 3430 Tulln, Austria; (C.D.); (W.K.)
| | - Wolfgang Knoll
- Biosensor Technologies, AIT Austrian Institute of Technology GmbH, Konrad-Lorenz Straße 24, 3430 Tulln, Austria; (C.D.); (W.K.)
- Department of Physics and Chemistry of Materials, Faculty of Medicine/Dental Medicine, Danube Private University, 3500 Krems, Austria
| | - Paolo Pelosi
- Biosensor Technologies, AIT Austrian Institute of Technology GmbH, Konrad-Lorenz Straße 24, 3430 Tulln, Austria; (C.D.); (W.K.)
| |
Collapse
|
108
|
Gautam V, Nimmanpipug P, Zain SM, Rahman NA, Lee VS. Molecular Dynamics Simulations in Designing DARPins as Phosphorylation-Specific Protein Binders of ERK2. Molecules 2021; 26:molecules26154540. [PMID: 34361694 PMCID: PMC8347146 DOI: 10.3390/molecules26154540] [Citation(s) in RCA: 2] [Impact Index Per Article: 0.7] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/08/2021] [Revised: 07/20/2021] [Accepted: 07/23/2021] [Indexed: 11/16/2022] Open
Abstract
Extracellular signal-regulated kinases 1 and 2 (ERK1/2) play key roles in promoting cell survival and proliferation through the phosphorylation of various substrates. Remarkable antitumour activity is found in many inhibitors that act upstream of the ERK pathway. However, drug-resistant tumour cells invariably emerge after their use due to the reactivation of ERK1/2 signalling. ERK1/2 inhibitors have shown clinical efficacy as a therapeutic strategy for the treatment of tumours with mitogen-activated protein kinase (MAPK) upstream target mutations. These inhibitors may be used as a possible strategy to overcome acquired resistance to MAPK inhibitors. Here, we report a class of repeat proteins-designed ankyrin repeat protein (DARPin) macromolecules targeting ERK2 as inhibitors. The structural basis of ERK2-DARPin interactions based on molecular dynamics (MD) simulations was studied. The information was then used to predict stabilizing mutations employing a web-based algorithm, MAESTRO. To evaluate whether these design strategies were successfully deployed, we performed all-atom, explicit-solvent molecular dynamics (MD) simulations. Two mutations, Ala → Asp and Ser → Leu, were found to perform better than the original sequence (DARPin E40) based on the associated energy and key residues involved in protein-protein interaction. MD simulations and analysis of the data obtained on these mutations supported our predictions.
Collapse
Affiliation(s)
- Vertika Gautam
- Department of Chemistry, Faculty of Science, University of Malaya, Kuala Lumpur 50603, Malaysia; (V.G.); (S.M.Z.); (N.A.R.)
| | - Piyarat Nimmanpipug
- Department of Chemistry, Faculty of Science, Chiang Mai University, Chiang Mai 50200, Thailand;
- Center of Excellence for Innovation in Analytical Science and Technology (I-ANALY-S-T), Chiang Mai University, Chiang Mai 50200, Thailand
| | - Sharifuddin Md Zain
- Department of Chemistry, Faculty of Science, University of Malaya, Kuala Lumpur 50603, Malaysia; (V.G.); (S.M.Z.); (N.A.R.)
| | - Noorsaadah Abd Rahman
- Department of Chemistry, Faculty of Science, University of Malaya, Kuala Lumpur 50603, Malaysia; (V.G.); (S.M.Z.); (N.A.R.)
| | - Vannajan Sanghiran Lee
- Department of Chemistry, Faculty of Science, University of Malaya, Kuala Lumpur 50603, Malaysia; (V.G.); (S.M.Z.); (N.A.R.)
- Center of Excellence for Innovation in Analytical Science and Technology (I-ANALY-S-T), Chiang Mai University, Chiang Mai 50200, Thailand
- Correspondence:
| |
Collapse
|
109
|
Wang Y, Xue P, Cao M, Yu T, Lane ST, Zhao H. Directed Evolution: Methodologies and Applications. Chem Rev 2021; 121:12384-12444. [PMID: 34297541 DOI: 10.1021/acs.chemrev.1c00260] [Citation(s) in RCA: 203] [Impact Index Per Article: 67.7] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 02/06/2023]
Abstract
Directed evolution aims to expedite the natural evolution process of biological molecules and systems in a test tube through iterative rounds of gene diversifications and library screening/selection. It has become one of the most powerful and widespread tools for engineering improved or novel functions in proteins, metabolic pathways, and even whole genomes. This review describes the commonly used gene diversification strategies, screening/selection methods, and recently developed continuous evolution strategies for directed evolution. Moreover, we highlight some representative applications of directed evolution in engineering nucleic acids, proteins, pathways, genetic circuits, viruses, and whole cells. Finally, we discuss the challenges and future perspectives in directed evolution.
Collapse
Affiliation(s)
- Yajie Wang
- Department of Chemical and Biomolecular Engineering, University of Illinois at Urbana-Champaign, Urbana, Illinois 61801, United States.,Carl R. Woese Institute for Genomic Biology, University of Illinois at Urbana-Champaign, Urbana, Illinois 61801, United States.,DOE Center for Advanced Bioenergy and Bioproducts Innovation, University of Illinois at Urbana-Champaign, Urbana, Illinois 61801, United States
| | - Pu Xue
- Department of Chemical and Biomolecular Engineering, University of Illinois at Urbana-Champaign, Urbana, Illinois 61801, United States.,Carl R. Woese Institute for Genomic Biology, University of Illinois at Urbana-Champaign, Urbana, Illinois 61801, United States.,DOE Center for Advanced Bioenergy and Bioproducts Innovation, University of Illinois at Urbana-Champaign, Urbana, Illinois 61801, United States
| | - Mingfeng Cao
- DOE Center for Advanced Bioenergy and Bioproducts Innovation, University of Illinois at Urbana-Champaign, Urbana, Illinois 61801, United States
| | - Tianhao Yu
- Department of Chemical and Biomolecular Engineering, University of Illinois at Urbana-Champaign, Urbana, Illinois 61801, United States.,Carl R. Woese Institute for Genomic Biology, University of Illinois at Urbana-Champaign, Urbana, Illinois 61801, United States.,DOE Center for Advanced Bioenergy and Bioproducts Innovation, University of Illinois at Urbana-Champaign, Urbana, Illinois 61801, United States
| | - Stephan T Lane
- Carl R. Woese Institute for Genomic Biology, University of Illinois at Urbana-Champaign, Urbana, Illinois 61801, United States.,DOE Center for Advanced Bioenergy and Bioproducts Innovation, University of Illinois at Urbana-Champaign, Urbana, Illinois 61801, United States
| | - Huimin Zhao
- Department of Chemical and Biomolecular Engineering, University of Illinois at Urbana-Champaign, Urbana, Illinois 61801, United States.,Carl R. Woese Institute for Genomic Biology, University of Illinois at Urbana-Champaign, Urbana, Illinois 61801, United States.,DOE Center for Advanced Bioenergy and Bioproducts Innovation, University of Illinois at Urbana-Champaign, Urbana, Illinois 61801, United States.,Department of Chemistry, University of Illinois at Urbana-Champaign, Urbana, Illinois 61801, United States
| |
Collapse
|
110
|
Sarfati H, Naftaly S, Papo N, Keasar C. Predicting mutant outcome by combining deep mutational scanning and machine learning. Proteins 2021; 90:45-57. [PMID: 34293212 DOI: 10.1002/prot.26184] [Citation(s) in RCA: 6] [Impact Index Per Article: 2.0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 08/22/2020] [Revised: 06/01/2021] [Accepted: 07/11/2021] [Indexed: 02/02/2023]
Abstract
Deep mutational scanning provides unprecedented wealth of quantitative data regarding the functional outcome of mutations in proteins. A single experiment may measure properties (eg, structural stability) of numerous protein variants. Leveraging the experimental data to gain insights about unexplored regions of the mutational landscape is a major computational challenge. Such insights may facilitate further experimental work and accelerate the development of novel protein variants with beneficial therapeutic or industrially relevant properties. Here we present a novel, machine learning approach for the prediction of functional mutation outcome in the context of deep mutational screens. Using sequence (one-hot) features of variants with known properties, as well as structural features derived from models thereof, we train predictive statistical models to estimate the unknown properties of other variants. The utility of the new computational scheme is demonstrated using five sets of mutational scanning data, denoted "targets": (a) protease specificity of APPI (amyloid precursor protein inhibitor) variants; (b-d) three stability related properties of IGBPG (immunoglobulin G-binding β1 domain of streptococcal protein G) variants; and (e) fluorescence of GFP (green fluorescent protein) variants. Performance is measured by the overall correlation of the predicted and observed properties, and enrichment-the ability to predict the most potent variants and presumably guide further experiments. Despite the diversity of the targets the statistical models can generalize variant examples thereof and predict the properties of test variants with both single and multiple mutations.
Collapse
Affiliation(s)
- Hagit Sarfati
- Department of Computer Science, Ben-Gurion University of the Negev, Be'er Sheva, Israel
| | - Si Naftaly
- Avram and Stella Goldstein-Goren Department of Biotechnology Engineering and the National Institute of Biotechnology in the Negev, Ben-Gurion University of the Negev, Be'er Sheva, Israel
| | - Niv Papo
- Avram and Stella Goldstein-Goren Department of Biotechnology Engineering and the National Institute of Biotechnology in the Negev, Ben-Gurion University of the Negev, Be'er Sheva, Israel
| | - Chen Keasar
- Department of Computer Science, Ben-Gurion University of the Negev, Be'er Sheva, Israel
| |
Collapse
|
111
|
Shukla V, Runthala A, Rajput VS, Chandrasai PD, Tripathi A, Phulara SC. Computational and synthetic biology approaches for the biosynthesis of antiviral and anticancer terpenoids from Bacillus subtilis. Med Chem 2021; 18:307-322. [PMID: 34254925 DOI: 10.2174/1573406417666210712211557] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/09/2020] [Revised: 04/18/2021] [Accepted: 04/25/2021] [Indexed: 11/22/2022]
Abstract
Recent advancements in medicinal research have identified several antiviral and anticancer terpenoids that are usually deployed as a source of flavor, fragrances and pharmaceuticals. Under the current COVID-19 pandemic conditions, natural therapeutics with least side effects are the need of the hour to save the patients, especially, which are pre-affected with other medical complications. Although, plants are the major sources of terpenoids; however, for the environmental concerns, the global interest has shifted to the biocatalytic production of molecules from microbial sources. The gram-positive bacterium Bacillus subtilis is a suitable host in this regard due to its GRAS (generally regarded as safe) status, ease in genetic manipulations and wide industrial acceptability. The B. subtilis synthesizes its terpenoid molecules from 1-deoxy-d-xylulose-5-phosphate (DXP) pathway, a common route in almost all microbial strains. Here, we summarize the computational and synthetic biology approaches to improve the production of terpenoid-based therapeutics from B. subtilis by utilizing DXP pathway. We focus on the in-silico approaches for screening the functionally improved enzyme-variants of the two crucial enzymes namely, the DXP synthase (DXS) and farnesyl pyrophosphate synthase (FPPS). The approaches for engineering the active sites are subsequently explained. It will be helpful to construct the functionally improved enzymes for the high-yield production of terpenoid-based anticancer and antiviral metabolites, which would help to reduce the cost and improve the availability of such therapeutics for the humankind.
Collapse
Affiliation(s)
- Vibha Shukla
- Food, Drug and Chemical Toxicology Group, CSIR-Indian Institute of Toxicology Research, Vishvigyan Bhawan, 31 Mahatma Gandhi Marg, Lucknow-226001, India
| | - Ashish Runthala
- Department of Biotechnology, Koneru Lakshmaiah Education Foundation, Vaddeswaram, Guntur-522502, Andhra Pradesh, India
| | | | - Potla Durthi Chandrasai
- Department of Biotechnology, National Institute of Technology Warangal, Warangal-506004, Telangana, India
| | - Anurag Tripathi
- Academy of Scientific and Innovative Research (AcSIR), Ghaziabad- 201002, India
| | - Suresh Chandra Phulara
- Department of Biotechnology, Koneru Lakshmaiah Education Foundation, Vaddeswaram, Guntur-522502, Andhra Pradesh, India
| |
Collapse
|
112
|
Sitterlé E, Coste AT, Obadia T, Maufrais C, Chauvel M, Sertour N, Sanglard D, Puel A, D'Enfert C, Bougnoux ME. Large-scale genome mining allows identification of neutral polymorphisms and novel resistance mutations in genes involved in Candida albicans resistance to azoles and echinocandins. J Antimicrob Chemother 2021; 75:835-848. [PMID: 31923309 DOI: 10.1093/jac/dkz537] [Citation(s) in RCA: 10] [Impact Index Per Article: 3.3] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 09/17/2019] [Revised: 11/22/2019] [Accepted: 12/01/2019] [Indexed: 01/05/2023] Open
Abstract
BACKGROUND The genome of Candida albicans displays significant polymorphism. Point mutations in genes involved in resistance to antifungals may either confer phenotypic resistance or be devoid of phenotypic consequences. OBJECTIVES To catalogue polymorphisms in azole and echinocandin resistance genes occurring in susceptible strains in order to rapidly pinpoint relevant mutations in resistant strains. METHODS Genome sequences from 151 unrelated C. albicans strains susceptible to fluconazole and caspofungin were used to create a catalogue of non-synonymous polymorphisms in genes involved in resistance to azoles (ERG11, TAC1, MRR1 and UPC2) or echinocandins (FKS1). The potential of this catalogue to reveal putative resistance mutations was tested in 10 azole-resistant isolates, including 1 intermediate to caspofungin. Selected mutations were analysed by mutagenesis experiments or mutational prediction effect. RESULTS In the susceptible strains, we identified 126 amino acid substitutions constituting the catalogue of phenotypically neutral polymorphisms. By excluding these neutral substitutions, we identified 22 additional substitutions in the 10 resistant strains. Among these substitutions, 10 had already been associated with resistance. The remaining 12 were in Tac1p (n = 6), Upc2p (n = 2) and Erg11p (n = 4). Four out of the six homozygous substitutions in Tac1p (H263Y, A790V, H839Y and P971S) conferred increases in azole MICs, while no effects were observed for those in Upc2p. Additionally, two homozygous substitutions (Y64H and P236S) had a predicted conformation effect on Erg11p. CONCLUSIONS By establishing a catalogue of neutral polymorphisms occurring in genes involved in resistance to antifungal drugs, we provide a useful resource for rapid identification of mutations possibly responsible for phenotypic resistance in C. albicans.
Collapse
Affiliation(s)
- Emilie Sitterlé
- Unité Biologie et Pathogénicité Fongiques, Département de Mycologie, Institut Pasteur, USC2019 INRA, Paris, France.,Université Paris Diderot, Sorbonne Paris Cité, Paris, France.,Unité de Parasitologie-Mycologie, Service de Microbiologie clinique, Hôpital Necker-Enfants-Malades, Assistance Publique des Hôpitaux de Paris (APHP), Paris, France
| | - Alix T Coste
- Institut de Microbiologie, Université de Lausanne et Centre Hospitalo-Universitaire, Lausanne, Switzerland
| | - Thomas Obadia
- Hub de Bioinformatique et Biostatistique, Département Biologie Computationnelle, Institut Pasteur, USR 3756 CNRS, Paris, France.,Unité Malaria: parasites et hôtes, Département Parasites et Insectes Vecteurs, Institut Pasteur, Paris, France
| | - Corinne Maufrais
- Hub de Bioinformatique et Biostatistique, Département Biologie Computationnelle, Institut Pasteur, USR 3756 CNRS, Paris, France
| | - Murielle Chauvel
- Unité Biologie et Pathogénicité Fongiques, Département de Mycologie, Institut Pasteur, USC2019 INRA, Paris, France
| | - Natacha Sertour
- Unité Biologie et Pathogénicité Fongiques, Département de Mycologie, Institut Pasteur, USC2019 INRA, Paris, France
| | - Dominique Sanglard
- Institut de Microbiologie, Université de Lausanne et Centre Hospitalo-Universitaire, Lausanne, Switzerland
| | - Anne Puel
- Laboratoire de génétique humaine des maladies infectieuses, Necker, INSERM U1163, Paris, France.,Université Paris Descartes, Institut Imagine, Paris, France
| | - Christophe D'Enfert
- Unité Biologie et Pathogénicité Fongiques, Département de Mycologie, Institut Pasteur, USC2019 INRA, Paris, France
| | - Marie-Elisabeth Bougnoux
- Unité Biologie et Pathogénicité Fongiques, Département de Mycologie, Institut Pasteur, USC2019 INRA, Paris, France.,Unité de Parasitologie-Mycologie, Service de Microbiologie clinique, Hôpital Necker-Enfants-Malades, Assistance Publique des Hôpitaux de Paris (APHP), Paris, France.,Université de Paris, Paris, France
| |
Collapse
|
113
|
Montiel V, Bella R, Michel LYM, Esfahani H, De Mulder D, Robinson EL, Deglasse JP, Tiburcy M, Chow PH, Jonas JC, Gilon P, Steinhorn B, Michel T, Beauloye C, Bertrand L, Farah C, Dei Zotti F, Debaix H, Bouzin C, Brusa D, Horman S, Vanoverschelde JL, Bergmann O, Gilis D, Rooman M, Ghigo A, Geninatti-Crich S, Yool A, Zimmermann WH, Roderick HL, Devuyst O, Balligand JL. Inhibition of aquaporin-1 prevents myocardial remodeling by blocking the transmembrane transport of hydrogen peroxide. Sci Transl Med 2021; 12:12/564/eaay2176. [PMID: 33028705 DOI: 10.1126/scitranslmed.aay2176] [Citation(s) in RCA: 20] [Impact Index Per Article: 6.7] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 05/30/2019] [Revised: 12/24/2019] [Accepted: 08/31/2020] [Indexed: 12/31/2022]
Abstract
Pathological remodeling of the myocardium has long been known to involve oxidant signaling, but strategies using systemic antioxidants have generally failed to prevent it. We sought to identify key regulators of oxidant-mediated cardiac hypertrophy amenable to targeted pharmacological therapy. Specific isoforms of the aquaporin water channels have been implicated in oxidant sensing, but their role in heart muscle is unknown. RNA sequencing from human cardiac myocytes revealed that the archetypal AQP1 is a major isoform. AQP1 expression correlates with the severity of hypertrophic remodeling in patients with aortic stenosis. The AQP1 channel was detected at the plasma membrane of human and mouse cardiac myocytes from hypertrophic hearts, where it colocalized with NADPH oxidase-2 and caveolin-3. We show that hydrogen peroxide (H2O2), produced extracellularly, is necessary for the hypertrophic response of isolated cardiac myocytes and that AQP1 facilitates the transmembrane transport of H2O2 through its water pore, resulting in activation of oxidant-sensitive kinases in cardiac myocytes. Structural analysis of the amino acid residues lining the water pore of AQP1 supports its permeation by H2O2 Deletion of Aqp1 or selective blockade of the AQP1 intrasubunit pore inhibited H2O2 transport in mouse and human cells and rescued the myocyte hypertrophy in human induced pluripotent stem cell-derived engineered heart muscle. Treatment of mice with a clinically approved AQP1 inhibitor, Bacopaside, attenuated cardiac hypertrophy. We conclude that cardiac hypertrophy is mediated by the transmembrane transport of H2O2 by the water channel AQP1 and that inhibitors of AQP1 represent new possibilities for treating hypertrophic cardiomyopathies.
Collapse
Affiliation(s)
- Virginie Montiel
- Institute of Experimental and Clinical Research (IREC), Pharmacology and Therapeutics (FATH), Cliniques Universitaires St Luc and Université Catholique de Louvain (UCLouvain), 1200 Brussels, Belgium
| | - Ramona Bella
- Institute of Experimental and Clinical Research (IREC), Pharmacology and Therapeutics (FATH), Cliniques Universitaires St Luc and Université Catholique de Louvain (UCLouvain), 1200 Brussels, Belgium
| | - Lauriane Y M Michel
- Institute of Experimental and Clinical Research (IREC), Pharmacology and Therapeutics (FATH), Cliniques Universitaires St Luc and Université Catholique de Louvain (UCLouvain), 1200 Brussels, Belgium
| | - Hrag Esfahani
- Institute of Experimental and Clinical Research (IREC), Pharmacology and Therapeutics (FATH), Cliniques Universitaires St Luc and Université Catholique de Louvain (UCLouvain), 1200 Brussels, Belgium
| | - Delphine De Mulder
- Institute of Experimental and Clinical Research (IREC), Pharmacology and Therapeutics (FATH), Cliniques Universitaires St Luc and Université Catholique de Louvain (UCLouvain), 1200 Brussels, Belgium
| | - Emma L Robinson
- Laboratory of Experimental Cardiology, Department of Cardiovascular Sciences, KULeuven, 3000 Leuven, Belgium
| | - Jean-Philippe Deglasse
- Institute of Experimental and Clinical Research (IREC), Endocrinology, Diabetes and Nutrition (EDIN), Université Catholique de Louvain (UCLouvain), 1200 Brussels, Belgium
| | - Malte Tiburcy
- Institute of Pharmacology and Toxicology, University Medical Center Göttingen, 37075 Göttingen, Germany.,DZHK (German Center for Cardiovascular Research), Partner Site Göttingen, 37075 Göttingen, Germany
| | - Pak Hin Chow
- Adelaide Medical School, University of Adelaide, Adelaide, SA 5000, Australia
| | - Jean-Christophe Jonas
- Institute of Experimental and Clinical Research (IREC), Endocrinology, Diabetes and Nutrition (EDIN), Université Catholique de Louvain (UCLouvain), 1200 Brussels, Belgium
| | - Patrick Gilon
- Institute of Experimental and Clinical Research (IREC), Endocrinology, Diabetes and Nutrition (EDIN), Université Catholique de Louvain (UCLouvain), 1200 Brussels, Belgium
| | - Benjamin Steinhorn
- Cardiovascular Division, Department of Medicine, Brigham and Women's Hospital, Harvard Medical School, Boston, MA 2115, USA
| | - Thomas Michel
- Cardiovascular Division, Department of Medicine, Brigham and Women's Hospital, Harvard Medical School, Boston, MA 2115, USA
| | - Christophe Beauloye
- Institute of Experimental and Clinical Research (IREC), Pole of Cardiovascular Research (CARD), Cliniques Universitaires St Luc and Université Catholique de Louvain (UCLouvain), 1200 Brussels, Belgium
| | - Luc Bertrand
- Institute of Experimental and Clinical Research (IREC), Pole of Cardiovascular Research (CARD), Cliniques Universitaires St Luc and Université Catholique de Louvain (UCLouvain), 1200 Brussels, Belgium
| | - Charlotte Farah
- Institute of Experimental and Clinical Research (IREC), Pharmacology and Therapeutics (FATH), Cliniques Universitaires St Luc and Université Catholique de Louvain (UCLouvain), 1200 Brussels, Belgium
| | - Flavia Dei Zotti
- Institute of Experimental and Clinical Research (IREC), Pharmacology and Therapeutics (FATH), Cliniques Universitaires St Luc and Université Catholique de Louvain (UCLouvain), 1200 Brussels, Belgium
| | - Huguette Debaix
- Institute of Experimental and Clinical Research (IREC), Nephrology (NEFR), Cliniques Universitaires St Luc and Université Catholique de Louvain (UCLouvain), 1200 Brussels, Belgium.,Institute of Physiology, University of Zürich, CH 8057 Zürich, Switzerland
| | - Caroline Bouzin
- 2IP-IREC Imaging Platform, Institute of Experimental and Clinical Research (IREC), Université Catholique de Louvain (UCLouvain), 1200 Brussels, Belgium
| | - Davide Brusa
- Flow Cytometry Platform, Institute of Experimental and Clinical Research (IREC), Université Catholique de Louvain (UCLouvain), 1200 Brussels, Belgium
| | - Sandrine Horman
- Institute of Experimental and Clinical Research (IREC), Pole of Cardiovascular Research (CARD), Cliniques Universitaires St Luc and Université Catholique de Louvain (UCLouvain), 1200 Brussels, Belgium
| | - Jean-Louis Vanoverschelde
- Institute of Experimental and Clinical Research (IREC), Pole of Cardiovascular Research (CARD), Cliniques Universitaires St Luc and Université Catholique de Louvain (UCLouvain), 1200 Brussels, Belgium
| | - Olaf Bergmann
- Center for Regenerative Therapies Dresden, Technische Universität Dresden, 01062 Dresden, Germany.,Department of Cell and Molecular Biology, Karolinska Institutet, 171 77 Stockholm, Sweden
| | - Dimitri Gilis
- Computational Biology and Bioinformatics (3BIO-BioInfo), Université Libre de Bruxelles (ULB), 1000 Brussels, Belgium
| | - Marianne Rooman
- Computational Biology and Bioinformatics (3BIO-BioInfo), Université Libre de Bruxelles (ULB), 1000 Brussels, Belgium
| | - Alessandra Ghigo
- Molecular Biotechnology Center, Università di Torino, 10124 Torino, Italy
| | | | - Andrea Yool
- Adelaide Medical School, University of Adelaide, Adelaide, SA 5000, Australia
| | - Wolfram H Zimmermann
- Institute of Pharmacology and Toxicology, University Medical Center Göttingen, 37075 Göttingen, Germany.,DZHK (German Center for Cardiovascular Research), Partner Site Göttingen, 37075 Göttingen, Germany.,Cluster of Excellence "Multiscale Bioimaging: from Molecular Machines to Networks of Excitable Cells" (MBExC), University of Göttingen, 37075 Göttingen, Germany
| | - H Llewelyn Roderick
- Laboratory of Experimental Cardiology, Department of Cardiovascular Sciences, KULeuven, 3000 Leuven, Belgium
| | - Olivier Devuyst
- Institute of Experimental and Clinical Research (IREC), Nephrology (NEFR), Cliniques Universitaires St Luc and Université Catholique de Louvain (UCLouvain), 1200 Brussels, Belgium.,Institute of Physiology, University of Zürich, CH 8057 Zürich, Switzerland
| | - Jean-Luc Balligand
- Institute of Experimental and Clinical Research (IREC), Pharmacology and Therapeutics (FATH), Cliniques Universitaires St Luc and Université Catholique de Louvain (UCLouvain), 1200 Brussels, Belgium.
| |
Collapse
|
114
|
A Deep-Learning Sequence-Based Method to Predict Protein Stability Changes Upon Genetic Variations. Genes (Basel) 2021; 12:genes12060911. [PMID: 34204764 PMCID: PMC8231498 DOI: 10.3390/genes12060911] [Citation(s) in RCA: 18] [Impact Index Per Article: 6.0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 05/14/2021] [Revised: 06/08/2021] [Accepted: 06/09/2021] [Indexed: 01/17/2023] Open
Abstract
Several studies have linked disruptions of protein stability and its normal functions to disease. Therefore, during the last few decades, many tools have been developed to predict the free energy changes upon protein residue variations. Most of these methods require both sequence and structure information to obtain reliable predictions. However, the lower number of protein structures available with respect to their sequences, due to experimental issues, drastically limits the application of these tools. In addition, current methodologies ignore the antisymmetric property characterizing the thermodynamics of the protein stability: a variation from wild-type to a mutated form of the protein structure (XW→XM) and its reverse process (XM→XW) must have opposite values of the free energy difference (ΔΔGWM=−ΔΔGMW). Here we propose ACDC-NN-Seq, a deep neural network system that exploits the sequence information and is able to incorporate into its architecture the antisymmetry property. To our knowledge, this is the first convolutional neural network to predict protein stability changes relying solely on the protein sequence. We show that ACDC-NN-Seq compares favorably with the existing sequence-based methods.
Collapse
|
115
|
Raghav PK, Gangenahalli G. PU.1 Mimic Synthetic Peptides Selectively Bind with GATA-1 and Allow c-Jun PU.1 Binding to Enhance Myelopoiesis. Int J Nanomedicine 2021; 16:3833-3859. [PMID: 34113102 PMCID: PMC8187006 DOI: 10.2147/ijn.s303235] [Citation(s) in RCA: 1] [Impact Index Per Article: 0.3] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 01/22/2021] [Accepted: 04/20/2021] [Indexed: 12/12/2022] Open
Abstract
BACKGROUND Hematopoietic stem cells' commitment to myelopoiesis builds immunity to prevent infection. This process is controlled through transcription factor, especially Purine rich box 1 (PU.1) protein, which plays a central role in regulating myelopoiesis. The β3/β4 region of PU.1 accommodates a coactivator transcription factor, c-Jun, to activate myelopoiesis. However, an erythroid transcription factor, GATA-1, competes with c-Jun for the β3/β4 region, abolishing myelopoiesis and promoting erythropoiesis. This competitive regulation decides the hematopoietic stem cells' commitment towards either erythroid or myeloid lineage. METHODS Therefore, this study investigated the in vitro and in vivo effect of novel synthetic PU.1 β3/β4 mimic peptide analogs and peptide-loaded hydrophilic poly(D,L-lactide-co-glycolide) (PLGA) nanoparticles. RESULTS The designed peptides significantly increase the expression of corresponding myeloid markers, specifically CD33 in vitro. However, the in vivo delivery of peptide-loaded PLGA nanoparticles, which have sustained release effect of peptides, increases 10.8% of granulocytes as compared to control. CONCLUSION The observations showed that the fabricated nanoparticles protected the loaded peptides from the harsh intracellular environment for a longer duration without causing any toxicity. These findings highlight the possibility to use these peptides and peptide-loaded nanoparticles to increase hematopoietic stem cell commitment to myeloid cells in case of opportunistic infection.
Collapse
Affiliation(s)
- Pawan Kumar Raghav
- Division of Stem Cell and Gene Therapy Research, Institute of Nuclear Medicine and Allied Sciences (INMAS), Delhi, 110054, India
| | - Gurudutta Gangenahalli
- Division of Stem Cell and Gene Therapy Research, Institute of Nuclear Medicine and Allied Sciences (INMAS), Delhi, 110054, India
| |
Collapse
|
116
|
Mahmood MS, Irshad S, Kalsoom U, Batool H, Batool S, Butt TA. In silico analysis of missense Single Nucleotide Variants (SNVs) in HBB gene associated with the β-thalassemia. GENE REPORTS 2021. [DOI: 10.1016/j.genrep.2021.101019] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 10/22/2022]
|
117
|
Iqbal S, Li F, Akutsu T, Ascher DB, Webb GI, Song J. Assessing the performance of computational predictors for estimating protein stability changes upon missense mutations. Brief Bioinform 2021; 22:6289890. [PMID: 34058752 DOI: 10.1093/bib/bbab184] [Citation(s) in RCA: 23] [Impact Index Per Article: 7.7] [Reference Citation Analysis] [Abstract] [Key Words] [Journal Information] [Subscribe] [Scholar Register] [Received: 02/08/2021] [Revised: 04/07/2021] [Accepted: 04/21/2021] [Indexed: 11/14/2022] Open
Abstract
Understanding how a mutation might affect protein stability is of significant importance to protein engineering and for understanding protein evolution genetic diseases. While a number of computational tools have been developed to predict the effect of missense mutations on protein stability protein stability upon mutations, they are known to exhibit large biases imparted in part by the data used to train and evaluate them. Here, we provide a comprehensive overview of predictive tools, which has provided an evolving insight into the importance and relevance of features that can discern the effects of mutations on protein stability. A diverse selection of these freely available tools was benchmarked using a large mutation-level blind dataset of 1342 experimentally characterised mutations across 130 proteins from ThermoMutDB, a second test dataset encompassing 630 experimentally characterised mutations across 39 proteins from iStable2.0 and a third blind test dataset consisting of 268 mutations in 27 proteins from the newly published ProThermDB. The performance of the methods was further evaluated with respect to the site of mutation, type of mutant residue and by ranging the pH and temperature. Additionally, the classification performance was also evaluated by classifying the mutations as stabilizing (∆∆G ≥ 0) or destabilizing (∆∆G < 0). The results reveal that the performance of the predictors is affected by the site of mutation and the type of mutant residue. Further, the results show very low performance for pH values 6-8 and temperature higher than 65 for all predictors except iStable2.0 on the S630 dataset. To illustrate how stability and structure change upon single point mutation, we considered four stabilizing, two destabilizing and two stabilizing mutations from two proteins, namely the toxin protein and bovine liver cytochrome. Overall, the results on S268, S630 and S1342 datasets show that the performance of the integrated predictors is better than the mechanistic or individual machine learning predictors. We expect that this paper will provide useful guidance for the design and development of next-generation bioinformatic tools for predicting protein stability changes upon mutations.
Collapse
Affiliation(s)
- Shahid Iqbal
- Computer System Engineering from Ghulam Ishaq Khan Institute of Engineering Sciences and Technology, Pakistan
| | - Fuyi Li
- Department of Microbiology and Immunology, Peter Doherty Institute for Infection and Immunity, the University of Melbourne, Australia
| | - Tatsuya Akutsu
- Bioinformatics Center, Institute for Chemical Research, Kyoto University, Japan
| | | | - Geoffrey I Webb
- Monash Centre for Data Science, Faculty of Information Technology, Monash University, Victoria 3800, Australia
| | - Jiangning Song
- Monash Biomedicine Discovery Institute, Monash University, Australia
| |
Collapse
|
118
|
Structural effects driven by rare point mutations in amylin hormone, the type II diabetes-associated peptide. Biochim Biophys Acta Gen Subj 2021; 1865:129935. [PMID: 34044067 DOI: 10.1016/j.bbagen.2021.129935] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Journal Information] [Subscribe] [Scholar Register] [Received: 02/19/2021] [Revised: 05/11/2021] [Accepted: 05/20/2021] [Indexed: 11/27/2022]
Abstract
BACKGROUND Amylin is a 37-amino-acid peptide hormone co-secreted with insulin, which participates in glucose homeostasis. This hormone is able to aggregate in a β-sheet conformation and deposit in islet amyloids, a hallmark in type II diabetes. Since amylin is a gene-encoded hormone, this peptide has variants caused by point mutations that can impact its functions. METHODS Here, we analyzed the structural effects caused by S20G and G33R point mutations which, according to the 1000 Genomes Project, have frequency in East Asian and European populations, respectively. The analyses were performed by means of aggrescan server, SNP functional effect predictors, and molecular dynamics. RESULTS We found that both mutations have aggregation potential and cause changes in the monomeric forms when compared with wild-type amylin. Furthermore, comparative analyses with pramlintide, an amylin drug analogue, allowed us to infer that second α-helix maintenance may be related to the aggregation potential. CONCLUSIONS The S20G mutation has been described as pathologically related, which is in agreement with our findings. In addition, our data suggest that the G33R mutation might have a deleterious effect. The data presented here also provide new therapy opportunities, whether for creating more effective drugs for diabetes or implementing specific treatment for patients with these mutations. GENERAL SIGNIFICANCE Our data could help to better understand the impact of mutations on the wild-type amylin sequence, as a starting point for the evaluation and characterization of other variations. Moreover, these findings could improve the health of patients with type II diabetes.
Collapse
|
119
|
Wilson CJ, Chang M, Karttunen M, Choy WY. KEAP1 Cancer Mutants: A Large-Scale Molecular Dynamics Study of Protein Stability. Int J Mol Sci 2021; 22:5408. [PMID: 34065616 PMCID: PMC8161161 DOI: 10.3390/ijms22105408] [Citation(s) in RCA: 4] [Impact Index Per Article: 1.3] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 04/21/2021] [Revised: 05/11/2021] [Accepted: 05/13/2021] [Indexed: 12/30/2022] Open
Abstract
We have performed 280 μs of unbiased molecular dynamics (MD) simulations to investigate the effects of 12 different cancer mutations on Kelch-like ECH-associated protein 1 (KEAP1) (G333C, G350S, G364C, G379D, R413L, R415G, A427V, G430C, R470C, R470H, R470S and G476R), one of the frequently mutated proteins in lung cancer. The aim was to provide structural insight into the effects of these mutants, including a new class of ANCHOR (additionally NRF2-complexed hypomorph) mutant variants. Our work provides additional insight into the structural dynamics of mutants that could not be analyzed experimentally, painting a more complete picture of their mutagenic effects. Notably, blade-wise analysis of the Kelch domain points to stability as a possible target of cancer in KEAP1. Interestingly, structural analysis of the R470C ANCHOR mutant, the most prevalent missense mutation in KEAP1, revealed no significant change in structural stability or NRF2 binding site dynamics, possibly indicating an covalent modification as this mutant's mode of action.
Collapse
Affiliation(s)
- Carter J. Wilson
- Department of Biochemistry, The University of Western Ontario, 1151 Richmond Street, London, ON N6A 5C1, Canada; (C.J.W.); (M.C.)
- Department of Applied Mathematics, The University of Western Ontario, 1151 Richmond Street, London, ON N6A 5B7, Canada
| | - Megan Chang
- Department of Biochemistry, The University of Western Ontario, 1151 Richmond Street, London, ON N6A 5C1, Canada; (C.J.W.); (M.C.)
| | - Mikko Karttunen
- Department of Applied Mathematics, The University of Western Ontario, 1151 Richmond Street, London, ON N6A 5B7, Canada
- Department of Chemistry, The University of Western Ontario, 1151 Richmond Street, London, ON N6A 3K7, Canada
- Centre for Advanced Materials and Biomaterials Research, The University of Western Ontario, 1151 Richmond Street, London, ON N6A 5B7, Canada
| | - Wing-Yiu Choy
- Department of Biochemistry, The University of Western Ontario, 1151 Richmond Street, London, ON N6A 5C1, Canada; (C.J.W.); (M.C.)
| |
Collapse
|
120
|
Zhou Z, Wang X. Rational design and structure-based engineering of alkaline pectate lyase from Paenibacillus sp. 0602 to improve thermostability. BMC Biotechnol 2021; 21:32. [PMID: 33941157 PMCID: PMC8091735 DOI: 10.1186/s12896-021-00693-8] [Citation(s) in RCA: 7] [Impact Index Per Article: 2.3] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 12/23/2020] [Accepted: 04/26/2021] [Indexed: 11/10/2022] Open
Abstract
BACKGROUND Ramie degumming is often carried out at high temperatures; therefore, thermostable alkaline pectate lyase (PL) is beneficial for ramie degumming for industrial applications. Thermostable PLs are usually obtained by exploring new enzymes or reconstructing existing enzyme by rational design. Here, we improved the thermostability of an alkaline pectate lyase (PelN) from Paenibacillus sp. 0602 with rational design and structure-based engineering. RESULTS From 26 mutants, two mutants of G241A and G241V showed a higher thermostability compared with the wild-type PL. The mutant K93I showed increasing specific activity at 45 °C. Subsequently, we obtained combinational mutations (K93I/G241A) and found that their thermostability and specific activity improved simultaneously. The K93I/G241A mutant showed a half-life time of 15.9 min longer at 60 °C and a melting temperature of 1.6 °C higher than those of the wild PL. The optimum temperature decreased remarkably from 67.5 °C to 60 °C, accompanied by a 57% decrease in Km compared with the Km value of the wild-type strain. Finally, we found that the intramolecular interaction in PelN was the source in the improvements of molecular properties by comparing the model structures. Rational design of PelN was performed by stabilizing the α-helices with high conservation and increasing the stability of the overall structure of the protein. Two engineering strategies were applied by decreasing the mutation energy calculated by Discovery Studio and predicting the free energy in the process of protein folding by the PoPMuSiC algorithm. CONCLUSIONS The results demonstrated that the K93I/G241A mutant was more suitable for industrial production than the wild-type enzyme. Furthermore, the two forementioned strategies could be extended to reveal engineering of other kinds of industrial enzymes.
Collapse
Affiliation(s)
- Zhanping Zhou
- Tianjin Sinonocy Biological Technology Co. Ltd., Tianjin, 300308, China
| | - Xiao Wang
- Nanfang College of Sun Yat-Sen University, Guangzhou, 510970, China.
| |
Collapse
|
121
|
A meta-analysis of the activity, stability, and mutational characteristics of temperature-adapted enzymes. Biosci Rep 2021; 41:228416. [PMID: 33871022 PMCID: PMC8150157 DOI: 10.1042/bsr20210336] [Citation(s) in RCA: 2] [Impact Index Per Article: 0.7] [Reference Citation Analysis] [Abstract] [Key Words] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 02/04/2021] [Revised: 03/29/2021] [Accepted: 04/19/2021] [Indexed: 11/17/2022] Open
Abstract
Understanding the characteristics that define temperature-adapted enzymes has been a major goal of extremophile enzymology in recent decades. In the present study, we explore these characteristics by comparing psychrophilic, mesophilic, and thermophilic enzymes. Through a meta-analysis of existing data, we show that psychrophilic enzymes exhibit a significantly larger gap (Tg) between their optimum and melting temperatures compared with mesophilic and thermophilic enzymes. These results suggest that Tg may be a useful indicator as to whether an enzyme is psychrophilic or not and that models of psychrophilic enzyme catalysis need to account for this gap. Additionally, by using predictive protein stability software, HoTMuSiC and PoPMuSiC, we show that the deleterious nature of amino acid substitutions to protein stability increases from psychrophiles to thermophiles. How this ultimately affects the mutational tolerance and evolutionary rate of temperature adapted organisms is currently unknown.
Collapse
|
122
|
Bahia MS, Khazanov N, Zhou Q, Yang Z, Wang C, Hong JS, Rab A, Sorscher EJ, Brouillette CG, Hunt JF, Senderowitz H. Stability Prediction for Mutations in the Cytosolic Domains of Cystic Fibrosis Transmembrane Conductance Regulator. J Chem Inf Model 2021; 61:1762-1777. [PMID: 33720715 DOI: 10.1021/acs.jcim.0c01207] [Citation(s) in RCA: 6] [Impact Index Per Article: 2.0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/28/2022]
Abstract
Cystic Fibrosis (CF) is caused by mutations to the Cystic Fibrosis Transmembrane Conductance Regulator (CFTR) chloride channel. CFTR is composed of two membrane spanning domains, two cytosolic nucleotide-binding domains (NBD1 and NBD2) and a largely unstructured R-domain. Multiple CF-causing mutations reside in the NBDs and some are known to compromise the stability of these domains. The ability to predict the effect of mutations on the stability of the cytosolic domains of CFTR and to shed light on the mechanisms by which they exert their effect is therefore important in CF research. With this in mind, we have predicted the effect on domain stability of 59 mutations in NBD1 and NBD2 using 15 different algorithms and evaluated their performances via comparison to experimental data using several metrics including the correct classification rate (CCR), and the squared Pearson correlation (R2) and Spearman's correlation (ρ) calculated between the experimental ΔTm values and the computationally predicted ΔΔG values. Overall, the best results were obtained with FoldX and Rosetta. For NBD1 (35 mutations), FoldX provided R2 and ρ values of 0.64 and -0.71, respectively, with an 86% correct classification rate (CCR). For NBD2 (24 mutations), FoldX R2, ρ, and CCR were 0.51, -0.73, and 75%, respectively. Application of the Rosetta high-resolution protocol (Rosetta_hrp) to NBD1 yielded R2, ρ, and CCR of 0.64, -0.75, and 69%, respectively, and for NBD2 yielded R2, ρ, and CCR of 0.29, -0.27, and 50%, respectively. The corresponding numbers for the Rosetta's low-resolution protocol (Rosetta_lrp) were R2 = 0.47, ρ = -0.69, and CCR = 69% for NBD1 and R2 = 0.27, ρ = -0.24, and CCR = 63% for NBD2. For NBD1, both algorithms suggest that destabilizing mutations suffer from destabilizing vdW clashes, whereas stabilizing mutations benefit from favorable H-bond interactions. Two triple consensus approaches based on FoldX, Rosetta_lpr, and Rosetta_hpr were attempted using either "majority-voting" or "all-voting". The all-voting consensus outperformed the individual predictors, albeit on a smaller data set. In summary, our results suggest that the effect of mutations on the stability of CFTR's NBDs could be largely predicted. Since NBDs are common to all ABC transporters, these results may find use in predicting the effect and mechanism of the action of multiple disease-causing mutations in other proteins.
Collapse
Affiliation(s)
| | - Netaly Khazanov
- Department of Chemistry, Bar-Ilan University, Ramat-Gan, 5290002, Israel
| | - Qingxian Zhou
- School of Medicine, Division of Pulmonary, Allergy and Critical Care Medicine, University of Alabama at Birmingham, Birmingham, Alabama 35294, United States
| | - Zhengrong Yang
- School of Medicine, Division of Hematology & Oncology, University of Alabama at Birmingham, Birmingham, Alabama 35294, United States
| | - Chi Wang
- 702 Fairchild Center, MC3423, Department of Biological Sciences, Columbia University, New York, New York 10027, United States
| | - Jeong S Hong
- Department of Paediatrics, Emory University School of Medicine, Atlanta, Georgia 30303, United States
| | - Andras Rab
- Department of Paediatrics, Emory University School of Medicine, Atlanta, Georgia 30303, United States
| | - Eric J Sorscher
- Department of Paediatrics, Emory University School of Medicine, Atlanta, Georgia 30303, United States
| | - Christie G Brouillette
- Department of Biochemistry & Molecular Genetics, University of Alabama at Birmingham, Birmingham, Alabama 35294, United States
| | - John F Hunt
- 702 Fairchild Center, MC3423, Department of Biological Sciences, Columbia University, New York, New York 10027, United States
| | - Hanoch Senderowitz
- Department of Chemistry, Bar-Ilan University, Ramat-Gan, 5290002, Israel
| |
Collapse
|
123
|
Ancien F, Pucci F, Rooman M. In Silico Analysis of the Molecular-Level Impact of SMPD1 Variants on Niemann-Pick Disease Severity. Int J Mol Sci 2021; 22:4516. [PMID: 33925997 PMCID: PMC8123603 DOI: 10.3390/ijms22094516] [Citation(s) in RCA: 3] [Impact Index Per Article: 1.0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 02/21/2021] [Revised: 04/10/2021] [Accepted: 04/20/2021] [Indexed: 12/12/2022] Open
Abstract
Sphingomyelin phosphodiesterase (SMPD1) is a key enzyme in the sphingolipid metabolism. Genetic SMPD1 variants have been related to the Niemann-Pick lysosomal storage disorder, which has different degrees of phenotypic severity ranging from severe symptomatology involving the central nervous system (type A) to milder ones (type B). They have also been linked to neurodegenerative disorders such as Parkinson and Alzheimer. In this paper, we leveraged structural, evolutionary and stability information on SMPD1 to predict and analyze the impact of variants at the molecular level. We developed the SMPD1-ZooM algorithm, which is able to predict with good accuracy whether variants cause Niemann-Pick disease and its phenotypic severity; the predictor is freely available for download. We performed a large-scale analysis of all possible SMPD1 variants, which led us to identify protein regions that are either robust or fragile with respect to amino acid variations, and show the importance of aromatic-involving interactions in SMPD1 function and stability. Our study also revealed a good correlation between SMPD1-ZooM scores and in vitro loss of SMPD1 activity. The understanding of the molecular effects of SMPD1 variants is of crucial importance to improve genetic screening of SMPD1-related disorders and to develop personalized treatments that restore SMPD1 functionality.
Collapse
Affiliation(s)
- François Ancien
- 3BIO—Computational Biology and Bioinformatics, Université Libre de Bruxelles, Avenue F. Roosevelt 50, 1050 Brussels, Belgium; (F.A.); (F.P.)
- (IB)—Interuniversity Institute of Bioinformatics in Brussels, Boulevard du Triomphe, 1050 Brussels, Belgium
| | - Fabrizio Pucci
- 3BIO—Computational Biology and Bioinformatics, Université Libre de Bruxelles, Avenue F. Roosevelt 50, 1050 Brussels, Belgium; (F.A.); (F.P.)
- (IB)—Interuniversity Institute of Bioinformatics in Brussels, Boulevard du Triomphe, 1050 Brussels, Belgium
| | - Marianne Rooman
- 3BIO—Computational Biology and Bioinformatics, Université Libre de Bruxelles, Avenue F. Roosevelt 50, 1050 Brussels, Belgium; (F.A.); (F.P.)
- (IB)—Interuniversity Institute of Bioinformatics in Brussels, Boulevard du Triomphe, 1050 Brussels, Belgium
| |
Collapse
|
124
|
Caldararu O, Blundell TL, Kepp KP. Three Simple Properties Explain Protein Stability Change upon Mutation. J Chem Inf Model 2021; 61:1981-1988. [PMID: 33848149 DOI: 10.1021/acs.jcim.1c00201] [Citation(s) in RCA: 16] [Impact Index Per Article: 5.3] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/29/2022]
Abstract
Accurate prediction of protein stability upon mutation enables rational engineering of new proteins and insights into protein evolution and monogenetic diseases caused by single-point amino acid substitutions. Many tools have been developed to this aim, ranging from energy-based models to machine-learning methods that use large amounts of experimental data. However, as the methods become more complex, the interpretation of the chemistry underlying the protein stability effects becomes obscure. It is thus of interest to identify the simplest prediction model that retains complete amino acid specific interpretation; for a given number of input descriptors, we expect such a model to be almost universal. In this study, we identify such a limiting model, SimBa, a simple multilinear regression model trained on a substitution-type-balanced experimental data set. The model accounts only for the solvent accessibility of the site, volume difference, and polarity difference caused by mutation. Our results show that this very simple and directly applicable model performs comparably to other much more complex, widely used protein stability prediction methods. This suggests that a hard limit of ∼1 kcal/mol numerical accuracy and an R ∼ 0.5 trend accuracy exists and that new features, such as account of unfolded states, water colocalization, and amino acid correlations, are required to improve accuracy to, e.g., 1/2 kcal/mol.
Collapse
Affiliation(s)
- Octav Caldararu
- DTU Chemistry, Technical University of Denmark, Building 206, 2800 Kgs. Lyngby, Denmark
| | - Tom L Blundell
- Department of Biochemistry, University of Cambridge, Cambridge, CB2 1GA, United Kingdom
| | - Kasper P Kepp
- DTU Chemistry, Technical University of Denmark, Building 206, 2800 Kgs. Lyngby, Denmark
| |
Collapse
|
125
|
Janežič M, Dileep KV, Zhang KYJ. A multidimensional computational exploration of congenital myasthenic syndrome causing mutations in human choline acetyltransferase. J Cell Biochem 2021; 122:787-800. [PMID: 33650116 DOI: 10.1002/jcb.29913] [Citation(s) in RCA: 1] [Impact Index Per Article: 0.3] [Reference Citation Analysis] [Abstract] [Key Words] [Journal Information] [Subscribe] [Scholar Register] [Received: 12/22/2020] [Revised: 02/08/2021] [Accepted: 02/09/2021] [Indexed: 11/09/2022]
Abstract
Missense mutations of human choline acetyltransferase (CHAT) are mainly associated with congenital myasthenic syndrome (CMS). To date, several pathogenic mutations have been reported, but due to the rarity and genetic complexity of CMS and difficult genotype-phenotype correlations, the CHAT mutations, and their consequences are underexplored. In this study, we systematically sift through the available genetic data in search of previously unreported pathogenic mutations and use a dynamic in silico model to provide structural explanations for the pathogenicity of the reported deleterious and undetermined variants. Through rigorous multiparameter analyses, we conclude that mutations can affect CHAT through a variety of different mechanisms: by disrupting the secondary structure, by perturbing the P-loop through long-range allosteric interactions, by disrupting the domain connecting loop, and by affecting the phosphorylation process. This study provides the first dynamic look at how mutations affect the structure and catalytic activity in CHAT and highlights the need for further genomic research to better understand the pathology of CHAT.
Collapse
Affiliation(s)
- Matej Janežič
- Laboratory for Structural Bioinformatics, Center for Biosystems Dynamics Research, RIKEN, Yokohama, Kanagawa, Japan.,Department of Computational Biology and Medical Sciences, Graduate School of Frontier Sciences, The University of Tokyo, Kashiwa, Chiba, Japan
| | - Kalarickal V Dileep
- Laboratory for Structural Bioinformatics, Center for Biosystems Dynamics Research, RIKEN, Yokohama, Kanagawa, Japan
| | - Kam Y J Zhang
- Laboratory for Structural Bioinformatics, Center for Biosystems Dynamics Research, RIKEN, Yokohama, Kanagawa, Japan.,Department of Computational Biology and Medical Sciences, Graduate School of Frontier Sciences, The University of Tokyo, Kashiwa, Chiba, Japan
| |
Collapse
|
126
|
Caldararu O, Blundell TL, Kepp KP. A base measure of precision for protein stability predictors: structural sensitivity. BMC Bioinformatics 2021; 22:88. [PMID: 33632133 PMCID: PMC7908712 DOI: 10.1186/s12859-021-04030-w] [Citation(s) in RCA: 19] [Impact Index Per Article: 6.3] [Reference Citation Analysis] [Abstract] [Key Words] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 09/30/2020] [Accepted: 02/15/2021] [Indexed: 01/17/2023] Open
Abstract
BACKGROUND Prediction of the change in fold stability (ΔΔG) of a protein upon mutation is of major importance to protein engineering and screening of disease-causing variants. Many prediction methods can use 3D structural information to predict ΔΔG. While the performance of these methods has been extensively studied, a new problem has arisen due to the abundance of crystal structures: How precise are these methods in terms of structure input used, which structure should be used, and how much does it matter? Thus, there is a need to quantify the structural sensitivity of protein stability prediction methods. RESULTS We computed the structural sensitivity of six widely-used prediction methods by use of saturated computational mutagenesis on a diverse set of 87 structures of 25 proteins. Our results show that structural sensitivity varies massively and surprisingly falls into two very distinct groups, with methods that take detailed account of the local environment showing a sensitivity of ~ 0.6 to 0.8 kcal/mol, whereas machine-learning methods display much lower sensitivity (~ 0.1 kcal/mol). We also observe that the precision correlates with the accuracy for mutation-type-balanced data sets but not generally reported accuracy of the methods, indicating the importance of mutation-type balance in both contexts. CONCLUSIONS The structural sensitivity of stability prediction methods varies greatly and is caused mainly by the models and less by the actual protein structural differences. As a new recommended standard, we therefore suggest that ΔΔG values are evaluated on three protein structures when available and the associated standard deviation reported, to emphasize not just the accuracy but also the precision of the method in a specific study. Our observation that machine-learning methods deemphasize structure may indicate that folded wild-type structures alone, without the folded mutant and unfolded structures, only add modest value for assessing protein stability effects, and that side-chain-sensitive methods overstate the significance of the folded wild-type structure.
Collapse
Affiliation(s)
- Octav Caldararu
- DTU Chemistry, Technical University of Denmark, Building 206, 2800, Kgs. Lyngby, Denmark
| | - Tom L Blundell
- Department of Biochemistry, University of Cambridge, Cambridge, CB2 1GA, UK
| | - Kasper P Kepp
- DTU Chemistry, Technical University of Denmark, Building 206, 2800, Kgs. Lyngby, Denmark.
| |
Collapse
|
127
|
Wu R, Wei Z, Zhang L. Structural insight into mutations at 155 position of valosin containing protein (VCP) linked to inclusion body myopathy with Paget disease of bone and frontotemporal Dementia. Saudi J Biol Sci 2021; 28:2128-2138. [PMID: 33911929 PMCID: PMC8071901 DOI: 10.1016/j.sjbs.2021.02.048] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 11/19/2020] [Revised: 02/10/2021] [Accepted: 02/11/2021] [Indexed: 10/31/2022] Open
Abstract
Mutations in Valosin-containing protein (VCP) have been implicated in the pathology linked to inclusion body myopathy, paget disease of bone and frontotemporal dementia (IBMPFD). VCP is an essential component of AAA-ATPase superfamily involved in various cellular functions. Advanced In-silico analysis was performed using prediction based servers to determine the most deleterious mutation. Molecular dynamics simulation was used to study the protein dynamics at atomic level. Molecular docking was used to study the effect of mutation on ATP/ADP transition in the kinase domain. This ATPase of 806 amino acids has four domains: N-terminal domain, C-terminal domain and two ATPase domains D1 and D2 and each of these domains have a distinct role in its functioning. The mutations in VCP protein are distributed among regions known as hotspots, one such hotspot is codon 155. Three missense mutations reported in this hotspot are R155C, R155H and R155P. Potentiality of the deleteriousness calculated using server based prediction models reveal R155C mutation to be the most deleterious. The atomic insight into the effect of mutation by molecular dynamics simulation revealed major conformational changes in R155C variants ATP binding site in D1 domain. The nucleotide-binding mode at the catalytic pocket of VCP and its three variants at codon 155 showed change in the structure, which affects the ATP-ADP transition kinetics in all the three variants.
Collapse
Affiliation(s)
- Rui Wu
- Department of Neurology, Affiliated Hospital of Zunyi Medical University, Zunyi, Guizhou 563000, China
| | - Zhijie Wei
- Department of Neurology, Affiliated Hospital of Zunyi Medical University, Zunyi, Guizhou 563000, China
| | - Li Zhang
- Department of Neurology, Affiliated Hospital of Zunyi Medical University, Zunyi, Guizhou 563000, China
| |
Collapse
|
128
|
Birolo G, Benevenuta S, Fariselli P, Capriotti E, Giorgio E, Sanavia T. Protein Stability Perturbation Contributes to the Loss of Function in Haploinsufficient Genes. Front Mol Biosci 2021; 8:620793. [PMID: 33598480 PMCID: PMC7882701 DOI: 10.3389/fmolb.2021.620793] [Citation(s) in RCA: 17] [Impact Index Per Article: 5.7] [Reference Citation Analysis] [Abstract] [Key Words] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/23/2020] [Accepted: 01/07/2021] [Indexed: 11/29/2022] Open
Abstract
Missense variants are among the most studied genome modifications as disease biomarkers. It has been shown that the “perturbation” of the protein stability upon a missense variant (in terms of absolute ΔΔG value, i.e., |ΔΔG|) has a significant, but not predictive, correlation with the pathogenicity of that variant. However, here we show that this correlation becomes significantly amplified in haploinsufficient genes. Moreover, the enrichment of pathogenic variants increases at the increasing protein stability perturbation value. These findings suggest that protein stability perturbation might be considered as a potential cofactor in diseases associated with haploinsufficient genes reporting missense variants.
Collapse
Affiliation(s)
| | | | | | - Emidio Capriotti
- Department of Pharmacy and Biotechnology (FaBiT), University of Bologna, Italy
| | - Elisa Giorgio
- Department of Molecular Medicine, University of Pavia, Italy
| | | |
Collapse
|
129
|
Planas-Iglesias J, Marques SM, Pinto GP, Musil M, Stourac J, Damborsky J, Bednar D. Computational design of enzymes for biotechnological applications. Biotechnol Adv 2021; 47:107696. [PMID: 33513434 DOI: 10.1016/j.biotechadv.2021.107696] [Citation(s) in RCA: 43] [Impact Index Per Article: 14.3] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/07/2020] [Revised: 01/12/2021] [Accepted: 01/13/2021] [Indexed: 12/14/2022]
Abstract
Enzymes are the natural catalysts that execute biochemical reactions upholding life. Their natural effectiveness has been fine-tuned as a result of millions of years of natural evolution. Such catalytic effectiveness has prompted the use of biocatalysts from multiple sources on different applications, including the industrial production of goods (food and beverages, detergents, textile, and pharmaceutics), environmental protection, and biomedical applications. Natural enzymes often need to be improved by protein engineering to optimize their function in non-native environments. Recent technological advances have greatly facilitated this process by providing the experimental approaches of directed evolution or by enabling computer-assisted applications. Directed evolution mimics the natural selection process in a highly accelerated fashion at the expense of arduous laboratory work and economic resources. Theoretical methods provide predictions and represent an attractive complement to such experiments by waiving their inherent costs. Computational techniques can be used to engineer enzymatic reactivity, substrate specificity and ligand binding, access pathways and ligand transport, and global properties like protein stability, solubility, and flexibility. Theoretical approaches can also identify hotspots on the protein sequence for mutagenesis and predict suitable alternatives for selected positions with expected outcomes. This review covers the latest advances in computational methods for enzyme engineering and presents many successful case studies.
Collapse
Affiliation(s)
- Joan Planas-Iglesias
- Loschmidt Laboratories, Department of Experimental Biology and RECETOX, Faculty of Science, Masaryk University, Kamenice 5/A13, 625 00 Brno, Czech Republic; International Clinical Research Center, St. Anne's University Hospital Brno, Pekarska 53, 656 91 Brno, Czech Republic
| | - Sérgio M Marques
- Loschmidt Laboratories, Department of Experimental Biology and RECETOX, Faculty of Science, Masaryk University, Kamenice 5/A13, 625 00 Brno, Czech Republic; International Clinical Research Center, St. Anne's University Hospital Brno, Pekarska 53, 656 91 Brno, Czech Republic
| | - Gaspar P Pinto
- Loschmidt Laboratories, Department of Experimental Biology and RECETOX, Faculty of Science, Masaryk University, Kamenice 5/A13, 625 00 Brno, Czech Republic; International Clinical Research Center, St. Anne's University Hospital Brno, Pekarska 53, 656 91 Brno, Czech Republic
| | - Milos Musil
- Loschmidt Laboratories, Department of Experimental Biology and RECETOX, Faculty of Science, Masaryk University, Kamenice 5/A13, 625 00 Brno, Czech Republic; International Clinical Research Center, St. Anne's University Hospital Brno, Pekarska 53, 656 91 Brno, Czech Republic; IT4Innovations Centre of Excellence, Faculty of Information Technology, Brno University of Technology, 61266 Brno, Czech Republic
| | - Jan Stourac
- Loschmidt Laboratories, Department of Experimental Biology and RECETOX, Faculty of Science, Masaryk University, Kamenice 5/A13, 625 00 Brno, Czech Republic; International Clinical Research Center, St. Anne's University Hospital Brno, Pekarska 53, 656 91 Brno, Czech Republic
| | - Jiri Damborsky
- Loschmidt Laboratories, Department of Experimental Biology and RECETOX, Faculty of Science, Masaryk University, Kamenice 5/A13, 625 00 Brno, Czech Republic; International Clinical Research Center, St. Anne's University Hospital Brno, Pekarska 53, 656 91 Brno, Czech Republic.
| | - David Bednar
- Loschmidt Laboratories, Department of Experimental Biology and RECETOX, Faculty of Science, Masaryk University, Kamenice 5/A13, 625 00 Brno, Czech Republic.
| |
Collapse
|
130
|
Tu T, Wang Z, Luo Y, Li Y, Su X, Wang Y, Zhang J, Rouvinen J, Yao B, Hakulinen N, Luo H. Structural Insights into the Mechanisms Underlying the Kinetic Stability of GH28 Endo-Polygalacturonase. JOURNAL OF AGRICULTURAL AND FOOD CHEMISTRY 2021; 69:815-823. [PMID: 33404235 DOI: 10.1021/acs.jafc.0c06941] [Citation(s) in RCA: 7] [Impact Index Per Article: 2.3] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 06/12/2023]
Abstract
Thermostability is a key property of industrial enzymes. Endo-polygalacturonases of the glycoside hydrolase family 28 have many practical applications, but only few of their structures have been determined, and the reasons for their stability remain unclear. We identified and characterized the Talaromyces leycettanus JCM12802 endo-polygalacturonase TlPGA, which differs from other GH28 family members because of its high catalytic activity, with an optimum temperature of 70 °C. Distinctive features were revealed by comparison of thermophilic TlPGA and all known structures of fungal endo-polygalacturonases, including a relatively large exposed polar accessible surface area in thermophilic TlPGA. By mutating potentially important residues in thermophilic TlPGA, we identified Thr284 as a critical residue. Mutant T284A was comparable to thermophilic TlPGA in melting temperature but exhibited a significantly lower half-life and half-inactivation temperature, implicating residue Thr284 in the kinetic stability of thermophilic TlPGA. Structure analysis of thermophilic TlPGA and mutant T284A revealed that a carbon-oxygen hydrogen bond between the hydroxyl group of Thr284 and the Cα atom of Gln255, and the stable conformation adopted by Gln255, contribute to its kinetic stability. Our results clarify the mechanism underlying the kinetic stability of GH28 endo-polygalacturonases and may guide the engineering of thermostable enzymes for industrial applications.
Collapse
Affiliation(s)
- Tao Tu
- Institute of Animal Science, Chinese Academy of Agricultural Sciences, Beijing 100193, PR China
| | - Zhiyun Wang
- Institute of Animal Science, Chinese Academy of Agricultural Sciences, Beijing 100193, PR China
| | - Yan Luo
- Institute of Animal Science, Chinese Academy of Agricultural Sciences, Beijing 100193, PR China
| | - Yeqing Li
- Institute of Animal Science, Chinese Academy of Agricultural Sciences, Beijing 100193, PR China
| | - Xiaoyun Su
- Institute of Animal Science, Chinese Academy of Agricultural Sciences, Beijing 100193, PR China
| | - Yuan Wang
- Institute of Animal Science, Chinese Academy of Agricultural Sciences, Beijing 100193, PR China
| | - Jie Zhang
- Institute of Animal Science, Chinese Academy of Agricultural Sciences, Beijing 100193, PR China
| | - Juha Rouvinen
- Department of Chemistry, University of Eastern Finland, Joensuu 80130, Finland
| | - Bin Yao
- Institute of Animal Science, Chinese Academy of Agricultural Sciences, Beijing 100193, PR China
| | - Nina Hakulinen
- Department of Chemistry, University of Eastern Finland, Joensuu 80130, Finland
| | - Huiying Luo
- Institute of Animal Science, Chinese Academy of Agricultural Sciences, Beijing 100193, PR China
| |
Collapse
|
131
|
Hou Q, Pucci F, Ancien F, Kwasigroch JM, Bourgeas R, Rooman M. SWOTein: a structure-based approach to predict stability Strengths and Weaknesses of prOTEINs. Bioinformatics 2021; 37:1963–1971. [PMID: 33471089 DOI: 10.1093/bioinformatics/btab034] [Citation(s) in RCA: 12] [Impact Index Per Article: 4.0] [Reference Citation Analysis] [Abstract] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/12/2020] [Revised: 12/05/2020] [Accepted: 01/15/2021] [Indexed: 11/13/2022] Open
Abstract
MOTIVATION Although structured proteins adopt their lowest free energy conformation in physiological conditions, the individual residues are generally not in their lowest free energy conformation. Residues that are stability weaknesses are often involved in functional regions, whereas stability strengths ensure local structural stability. The detection of strengths and weaknesses provides key information to guide protein engineering experiments aiming to modulate folding and various functional processes. RESULTS We developed the SWOTein predictor which identifies strong and weak residues in proteins on the basis of three types of statistical energy functions describing local interactions along the chain, hydrophobic forces and tertiary interactions. The large-scale analysis of the different types of strengths and weaknesses demonstrated their complementarity and the enhancement of the information they provide. Moreover, a good average correlation was observed between predicted and experimental strengths and weaknesses obtained from native hydrogen exchange data. SWOTein application to three test cases further showed its suitability to predict and interpret strong and weak residues in the context of folding, conformational changes and protein-protein binding. In summary, SWOTein is both fast and accurate and can be applied at small and large scale to analyze and modulate folding and molecular recognition processes. AVAILABILITY The SWOTein webserver provides the list of predicted strengths and weaknesses and a protein structure visualization tool that facilitates the interpretation of the predictions. It is freely available for academic use at http://babylone.ulb.ac.be/SWOTein/.
Collapse
Affiliation(s)
- Qingzhen Hou
- Department of Biostatistics, School of Public Health, Cheeloo College of Medicine, Shandong University, Shandong 250002, P. R. China.,National Institute of Health Data Science of China, Shandong University, Shandong 250002, P. R. China.,Computational Biology and Bioinformatics, Université Libre de Bruxelles, Brussels 1050, Belgium
| | - Fabrizio Pucci
- Computational Biology and Bioinformatics, Université Libre de Bruxelles, Brussels 1050, Belgium.,Interuniversity Institute of Bioinformatics in Brussels, Boulevard du Triomphe, 1050 Brussels, Belgium
| | - François Ancien
- Computational Biology and Bioinformatics, Université Libre de Bruxelles, Brussels 1050, Belgium.,Interuniversity Institute of Bioinformatics in Brussels, Boulevard du Triomphe, 1050 Brussels, Belgium
| | - Jean-Marc Kwasigroch
- Computational Biology and Bioinformatics, Université Libre de Bruxelles, Brussels 1050, Belgium
| | - Raphaël Bourgeas
- Computational Biology and Bioinformatics, Université Libre de Bruxelles, Brussels 1050, Belgium
| | - Marianne Rooman
- Computational Biology and Bioinformatics, Université Libre de Bruxelles, Brussels 1050, Belgium.,Interuniversity Institute of Bioinformatics in Brussels, Boulevard du Triomphe, 1050 Brussels, Belgium
| |
Collapse
|
132
|
Strokach A, Lu TY, Kim PM. ELASPIC2 (EL2): Combining Contextualized Language Models and Graph Neural Networks to Predict Effects of Mutations. J Mol Biol 2021; 433:166810. [PMID: 33450251 DOI: 10.1016/j.jmb.2021.166810] [Citation(s) in RCA: 15] [Impact Index Per Article: 5.0] [Reference Citation Analysis] [Abstract] [Key Words] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/26/2020] [Revised: 12/19/2020] [Accepted: 01/03/2021] [Indexed: 12/21/2022]
Abstract
The ELASPIC web server allows users to evaluate the effect of mutations on protein folding and protein-protein interaction on a proteome-wide scale. It uses homology models of proteins and protein-protein interactions, which have been precalculated for several proteomes, and machine learning models, which integrate structural information with sequence conservation scores, in order to make its predictions. Since the original publication of the ELASPIC web server, several advances have motivated a revisiting of the problem of mutation effect prediction. First, progress in neural network architectures and self-supervised pre-trained has resulted in models which provide more informative embeddings of protein sequence and structure than those used by the original version of ELASPIC. Second, the amount of training data has increased several-fold, largely driven by advances in deep mutation scanning and other multiplexed assays of variant effect. Here, we describe two machine learning models which leverage the recent advances in order to achieve superior accuracy in predicting the effect of mutation on protein folding and protein-protein interaction. The models incorporate features generated using pre-trained transformer- and graph convolution-based neural networks, and are trained to optimize a ranking objective function, which permits the use of heterogeneous training data. The outputs from the new models have been incorporated into the ELASPIC web server, available at http://elaspic.kimlab.org.
Collapse
Affiliation(s)
- Alexey Strokach
- Department of Computer Science, University of Toronto, Toronto, ON M5S 3E1, Canada
| | - Tian Yu Lu
- Department of Computer Science, University of Toronto, Toronto, ON M5S 3E1, Canada
| | - Philip M Kim
- Department of Computer Science, University of Toronto, Toronto, ON M5S 3E1, Canada; Donnelly Centre for Cellular and Biomolecular Research, University of Toronto, Toronto, ON M5S 3E1, Canada; Department of Molecular Genetics, University of Toronto, Toronto, ON M5S 3E1, Canada.
| |
Collapse
|
133
|
SAAFEC-SEQ: A Sequence-Based Method for Predicting the Effect of Single Point Mutations on Protein Thermodynamic Stability. Int J Mol Sci 2021; 22:ijms22020606. [PMID: 33435356 PMCID: PMC7827184 DOI: 10.3390/ijms22020606] [Citation(s) in RCA: 58] [Impact Index Per Article: 19.3] [Reference Citation Analysis] [Abstract] [Key Words] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 12/05/2020] [Revised: 12/23/2020] [Accepted: 01/06/2021] [Indexed: 01/04/2023] Open
Abstract
Modeling the effect of mutations on protein thermodynamics stability is useful for protein engineering and understanding molecular mechanisms of disease-causing variants. Here, we report a new development of the SAAFEC method, the SAAFEC-SEQ, which is a gradient boosting decision tree machine learning method to predict the change of the folding free energy caused by amino acid substitutions. The method does not require the 3D structure of the corresponding protein, but only its sequence and, thus, can be applied on genome-scale investigations where structural information is very sparse. SAAFEC-SEQ uses physicochemical properties, sequence features, and evolutionary information features to make the predictions. It is shown to consistently outperform all existing state-of-the-art sequence-based methods in both the Pearson correlation coefficient and root-mean-squared-error parameters as benchmarked on several independent datasets. The SAAFEC-SEQ has been implemented into a web server and is available as stand-alone code that can be downloaded and embedded into other researchers’ code.
Collapse
|
134
|
Nikam R, Kulandaisamy A, Harini K, Sharma D, Gromiha MM. ProThermDB: thermodynamic database for proteins and mutants revisited after 15 years. Nucleic Acids Res 2021; 49:D420-D424. [PMID: 33196841 PMCID: PMC7778892 DOI: 10.1093/nar/gkaa1035] [Citation(s) in RCA: 90] [Impact Index Per Article: 30.0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 09/14/2020] [Revised: 10/14/2020] [Accepted: 10/26/2020] [Indexed: 11/12/2022] Open
Abstract
ProThermDB is an updated version of the thermodynamic database for proteins and mutants (ProTherm), which has ∼31 500 data on protein stability, an increase of 84% from the previous version. It contains several thermodynamic parameters such as melting temperature, free energy obtained with thermal and denaturant denaturation, enthalpy change and heat capacity change along with experimental methods and conditions, sequence, structure and literature information. Besides, the current version of the database includes about 120 000 thermodynamic data obtained for different organisms and cell lines, which are determined by recent high throughput proteomics techniques using whole-cell approaches. In addition, we provided a graphical interface for visualization of mutations at sequence and structure levels. ProThermDB is cross-linked with other relevant databases, PDB, UniProt, PubMed etc. It is freely available at https://web.iitm.ac.in/bioinfo2/prothermdb/index.html without any login requirements. It is implemented in Python, HTML and JavaScript, and supports the latest versions of major browsers, such as Firefox, Chrome and Safari.
Collapse
Affiliation(s)
- Rahul Nikam
- Department of Biotechnology, Bhupat and Jyoti Mehta School of BioSciences, Indian Institute of Technology Madras, Chennai 600 036, Tamilnadu, India
| | - A Kulandaisamy
- Department of Biotechnology, Bhupat and Jyoti Mehta School of BioSciences, Indian Institute of Technology Madras, Chennai 600 036, Tamilnadu, India
| | - K Harini
- Department of Biotechnology, Bhupat and Jyoti Mehta School of BioSciences, Indian Institute of Technology Madras, Chennai 600 036, Tamilnadu, India
| | - Divya Sharma
- Department of Biotechnology, Bhupat and Jyoti Mehta School of BioSciences, Indian Institute of Technology Madras, Chennai 600 036, Tamilnadu, India
| | - M Michael Gromiha
- Department of Biotechnology, Bhupat and Jyoti Mehta School of BioSciences, Indian Institute of Technology Madras, Chennai 600 036, Tamilnadu, India
| |
Collapse
|
135
|
Sukumar S, Krishnan A, Banerjee S. An Overview of Bioinformatics Resources for SNP Analysis. Adv Bioinformatics 2021. [DOI: 10.1007/978-981-33-6191-1_7] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/29/2022] Open
|
136
|
Contreras F, Nutschel C, Beust L, Davari MD, Gohlke H, Schwaneberg U. Can constraint network analysis guide the identification phase of KnowVolution? A case study on improved thermostability of an endo-β-glucanase. Comput Struct Biotechnol J 2020; 19:743-751. [PMID: 33552446 PMCID: PMC7822948 DOI: 10.1016/j.csbj.2020.12.034] [Citation(s) in RCA: 5] [Impact Index Per Article: 1.3] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/01/2020] [Revised: 12/23/2020] [Accepted: 12/23/2020] [Indexed: 01/02/2023] Open
Abstract
Cellulases are industrially important enzymes, e.g., in the production of bioethanol, in pulp and paper industry, feedstock, and textile. Thermostability is often a prerequisite for high process stability and improving thermostability without affecting specific activities at lower temperatures is challenging and often time-consuming. Protein engineering strategies that combine experimental and computational are emerging in order to reduce experimental screening efforts and speed up enzyme engineering campaigns. Constraint Network Analysis (CNA) is a promising computational method that identifies beneficial positions in enzymes to improve thermostability. In this study, we compare CNA and directed evolution in the identification of beneficial positions in order to evaluate the potential of CNA in protein engineering campaigns (e.g., in the identification phase of KnowVolution). We engineered the industrially relevant endoglucanase EGLII from Penicillium verruculosum towards increased thermostability. From the CNA approach, six variants were obtained with an up to 2-fold improvement in thermostability. The overall experimental burden was reduced to 40% utilizing the CNA method in comparison to directed evolution. On a variant level, the success rate was similar for both strategies, with 0.27% and 0.18% improved variants in the epPCR and CNA-guided library, respectively. In essence, CNA is an effective method for identification of positions that improve thermostability.
Collapse
Affiliation(s)
- Francisca Contreras
- Institute of Biotechnology, RWTH Aachen University, Worringerweg 3, 52074 Aachen, Germany
| | - Christina Nutschel
- John von Neumann Institute for Computing (NIC), Jülich Supercomputing Centre (JSC) and Institute of Biological Information Processing (IBI-7: Structural Biochemistry), Forschungszentrum Jülich GmbH, 52425 Jülich, Germany
| | - Laura Beust
- Institute of Biotechnology, RWTH Aachen University, Worringerweg 3, 52074 Aachen, Germany
| | - Mehdi D. Davari
- Institute of Biotechnology, RWTH Aachen University, Worringerweg 3, 52074 Aachen, Germany
| | - Holger Gohlke
- John von Neumann Institute for Computing (NIC), Jülich Supercomputing Centre (JSC) and Institute of Biological Information Processing (IBI-7: Structural Biochemistry), Forschungszentrum Jülich GmbH, 52425 Jülich, Germany
- Institute for Pharmaceutical and Medicinal Chemistry, Heinrich Heine University Düsseldorf, 40225 Düsseldorf, Germany
| | - Ulrich Schwaneberg
- Institute of Biotechnology, RWTH Aachen University, Worringerweg 3, 52074 Aachen, Germany
- DWI-Leibniz Institute for Interactive Materials, Forckenbeckstraße 50, 52074 Aachen, Germany
| |
Collapse
|
137
|
Guo X, An Y, Chai C, Sang J, Jiang L, Lu F, Dai Y, Liu F. Construction of the R17L mutant of MtC1LPMO for improved lignocellulosic biomass conversion by rational point mutation and investigation of the mechanism by molecular dynamics simulations. BIORESOURCE TECHNOLOGY 2020; 317:124024. [PMID: 32836036 DOI: 10.1016/j.biortech.2020.124024] [Citation(s) in RCA: 16] [Impact Index Per Article: 4.0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Subscribe] [Scholar Register] [Received: 07/17/2020] [Revised: 08/12/2020] [Accepted: 08/13/2020] [Indexed: 06/11/2023]
Abstract
To enhance the biomass conversion efficiency, the R17L mutant of the lytic polysaccharide monooxygenase (LPMO) MtC1LPMO with improved catalytic efficiency was constructed via rational point mutation based on the HotSpot Wizard 3.0 and dezyme web servers. Compared with the wild-type (WT) MtC1LPMO, R17L exhibited a 1.8-fold increase of specific activity and 1.92-fold increase of catalytic efficiency (kcat/Km). The degree of increase of the reducing sugar yield from microcrystalline cellulose and three plant biomass materials during synergistic hydrolysis using cellulase in combination with R17L was about 2 times higher than with the WT. Molecular dynamics simulations revealed that the R17L mutation reduced the stability of the region R18-I36, which then weakened the direct interactions between region N24-V31 and the substrate cellohexaose. Consequently, the deflection time of the cellohexaose conformation in R17L was prolonged compared to the WT, which enhanced its catalytic efficiency.
Collapse
Affiliation(s)
- Xiao Guo
- Key Laboratory of Industrial Fermentation Microbiology of the Ministry of Education, Tianjin 300457, PR China; Tianjin Key Laboratory of Industrial Microbiology, Tianjin 300457, PR China; College of Biotechnology, Tianjin University of Science & Technology, Tianjin 300457, PR China
| | - Yajing An
- Key Laboratory of Industrial Fermentation Microbiology of the Ministry of Education, Tianjin 300457, PR China; Tianjin Key Laboratory of Industrial Microbiology, Tianjin 300457, PR China; College of Biotechnology, Tianjin University of Science & Technology, Tianjin 300457, PR China
| | - Chengcheng Chai
- Key Laboratory of Industrial Fermentation Microbiology of the Ministry of Education, Tianjin 300457, PR China; Tianjin Key Laboratory of Industrial Microbiology, Tianjin 300457, PR China; College of Biotechnology, Tianjin University of Science & Technology, Tianjin 300457, PR China
| | - Jingcheng Sang
- Key Laboratory of Industrial Fermentation Microbiology of the Ministry of Education, Tianjin 300457, PR China; Tianjin Key Laboratory of Industrial Microbiology, Tianjin 300457, PR China; College of Biotechnology, Tianjin University of Science & Technology, Tianjin 300457, PR China
| | - Luying Jiang
- Key Laboratory of Industrial Fermentation Microbiology of the Ministry of Education, Tianjin 300457, PR China; Tianjin Key Laboratory of Industrial Microbiology, Tianjin 300457, PR China; College of Biotechnology, Tianjin University of Science & Technology, Tianjin 300457, PR China
| | - Fuping Lu
- Key Laboratory of Industrial Fermentation Microbiology of the Ministry of Education, Tianjin 300457, PR China; Tianjin Key Laboratory of Industrial Microbiology, Tianjin 300457, PR China; College of Biotechnology, Tianjin University of Science & Technology, Tianjin 300457, PR China
| | - Yujie Dai
- Key Laboratory of Industrial Fermentation Microbiology of the Ministry of Education, Tianjin 300457, PR China; Tianjin Key Laboratory of Industrial Microbiology, Tianjin 300457, PR China; College of Biotechnology, Tianjin University of Science & Technology, Tianjin 300457, PR China
| | - Fufeng Liu
- Key Laboratory of Industrial Fermentation Microbiology of the Ministry of Education, Tianjin 300457, PR China; Tianjin Key Laboratory of Industrial Microbiology, Tianjin 300457, PR China; College of Biotechnology, Tianjin University of Science & Technology, Tianjin 300457, PR China.
| |
Collapse
|
138
|
Chen Y, Lu H, Zhang N, Zhu Z, Wang S, Li M. PremPS: Predicting the impact of missense mutations on protein stability. PLoS Comput Biol 2020; 16:e1008543. [PMID: 33378330 PMCID: PMC7802934 DOI: 10.1371/journal.pcbi.1008543] [Citation(s) in RCA: 93] [Impact Index Per Article: 23.3] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/23/2020] [Revised: 01/12/2021] [Accepted: 11/16/2020] [Indexed: 12/12/2022] Open
Abstract
Computational methods that predict protein stability changes induced by missense mutations have made a lot of progress over the past decades. Most of the available methods however have very limited accuracy in predicting stabilizing mutations because existing experimental sets are dominated by mutations reducing protein stability. Moreover, few approaches could consistently perform well across different test cases. To address these issues, we developed a new computational method PremPS to more accurately evaluate the effects of missense mutations on protein stability. The PremPS method is composed of only ten evolutionary- and structure-based features and parameterized on a balanced dataset with an equal number of stabilizing and destabilizing mutations. A comprehensive comparison of the predictive performance of PremPS with other available methods on nine benchmark datasets confirms that our approach consistently outperforms other methods and shows considerable improvement in estimating the impacts of stabilizing mutations. A protein could have multiple structures available, and if another structure of the same protein is used, the predicted change in stability for structure-based methods might be different. Thus, we further estimated the impact of using different structures on prediction accuracy, and demonstrate that our method performs well across different types of structures except for low-resolution structures and models built based on templates with low sequence identity. PremPS can be used for finding functionally important variants, revealing the molecular mechanisms of functional influences and protein design. PremPS is freely available at https://lilab.jysw.suda.edu.cn/research/PremPS/, which allows to do large-scale mutational scanning and takes about four minutes to perform calculations for a single mutation per protein with ~ 300 residues and requires ~ 0.4 seconds for each additional mutation.
Collapse
Affiliation(s)
- Yuting Chen
- Center for Systems Biology, Department of Bioinformatics, School of Biology and Basic Medical Sciences, Soochow University, Suzhou, China
| | - Haoyu Lu
- Center for Systems Biology, Department of Bioinformatics, School of Biology and Basic Medical Sciences, Soochow University, Suzhou, China
| | - Ning Zhang
- Center for Systems Biology, Department of Bioinformatics, School of Biology and Basic Medical Sciences, Soochow University, Suzhou, China
| | - Zefeng Zhu
- Center for Systems Biology, Department of Bioinformatics, School of Biology and Basic Medical Sciences, Soochow University, Suzhou, China
| | - Shuqin Wang
- Center for Systems Biology, Department of Bioinformatics, School of Biology and Basic Medical Sciences, Soochow University, Suzhou, China
| | - Minghui Li
- Center for Systems Biology, Department of Bioinformatics, School of Biology and Basic Medical Sciences, Soochow University, Suzhou, China
| |
Collapse
|
139
|
Runthala A, Sai TH, Kamjula V, Phulara SC, Rajput VS, Sangapillai K. Excavating the functionally crucial active-site residues of the DXS protein of Bacillus subtilis by exploring its closest homologues. J Genet Eng Biotechnol 2020; 18:76. [PMID: 33242110 PMCID: PMC7691408 DOI: 10.1186/s43141-020-00087-x] [Citation(s) in RCA: 2] [Impact Index Per Article: 0.5] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/16/2020] [Accepted: 10/21/2020] [Indexed: 11/10/2022]
Abstract
Abstract
Background
To achieve a high yield of terpenoid-based therapeutics, 1-deoxy-d-xylulose-5-phosphate (DXP) pathway has been significantly exploited for the production of downstream enzymes. The DXP synthase (DXS) enzyme, the initiator of this pathway, is pivotal for the convergence of carbon flux, and is computationally studied well for the industrially utilized generally regarded as safe (GRAS) bacterium Bacillus subtilis to decode its vital regions for aiding the construction of a functionally improved mutant library.
Results
For the 546 sequence dataset of DXS sequences, a representative set of 108 sequences is created, and it shows a significant evolutionary divergence across different species clubbed into 37 clades, whereas three clades are observed for the 76 sequence dataset of Bacillus subtilis. The DXS enzyme, sharing a statistically significant homology to transketolase, is shown to be evolutionarily too distant. By the mutual information-based co-evolutionary network and hotspot analysis, the most crucial loci within the active site are deciphered. The 650-residue representative structure displays a complete conservation of 114 loci, and only two co-evolving residues ASP154 and ILE371 are found to be the conserved ones. Lastly, P318D is predicted to be the top-ranked mutation causing the increase in the thermodynamic stability of 6OUW.
Conclusion
The study excavates the vital functional, phylogenetic, and conserved residues across the active site of the DXS protein, the key rate-limiting controller of the entire pathway. It would aid to computationally understand the evolutionary landscape of this industrially useful enzyme and would allow us to widen its substrate repertoire to increase the enzymatic yield of unnatural molecules for in vivo and in vitro applications.
Collapse
|
140
|
Bellacchio E. Mutations Causing Mild or No Structural Damage in Interfaces of Multimerization of the Fibrinogen γ-Module More Likely Confer Negative Dominant Behaviors. Int J Mol Sci 2020; 21:ijms21239016. [PMID: 33260935 PMCID: PMC7730044 DOI: 10.3390/ijms21239016] [Citation(s) in RCA: 3] [Impact Index Per Article: 0.8] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/11/2020] [Revised: 11/23/2020] [Accepted: 11/25/2020] [Indexed: 02/02/2023] Open
Abstract
Different pathogenic variants in the same protein or even within the same domain of a protein may differ in their patterns of disease inheritance, with some of the variants behaving as negative dominant and others as autosomal recessive mutations. Here is presented a structural analysis and comparison of the molecular characteristics of the sites in fibrinogen γ-module, a fibrinogen component critical in multimerization processes, targeted by pathogenic variants (HGMD database) and by variants found in the healthy population (gnomAD database). The main result of this study is the identification of the molecular pathogenic mechanisms defining which pattern of disease inheritance is selected by mutations at the crossroad of autosomal recessive and negative dominant modalities. The observations in this analysis also warn about the possibility that several variants reported in the non-pathogenic gnomAD database might indeed be a hidden source of diseases with autosomal recessive inheritance or requiring a combination with other disease-causing mutations. Disease presentation might remain mostly unrevealed simply because the very low variant frequency rarely results in biallelic pathogenic mutations or the coupling with mutations in other genes contributing to the same disease. The results here presented provide hints for a deeper search of pathogenic mechanisms and modalities of disease inheritance for protein mutants participating in multimerization phenomena.
Collapse
Affiliation(s)
- Emanuele Bellacchio
- Area di Ricerca Genetica e Malattie Rare, Bambino Gesù Children's Hospital, IRCCS, Piazza Sant'Onofrio 4, 00165 Rome, Italy
| |
Collapse
|
141
|
Sarkar A, Yang Y, Vihinen M. Variation benchmark datasets: update, criteria, quality and applications. DATABASE-THE JOURNAL OF BIOLOGICAL DATABASES AND CURATION 2020; 2020:5710862. [PMID: 32016318 PMCID: PMC6997940 DOI: 10.1093/database/baz117] [Citation(s) in RCA: 19] [Impact Index Per Article: 4.8] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Subscribe] [Scholar Register] [Received: 04/12/2019] [Revised: 06/03/2019] [Accepted: 07/01/2019] [Indexed: 02/07/2023]
Abstract
Development of new computational methods and testing their performance has to be carried out using experimental data. Only in comparison to existing knowledge can method performance be assessed. For that purpose, benchmark datasets with known and verified outcome are needed. High-quality benchmark datasets are valuable and may be difficult, laborious and time consuming to generate. VariBench and VariSNP are the two existing databases for sharing variation benchmark datasets used mainly for variation interpretation. They have been used for training and benchmarking predictors for various types of variations and their effects. VariBench was updated with 419 new datasets from 109 papers containing altogether 329 014 152 variants; however, there is plenty of redundancy between the datasets. VariBench is freely available at http://structure.bmc.lu.se/VariBench/. The contents of the datasets vary depending on information in the original source. The available datasets have been categorized into 20 groups and subgroups. There are datasets for insertions and deletions, substitutions in coding and non-coding region, structure mapped, synonymous and benign variants. Effect-specific datasets include DNA regulatory elements, RNA splicing, and protein property for aggregation, binding free energy, disorder and stability. Then there are several datasets for molecule-specific and disease-specific applications, as well as one dataset for variation phenotype effects. Variants are often described at three molecular levels (DNA, RNA and protein) and sometimes also at the protein structural level including relevant cross references and variant descriptions. The updated VariBench facilitates development and testing of new methods and comparison of obtained performances to previously published methods. We compared the performance of the pathogenicity/tolerance predictor PON-P2 to several benchmark studies, and show that such comparisons are feasible and useful, however, there may be limitations due to lack of provided details and shared data. Database URL: http://structure.bmc.lu.se/VariBench
Collapse
Affiliation(s)
- Anasua Sarkar
- Department of Experimental Medical Science, BMC B13, Lund University, SE-22 184 Lund, Sweden
| | - Yang Yang
- School of Computer Science and Technology, Soochow University, No1. Shizi Street, Suzhou, 215006 Jiangsu, China.,Provincial Key Laboratory for Computer Information Processing Technology, No1. Shizi Street, Soochow University, Suzhou, 215006 Jiangsu, China
| | - Mauno Vihinen
- Department of Experimental Medical Science, BMC B13, Lund University, SE-22 184 Lund, Sweden
| |
Collapse
|
142
|
Li X, Dou Z, Sun Y, Wang L, Gong B, Wan L. A sequence embedding method for enzyme optimal condition analysis. BMC Bioinformatics 2020; 21:512. [PMID: 33167861 PMCID: PMC7653822 DOI: 10.1186/s12859-020-03851-5] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/28/2020] [Accepted: 10/30/2020] [Indexed: 11/10/2022] Open
Abstract
Background An enzyme activity is influenced by the external environment. It is important to have an enzyme remain high activity in a specific condition. A usual way is to first determine the optimal condition of an enzyme by either the gradient test or by tertiary structure, and then to use protein engineering to mutate a wild type enzyme for a higher activity in an expected condition. Results In this paper, we investigate the optimal condition of an enzyme by directly analyzing the sequence. We propose an embedding method to represent the amino acids and the structural information as vectors in the latent space. These vectors contain information about the correlations between amino acids and sites in the aligned amino acid sequences, as well as the correlation with the optimal condition. We crawled and processed the amino acid sequences in the glycoside hydrolase GH11 family, and got 125 amino acid sequences with optimal pH condition. We used probabilistic approximation method to implement the embedding learning method on these samples. Based on these embedding vectors, we design a computational score to determine which one has a better optimal condition for two given amino acid sequences and achieves the accuracy 80% on the test proteins in the same family. We also give the mutation suggestion such that it has a higher activity in an expected environment, which is consistent with the previously professional wet experiments and analysis. Conclusion A new computational method is proposed for the sequence based on the enzyme optimal condition analysis. Compared with the traditional process that involves a lot of wet experiments and requires multiple mutations, this method can give recommendations on the direction and location of amino acid substitution with reference significance for an expected condition in an efficient and effective way.
Collapse
Affiliation(s)
- Xiangjun Li
- School of Software, Shandong University, Shunhua Road, Jinan, 250101, China
| | - Zhixin Dou
- State Key Laboratory of Microbial Technology, Shandong University, Binhai Road, Qingdao, 266237, China
| | - Yuqing Sun
- School of Software, Shandong University, Shunhua Road, Jinan, 250101, China.
| | - Lushan Wang
- State Key Laboratory of Microbial Technology, Shandong University, Binhai Road, Qingdao, 266237, China
| | - Bin Gong
- School of Software, Shandong University, Shunhua Road, Jinan, 250101, China
| | - Lin Wan
- School of Software, Shandong University, Shunhua Road, Jinan, 250101, China
| |
Collapse
|
143
|
Pacheco-García JL, Cano-Muñoz M, Sánchez-Ramos I, Salido E, Pey AL. Naturally-Occurring Rare Mutations Cause Mild to Catastrophic Effects in the Multifunctional and Cancer-Associated NQO1 Protein. J Pers Med 2020; 10:E207. [PMID: 33153185 PMCID: PMC7711955 DOI: 10.3390/jpm10040207] [Citation(s) in RCA: 4] [Impact Index Per Article: 1.0] [Reference Citation Analysis] [Abstract] [Key Words] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/07/2020] [Revised: 10/27/2020] [Accepted: 11/02/2020] [Indexed: 12/13/2022] Open
Abstract
The functional and pathological implications of the enormous genetic diversity of the human genome are mostly unknown, primarily due to our unability to predict pathogenicity in a high-throughput manner. In this work, we characterized the phenotypic consequences of eight naturally-occurring missense variants on the multifunctional and disease-associated NQO1 protein using biophysical and structural analyses on several protein traits. Mutations found in both exome-sequencing initiatives and in cancer cell lines cause mild to catastrophic effects on NQO1 stability and function. Importantly, some mutations perturb functional features located structurally far from the mutated site. These effects are well rationalized by considering the nature of the mutation, its location in protein structure and the local stability of its environment. Using a set of 22 experimentally characterized mutations in NQO1, we generated experimental scores for pathogenicity that correlate reasonably well with bioinformatic scores derived from a set of commonly used algorithms, although the latter fail to semiquantitatively predict the phenotypic alterations caused by a significant fraction of mutations individually. These results provide insight into the propagation of mutational effects on multifunctional proteins, the implementation of in silico approaches for establishing genotype-phenotype correlations and the molecular determinants underlying loss-of-function in genetic diseases.
Collapse
Affiliation(s)
- Juan Luis Pacheco-García
- Departamento de Química Física, Facultad de Ciencias, Universidad de Granada, 18071 Granada, Spain; (J.L.P.-G.); (M.C.-M.); (I.S.-R.)
| | - Mario Cano-Muñoz
- Departamento de Química Física, Facultad de Ciencias, Universidad de Granada, 18071 Granada, Spain; (J.L.P.-G.); (M.C.-M.); (I.S.-R.)
| | - Isabel Sánchez-Ramos
- Departamento de Química Física, Facultad de Ciencias, Universidad de Granada, 18071 Granada, Spain; (J.L.P.-G.); (M.C.-M.); (I.S.-R.)
| | - Eduardo Salido
- Centre for Biomedical Research on Rare Diseases (CIBERER), Hospital Universitario de Canarias, 38320 Tenerife, Spain;
| | - Angel L. Pey
- Departamento de Química Física y Unidad de Excelencia de Química Aplicada a Biomedicina y Medioambiente (UEQ), Facultad de Ciencias, Universidad de Granada, 18071 Granada, Spain
| |
Collapse
|
144
|
Li B, Yang YT, Capra JA, Gerstein MB. Predicting changes in protein thermodynamic stability upon point mutation with deep 3D convolutional neural networks. PLoS Comput Biol 2020; 16:e1008291. [PMID: 33253214 PMCID: PMC7728386 DOI: 10.1371/journal.pcbi.1008291] [Citation(s) in RCA: 48] [Impact Index Per Article: 12.0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/02/2020] [Revised: 12/10/2020] [Accepted: 08/26/2020] [Indexed: 12/22/2022] Open
Abstract
Predicting mutation-induced changes in protein thermodynamic stability (ΔΔG) is of great interest in protein engineering, variant interpretation, and protein biophysics. We introduce ThermoNet, a deep, 3D-convolutional neural network (3D-CNN) designed for structure-based prediction of ΔΔGs upon point mutation. To leverage the image-processing power inherent in CNNs, we treat protein structures as if they were multi-channel 3D images. In particular, the inputs to ThermoNet are uniformly constructed as multi-channel voxel grids based on biophysical properties derived from raw atom coordinates. We train and evaluate ThermoNet with a curated data set that accounts for protein homology and is balanced with direct and reverse mutations; this provides a framework for addressing biases that have likely influenced many previous ΔΔG prediction methods. ThermoNet demonstrates performance comparable to the best available methods on the widely used Ssym test set. In addition, ThermoNet accurately predicts the effects of both stabilizing and destabilizing mutations, while most other methods exhibit a strong bias towards predicting destabilization. We further show that homology between Ssym and widely used training sets like S2648 and VariBench has likely led to overestimated performance in previous studies. Finally, we demonstrate the practical utility of ThermoNet in predicting the ΔΔGs for two clinically relevant proteins, p53 and myoglobin, and for pathogenic and benign missense variants from ClinVar. Overall, our results suggest that 3D-CNNs can model the complex, non-linear interactions perturbed by mutations, directly from biophysical properties of atoms.
Collapse
Affiliation(s)
- Bian Li
- Department of Molecular Biophysics and Biochemistry, Yale University, New Haven, Connecticut, United States of America
- Program in Computational Biology and Bioinformatics, Yale University, New Haven, Connecticut, United States of America
- Department of Biological Sciences and Vanderbilt Genetics Institute, Vanderbilt University, Nashville, Tennessee, United States of America
| | - Yucheng T. Yang
- Department of Molecular Biophysics and Biochemistry, Yale University, New Haven, Connecticut, United States of America
- Program in Computational Biology and Bioinformatics, Yale University, New Haven, Connecticut, United States of America
| | - John A. Capra
- Department of Biological Sciences and Vanderbilt Genetics Institute, Vanderbilt University, Nashville, Tennessee, United States of America
| | - Mark B. Gerstein
- Department of Molecular Biophysics and Biochemistry, Yale University, New Haven, Connecticut, United States of America
- Program in Computational Biology and Bioinformatics, Yale University, New Haven, Connecticut, United States of America
- Department of Computer Science, Yale University, New Haven, Connecticut, United States of America
| |
Collapse
|
145
|
Tunstall T, Portelli S, Phelan J, Clark TG, Ascher DB, Furnham N. Combining structure and genomics to understand antimicrobial resistance. Comput Struct Biotechnol J 2020; 18:3377-3394. [PMID: 33294134 PMCID: PMC7683289 DOI: 10.1016/j.csbj.2020.10.017] [Citation(s) in RCA: 11] [Impact Index Per Article: 2.8] [Reference Citation Analysis] [Abstract] [Key Words] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 05/18/2020] [Revised: 10/15/2020] [Accepted: 10/17/2020] [Indexed: 02/07/2023] Open
Abstract
Antimicrobials against bacterial, viral and parasitic pathogens have transformed human and animal health. Nevertheless, their widespread use (and misuse) has led to the emergence of antimicrobial resistance (AMR) which poses a potentially catastrophic threat to public health and animal husbandry. There are several routes, both intrinsic and acquired, by which AMR can develop. One major route is through non-synonymous single nucleotide polymorphisms (nsSNPs) in coding regions. Large scale genomic studies using high-throughput sequencing data have provided powerful new ways to rapidly detect and respond to such genetic mutations linked to AMR. However, these studies are limited in their mechanistic insight. Computational tools can rapidly and inexpensively evaluate the effect of mutations on protein function and evolution. Subsequent insights can then inform experimental studies, and direct existing or new computational methods. Here we review a range of sequence and structure-based computational tools, focussing on tools successfully used to investigate mutational effect on drug targets in clinically important pathogens, particularly Mycobacterium tuberculosis. Combining genomic results with the biophysical effects of mutations can help reveal the molecular basis and consequences of resistance development. Furthermore, we summarise how the application of such a mechanistic understanding of drug resistance can be applied to limit the impact of AMR.
Collapse
Affiliation(s)
- Tanushree Tunstall
- Department of Infection Biology, London School of Hygiene and Tropical Medicine, Keppel Street, London WC1E 7HT, UK
| | - Stephanie Portelli
- Computational Biology and Clinical Informatics, Baker Heart and Diabetes Institute, Australia
- Structural Biology and Bioinformatics, Department of Biochemistry and Molecular Biology, Bio21 Institute, University of Melbourne, Australia
| | - Jody Phelan
- Department of Infection Biology, London School of Hygiene and Tropical Medicine, Keppel Street, London WC1E 7HT, UK
| | - Taane G. Clark
- Department of Infection Biology, London School of Hygiene and Tropical Medicine, Keppel Street, London WC1E 7HT, UK
- Department of Infectious Disease Epidemiology, London School of Hygiene and Tropical Medicine, Keppel Street, London WC1E 7HT, UK
| | - David B. Ascher
- Computational Biology and Clinical Informatics, Baker Heart and Diabetes Institute, Australia
- Structural Biology and Bioinformatics, Department of Biochemistry and Molecular Biology, Bio21 Institute, University of Melbourne, Australia
| | - Nicholas Furnham
- Department of Infection Biology, London School of Hygiene and Tropical Medicine, Keppel Street, London WC1E 7HT, UK
| |
Collapse
|
146
|
NGS Panel Testing of Triple-Negative Breast Cancer Patients in Cyprus: A Study of BRCA-Negative Cases. Cancers (Basel) 2020; 12:cancers12113140. [PMID: 33120919 PMCID: PMC7692082 DOI: 10.3390/cancers12113140] [Citation(s) in RCA: 4] [Impact Index Per Article: 1.0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 09/18/2020] [Revised: 10/05/2020] [Accepted: 10/23/2020] [Indexed: 01/11/2023] Open
Abstract
Simple Summary In Cyprus, approximately 9% of triple-negative (negative in common breast cancer receptors—estrogen, progesterone, and human epidermal growth factor receptor 2 (HER2) receptors) breast cancer (TNBC) patients carry inherited mutations in the BRCA1/2 breast cancer (BC) susceptibility genes. These mutations increase the risk of BC. However, the contribution of other BC susceptibility genes has not yet been determined. To this end, we aimed to investigate the prevalence of mutations in BRCA1/2-negative TNBC patients in Cyprus. Ninety-five cancer susceptibility genes were sequenced for 163 TNBC patients. The frequency of non-BRCA mutations and especially PALB2 in TNBC patients in Cyprus appears to be higher compared to other populations, and half of the mutation-positive patients were diagnosed over the age of 60 years. Based on these results, we believe that PALB2 and TP53 along with BRCA1/2 genetic testing could be beneficial for a large proportion of TNBC patients in Cyprus, irrespective of their age of diagnosis. Abstract In Cyprus, approximately 9% of triple-negative (estrogen receptor-negative, progesterone receptor-negative, and human epidermal growth factor receptor 2-negative) breast cancer (TNBC) patients are positive for germline pathogenic variants (PVs) in BRCA1/2. However, the contribution of other genes has not yet been determined. To this end, we aimed to investigate the prevalence of germline PVs in BRCA1/2-negative TNBC patients in Cyprus, unselected for family history of cancer or age of diagnosis. A comprehensive 94-cancer-gene panel was implemented for 163 germline DNA samples, extracted from the peripheral blood of TNBC patients. Identified variants of uncertain clinical significance were evaluated, using extensive in silico investigation. Eight PVs (4.9%) were identified in two high-penetrance TNBC susceptibility genes. Of these, seven occurred in PALB2 (87.5%) and one occurred in TP53 (12.5%). Interestingly, 50% of the patients carrying PVs were diagnosed over the age of 60 years. The frequency of non-BRCA PVs (4.9%) and especially PALB2 PVs (4.3%) in TNBC patients in Cyprus appears to be higher compared to other populations. Based on these results, we believe that PALB2 and TP53 along with BRCA1/2 genetic testing could be beneficial for a large proportion of TNBC patients in Cyprus, irrespective of their age of diagnosis.
Collapse
|
147
|
Yazar M, Özbek P. In Silico Tools and Approaches for the Prediction of Functional and Structural Effects of Single-Nucleotide Polymorphisms on Proteins: An Expert Review. OMICS-A JOURNAL OF INTEGRATIVE BIOLOGY 2020; 25:23-37. [PMID: 33058752 DOI: 10.1089/omi.2020.0141] [Citation(s) in RCA: 21] [Impact Index Per Article: 5.3] [Reference Citation Analysis] [Abstract] [Key Words] [Subscribe] [Scholar Register] [Indexed: 12/18/2022]
Abstract
Single-nucleotide polymorphisms (SNPs) are single-base variants that contribute to human biological variation and pathogenesis of many human diseases. Among all SNP types, nonsynonymous single-nucleotide polymorphisms (nsSNPs) can alter many structural, biochemical, and functional features of a protein such as folding characteristics, charge distribution, stability, dynamics, and interactions with other proteins/nucleotides. These modifications in the protein structure can lead nsSNPs to be closely associated with many multifactorial diseases such as cancer, diabetes, and neurodegenerative diseases. Predicting structural and functional effects of nsSNPs with experimental approaches can be time-consuming and costly; hence, computational prediction tools and algorithms are being widely and increasingly utilized in biology and medical research. This expert review examines the in silico tools and algorithms for the prediction of functional or structural effects of SNP variants, in addition to the description of the phenotypic effects of nsSNPs on protein structure, association between pathogenicity of variants, and functional or structural features of disease-associated variants. Finally, case studies investigating the functional and structural effects of nsSNPs on selected protein structures are highlighted. We conclude that creating a consistent workflow with a combination of in silico approaches or tools should be considered to increase the performance, accuracy, and precision of the biological and clinical predictions made in silico.
Collapse
Affiliation(s)
- Metin Yazar
- Department of Bioengineering, Marmara University, Göztepe, İstanbul, Turkey.,Department of Genetics and Bioengineering, Istanbul Okan University, Tuzla, Istanbul, Turkey
| | - Pemra Özbek
- Department of Bioengineering, Marmara University, Göztepe, İstanbul, Turkey
| |
Collapse
|
148
|
Ferreira KCDV, Fialho LF, Franco OL, de Alencar SA, Porto WF. Benchmarking analysis of deleterious SNP prediction tools on CYP2D6 enzyme. Chem Biol Drug Des 2020; 96:984-994. [DOI: 10.1111/cbdd.13676] [Citation(s) in RCA: 4] [Impact Index Per Article: 1.0] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/28/2019] [Revised: 02/15/2020] [Accepted: 03/03/2020] [Indexed: 12/19/2022]
Affiliation(s)
- Karla Cristina do Vale Ferreira
- Programa de Pós‐Graduação em Ciências Genômicas e Biotecnologia Universidade Católica de Brasília Brasília Brazil
- Centro de Análises Proteômicas e Bioquímicas Pós‐Graduação em Ciências Genômicas e Biotecnologia Universidade Católica de Brasília Brasília Brazil
| | - Leonardo Ferreira Fialho
- Programa de Pós‐Graduação em Ciências Genômicas e Biotecnologia Universidade Católica de Brasília Brasília Brazil
| | - Octávio Luiz Franco
- Programa de Pós‐Graduação em Ciências Genômicas e Biotecnologia Universidade Católica de Brasília Brasília Brazil
- Centro de Análises Proteômicas e Bioquímicas Pós‐Graduação em Ciências Genômicas e Biotecnologia Universidade Católica de Brasília Brasília Brazil
- S‐Inova Biotech Pós Graduação em Biotecnologia Universidade Católica Dom Bosco Campo Grande Brazil
| | - Sérgio Amorim de Alencar
- Programa de Pós‐Graduação em Ciências Genômicas e Biotecnologia Universidade Católica de Brasília Brasília Brazil
| | - William Farias Porto
- S‐Inova Biotech Pós Graduação em Biotecnologia Universidade Católica Dom Bosco Campo Grande Brazil
- Porto Reports Brasília Brazil
| |
Collapse
|
149
|
Gerasimavicius L, Liu X, Marsh JA. Identification of pathogenic missense mutations using protein stability predictors. Sci Rep 2020; 10:15387. [PMID: 32958805 PMCID: PMC7506547 DOI: 10.1038/s41598-020-72404-w] [Citation(s) in RCA: 59] [Impact Index Per Article: 14.8] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/11/2020] [Accepted: 08/31/2020] [Indexed: 12/17/2022] Open
Abstract
Attempts at using protein structures to identify disease-causing mutations have been dominated by the idea that most pathogenic mutations are disruptive at a structural level. Therefore, computational stability predictors, which assess whether a mutation is likely to be stabilising or destabilising to protein structure, have been commonly used when evaluating new candidate disease variants, despite not having been developed specifically for this purpose. We therefore tested 13 different stability predictors for their ability to discriminate between pathogenic and putatively benign missense variants. We find that one method, FoldX, significantly outperforms all other predictors in the identification of disease variants. Moreover, we demonstrate that employing predicted absolute energy change scores improves performance of nearly all predictors in distinguishing pathogenic from benign variants. Importantly, however, we observe that the utility of computational stability predictors is highly heterogeneous across different proteins, and that they are all inferior to the best performing variant effect predictors for identifying pathogenic mutations. We suggest that this is largely due to alternate molecular mechanisms other than protein destabilisation underlying many pathogenic mutations. Thus, better ways of incorporating protein structural information and molecular mechanisms into computational variant effect predictors will be required for improved disease variant prioritisation.
Collapse
Affiliation(s)
- Lukas Gerasimavicius
- MRC Human Genetics Unit, Institute of Genetics and Molecular Medicine, University of Edinburgh, Edinburgh, EH4 2XU, UK
| | - Xin Liu
- MRC Human Genetics Unit, Institute of Genetics and Molecular Medicine, University of Edinburgh, Edinburgh, EH4 2XU, UK
| | - Joseph A Marsh
- MRC Human Genetics Unit, Institute of Genetics and Molecular Medicine, University of Edinburgh, Edinburgh, EH4 2XU, UK.
| |
Collapse
|
150
|
Masso M. Accurate and efficient structure-based computational mutagenesis for modeling fluorescence levels of Aequorea victoria green fluorescent protein mutants. Protein Eng Des Sel 2020; 33:5905305. [DOI: 10.1093/protein/gzaa022] [Citation(s) in RCA: 1] [Impact Index Per Article: 0.3] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/08/2020] [Revised: 07/16/2020] [Accepted: 08/04/2020] [Indexed: 11/13/2022] Open
Abstract
Abstract
A computational mutagenesis technique was used to characterize the structural effects associated with over 46 000 single and multiple amino acid variants of Aequorea victoria green fluorescent protein (GFP), whose functional effects (fluorescence levels) were recently measured by experimental researchers. For each GFP mutant, the approach generated a single score reflecting the overall change in sequence-structure compatibility relative to native GFP, as well as a vector of environmental perturbation (EP) scores characterizing the impact at all GFP residue positions. A significant GFP structure–function relationship (P < 0.0001) was elucidated by comparing the sequence-structure compatibility scores with the functional data. Next, the computed vectors for GFP mutants were used to train predictive models of fluorescence by implementing random forest (RF) classification and tree regression machine learning algorithms. Classification performance reached 0.93 for sensitivity, 0.91 for precision and 0.90 for balanced accuracy, and regression models led to Pearson’s correlation as high as r = 0.83 between experimental and predicted GFP mutant fluorescence. An RF model trained on a subset of over 1000 experimental single residue GFP mutants with measured fluorescence was used for predicting the 3300 remaining unstudied single residue mutants, with results complementing known GFP biochemical and biophysical properties. In addition, models trained on the subset of experimental GFP mutants harboring multiple residue replacements successfully predicted fluorescence of the single residue GFP mutants. The models developed for this study were accurate and efficient, and their predictions outperformed those of several related state-of-the-art methods.
Collapse
Affiliation(s)
- Majid Masso
- Laboratory for Structural Bioinformatics, School of Systems Biology, George Mason University, 10900 University Boulevard MS 5B3, Manassas, VA 20110, USA
| |
Collapse
|